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Table \ 



GCCGAGGTA C I 1 1 1 I M 1 1 » H 1 1 1 M 1 11 \ GGACAT ACTGAGAGAATTTGGAATTAT AT 
GTTATGGTAGAATAAAGATCGAGGTCCATTTTrCTTATACATGAAAANTTAAATATTTAG 
T 

TTGGGATTTGAGACTT(X^TCTAGGCCTCTG>^TTTCTTTCrAG I T f IT 1C CCTACCAT 
T 

CTTTAATCGGAGTATCCAAGCCCAATCACCCTGTANCCT ATGTCCTAAAGCATCTTGAAT 
TGNTTGNTTC^MGTTTTTHCrrCAJGHAGGAG I G ICTTI IGCftCACHCCTCTTAAGCC 
TA 

TCTGGATCCCCACTTCANNCCTCTGAAGGGTTCTGTTAAAAfTrTCT AACCCT ATCTNT 
AT 

NGAATTTOTCOCC 



GCCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTC 
CGCGAAACCAA£TTGGACTTAC>£AGAGTTCCAAGAAACT ATGATCCTGCTTTACATCCT 
TTTGAGGTCCCACGAGAATATATAAGAGCTTTAAAT^ 

GCAAAACCATTOCTTOCI rCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAG 
CATCCA<SAGAAGCTGGCTAC rGTCCTTTC I GGGGCGTGTCATGGAGAGGTTAGAATTTGG 
AATCTAACTCAGCGGAATTGTATCCGTACCT 



CGGAGAGGAGTCCTrACTTAGAGTNAAGCTGAAGGAGCATCACAACCOCAAAGACTGTTA 

TGTTGTGAAAT7TAGGCTGTGTTTTAATAATACTGATGATGATANGATGAAATAGTAAT 

T 

TATTGATTACT AT ATCTACT AT ATGTCCGTAAGATAGCAGGGTCTTT AT ACTCGGAATC 
T 

CATITGATCCTCATAGTTTrrATTGGTGTATTATTATCCTCATTTTACAGATAC^ 

TGAGGCTTCAGAGAGGCTGTGTAATCAAGAGTTTGTATGCCTTTCATCT 

GGACAATGCCAAGTTAGAAAAATAAATGTCTTT AGCATTA I I TTTOCI l A ATGTTTAGAA 

TATTAATAAX5TT AC7CAGAT AATCTATTCGAATTTTCTTCA A 

GAGTTG 

G 



TACTCAGTTTCCTTATCTATAACATGGGGATAATATTANGTATGCTACAT^^ 

GAGGATCAATATCTGTAAAGCTCTTAGAACATGCA ITTTTCTI MTACTAAATGGGMAAGG 
TCTGGCMGGCGCGGTGGCTCACACCTGGTAATCCCAfiCACTGTGGAAGGCTGAGGNGGGG 
GCAGTTGGGGACCGACGGGTTGTACTACTNCAATGTAAC IIUI III C TCAGAAATTHAGG 
CHAAAAGTCTTACTGACCATGTAAAGGAAATCCAACAATTATAA^ 

^^GGAGCTTATAGTCTAGTTAhKSAAACCAGACTT AAACATATGAAAAGTTTAAACATTGG 



ctctttcattgaaaggaaattanggttgaacctccaggagcccgtcagagtctgaggaga 
ggctggcttnatgtctagatacgacgacagcaaggctgcttao^gctaacagcgcattgc 
ctttcactaccggactctcctttgcagctgccttggtgatctcatcagtcagcatgtc 

TC 

taacccagagccaggctgtgc hum i g tacct 

Sequence 6 

CGCGGTGGCGGCCGCCCGGGCAGGTACCTATGACCATCTrACATtATTTTTATGGGTGGG 

GGGCATTGGCTGTGGAATGTGGGCAGTAACTTGCACAGTCAGTAACCGTNNGAGTAACTG 

GTTGTTGGCATCCCCATTCTGGCACTCCTCCTCTAGGTCTCCACCTCACACGCTGGTTTG 

TGGGCG<^GGGGCAGGTTGGTGCCGTGGGGTGTCCGGGCACTGGCTGTGCATGCCTTCTT 

CCTCTTCTGTCTCTTGGCCACCTTTTCCAAAAAGTCACCAGTGACCAATTCTCCCAGT 



GT 

TTCTTTGGGACTCAATGGCTrGGGCTTGGCATTGGGT AAAGCCGACTGGCAAGTTTCATT 
CTGACCAAGCTCTATAGTAGTCCGG>K3TGGACCTCTTGCCCTCCCTGCTCTGCGGAAAGC 
TTNCTCAGGCTTTG C TT C TTCACTT ATTTACTATTrGCGGCGTCCTGGGGGTACCCTC 
GG 

NCGCTC TAGAACT AAGTGGGATCCCCCCCGGGCTGCAAGGAATTCGAATATCAAGCCTTA 
TCGAATCCGTCNAACCTTCGAAGGGGG 



GGTGGCGGCCGAGGTACGGATACAATTCCGCTGAGTTAGATTCCAAATTCTAACCTCTCC 

ATGACACXSCCCCAGAAAGGACAGTAGCCAGCT^ 

TCCATCACGGTGACCATCCAGCGAAGOKAGG 

GGT AGCATTTAAAGCTCTTATATATTCTCG TGGGACCTCAAAAGGATGTAAAGCAGGATC 
ATAGTTTCTTGGAACTCTCTGTAAGTCCAACTTGGTTTCGCGCACATAATTG 

TT 

CCGGCTCAGCATCT 1 CACCTTCATCTCGG 1 1 GC IC 1 1 C 



AGCAACCGAGATGAAGGTGAAGAPGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAAC 
CAAGTTGGACTTACAGAGAGTTCCAAGAAACT ATGATCCTGCTTTACATCCTTTTGAGGT 
CCCACGAGAATATATaAGAGCTTTAAATGCTACCAAACTGGAACGAGTATTTGCAAAACC 
ATTCCTIGCTTCGCTGCATGGTCACCGTGATOGAGT 

GAAGCTOGCTA CTGT0C1 1 1 C TGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAAC 
TCAGCGGAATTGTATCCGTACCTCGGCCGTTGTANACT AGGGGATCCCCCGGCC 

rrACGACATGCACTGATAGCTCTCTTTGTATGAACAGGAGCTGTGGC 
AGGCCCTATGCCAGGGAGAAAGTAAGATTG<3AAAAGAG<riTA 
CACTGTGCTTAACGGGOA G AAAAACGTCTTO^ 

AGTGCT ACTGCT CTTTCATCCTGGAGCT GGAACAGACGTCACCAGTCAATCATGATGGCT 
GCTGGGTGCACTGGCT AACATCTAT AATCCCAGCACTTT GTGAGGCTGAGGGTGGGAAGA 
TTGCTrGGGGCCAGGAGTTTGAGACCAGl 1 1 1 GGGCAAATTGCAAGACCCTGTCTCTGCA 
AAAAAATATAAAATGTAGCTGAGTGTGGTGGCACCTGTAGACCCAGCCCCAGCTACTCGA 
GAGGCTGAGATGGGAGGATOGCTTGGGCCTAGGAGTTCGAGGCTGCAGTGAGCTATGATT 
GCACCACTGCACTCCAGCCTNGGTGACAGAACAMGACCTGTCTNT AAAAANCATT AAATT 



JATCCTGl 1 1 1 AACTAGCACAGACAAAACCTATGTGTTACTATCAAA 
ATAAAATTTAGAAAAACAATTTTCTTAt AAAA rTTTCTGTTI GTATTTGGACTACATAAA 
CT GGC TTTAAAATTGAGAAAT ATGCGCT AAAACCATAAGGAAAAAGCCAACAGAAAGAAC 
AAAAAGATCACAGCAATTAGGCCCGTTCTATTCAATTTTGCCATGAGCTAAAAATCACAT 
TCTTCACAAAGTAAATTACCGCCXrTGTTTTTrATTCTTAAGCAC 
G 

ATCTGAGCTTTACTAAATCGGAAAAGAAAATCTCAATTATAGAACATTTAGTTTATTTAT 
ACCTTAATGCCCGGAGA6GTAATATTTTACTTTAAAATGCATAACCCATGTGGACATGCT 
AGGTCTTCCAAAA 
Sequence 11 

CTGGGGTTTCCTCCCNAGGGGAAGCCTTGTAGAATCCACCTNGGAAANCCrTGTNGGGTN 
CCGC1TGGCCCGTNGMATGGHTGGMGTAGGGGAAGGGCAAAGTACGCCTTCAAGAATAGG 
NAAAAAAGGGANGGGGGGGGGGNACCaCTCAAGGCCTGGCAAAGGGCAAGTGGGACCAAG 
TGGC CCAAGGGGGCTTCTTGGAA TGGTGGNT CTC TCACAAGCTTTG T AANAAAGG TGG TG 
GAAGAACCAAGCCTTGNCCCTTTroTGGOTCGWGWGACCTTGGAATAAAGGGCCAAAAGG 



PCT/US00/33312 



NAANI RSGT I IGTAAAGGGGAAAG 
Sequence 12 

GCGATTGGAGGTCCCCGCGGTGGCGGOCGAGGTA C I Mi l l 1 GTTTTGTATTTTTAGTAG 
AGATGGGGTTrCACCGTGTTGGCCGGGCTGGTCTTGAACTCTTGAIT^ 
CCACCTCAGCCT CCCAATG I GC I GGGATTACAGGTG TGAGCCAOCATGCCTGGCCTTTTT 
CT TTTTTI rTTTTAAACGAAAAAATGTTTTTAATTGACAAATAAAAATGATGTATATTTA 
TGGTGTT1 1 M L ILI I I TGGATCATCAG ICICI I IC ICATCACTGAAACCTACAAATATT 
TTAAAATCTTTCGATTAAAAAAA 1 1 II UC I G ATCATTCAAC C I C I I CAAATTATTAAGAG 

atacttactttgtatgaaaaatittgtcgagatgtataatcc a i rm rrCTG GGAAG 

Sequence 13 

TTACTTACGGCGAATTGCGWCCGAGGTACCAG^ 

GATGCAGATCTGGCCCCAGTGTGAGC A T C 1 G I G 1 1 a atggtatcagacttaaagaaggaa 

agacctgatttgactgctgttggtttggtactgttccctgatcc^ 

g*gggaotcccaaagcaggtttgagctgtggtaatga^ 

acagtagaatcgtacctgcccg 



TGGCGGCCGAGGTACGGTATTXrrCTTCAAACAAGAGCAA^ 
TCAGTTGCTTGGGGGACAAACAAGAAGGAAAACTCTGAGACAG^ 
GATGACCTGGTGAAGGTCTGGAAATGGCGTGATGAGAGGCTGG^ 
GACGGACATCACCTGOGA G T GG T G T C TGT G GCACATCA^ 

ATCCAGCTCTNl I GATGCTCATATTCGTCTTTGGGACTTGGAAAATGGCAAACAGATAAA 
GTCCATAGATGCAGGACCTGTGGATGCCTGGACTTT GGCCTTTrCTCCTGATTCCCAGTN 
TCTGGCCACAGGAACTCATGTCXXXJAANGTGAACATTTTTGGT GTGGAAAGNGGGAAAAA 
GGAA 

Sequence IS 

GCCCCTGCCCGGCTGGTTATGTAACAAACAAAGTCTGTGTCTGTGTGGAGTGTTGCAGGA 
CGAGTGGAAATGACTGTTTCCAAGTTCATGGCAATTCAGAAGG 

TTCCAGTGCCTGTCCGATGGAGAAGTATCTTGTAAGGAAGCAACTTCCATAAAAAGGGTC 
AGAAAGTCTGTTGACCGATCACI ICI 1 1 ICI I UGATAACTGTCTGCGGCTGCAGGAAGAG 

tc^gaggttcttcagaagagtgtgatcattggagtgattgaaggtggagatgtgatggaa 

GAGAG GCTGAGGTCAGCACGAGAGACAGCC AAGCGGCCTGTGGGTGGC 1 I C I ICI GGATG 
GTTTTCAAGGAAATCCAACA 



CGCTGGGGGCCGCCCGGGCAGGACGCGGGAAGAGGTAATTTTAATGCCATTTTCATGGGA 
CACTTGGGAGCT AGATTAGAAGAAGCCAAGACTACAATCGGGGAGATGAGTT GCAGAGGG 
NNGTGGTGAAGGTCTGAAGGAAGGTaGGAAAAGGTCGGACACATTC 

ggtggaggtggttggatatggggagtt 

tagtcctcactctgtgggcaggggca 

TTACAGCATAGGGGTCCCTTTTGTCAGGGATTTATGATGGCATCACAO 

gaccatwaattgaaaaatacatatgattggctgggcctgga^ 

agcactttgggaggctgaggtggctggatcacctgaggtcgggagttc^^ 

acxaacatggagaaaccctttctctactaaa^ 

atcxxtgtaatcccagctactagggaggctg^ggcagc^u^ttgcttgaacc 



18 

tnccgcggtggcggccgaggtacgattctac 1 G 1 I 1 I UICI ICI A GGATCAACTCGGTCA 

^ACCACAGCTCAAACCTGCTTTGGGACTXXXrrCCC^^ 
ACACTACCAAACCAACAGCAGTCAAATCAGGTCTT^ 

*>"CAGATGCTCACACTGGGGCCAGATCTGC^^ 

CCTGGTACCTGCCCG 

Sequwwa 19 

CCGCGGTGGCQGCCCGCCGGGGCAGGTA C I I 1 1 M I H 1 1 1 1 1 1 1 1 1 I Al 1 1 1 1 1 1 1 1 1 
T 

iTTTTTTTn 1 11 NCCCCCGGGAGAGGAATTGGGAAGAGCAAATTGCTGC TGAAAATT 
TC 

TACATTGATCCAGACAAACAAGTTAGAGCAGGCTGAAAAAGAAOOGTTGG 1 O 1 1 1 1 ICTG 

TGTTCAACCAGATCAACTGGAAAAGTATAGATACCTTAATTAGCAC IC TGCTCl G HGGGA 

TTCTSSTCAGCCTGGCCCA STG G1 1 1 H 1 1 CCO.TGAACACKCCTGAAAGGGGAGCTCAT 

AATGACTGCTGTGCAGGTGGGCGGGGAGGGGGCTTCCTATTTGATTTAGW 

TGCCAGTTACCAATTMTNGGTNGCCCCATTTATACATGGKGGAAAAA^ 

Sequenced 

GAGGT ACCCAATTrTTTTAAGTrCTAAGCTAGCTTTCTCAAAGAAA 
G 

TCCATTAAAAGAGCATCTGCGAATTG1 1 1 1 IGCAGGGACTCCTAATCAGTCAGGAGAAGT 
AGAATGTAAGCAAAGTCACAAACCTCCCGTAAGAA 1 1 IGO I rCA CCAGCACACAGCTCCT 
CT GTTATGAAGGGATGAGAAGCAGACCCCAAAGCCAGTGCCACAGT C T CCC T GGAAACAG 
CAGCA{SGCTTGGGGAATGCTTCCAAAA<MSCTATGCCATTCAAGCT 
TAAAAATACAACTTAGGCCAACTGCAAGTGGCTCATGCCTGTAATTAATTCCAAC 

iATCAACTCGGTCATTACCAC 

XTGGCTCCGGATCAGGGAACACTAC 

CAAACCAACAGCAGTCAAATCAGGTCT I FOCI I C I 1 I A AGTCTGATACCATTAACACAGA 
TGCTCACACTGCGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAAT^ 
CCTGCCCG 
Se q uen ce 22 

CGCGGTGGCGGCCGAGCTACAGAGTAGAGAGAGTrcTGCAGG 

GAT AGGACCAGACCAGACCAGGCCTTGT AGGCCATGGAACGACTTTGGATTTT ACACCAA 
GTGCAACAGGTAACTGCTGGAGGGAATTCAGCAAGAGAGTGACAGGAGCTGATTGACAAT 

ttgaacgcccactctggctgccatgtggcaaatagattgtaggaagaaa 

AAGAGAGCAGTTTGGAAGCTACTACTGTTGTCCCAGAAATA 
S equen ce 23 

CGCGGTGGCGGCCGAGGTACANAGTAGAGAGAGTTCTGCAGGGATGAACGT 

TGAT ATGGACCAGACCAGACCAGGCCTTGT AGGCCATGGAAGGACTTT GGATTTTACACC 

AAGTGCAACAGCTAACTGCTGGAGGGAATTCAGCAAGAGA^ 

ATTTGAACGCCCACTCTGGCTGCCATGTGGCAAATAGATTGTAC 

GGAAGAGAGCACTTTGGAAGCTACT ACTGTTGTCCCAGAAATATGTaaTGG TGGC TT GGC 
CCAGGTTGGGGT 



CCGCGGTGGCGGCCGAGG TACAAAAAAAGCACANGCCTGGCTCTGGGTT AGAGaCATGCT 
GACTGATGAGATCACCAAGGCAGC TGCAAAGGAGAGTCCCGTAGTGAAAGGCAATGCGCT 

gttagctctaagcagccttgctgtcgtcgtatctag^ 

ctctgacgggctcciggaggttcaacctaai I ICC 1 1 1 caatgaaagagtgggtttccat 
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GGTACCTGCCCG 
Sequence 25 

CCGCGGNGGCGGCCGCCCGGGCAGGTACGCGGGAGGCACATTC 1 1 1 1 CTACGTGAAGAGT 

TTTGTAAACTGAACTTTGTTTTCAGTTCCGGCTCCAGCCATCCTGGGGTNGCTTGCCA 

AT 

agatgaatcccactcgtttgacccatgacgctccttcttttcatttctccct 
c 

ACAGCAGTGCATGTCCACCATACCACCTGAGAGTCTGTGGAATCTAATTTTCTGTTATAC 
TTCTTTGCTTACAC 



CCATCACACGCCCCAGAAAGGACAAGTAGCCAGCTTCTCTGGA^ 

GACTGCATCACGGTGA(XATXXAGCGAAGCAAGGAATGGTTTTGCAAATACTCGTTCCAG 
TTTGGTAGCATTTAAAGCTCTITATATATTCTCGTGGGACCTCA^AAGGATGTAAAGCAGG 
ATCATAGTTTCTTGGAACTCTCTGTAAGTCCAACTTGGTT7CGCGGACATAATTGTCC 
GG 

ATTCCGGCTCAGCATCTTCACCTTCATCTCGGTTGCTCTTC 
Sequence 27 

ACGCGGCGGCGGCCGAGGTACGGATACAATTCCGCTGAGTTAGATTCCAAATTCTAACCT 
CTCCATCACACGCCCCANAAAGGAGAGTAGCCAGCTTNTCTGGATGCTTTGCCAAGCAAT 



GTTTGGTA6CATTTAAAGCTCTTATATATTCTCGTGGGACCTCAAAAGGATGTAAAGCAG 
GATCATAGTTTCTTGGAACTCTCTGTAAGMCNCAACTTGGTTATCGCCGGACATAATTGG 
ACCCGGTATTTCCGGCTCAGNCATCTTCACCTTTC^TCTAA« 

CGNTCTAAGAACTAGTGGGATCCCCCCGGGGCCTGCAGGGAATTCCGATAATCAAAGGCT 
TAATCTGAATACCCGGTCGGACCCTTCGGAGGNGGGGGGGCCCCGGNTACGCCAAGCTTT 
TTTGGTTTCCCTT 



cggccgaggtactcagtttccttatctataaca7ggggataatattagtagctacatcgt 
tgttatgaggatcaatatctgtaaagctctt agaagatgc a 1 11 mci ictactaaattt 

atcccagcactgtggaaggctgaggt 
gttactactccaatgtaactgctttctcagaaatta 
aggcaaaaagtcttactgaccatgtnaaggaaatccaacaattataaacagtctctgcct 
ttaaggagcttatagtctagttaagaaaccagacttaaacatatgaaaagttaaacattg 



ACATCGTTGTTATGAGGATCAATATCTGTAAAGCTCTTAGAACATGCATTTTTCTTCTA 

c 

TAAATTTTAAGGTCTGGCAGGCGCGGTGGCTCACACCTGGTAATCCCAGCACTGTGGAAG 

GCTGAGGTGGGGGCAGTGGGGAGCGAGGGGTTGTTACTACTCCAATGTAACTGCTTTCTC 

AGAAATTAAGGCAAAAAGTCTTACTGACCATGTAAAGGGAAATNCAACAATTATAAACAG 

TCTCT 

Sequence 30 

GGCGGCCGAGGTACTCAGTTTCCTTATCTATAACATGGGGATAATATTACGTAGCTACAT 
CGTTGTTATGAGGATCAATATCTGTAAAGCTCTTAGAACATGCA I I » \ I CTTCTACTAA 
A 

TTTT AAGG TC TGGCAGGCGC GG T GGC TCAC ACC T GGTATCCCAG CACTG TGGAAGGC T GA 
GGTGGGGGCAGTGGGCVVGCGAGGGGTTGTTACTACTCCAATGTAACTGCTTTCTCAGAAA 



TTAAGGCAAAAAGTCTTACTGACCATGTAAAGGAAATC c aacaattat aaacag tctctg 
CCTTTAACGAGCTTATAGTCTAGTTAAGAAACCAGACTTAAA^TATGAAAAGTTAAACA 
TTGGCCAGGCACAGTGGCTCATGCCTATAATCCCAGOSCTTTGG 

CTAAAAATACAAAACTAGTTGGGCATGCTGGCGCATGCCTGTGATCCCAGCTA^ 

Sequence 31 

CCOGCGGTGGCGGCCGAGGTACTCAGTTTCCTTATCTATAACATGGGGATAATATTAGTA 
GCTAGATCGTTGTTATGAGGATCAATATCTGTAAAGCTCTTAGAACATGCATTTTTCTT 

c 

TACTAAATTTTAAGGTCTGGCAGGCGCGGTGGCTCACACCTGGTAATCCCAGCACTGTG 

aaggctgaggtgggggcagtggggagcgaggggttgttactactccaatgtaactgct^ 
c tcagaaatt aaggcaaaaagtctt ac tgaccatgtaaaggaaatc caacaattat aaac 
agtctctgoctttaaggagctttatagtcragttaagaaa 



GCGGCCGAGGTAC^ATGCACTTGCTTGCCATCTAAGCAGGGACAATGGCA^ 
ATGATGTTACTTTGATTCTCTGACCAAACTGGCCT GTGAGCACCCTGGGCCTTT CTTC 
CT 

CTGTCAAAGGCCTTAAGACAGGTTTACGCTGTAGCCAGGTCTGGAAGACAGAGCTGGGTT 
AAAGCTGGGTGGGAGAAGTGAAAAAGGTCAGGTTT ACATTCCT ACGCGGAAAAGGATGTA 

CC TG TTG T AGGAGAG TAAT AAATGACTTGAGAGT AAGCC T AAGCAAACTCAAGTGGGAAG 
GGGAGTGGGCTGTAAAATAGTTTAAGAGACTCTCTCAGGAAGTCAGCGTAATTGATGTGT 
AGAAAGGTAACAGTCAAGAGTTCTCCTAACAAGACAGCTTCAAAGCAGCAGCTATAGTGG 
AGCATTCCTGAGGCCTGCTGCAGATCAAAG<^TGAATGTGCAGACTGGTCCTCTTGCCGA 
GCGTTTCTTTC 
»33 



TTCATATCATGATGTTACTTTGATTCTCTGACCAAACTGGGCTGTGAGCACCCTGGGC 
CT 

TTCTTCCTCTGTCAAAGGCCTTAAGACAGGTTTACCCTGTAG<XAGGCTCrTGGAAGACAG 
AGCTGGGTTAAAGCTGGGTGGGAGAAGTGAAAAAGGTCAGGTTT ACATTCCT ACGCGGAA 
AAGGATGTAACACGGGGCCACATCCTATGC C CAATCCCAAGGCAGGGAGGCAGGGAAG T G 
GCTGCCAAACC TG TT GTAGGAGAGTAAT AAATGACTTGAGAGT AAGCCTAAG CAAAC TC A 
AGTGGGAAGGGGGAGTGGGCTT 
Sequence 34 

GCGGCCGAGGTACCAGTTAAAGTCTTCTAGCCTCTATCCCCACTCCTTT^ 

GCTAATATGAGTTTTGTCTCTTACAATTTATCTGCATCTTA 
T 

CATTAAATCATTAAACATTCTATCCAAATAGGATGCCCTTCTGTGGAACTGCATATTTG 
G 

AAACCATACTGCCTGTTT AAC TTATG CACTC CAC TGGGAACTT ACAGT ATCTGTTTC CC 
A 

CAAT AC TTGCAGTCAT ATCAGTT ACAACCGC TGGG T GT G T ATT GGTTCAAAAGGACCT AC 

CTACAAGGTTATATCAATCCATTGTCCAATTTGAGAGATTTTTTCTGAATCCAGTTAAA 

A 

T AATTTTTGG C TACACCTGGGGACACTTCCCAGGACAACAATGACTTG TAGTCTAGTGCC 
CAAGAAAGC CAAAAAG GC CCGG CAAC 
Sequence 35 

GGTGGC G GC CGAGG T ACG GATACAATTC C G C T GAG TT AGATTCCAAA TTCT AAC C TC TC C 



PCT7US00/333U 



PCT/USOO/33312 



ATCACACGCCCCAGAAAGGACAGTAGCCAGCTTCTC TGGA TGCTTTGCCAAGCAATTG^AC 
^*^^TPA^ft/^Ti^A^^^yf > ^ ( A/'^^A AO^AAGGA^T^tGI 1 T IGCAAATACTTjGTTTjCAGI 1 1 
GGTAGCATTTAAAGCTCTT ATATATTCTCGTGGGACCTCAAAAGGATGTAAAGCAGGATC 
ATAGTTTCTTGGAACTCTCTGTAAGTCO^ACTTGGTTTC 
TT 

CCGGCTCAGCATCTTCACCTTCATCTCGGTTGCTCTTC 



CATNTG TGTTTT ATTGTGAAGGGTCCTCAACTGTGTGGCTGATTCAGGCTG TCCCCACTG 

CAATGTAGGGAGAGGAGAGAAAGGGATGAAAGTGAAGGCAGGGGGGGGGATGTT^ 

ACCGGGG TGAACTTCTGCCTGAGCAAGNTGATG TTGGC TTCCGAN NGTATTTGGGACAC T 

TTCTTTCAATACAThnT^TTATTTAAGCACTTTATTCTGTGNCTGCTGCCCTG 

G 

Sequence 37 

CCGCCGTGGCGGCCGCCCGGGCAGG^.VCGCGGGGGCAACATGGCGGCC 

AGCTGCC^AGATTTGAATTACT(XACTCGTAGCTATTGCATTCCTGACGATGGCCTCTGTG 

GCTTCGTGCGATTCGCGTCCGAGCTCAGACGAGCTCC^ 

GAAGATGAGGACTATGATTTTGAAGATCGGGTCAGCGACTCGGGTTCATATTCC^ 
AGTAGCGATTATGATGATCTTGAGCCTGAATGGCTGX3ACAGTGTGCAGAAAAATGGAGAG 

GTGAACCATG 7CAGGTTCAGTGAAAATGAGATTATCATTG 
Sequenced 

CCGCCGAGGT ACTT AAGTTTTTC TTCAGTT ACAGC TACCATG TGAAAATAATTC TCTGC 
T 

TATCAAGTT TACAA CTTT AGAATTTCCTGTTTTAAAGTTTTCTCATTTACTT ATCACACA 

GTCATCTTCTTTTTGCCAAACGCTATAGTAGCACATTAAAAGGAGACT 

ACTCTGTGCAAAAAGTATTGGGTGCTTTGGTAGAAGTCTATACAGAAGACACTGGAGACA 

CAAAAATGAATTTTGTCCAGGTGAGTTGATGTCAGAAAAGGCTTAATAATGGAGATGAGG 

CCGGGCATGG T GGTTCACACC T GTAAT CCCACCTGTTTGGGAGGCTGAGGCAGGT AGATC 

ACTTGAGACCAGGAGTTTGAGACCAGCCCAGCCAACATGG^ 

-NAAAA>n>IAAAAANATNNGGTTCCTGCCCGGGCGGGCGCTTAGAACTAGTGGGATCCCC^ 
GGGCTGCANGAATTTCGATATCA 



TCCCCGCGGTGGCX^CCGCCCGGGCTGGTACGCG^BGAAAGC^^ 

ACCAGGGCATCAAACAGATGAAACAGCCTATTGTCCATTTCAACAGGATTT^ 

GGGGATGATCTTTCAAATTATCCACAACTTAATTATTTAATATTTTGATAGTCAATTACC 

TAAGACACCGCATCGTCACTGACCAATCAGAAGAGATGCCAGTAGTTGGGCGCAGTGGCA 

GCACTTTGGGAGGCTGAGTGGACAGATCACCTGGGGTCAGGAGTT«3A^ 

CTACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATGAGCCAGGCATGGTGGGCAC 

CTGTAATCCCAGCTACTTGACAGAGTGAGCCTCTGTCTCAAAAAAAAAAAAAAAAA 

Sequence 40 

GCCTCCCCGC GGTGGC G<K;CGAGGTACAGTrTAGAAAACTGTGGGGCTGAGTCCTCGGGG 
CCGTGGGGCGCAGCGTGGCTGATCACCATCATAACGGGCCTATGGGGATACATTCTCTTA 
GACATTTTGAAG T AA TT AATGC T CTCG TT AG TGATT AAGTC TG TGAA GT AGTCC TTT GC 
A 

TAATCAAATCCATGC I I I 10 1 1 IGATGCCATTGCGACAAACAGTGTAATTATAGAAGCG 
A 

GAATTCTTGATTAATCCAAGCCATTCTCGCCACCCAGGGGGGATGTAGCTGCCATTATAT 
T CATT GAGG T ATTTT CC AAAAAAGGC TGTT C TGT AGCCAG TG TTG TT AAGAT AT ACAGCA 
AAAGTCCGAGGC T CAT GCAT GGCC TGCCACGAG G GGGAAGAGCAGTTC TC GTTG TT GG TG 



TAGACATTGTGATTGTGCACATACTTNCCGCTGAGCATGGAGGAC^ 
ATGGGGTGTAGTCACAAAGGCATTGATGAAGGTGGCCCCCCATGTT 
Sequence 41 

CCCXX3CGGTGGCGGCCGCCCGGGCAGGTACACGTGCACA 

G TAT ACATGAGCCA TGCTGGTGCGCTGCACCATG GCACATGCATATCTATGTAACAAACT 

TGCATX3TTCTGCACATGTATCACAGAACTT AAAGTGTAAT AAAAAAAGAAAGAAAAACAG 

CATGCAATTCAGCCCACaCAAAAAAAAGAAGTCAAAGACAGCGAGAATTCTTAAAACAGC 

AATAAAAAGTAT AAAGTCACT CTAAAGGAATCCCCGTT AGATTAACAACACATTTCTT AA 

GAGAAATtTTAACAGGCC^GGAGAGAATGGGATGACATATTCAAAGTGTTAAAGGGGG 

AAAAACTCCACTCAAGACTACACCCAGAAAAGCTATCTTTCAGAAATGGAGATAAAAACA 

TCTTTCCCAGACAAAGAAAAACTAAGAGAATTTACTACCACTCACCAGCCTT 

A 

Sequence 42 



C GGAATCC GGACAATTATGTCCGCGAAAC CAAG TTGGAC TT ACAGAGAGTTC CAAGAAAC 

TATGATCCTGCTTTACATCCTTTTGAGGTCCCACGA^ 

T 

ACCAAAC TGGAACGAG TATTTGCAAAACCATTCCTTGCTT CGCTGGATGG TCACCG TGAT 
GGAGTCAATTGCTTGGCAAAGCATCCAGAGAAGCTGGCTACTGTCCTTTCTGGGGCGTGT 
GATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAATTGTATCCGTACCT 
Sequence 43 

ATTGGAGCTCCCCGCGGTGGCG>GCCGGAGAGCAACCGAGATGA^ 

CGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACTTACAGAGAGTTCCAAGAAAC 

TATGATCCTGCTTTACATCCTTTTGAGGTCCCACGAGAATATATAAGAGCTTTAAATGC 

T 

ACCAAACTGGAACGAGTATTTGCAAAACC A 1 1 CC 11 GC TI CGCTGGATGGTCACCGTGAT 
GGAGTCAATTGCTTG GCAAAGCATCCAGAGAAGC TGGCT ACTGTCCTTTCTGGGGCGT GT 
GATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAATTGTATCCGTACCT 



iTACI 1 1 1 1 1 1 1 1 1 1 II HI I I I It III I ICTACTCTGGAAG 




TATGATCACAAC 
TTTTTTTACATT 

AAI II I IA AAAGTGAGG 1 1 1 1 1 ACCTGATGATTGNGTAG G 1 1 I CIC CTAGCTCCAAAGT 
A 

TCCGGCTCCTACGACTCTAAATATAACCTTCAAGGAAAGNGCAGCTGGTTT^ 
TGATAATATCAAGCCATTCCTGGCTGGGCGTGG^K5GCTCATGCCTATAATCXCAGCACTT 
TGGGAGGCCCCGCGTACCT 
Sequence 45 

GGGNGGCTCCCACCGCGGTAGGCNGGCCGCCCGGGCCAGGTACGCGGGNAATTCAAGGAT 

GGGATTAAAGGATTTAAACCGTTTAGGACCCT AAAAGCATAAAAACCCCTTAGAAAGGAA 

AA T C TT AGGGCAAT ACCCA TT GGAGGGAC C TT AGGGCCTT GGGACCAAAGGAC TTT CA T G 

GAC TT AAAAACCAC CCCAAAAGGCAATTGGGCAANCCAAAANGCCCCAAAATT AGGN C CA 

AATNGGGGATTCTTAACCTTAAAACTTTAAAGGAGGCTTTNTTGGCCCC^ 

GAAAACTTTCC CC TTC NAGANGG NGGGACCCNGGGCCANC CCTTT CC NGGGAATN GGGGG 

GGGAAAAATTT 



GGAGCTCCCCCCGGTGGCGGC CGAGG TACTCGGGAGATCGTGCCACTGCCCTCCAGCCTG 

AGAGAAAGAAACTCTGTCTCTAAAAAAAAAAAGAAAGAAAGATGTCAGTGCTATTTATAG 

TAATACAAAAATTTAATGTAATTTTTGTCAAAATCTCAATGGTATATTTTTGCAGATTT^ 



PCTAJSOO/11312 



PCTAJS0G/XU12 



TCAAA TT AT AT ATATATGATTT A T AAATT ATTGTT ATAGATTCCTGGAAAG TTAATCCA T 
CTCACCATTACATAATACCAATCTCTCTCGGCCGGGCGCaGTGGCTCACGCCTGTAGTCT 

cagcactttgggagtccgaggcgggtgaatcatgaggtccagagatcgagaccatcctgg 

CCAACAAGGTGAAACCCCATCTCTACTAAAAAT 
Sequence 47 

CTAACCTCACATTTAATTGCGTTTGCGCTCACTG^ 
TGTTCGTGCCAGCKTGCAATTTAATNGAATI 
CCGGG TTTTTGGOGGTAT1 
GAA 

CTTCGCCTNCCGCCTTCGGGGTCC 

jtacaagmgacaatgctggatgccaagcagwtcccc 

CXTACCGTCTCACTGCCCCTCAAGACTTCAA^ 

CAGATTTAGG TGGAAGAGCAGCCATGTTTGAAGGGCACATG TGATGAGTGGGGGGCAGCA 
AGATGCCATTTCTGCATCTCCCAGAAGGGATGAGTCT^ 

CC CCCAGCAGTGCTTNCCTNCTCCACCCTGCAl. i TTTN61 ICTTTOC 

CCCNAACTTTT 
Sequence 49 

G^XX«CCGAGGTACAACTAATGGAGCTCA<MAGCTGTCAAG<MTATAAGCAGTGCAACCCA 
AGACCTAAGAATCTTGATGTTGGAAAT AAAGATGGAGGAAGCT ATGACC TACACAGaGGA 

cagttatgggatggatgggaaggttaatc^gccccgtctcactgcagacatcaactggca 
aggcct agaggagct acacagtgtgaatgaaaacatctatgagtacctgcccgggcggcc 
ggctctagaact agtggatccccgg 
Sequence SO 

ggcggccggangagcaaccgagatgaaggtgaagatgctgagcccggaat^^ 

atgtccgcgaaaccaagttggacttacagagagttcx^agaaactatgatcctgctttac 

atccttttgaggtcccacgagaatatataagagctttaaatgctacc^ 

TATTTGCAAAACCATTtXrrTGCTTCGCTGGATGGTCACra 

CAAAGCATCCAGAGAAGCTGGCTACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTT AGAA 

TTTGGAATCTAACTCAGCGGAATTGTATCCCGTACC 

T 

Sequence SI 

NGGCGGCCGAGGTACCTCAGCATATATTGGAAGTGTTTTAGAGTTGGTGAGTTCCCCGTC 
C CTT CCAGAACTGAACGCT AGGAGGAGCAG N C AGNGAGGACAGACGTCT ATGCAGAAACA 
TGGNGAACCTCTGGAAATGACACACTCTCCGGGCNCAGGGGGCCATTCGTCCATCTTTGA 

ggtggactaatc^tggagattctngcaggg<x:ggctgctatctcagattttct 

gaaggagagagatcaacttcgatcgactccagtctgtcgggggctgatgagtgaggtggc 

agcaggcatccgcgtggttttgttgaaactggactttttattc^ 

GT 

TGTGATGATCTCATACTTTGNAGTTGNTCT ATCTGCANCACTGACTTTC 



CA 



TGG 



ttctgaaaattcatgaggctgtgttttaggtgaggctatttcttcattcactgaacng 

GG 

CACCCAACAGGCTCTTAATATGAAGACTTGGGCCCTTCCTGAGTTCTAGAAAAGCATTT^ 
TACTAGTTCTTCAGTAATTTCXCCTCCCCTTC^TTCT 

AATTGGATCTTGGGCCTCTAAGTATAGGCAAGATCATGTTTCTAAAAAGGTTCTTAGAGG 
GAGGGAG TT CC TGGGAGTG TT AT GT GG GG TGGTGCAN AA GGTGC T AACAGG T GGNTTTNT 



C TTT AGGATGAGCAGG TGG 
Sequence S3 

GTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACXAAGTTGGACTTACAG 
AGAG TTCCKNGAAACT ATGATCC TGCTTTACATCCTTTTGAGGTCCCACGAGAAT ATA TA 
AGAGCTTTAAATGCTACCAAACTGGAACGAGTATTTGCAAAACCATTCC 1 1 GC I TC GCTG 
GATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAO 
CTTTCTGGGGCCGTGTGATGGAGAGCTTAGAAATTTGGAAT^ 

GTAATCACGTACCTCGGCCCGCTCT AAGAACTAGTGGGATCCCCCGCffGCTGCACGGAAA 
CCAANC1 1 1 1 iGG 

CCCCCCGCGOSGCGGCCGAGGTACACTGGGAAAATGA 

AGGGwACAGTCMAACTTGACAGGGGGGAAATCAAGTTAAATTCAGAGCTGGATTTAGATG 

ATGCCATTCTAfiAGAAGTTTGCTTTCTCCAA^ 

A 

TTTGGGAAGCATCACTGGATAAATTTTATTGAATCTATTCAAGMCAATT^ 
T 

AAAAGCTGGGAAGAAAGTGAAACTATCTCATGAAGAAGTTATGCAGAAAATCGGTGAACT 
CTTTGCTCTAAGGCACCGTATAAACTTTGAAGTTC^ . 
TT 

TCTTACTGGGGACAGGAGAAAACCNNGGGAAGGGACTTTACCGA^ 
ATTCCTTTAGCCATTTGGCCCCGAAAGANGTTAAGGG TCCAATGAAATTGAAA 
Sequence SS 

TAGCAGGAGCCCCAGGAGTCTGAGCOGJtGGGACCCTCATGTCCATGCC Km GTCCCTGG 
ACNTGAAGACCTGAACrcCCCGCGCGTACTCTCGGCCCGNTTCTT AGGAACNT AGGTGGG 
ATT CXXXX&GGCCTGCT AGGGGAA TTTCCGAATATTCAAAGGCTT AATTOjAAT ACCCCG 
GTCCGAA^>IClTCGNAGGGGGGGGGGGGCGOCCGMNTTACCCCAAGC 



GCGGCGGAAGAGCACCGAGATGAAGGTGAAGATGCTGW^CGGAATOCGGACAATTATGT 
CCGCGAAACCAAGTTGGACTT ACAGAGAGTTCCAAGAAACTATGATGCT gctttacatcc 
ttttgaggtcccacgagaatatataagagctttaaatgctaccaaactggaacgagtatt 
tgcaaaaccattccttgcttcgctggatggtcaccct 

gcatccagagaagctggctactgtcctttctggggcgtgtgatggagaggttagaatttg 
gaatctaactcagcggaattgtatccgtacct 

Sequence 57 

CAGGGAATGGGAfGGWGGCTNCACCTGGGGANNCCTGAGGCCCGTGTTTGT 
GATTOCTTCAT GAAACAGN CTGG>WATGACGACTGCNGATACAGT ATTAAAGAAGACTGG 
ATGAACAGTACCT 
Sequence 58 

CGGCCGCCGGGCAGGTACGCGGGCTATTGTGATTCCO^TGACCCATA^ 

actagtcctatgac^tgtgactgggcttgggaagttc^gtgtcagktccaaaaatccta 

AGGTGGGATCTTCGCTTTGTGAAGCAAATTAATTACACAACCAAATATTGCC^ 
T 

gaggtctattgacacaatgggaacttcaacccctacttagcttagca r run 1 1 1 i ca 

A 

GAG TGAAAAG T G GTCCAC G T AGAGCACAAT AT AATTT AAGT AAAGGAAGATT AAAACAT A 
TTTTT ATCCATTTCTT ATG GTGG N N N NATT ACATG TTTT AGATTTGAG GTC CC CC TC T C 
A 

GGAAAACC C TTTCAAC TTC G T ATTATTC AC TCCTGAGT AG T ATGGGG GT AGAAAAAT GAG 
TGGGAAATCAG TTTG GTCCACT ATTTTCCCGAG TCTTTC TTG CACTTG CAAA T AC TTT C 
A 
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TCAAATATTTTACCAAAAATTCTCAw^^ 

GGGCTCAAACC AAAG^f^ACAGGAAATTCTTf^TGGNGGGTTTT^TATCC TGGGACNATTC 
TAAATTTTAAAAAACCT AAAAAAGG TT ATTT ATTTC TTGNCHAATTTATTCAWfTGNTTT 
TTTAAA 



CACGCGGGAAAGATCAGTTGIfrTTACCTTGGCATTCAAAGA C T T 1 ICT I TG ACTCCCATG 
GTTCTCAAAGCGTGATCCTGGTCCACC^ 

CTGCA AATTC TCATTTJCTCCCTAATTTTCTGAATCANA ' 
AATCTGTTTTAAO^AACTTCCACATAAT 

GATCT AGTTTGTCCCTCTCATTTTGCAGGCAAAGAATTGAA 

<X i TO iwCAGTCATACAGCTAGGG rCTGTTTTCI A TTATTTATTTATTTATTTATTTTTA 
TTTTATTCACTTTACCCCCCAGGTATTCATAGN 1 1 FCTI IC T AAATACTCCATATTTGGA 
CTTGA C Mi l 1 ACAAOTTTCTAAT TACCA AATAAAGTCT AAAGATGGGGAAAGGTTGTGG 
GAAAACTTT ATAGAGAACATGAGATTTTGACTGAACC^GT>*AACATTAAGTAGAGAG*IAA 
AAAGAAAGGGGTGTTCTAAAGCAGTAGCGACCACACT 

ACATCCTTTTGAGG TCCCACGAGAATATATAAGAGCTTT AAATGCT ACCAAACTGGAACG 

AGTATTTGCAAAACCATTCCTTGCTTCGCTCG^ 

GGCAAAGCATCCAGAGAAGCTGGCTACTGTCCTTTCTGGGGC^ 

AATTTGGAATCTAACTCAGCGGAATTGTATCCGTACCT 

Sequence 81 

T^ACTCCCGCGGTGGCGGCCGAGGTA1>CGTTACT 

GTTCTGCCCTTTGGAACATCTim^CGGTGTrcCTGTG^ 

CT 

TTATGTAATAA > C 1 U u CAATAAATAATTTTTAAAAGGACACAACAACGCCGCAGGTGAT 

CTGGAGGCTCCTGGAGGACCTCAGCGACTCAGGTCC^ 

GCTGAAGGATGGATCCACATGTTTAGAGGAGATCGAGAAATG^ 

G*GAAATGCCACAGAAAGGGGAGCTGGAGAGAATCAAAGCATGAGAGG^ 

TGTCACTGGAAGGGGTCCAX^TGGAACGCTTGAGAAGAAACGTGTGTAGCATCTAGGAGT 

AAAGACTCGCCCTGGCT GACAGCTAGTAAGGAAATGGGAACCTCAfrTGCTGCAGCCTCAA 

AGAATTGACTTTAA 

rrACAATGATGGCTGTCAACTTG t, fTTCTn A AAAAAAGACA 
ATTTGAGCAGGACGACCCTCTCCAATCTGGGTAG^ 
TAGGCTCGGAGGATGGGTACCT 



TGAGTGAGCCT AACTCACATTT AATTTGCGTTTGGCGCC TCACTGC£XX»C f 1 1 1CCAG 
TT 

CN GGGGAAACNCTG TTCGTTGCCCAGKCT GCATTTAATGGAAATCCGGCCAACGCCGCCG 
GWGGNAGGAGGGCGGGTTTTGCCGTATTTGGGGCGGCTCTTTCCCGCCTTCC^ 
TCAACTTGACTTCGGCTTGCNCCTTCGGGGTCMGTTrrCTGGCTTGCCGGGTCG 
GG*TArTCAANCCTTCAACTTCMAAAGGGGCCGGGCI^ 

CAGGAAATTNAAGGGGGGGAAT AAACCGCOAAGGGAAAAAGGAAAACANTTGTTGCAAGC 



GCAGCMMCCACCAGGGGACCCfKMSANCCCGGGCGACGCCNGANCCAA 

NHAAACCTNHHCHNTTGAGCGGNGGNNCNGHCCCGCGAGCCCGAGGAAAAGGAAGCCCAG 

CttGGAGGGGCGGNGGAN NGACGCCCNC45GGGGGCACAAACAACCN NCNAAAGGAAGAAN N 

NGCCACCCACCAANCCNNANCAA>OtCAACA^ 

GAGNAAAAAAAAAA 

ATGGACAGTGTCAGTCTCCTGATANGGNGG 

GATGGGTAGCTAATTTAAAAGCTTCT ATTATAAAATCT AGTCTCTCT G ACACTGGCCTG 
T 

CCACTGCAG TCACATCTTX^AATACTGAAGGATCCTGAGAATACCGAGCMGG TCAT GAGA 
CTT AC TCACGTCATTCACCAW 1 I I I If I GWACCTGOCCG 



GCGGTGGCGGTKTCCCGGGCAGGCCACGCGGAAATCCCX^ 

ATCCCAT ATTT AGGTTAGATAGAGAAGTGTGT ATGTGTGTGTGTGTGTGTGTTGCTCGCA 

CAGTGATGAACTGTAAACATAAATGAAGATATGGAAAAATACATCAATTAGGACAACATG 

AC^ATTTCATTAGACTCCTATCAAAGAGTATCAGTTCAC^ 

T 

AAAATTCAGATC TTGACTGTTTTCTGGGGATAAAGCANGGC TTT ACAATTTAGCAGT>Tr G 
NAGCTAGC TTGAAACAGTAAAACAACAACAGCAGAGCCTT AAGTGT A 1 1 1 I IGTG ACCTA 
AAACATGAACTCAGGGTTTCCAAATTCCTAACAA 
Sequence67 

AGGTACTTGAAGGATAAGAAATTAL PSTG1 CAAATT ACCCACAAG TTAAATGCCCATGTT 
CCAGACCTGTGGCTCTTAGTATCAGGCTTGTGATAGAGAAAAGGCTGCTATGAATTCTAC 
TCAG TGTGCTT AGAC CAAAA>GAAACCACCACAGGGATTTCACAGGC 



GGAT AAGAAATTACT GTGTCAAATT ACCCACAAG I IK H I G CCCATGTTCCAGACCTGTG 

GCTCTTACTATCAGGCTTGWGATAGAGAAAA 

AGACCAAAGGAAACCACCACAGGGATTTCACAGGC 



GGGCGATTGGGCTGGAGGAG»4GGaGCGGCKKCAGKAGGGGGGCGCCGGCC>*CCO 



GOGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTaCCCATTTCATCTTGCAC^ 
CAGGGATTGTTGCG AAGAATCAGTTGTGTTATATTGTCCAAATCATCAAAGATACCCTGA 
GGT AAATT ACTTAGGTTATTATTGGACATATCXAGTCGATAGAGCT GCCTT AGATAAGAA 
AAAGCATTTGCGGGCACCCGATTGATGTGGTTATCTTGAAGATAAA TCAGGTTT 
GTGCCTGGAAGGTTT ACTGGTGCAGCAGTCAGGGAATTCCGCACCAGG^ TG TC 

AAATTAACTaGGTTGAAGAAAACI I T G 1 CACCT AAACCATGATTG TTCAACAGGTTTCCA 
TCTAGAACCAGGCGTirTAGACTAGTGAGAOCTTGAAGAGATGGTG^ 

atgcgattatcatccaagcgtag rrcnrci atactcctgggcaaaccccagggaattgtg 
ctaaggt gatt acgggacagga a aa g cagtcggagatagttgctgtctcggaatgctccc 
tcttjoatgctaactgcagagacagagttgwcatct aaatgtaattcttccagatagg 

Sequence 70 

^TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGAATAAAAGGCTTTG 

ATCTCTTCCAATCAATCAjCACAGAGCTTGCCCTGATACT 

C T AT AT AGTTT AAGGTTTCATGTTGAAjCACCACTTTCAAGAGCT 

AT 

CTTTCACAATATGCTGGACACTTTCAATGTAACXAGACTTGAGG 
T 

CTTTTAAGGTTTCCTGGGGTGAAAGTATGCTTTCCAAGGCTTCGTGGAACCG 
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GTAAAAAGACGTTTGAAGTGTATTCTTTAAAGCCATCTTCTCCCAGTTCCAGAATCATC 

CGCTGTTTCCACCTCTCCAACAAAGAAAACCTGTTGTTTTGGTCATGGTCTGCTGAAGGA 

CTCGGGTCACACTTGGTATCACATTCCTTTGCAAGGGGATTTTCAAA 

Sequence 71 

AGGTACTTGAAGGATAAGAAATTACTGTGTCAAA7TACCCACAAGTTAAATGCCCATGTT 
CCACACCTGTGGCTCTTAGTATCAGGCTTGTGATAGAGAAAAGGCTGCTATGAATTCTAC 
TCAGTGTGCTTAGACCAAAGGAAACCACCACAGGGATTTCACAGGC 
Sequence 72 

GCGATTGGAGCTCCCCGCGGTGGCGGGCGAGGTACATATATCATTTATTCAAGAGGCAGA 
TTTTAAACGi 1 1 1 1 GTAAAAAGCTAAATAACACCCAGAGTGACTCAAAAAATTTCTCAA 

tttgcccaagtgaatagtaagtctagagttttttggg i rrnrm i g tgacagagttt 

TCTCTGCCGCCCAGGCTGGAGTGCAGTGGCGATCTTGGCTCACTGCAACCCCTGCCCG 
Sequence 73 

GGCTOT ^GGGGGGCAACACCGANCCGCAGAGNCACA^ 

TGGGGGGGGGAAAAACGCCGGCCCCNCCNGNCCAGCNGGACCATCNATTTNNTCCNCCNC 
C N CGGAGCN GCNCCCNAAAAAGCNCANAACAGNAGAGAN CAGN NGMCNCNGGN NGCAAAN 
CNAACANAI^NCAWGCAANGGAGGNGTMhK>ICCATGCTTT^^ 

CNACGOtCCCNNGAAGAAAAAAACGCCNCAGNAACGGGGGGGGGGNAGGACCCAGCCNGG 
GCGGNCGjCNCNAGAACCAAGMGGAACCCCCCGGGCCNGCAGGAAANCCGAAANCAAGNCN 
HANNGAAACCO^NAACCNAGANGGGGGGGGNCC 
Sequence 74 

CCGCGGTGGCGGCXXSCCGCGCAGGTACCTTGTGAGAAGAGG^ 

AGATCAGAGCAT AGTAG AGAAAGTAGCCCTGTAAACAGAGGAGAAGCAGAAAGAGAGAGG 

^^^GAGAGAGCTTTTATTTTGCTCGAGGTTAAAAAGAAAAAAAAAGCAGATTCAACTCT 

ATGTAGTGTCTGTCCCAGGTCCTAGAACTGGAATAGACCAACCAAGCCCAACCCTTCTTA 

AAAGTAAGACTNGGTGCTTCCTGATTATATATTCAACTGCCTGGAAGCATGCAAGTAAAA 

TTTCCTrGATGGCATTTC TAAGTTr CAAACATATTCTTNCTAACAAATGCATTTACAAAA 

AAATATTAGGGATTGNGGTTTTTTTGCTTNGGACT^ 

AT AATT GGGGGC CCTACCCCAAAATGGATTCTTC T C CCC T ACAGGTGGAGGGTTTCATTT 
TTTC 



GCGGCCGAGGTACGCGGGGAGGCGTTGTGGGAGGAGGTGCGGGGAGAGAGGAAGGGGCCT 
G TGCACTGAGC N GGCATCAAAC TATT AGTGGATGGCCTTGCG T CTCAATCTGCAGT AAAN 
AGGAAACTAATCTGAAAGGGAAN GANAGGAC TG TGTG N C TTTTTATTTTTTAAAATACGG 
AGTGTGCANTTTTACTGAATCTTGAATCATGCCC 



CTTGGCCCTTGGNTCGGGGGCCrmTNCCCCCCCAAG<X^TGGG6NCCCNTGGflGTANGT 
GTTN GNGGGCCCAAAT AN GAGC GGAMAGGTTAAAAN C NAAGT AACNAACGAC C G T AATC G 
TTGTAGTTCCAAATGGGGAAATTGGGGTNTTTTCGGGNGGAACCTTAAGAAAGNGGCCTT 
CCAAAATTGGNGCTT>tGGGGGGAAAGGAAAGGAATTCCCCCCTrGGCCAANAAAAAACNC 
CCACNCCAAACCCCCAAGGAAAACCGGTTGGGGNTTTTTTGGGCCCCNTNGGAAAGGGGC 
NTN G ^GA TACCTTGGGNANGGAAGG^ AAAAAATGGAATTTTCTTGGGGGGGGGGCTTTG 
G TTCTTTT AATTG N AAAAAAANATTN AATT AAC GG ACC C ATTTTTN C TTC N AACN AAATT 
AAAAG GCC C C C CAC GTTNNTTCAATTCCATC CCC CAATTTTTTNTCCCCTNCCCCTTTT 

TTANCCCC 1 1 I II II I CTAAAGNATTGGGCCAAAGNNTTNTCTTCCNTTTNTTTTNCCA 



CCNATTTTrMANGGGGGCCTTGGGGrrTTNGNGTTWTrCAANAANAAC^ ll l ll l l ll 
GN 

GGGGTAAGTCCCNACCOSNGNTANCNTTGGGTNCAAGNT^ 
AAAGGCTTGGNGGTTTCCAANGTCCNTCCAATTNTCCT" 



e77 

AAAAAGNGAATI 

tttggcncaggcccctaaagccttacccrtggccaagttttt^ 
caagngggttcaaccttttaacccccttgcttggttcttggaaattggtcfn'cc 
ggggaaccaaaacaagggniagggggcntggccao:ttcaacttgggc^ 
agaaccaggaaaaggaagggggaatccattccggggaccttgggaaaagncctccttggg 

CCAAGGGGGTAATTGGGGCTTAGGCTCC^GGGGTTTNA^ 
AAAT TNGGCAA GNAAGGGGGGCCCCAACCCTTGGCCCCCAAGCCWTT^ 

at ggm 1 1 1 itcccccaaqgggaagaaaaccaaaggggaaggggctttggi 1ut ttcccc 
acctttggkaccaaggtttttcaagnaccaagggaaaaggttgggggaaaacc^ 

T GGOGG MA CCC CCGGGGAAAAGNCC TTCNTTANNCCAAAGGNTGGG TTTTGGCCCCCCAA 

CCGCTTGGGGGCC7TAANCTTTANAANTTGGGAAGGCCCCTTTTTGG 

GCCCGGAAAAAAACCGAAAATTTAAAAATTTCAAAAAAGGGAAAGGCCAAGhrrrTTCNTT 

GGTNCCCNAANAAGGN 

Sequence 78 

TCCCTTT AAGTGAGGGGTTAATTGCGCCGCTTGGGCCGTAATCATGGTCATTAGCCTGGN 
TTCCTGTG TGGAAATTG TTANTCNCGCCTCACAAATTT N CAACACCAACCATTACGGAAG 
GCCCGGGAAAGNCATTAAAAGTTGGTAAAAAGCCCTNGGGGGGTGCCTAAATGGAAGNTG 
GAGCCTAANCTTCAAGATTTTAAATTTNGCGOTTTGC<X3CGTTC^ 
TTTTCCAAhTITCaSGGGGAAAACCXJCTTGTTCCGGTNGCCC^ 

GGAAAATC GGGCTCCAAACG NCCCCGGGG N G NAG AAGGGCCN GGTTTTTGGCCG GTTATT 

TTGGGGGCCNGCCNTTCTTTTNCCGGCNTT 

Sequence 79 

GAGGTACTTTGGGCCTCTCTGGGATAGAATGTTATTCACGCAGGCACACGAAACAAGAAG 
GGCAAGTTTCCAAGGATTTCAACCTGCTTCAATCAAGAATGGGGCGGGGGGGAAAGAATG 
AAAGAAC CAGG AAT GGGTGGCCAAGGCCCAC CAGGTTTCGTTTTTNGAN TCCTCCCACCC 
T1TG GGGTT CCCCTTCCCGGCCCCGAAAAGGTGGAAGCCCGNATGGTCCCCTTTCCCATA 
ATTGGTTTTAACAGGGTAAAAATAACAACCTT^GCAAGAAAATNCTTTCAAAGGGCCTCCC 

aagncccttgc^^tggaattgggttggaag^aggtggaaaaggttc^tggttcccccaag 
nacccccaccttggcccaacttggaaacgccttggtccttggccgaattgntccaaggtn 
ggggccccc^^rtggttttggggaattggtaattccaag^iaaggaattgnaagngggaagc 

G NAANGCCC CTTGGGGC CCCAAG GGGTTTTTTCCTTGG GCNTTGGGG tt 
C TTTCTT AAGAAAACCCTAAGGTN G 
GGGGAATTCCCCCCCCCGGGGGCCTTTNGCNAGGGGNAANTTTTCNCAATTANTTCCAAA 
AGNCCTTTAATTCNGAATTTJCCCCCGGTTTNGAACCCCTTTTGNANNGGGGGGGGGGGGC 
CCGCCGGGGTTNACCCCCAAAGMCNTTTTTTGGGGNTN CCCCCNTTTAAANTNGGAAGGG 
GGGTTTAA 
Sequence 80 

TGGCGGCGATTACTGTGCGAGAGGTAAAGGATATATGTGGCTACGATTACGGCCTCTCT 



A TGATATG CAGAGGGAGTTGGGGCTTTGCC AAAGGGTAAGCACAAAGAAGGAGTCAC GGG 
TTCTGTTCGAGGCACTGTTGGGATTAGGAGCCGGAGGGGACCT ACTTTTGCAGGAACCTA 
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GCATAACTTTGTGTGACGAGACTGCACAAGACAAAGCTCANGCAAGTGGCTCAGTAGTTG 

GCCAGCCCAGCAGGGTCCTCTGTATGAGTGTGCACCCAGCTGAAGAGAAGAAATGGAGAG 

CAGCAATrGGAGCTTNAGGACCGGCTTGCACTGTGGCTCCAGGTTATACCACCACTGCCC 

AAAGCAAAAGCTAGAGAAGCAAGTGGAGAAATGCTGGGAGAAAGCTG 

Sequence 62 

T GGCGGCGGAGGTACGCGGGGGAGTCAGTCTCAGTCAGGACACAGCATGGG 
Sequence 83 

CGAGGACCTTGTTGCAGCTCTrTATTrCTTAAGTCCC^ 
CT 

G^TTTTTTAGCTGTTTCCTCTAGTGTAGGTTCACCTWGCTAATTTT^ 
T 

AACCACCGTTACATACTACAAAATATCACTAT ATTATGAC CATGATTATATTTTNTTTTC 
Tm I CCCTTCATCAAGGAAGTTCATGAAAGAATrrCATCAAAGTTCAATGATGACCTC 

TTTTAAAAI III U I T AGTATTCTATGTAACTATCACCGATCTTTTCCCCACAGACTTCAA 

GAGGCTTrrTTAAAMATAATNTTITACATAGGCCNT^ 

CTNTTTT 



GTGGCGGCCGANGNACTNNGGCCTA1 
ANGCATNNTCCATATTf*»IAACNGC TCATCAT ATGGNGN NAANATN NNGACAGAN GGTGCA 
ANCACNWTNCACT>IGATATACNCCTTG«^ 

CCCCCCCGCGGCCTGCAAGXSGAAANTTTTCGAATAATCAAAAGCCT 

CCCGNTt^lGACCCCriTNCGAAGTGGGGGGGGGG 

NTTTATGGTTTTCNCCCTTTTTTAAAGATTGNAAGNGGG 

CC 

CGCCTmGGGNCNGmTAAAATTWCAATWK^TTACAATTAAA^ 
TT 

C CCGTTGGTTGGTTAGGAAAAAAATNTTN GATTTT AATT ACCCNGGCCTTTN CNAACNAA 
' » ' I CTT W^CCAACCCAAAACCAAATNAAACCTNAAN TCCCCCGN GGGGNAAG N C 
CNAATTAAAAAAANGATTTGGTT AAATAAGGCCMCTTGGGGGGGGGGTT 
Sequence 85 

CCGCGGT GGCGGXX^GAGGTACTTAT ATTACATTATGCTCAAATGCAAACACTT ATGC T AA 
ATGTTATATTTGGGAACAAATTGTGTAAATATACTGATGACGTCAATGGATCATTACAA 

TAATGTAGGTGCCGTGGGCAGGAAAGCTAACTTTANCTGAAAGCATCTNNAACGTGCTTA 

TTTTT(^TGGGCCCTCAAAGGAAAGGGATGAGGCCAGCCATAAGGAANGGCTTGGCCAAA 

TATAGTTCTT GTTTGTCAAGAACAACAAATCXiCATTTCACAACAGAACT AACGCTGGCAT 

GCCATTCmT<XT>uVGGTTCTTGGCGTGCAGTGAGCGAX5GCC^ 

TCATTCCTTTG 

Sequence 86 

CCCCGCGGTGGCGGCCGAGGTACATCCCTGTTTATCCCATTCCATCCACCGAGGCCCAAC 



TGACACTTTACCATATCTGGCAACAAACTGTTTGCTCTCTTCTTGGATTTCAAATCCAC 
C 

AGCTTTTACCAGGGCCAGGGCCAGGCCTCCCCCATGCAGAAGATCTTCATTGGCTGCATT 
CACCACAGCATCAACAGCATGTGTGGTGAGGTCATCTTTCCACACTGATAACTCTATCCT 
AGGAGTCAGCAT T71 ICTGAACACTTGCAGAGATTTGCTGTTGCCTTCCTGAACTGGAGA 
GACCAGGGTAGAGATACAGCCAAACTTATTCTGGAGGACTTCACACAGCTGACGCTCATT 
ATTTTTTAAAATTTTAGAAGTCATTGGT GGTTAATGG 
Sequence 87 

CGOTGGCGGCCGAGGTACTCTTCAAAATTGTCAAGGTCATGAAAGACAGCAAAAAGTGAA 



GAATTCTTACAAACTAGAGGAGACAAAGATTGGAGAAGAAACAATGACTGGCTGGGCACG 
GTGGCTCATGCCTGTAATCCACTTTGGGAGCACTTTG 



CNNCATCCCfn"ACCTATAATAAATACCAGGAAATTACGCCTTGGGGTCGTNGGTTGGNTG 
ACATTGCCCTTATTAAATNCCCCAGCCTTACCTTTGTGAAAGGGCNCTTC^ 

GCNCGAAAAAAAATTTGGCCGGCCATTrTGGNCAACNTCCCANGCNCCTNGGGGGCAANC 
AAAGAAGCCGAA 



GCCCANAAAACCGTAAAAAAGGCCGCCGTTT^TTGGCGTTTrTTCGAT^ 
CCCCTTGACCGAGCCATCACCAAAAAAATTCGACGCTCAAGG TGAAGAAGGGTTGGG CGG 
AAAACCCCCGACCAGGGAACNTATTAANAGAATAI 
GAAAGGCTTO 
NACCCGGGNATTAj 



TCTGAAGGTCAGCATGCTTACGCTCGGCCTCACATGTGTCAAAGTGATTCTGGATCTCCT 
T AAGTCGATCCAACATCTGGAGffTGCTGG TTTTCCCCATTCTCCAGTTCACG TGTTAA 
AT 

TCTCTACTTGTGATGCCAAATGTGCTTTCT>(CTTGTCTrTTCTTTCCATGCACCGTTW 
A 

CTTCCTTTAACT 



GCGATTGGAGCTCCCCGC^3GTGGCGGCCGAGGTACGCGGGATCACAAyAGCAGACAAACAG 

GAAAGACTGAACCATCTATTTGAAAAAAGTGACTTCATTCAATTGGTTCAGCCACCCG 

TC TGT AATCTC TCCA7TCTGCCCTCTTGATTTT AATGCAGCTAT AAAGGAGAGTATTTT 

A 

AAAGTGCCTCCCAGTAGGAAGAACAGTCACAAGGGACTGTTATATCAATTCAGTGTGACA 
CAAGCCCTGATTATTT AAT AGTAT AACAGCAG TGAATCAGAGTTCTTTCATC TG AC TTT 
G 

CTGACATTNCCAG CAGC TGNATATTTAATTCACAG TTAG GG GCTGGACAAAC T ACAGCCN 
TTGATCAGAATGGAAGCAGGCATCCTTGAGGTTCTTCTAGGAACAAATACAGATGTGGAC 
AAAATTTTCATTTATTCAGT 
Sequence 91 

AAGACTGAACCATCTATTTGAAAAAAGTGACTTCAITCM 

TGTAATCTCTCCATTCTGCCCTCTTGATTTTAATGCAGCTATAAAGGAGAGTATTTTAA 
A 

AGTGCCTCCCAGTAGGAAGAACAGTCACAAGGCACTGTTATATCAATTCAGTGTGACACA 
AGCCCTGATT ATTT AAT AG T AT AACAGCAGTGAATCAGAGTTC TTTCAT C TGACTTT GC 
T 

GACATTTCCAGCAGC TGTATATTT AATT CACAGTT AGG GGCTGAACAAACT ACAGCC ATT 
GATCAGAATGTAAGCAGGCATCXrrTGAGCTTCTTCTAGGAACAAATACAGATGTG 
Sequence 92 

CCCCANGAGGMCACCAAGCATCCCANCACCCTTNNTCCGGGNGGTGNAAANCCCANGGCC 
GCCAGGCAANGGCACANCAAAAMCCGGGCTGCGNCMNGAGCACNGGGCANCCCGAGAAAA 
CAAGGNCNC*AC*tACNGACNGCC*MAGAAGGGGCC^ 
GNNNAGAGCAATC 1 1 1 1 1 1 NGGGGGNGGAGCACCGGGACC 
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CCCGCGGNGGOSGANATTGGGGGNGAAACCTIUMA^ 

GATTANATTGGGGGGNCTTAAANCCCCAGCGGCNNNNGACAGNTAATACACCTCACGTTT 



TTTCCCGGGCAGGNACAGCTCCATGAGGTGACCAAGCATC^ 

TTGCATGGCAATGGCTGCCAGGCAATGGCACATCAAAATCCGGGCAGCGTCTTGAGCACT 

GTGCAATTGAGTCAACAAGGTCTCAACTACTGACTGGCTAAGATGGGGCCTOCCCTTGGC 

CAACTTCACCATACAGTTTAGAGCAATCTTTMAGTCGNCTGAGCACCTGGACTATCA'FC 

TTGACTAGAAAGTACCT 

Sequence 95 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTATGATAACATTGCAGTCAAACATA 

TCTTGTGACACGACAG TTTTTTG TGGGGAGGAGAATT AGACCAAG TTCGGAGATAT ATTT 

TAGGAACTAAAAGGAACGTAAGATCTGGGGTACOGGGATGAGCAGCTCCACACCCTGCTC 

CTGTGTGAGCTGTGCGCTCCXUjACTGGGAAATGTCTAACTCCATC^^ 

G GGCC AGGCAAGGGGCTACTTCCAAGCXTITCATT^^ 

TATTTTCAGAAAAGAAACCAGTAAGGTGGGTTCA^ 

ATTTTAAATAATTTAGCCCATTGAAATGT^ 

TTTGCTTCTT 




GAG<>CTATCTTCAGAGGCCATCCAAGTmACCATAACAA^ 

^GAAGAGGCTGGTGATAGACAAGTTTCATGTTCACAACTTGAATTGCAGAGGTCAAGAGT 
TTAAAGAGTTTGGGATGGAAAGAAATCGAGAATTGGGCT 

GCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAGATGGCXTC^ 
GGTCAGTAGCACTT ACTGATGTAGTGGTTTGATACACACTGATTAC C I rCTTCCTTTTI 
T 

ATTCTC TGGCATTTCTCC TAT AT AAC T AGC C ACTTTT AAACAAT ATTTGTCGGCTC TTTT 
CTTC TGC TTGTCTG TAAAT ATT AGGG TTCC TGAG TCCTT ACC TAGATTTTCTTCTCTTC 
T 

TACTCCTGGCCTTTCCTTGGGAGAGTTCATAATTCACCTACTCCATCTAGATATTTGTG 
A 

tgtccaaacacatctccacgttaggcttctatttgtagcatcagacccacactttcaa 

CT 

GTCCACTAGATAGCCTCACTTGGATGCTCTGCAGGCCTAAATAACCTTTGCGGACAGATT 
AACAGGGAAAAAATATTAATAGGAAAAAATATAGATTTTTATCTGATGGTAAT 



TGCGTTGCGCTCACTTGCCCGCTTTCCAGTCGGG^3AAACCTT#GTCGTGGCCCAGCCTGCA 
TTANATTGAAATCGGCCAAACCGCCGCCGGNGGAAGAGGGCCGGTTTTGCGGTAATTGGG 
GCGCCTCTTTCCGCTTTCCTTCGCTTCACTGGACTCCGCCTTGCGGCTTCGGG^^ 
TCCGGNCTTGGCCNGGCCGAAGGCCGGGTTANTTCAGGCCTCCAGNTCAAAAAGGGCGGG 
GT AAATN AACCGGG TT AAT C CCACCAN GAAATTCAGGGGGGGAATNAACCGC CAGGGAAA 

Sequence 100 

ATATGTTrTAATATGCATATGATCCAGGCAGCATAATGTTATATTTCAAAGACAGATTTA 
TCCATTGAATTATTGTTTTTAAAAGTTGGGATTCTCTACATAGAACATATTTTCTGAAAT 



TTCAAGAATATTTTCAGGTAAATTAAGAATTAATTTTrrTC 

CAATCTATTCCATAATATAATCAATGATAAAGATTCACATGTATCACCAAATTCGAGGC 
A 

GC7TAGTTGAAAAAATTTGAAACAGCTT ACTGAATTCCATTTGCTGATTCTGNGGGGGCT 
TCCCCAATGGCATGNGTGCTCXTTTTGGATGCCTGCAGGGGTGGTC 
T>ITGTGCCACTGGGAG TTGGGGAGGCGGGCTGCTGGGG I TCCCTGGGT 
Sequence 101 

ggccgagcccaattcttga mem LC ATCCCAAACTCTTTAAACTCTTGAGCTMTGC 
A 

ATTCAMGTTGTGAACATGAAACTTGTCTATCACCAGOCTCTTCTCTGCA 1 1 CICI I ICC 
C 

TOCTTSNGTACTCCnAAAACT^ 

TGCAGGAAAT 
Sequence 102 

CGGGTCCA T AAT AATGCAATT AACAAAATCCAGGATTT AAGCyVTTTKT AT AAGATT AAAA 
AAAAAATGAGGTGG TGTCGAGTGGGGAGAGA A AAAAGCAGGAAACAAAACTGGTGAGAGG 
AAATGACCCCCTGATGAAAGATCTTAAACACCAGGCTGAAGATT^ 

GATTTGAGAAAGATTCTTGAGAACTCGTGCATAGGAATGAACT 

TGTGAGCAG 

ATTGAGCTCCCCGCGGTGGCGGCCGAGGTACTCC 1 1 IC1 1 GI 1 1 A AACGOCTCACCACTG 
AC CACGGAACGTCTTGATAGAGCCATC TAG T AATTCTT AAGTCCTACCTCATCCAACCTT 

TGCAGGAGACTCTCAATTTCTCAGTCCACATCAGCTCTNAGACC^ 



B 104 

TGGATTGAGCTCCCCGCt^TGGCGGCCGAGGTACACGTCAACACGGGTGGTTGCATGCAT 
TCC TCAAGTCTGT ATGACTCTACCAAGATACTGTGAAGTTGTCCTTCT GATTGCACAT 
GG 

GGaGAAAATGCTGAAACTAGTGGCCACAGATGTCTTTAATTCCAAAAACC 
d 105 



TGCTCCTAATATCATAAATGTTTACTTTCCGAAAAGTATTTCTGAAAGGGAGCATATTT 
T 

GGAAAG TGCAT AGGCTTGT AATCAT ACTTG TTTTCAAGTTT CAACTTTGC TATTCAAC T 
A 

GAATAATCTTGTGCAAAACCTGAGCTGATTTTCTCATCTATAAAATGGAAACAATACTT 
T 

CTGTGATAATGGGTGCAAAACACAAGGTATACTGGITTCTTTGCTCTGGATTCAA^ 
TT 

CTTCTTAGTTTCAAAATTTTAAAGGGAAACCAAAAATGTTTCATGGNCGNNMCTN 
NANGGGAN TTTTCCC NCNAAAAAAAAAAOTCAACGGGGGGGGTTTTTNCCNKTGGGGAN N 
CCCAAAAAGCCGN NTNTNGGC CAN GTTTTT N NGN N NCTTTTTGTNAGGGG NTTTNGGGCC 
NCCC T GC TTT AC CCCN TTTTTT ANATAACN NCC CCCCC TTTTGGNNTNGGGGNGGGGNNT 
TATATATNTTTNTGGGGGGGG 
Sequence 106 

GTAGTGGGCAGCGATNAGGGCTGGGGCTCTTTCCTGAGTTGTGTCAAGGTGAGAGATTGT 
GAAGAACTTGGCTTGCAGGG TTTGGGCATCAGCTGCCCATTGAGGGGCCGTTCATTGTC T 
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CAAAGTGAATGTGGGGTGGTTTGATCTGCATGTGTCATTTGTATCCACACAAGTTAATTA 
TTCTGCTTTTGTTGTAGTACCTTGGTTGTGAAGCAGAAGCT ACCAGGCGTNTATGTGCAA 
GCCATCTTATCGCTCTGCATTAAGTAAGATGAGGArrCACTCTTAA 
T 

TTAGT1 IX 1 1 tXACACAAATTTAAGGCCTTAACTCTTNATTTTTTCCTACAKTGGNGGG 

TTTGGAAG TAAT ATTCAT ACGGGCATGGGACCT 
Se qu ence 107 

CAGAGAAAGCTTGCCAACGGTGATAAGTAGG r 1 1GIC1 AGCAGCACTGATGCGTCGTGGA 
AGTTGATGGTCATGAACATACAGTGTGATAACCTATCTGCC^ I III U 1 ACT 

GTGCTATGTCATTTTGGT ACTAAGG TAGGTGAATTTTCCAAG TGTTCTTGGAAATAAG 
GA 

AACATCAAGAATAATGTAAAAGCGTCATATACAATAATGAATAATAAAGAATAATGTGAA 
GGCTTGATTCAAGGTTGGGGTTTGCCAGATACATTGCA 

GTATTTAGGGATAGTGGCCAAAGTATTGT/^TGATGGCTT ATGGGAGT GTCAAGCTGGAT 
AAAAGAGTGAAAAATGGAAATAAAAAACT AATGGGATTGGTT CHANT CCGAAA T AGGCAG 
ATGGCNCCCATNGCCCCGGTTTNAAATTAGGGGG 

»*CCGGAATGGAATTCTACATCAAGTGTCTGTGCCTCt^ 

CAAGGTCATCIT IGT I gctgaacagggctggacctgtogcacttaagcacacttaaagga 

T^CTATTCTTC^TTCAGGTtXaXGAGAGAAATTGGCTCCTTA rTTTTCTTI A CCT ATTC 

TACACTTCCI 1 1 1 G I C I AGAGCCAGTTTTGCAAAGGGCACTTTTATCCATCTCAGTTAT 

CCCAGAGGTGACAGAATGAGTAAACCATATGGGGCAAATAGCATATATGAGCTAAACCAG 
NTAACTGTTAACCAAGGCACATGGTCAATGCCTTAGTA i I H 1 1 M 1 1 1 1 IAATTCTTCC 
TAAACGGTTATTTTCTAGCTGTACATTCCCAAAA 



OG6TO0GAGACAC 1 1 C I C I GACTAACCAT AGACT ATGTGGAAAATGGT AGCTGGATTGCC 

TTTGGGTGGAGTCCTTGCCGTGTGGCATAGGAAAJCAAAGGAAAGGAGA^ 

GAiSATTAATGAAAATTXTTCTCAGCCAAATAAAATCTAAAAATAGCCT 

ACGCG TGGCCCCT AAGGGTCCTAAAGAGAGAGCT AGGGGAGG TTCAGCTGGCCACAGAGA 
TGCT AAAGGTCAGGAGCAGACTTTTAGGGTTTGC TGTTTT ATAGGTTT AAAGACCAGGTC 
TGTGTTTTGATAACTGAACTT GCTAATAGCTGGCCACTTGAGTTGCTTCTTCCAGCTCT 

TGI 1 1 GI II I AAAT AAAGAGATTCAGCCAGTAAT AATGGGAAGAGCTGCAAATGACTTCC 
CCAG 

GTGCTGCC TGCACTGTGACT AAGACTTTCT GGACT ATCATCATGTTT AGGAGTT GATGAG 
ATTATAGTTTCATGTAAGTGTATCATTAGATGACAACTCTACATCrTrAGGCATGGAAA 

AA AAA1 > 1 1 1 CCTGGAAGAAAAAAAAGTGAAGATGCAACCTCCATTTAAACAAATTNGAT 
TG J lit I 1 1 GCT ATTAAGAAACTtXXSTGCTCTTTCTCXXACTCTATTATATTGTCAAAAT 
ACATCTGGAGACACTTTATAAACTTTTTCTCCTTTAAArrACCTGGTTTA 
<XTGTAGCCTGCATAAACGATAAAGGGTTAAACATA 
Sequence 111 

GO*CGCGGGATTGGCCGACGCAGCCATGGTA«;TCCAG^ 

COCA TAGG TT ACGCCCGATTCCTGGAGCTTCTT3GACTGAGGGCCGCGGTAAGCAGTGGTC 
TGGGCTCCCGC 



CGTGGCCGAGCGGTTTGCATCGCCGCTCGCGCAAGGCCATGAGGTTC 
ACGCATCGATGGCGGCACGGGCCTGTTCCGGCACGTA) 
CCAGCAATTCGCGCGATGGCAGGTGGATCAGCAGCAGCTCATl 
AGTCAGGCAAGGTCTCGCGCACTTGCACGCCGGTGAT( 
TTTCCAGATGCTGGAC GTTGAC TGTGGTGAATACGTTGATI 

TTCACCAGCACCAGTTTTGGGCTTG 
Se qu ence 113 

GCGGCCAGCCAGACTGGACXXCTTAGCXrTCGAG 

CGACT GCCCCTGACATGGTGTTGGATTCCAGC TGCTG1 GGCCCTGAAGGTGGGTGGTGGG 

AAGAACGGGAGAA TGAAGCCAGCCTTGGGAGAGGT AGGACGCCAGCCCCGCCCAGCT GCT 

ICCAGCATCTGGATCCAGOCTGACCTGAAGCCAGCCACCTMCT 

GTMAACACCXaAAACACAGOGTTTCTGACCATTGCAACC 

T 

Sequence 114 

TTGAGCTCCC^GCGGTGGCGGCaSAGGTACGCGGGAAGC^ 

CTGTGTATGGCCAATCCAGATAAATAAGACGATCAAGTCTTT ATGAAAAGGAAAGAAAAA 
TTTGGAATGCACA TCTCTGTCCAGC TCAATTCCTCACT CCTTTTTT AAGATGGAGAGCT 
G 

TTAGGTTTG TCTACACAGTAGGAAACACCTGATTAAATAAGAGGATGGAGCCAATCTTGA 
CAAAGAAATTGGCTGCATCCAATAGAATCCCAGGGCCGGTCGTGGTO 
ATCCCAACACTTTG 
Sequence lis 

GGCCGGAATCGTTGCACCAGACMAGGCCCCC^ 

TGGATTGGAACGTCCTAGGACACATGCCACGGCTrTOACCCAGtSCTGGGGGTCC 
CTCACTGGGGTTAGTTGGTCX3GAGGGGGAAGCCCX^TGGGTCX>IXAGGATGAGGTGT^ 
AACTCTATCAGGGTACCT 
Sequence 116 

GGGCCTCGTCGCTGGCGGCCAGCGAATTGGTGACG 

GGATCTCCCGCATCACCTCCAGCCCCGTGGCACCCGGAATCACCTAGGG 

ATCTGCTCGACCACGTTGTAGCGCACGCTGTCGACAT 
XiACGGGGTCTTGCCCGATCACGCGGTCAGGCGAATCG 
GCAT ACGCCTCGGCGGTGGTCCAGATCAGGCCGAGCTTGCCGGCGTrr GAAGGTCTT CGA 
CCATCGGGCTGTA4 
GTTTGGTGTO 
117 




GATGATGAGCTCCCCGCGCTI 
AAAAAGACAAAAATACCTCCTACAGTTGTTATCA1 
A 

AATTTGTCTTCTACCTCAGATCTAAACCArrTGATAACAT^ 
T 

CGT GGCCC ACTAAAACCATAGCAAATGTTTTCTCCCT AGGACACTATCTGTTTTCACAGG 

AAAATTTTTCT CATAC^AAAACTGTAGGAAAAGCCATGGGATGAGCTGAGAAGACCAAAC 

CTATCTCTTGGAAAACAACAGTAGGGAGCGTKGGATTAGGAATGTCTTTGGTGCGTG^ 

CAGCCAGACCAATCCTGAAACATC TTTC TCT GGGGACCGTAAGGCATGGAAAAATTTTC T 

ATTACACTTANGGAGGGCTTCTAGGGAAACAGGAAACCGACCAAAAATGGGAATGGGGCC 

TTAATTCAT TTTTTT 

T 

118 
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tggcggatatggaggatctcttcgggagcgacgccgacagcgaagctgagcgtaaw5att 
c tga ttctggatctgactcaga ttc tgatcaagagaatgctgcct c tggcagt aatgcc t 

tgtttggagatgacagtgaggacgagggagcttcacatcata6tggtagtgataatcact 
ctgaaagatcagacaatagatcagaagctttggagcgttctgaccatgagggacaatgac 

CCCTCAAGATGTTAGATCAGCACAGGTGGGATCAGAAAGCCCCTAATG 
Sequence 119 

GGTGGCGGCCGAGGTACCTGAACACCAGGCTCTTT^^ 
CT AATAT AAAACACACC GAGGC TGAAATAI 

AATGACTACCTGCAGATAGAAACCATCCAGGCTTTGGAAGAACTTi 
CTAATGAGfrTGCTGTGCCATTGTGTATGTCTGCAGATTTCCCCAGGGTTGGGATGGGTTC 
ATCCTACAACGGACAAGATGAAGTGGACATTAAGAGCAGAGCAGCATACAACGTAACTTT 

gctgaatttcatggatcctcagaaaatgccatacctgaaagaggaaccttattttggcat 

GGGGA 
Sequence 120 

GTCGCGGCCGAGGTACCCGAGCTACCAGGCTGTGGAATGAGACCGTGGAGCTTTTTCGTG 
CT AAGATGCCCGTTACGGAAACATCGCTGTCGTTTCAAGAGCT ATGGGCATTGTTTCACA 



GTTCCCAC CTT I CA GCTGCCTGGGCATTGATTGCTCAGCTACCACTATGACTAGATATGA 
TTCCATGTGCTTTTGACT AGATTCTTTGTCTCTTGTGTATGGAAAGTGAGACTTTAAGT 

ATAGTT ACTGCTGAGAGAAATAG AAGAC GTGACAACGTTTGCTTTCCCATTCAGTAGTCA 
GCGGTTGAATGGAATTATCTTCGTTTTTGGACrGACAGATTTGTTTT 

TCCCAAGCCTTACTATTCAAAGCAGAACCCTTCTGTCTTCTT^ 

CC^rrATATTCTGTTGTATTTTTTTCAAATAACTTATTACTATCTCAAGTAAAATTGTTTT 
ATGTTTTGTTTTTATCT ACCC TC TT AATCAGGGCAGGGATATGTCTGTTG TAT ATTTT A 
C 

TTTTCCCAAATCATAAAGG TTTTGGG 
Sequence 122 

CCCGCGGTGGCGGCCCGAGGTACACACTGGGATCTCCTTCACTCATTTTTTAACCCTGAC 
TGGGAC ACCAGAGACATGCTGCATC TTG T ATT AGGTGTTTCAT CTTGCAGAATGGC TGTG 
CTCCTGAAATATTTCCTGTGAAGAAAATTGTTACAATCCCATTACATCACT GGCTTTTA 

TATTAAATTGGAATGTTGGCTGGAAACAATTTTAACCC 
Sequence 123 

GCGG TGGCGGCCGCCCGGGCAGGT ACGCGGGTG tgcaac t gcaaaccag t aacc tgc tat 
ggccaattgtgaagagatggcagtctccccgtattgcccaggccggtctcaaactcctgg 

GC TCAAGCAATCTTCCCGCCCCAC TTCCCGAAGCCC T AOGATT ACGGGAG TGAGCCACCG 
CAXXXXAGCCAGAAAAAGGTTTCAAATATTGGAAAACCTTACTTTTTT 

TGCATCAAGGGGTAACAGGGACATT AGGCTTTTTTTCTC TT AGAC TCCAAACAGT AAGGT 
CAGAATTTATCAAGACATTACATAGGAGTAAGGGCACAGCCAGGGGGTGGTGGGGGGGAG 
GGACATTTTCCAGCA 
Sequence 124 

GCTCACCGCGGTGGCGGCCCGAGAAATGTCGCCAAACTGCCGTCTTCCCTCCTCGGCCGC 
TGGGACAAACACCCCACAAAATGGCGGCAGCGCCGTCGCCCTAGAATCCCCCGAGTCGCC 
TCTCCCCGCGTACCT 



125 

ATTCAACAAATATTTATGCATCAGCTACATGCCAGGATCT gtaatagattctgggtgtgc 
AGT AGTGATTACTGCAGAATGCAGACATGGTCCCTGCATTGTTGAGAGGGAGACAGCAAC 
CAAATAAACAATTACAAAAAAGTATGTAACTAATTAACAAGTGGGAGAAGGGAGTGGGAT 
TACACAGCAGAAGTGGAAGGAAGGGCCC^TTAGAGTGGTCAAAGGCTTCTTGAAGGTAA 
CATGTAAGC TGAGACC tgaagaaggatgcaaaagggccagcatgt aaggaacagagaata 

AACATCCCAGAAATAGAAAATAACACACAAAAACCT AAAGTCATTAAAGAACATGATCAT 

CTTTCAAGAACTAACCCTTGAGATCAGAGT agtttgattatagaggaaaagggtgagtgc 
AATGGAAACGTTAAAAATAGCCCAGATCACGTAGAGCTCTTAGCCTTTTGGTAGAAAAA 
Sequence 126 

GCTCCCCGCGGCCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTGA^ 
CAATTATGTCCGCCAAACCAAGTTGGACTTACAGAGAGTTCC^ 

TTTACATCXTTTTGAGGTGCCACGAGAATATATAAGAGCTTT AAATGCTACCAAACTGGA 
ACGAGT ATTTGCAAAACCATTCCTTGCTTCGCTGGATGGTCACCGTGATGGAGTCAATTG 
CTTGGCAAAGCATNCAGAGAAGCTGCTACTGTCCTTTCTGGGGCCGTGTGATGGAGANGT 
TAAAAATTTGGAATCTAACTC^AGNGGNAATGNATTCC^ 

ACT AGGGGGATCCCCCGGGGCTGNAGGGAATTCGANT AAAGCTTNNTTAflTCCCCGCCAC 
CWCNNGGGGGGGGNCCCCCNNCCCA 1 1 M 1 1 1 II I > fmTANGGGGGGNTAATNGCCCCC 
GGGGGAAAAAANNNANAAAAAI Mill I NTNGNGGAAAAATTTTCCCCCAAAANTNTNCA 
NNAAAAAAAAAGGGG 
Sequence 127 

GTGAAAAAACAAGAAAGCTGAGAGAAATCAACATGTTCCCAAGTGCTGTATGTGAACAAT 
AAATC TGAGACATACCTC TAAGGCTTTTC CAGAGACAAGAAGCTCTCAACCTGT AAAGAA 
TTCCTGGGACATGACTGAGAGCAATGAGAAGTCCAGTGNCAGAAGGTTAGCAGATATAGT 
GTAGAGCATACACAGATATACTATAGTTCATAACACTGGTGGCTTAGCTGTAAATCACAA 
AAT AGCAC TGGAATTATCTAGTGATCATAGCACATAGTCCAAGAAGAAAAAATTTTGATC 
TTGTCTTAAACTTTGTGGAGCCAGTGGTGAAATGAGTCACACAAAGATGCAAGAATGATT 
GAACCCAGNCCTCTTT AGAC T AACAT ATTC TTGGCCATCACC NCCAAT ATT ACAAT AAAA 
ATCAAGACCCATGAAGGAGCATACC 1 1 1 1 ICIGNAAGNAAATATTGNTTACCTCAGCTCT 
ATTGGTATTTGATGCAAAACACCCACATGCAATTTG^GATGAATAAGACATGGGAAGGGGC 
CAAAATGNNACTTCATGGTTAAGGAAAAAAAAGGAGNGGGAAGGAGGNCACCAAAGCNGG 
T NC N GNAATGG G T NAACTT GGGGCATT AT AN GGG GG N GCTTT AAAAT ACCATTTT 
b 128 



ACAGCGATGTTTCGGTAACGGCATCTTAGGACGAAAAAGCTCCACGGTCTCATTCCACAG 
CCTGGTAGCTCGGTACCT 



CGCGGTGGCGGCCGCCCGGGCAGGTACAGTCAACGGCCGAAAACCACTGAGCTTTTCCCT 
C TGCC TGGCACATATCCACTGCCCTGCCTTCCTTCAGCTGATGAACTCTTCATATGCCTC 
CTTTTG^TGTCAGTGGAAATGTCACTTCTTTCTAGAAGCTTCTCTGGCTCTCCCAGC 
CT 

GGC C CAGG GCT CCAGC T ATGAGC TTCCAT AACAC CCCTAG TTTTCC TCACATTGCC C TCA 
TAGTATATGGAATTTGTTCATTCAATTGCCTGGCTTCCAACAGATGCCAGCTCCAAGAAG 
GCAGGAGCTGCTTC1GGGTATTGCTTGCGATCAAGGCCCTCACACCCAACCTAATGCCTG 
GGCCAGAGGTAGGTGCTTAATAAAAAATTGTTTGAGGCCGGGGCGTGGTGGCTCACGGCT 

ataatcccagcact 

T 

Sequence 130 

gcccaagggggggcc aagcc acattatttgn ntggggcn n n ctgcccntttttn aaann a 
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rTATNAAATAAACCCCCCCCNNNGGGGNGNGGGGGGGGG 
GGGNGTNATAN NNG NAN N N N G7CTG^^>r^TTTNTCCTTTAATTCXa , iANAAATAAACTT 
GA 



AAGGGCGGNTAATACCN 
Sequence 131 

AAACAAGCGACTTTTAGCAAATTAACGGTTAATTTTAATG 
C 

AM CI fAACGTTITATAATTAGAAGCTGTTATATAATTAGAGCTGGACACCCACATG^ 



TGT AGACCC TTGGGAG TTGT AGGGAGTTAAAGCTGATCATTATATACTATTATATACTT 
A 

GG^TACAACCCAAGOGCAACCCCTGGCCTTT ATGAAAACCTGGAGTGAGTTATTATTTC 
CTGCTAATACAATTCTCTGCCAGCCAGTTGCTGCATCAAAACAG 
TAAAGTCACCACTTCCTCATTCTGGTCCCCAATAACCGTATAAGC^ 
GACCTCTGCCCTGTGAAGGGTTGGGCTC 
Sequence 132 

CGCGGTGGCGGCCGAACCGTGGTGGCCGTGATCGTG 
ATGTTCTGGGCGGCCAGCACAATCGCCGGCTTGCCGACGATJ 

CGCAGCCGCGTGCCCCCGGCAATGAACTGGATATTGCT 

AGGGCAAGTTCCt^GGCAAGATCGGCGCGGACGCCGTCGAGGAACGCCAGACCG 
Sequence 133 

CG«TGGCGGCtX5AGGTACGATAATTCATGCCAATTTCTTTGGGA^ 

TAATAGGTTACAAAGCAAAATTGAGATGATTTTTAAAATGCCATGCAGTTAT^ 

G 

AATAACATAAATTTTAAACAGAGACCTGAAAAAAACCCCAAAAGTATT 
CATAAACTCAATAGAAATAATTTAACTGCCTTCTCTTCACAAGAGGCAATCAGAAGGCAG 
GACTATAGTTTTCTGTG 1 1 Id 1 1 1 CGACAGGAGAGATAATTACATTTCTAGAGACCCA 

AGAAACAATTCCAT AG TTTT AATTTC 
■ 134 



AC TCCAC CAGATGT AAAATCACTTTC TAAACAAC TATTTGACAGAC TGC TCCACAAG T CA 

TCATTCTTAGCATTTCTATAGCTGAACTTCTTTAAGTACCTGCC 

CG 

Sequence 136 

AGCrTNCCGCGGTGGCGGCCGAGGTACTTAAAAGTATATCANGGGCAGTTT^ 
GAGCCAGGGAAGGCACCCAAGGAAGTGATGGAAGAGTAGAAGTTCACCAGGTGCAGCT CA 
GGAAAGGGCTCAGCAAATTTCTCTGTAACAGGATGCAGAC^CCGCGTCCTGCCCG 
Sequence 137 

GCCGAGGTACTAAATTTAGCAACTTTATTT^TGAG 

TTGTCCTTCATGCTTACATGGATGAACTGTCAI 1 1 1 1 G I C TCCAATGGAGATGGAGAG 

TTTCTGAGGAGTTTCTTTGCTTTGACATTC^ 
G 

CTTT AGCAATAGTGCATGGAGCGGCTGCTTA" r CTCCCAGACTTCTTGGACTACTTTGC 
TT 

TAATTTCCCCAACAC TCGAGTGAAAAATGGGAAATTCTGGGCAAGAAAGATTTTTGAAC C 

ACXXXGCATTTTAAATTTTTTNACCT^^ 

CNCACCGGNGGGGGNTCX>rrTTTGGGGGGAAAA^ 

C GGGNTTTNTTTTTCCCN NCCCNTTTTTTTTNTTNTANAAAAAAAA 

CCNCI 1 1 1 1 I INAAAAAATTTT 
Sequence 138 

TNCCGCGGTGGCGGCCGAGGTACTCGGGAGGCTGAGACAGGACAATTGCT^ 
AGGTAGAGGTTGCAGTAAGCCAAGATX^TGCTAGTACACTCCAGGCTGGGTGACAGAGTA 
AGACT CCATCTCAAAAAAAAAAGAAAAAAATTGACTTTGGAACC TCAGATT ACAT ATC AG 
TTTGCATACATGCT AAACAGAGAAATGTCCTCAAAATTCAGTTACTAAAAATTACT GAT 
A 

GCT ATATAAGCATAACTGATAACTGCTATTACAAATAAATATTCCACAAATTTGGAAAG 

T 

TATTAGAGGAAGAA I II I I 1 1 ICCTTGTAATTTCCAGGTGTTTATATTAGTTGGGCCAT 
A 

Sequence 139 

TTGGAGCTCCCCGCGGTGGCGGCCGAGCCCAATTCTTGATTTCTTTCCATCCCAAACTCT 
TT AAACTCTTGACC T C TGCAATTCAAG TTG TGAACATGAAAC TTGTC T AT CAC CAG CC T 
C 



GCCCTGGGATTCTATTGG^TGCAGCCAATTTCTTTGTC 

TTAATCAGGTGTTTCCTACTGTGTAGACAAACCTAACAGCTCTCCATCTTAAAAAAGGAG 
TGAGGAATTGAGCTGGAGAGAGATGTGCATTCCAAA I I III C I II CCCTTTCATAAAGA 
C 

TTGATCGTCTTATTTATCTGGATTGGCCATACACAG 

TCCCGCGTACCT 
Sequence 135 

TTGAGCTCGCCGCGXSTGGCGGCCGAGGTACCTCTCCT^ 

TCCTGGAAAACCCCCTGGAGGAAGCGCTCCTGTTGCAGTCGGAGTGAACACCCGTCTTGT 

^A*CCACCAGCAGG<XjGATTCCTTTCTGGAGAGTCCATGTAGTGATCATCTC 

TC T GCATTTTCCCCCAGAAAGGCGAGCATG TT AC TTG TCATCTTGGGATC CGAATGACAA 



GCTCTTCACCCCTCTGAAGGGGGCTCCTCANGGGAAGGTACC 



TCCCCGCGGTGGCGGCCGCTGTGAAACAATGCTCATAGCTCTTGAAACGACAGCGATGTT 

tcx;gtaacggcatcttagcacgaaaaagctccacggtctcattccacagcc TGGTAGC TC 

GGTACC 
T 

Sequence 141 

TNCCGCGGTGGCGGCCGAGCCCATTCTTGATTTCTTTCNTCCCAAACTCTTTAAACTC 
TT 

GAC CTC TGCAA TTC AAG TTG TGAACATGAAACTTGTC TAT CA CCAGC C CC TT C T C TGCAT 

TCTCTTTCCCCCCTTGTTATGCTAAAACTTGGATGGCCTCTGAAGATACTGCTCTTCA 

CC 
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CCTCTGAAGGGGGCXCCTCAGGGGAAC6TA0CT 



NGGTTGCGCTCACTGCCCGNTTTTTCCAAGTCAGGGAAAACCTTNGCNGGCCCN>rnrTNG 
TTTTAANANAANN7<WGCCNNCCCCN^^ 

CCNNTTTTTCCC TTTTCCNNNAAAAAAAAAAAAWCT<><NGGCCCCCNGGN WTTTTTGGGG 



NNGACCTAACCTNACATTTAAATHGCGGTGGCGGCTTA, 
GGGAAAAACCCTWrrCCNNGCCCAAWCTTTGTWTi^^WGAAA'n 
GGGGWGAAGGGNGGGTTTTTNGCNATTAT1 



GAGCTCCCCGCGOTGGCGGCCGTTGCCCTTACATCTCTCATTTGGAACOTOACACGGTAT 
TAAATAACGGCATATGAAAGCTT AAAAGTCATCAAATACAATCACTGGGTACTTTCGATT 
ACCCAAAOCAGGCACTTTCCTAAACTCCCCACT7CTTTAI 
T 

TATTCCCCCGCGTACCTGCCC 



AC TCCCCOCGGTGGCGGCCGAGGTACCGAGCT CCNGGCTGTGGAATGAGACCGTGGAGC T 

TTITCOTGCTAAGATGCCGTTACGGAAACATCGCTGTCGTTTCAA<MGCTATGAGCATTG 

TTTCACA 



CTCCOCGCGGTGOOGGCCGTT ATGCTTAGCCWGTTTATTCTTTAT I f 1 TTTACTGGAQ 
TC 

ATTGCCAGTGATGGAAACGGTGTTTGt: 11 C 1 C I f I C AGTCAAGATCTGCACAAAGTATAG 
CATTAGGTGGTATTTATTtnTTATATTATGAGTTCTACATTCATCTTTCCAGCACTCTGA 
AGTTATCAGCAAGTTCTCAGTCAGTTCAAGGCATTGGATTCTGCTTGA I ITCTTTTTA A 

T ■ ■ - 

TCATTGTTT I7GACCCCT I TGAGAGTTTTAATAGAGAGGAGTCTGQAAGGCAGAGATCTC 
CAOCACCTAACCGTGAGAAATTTGGAACTAAGGACTTGCACTGGTCCCCAAGTTAACAGG 
GQATATACTTCCTGCATTTTCTCTGKTC I T T C Tt UCC 
Sequence 147 

TGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAAGGTGGGC AT H H 11 AAAAA 

ACCCATGGAAATAAATGCTACTTCTTGTTAGT G 1 1 b M I GAAAATAAACAAAGAAAATGC 

AAACAAAACAAAAACCATCGTCCATTCAAGCTCAAGAGTATTTAACCAATGCTCT6TTGC 

CTCTTAAAGGATtGGTAGCTATTTCCCCATCTACAAATACATGACAATTAACTAAGCCCA 

ATTCTTTAAAACT ATCTGGAATTAGGTCAAAATTATCTAA1 1 1 1 1 1 1 L I G ATTTAATT AT 

GOATTaCCGTAATCXAATAGTTGGCAACATTATAAAACCCTAACTTTACCTCATTGGTT 

T 

GGCTATACCAAGG7CTCATGGACTCTTGGACATAACCACCATTCTTTCCTNCCAACACCC 
CGGNGTACTTCAGAGT AAAACCCGGGAGCCTTCATGAT AACCA TGAAGGC CCGGAAGC TT 
CTGGCTTCCAAGGCTTTCTNTNGGCCTNACCTTCCGGTGGTTCCTTTTCT 
Sequence 146 

GGGTGGCGGCCGAGGTAGCTNTGTGCGCGGTGGKCGAAAAAGCACCTGGGTCGGGTGCAG 
ACTGCGGAGCNGGGCCCTACCGTGTGCGCAGAAAGAGGAGGCGCTGGACTTATCCTACCT 
TAAGTTGAAGCAGACCAGCAATTGTTGTGACCTACAATCTCCACACCCATCTTTACTCTG 
AGCCAAGGAAGTGTCTGTTCTTGTGCTGAGTTTNAGGGGCCTTCAGCTN GN GGGAAATCC 
CNAAGA 

M9 . . I 



agctccccgcggtggcggccgaggtaccttcccctgaggagcccccttcagaggggtgaa 
gagcagtatcttcagagcccatccaagttttagcataacaaggagggaaagagaatgcag 
agaagaggctggtgatagacaagtttcatgttcacaacttgaattgcagaggtcaa 
ttaaagagtttgggatggaaagaaatcaagaattgggct 

b 150 



ccgtaacggcatcttagcacgaaaaagctcgacggtctgattccacagoctggtagctcg 
gtacctcggccgctctagaactagt 



tttctgtcccctc tgagccatggaagatactggagtt aagaaa aattt tataaactaaag 
aaagc a actttataatctaaaagaaagcaactttccc i dc us tcl 1 1 rgaattcttattc 

CTGAAAGAATGGATAATGAATCAGGAGATGAGCAAAAAi^TATCTTTTA£lAAAGCTCTAG 

TCTTCCAAAAGCCTCTAAACTCAAACGAAACCTTTTTAAAGTAGTTTTGTA 

A 

GGTATGGCATTTCCAGAAAGTTGCAGATGAGCACCATTGGGGATTACCCAAATTCTGTCA 

cacattgaggaatgaaattcagggaattgggacaatgagctcttgggcatatgaaagaa 

GCCCTTCCTTCCTTTCT 
Se qu ence 152 

t>iccgcggtggcggccgaggtacncctaaaaaagtactgcagcagagaagaaaacattgg 
acaaagaagaaaggcgacagaaggctagagagaggcagcagaaat^ 
c ttcacgacagaaaggctttatggaaactgcaatggatgttgattc tcctgagaat gat a 
ttcctatggagatcaccacggcagaaccagaggtttccgaggcagtatatgactgtgtta 
atcgacctactggattagttgtacctgccc 




itataaaggaagagtaaagatggagctcctcctgcctccatcacgaaagc 
acatatcatctgtccctttggattttacttccaggacgcgtgtcgtcccc^ 

TT 

G<XTTATGGTGCCGGCAGAGCCTCAGCTATCTGCCTGGGAAGTtX^^TGTCCTTGGAGAG 
aatttggaatg CAGA t AATTTTTCTT ATTTCTTGAGAGC TT AC tttaatcagcatgaca 

c 

tacctaaacactgaagatggccttatattagtaagatttc<iacaaaattaagtatacct 

A 

tgcaaactattactttggtttttaggagtttgatcagatgaagaagtnatggtatcaca 
attttttgnaag TG rmTTTATT aaaaacc 

C*fCCGCGGTGGCGTNCGGC*X5CCGCGTrrTCTGCGG^ 

TCAATGAGGTCGTCXKJCATCTTCGAGACCGATGGACAGGCGGATCGTGCCCTGGCTrG^ 

CCTGCGCCCGCCAGCGCTTCGTCGCTCATGCGGAAATGCTGTGGTG^TGGCCGGGTGGAT 

CACCAGGCTGCGGCAATCGCCCACGTTGGCCAGGTGGCTGAAGACCTTGAGGGTTTCAAT 

GAACTTCTTGCCCTGCTCGCGGTTGCCCTTGAGGTCAAAGCT 

Sequence 1S5 

C GCGG TGGCGGCGCCCGGGCNGGTT ATAAAAACGAACA TG T A T AAACG C TT ACGC AAACC 
C TTTTT AATGTTC TGAAG TCAG TC TTT G TAAG TGAAATCGC TGGAGACT AGAAAGT ATG 
A 

AATGGCAGTCTACCTGGGCAACCTACAAAAAATTTAGCTTGAAAAGACTTCAGTCTCCGC 
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TCCCCTGTTGATCIX^TGGAGTGGGGAATGGGAATTGAACCAGAACTGGAAAA 
GGAAAGTTTGTTAACTACTC ITTGTI GATCrCATGGAGTGGGGAATGGGAATTGAACCAG 
AACTGGAAAATTATTTGGGAAAGTTTATTAAC 
Sequence 156 

CTGGCGGCCCCCCGNNCTGGTNCTTNCATCTW^^ 

ACGGCCATGAAATGCAATCCAGCTGAAGT ATTATCATCTTGTAGCATTTCAAAAGGAACC 
GTCGAAGTCATCCAAAGGATGGGAACGACAAT G 1 1 L TT G M GTTCCTTGGGTrTCTTA 
AT 

GATTTCTGAATCATCATTATTAATTATGGAATTCTCT^ 

TCTCCT CACTTTCTCACATC TTTTT TCT TGCAGCTCTTTCTCAG CTC T TC T T CCTTGCCT 
TTTTTT ACTGGCCTTTCCTTGTCTTACTTCAG^ 

TGAAGGGTGGTNCAAGCATCAG C T TGGTTCWAATAAAATTAATGGTGTTAI 
GGCCTTWGTTTAAAACGCAAAn 

Us?, : 




CCGCTTTTiACCCAGGCTGGGGGTGCACAGGATCT 

AAGCCCCATGGGTCCACCAGGATGAGCTGTTTAACTNTATCAGGG*^^ 

AGAA 

Sequence 159 

<XCCGCGCTGGaX3C«3CCCGGGC^GGTA^ 

TTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGA 
AGAGGCTTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTT ACATCATTT ACA 
CCGCCGACCATGGTTACCATATTGGGCAGTTTGGACrrG 

ATGACTTTGAT ATTCGTGTGCC I 1 1 1 1 1 1 ATTCGTGGTCCAAGTGTAGAACCAGGATCA 

T AGTCCCACAGATCGTTCTCAACATTGACTTGGGCCCCACGATCC^ GGGC 
TCGACACACCTCCTGATGT GGACGGCAAGTCT G TCCTCAAACTTCTGGACCGAGAAAAGC 
CAGGTAACAGGTTTCGAACAAACAAGAAGGCC 

«TACACAGGACCAA TGCTGCCCATCCACATGGAATTTACAAA 
CATTCTAGAGCGCAAAAGGCTCGAGACTTTGATGTCAGTGGATGATTCT 
GTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTAOVTCATTTA 
CCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGGAAATG 
TGATATTCGTGTGGU 1 1 1 1 1 1 1 ATTCGTGGTCCAAGTGTAGAACCAGGATCAATAGTC 
CC 

^CAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCTGGATATTGCT GGGCTCGAGAC 



caggtttcgaacaaacaagaaggccaaaa 

Sequence 151 

CGAGGT ACCATCCTATTAAT ACTAACTTCTGCTTCTACAT ACT G TAGACC I I rC TGGAT 
G 

atag aaatcaatgcagcgggtgggacgagggcaccatttatattggactgactgatatgg 
ctttctataccaaaggtaaatgctgaatgagaaaatcctgactcttgcaagtatctata 

T 

accaagaagttgacctcatcactgcttatactcatctttatt^^ 



tcccaacacaggatataacccattgggcagtgcattt^tgtgg^ 
atnckxsgtc^acgcgccaatcacaagtttgctggtgttgatgctggaaac 

ACGCCGCTCCCCCCTCCCGGGAA 
Sequence 162 

GGCGGCCGAGGTACGTGGCCTGCTGGCATAGTTCTTTGACCCGTTCATATT^ 
GArTTGACTGTTGGATATTC 1 1 GC I GGATTCC TGCTTCTT ACG T AGAAATTTGCCTCTT 
T 



G4JI 1 1 1 CTGGCTCCAGAAGTTTGAGGACAGACTTGCCGTCG^ 

GCGCAGCAATATCCAGGiATCGTGGGGGCCAAGTCAATGTTGAGAACGATCTGTGGGACTA 
TTGATCGTGGTTCT ACACTTGGACCACGAATAAAAAAAGGCACACGAATATCAAAGTCAT 
ATGGCATGGAI I ICUXil I GACCAGTCCAAACTTGCCCAATATGGTAACGATGGTCGGGGG 
TGTA 

Seq u ence 163 

GGGGCCWCGCGTCCGGGTGGCTCTATGTAGTTCTAATTTGCA 1 1 nCTCl AATGACTAACG 
ATGTTAAACATATTTTTATGTACTTGTTTCATGT ACTT GTTGATATGTCT ATTCAATTCC 

trtcaccatttttatggag c i gi 1 1 i i i i a ttattgagttgtagg a 1 1 ii i i i a tatatg 
ctgcat accaggccttt gttatatacatgc tttgcaatgtaca ttgtcttaaaatctgt 

G 

GCTTGCCTGTTCAATTCATTAGTGGT G 1 T 1 1 GTTAAOCAG TTTTT AATTTTGATCiAAGT 
G 

T AACTT ATTCATTTTrTATTATGGTTATTGCTTT ATGTTTCAGGTC 
CTCACAAATCACAAACATTATCCTATGTTTTCCTTCAAAAATTATATGG 
TTCAATCTCAAAATATTCTCTAA 1 1 1 1 1 1 1 GCTGATTTATTTCTAAAGAAATTTGAGGGA 
TTTGCTATAATGG 
Sequence 184 

CCCCGCCGTGGCGGCCGCCCXXXSGCAGGTTATTTAAt I IU1 i AGIG1CICAATTTCCTCC 

TCTATAAAACAGAGATAATAGTATTTAGGOCAGAGGGTTGTGGTGAAGTGTGAATCATTT 

CTCCATGTAAAACACATAGGACAGGCTGGGCATGGTGGTGGGCACXn^GTAATCCCAGTTA 

CTTGAGAGGCTGAGACAGGAGAATCGCTTGAACGCGGGAGACGGAGGTTGCAGTGAGCGG 

AGATAGTGCCACTGCACTOIAGCCTGAGTGACAAGAGTGAGAG 

AAAAAAAAAAAAAAGTACCT 

Sequence 165 

NCCTGGCATCACCHATTAGMAATGAACCTGTT AATGCAAGGTCTTT AGAAAAACTTGAAA 
TTATTCCTGCAAGCCAAI 1 1 1 G I CCACGTGTTGAGATGATTGCT ACAATGAAAAAGAAGG 
GTGAGGAAAGAAGATGTCTGAATCCAAGAATCCGAAGGGCCG TCAAGAAATTTTAOCTGA 

aaggcaggttaggcaagggaaaaggggtctaaaaagatctggcttaaaaaccaggagggg 

GGAAGCCAAAAATCCGATGCCAA GTGC rrrTCCCAAA 
GGGCCTGGCCCTTCTTCCCMCACTTTCCCCTTACC^ 

GGCCA TTAAATTTGGTTT CTTT AAGTTTTTGGCAGGTTT ACCGCCTT AAAAAAGGGTG 
GA 

CCCAAATGGATTGGGTCCAOCCAAAATCWAGGCTTGCTTACTT ACTTCCCTGGTAAGGGA 
A 

Sequence 166 

gtggcgncccticggncaggtacttgctcagcct^^ 

^tcagcaggaccaaggtgtgaagtgggaatgaacatggatco^tcocattggatggaga 
agaaaggt ggacagcctgttcg tct CTGATG TCAGCCT AGGGCTGCX5AACAG tttg tgag 
gacttatctgttgtacct 
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167 

^CNGGCCGCCCGGGCAOGTACGCGGGAATGGGCACNNT GNAGCGCAAGT AGGTC T ACAAG 
ACGCT ACTTCCCCTATCATAGAAGAGCTT ATCACCTTTCATGATCACGCCCTNGGGNATC 
^^^^CCTTATCTGCTTCCTAGTCCTGGTATGCCCTTTTCCTNAACCACTCACAAACCA 

AAACTTAACTAAATAACTTAACAATCCTNAGAACGCCTCAAGGNAAAI^AAGAAAACCCG 
TCNTGAAACTT ATTCCTGCCCCGCCCATCATCCCTT AGNTCCCTCAATTCTCGNCCC T 

CCAAN CCCCT ACCGCCAATCCCTTTTTACAATAAAACAGGACCGAAGGGTCCAAA^ 

tcccctccccntt acccattcaaaaaatcaaaattmggcccacccaaa7tggannacctt 
gaaacccctaaccgaagttacccttcgggcccgcttcttaagaaactaaggmgggaatcc 
cccccnggggcctggnaanggaaatttcggataatcaaagccttaattccgaatancccg 
gtccgaacccttc " 

CTTTTTA 
A 



ATNTTCAGGAGACGC TCNGT AGCCCTCGCGCTNTATCCTNCGG NACAGTT CTGCGGAAGA 
AGTGC^Tt^CGCCTTCCAGAGCCACATCATCGCGGNCGAAAGNGAAGCCCAGAGAGAGGT 
AGGT GTAGGAGGCCTGCAGGTACCTCGGCCGCTCT AAGAACAANGNGGATCCCCCCGGGC 
TGCAAGGGAArrCCCT TAHC AAAGCANTANTNA^ACCCGTCCGWCC^ 
CCCCGNTACCCNAANCTTTTGN NNCCCNTNATAGAGAAGGG N GAAAAAATNANGCC C NC C 
TNGGGGCAGNAAAAAAATGGGGACAATAAAGCTNTTNW^NGGGGGNTNAAAA^ 
^^^TCCCCCNACCANNAATTTTCNCNAAACAAAAAAATAAAAAANCNCCGflGGANNGAN 
AAAAAAAAN NGGNAT AAAACACCCC NGGGGGN GGGTCCCCNCAAAGNGGGGGGGGGACC N 
CCNCCCfiAACAATTAATGTGGGGNGGGNGGA^hW^TNGCCCTN fsTTTTTT NT AN NG N G 
ANNAAAAANNCTTGGNGCWGNCCCNWACTrCTAMNTAAAAAAAAANACCCCCCCNCCCCN 
CCCGGGGGMNAGNGNGG##4NGNTTNACTTTANNkGGGCNANNTT^ 
AAAAAAAAAAATACNNGGCACN NGGGAAATTTNGGGGGGGGGGG 



■nTTGAACCCCNCTTNCCGCGGNGGCGGCCGCCC^^ 

TGTATTCTGTATTATAATTGTATATTTGATTGCCTATCTCCCCTCAACTGCATTATACAT 
TTTCAT GGGTGAGCCAATGTCTTTTTCACTC TATTTCAG TGCC C TGCACATTTTCTGGC 

^^^GTAAGCATDCCATGAGTATCTGATGAATAAATGTATTTCCAAATTCAGGTTCAGT 

TCC TTAATCTGAAAAT ACAAAATCCGAAAT GCCATAAAATTCAAAGCTTTTTGAGGAC TG 
ACCTCGTGC T GAAAGGAAATGCTCATTGGAGCATT7TGGACTTCAG>TTTTCAGATT AGG 
GATATTCAACCCGTAAGAATAGTGCCAATATTCCAAAATTCAAAAAAGTCTGAAATCCAA 
AACAC TT CTGG T CCCAG GT ATTTTGGATAAGGGAT ACTCAACC TGTACCGT AAAATACAT 
GCAT AC TTTCGAT AGCACATG TGAAGGTATCTC TCT AAAATTG ACC TCATTG GTTTC GT 

CTCAAGCAAACTGACCTGGGGCCACTGAACATGGCTTTTATCGNGCCTGATGTTAATGCA 
TGTCTCI 1 1 1 IACAATA 



AAGTCTACATTTTATGTAG7GGTTAATGTTTGCTGTTCATTAGGATGGTTTCACAGTTA 

C ATACAAATGTAG AAGCAAC A GGTCCAAAAAG T AGG GC ATGATTTT C TC C ATG T AA TCCA 
G G GA GAAAAC AAG CCA TGAC C A TT GTTGGTT GGGAGAC T GAAGG TGATTG AAGGTTC ACC 
ATCATCCTCACCAACTTTTGGGCCATAATTCACCCAACCCTTTGGTGGAGCCTGAAAAAA 
ATC TGGGCAGAATGT AGGAC TTC TTTATTTTG TTTAAAGGGGT AAC ACAGAG TGC CC TT A 
TGAAGGAGTTGG AGATCCTG C AAGGAAGAGAAGG AG TGAAG GAG AG ATCAAGAGAGAG AA 



AC AA TGAGG AACATTTCATTTGACCCAACA TCC TTTAGGAG CAT AAAT G TTGACAC TAAG 
TTAT(XXTTTTGTGCTAAAATGGACAGTATTGGCAAAATGATCCACAACTTCTT ATTCT 

c 

TGGCTCTATATTGCTTTGGAAACACTT 
Sequence 171 

GGC GGCCGCCGGAGCGGCGCGGAGCA tga t GGAAG TC G T AG T AGGAAATGGCG tcgt GGC 
ATTGAGGGGGCATCCCTCCTAGAACCTCCAGGAAAAGCTaSCGGAAGAC^ 
GAGAGAGAGGCTCCAAGCAGTCTGGGAAGTGTAGTCCAGTTGGCTTAGCAGTAGT^ 
TGGGGGGGAGCCCGAGGTTCCGGGAAGGGGCTAGGCCGGCT^ 



AAAAAAAATCTCCACAAGAAGAAGCAACTCANCAGGCCC^^ 

AAT AAACAGA T ATGCATTGCATT AAAG G T AATTTTCAAAT ATTT AAGTT ACACCAAGATT 
TCCCTCCAATATGTGCCTTTCTCAAACCAATG CAAC TAATTCATTGC T AAT ACTGGGG 
CA 

TGAAI 1 1 If I GGCAAATGTTTATGGTTTTACTTTCTTC^TTAATCAAAAAANT 
Sequence 172 

CGGGTACANATTTAAGGT AGATGGACT CAGGGTAAGGATAGC TACAGCTGTGTGGGGCT G 

AAGGTCTGTGGCACTGAGCTACTGGGGAAGGAGGGCTCTGTTTTC^TNGTGAC^ 

GTTAATAAAGCACTTACTGAGGGAGCCAGAGCCCAAACTCTAAATGTGCTGTAGAAAAAG 

GGCCAAGTCATTtSACTGCACCACTCCTTCAGCCAGAGGTAGAAAGGATTTACTCTTCAGC 

CATCTGGTAGAGCCCCAAGAACAAGTTACATGTGGACAAAGGGAGGGAGAGGTATCATGG 

"' 1 GATTAATAAATNCAAACAAAGCTGAATGATAAGNACCCCAGGATGGAATACAGTCTGAG 

AAAGGCCTGGGCAAAG 

Sequence 173 

GGG<KX;GGGCCCCCGTAGGGGTTA(XCNCCGNGGGrrATTAAGGGGTT 
AAACCACCTGGCNCAhrTTTCCAACXCAAANGGTNC^^ 

C CCAG GCC TTGGGGAAAAGTTGTTTGGGG ^WUJCCCACCAAACGAATTGG NCTT GGTN GG 

GGAGGCCAACCC^ACCAATGGNCCTTGTTGI^KJTAAGAAAT 

CTTGNAAAGGGTATTTGGGTGGTTNCGTTAAWTTTGCGGAAAAAGGAAAT^ 

G TT ATTT GTTAAGAAAGC C AAAGGGTTTTGGAAAAAAATGGGGAATTTGGGAAGAACCTG 

GCCAATTGGGGTTGGGGCCCATTAAAANAATTGGGGAAGGNAAAAAATTTTGG^ 

GGTNAAGNCCANTCCTTAAGGTTCCTTAACCTTT^^ 

AGGNAACCCANTTAAAAGGGGGNANGGGANGGACCCAAAAAAAAAAACCCAGGGGGGTNT 

TTTGGTT>KSGNCCCCCAATTAAAAAAAAGGGTTAA rTTTTTTT l TT TTTCCAAAAAAAG 

G 

GAACC CANCC CC CCAAAA AGGG AAATT GGGTTGGGGGGTTNAAAAAAAATTGGGGAAAAA 
AAAAAAl 1 1 1 IAAN 1 1 1 I r AAGGGTTTTTCCAAACCTTTTTTCCCCCCTTGGCCTTGGG 
C 

CCCAANTTGGGAAAAAAANCC 7 T I I I \ J GGGCCC C NTTTTT AAAAAG G N AAAAAG GGGGG 
T H GG GCCC TTGGGGG NAANTTTTTN CCCCCAAAAAGG GGGG I I HIII IGGGTTNAAAAA 
AAGGGGGGN C CAAN TTTC N TTTC CGGG GG TTT AAAAAAAGGGAACCCTTGGG C I U III I 
TT 

Sequence 174 

GGCGAGCGGCCGCCGGGCAGGTACCCTAGGGTGTTGTTTAAAGGACTTGATAACCAGCTT 
GAAGAGGTTCCTACTGACCAGAAATGGAATGAAATTTAAGCATCAATAAGGGTAATAACT 
GCAAGAGAC TGACATC CACT AT GG TTT AAATCCATGAG G TCA C AATG AT ACTT AATTTT 

CATTATTCTGAAAACCAGTAAATAAAGGCTAAGATTCAACAAGCATTTATCCAGCCTTTC 
C TCAATG AAA T AT ATC N T AAG AGAAC CGAAT AG TT AACAT AGA GAC AT GGCC GGGCAAGG 
TGGCTCTCGCCTGTAATCCCAACACTTTGGGAGGCCCGAGGTGGGAAGATTGCTTGAGCC 
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CAAGAGTTCTAGACCAGNCTGGACAACATGGTGAAACCCTGTGCCTAI 
AACAAAAAAAAAGGTCCCC 
Sequence 175 

CAGGACCAAAACCTGGGGATTAAGCT AAGAAGTCTGGTGGAGAGACTCTGTGGACGTAAA 
GAAGGGAATGAACACAGAGAAACTTTCAGCCV^GATTC^ 

AGTCAAACT GGAGTGAAACC^TGCAAATGCAGCXSTGTGTGGGAAAGTCTTCCTCCCGTCA 
TTCATTCCTGGACAGGCACATGAGAGCTrCATGCT^ 




TTCCCCGGCCATCTTAI 
AGATGGC^SGCCGGAfiAGAAGACNAAAAAGTCW^TCGGAGTCGATCAACTCTAG^ 
CTCGNNATGAAAAGTGGGAAGTNCCT 



CCCCGCGGTGGCGGCCGAGGTAC i 1 1 H I I I H I I I 1 1 H t ATGAATMATTNATTTTCT 
T 

T^CAGAAAAGGATGCGCCTCCACTTAGCAAGGCT GGGCAGGATGTGGTCTCTGCATCTC 
CCCACAGACAGGGGTGGTTCTAGA 

iTACCAAACCATTTTCACTAGTTCAGGATAGGAATATTCA 
^^GATTGTCTCTGTAAAAGTGAATCACAAAAATTCCACCTGTGTAGGTGTGGGACTGGA 

TCAGAA<nTAAAAATCTACTCTGAGTTATATGTGCATCAATITTAGACATATTGCTGAT 
T 

TTATTATGAAAATGAAGTGCT Aj 



GGTACTC^ACAGTCACGCAAATTCACAGTCTGCGTGCACGGCT C TCCATT CTTCTTC TTGG 
CTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATT^^ 

T AGATAAACACAC CTCC TCTGAACCATCCTTGGGC TTCATGGGG TTGGCATTGAGGA TCC 
CTACGACAGTCCCCTGCTCCGTCTrcCAGAGCGCTTrGTGAACTTCTCCAAATAAGAAGA 
AGGACACACATTGTGTCAGG TCACGAAGATCATTCAG TTTCCAT ATGC TGAAGGTTTTT C 
CACTATTCACAC TC TGTGGCGT AAC CTTC TTCAAT AT AACCCCAAATGTCACCCAAT CT 

TTTCTTCCAGCTTCTCTCTGGCCA I C I 111 C 1 1 GATCTGAGACAGTCTGATCAGTTTTC 
G 

GCCGCTCTAGAACTAG 
Sequence 180 . 

GGCGGCCGAAAACTGATCAGACTGTCTCAGATCAAGGAAAAGATGGCCAGAGAGAAGCTG 

GAAGAAATACGATTGGGTGACATTTGGGGTTATATTGAAGAAGGTTACGCCACAGAGTGT 

GAATAGTGGAAAAACCTTCAGCATATGGAAACTGAATGATCTTCGTGACCTGACACAATG 

TGTGTCCTTGTTCTTATTTGGAGAAGTTCACAAAGGGCTCTGGAAGACGGAGC^^GGGAC 

TGTCGTAGGGATCCTCAATGCCAACCCCATGAAGCCCAAGGATGGTTCAGAGGAGGTGTG 

TTT ATCT ATO^TCATCCTCAGAAGGTCTTAATTATGGGTGAAGCTCTTGACCTGGGAAC 

CTGTAAAGCCAAGAAGAAGAAT 

Sequence 161 

GTGGCGGCCGAGGT AC TACAGTCACX^TCCTCTGAACCATCCTTGGGC TTCATGGGG TTG 
GCA TTGAGGATCCC T AC GACAGTCCC C TGC T CCGTCTTCCAGAGCGC TTT G TGAAC TT CT 



CCAAATAAGAACAAGGACACAC^TTGTGTC^GGTCACGAAGATCATTCAGTTTCCATATG 
C TGAAGGTTTTTCCAC T ATTCACACTCTG T GGC GTAAC C TTC TT CAATA T AACCC CAAA 

GTCACCCAATCTATTTCTTCCAGCTTCTCTCTGGCCATCTTTT 
Sequence 182 

GCGGCCGAGGTACATGGATACGTTCTCTTCTGGGGGCGGTCTCCAGTCXTTTCTCATGAG 
GGAGCACACTTXTCTGCCTCATTGCAGTGGCCTCAGGGATATGGAATTAAGATCCACCTG 
G TG TGATGAATAAACCCAGACTC TCAGCAACGCAGGAAAAAAAACAAAAAC TGGCTGGCG 
ATCTGGAGTAAAGGATCCTCACATCCACGTGAACCAGGAA^ 
AAAAAAAAACACTGGGAGAGCCGAACTAAAAGTCTTTTAGCACGGGTACCTGCCCG 
Sequence 183 

TCCCGCGGTGGCGGCCCGAGGTACGCGGGGAGCGGAAAGGGAGACTGTGGGGAACTAGGA 
GCAACAGCAGGCATGGACCAAAGCAGTGAAGGATGTATGAAAAAGATTAGCAGTGTGAAT 

cttgacaaacttataaatgacttctcacagatagaaaagaaaatggtagaaaccaatgga 
aaxsaacaat atactggatattcagttg gaaaaaagt aattgcc t att aaaagtaatgcaa 
gcaaaggaggtctccattaaagaagaatgtgct actcttcataatataataaaagggcta 
caacagaccattgaatatcaacagaatttgaaaggtgaaaatgaacaactaaaa^taagt 
gctgatcttataaaagagaagttaaagtctcatgaacagg^tataagaataatattgcc 

AAAC TTGTAAGTGAAATGAAAATCAAAGAGGAGGGATATAAGAAAGAAAT AAGCCAACTT 
TATCAGGGACATGGAGAGAAAAGTTGAATTAAATGAAGAAAAGCCCAAAGAACTTATANA 
GAAAAAGGNGATGGGAAffTTCANAGGTTAATGCCAAGCTTAGAAGTCAAAAAAAAAAAAA 
AAT 



ccgcggtggcggccgaggtacatggatacgttctcttctgggggcggtctccagtccttt 
ctcatgagggagcacactcctctgcctcattgcagtggcct<^gggatatggaattaaga 
tccacct ggtgtgatgaataaacccagactctcagc^acg^ggaaaaaaa 
ggctgc^gatctggagtaaaggatcctcacatccacgtgaaccaggaaactctgtgccca 
^ tcg acgaaaaaaaaacactgggagagccxiaactaaaagtcttrragcacgggtacctg 

CCCG 

Sequence 185 

CCGG^CGCCCGGCAGGTACGCGGGGGTGTCCGGCGATGGGCACGGGCATTTCTTCGTTTA 
TAGCTGTCTGTTTGCATTCTGATTGGGAACACTGGGATCATTTTCATCATGCCGACAGTG 
GTGGTAATGGATGTATC^CTTTCCATGACCCGACCTGTGTCTATTGAGGGGTCCGAGGAA 
T AC CAGC GAAGCAC T AAGTAAT ATGGAT GA TT ATGACAAAA C CTGC TT GGAG TC TGCATT 
AGTTGGTGTTTGCAATATCGTTCAGCAAGAATGGGGTGGTGCAATTCTTGCCAGGTTGTC 
CTGGTGACAGACGGNTGTCTGGCATTGNNAGAGGGCCACTGGGACATTCNhTTANCCANTC 
AAAATTAACNAAAGTGNGAGCACNN 
AT 

AT ACCANGG N NGGGCGAAT1 
CWGTTTGTCmTTCOICTTTGGGGNGNGGCCATTTTTATNTTGGGGGGNCCC 
AKAARAAACCCCCOKXCCTTTAKAAAANNGGNCCCCCCCCCGNGNGGG^^ 
AAAAAATTTTNCCCCCCCCCCCCCCGGG 
Sequence 186 

TCC CG C GGTGGC GGCCGAGGT ACTCACACG T CACCGCAAATTCACAGTC TG CG T GGAC G G 

CTCTCCATTCTTCTTCTTGGCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTA 

A 

GACCTTCTGAGGATGATCGAT AGATAAACACACCTCCTCTGAACCATCCTTGGGC TTCAT 
GGGGTTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCGTCTTCCAGAGCGC TTTG TG 
AAC TTCTCCAAA T AAGAACAAGGACACACA TT G T GTCAGG TCACGAAGATCATTCAGTTT 
CCATATGCTGAAGGTTmCCACTATTCACACTCTGTGGCGTAACCTTCTTCAATATAA 
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CCCAAATGTCXCCCAATCTAT1 



ATU 1 1 1 ICTTGATCTG 

:tagaactaggtggatccccc 




tatcattacagcacaaaagcatactcatggaatatttcccgtaaatctgcagaatcgcta 
cacagacttagtggccatccagaataaaaaTgaaattgattacctcaataaggtcctacc 
ctact acagctcctact actggattgggatccgaaagaacaataagacatx3gacatgggt 

GGGAACCAAAAAGGCTCTCACCAACGAGGCTGAGAACTGGGCTT^TAATGAA^ 



TTTTGTAACTACArGGTCTCAGATGCATCACAAAAGCAGAAGTGCCCTTTCAGC I C > IC I C 
TGTGCCATT CC 1 1 CI CAATTTCATGCTGCCT ACAGCAACAGCAT AATACTGCAAACACCC 
ATGATGTCACTCGAAGTGCTCTGTGATTGACAGAGAGGGACAGTCGTAGTCAGAGGTGGC 
TCCTCAGAGAATTCAGAACTCACTCGCTGTCXTCCAGGGGCTCATCCCTTGATTTG 
AGGGATGAAATATTCTCTGCATGAGAGAGCAGGGATGGGAAGTGATATAGGTATGTAAGG 

atggtgaagttactctaaatgtagttagacaggacagccagaatacccgaggtcttggtt 

AGGTCCTCTGTAACAAGCCGTAGAGGCCCAGAAATGTGGTGACAGCGAGACACATTTCTT 
AACTCTTACACTTGTTGAAATGAGTAGAAGGNGACATTTGGTTTCGAAATCCCTCCCC 

iCAGGTACGCGGGGAAGGAAAGCAGCTGCAAACTTCCCA 
ATCACTGCCACGATTACCCCTGGATGAAT 
TCCTCAGTGGAAATATCAACAAGACTCAGCCCACCTGCACCCAGGTGATTAAAAAGCTTT 
ATTGCTCACACAAAGCCTGTTTGGTGGTCTCTTCACATGG^GCGCGCGACATTTGGTGC 
CCTGACTTGGATCAGGGGAOCTCCCTTGGGAGATCAATCCCCTGTCCTCCTGCTCTTTGC 

TGACCAATTTTTAAT CAGGAAT ATTC TGTGAAAAAGACTAAGAT ATCAAGAGAAATT AT 
T 

AGTGCACATTATTAGAAGAGAGCTTCAGATGAAAATAAAGATCAAGAAAAAGACTCTTGC 
TTTGAGAAAGACACAAAGAAATCACATCATTCTTATTGGGATTACTGGGCTAGCCATATG 
CCAGAAAAATGAAAC TGG TCCC TTC TT ACACCAT ATACCAAAAGCNGCCCANGATGGWTT 
ACTTNAATGTNAAANCCAAAACT 

GTACCATCGCCGTCCCATTGCTCACAGGGACTGGGAAGGCGATGCC 
TGGCGGGAGCTGCTGGTGGAGAGACTCGGGATGACTCCTGCTCAGATTCAGGCCTTGCTC 
AGGAAAGGGGAAAAGTTTGGTCGAGGAGTGATAjGCGGGACTCGTTGACATTGGGGAAACT 
TTGCAAT GCCCCGAAGAC TT AAC TCCCGATGAGG TTGTGGAACTAGAAAATCAAGCT GT A 
CCCTGATGCTACAGACGAGGACATCACCTCACACATGGAAAGCGAGGAGTTGAATGGTGC 
ATACAAGGCCATCCCCGTTGCCCAGGACCTGAACGCGCCTTCTGATTGGGACAGCCGTGG 
GAAGGACAG TT A T GAAAC GAGTCAGCTGGATGACCAGAG T GC T GAAACC CACAGCCACAA 
GCAG TCCAG ATT AT AT AAGC GGAAAGC CA TGATGAGAGCAATGAGCATTCC CCATG TGAT 
TGAT AGTCAGGAAC TTTCC 
Sequence 191 

C GCCGGGCAGG T AC TCCCTGGAAAGTCCAGC TGAGAAAGC GA TCCTGCC C TC TGCTCC T C 
CCAGGG TT ACCC TC C TG T AAGTCTTC TGC TT AG TG TTCAGAATTGGGGGATGC TGGGAC T 
G GGCAAGGAC TTG T AGG CAACAC C CCAT AGCC TGC TCATGCCTGTTGGGTTGCCT ATGGA 



TCATTtXCTGCTGGGCTCACTCACCGCCTrCGTATAAGGTCCTTTrrGAGGnTATTA 
TT 

TCCTTGTCCATATAC TTGATGCTC7TCATTGGCTTGTCTGGGACCTGCCTTAGGTTCT 

cc 

GAGGCATAAAAGGGCCGGACAiSCCCCCGAGTTGGGGGAACTC^^ 
GGAACCTTGGTCATCTTAAAAATCCrrCAG^ 

^TTCCAGAATCTTCTACTTCAAGTAGTTACTGGTATGAAG^AGTTTCGGCA 
Sequence 192 

CTCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 1 H 1 1 M 1 1 1 1 H I M I II I III MC 

GGCTTGAAATACAGCTGAAATAACTGAATTTrCTACTTGJWUTO 
GNGGGGCCAAGGCCCTGGAAATGTAAACGGCCAATCT^ 

GTGAAGGGCGGGTTCTGCAAAGACAAACAGGTCTCACAGATAGTTGCCCCCGCGTACCT 
Sequence 193 

>*GGCGGCCGACGTACGCGGGGGGCTT^4AGTAGGCTTCG I C M CGGHTTTTCTCTTCCTTC 
CTCCCGCCGGC 

CGGCCGCAGCGGCAGCTACAACAACCGCGTCGCTCTCCGCTCAATTTCC^GAGCCAGCT 

TTGAAGCCAAGTGCCCCCGCGTACCT 

Sequence 19S 

GCAAGTGCTCGACGC»TGGCCCGAAAATGCTGCCGAAGA^ 
ATGA CGAG^TGT TGTAAAGTGTGGAA^ 

Sequence 196 

C^^TGGCGGCCGAGGTACTTTXiAGCTCATAAGCTGGTATAAAATATCAAACATTTTGACT 
GTTTAAACAACTCAAGATATGTTTTGCAAAATTACAAAACATTATACAGGTGACTT AATT 
AAT A TC T AC TCCAATT ATAXiACAACACATCATXSCTGAAGATTT AGATTTATTTGAAAACA 
CTTAGTCTAATTrATATTAGTGCAGAAAAATCACATTCAATAAACCACAATTGTAGAAG 
A 

GACAGATAAGTGTGTTTGTCACATTTTCACACAAATATAATTTGATATTTAATTAAGGG 



CCTACCCCGTCGCCTCAT 



TGATGAATCACAATCACCATG 

CGGATCCCCCCCGCGTACCTCGGCCGCTCTAGAACTAGTG 
Sequence 197 

NCGAGGTACCTGCCTNACAGNGCAGGGCGGTATGCCGCCAAACGC^^ 
TGTCCCATTGTGGAGCGCCTCACTAACTCCATGATGATGCA 
Sequence 198 

TTGCTCAGCCTrrCCAGGCCCCTCTGATGAGCTCTCTAATCAGCAGGACCAAGGTGTGAA 
T G TGGGAATGAACATGGATCCATCC CATTGGATGGAGAAGAAAGG TGGACAGC CTG TTCG 
TCTCTCATGTCAGCCTAGGGCTGGGAACAGTTTGTGAGGACTTATCTGTTGTACCT 



ggacttgctcagcctttccaggcgcctctgatgagctctctaatcagcaggaccaaggtg 
tgaagtgggaatgaacatggatccatcccattggatcgagaagaaaggtggacagcctgt 

T CGTCTCTCATGTCAGCCT^GGCTGGGAACAGTTTGTGAGGACTT ATCTGTTGTACC 



gakxsagaaagctggaagaaatagattgggtgac^tttggggttatattgaagaaggt^ 

gccac^gagtgtgaatagtggaaaaaccttcagcatatggaaactgaatgatcttcgtga 

cctc^cacaatgtgtgtccttgttcttatttggagaagttcacaaagcgctctgg^ 
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ggagcaggggactgtcgtagggatcctcaatggcaaccccatgaagccc^^ 
agaggagcgtgactgtgagtacct 

Sequence 201 

gccgaggtactcgggcaaagagggtgacamgttcaagctcaa^ 
gctgctgacccoggagctgcctlagcttcttgx^^ 
n^anctgatgagcaactrggacagcaacagggacaacg^ 
cctgcccgggcggcccgctctagaactagt 

S eq u ence 202 

^GGGGCACAGAGAGGGTTTCAGAGGATCCTTGNGAAACACTAGTT aaaagatgaccgagt 
OGCGAGAAGTGCGAGGAAAGAAGGAAATTAGTCTGACTGG l l 1 fCTGTC CTGCACCATTG 

attcaatggagactgggcgggaggaaatggaagactagggttggagatgggat^^ 

GCAAGGGATGGAAAGGAAAACXK>GACAACTAATGCGTTC^ 

TATCAAAGCACTTTAAAGGAGATTANAAGGACCCAATCAGCAATAM A 1 1 i g gg cc aacct 

tta^ttctttagggaagc^ttcaaaagttccct^^ 
tattnactaaaaaagccaaaagagcaagtttn'gggtaccctgcccc^ 

TCT AAG AACCTAGGTNGGGATCCGCCCGGGGGCC^^ 

C TAOCCC 
b 203 



GCGGCCGCCCGGGCAGGTACGaSGGGAAGTCTNTCCrTTCTCG^ 

CGGCTGCTGTTGGTTGGGGGCCGTCCCGCTCCT AAGGCAGGAAjGATGGTGGCCGCAAAGA 

AGACGAAAAAGTCGCTGGAGTCGATCAMCTCTAGGCTGCAACTCGTTATGAAAAGTGGGA 

AGTACC 

T 

Sequence 204 

CTCCGCGCGGTGGCGGCCGAAAACTGATCAGACTGTCTCAGATCAAGG^ 

GAGAGAAGCTC<^AGAAATAGATTGGGTGACATTTGGGGTTATATTGAAGAAGGTTACGC 

CACAGAGTGTGAATAGTGGAAAAACCTTCAGCATATGGAAACTGAAlGAT^^ 

TGACACAATGTGTGTCCTTGTrCTTATTTGGAGAAGTTCACAAAGCGCrCTGGAAGACGG 

AGCAGGGGA CTT^ CGTAGGGATCCTC^TGCC^CCCCATG^GCCCAAGGATGGTTCAG 

AGGAGGTGTGTTTATCTATCGATCATCCTGAGAAGGTCTTAATTATGGGTOAAGCTC^ 

^CCTGGGAAGCTGTAAAGCCAAGAAGAAGAATGGAGAGCCGTGCACGCAGACTGTGAA 



CNCCGCGGTGGCGGCCGAAAACTGATCAGACTGTCTCAG^^ 

G^GAAGCTGGAAGAAATAGATTGGGTGACATTTGGGGTTATATTGAAGAAGGTT aogcca 

cagagtgtgaatagtggaaaaaccttgagcatatggaaactgaatgatcttcgtgacctg 
acacaatgtgt gt ccttgttcttatttggagaagttcacaaagcgctctggaagacggag 
caggggactctcgtagggatcctcaatgccaaccccatgaagcxx^aggatg^ 
gaggtgtgtitatctatcgatcatcctcagaaggtcttaattatgggtgaagctctt gac 

CTGGGAACCTGTAAAGCCAAGAAGAAGAATGGAGAGCGGTGCACGCAGACn'GTGAATTTG 

cgtgactgtgagtacct 



tcwccgcggtggcggccgaggtactcacagtcacgctcxttctga^ 

ATGGGCTTGGCATTGAGGATCCCT ACGACAGTCCCCTGCTCCGTCTTCCAGAG^ G 

TGAACTTCTCCAAATAAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCAT^ 

TTCCATATGCTGAAGGTTTTTCCACTATrCACACTCTGTGGC^ 

accccaaatgtcacccaatctatttcttcxaxsct^ctctctggcca 1 c f m C C 11 GAT 



tgagacagtctgatcagtttt 

e207 
TCCCGCGGTt 

TAATGTGTI ictttttm 



ATGNGGC i fCNTC 1 1 CTG 

ctgaatagggtaagtgtaaaggcacaacaaat 



aacactgtatcagatctcattcgttccaaaaacgtttgagtcctag 1 1 1 1 1 1 i c tgtca 

TCTGATCAACTACCCAATGI rTGTTTTGTJ I ATTTTATAATTGGGAAGGTTCTCCAAGG 
C 

CTACCACT AAC TTTAACGAATGA TATAGATAGAGCTCAGAGCAATCTT C TCACGATCATG 

AAGTCATGTATAAAAATCAGGATTAAAACAA A GGTCATCTTjATCTCCAATCAT^ 

AAGGAAAGTCAATTATATTANGAAATGGTTAAGAGCTTGCACTCTGA^ 

GGGTTTAATCTACCTGCTGCACCCTGAAAAATTGGTATTTACCCTT 

Sequence 208 

CGCGCTGGCGGC CGCCCOGGCCGGTACATG G > I C 1 I C CT C AGAAAGT G GT 1C I 1 C CTTAA 
T OIUIM CIIir»A UAA. I iMC I ICMC II CMt^ C^GATG I i »L il C IIL! I IUI U CCA 
CTTTTTCTTCTTCCTCTTCTTC>ACTGAATAGGGTKAGT GTAAAGGCACAACAAATTAA 
C 

ACTGTATCAGATCTCATTCC TTCC AAAAACXSTTTGAGrrCCTA b 1 1 I I 1 1 1 C 1 GTCATTCT 
CATCAACTACCCAATG ITTOTTTTGTn ATTTTATAATTGGGAAGG1TCTCCAAGGCCT 

CCACTAACTTT AACGAATGATAT AGAT AGAGCTCAGAGCAATCTTCTCACGATCATGAAG 
TCATGT ATAAAAATCAGGATTAAAACAAAOGTGATCTGATCTCCAATCATTA7TGGGAAG 
AAAGTCAATTATATTAGAAATGGTTAAGAGCTTG^ 

TAATCT ACCTGCTGCAAGCCTGAAAAATTGT ATTTACCCTT GGTGAAGCTCCTATCT AT 
AAACTTAAGAATGTCTTATCTTACTGGACTGGTACTGGATTAAAAAGA 



CACCGCG GCGGCG GWCGAGCTACACGACATAGGCACATGTGCAAACACAAAGAAGGTGGG 
CATGCTGCTTCTTTCTWTCTGOCCCTAGMCCAGGCTC C T1 TGC1 1 C ACGWAAGATMMACA 
CTTTCCCATTCCTCTGAAGTTGCTGGAAGGACATTTCCCAGGAA 
GCCTATAAACTCTAGTCCCAATG114GGGATAGTCAAIOCGAACATG 



AACCT 



210 

GGTGGCGGCCCGAGGTACTCACAGTCACGCTCCTCTC^XACC^ 
TTGGCATTCAGGATCCCTACGACAGTCCGCTGCTCCGTCTTCC^ 

TCTCCAAAT AAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAG TTTCCAT 
ATGCTGAAGGTTTTTCCACTATTCACACTCTGTGGCGTAACCTTCTT 

AATGTCACCCAATCTATTTCTrTCCA^ 

CAAGTCTGATCAAGTTrrCGG 
C 

Sequence 211 

GCGGTGGCGGCCZGAGCTACTCACAGTCACCCTCCTCT 

GGTTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCGTCTTCCAGAGCGCTT^ 
CTVC TCCAAA T AAGAACAAGGACACACaTTG TGTCAGGTCACGAAGAT CATTCAGTTTCC 
ATATGCTGAAGG i t 1 1 1 CCACTATTCACACTCTGTGGCGTAACCTTCTTCAATATAACC 



P*Zt>5 ot22t 



PCT/USOOA3J3U 



PCTVUS00/3J312 



CAAATGTCACCCAATCTATTTCTTCCAGC7TCTCTCTGGCCATCTTTTCC7TGATCTGA 
G 

ACAAGTCTGATCAGTTTT 
Sequence 212 

GGNGGCGCCCGCCCGGGCAGGTACTTTTNAAAI IHIIIIII ICTGWAGAGACGAGGTCT 
TTCT ATGCT GTTCAGGCTGAACTTCATGGGTTTATTGGGGATGGCTAANGGATGACATTG 
GCTGGTGGTCCTTGATACCAGATAAGCCCTCAGTGTGAAGCAGCTCTTATTTTTCCTT 
GT 

CTTGAGATTGCTCTTGGAATGGAAATTAGGC 1 1 1 I II GAAGGTGTGGAGGC 1 1 1 1 TO G 
TT 

CATTTCTTCAGCAGTTACTTTTT A I II I I IT I AAAATG TTTTGACACACAAGTCTTNTGG 

ATAAATGAATCANTTCACCCAANCAGCCCGGATTTACTTCT^^ 

GT 

NGNTGAACACNTGTCCCCTTTTGAAGAAATCTGGGNCGACAGCTTATGTATCCCCATTCA 
CCCACAACACCCCCAAAAAAAAAAAAAAATTTATTGTCTTGGGGTTCCCCAGGGGAGNTT 
ACCCTTTTTAATGGAACAAAGGTNCCATTCTTGNGGAAA 

AANAAGGAAACCTTTCCCTGGGGGAAAAACAACTTGNAMAGGAAAAAATTAAAAGGAAG 
GGCCCGGGGGCC 
• 213 

TGAGAAGCCAGCGCTCACGCACCCGGGGTCTCTGTGCATTGACCT 
GTACCTCGGNG 

■ 214 



GGCAGGAGAATXX3CTTGAAGCCAAGA6GCGTAAGTTGCAGTGAGCCGAGATCATGGCACT 



AAAAAGAAACAGATGAAACCAATGTGAATAATTTAT1TTAACACAATATACCTAACATAT 
TTTTATTTCAATATCTAACCAGTATAAAAA 

GCTCCTTAAGTAAACAGAAGTAATACCTGATTAATTAGAATTCGCAACCCTCATCAAGTG 
TGTGCTTATATAGAAGAAACCCAGTAAATGTTTGTTGATTGAAAGATATTAATACTCTT 



e21S 

TTGGCTTCTGCTGGCACGGTAACCAGACTCCX^TACAACTGCACTCTTTGTCTTTGTCA 
TG 

CiAAGCCGCGAGCG T AGAGGTTCCGCGTGCT CTGCCGGACTTGAGCAGG T CAC T GGGTCCT 
TT ACACTTGTGAATTCGAAGC TTGCCAGATGTAT CCTCAATGCATTG CCACTTCTGCC 
CC 

GGTTGTTCACAGGCTGTCTGGTACGAGATC 
CGCAGCCACCT CAAGATCACAGATTCTGCTGGCCAT ATTCTCTAl 

ATACCTGACCAACTCGTGATCCTAGACATGAAGCATGGAGTG 
GAGGCGAAAAATTACGAAGAGATTGCAAAAGTTGAGAAAGC 
Sequence 216 

CCGCGGNGGCGGCCGAGGTACTTTGGAGTCCCCTGGTTTCTCAAGAATTGGCGTTGACTC 

TTTCTTTGGCTTCTGCTGGCACGGTAACCAGACTCCCTACAACTGCACTCTTTGTCTT 

TG 



GGTCCTTTACACTTGTGAATTC GAAGCTTGCCAGATGTATCCTCAATGCATTGCCACT 
TC 

TGCCCCGGTTGTTCAGAGGCTGTCTGGTACCGAGATCTCCGACCAGTCTGGGGGCGCTGG 



CGGCCT GCGCAGCCACCTCAAGATCACAGATTCTGCTGGCCATATTCTCTACTCCAAAGA 
GGATGCAACCAAGGGGAAATTTGCCTTTACCACTGAAGATTATGACATGTTTGAAGTGTG 

ttttgagagcaagggaacagggcggatacctgaccaactcgtgatcctagacatgaagca 

TGGAGTGGAGGCGAAAAATTACGA 



CXXXCGGTGGCGGCCGACGTACTATCAAACAACATGATACA^ 

AC TACT AGTGG TCACCT GAAATCGATTTTCCCCTCCTTCACAGTAAGAGTTTT AGNTG 
AA 

TGAGTGGCGACTCATAGAGAGATTGCATTTCT GGCTTC CCTTGGAGCGATAGGTAGCCAT 

GAAGAAGAATTGTCTTGGGCCACAGATAAAATACACTGGCATCAAGGATAGCTGATXjAGC 

AA A AAAAAAAAAAAAAAAAAAAGTACCTGCC 

Sequence 218 

CCCGCGGTGGCGGCXIGAGGTACCATCCTGTTCNACAGAGCC^TTGCCTATTCCTAAATTG 

AATCCGACTGGGCGTGCCCCTCCTCGGAAC^CAACAGTAGACCTTAATAGTGGAAACATC 

GATGTGCCTCCGAACATGACAAGCTGGGCCAGCTTTCATAATGGTGTGGCTGCTGGCCTG 

AAGATAGCTCCTGCGTCCCAGATCGACTCAGCTTGGATTGTTTACAATAAGCGCAAGCAT 

GCTGAGTTGGCCAATGAGTATGCTGGCTTTCT^ 

ACCAAGCTGGCGACTCTCAATATCCATGACTACTTGACCAAGG 

ATTGGAC7GCTACTTGGTGTTTCTGCTGCAAAACTAGGCACCATGGATATGTCTATTA 
CT 

CGGCTT CTTAGCATTC^CATT(^TGCTCTCTTACCCCCAACGTCCACAGAOCTG 
Sequence 219 

GTTATTGGTGGTGAAGAGCCGNAGCAACAGTGGGCATGTCTTCTCGCGGTCGATCGGfifTT 
CTCTGGCTCCTTNTTAATTTCCTCCTGGGNAACU3CGCGACTCC^ 



GGGCT 
Sequence 220 

GGCGGCCGAGGTACCATGATATCATGTATCCTGCTTGGAC^TTTTGGGAAGGGGGACCTG 
C TGTTTGGCCAATTT ATCC T ACAGGTC TTGGACGGTGGGACCTC TT CAGAGAAGATCTGG 
T AAGGT CAGCAGCACAGTGGCCATGGAAAAAGAAAAAC TCT ACAG CAT ATTTC CGAGGAT 
CAAGGACAAGTC CAGAACGAGAT C C TCTCATTC TTCTGTCTC GGAAAAAC CCAAAACTTG 
TTGATGCAGAATACAC C AAAAAC CAGGCCTGGAAATC TAT GAAAGAT ACCTT AGGAAAGC 
CAGC TGC T AAGGATGTCCATCTTGTGGATCAC TGCAAAT ACAAG T ATCTGTTT AATTTT 
C 

GAGGCGTAGCTGCAAGTTTCCGGTTTAAACACCrrCTTCCTGTGTGGCTC^CTTGTTTT 

CC 

A TG TTGG T GATGAGTGGCT AGAATT CTTC T ATCCACAGC T GAAGCCATGGGTTCACT AT A 
T CCCAG TCAAAACAGATC T C TCCAATGTCCAAGAG CTG NT ACAATTTGT AA 
Sequence 221 

GCN GGT ACAGOVACAWjAATCAGATGCTCTTT AGAGAT C C TC CATTTCATT ACTCT AACA 
TTCTTCAATGTGGTTCCAGCCACGCATAGTCATATAGATACTACAT ATN CAAAGAT AAC 
T 

TACTGAAGCTTGTTCACAGAACCAAGCTTTCTCCTGGATAAGCTCTTCTKTCCCCTAC 
CC 

CGCACTTCTTGGGNAAGGTATTACCCCAAAATGCrcTrcAGNGGATTTAAAATAAACAAT 
TTTTT AAAAAN AN GGACACTT AA CAC TGACAAAAAAATG GGGGAAATTTTGC TCG GGCCA 
TTGGACNGCGGAAACCAAATTACCGGGTTTAACTTCCAAGNATGGCTTGTCATTTCAAAA 
ACCTGGTATTGGGGGTCCCGTTCGGAAAAAAANANATAGGATATTAACCCATNTTTTTCT 
CATAAGGACCAAGCTATTCTTACNTTTTAATCAACCCAAATTTCCTGGGGGGAAAGGNCC 



PCT/US007333U 



PCT7US00/333I2 



TTTCTTCTTATTTTAGGTCTTCGGGGATAGGTCTTNTANTCCCAATAAATAATTG^ 
T 

GAGGTC NTTGCCT GG TATCC TC ^MAAAGGTTGGAAACCAAGCTTGGCNACTTTATCTTCT 

TAAA Li rrcrn igggaaggaaccccaggtttcaagat ai m i n i i g gggaa 

e222 



AGAGAATGGGACATTTTAGAGACTGAAGAGCATTATAAGAGCCGATGGA 
ATTTTATATCTTACTATG MI C I CANCAGATGTAGGGTTTTCTGTAGATGATGATCTCC 
A 

TATGGCCATATCTCCAAAAGANATGAATCCGACAGCNGATACAAAGTTTTTTGGGCTGGG 

TTTATTGCMTCATATAGNNCTTTGGCCCAAATGGNANGCTTCACCCTAT^ 

TT 

ATGGNCTAAATTATTANGACCCANAGGVV -AAGGAGCCT04TTAATTGGTCTCXXATCTT 
nTTI f^rCGTGGHAAGCAC AACCTGCCCTCTATGCATAT C 1 1 C CACGATCCCCAAGCT 
TTCTCATAAANTAAAAATAACCT ACCAATGGCCTGGGTTGCNT(XGTNGGGAATTTGNNT 
GGGGAAATTTGGGAAGCCANGTT 
AGAAA 
Sequence 223 



Mil I U AAGAC^TTNGGWnTT ACAATTCCCTTTGGG 



GGGCGGCOSGAGTGATGCCATCTGCAGTTTTGTGATCTGCAAT^ 

TCAGCCCATTATCTTTAATCGrGACTTTTTI^ 

GA 

GATTTTCACTGAGTTGGTGGTCAGCAATATCAGAAGGCTCATCGATTTACCT 
GTTGGCTCANCTGATGGGGGAAGTGGACCTT AAGTTGCCTGGCGGGGCTGGCCCAGCATC 
AGGATTCTTCCGGTCTCTCATGTCTCTCAAGCGAAAGGAAAAAGGAGTGATATTTGGGTC 
CCCAC TGACGGAGGAAGGCATTGCCCAGATATACCAACTGATTGAG TATCT ACACAAAAA 
CTTGCGAGTAGAGG U I ITCTI IA GAGTACCT 
Sequence 224 

CCGCCCGGGCAGGTACTCCCTGATAAAGGGGAATTTCCATGGCGTCTACAGGGATGACCT 

GAAGAAATTGCTAGAGACCGAGTGTCCTCAGTATATCAGGAAAAAGGGTGCAGACGTCTG 

GTTCAAAGAGTTGGATATCAAiCAiCTGATGGTGCAGTTAACTTCCA^ CATTCT 

GGTGATAAAGATGGGCGTGGCAGCCCACAAAAAAAGCCATGAAGAAAGCCA CAAA GAGTA 

GCTGAGTTACTGGGCCCAGAGGCTGGGCCCCTGGACATGTACAGACTCTCATTTTATGAT 

GTATCCTACTG^ATC>GGACATTTGTGTC>ATGTCAGGTGACGAGGGGAAATGAAAGTGA 

TGAGACGATGAGAGGAGTGAAATACCAAGGACGCCATACTAGGAAACCCAGGTCTATTTG 

TTATCAGAGTAAGGATCAAGCCAG^TAGCCTGTTATGTAATTTCTCCGATAAAAGATTT 

T 

GAAAGCAGGTGCTCTGGGCATCTGTATGGGGGAATCGOKCTCATAGAATTATTT^ 

GTAAATATTTGGTATCAGGCCCAGCAAGGGAAA 

Sequence 225 

CTCCCCGCGGTGGCGG<XGAGGTACTCACAGTCACGCAAATTCACAGT 

CTCTCCATTCTTCTTCTTGGCTTTACAGGTTCCCAGGTCA 

A 

GACCTTCTGAGGATGATCGATAGATAAACACACCTCCTCTGAACCATCCTTGGGCTTCAT 

GGGGTTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCGTCTTCCAGAGCGCTTTGTG 

AACTTCTCCAAATAAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAGTTT 

CX^TATGCTGAAGGTTTrrcCACTATTCACACTCTGTGGCGTAACCTT 

C 

CCCAAATGTCACCCAATCTATTTCTTCCAGCTTCTCTCTGGCCATCTTTT(XT^ 
G 

AGACAGTCTGATCAGTTTT 



GGAGATCCGGGAGCGGCAGAAGTTACGGCGACAGCTCCTCGCGGAGCAGTTGGGAGCTGA 

AAGTGCCGACAGCATTGGTGCCGTGTTAAATAGCAAAGATGAGCAGAG 

AACAAGAGAAACTTGCAGGGCTTCCTATGATACCTCTGCTGCAAATGCAAAACGTAAGTA 

TCTGGATGAAGGAGAGACAGATGAGGACAAAATCGAAGAATATAAGGATGAACTAGAAAT 

GCAACAGGATGAAGCTT ATCATCAATTCATTGTATAAAAATAAAGAGATTTTCCTGAGAG 

AACTGATTTCAAATGCTTC TGA7GCTTT AGAT AAGATAAGGCTAATATCACTGACTGAT 

G 

AAAAT 
Sequence 227 

CNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCAAAGTGATTCAGA 

CAGGCGCTGTAGCAAACGTGCAACTCTTGAGGAACACTT^ 

CAAAAAGCTACAGAAGGTCCAGGCTACTGAAAAGCATCAAGACCAAGCTGTT 

TGCGCATCACAGAGGGGGGCATGGTGTTCCACATGGGAAATTGTTAAAACAGA 

GGAGCCATCGGTGTCAATACCCT7CCTACAAACTGCATTATTAAGAAGTTC 

tggg<xw^g>ccaagccaggagatggataaaatgttaaaaa^tcaagcaac ttctgc t ac 

TTCTGAAAAGGATAATGATGATGACCAAAGTGACAAGGGTACCTCGGCCGCrCTAGAACT 
AGTG 

Sequence 226 

GAGCTCCCTCCTACCCCCTAGCTGAGTAGGCCAGGTTTTC^ 

GC5AAAGTTOCTGCATATGCTGCGCAGTATGNGCCTTGAATAAAAATCCTGAAGATTAGA 
GGTTCAGGCTGCATCATCCCAAAGCAAAGAGCACCTCTTTGAAGCTCACCTGCCCGGGCG 
GCCGAGGTACI I I I 1 1 II I Ml II II 1 1 It I CAGTANGNAGCTTTAAAACAGTTACATAT 



6 229 

TGGCGGCCGAGGTACTACAGGATGATC 
ATGGAGTTGGGGAGAGAATGTCTAAACCTCTGGGGGTATGAACGGGTAGATGAAATTATT 
TGGGTGAAGACAAATCAACTGCAACGCATCATTCGGACAGGCCGTACCT GCCCGGGC GGT 
ACTTNN 1 1 I mill I I I! I II I I 1 1 IIIAII! III! I I 
ACATTGNTC AG TTTTT ACTTGNAAAAAGT 
NTTATAGAANAGTTTTATTGGAATGTT ATTTT ATTAAGCCNTTTTCATGGGTTATTTTTT 
TTTAAAGTTTAAAAAGTTTTTACAACANGGTGGGNGGGGGGGNTTNCACCTGGCATG 
GCACTTTTGGAGGNCCCGGCGGGCANAAACCTGANGGCGGGGAGGTTTAAAAAANCNA 
CTGNCCANATTGGN AAACCCNTN I IT 1 1 I C TTAAAATTCCCAAATTAAATTC 
C 

Sequence 230 



231 

CCACCGCGGTGGCGG^ICGAGGTACGACGTTTCCATCAGCTTGTCTGTTTCATTCCCTGAT 
G TT ACGAGCAAT ATGAC CATC TT CTGT ATTCT GGAAAC TGACAAGACGCGGC TTTT ATC T 
TCACCTTTCTCTATAGAGCTTGAGGACCCTCAGGCTCCCCCAGACCACATTCCTTGGATT 

AATNTCGGATATCAAGCCTT ATNCGATACCCGTCGACCC TTCGGAGGGGGNGGGCCCCCG 

GGTACCCCAGCCTTNTTGTTTCCCC T 1 1 1 aggtggagggggtttaaatttgccgccgcnt 
tgngcggtaaattcaatgggttcattaggcttgtcttcccctgtggtgnaaaattngtta 
atcngggctcaccaantttcccaccacaaac caatancg nagncccgggggaggccatta 
aaaaggtngtaaaaagcccttggggggttggccctaatgaagtgg^ 
attaaatttgccgtttggcgcttcacttgccccgctttttccaagtccgg 
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CGGTGGOGGCCGCCCGGGCAGGTACTTT AI II 1 1 II I 1 1 I I II II I H I 1 1 1 MCTTTWA 
A 

AAAAAAAAAANGATATTTTAATATATTCAGATCCNCAAATATGAAATAAAACTAAGMNGA 

GCTGGTATTCATTTACACATAATTATCTTATACCGTTNGGAATAAGAATTTGGGGCNC 

GT 

TAGCAAACCAAAAGGCTCAAAAAGACGTCCNGATATTTAGTTCTTCTCTCCCTCTACAAA 
NGGGAAGCACTWTTTT ATCCGGCATTCCTAGGGGNGTTCCTATTTTCAA 
Sequence 233 

CGGTGGCGGCCGNCCGGGCAGGACGCGGGGGCCAGTTCTCTTCGGGGACT aactgcaacg 

GACAGACTCAAGATGATTCC C I 1 1 1 I ACCCATG1 1 1 1 C TCTACTATTGCTGCTTATTGT 

T 

AACCCTATAAACGCCAACAATCATTATGACAAGATCTrGGCTCATAGTCGTATCAGGGGT 
CGGGGAGCAAGGCCCAAATGTCTG TGCCCTTGAACANGATTTTGGGCACCAAAAAGAAAT 
ACTTCAGCCACTTGTAAGAAACTGGGTATAAAAfiMGTCCATCTGTGGGACAGWAAAAAC 

AATGGAAAAGGGCTGCCCAAGWCAfl i M il I AGCCCA TTTGACCGANTGG TTTTATTGGG 

TTTCCTTGGACCGNCCCTNAAANAAACC TTGAAGGGGGAAGGGGNGGAATCCGGAGGGGG 

AAAAGGGGGGA 

Sequence 234 

CGCGGAGGOGGCCGCCCGGGCAGGTACAGTATAGGTTG G I 1 1 rG C CTGT 1 1 I GA CGCTTT 
ATATATACGTAGACAGACATACACATCTATATATACACACACACATTTTACATATATATA 
TGAAACTGTATAATGTrGTTCGCTTCAGTGTCTGGCTGCTTTTACTT^ACATTGTGAAA^ 
T 

AATTCCTGTTATCGGWATATGGGTATCNAAATTTGKTTTGCCCTA G 1 1 1 1 I U CCTTCTC 
A 

TTGCTTTCTGAATTGGGGGCAGCTTTGCCCCTCAAGGGGAAATTTAGCAATGTCTGGAGA 
CATTTTTT1 1 ATTTTCATAATTTNGGGAGGGGACATGGGGGGAGGTTTGGTGGCTACAGG 
AACCTTAATTAAGGT7CGAGGGACAGGGGTTAGGTGCTTGAACGGTTNCCACANGTAACA 
CTTCGGGCNCGCTnO^AAGAAACCTAGGTGGGATTCCCCCCCNGGGTCTGGCNANGGAAA 
ATTCCGANTATT>JCNAAGCCTTANTCGANTACCCCGGNCGACCCTTN^^ 

Sequence 235 

CGCGGTGGGGGCCGAGGAC I I 1 1 1 1 I I I I 1 1 U M H I II I 1 I 1 1 1 1 ATAATAATTTTGT 
CATTTTTGT AGAGACAAGG TC TCCCATG TTGCC CAGGCTGGTC TCAAACTCC T AGGCT CA 
AC TGATCCTCC T ACC TCCACCTNTGCC TC CCAATT ATCCC CAATTGAGAGATGAAAATTC 
TGACAAGCTCTCAAACGTTAACTGaCTTGCCCATAAATGACAGTTCCAAAGTTATAAGGG 
CCT AGNAACNTTGAATCCAGGTNCTGTTAGNAAATTCT AGGGTTTGAGAAATCCCATATT 
TCTMTCCACTTCCCGCGGTACCCTGCCCCCGGGGCCGGGCGGCCTTCTAGGAACflTAGGT 
GGGATCCCaXCGGGGCTTGCAGGGAATTCX^TATTCAAG 

AGGGGGTTTAAATT 
- Sequ e nce 236 

GCGG CCGN C CGGGCAGGNAC C TACGCCACAGACAGC CAGAGGGAAAGCGAC C CAGACAG C 
AGCCCCTCCTCGACAGGCCCACCCTGCAGCTCAGGCACCAAGAAAACAGCCGATACTGGC 



GATTTTGAAGAGC TTTCTACA T ATCC GGTTATG T AAANTTCAT A T ATG T ATTTTTTGGAA 
ATCAGTTCTTATANAACCAGCC TCCGATTCAAGTCTTTAGGCTAAAATTTTATAGGTCC 
T 



aaggg taggtatggttaacaattttggaaoctttttggtcctt AAAGAAAAAGGTTGGAC 

TTGTTTCAAWATAW H ICTNTC 1 1 ACCTWGTGAAAAGGAAAATCWTT A C I M il ICCTAA 
TTAAAAAGGAATTCTTGTTACCCTTCGGGCTCCGCTTCTTAGGAAAC^ 
^♦CCCCCCGGGGTCTTGNGAAGGNAAATTTTCGAATATTCCAAAGGCTTTTATTCGAATAC 
CCCGGCTCGGAACCTCGGNAGGGGGGGGGGGGCCCCGGGGTACCCCCAAGC 1 1 1 1 1 INGT 

Sequence 237 

GCAGTrrrGTGATCTGCAATGATTCTTCCCTTCGAGGTCAGCCCATTATCTTTAAT^ 
G 

ACTTTITTGTGGAGAAACTCCGACATGAGAAACCTGAGATTTTCACTGAGTTGGTGGTCA 
GCAATATCACAAGGCTCATCGATTTACCTGGAACTGAGTTGGCTCAGCTGATGGGGGAAG 
TGGACCTTAAG TTGCCTGGCGGGGCTGGCCCAGCATCAGGATTCTTCCGG T CTCTCATGT 
C TCTCAAGCGAAAGGAAAAAGGAG TGAT ATTTGGGTCCCCACTGACGGAGGAAGGCATTG 

. CCCAGATATACCAACTGATTGAG t atc tacacaaaaacttgcgagtagagggtttgttt a 

gagtacct 

Sequence 238 

cccgcggtgggggccgaggtacgcggggattgtgtgca^aatgagagaggggtgcaagga 
tcctgatttttcaggagttcaagogacaatggcagcccaata^ 
ccccagcacaccaggggccagttatgggcctggaaggcaa& 
gagaattgtgttagtgggtaaaaccggagcaggaaaaagtgcaacagga^ 

TGGCCGGAAAGTG1TTCATTCTGGGACTGCAGCAAAATCCATTACCAAGAAGTGTGAGAA 
ACGCAGCAGCTCATGGAAGGAAACAGAACTTGTCCGT AGTT GAGACACCAGGCATTTTCG 
ACACAGAGGTGCCCAATGC 
Sequence 239 

CCGCGGTGGCGGCCGAGGTACCAGTTAAGTGAACAGCTCGTCTAGGTCTGCTTTTGTAAC 
ACCCAAATACAATT AGCACTTCT CTGCTGGT ATTCCCTGGGCCGTCTT AATTATCT AG 
AG 

GCCAGGAGGCAAAGCCTAGCACGTAACAAAGTATGTGCTTTGTAACTGCTGATTAATTCA 

gtttcttaactaggcagaggaggtcatcagtgtatctaattcacactattaatacactg 

GTTATGGOCTAGCCCCAGNGGGAAG 

TAGGTGCTTGGCCTGGAGCTCCCAGTTTTAGGGTCGTGGTTACTGTTTCTGGTTTCCAAC 
TTATTAAAATCCAGGGGATGGACCTGGTTACCTCAGATTTCAGGTTGCCTT atggtagga 
AAAATAGGAATGCCACAGGCCAAAAAACATTAATTTTGGGGGGGATGGACTTGGGCAGNC 



• 240 

GWGGWGGGCCGGCOCGAGGTAC 1 U I H H I I N 1 1 I I 1 1 1 GGTATGAGTAT AGATGGC 
TA 

GTG N G TC TTTTT ATT AGC T ATCANCGTTCATTT AACAGACAAAAAATTCAAG TTCAATG 
N 

NNGGNCATTAAAATACGGAAGAATT AACAATAAGTTCaTTAATCAATCTTTCANCTG TT 
C 

CTATTTTAT(^CAAATNACTTTTGCTANAATTGGAANAAGGAT>K>TGGGAAGGGGAC^ 
GTCTTGGAAAAACGCAAACCGTAA1TGTGTTCTTTCAAATTCCATAAAAGACACTTC 
NNCAAAAAAAT AAAT AAACAAGGNAAGGG CCGCNTCATT ACC TNTT AG TTTN GGGGN G T N 
GGAAATT GAATCATGGCCAAG TGCC T AAGNGC hmTTT GC TG N TNAG TT AACCCNC CG T G 
CCCGCGTCNTAGGAAACCTATGNTGNGGATCCCCCCCGGGGCCTTGCCANGNGGAAATTT 
CGAATAATCCAAANGCCTTTaTCCGGAATA( 
GGG 
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TCAGCTGCCTTCM^AAGGAGGAACAAGATCGGCAA 

GAGAGT GTTGTAAAGTGTGGAAATGCCTTCTTAAAGTTTATAAAAGTAAAATCA 
Al 1 1 1 1 1 1 rCCAAAAAAAAAAAAAAAAAGTACCT 
Sequence 242 

TGTCT cagatcaaggaaaagatggccagagagaagctggaagaaatagattgggtgacat 

TTCGGGTTATATTGAAGAAGGTTACGCWACAGAGTGTGAATAGTGGAAAAACC^ 
TATGGAAACTGAATGATCTTCGTGACCTGACACAATGTGTGTCCTTGTT atttgga 
G 

aagttcacaaagcgctctggaagacggagcaggggactgtcgtagggatcctcaatgcca 



• 243 

gtacgcggggtgctgggattacag^scacgagocagtgggcccagcti 
tattaagctgttctggactgtcgggctccttgggcagatgctgt attatgoggataagcc 
acacac tttttgaactggcccgg tc^ggggggacataaccatrtnctg tgccacccca tc 

AATCCGCACCTATTCTGAGTGTAGGCTCCTCCCCTGCTTGAGT aatggccacagatcttg 
gctcggcactcctaagctgcatgttgaattcctgggacaaca 

ATTCTCCAGATCUrrTGGGTTGGCTTCTGGGTGCACTAGGAGATC 

caccaggaaagtactggaagtaaagtctgactctaaagaagatgaaaatct agtaattaa 
tgaagtaataaattcttccaaagggaaaaaacgcaaggj<ag 
ttgtagttctcaat< 




tcccgaaaaaan 

• 244 

tocacccacctcggccctccagtgtgctgggattacaggcatg 
CTGGTTI gctttctgaaccatgtc>atacagtaccaccacagttgct 
at 

ctttcattaaaacatcaccgtctagtttgagaatactttt aagcctgctggcctccttt 



tactccgtaagctt 



gggcattc i i rr ttctcttttcagcacggatc 

T 

tagccatgttttaccttgagggccgaagttaacttcagcgggagtgaacgao 

gctccactttatccagtgcactcggaagcoggagggcccccaccaaaaa 

accctc 

Sequence 245 

ccccgcggtggcggcxxxxxxsggcaggtacaattgctt 

AGGATGaCCACTAGOXACCACCTTTXACTG TTTCT ACAGTCCTGGNGAGCAAGTTTGGA 

GTTAAGGCTTCAAAATCCTGCAGCACACACATGCCGAAGGT^ 

GTCTCGTTGTAGTAGCAGTAGCGAATGTTTGTGGCTGCTATGAAGAGTTC 

TCCTGCTTT ATGTTCAG T GTTCCA I ICTT IAI 1 1 1 C 1 1 CTGCAGCTGTCGCA 

T 

Sequence 246 

GCGGCCGTGGGGATCAGCGTAGGTGAGCTGNGGCCTTTTGCGAGGTGCTGCAGCCATAGC 

TACGT GCGTTCGCTACCGAGGATTGAGCGTCTCCAGCCATCTTCTGCGCNGNCACCATCT 

ACAT AA TGAATCCCAGTATGAAGCAGCAACAAGAAGAAATCAAAAGAAGAAT AT AAAGAA 

ATACTIC TTGTCCCAAAGGAAGGAAAC TCTTGAAGGATTGAATTTCAGCCC TTC TTGCAT 

CTrGGGATCTCTrGGTTGGGAAACGGAAGGAAANAAATNGGAAGCCTTGTCOCGCAAGNG 

CTTTGTCCANAMAAAGGGGAAAACCATTCTGGGGAATGGACCCACCTTT 

CAAACC TTCCAAGCCCCTTGGGGGGGTNTATTTGGTCGCCAACACAAAAAAAT AGAAGTA 



TAAAGAAATANAGGTTANCCTTCGGGCCCGCTTCTTANGGAACCTAGNNGGGGAATCGCC 
^CGGGGCCTTGCCAGGGGAAATTCNGGAAT>nTCAAAAGCCTrTATCGGAATACCCCGTC 
CGGACCCrrCGGAGGGGGGGGGGGGCCCCGG 
Sequence 247 

ggcttgcttgactagatgagctgctatagtagccaatcctgttagacttggaccattgtt 
tgtctgaagaangggaatctgtcgctcgccctgag 

gncccaagggacttctgcaag t agccgacaaactc tgcccggcc<jcccgcca tc ttccgg 
gcccg c tctagaaact aagtitggggatccccccgggggcttgcaaggggaaatttccgaa 

TATCAAAAGCTT ATCACAA TAACCCGTCGGAACCITOG 

GGTACCCCAAGC 1 1 1 1 1 1 U I NTCCCCTTTT AAGTGGAAGGGGGTTTAAATTNGCCGCCGC 
NTTGGGCGGTAAAWT CANTGGGTCATTAGGCTTG TTTTCCCTGG TNG T CGGAAAAATTTG 
MMTT ATTCGCGCTCACCAAATTCNCACAACAAACAATAACCGAAGCCCGGGGGGA<^CCA 
TTAAAAAGGTT GGTAAAAAGfn^CTTGGGGGGTGGCfKrTAAATGGGAAGTNGAGCCTAAA 
CTTCACAATTAAATTTGCCGTTTGGCCGCTTCACTGGNCCCGCT 
248 

T HTTTTTTTTTTTrrrrnTTTTTCTTlTTI 
ATTTAATTAGGGMTGT AACAAATGGTTA 
ATTNTAGKAAGAAAAACCAAATTGAATAATTTTCT^^ 
CAMCCNTAATGAACATCACATAATGAAGTTNCTCCTTTCCAMATCTATAAA 

TGCTATAAAATCTCTTACATTTTAANACTT ACTCTTAANAAATCATC T ATTCTTCCO TC 
Sequence 249 

AGACTGTCTCAGATCAAGGAAAAGATGGOCAGAGAGAAAGCTGGAAGAAATAAGATTGGG 
TGACATTTGGGGTTATATTGAAGAAGGTTACGCCACGGAGTGTGAATAGTGGAAAAACCT 
TCAGCATATGGAAACTGAATGATCTrCGTGAOCTGACACAATGTGTGTCC I IG l IC1 1 
AT 

TTGGAGAAGTTCACAAAGCCGCTCTGGAAGACGGAGCAGG^ 

AATGCC A ACCCC ATGAAGCCGAAGGATGGTTCAGAGGAGGTGTGTTTATCTATCGATCAT 
CCTCAGAAGGTCTTAATTATGGGTGAAGCTCTT GACCTGGC^ACCTGTAAAGCCAAGAAG 
AAGAATGGAGAGCCCGTGCACGCAGACTGTGAATTTGCG T GACTGTGAGTACCT 
Sequence 250 

CGGCCGGAGTGATGCCATCTGCAG TTTTGTGATCTGCAATGATTCTT CCCTT CGAGGTCA 
GCCCATTATCTTTAATCCGGAC 1 1 1 1 1 1 GTGGAGAAACTrCCGACATGAGAAACCTGAGAT 
TTTC^CTGAGTTGGTGGTC^GCAATATC^CAAGGCTCATa^ 
GGCTCAGC TGATGGG GGAAGTGGACCTTAAGTTGCCTGGCaSGGG 
ATTCTTCCGGTCTCTCATGTCTCTCAAGCGAAAGGAAAAAGGAGTGATATTTGGGTCCCC 
ACT GACGGAGGAAGGCATTGCCCAGATATACCAACTGATTGAGTATCT ACACAAAAACTT 
GCGAGTAGAGGGTTTGTTTAGAGTACCT 
Sequence 251 

TG<XXSGCCGAGGTACCAGCA£AAA 
TCTACCCGGGAATt 
Sequence 252 

AGGT ACATTTT ACTACGGACGCTTAGGC A 1 1 CI II 1 1 C TC 

c 

G T GCACA TGCACACACACAAATGGGTGAAACAATTCTCACCATAI 
CC T GCCGAGAATTTGCATTTC TAACAAGTTCCCAGG TGATGCTGACACTGCT GGCTCA TG 
GAACCACTGCTGTAGTATTTTCCAAATTATCCTGATTCTAAGAACCACCTATGACCTGT 
G 

CTGI 1 1 1 1 1 CTGTGGTTACTGGCTCATGTCACATAAATTCTTTT aggattcaaacatgt 
T 
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tgtgatattactcagtatttacatcttgcttttactgcagcatgatggaaaaattaacc 

A 

CAGGTATATCATAACAAAAAGAACATGAGTTACCATTTTTCACAAAGTTCAGATATATT 
T 

AAATTAGCCTATTTAATi 

rACCAGTAG 

TATTTCAGATTTATTAAAAAACACATAGGTAACCGAGTCANAGCTTTGGCTAGGAATGAN 

TTGGAAAAGAAC TGAAGGCATAATTTXACAGGACATTCACAGTTAGTGTGC T AGAAGACA 

NGAGAGGGAAGCAGGGAAAAGTGTTTTTAAGAAAGCATTTGCGGGCCGGGACAAATGGGA 

AAGGGCCCGGGCTTTCATCGAAATTCCCTTGTTTTGCCTTGGATCCCACAATCT^ 

GGAAAAGGGTGGGGACAAGAAGXSAAGNGCCCAAGGGATGGGGAGCCACCCGATCCCAAGA 

CCAAGGAAGTANTTTTGCCGCTCCCGGGANGGGGGGGCAAATTGGATCCTTTGGAATCCT 

TCAATGGGTGGCCTNGGGGGTAGCTTAAGGGGGCCCGGTGGAATCCTCCTTTCTNGCATT 

T CCGGGGGCC GGGCNAAAT NGCC CAAGGGGGG T ACCC TTGGGGCCCGCTT CT AAGAAAC C 

TAGGG**GGGGGATTCCCCCCGGGGCTTGCAW>GGAAATTTCGGAATATCAAAA SGCTTA A 

T 

Sequence 254 

CTCACCGCGG TGGCGG MCGXGGT ACTCATGGNTGC TGNAAATCATGGCACGCCCGTTCTG 
CAGGGMTNT GCTTAGCCAGGCT CCTNTGAGATCTGGCTATTNTGNCTTGTGGATNNTCAG 
TCCCCGNGTACCTGCCCGGG 



AGTCAACAAAGAAGAAAI II C I t 1 1 1 1 CGGAGACAAAGAGATATTTCACACAGTATAGTT 
TTGCCGGCT GCAGTTTC TTCAGCTCATCCGGTTCC T AAGCACAT AAAGAAGCCAGAC TAT 



CTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATTGTGTGCAAAATCAGAGGGGGGTGCAA 

AGATCCTGATTTTTCAGGAGTTCAAGCGACAATGGCAGCCCAATACGGCAGTATGAGCTT 

CAACCCCAGCACACCAGGGGCCAGTTATGGGCCTGGAAGGCAAGAGCCCAGAAATTCCCA 

ATTGAGAATTGTGTTAGTGGGTAAAACCGGAGCAGGAAAAAGTGCAACAGGAAACAGCAT 

CCTTGGCCGGAAAGTGTTTCATTCTGGCACTGCAGCAAAATCCATTACCAAGAAG^ 

GAAACGCAGCAGCTCATGGAAGGAAACAGAACTTGTCGTAGTTGACACACCAGGCATTTT 

Sequence 256 

ANCCCACAC CACCAC NTCTGATTAATNTTTT GNATTT AAAN NTTT AGG TGGGGC T NCAC C 
ATGTTGCCCAGACTGGTNTTGAACTCCTGAGCTT AAGCAATCCACCTGCCTCGGCCTCCC 
AAAGNGTTGGGATCACAGGCGTGAGCCACCGCATCCGGCCTCATGTTC I 1 1 I ICATTAAA 
GAGAGAAATCAACTA7TCAGGACCGGCCCCCACCTTTCCTCAGGAGTCATTTCTGTTCCG 
CACAGGCCTGCTGAACTGGGTGCTTTATATAGGGNANAGGGGGGCTCATTTTTNGTTCCC 
CTGNCCCNCAAGCNTTANGGGGCAAAAANAAAACCATNCCAANAATTrGGNAAAGGGNNT 
1 1 1 1 1 1 1 1 1 1 NAAAATNNGGNNNGGGGGGGGGCCCCCCTCNCTTGNGGTCGGGNGGNTTT 
TNCMG6NGNM 
Sequence 257 

AGCTCCCCGCGGTGGCGGCCGAGGTACTCTGACTTGCAGGGCCCAAGACCGGCCTTGCGA 
GCGTCGTTGGCTGATGGGAGTAGAAGCCACAGAGAGTCTTCCTCTTGGAGGTACAGTCAA 
TTCTGAGGTTTGGGCGTCATAGACT AAACCCAGAAAAC AGAACATTGGGAAGTCTTCGGA 
ATATTCTCTATCTTCTTCACCAACGAGTAAGACCGTTTTG 
Sequence 258 

GGCCACGTGACCGACGCGAACATNGCGGCGCCCAGTGGCGTCCACCTGNTTTTCCGCAGA 
G G TTCTCAT AGAATTTTCTC TTCAC CACT CAATCAT ATCT AC T N ACA C AAG C AGTCAAG 



e 259 

GGTGGCGGCCGGCGGGAGGCTGACGAGAGCCCGGGAGGCGTTAGCGAAGGAAGAGAAAAA 
CCGAAGACGAAGC CACT ACAGCC CCGCGT ACCT 
Sequence 260 

GGAGCATAAAGNTGTAAAGCCTGGGTGTGCCCT AATGAGGTGAGCCTAACTTCAGATTTA 
ATTGCGTTGCGCTCACTrGNACC^TTTCCAGTCGGGCNAAACCC^^ 
TGGNATTAAATGGAAATC 
GTATTGWGGGCGGCTTCTrrCCGCCTTTTO 
GC 

TTCGGGGTNCGGTTTCWGGCTTG^GGGGCGNAGGCCGGGTAAThK^G^ 



NACCGCCANGGGGAAAANGAAACCATGffTGGAGCCAAAAAAGG 
Sequence 261 

TG T GTTGAAAAATTGTTATCN fit* CTTCACAAATTCCACACAACAT ACCGAN GCCCGG N NA 
GTC^TAAAGTGTAAAAGCCCTGGGGTGCCTTAATGTAGTGAGCTAACCTCACATTAATTG 
CGTTGNGCTCACATGCCCGCTTTTCCAAGTTCCGG 
Sequence 262 

GGGCGGCCGAGGTACCCGATAGAACATGGCATCATCACCAACTGGGACGACATGGAAAAG 
ATCTGGCACCACTCTTTCT ACAATGAGCTTCGTG7TGCCCCTGAAGAGCATCCCACCCTG 

ctcacggaggcacccctgaacccnaaggccaaccgggagaaaatgacttcaaattattgt 
ttgaga c i 1 1 1 caaatgtcccangcccatgtatgtggcttatccaggccggtcgcctgtc 

TTCTCTTATGCCTCTGGNACGCACATCCTG<iCATCTGAGCCTGGACTCTTGGAGATNGGG 
T G TTCACTCCACAAATT GTTCCCCCATTCTT ATNGAGGGGGGCTATTGC NCTTGGCC CCC 



TT ACC TAACCTGGCTT G NAAGCCGNT GGAGGAATTGGTTCNCG GGGGACCAA7TCAAAAG 

GGAAGAAAANCTGG 

Sequence 263 

GTCCGCGCGGTGGCGGGCGAGGTAC I H I II 1 1 II 1 1 1 1 1 1 1 1 1 M 1 GCAGCCG I II I ! 
C 

TTACTAGAAGCTAGGCNGAAAGAGTTGTTACTCANATTTCTTGAACTTGAGACGTCAAAG 



GGCGCCATCTTGNGAAAAGCCCCGCGNACCT 
Sequence 265 

AGCN N CCCGCGGTGGCG NTN GCC N GG GCA MCCCGCGG GGGTGGAAACCTCTT CAGCATT N 
GC TT N N N NTC AGGGGGCT AAAAAACCCAN C AACC GGGACCCCAGCTTTTC AGAAC TGCAG 
G G NAACAGCCATCATGAG N GAGGGCAC CAAGAATTCCC TGGAGAAAATCC TTCCACAGC T 



ATAGAGN GC GCAAC CAGG GN GAAAN NTTAAAC ACNGAGN NCAAAGN GGN CG NGGG NC CC N 
CGGCCGCTCT AGAAC CAGGGGACCCCCGGGCCCGCAGGGAANNCCGANANCAAAGCCNAA 
NCGAAACCCGGCNACCNNCGAGGGGGGGGCCCCGG ACCCCAGCNN NNNGGN CCCCCN NAA 
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GAAAAGCCCN 
Sequence 266 

AGGTAU 1 1 1 CTAGGTATTGCTGGGCAAGATCCTTGTTGGAGTCCTCCTCTTTTGCTG 
CC 

CCACTCAGAuGGATAGGCAGAGCAGAC TGGCAGACACAACAGCACAAGGAATGCAAGATGC 
ATCATTCTCACTGCCCTTACCTTCTrTGTCTACTGGGCTTCTCCCCGCGT ACCTGCCC 
GG 

GCGGNCGNTCGAGCCGCCGGGCAGGTAC TACC TGWACCAACTTrrTCATTTGGGCATCAC 

GAAAATTCGGCCTAAACAGNAATAAATGAAAATGGAfn"GGAAATCAAAGN NCTTAAAT AG 

AACAfK3AAGGCNGGGCACCX3GNGGNTCACGCCTNG NANNCCCAGCACT 

T 



iTACTTTACCTCATTTCCTACCAATCATT 



GCGAT1 

TTAAGAGAATTTGGTTGTAT* 
A 

TTTTAGC6GTGGTCGCGGCCGAGCTACGGATACAATTC 

TAACCTCTCCATCACACGCCCCAGAAAGGACAGTAGCC^GCTTCTCTGGATGCTTTGCCA 
AGCAA TTGAC T CCA TCAGGG TGACCATCCAGCGAAGCAAGGAA T GG TTTTGCAAAT ACT C 
GTTXXAGTTTGGTAGCATTTAAAGCTCTTATATATTCT 
TA 

AAGCAGGATCATAGTTTCTTGGAACTCTCTGTAAGTO^ 
TGTCCGGATTCXGGCTCAGCATCTTCACCTTCATCTCGGTTGCTCTTC 
Se q uence 268 

NATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC^TTTATATGAAAGTCCTC^ 
AGCAGAAAAGGAGTAACTAGATGGGCATTTTC TAT ACCAGCT AAGGCTTTAAACATAACA 
ACGTCTACTGAACTATTTTCTACTT ACTTTGACTGAATAAGCCAGTGAGATCGTGACTG 
C 

AAGT GGAAGACCTTCTGGCACT GC GACCACTAAAACTGT AACTCCAAT AATGAAGAAC TT 
CACAAAGTATTGTATATAAATTGGTGTGCACTCAGCAAGCCATGGTCTTTTCTGAACCCA 
GAAGGTG TCAATGACAAAAT AT AAT ACT AGAATGAT AACTGTGATGGCAGGCATCAACAG 
ACCTTTCAGAATAGAAATGAAAGAAAAAT GTGATTATT AAATTT C CAGACACT AACCC TT 
GACAGATATAAATTAAACACTGTAAAGAGTTATAACTTGCTTGATAGTATTGAATTTCT 
C 

TGAGAAATTACTTC 1 1 IC I i GCACCTT ATAACTTGACATTGTCAGATTT AATTTTT 
Sequence 269 

ATTGGAGCTWXCGCGGTGGCGGCCGAGGTACGCGGGATAGTGGAGGCACTGAAAGACCA 
GCAGAGGCATAAGGTTCX5GGAAGAGGTTGTTACCGTGGGCAACTCTGTCAACGAAGGCTT 
GAACCAACCTCGAGCGGCCGCCCGGGCAGGTACaGATGCACAGGAGGCCATAGGGTTTAG 
GCAAAGGGGAGCACAAAAGTTGAAGATGAGGC gctgccaccaatgctgggacttcaggcc 
agg^kjcaggagctgaggaagccacaagggaggacattttctgcaasttgctgaaccagtag 
caaccaggtcctgagaaagccctctcttgtggaagaataacagccaggaggaaaagcttt 
tcattctgcaaagctggggcagaaagtt c rwrn gaatcccgcgtacctcggcccgmtc 
tagaactantggattcccccgggctggaggaattc 

Sequence 270 

GTCTTCGG^rTTTTCTCTTCTTTTTCCAGGGCCTCCAANCCCTCGTC^ 
Sequence 271 

GGGAGGCGN N AGCGAAGGAAGAGANT N TT C GAM GAC GAAGAAAAC CCAGCGCCCCC CACG 

NACCT 

Sequence 272 

TT G GAGC T CCCCGCGGTGG C GGCCGAG T CCCAC AGTT Al 



CTCAGX5GVVAAGGAGCCTGG<3TTGATTAACTTGTGTGTC>iATGTCCCACCCGTCCCAGGTA 
ACATTTTGCCCCCTGAGGTCCGGGGTAATTTAATGGCTGCTGGACAAAACCTCCAAAGTT 
CTTGAAAGATCAGAAATGATAGCTACCT GGAGTCCAGCTGTACGGCACTTGGCGTAAAGC 
CGCTTCCCTCAAGAGTAACTACAATCTTCWIATGCACAAGATGATTAATACAGATCTTAG 
CAGAATCTTGAAAAGCCCAGGAGATCCAAAGAGCCCTTCGVkGCACCACG^ 

atcgcagagtcctaaagaagaacccactgaaaaacttgagaatcatgttgaagctaaacc 
catattgcaaagaccatgcgccggaacaccattcttcgccaggccaggaatcacaagctc 
cgggtggataaggcagctgctgc^k^anc^5gcactacaagcccaatc^atgagaaggccg 

GCGGTTGCAGGCAAGAAGCCCTGTGG TAGGTAANAAGGG 
Sequence 273 

TNTTAGGGNCAAACA<y5GCCCCAGCCC<^GWCCA^ 
NGACAAAAACCCAGGNCACCCACCTGCCCG 
Sequence 274 

GCX3ATTGGAGCTCCCCGCGGTGGCGGCCGAG<3TACCGCX5TCGATGCTATGCGCTCAGT7C 
TAGTCAGAATAATC 1 1 bC I U ATCCTCCAGCTCCCCCTGTTCCACCAAGGCAGAATTCAAG 
CCCTCATCrrGCCAAAAC TACCA CCAAAGACTTACAAACGGGAGCTTTCGCACCC 

CGGTGACAC^TGTTGGTGTTGGTATTAOfiAGATCTGCACATCCCACACCGGTGCAACAG 

TTTGCCAJGCTAAATTCAAAAAACTCCT6GTGCCAGG 

AGGAAACCTTTGCA 

Sequence 275 

CAGCGAGCACGCGT1OT(XX3CAACCCGAAA 

CCGCTCTAGAACTA 
Sequence 276 

AGGTACGTTCTATTCCTGCT CCTATTAGGTCCTTCTCACC^GCACCGGCCCTCGGTCGATT 
ACGCCTCTCCAGTTCTGCTGGGGACGTTCTAGCCTCGCXiCCANCCGCGTCGATCTTTATG 
TTATACCGTCACTCCCAGTGCCCTAATGGAACTATCCCTCCACTACTCCCCCTGGTTCTA 

TCAAGCXXTCCCCGCGTACCTGCCCG 
Sequence 277 

gac t ccccgcggt ggcggccgagg t a c gcgogcigag c g ggccc t a cc g tgtgcgcagaaa 
gtggaggcgcttgccttcagcttgtgggaaatcccgaagatggccaaagacaactcanct 
gttcgntgc ttc agggcc tgc tgatttttg gaaatg tgattattggttgtt gcggcat 

TG 

CC TGCTGCGGAGTGCATC TTCTTTG T ATC TGACCAACACAGCC TCT A CCCACTGCTTGAA 

GCCACCGACAACGATGACATCTATGGGG 

Sequence 278 

TT CGCCCGGGCAGGT ACTTTCATCCAT AAAGGCC TGCAGC TGTTTCAC TGATCC TTG CAG 
TTCATCCATCACCAACTCCATACAGTCAAAGACTrrGCTCTGGTTCTGTAATATTTTCT 

G 

GTAGTCAGGTTTTGTATTAAGAACTTCATTCTGAGAAGACCCAAGATATGTCATAGGTTC 

CACTTTGACCTCAGTAATTTTGGCCTCAGTTGATCCTCTGGACAATATCTCTTTAGCCT 

C 

CTGCTGGTAGTGAGGCAAGAGC TGATCCCAAGTCTGACGTTCTAAAGAAAACTTTGTTAT 
G T ATTCCTTCATCTCA GCCACAGATGC TTCCAAAGAAAAATCTGATGCTTTTCCATTTG 
A 

ATCTTCAAAACATrTTTGNAGAGTTCCATCAGTTTCCAGGCCGTCTGCAAAATGTTTCA 
A 

TTCTTCAGAAAGAGAAGATGCTTTGGCTCTAAAACTTTCAAGACTGAAGCCCTTAGTGGC 
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CCTTANGAAAGGGT 
Sequence 279 

CACTGTTCTTTCTTTCTAATAAAACTTTC T T 1 1 1 CGAACCTATACTGTCTTCTGTAAATT 
CTTCTTACT ACCCTATOACCCGTIMCCCAACCACTTTCCGATGCCAGGGTTCTGAC^CCT 
CACCTGGCATAAT AT AAAGTGTTTTrrnTTTATACCCTTCCACTT GGAAAGACT ACAG 
A 



AGAAAACAAGAAGGGGTAAATTTTGAACCAAGG<jAAATCATTTAAGAAGTGTCTGGTATT 
TTTCAAATTTCTG TCAGTTGTT ACATTTGTCATAAGTAAA TG TT7 AGGAATAAAGGATG 
G 

AGACATGC TTATTTTAT7 TAACTCCCCC>>>^TTAAAAANNAAAAAA.AKAAAAAAAAAA> 

AGTCCCTGCCCGGGCGGOOGCTCGAGATAAC 
Sequenoe2S0 

CCGCGGTGGCGGCCGGAGTWATGCCATCTGCAGGTTT7GTGATCTGCAATGATTCTTCCC 

TTCGAGCTCAGCCCATTATCTTTAATCCTGACTTTTTTGTGGA^ 

GA 

AACCTGAGATTT7C>£TGAGTTGGTGGTCAGCAATATCACAAGGCTCATCGATTTACCTG 
GAACTGAGTTGGCTCAGCTGATGGGGGAAGTGGACCTT AAGTTGCCTGGCGGGGCTGGCC 

TTCGGTOCCCACTGACGGAGGAAGGCA7TGCCCAGATA T ACCAAC TGATTGAGTATC T AC 

ACAAAAACTTGCGAGTAGAGGGTrTGTTTAGAGTACCT 

Sequence 281 

AGGGTGCCCCCGAAGTGAAAGAGATGGCCTGG^ 

ACTAGGGrrGAAATCCATACAATGGGAATGGTAGCAGACAAT A A^AGAAAATGAACTATT 
GATGCCCCCTACTGCACAGCAGAAGCTCTGAATCGTGTTCCTGAATGAAAGAAGTCAGAG 
ATGAAA AGAT GGGCCAGGAGTCCAGTTTCTGGAAGGCCAAGAATCGAAGTAGCAAGCT GC 
AAGCCGTTTTCCAGACAAGCNGNGATGTGGGGATGCCACAAGAATTCAGGACTGGAGG 
Sequence 282 

cgcggtggcggccgaggtacttmnactgccagaggctgtgacgn^ 

cagccttncctgcantgatnccatcccgcaggaatcr^ 

ctctg«:tgtctttccacmtttccctttcncattttgca^ 

GC 

CCTTAAGGCTTGGAAACTCACNCCACCTTCAAGCr 

GTCNGGGNAACXAGGGTGGACAAGGGGGCCAGAATCAAAGNCGTTCT^ 
ATGGGCCAAGGGGAATGGGGGCCCCAGNNNGGGGTTCCCCAAAGGCANCAAGN^ NA 
ACTTGGANACTTGGAAGTGGANGGGCCATTGGNMGGCA^ 
AGGGGAGGGGNCNGNAACCACCMCAAAAAAGGTTTGGANGGC^ 
GCCCCAGGGGAAAAAACCTTTTGGGCCCCA t H I 1 1 1 I L CAATTTTCCAATTGGGGGT 
TG 

GGCGAMTAATTTGAAAGGGGAAGGAATTANCCTTGGGNMAAAGGGGNTNGGGXSGGGG 
Sequence 253 

TGGCNGCCGAGGTACAGNATTGGAAATGGATCTGTCTTTGGTAAAGA1CAGCCTATAATT 
CTTGTGCTGTTGGATATCACCCCCATGATGGGTGTCCTGGACGGTGTCUrrAATGGAACTG 
CAAGACTG TGTC CTTCCCCTCCTCAAAAGATGTCATCGCCNATCAGATATAAGAAAGACG 
GTTTGCCCTTTTCAAAAAGACCCTGGGAATGGTGGGCCCATTTCTTTGGTNGGGNCTTCC 
CAATGGCNCAAAGNAAAGGGGAAANGGG C KA TTGTGAAGAAG GAANANAGTATTTTT AC C 
TNGAAAAAGGCCATAAATGGTGNANANAAATTCTTTCCANAAATTCCNCAAGNGMGGTGG 
CANGCC CCTNT AG TAN T AAAANT ANCGN CCC AAAGGAAAGG N TCANG TTT AAAAGGGGTT 
TATTTTGTTTGTTNGGGGTAAAATCNCAAGCCCCAAATACCCCAAACTTGNCCCCTGGAA 



CTTGGCTTrTCMCAAAGGTTCNAGGCTTTC^^ 

AAACCNTTTCC 

Sequence 284 

GTGGCGGCCGCCCC<3GCAGGTACGC«3GGGCTCTAAG£ 

GAGGGGAAGv^GGCCTGTTTCGCTGTCGGGTCTCT AGTTCTTGCACGCTCTTTAAGAGTCT 

GCACTGGAGGAACTCCTGCCATTACCAGCCTNCCCTTTCTT^ 

GG AAAAACAAT WACAATTTTATTTCCATTGGCCCAAGT^ 

TGC 1 1 1 1 1 11 1 GGGCCNTTTKrCTTACCCCTTTGCCAAACCAAGAAAACTNAAATNTTG 
N 

CnuXCKCAAAMCTTCCCTTTAGTTAGn^^ 

TCNCWTGGNWAAGNCCCACCAAGANCC I M M 1 1 1 1 GGCTTTTTTGCCAATTTGGTGA 
AG 

GGAAG 
Sequence 265 

TGGCGGCCGAGGTACTAGGTCCCAAATGTTTCAACCGATTTTAOCCTATGT^ 

tattatagaaggggagaggtatcctgtagtgatgtccacgtatcttggagttatgggtcg 

AGTTCT ACT ACAAAACACT AG TTTTTTTCTT CACTTACTT AAT GAGATGGCOCATAAATT 
T AATCAGGAGATGGACCAGCTTTTGGGAAAT ATGATTGAAATG tgggtttgatcgaatgg 
ACAACATTACOGAGCCTGAAAGAAGAAAACTTTCAGCTTTGGCTTTGCT C fCTC T TCTG C 
CATCTGATAATAGTGTTATCCAAGATAAATTCTGTGGGATTATAAACATTTCAGTAGAA 



ITAT AAAGAC TGT ATGTT 
lATGAATGAAACCACCCAC 



GCCTCCATGATGTCATGACGGGAAGATC 
GGATGGTCTCATl 
Sequence 286 

GOGGCCGAGTAGCCGATAGAACATGGCATCATCACCAACTGGGACGACATGGAA 

TGGCACCACTCTTT CT ACAATGAGCTTCGTGTT GCCCCT GAAGAGCATCCC ACCCTGCTC 

AC GGAGO CACCOCTGAACCCCAANGGCCCAACCCGGGAWGAAAAATGAACTTC 

TTGTTTTTGGAGAACTTTTCAAATTGGTCCCCA^ 

CC 

AAGGCCGGGTNGCCTGGTCTTCTCTTATTGCCCTTnCT^ 

caattmgwtggccntgggaacttcttgggaaagaati 

AATGGNT<XXCCCCAATTCTTATTGGAAGGGGGGCCTTAAT 
CAAATGGCCCCANTCAATTGGNCCGTrCNTTGGGGAATNCCTTGGGCCTTG 
AAGNAATTCTrCAACCTTGGAACTTAACCCCT^ 

tggaaggccggtggggcctaatttcccc TTTTCGG NTTTAACNT AACC TTGGCTTGG n aa 

GCCGGTTGGAANGKAAATTTGGTNCCCGGGGGAACCATTTCAAAGGGGGAGGA 
TNGGWGGTTTTAATTGTTAAAG CCC TT C TTGGGGWA C H I 1 1 1 GAAAAAAA 
Sequence 287 

CTCCCCGCGGTGGCGGCCGAAAACTGATCAGACTG TCTCAGATC NAGGAAAAGATGGCCA 
GAGAGAAGC TGGAAGAAAT AGATTGGGTGACATTTG GGGTT ATATTGAAGAAGGTT AC GC 
CACAGAGTGTGAATAGTGGAAAAACCTTCAGCATATGGAAACTGAATGATCTTCGMGACC 
TGACACA/fTGTGTGTCCTTGNTCTTATTTGGAGAAGTTCACANAGC^ 

AGCAGGGGACTGTCG T ATCGGATCC t caatgccaaccccatgaagcccaaggatggttca 

GAGGAGGTGTGThrrATCTATCGATCATCCTCAGAAGGTCTTAATTATGGGTGAAGCTCTT 

GACC TGGGAACCTGTAAAGCCAAGAAGAAGAATGGAGAAGCCGTGCAGGCAGACTGTGAA 

TTTTGCGTGACTGTTGAGTAGCTCCGGCCGCTCTAGAACTANTTGGATCCCCCG 



GCCAAACGC TTCCGCAAAGC TCAGTGTCCCATTGTGGAGCGCCTCACT AACTCCATGATG 

ATGCA 

Sequence 289 



PCT/OSOO/23312 



GGACAGAC TGGCTCATNGAAGACATT NAC TNTGATGGGAC CATTTNAANCNGAT AATTTT 
TCTCATAACCTGAGAGGAGTWATCCCACGAAGTTrNGAATN l I TGI 1 1 I C CTTAATTGA 

CGTGAAAAAGAAAAGGCTGGAGCTCGAAAGAGTTTCCTTTGT AAGTGTTCCTTTATTGAA 
ATCTAT AACGAGCAGAT ATATGA TC T AC TGGAC TC TGCATCGGC TGGA 



C TTTCC TT ACAC TCATTTTCCTGTCTTT ATT ATGATAGTC T AAC TTTTTCTCC TCAAAGG 
TATAGCTGCCTTGCTTTCATGAAAACACA C 1 1 1 C CTATTGTGATTTATCAGAGGCCTTT 
C 

CATATCTCAGCCACTATGCTATGACAGATTTTATAATTAATA 



TGGCX5GCCGCCCGGGCAGGTACGCGGGGCCCGTAG 

ACAAAC^CCCTGAGACTG^GCTGCAGGAACTNAAACCATGAAGAGCCTGAl'OC'n C M GC 
CNTCCTGGCCGCCTT AWCGGAAGTAACTTTG TG TTATGAAATCACATGAAAAGCCATTGG 
GAAA1 CI 1 1 ATGGAAACTTAATTCCNCTTTTNATITAAAMCCAGGGHAAGNftAATATGT 
N 

AAAAATTCCNC! III! f ATTANWTCCCOCTCTNCAATCCAAGMAHGNATGGGGGAAGCNA 
GCKT AAAACCNCT>K^KHATT*AAIAhW>rrf<XSGTTTCTAAATAAGNAANCC rTTCTTTC TA 
AANANGWNCKTfK^TTCCCACCGATATCT^ 

TGWMAGWTTTATWTACTrmACWNAWGCA l 1 1 1 1 I I I fWWGTGNAAAAAACCOCGCWWT 
T 

AACC^4ACCCCAANTMGGGGT7TTTATATt CX3GGGHANTNACCAAAAATGGCCTNGGCCCT 
TnfTATNANAAATCNGGGCTTTNNCMTTTATAACNAG^ 

GGGMHANtKXXZiAMTATWTNJAAHATNNTTGGN NGGGGGAAAAAAAAAAAA 
Sequence 291 

GAGCCCGGGTGGCGGOCGCCGGOCAGGTA C 1 1 rfTTTTTTTTTTnTn I GG GGGAGTTA 
AATAAAATAAGCATGTCTCCATCCTTTATTCCTAAACATTTACT^ 

ACTGACAGAAATTTGAAAAAT ACCAGACAC TTCTTAAATGATTTCCCTTGGGTCAAAAT 

TACCCCTTCTTGTTTTCTCTTGC I T FT CAGGTAATTAACTCTTCTCTTTTTAGTTTGAAC 
TATGCAGTGCAACATTCCTCTGTAGTCTTTCCAAGTGGAAGG^ 

TATATTATGCCAGGTGAGGTGTCAGAACCCTGGCATCGGAAAGTGCTTGGCTCACGGGTC 

ATAGGWGTAGTAAGAAGAATTTACXGAAGACAGTATmGGTTCCGAAAAAG^ 

T 

Sequence 292 

L I I I HI 1 1 1 1 I I H 1 1 1 1 II I MM I N GCTT G I 1 1 1 IAT I I 1 1 1 
^GGTGCTCCTrGGAGTCACTGGAGTTCTACTTTGAATCCCACT 

TTTrCCTCAGGTCC^ANGTTTWTTTAAG^ 

ACCTGCCCG 

Sequence 293 

CTGGCGGCCGCCCGGGCCGGACGCGGGGACATTOGAGTGGGGATTAAGAGAAG^^ 
GCCTTGCTGGAGCTGTGTGGTCTTCTGCAAGTGAGAGTCCCAG 

GGAGGAAAAGGAGGAATTCATTTTNAGCAAGACAjCAAAGAAAAGCAG 1 II I II I I 
CANGTGCTGACGGCCACCCACCATCATCTAAAGAAGATAMCTTGGCAAATGACATGCAW 
GTTCTTCAAGGCAHAATAATTGCAGAAAATCTTCAAAGGaCCCT atctgcagatgttctg 
AATACCTCrGAGAATAGAGATTG^TTATTCMACCA&GA TACC7 AATTCAAGAAC t cg»jGa 
AATCAGG^ACGGAGACATTTTGGTCANGNTTTGCAACATT^ 
Seouence294 

GCGCTGGCGGCCGC CCGG GCAGGTACGCCGGAGGCACATT^^ 

TGT AAACTGAAC 1 1 1 G I I 1 1 LAGNKCCGGCTCCAGCCATGCTCGGGTAGCTTGCCAATAG 

ATGAATCCCACTCGTTTGACCCATGAOGCTCCTTCTTTGCATNNCTCCCTCTTTCCCC 

AC 

AGCAGNGCATGTCCACCATACCACCT GAGAGTCTGTGGAATCTAATTTTCTGTNATACTT 



CNCGCGGTGGCGGCCGGAAGAGCAACCGaGATGAAGGTGAAGATGC^ 

ggacaattatgtccgcgaaaccaagttg^sactt ac^wgagagttccaagaaact atgatcc 
tgctttaca tccttttgxgg tcccacgagaa tat ataaagagc ttt aaa tgc t accaaac 
tggaacgagtatttgcaaaaccattcc itgcti ccctggatggtcaccgtgatggactca 
attgcttggcaaagcatccagagaagctggctactgtcc ittct ggggcgtgtgatggta 
gaggttagaatttggaatctaacrrcagcggaattgtatccgacgct 



TCCCCCGGGGACCTGGAGGATGGACTTTTCCATG^ 

AAAAATCAGAGACTGGTGC TC TTGGAGAAAACTATAGTTGGCAAANTCCCATT AACCACA 

ATGACTTCAAAATTTTAAAAA 

Seq u en ce 297 

AAACCGCCGCTGAAGCCAACAAGAATTT GAGAACTG TAAATACCAAGCCTTGAAAGGGAC 
CATGGTGCGGGCTGTCAGACATAAQAAAGOCAGTCAAATTC^ 

ACAGTGATGATGAI H I GH 1 C \ GCAACTTGT ACCTCCGCCGTTGT AGAAACTTANTG 
GA 

CCCCTNAAMGGGGGGGGGGNCNCNNGGCCCNCM 
GGGGCGNGAATTTANCNKCCCCWCNGNCGGGGHAAAMAAAATACGG^ 
TTKTTNMGWGGGGGWAAHAAAAl 11 1 1 1 I N I CI CCCCCCCAAAAATAAAAAACMOGNCCC 
MCTTTCTKTGCCGGNTGGNNGNAAANMAffTATNGNGGTCCCCCCN 
A N! t llll lll I WNKNA AI 1 1 I 1 1 1 1 1 



GTGGCGGCCGAGGT ACTCCOCACCAAATATT C 1 1 IG I I G GCTTGCTTGACTAGATGAGCT 
GCT ATAGTAGTCAAT<^TGTTAGACTTGGACCATTGTTTGTCTGAAGAACTGGAATCT 
GT 

CCtCTCGCCCTGAGCACTGT ATTT ATTTCCCCTT ACTCANTCCCCAGGGGACTT CTTCCAA 
GTAAGCCGACAWACTTCTTG 
AGTAAACTTAGGTTGGGAATX 
TTCAAAAGGCCTTTATTCKGAATA 
CACCCCGNGTTA 

GGGTTTT AAAAATATTGGCCGACCGMCC TT ITGGGTCCGHTTANAAATTCCAATTGGGGG 
GKTCAATT AAGGNCCTT GKTTTATTCCCCTTMGTNGTTG<iAAAAAATTTNGTTNT AAAT 
T 

CJICCGNCKTTTCAACKAAAATTTTTCCCWAMHCAAO 

CCCGNGGGGAAG*«XIAATTAAT AAAAANNTT GGTT AAAAAAW3GCCCCTTGWGGGG 
Sequence 299 

TGGCGGCCCAGGTACTTCTGTCTTCCAGTTTTC^^ 
AT 

TGTTTKATCCT ACCACTCCCAATTAA 1 C I T l CCA 1 1 TT CGTCTGCGTTTAGTAAATGOG 
T 

TAACTAGGCTITAAATGACGCAATTCTCCCTGCGTCATGCGATTrTCAAAGGGTCTT^ 
TT 

AATTCACCCTTCCGGGTTTTAAATCCTCTTTTTTTAAAAAGAATCCG ICC I I I CAAAAAT 



f^t 51 at 221 
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TATNTTrAAATTCACCCTTACC^CCTTTTTAAAACCTAAAAACCTTT AAAGGCTTGTTT 
TAAAGGTCCACCCTTTCATTTTTTAAATCT AAAAAAGGCCATTTGGCCCCTTTCTAATT 

GGGMT AATTNAAATTCCGGGGGCCTCrTCTTAGGTAC^TNTTCTCTTCAAATTTTTAT 

C 

CI 1 1 1 1 1 lAAAAATTACCAl 1 1 1 II II I lACCTTCCCATTGAAAGGAAAGGCCTTTNCAT 
TCTTTCAAA«XXTTCCCGGTTCAATTGGTTTTTT^ 

TTCi mil i CCCCTTTTCGCCTTCCAATGGCCCTTAANC 1 1 I CI II IC CTTMAAAGGGT 
GCCTTTCCAATTAA 1 1 1 1 1 1 1 ICCTTCTTTTAAAAAAAAATTCCTTTTA 
Sequence 300 

cgcggtggcggccgaggtacttaaggttgactggtaatcagggtaacttctgatacttat 

CACACAAGATGGTGCCTCAGCATrTAAATAAATGGAGGTAGGGGAGGGCGTGGTGGTAAC 
ATACTTTTAAACCAGCGATTGCACAGCAAACCACAATGCAAGGTATTTCTGACTOCCAAG 
ATTGCC CGTT TCCTAAAGAGCAATTCTTCTGCAGGCAACAGCAAACCTACCTTTCCTTGC 
TAACTGCTTTCAGTAAATTCTTGATGGCCTTCGATTCTGGATTCAGACATCTCTTCTCA 

CCTTCTTTTTCATTGTAGCAATGATCTCAACACGTG 
GA 

Sequence 301 

TCCCCGCGG TGGCGGCCGGAGTGATGCC TCTGCAGTTTTGTGATCTGCAATGATTCTTCC 

CTrCGAGGTCACGCCCATTATCTTrAATCCTGACTTTTTTGTGGAGAAACTCCGACAT 
GA 

G AAACXTTGAGATTTTCACTGAG TTGGTGGTCAGCAATATCACAAGGCTTCATC NGATTT A 

CCTGGAACTGAGTTGGCTCAGCTGATGGGGGAAGTGGACCTrAAGTTGCCTGGOGGGGCT 

GGCCCAGCATCAGGATTCTTCCGGTCTCTCATGTCTCTCAAGCGAAAGGAAAAAGGAGTG 

ATATTTGGGTCCCCACTGACGGAGGAAGGCATTGCCCAGATATACCAACTGAT7GAGTAT 

CTACACAAAAACTTGCGAGTAGAGGGTTTGTTTAGAGTACCTCGGCCCGCTCTAGAACTA 

GGTGGATCCC 

Sequence 302 

TTGGAGCAC CCCG CGGNGGCGTTTTGGGACGC^GGAACNGCAATGCTTCAGGACCCAGA 
GGAGCGACTCTTTAAAGGGACCACAAAANCCGCACAGAGCTGCAAACAACTATACATGAT 
ATAATATTAGAATGTGTGNACCTGCCCG 
Sequence 303 

GNGGCG TTTT AGGGCG NAACGGCC CCCCATCATGGCGGACCCT AGAGAAAGGC TCTTAGG 

g^gaccnaaacccgnngcccgaacacaaggag 

Sequence 304 

T CGCCC gagc tttc tc ttgtccatc ttctc cc gc tgctgaaatttcagttgc g ggc gc t g 

TCACCTCAGGACCCCTCCCCCCGCGTACGCTGGATAGCCTC^GGCCAGAAAGAGAGAGT 
AGCGCGAGCACAGCTAAGGCCACGGAGCGAGACATCTCGGCCCGAATGCTGTCAGCTTCA 
GGAATCCC CGCGT ACC TGCCCG 



W^AAGAGCAAAGGCTCATGTTTGCCAAGTCTGTCCTTTTGTAACAAAAAACCCAGCAGC 

TTTATCAAGCAGAATTCCACCTGTATTTCTTAACTTGCCAGAGCTGAGTCTCATGGCC 
AC 

CCTTAGCAGGAGTTGGGGAGGTATTTTTAACAAGGCACATTATCATCTCCCCCACCCAAA 
G TGG AGC T A7TGCT AATG AAAAAGAT ACAAT GAGATGTTTATGAAATT AT CTG T AGC TAT 

TAATGTCAGGTTTTTGAAATTTACTGACCTGGAAGAATACTCATAATGCAATGTGAAGT 
G 

AGAAGCAGGACAAAGAACATTTGCAATACAGTTGTATTTATAAAATTTTGTT 
Sequence 306 
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ttgcaggagggtagaaaagcacctgggtcgggtgcagactgcggagcgggccctaccgtg 

TGCGCAGAAAGAGGAGGCGCTTGCCTTCAGCTTGTGGGAAATCC«3AAGA 

caactcaac tgttcgttgcttccagggcc tgctgatttttggaaatgtgattattggt 

TG 

TTGCGGCATTGCCCTGACTGCGGAGT GCATCTTCTrTGTATCTGACCAACACAGCCTCTA 



CAC«3CGGTGGCGGTTTAGCCCGGCGCNAAATCACCATTATTCCCCTTT^ 

A GGC TTG TTAA TGC TTTCTTTG TAATTAGGC T AT ATC TGGT ATCTGTATAATATCTTCA 

G 

TTCTTCTTTACCAGGGGTCTTACTCTGTTCTGAAAGATGGCACCTCAGGCGGCTCCG 

GCGCTGGACACAGGAAACTCCTGGGTCCCCGACTCCGGCTCTCCTNGACCCCCTCT^ 

TTAACTCCGC1TGTTTCTCTACAAAATGGCGCCGGAGGTGCCCCGCGTAOCT 



TGGGGNAACCCG£GGNGGCGGTCTTGGGGNCAACA 

ACCAGCNGNCI 1 1 1 1 1 NGGGGNGCCAAAAACCCGAGCAGCCGAAANCNGGAACNGCCNCA 
GNNGTGTNCCNGGNGAAGAANGNCNANCCCAGAGAGGGCAAAG^CCC 



TAAAGGGNNCNCCNTCTAAAAATNT 



^ACNGGNAGAANAAAACCCACCAACCGCTTTTTANTATCGAGNGTG 
AAGATGGMAAAAAAAAAAAAAGAAAAAAGAAAAAAAAACAAAACCAAAAACAAAAAAACT 
TTACAACCACAGCTAANGCAANNNNNNCCyWGGNTCC^ 
NGCAAAGCCCANNhWNNNCCAAGCATCCAAANGV^ 
^ TA ™GGG*GCAGCNNANGCAG<XKK*GCCAA* 
NGCCCGGG 
Sequence 310 

CACCGNGGACAAGAGGAGGNGGTNCTTGGGGGGNGNAAAACCCGCNCCGCGANGCAAGAG 
G CTCNG CACAACCACT ACTNTN CAGAAGAGCC GGG NCCNGN CC C CGGGAAAAAGAGNGCG 

Sequence 311 

CCTGAGGAAAAGCTCGCACCAGGNGGACGCGGATNNGGTANGGGGG<3TAAAAWACCCNCC 
CCAACAAG CC GC GGGG CAAAANGNCC NCGT AOfT CGGC C GCTCGAGAACT AG CGNACC C N 

Sequence 312 

CCCG CGGTGGCXjTTTC^NGGCCAGGCACTrGGAGAAAGTATAGCAGCAAACAATGCCTAT 
TTTTNACAGGAAACAGAACjANATACCCAGAAAAATGCCCTGGCAATCATCAAATCACAGT 
TTTC CAACATCAAT AAAG TGTTT AACTC CTCATTTGAAAGA T GG TGTTCC TGGATTGAA 

ATTGAAGAATrAATAGAGAAACTTCAGTCTGGAATGGTGGTAANGGATCAGATTTGNGAT 
GWGAGAATATCTGACATAA TGGA TGTATATGAAATGAAACTATCCACATTAGCTTCCAAA 
GAAAGCAGGCTACAAGATCTTTTGGAAACAAAAACTCTAGCCCTTGCACAGGCTGATAGA 
CTGATTGCTCAG CATC GCTGTCAAAGAACTCAAG 
Sequence 313 

CCGG G CAGGCC C TTAGCATT AGATTGAGTT ATGTTGC TAG GAGATN TTTATTCATCAGC T 
GATCATT AAG CATATGG GGCTT AC TTGGCCCC CCTA TCAATTTG C GTCAAAA T AAA TTAA 
TTGTAGACCTGTCTTGTTTrATGAAAAAGCAATGTGATAGTCTTTAAATTTATCTTTCTA 
AACAAGACACAA G TTTACACATTA CCCAGC AC AG T AA C CCC TCTTGGTATTG TTT AC C T A 
AAAGGAAGAAGTGTAGGAAAAACTGATATAAGTAGAGAGNTTATTTGGG 



PCIYUSOO/33312 



G^GGAGCTCCCCGCGGTGGCGGTCGAGGTACGCGGGGGGTCCTG^ 
TGCAGGAGGGTAGNAAAGCACCTGGGTCGGGTGCAGACTGCGGAGCGGGCCCTACCGTGT 
^^^^^^GAGGAGGCGCTCAGGAATGCATGAATTG^TTAATTAAATGTCGAGAGCTGT 
AGATGGCTTTTCTCAAGGTGCTTCAAGTGCAGAAGCCCAAGTGATTGACCCACAO 

^**^^^CCAGAAAATCCTTMGGGAGTGCCTTCAGCTTGT GGGAAATCCCGAA 
^^TGGCCAAAGACAACTCAACTGTTCGTTGCTTCCAGG GCCTGC TGATTTTTGGAAATG T 
GATTATTGGTTGTTGCGGGATTGCCCT 
6 315 



C T AAGCATA TGGGGCTTAC TTGGCC CCC CTATCAATTTGCNGTCAAAATAAATTAATT 
GT 

AGACCTGTCTTGTTrTATGAAAAAGCAATGNGATAGTCTTTAAATTTATCTTTCTAAACA 
JCiACACAAGTTTACACATTACCCANTTACAG 

GGAAGAAGTGTAGGAAAAACNGATATAAGTAGAGAGTTTATTTGGGCCAAGCATGA^ 
TACAACCCAACT GTATGGAGACAAGTTGGCCTGAACAATACACATTCTTATTAGCAACAG 
^ATAAGTAGGNTTTCAAAGAAAAAGAAGAGGCAGNTCCTAA 
Sequence 316 

TCGNCCGGGCAGGTACAGAGACCTNCTTACTTACCCGC^rrmT 
GCGAGCGAGAGCGGGCCGCTGGCGTTGGAGAGCGACGGCGG^ 
AACAACNCAGAGTAACGCGGGAATGAAGAATNTTAGGC^ 
GA^TTAAGGGTffTTTACGGAATAAAGGATGATGTC 

ACAGAATGGAATC TCAGACC TT GTGAAGGTGACTC TGACTTC TGAGGAAGAGGCCC GTTT 

GAAGAAGAGTGCAGATNCACTTTGGGGGATDCAAAAGGA 

Sequence 317 

TTTCGCCCGGGCAGGTACTTGGAGAAAGTATAGCAGCAAACAATGGCTATAGACAACAGG 
AAACAGAACA T AT ACCCAGAAAAATGCCC TGGCAATCATCAAATCACAG TTTTC C AA CAT 
C AAT AAAG TGTTTAAC TCC TCATTTGAAAGATGG TGTTCCTGGATTGAATATTGAAGAA 

TAATAGAGAAACTTCAGTCTGGAATGGTGfTrNAAGGATCAl^TTTGTGATGTGAGAATAT 
CTGACATAATGGATGTATATGAAATGAAACTATCCACATTAGCTTCCAAAGAAAGCAGGC 
T ACAAGATC TTTTGGAAACAAAAAC TC TAGC CC TTGCACAGGC TGA T AGACT GATT G C TC 
AGCATCGCTGTCAAAGAACTCAAGCTGAAACAGA 
Sequence 316 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGT ACTTTTATTGATGTTGAAGATGAGAAATCT 
C C TCAGAC TGAAAG TTGCACTGACAGT GGAGCAGAAAATGAAGG T AGTTGTCACAG T GA T 
CAGATGAGCAACGATTTCTCCAATGATGATGGTGTTGATGAAGGAATCTGTCTTGAAACC 
AATAGTGGAACTGAAAAGATCTCAAAATCTGGACTTGAAAAGAATTCCTTGATCTATGAA 
CrrTTCTCTGTTATGGTTCATTCTG<KiAGCGCTGCTG^TGGTCATTATTATGCATGTAT 

AAGTCATTC AGTGA TGAGCAG TGG TACGGGTGGGAATAGCACT ACACTG TTCATC TAGC C 

^GTAGAATAAGTCCCAGTGAACTGATATTCTGCAGAATCTTCACTGTTAT 

Sequence 319 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I I t 1 1 1 1 1 1 I 1 1 I I 1 1 I CAAN 

TT CAG TTTCC TTT AA T GAC CCCC A TC TCCCTGAA GGGCAGG TGCAGGCAGC TAGGTGATG 
GCAAGAGATGTTCACTTGAAGATCTTGCCCTGATTGAAGGCTTTGCCC^ 
GCCCCCTCCCAGGAAAAGTACCAGACATGAGCTGCCTCTTCTTCATTTTCAGCCAAAGAA 
AGGGC AC GTTCAAATGAGG TCAGAG TCA T ATCAT AC T GC T GGGCA T AG AAGCAACACAGC 
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CCCAGATTGTTAAAAAGCTGGCCGTTATAAATGCCCATCTGCAGCAGCCGCCTGTAAAAC 

CGGAGAGCTATTTCTGGCTG^TCAGAATAGAAGTGGTTG 

Sequence 320 

ACCCNCAGO^GACGCTCGflAGC«Xa3C«^ 
GCTCACGCCTTCCAGAGCC^CATCATNT>^GGNCGAAANAGAAG^ 
GNGNAGGAGGCCNGCAGGNACC 
Sequence 321 

CAAGCGGAGNNAACCGAAGAGGCGNACTTGGGGGG^ 

CCNGNGNCCA GCGCNGC«3GT^ 

CA ^ G «£ACAAhCC»*GCMCAC*tlGCAGC 

^ G ^ W ^^^ ^C^GGGAAGWKSCKAAGCNACCN GN CAANAKAGCANACAAAN NGAAACN 

GGGGGNGG^GAAGGANCNNAGAAGNNGGANGCCAGGAAANGGGANGAAGACC^ 

CA*GNNNCAGAACAGAGAAGACCCC*GGNAA 



CTCCCGNGACGAAAACACAANN ___„„.„,..„ 

GGGACAGCCTGGACTAC 1 1 INI 1 1 IUACACAAACAAACCTCCCCGCGRAWNCTCCTGGGC 
CA 

Sequence 323 

GCGATTGGAGCTCCtt^^GGTGGCGGCCGAGGTACCTTCAATACTTAAAAATAGTCTTCC 

ACAAAAATACTTTAi 1 1 C t GATCTATACAAATTTTCAGAAGGTTA I II I C I I I ATCATTG 

CTAAACTGATGACTTACCATGGGATGGGGTCCAGTCCCATGACCTTGGG 

| m 1 1 1 1 1 1 1 1 n 1 1 1 GGAAAGCTCTGCCATAAACTTCT AGCGTGTGCCAATGGTCACC 

GCCACACTCGCACCAGGTTGTCCGTGTAGCCAGCAAAC^GAGTCTGGCCATC^ 
^GGCCAGGGAGGTGCACTGGGGTGGTTCTGCCTTGCTGCTGGTACCTGCCCG 
Sequence 324 

GGTGGC GGCCGCCCGGGCAGGTAC 1 1 H I 1 1 1 1 1 Ml I I I I 1 1 1 M lAAWGGGGACGT 

AATAAAATAAGCATGTCTCCATCCTTTATTCCTAAACATTTACTT ATGACAAATGTAACA 
ACT GACAGAAATTTGAAAAAT ACCAGACAC TTC TTAAATGATTTCCCTTGGTTCAAAAT 

TACCCCTTCTTGT 
Sequence 325 

ATTGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCAAGTTAAAAGCAGAAGATGCTTCTG 
^J^E^** G CATTTAATCACTCTCAAGTTGAAGGCAAAG T ATC CTGCAGAATCACCAGATT 
Al 1 1 TGTGGAl I f rCCTGTTCCAl 1 1 IGTGCCTCCTGGACACCTCAGGTAAATTCTCCT 

AGAGCTCCTTAAT AAGCATTTAT AGTCAGTTTTTGGCAGCAAT AGAAT(^CT AA T 
TCTGGGATGTTATGGATGAAATCGATGNGAAGACCTGG 
Sequence 326 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTTTT 



G T AATT AACTCTTCTC 1 1 1 1 1 



GT 



TTTTT 



I 1 1 1 1 1 IAAGGGGA 



T AAAT AAAAT AACGC ATGTCTCCATCCTTT ATTCCT AAACATTTACTT AT GACAAAT GT A 

ACAACTGACAGAAATTTGAAAAATACCAGACACTTCTTAAATGATTTCCCTTGGTTCAAA 

ATTTACCCCTTCTTGTTTTCTCTTGCTTTrCAGGTAATTAACTCTTCTNTT^ 

AACTATGCAGTGCAAGATTCCTffrGTAGTCTTTCCAAGTGGAAGGGTATAAAAAAAAACA 

CTTTATATTATGCC^GGTGAGGWGTCAGAACCCTGGCATCGGAAA 

Sequence 327 

GCTCACCGCGGTGGCGGCCGAGGTACTTAAAACCAAATAAAAAGTGAC^ 

TTT AAAAGGATTTCCGAGCTCACAGTCAGC TTGCGAG CC ATTCT CCCGCGTACCAGCAC A 



PCI7US00O3312 
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AACCGGGCCAGCCTCCTAAACTOCTCATTTACTGGGCGTCTAOCCGCGAATCCGGGGTCC 

CTGACCGA 

Sequence 328 

CGCGTCCGCCCATCTGAOTGTCACAGACACTtXTGGGTTTGGAA HI I U I I G TTCTCT 
GT 

C TCTTTGATTTCCTGGAAGACGACACCAT GACAATTTCAAAGAAAAT AGAACAAAATGAA 
GGAAAAAGAGGCTCTG TCTTAGCACATTCCTGTGACCAGCCTGCTGTCTGTGGCGTGCCC 
TCCTGGCCCGGCCTTGGCACATGTTCGT^r^TTGTGGTTGTTGCCTGGACAGGCAACTCTG 
CTGCCTGTGCCAGGGGTTGTCAAGGGC 



GGTGGGGCT 
Sequence 329 

^l^ACTTTACAGGATGGCATTTAATACAGATATTTCGTATTTtXXXXACTGC > 1 1 1 1 ATTT 
GTACAGCATCAT7AAACACTAAGCTCAGTT AAGGAGCCATCAJICAACACTGAAGAGATCA 
GTAGTAAGAATTCCATTTTCCCTCATCAGTGAAGACACCACAAATTGAAACT CATAACT A 
TATT TCTAA GCCTGCATTTTCACTGATGCATAA 1 1 I 1 1 C I I ATTAAATATTTAAAGAGAC 
AGTNTTTTCTATGGGCCATCNTCCAAAAACCT^^ 
T A 

GCATTTTTTGGGGGAATTATACCTTCCCACCAAT L 1 1 I I I UAK TN T A T TT TT CCTTT GG 
A 

CCTTAAATCATGAAl 1111! ICAAAATTAANAAGGTTNNAAAGNTTTAAA 
Sequence 330 

AX3CTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATNGTTCAC 7 CACTTTCAAAGCCAGC7 
GAAGGAAAGAGGAAGTGCTAGAGAGAGCCCCCTTCAGTGTGCTT 

TGGCTTG T T AGAAGG CT GAAAGATCGAGGGGCCGCCCGGGCAGGTA C I II I I I I N 1 1 1 1 
IHIII IGGCI I ICI 1 1 GCTCCTTTCTTATGATCAGCCACATTTCTTCGACCTtXITTCTC 
CTTGATCCTCAGAATCTGAGAATTCTTCATCACAAGCTATCCGCTTGTCTGATGCTCG 
AA 

TAGAAATTCTCTTGTCTGGATCTTCTCCATCTTCATCTCCACTGTCTTCATGAACAGCA 
T 

CTTCTGGAAT AGCCTGCATC TGGACACCCAGG TGCATGAGGTAACATGCGCAAATTTTCA 
AACAAACCGCTGGTTTATCTTTTC 
• Sequence 331 

CTNCCGCGGTGGCGGCCGAGG TAC TAGCAGTTGCCAATGAAGGAGGCTTTG TTCGATTG T 

ATAACACACGAATCACAAAGTTTCAGAAAGAAGTGCTTCAAAGAATGGATGGCTCACTGG 

AATGCCGTCTTTGACCTGGCCTGGGTTCKTGGTGAACTTAAACTTGTTACAGCAGCAGGT 

GATCAAACAGCCAAATTTTGGGACGTAAAAGCTGGTGAGCTGATTGGAACATGCAAAGGT 

CATCAATGCAGCCTCAAGTCAGTTGCCTTTTCTAAGTTTGAGAAAGCTGTATTCTGTA 

cc 

TGGCCG 
Sequenc*332 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTGAC TTGGCAATGTAATGACACACAC GT 
TAGTGTGGGGCACAAACGTGGAATATTAGGAGAGAGCTGGTTCCAGCACCAAATCCAGAG 
TCACTCGGGGAAGGAGGTATGGTGGCAACACTTTATGCTTAATATTCAATTCTGCTCCAG 
T AGAACA TGGTACCT 
Sequence 333 

CGCGGTGGCGGCCGNTCGGGCAGGTACGCGGGGACTCTGAACGTGCTAAAATGGGAAGGG 

AGGCGGTGTTTTGCTGATCTGTTAAATTCTTAGTGAAGTTTCCTTGATTTCCAGTGGCT 

G 

C TGTTGTTTGAGTTTGGTTTGGAGCAAAAGTGAGGTAGTCCTAACATTTCTGG GACTGAA 



TCCAGGCANGAAAAAAAAAAAAAAAAAAAAAAAAGGTACCT 
Sequence 334 

CCCCGCGGTGGCGGCCGAGTTTG A 1 1 I CI I GCAGTCCTGAGCGATGGAGCCCGGGGGTGC 

CTGGTTATTGTCCGCTTTCTCTCTCAGATGCTTGGCTT GI I T T T CAAGAGAAC CTm I 

C 

GATATTCATTGCTCCATCGATTGGATCCAGTCCTTGTTCAG^ 

TAAGGCTGCCTGAAAGCCTTGAATCCTTGCTAAATATTCCAG T1 GI I F1 GAAGGTTGT 
AC 

CTCGGCCGCTCTAGAACTAG 
Sequence 335 

GCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGACTGCTAACAACTTTCAAATTCTT 
CTACTTACTCCCTCTTCTTCAGCTTGACATCTGGGAAA 

GCCTACCTTTGG TGCCAGAGGGAAGCTCAATCCATGCAAGCCCCAGATAATAT ATGAGAA 
CCTCCCCAACCTTACCCTACACXXXTCACCTCCCAATCCAA^ 1 C IC C I 1 1 C CCTGC 
TTTCTCAAACCATGTTTGGACCTGCTTGGAAGCTCCCTCTGCTCTCC^ 
CA 

TTATGTGAGTGATACATCTTTTCATATi 
T 

CAACATCTGAAGCAAATGTTGGGTGGGGGGGTAI 
C 

Sequence 330 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCATGAAGGAGATGGCCCCTTTOGGAGC 

^gcagagaatcactgagatcccaatggaaacaggaggttcagccagaggaaccgacttt 

TAAGGGATCACAGAGCTCACACCAAAGACCAGGGGAACAGTCAGAAGCCTGG^ 
TCAGGCTCCCAGGAACCTGCCTCAAAACACAGOT^^ 

GGTCTGGACAGCTGGGCCCCAGGGACGAGCCATGCGTGACAACAGAGCTGTATCCCTCTG 

TC^GCAAGAATGGGATGTGCCCAGGCCCTGCACAAAGGGCCCTCTAC^ 

CAGAGGAAGGGACAGTCACGTCTCGCTGGCAACAGGGTGITGCCCt^ 

GACCAAGACGCTCCTGGCTATTTTTTAAGTAGTTCTCAATTTTTATGGGNAAAACTNCA 

A 

GACCTTimiCAGCCAGNAACAGCCCCAGATTCTTACAGG 
CTTGGGAGCCAANGGGT7TTTTT 
e 337 



ITATCATCAGCC 



CCCTCTrGGGCTTGGCAGATGCCAGTGGATCCATACAACTACTCCGCCTGGTGGAATCTG 
AGAAGAG CCAC GTGCTGGAGCCATTG TCCAGCC TTGCCCTGGAGGAGCAGT GTCTGGCTT 
TGTCCCTAGATTGGTCCACTGGGAAAACTGGAAGGGCCGGGGACCAGCCCTTGAAGATCA 
TTAGCAGTGACTCCACAGGGCAGCTCCACCTCCTGATGGTGAATGAGACGAGGCCCAGGC 
TGCAGAAAGTGGCCTCATGGCAGGCACATCAATTCGAGGCCTGGATTGCCGC TTTCAATT 
ACTGGCA^CCAGAAATTGTOT 

Sequenoe 338 

NAAAACN CCCCCCGGGATAGAAGNfMTTTTTTKrcAGGGCACANANTTAGAANGCAGNNG 

CGGGGAGCGGGC NCACAAG N GGCAGGGAAGG GAGAC N CCACCN G N GGGGGGNCC TGGGGG 
CCCNGAAACCGNACAAAGGGGNGGNACACTGGCCGCCGGGNGGCGGGACGGAANNGAAGN 
AANNTAAGAAGGGGGAI^NCCCCCGGGGGGTGNAAGGGAAAANGGCGAANAANNCAANGC 
NCAAAAWCNGAAANNCCCGGGWNAACGCCWGAAGGGGGNGGGGGGNCCCGGGGGAACC 
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Teble I 



AAGGGGGNGGNAAAACNAANGGGGGCAAAAAAGGGCCNGGGNNNW^ 
AAAAAGGGGGGNAAAANCCCGGCXX>GGAACJ\AAAAAAAGGCAAA 

ACNGGGAN NCC N NGGGGAGGCCAAAAAAAAAGGGGGGGGAAAAAAGCCCCGGGGGGGGGG 

PGGGCNCHNAAAAAAGGAAGGGGGGGGGGCCGAAAAACNGCCAAAAA 

«X*>»rr>*GCGMNGGCTAMCHAAAAMG^ 



GC TC TAAGT AGGTCATCAAC TCTCCTCCAT< 



rTAATTCTGCGACAGACTCA 



CGCGGTNGCGGCCNTCNI 1 1 1 IUI 1 1 1 1 I 1 1 III 1 1 lAATAGCTGAAGATTTAGATTTAT 

TTGAAAACACTTAGTCTAATTTATATTAGGTGCAGAAAAATCACATTCA^ 

A 

TTGTAGAAGACACAGATAAG FGTGTTI G TCACATTTTCACACAAATATAATTTGATNTT 
AATTAAGGGATGATGAATCN CAACCCC TTGTTAAT AAATGATTTNTTCTC TCAG T AANT 
GCAAGAATCTMTTTTGNGGTTNCO 

TTTTaMAAATTTTAAGGGAA 1 1 1 1 1 1 1 N Ml 1 1 TAAAGffCCCNfrrNCCCTTCCCCTTTTT 
TGGGCfaTTTCXXXCNGNAANAAAAAAAATTTTTNCCCCG 
GGGGTAAAAAAACCCCCCNTCTKNGACNHAAAT 
NAANAAN 1 1 1 1 1 1 1 1 NCNNNGMHAAAACGCCNCCTTNTAI 
TT 

Seq u ence 3*0 

CAGCGCGGTGGCGGCCCXXXXGGGCAGGTACGCGGGGGAGCGGGCCCT 
GAAAGAg GAGGC GCTTGOCTTCAGCTTGTGGGAAATCCCGAAGATGGCG^ 
AACTGTTCG X I GC T T CCAGGGCCTGCTGATTTTT GGAAATG TGATT ATT 
Sequence 341 

CAAGAAATTTTGTAAAGTTGAACATOGCCATCT^ 

CA CGCAGC TTCCCAGATGCAT/GATATCCAAGGTCGACAGACCTGGATT 
CAACCTTTA TGCA CTGCGTATTGTATTTTCTGCATAAGA AGAACACA 
GTAAGTGT ATTTGCCCAGTAGTGACATTGCCTACATATAGGCAAGTGTTATAGTATACCA 
ACTTAGTATAl 1 1 T T CAAGGAGAGCTAAACCACC7TTTG TAATG TTTGGTTTCTCACTG 

TATCTTCCTTTCCTATAATTAArTTATTTTAATCTACAAATTG^ 

TCAATATTTT ACAAAATATTAATTAATGTAATTG TTCCCAATTATTAGAAACTTTTTT CC 
A 1 1 1 1 1 UAAAATGTTTGCCAACTTCACACAAGTGTGTAAAAATAGGGCTCT 
Sequence 342 

CCGCGGTGGCCGGCGAGGT ACAGGTTTAGTC TGAATGCACTG TCATGAAA TTT AAACTTT 

CATTATAATACTGTTTTAAGAACTTACAGCATCTGCTTTACAAATGCT 
G 

TCGACACAGCATCTTTAGCCAG 1 1 1 ICI 1 1 1 GGAAGTTCATC TGATGTCATCTGGAAAC 

GAGTAGCACATTTGCCTGCTCT GTTGGTGGCCTCACAAGCAAGGCAAAAGCATTATGGCA 
ATCTAG GGTT CCAGAATAACCATAAACATTAAGTGTCACTCCTTGGAAA^ 

^lEE^^ZZT*^^?^?^^^*^ 4 ^^ T GAAATTGCAATGAAA tgaact a tcacttctcca 

L 1 1 IU. I IG I LX I ATrrTTAATAAGACAAAGAACATCACCATATTAAGTTGAAGT ACCT 
G 

CCCGGGCGGCCGCTCTAGAACTAGGTGGATCCCCCGGG 
Sequence 343 

^CCCGCGG T GGCGGCCGCCCGGGCAGGTACATCA 

GTTT AAAAACTCATTTrAACCACTTTTT ATTCTTTGTATTC*AACCAATCACTGGCAATA 



TTCTTCGCGTAAA1TAATGGAAGGAAATGAGTGTCTGAGTTCTC 

tgaggataaagc tttcc tggagataat ataagtggtggcaggaagattt gggagccagat 

GATACTCTTTTCCTCTTAGAGAAACTCTGTGGAAGCTCT^ 

TTCTAGAC GCTGGC ITCTTTCTC1 AGTAAACATGTGGGCOCTTT AAAATGTTGAACCA 
AA 

ATGTGCTTCAAATATAGTTTAAGTT ATAAAACATTTATGGGGGAGTATGTATGTGCCAA 
C 

tagagaggcttcagagatgaagaaacagttcttaccct agtgttgcttagaatctagtag 

TAGTAAGTAATAATTACTAACATATGCATTTACT atat aggcaatactagggt aaatatt 

ttacatagattacctt atttagtagctctt agctgct aaaaaaaaaaa 

Sequence 344 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTALI 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 

GG 

GGOAGTTAAATAAAATAACCATGTCTCCATTCT^ 
A 

atgtaacaactgacagaaatttgaaaaataccagacactt^ 
t 

TCAAAATTT ACCCC I ICI IGI 1 1 I C 1 C 1 IGCTTTTCAGGTAATTAAC I I I 1CILII 1 1 IA 
GTTTGAACTATGCAGTGCAAGATTCCTCTGTAGTCTTTCCAAGT 

AAAACACTTTATATTATGCCAGGTGAGGTGTCAGAACCCTGGCATCGGAAAGTGGTTGGC 
TCACGGGTCAT AGGGTAGT AAGAAGAATTTACAGAAGACAGT AT AGGTTCGAAAA 
Sequence 345 

AGGTACACTGCGGCGGGGGCAGAAAAGCTGCAAGGAACAGAACCAGCAATGCAGA 
CTCGAAGGGCCACCATCATCCTGCAAAACACCAAGCAQGGCAG ICIC1 1 A TGCTGTGGCT 
CTTCTCAAG<^TGTCTCA>AGGGCTCCGGTGGTGCTCTCCTGCTCTATTXXXrrGCTGTG<^ 
AAATCCTCTAAAAACAGCGTTTTGCACAGCAGAGAGCAAAGTOC^ 

GATACGT GAGCTCAGTTTGOCAGCTAGTGATGAAGTCCAGCTGTTGGCAAGTTGGTCCCT 
GAGGCCTTGTaGACTGACCTG TGGCAGAGAGCT CCCTGGGTCCAGCATCTGTTGCCCTCA 
CCCTTGACACATGCGGACCCTCCCCAGGC 

GCGATTGGAGCTCCCCGCGGTGGGGGCCGGGGTACAAGAGAAGAAAGACCAGTCC 1 1 GCT 
CAAACACAAGTCTGAATGCTCCACTTTTTCA A I fC TC TC TI CCATTCTTCAGTAAGTCAA 

c 

TTCAATGTCGGATGGATGAAACCCAGACACATAGCAA 

CTCTGCTGGATGACGTGAGTAAACCTGAATCTTTGGAGTACCCATTCCCTTGATGTCTAjC 
AATATCACCTTTCTTATAGATTCCCATATATGTGGCCAAAGCAA^ 



AAAAGGCCTAGAGAACATATATCGGGT( 



TGTTCGTCATT 



GGCGAATTACTGGAAGATG 
Sequence 347 

AGCTCMCGGCGGTGGCGGCCGCCCGGGCNGGT ACCACNGCCCAGCT AATTTTTTT ATGTT 
TGT AGT AGAGACGAGTTTCACCATGTTGGTCAGGATGGTCTCAAACTCCTGACCTCAGGT 
GATCTGCCTGCTTCGGCCTCCCAAAGTGCTGAGATTAGAG 

CTu I l II GCTTCATCX1ATCCCTT AATTTCTTTGCT GGAGCA TTTT AAAGCAAATA TCAG 
A 

CATACCCTTTCACCCCTCACACTTtAACA^ 

G<iAACTGCTTTCAATCGG>GTTCCATCCTCCGCCGCAGTATGCCCT AACGCAAGCGTTAT 
CTTCAGAGCTACCACCAGGWTTCCGAAAC TTTTTC^GGNGGGAGGCGCTTTMGCCACCACC 
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TNGCCGGGNNAACGGNT>IGCGTfMAACCAAACCTTTGAACGGCCAGNCCCCCCGNGGTAC 
CTTNGOGCCGGTTTAAAAACTAAGN NGGGGA TNCCCCCCGGGCTGGCAGGGAATTTCGAT 
ATTCAAGCTTAATCGATACCCGGCGACCTTCOAGGGG 
Sequence 348 

ACTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTT GACTGCT AACAAC TTTCAAATTCTT 
CTACTT ACTCCCTCTTCTTCAGCTTCACATCTGGGAAAACTGATAGGGAAGCCTAGGTAG 
GCCTACCTTTGGTGCCAGAGGGAAGCTCAATCCATGCAAGCCCCAGATAATATATGAGAA 
CCTCCCCAACC^^ 

T^ATGTGAGTGATACATCTTTTCATATCTTCTTGGTGTGTGTGTGTGGTATCATCAGCC 

CAACATCTGAAGCAAATGTTGGGTGGGGGGTACCTCGGCCGCTCTAGAACTAG 
Sequence 349 

CCCGCGGTGGCGGCCX3GAAGGAGGACGACGGTGCTGTGCT GTGTATGAAGAGGCAGTGAA 
^^^CTGGCAACAGAGGCCATX^GGAAGTGTTACATCACCTTTTGCTT GGAAAGATTTAC 
TAAGAAGTGAAATAGTGGGTTCCTTAGAGGGAAGAGGTTGGAAAAAACCATGAC TGTATT 
GAGGAA GGCAC ATGAACTGAAGCTTCTGTCAGAATGCCAATACAAGCAGTTGAG TGTTTC 
GTTGCTGTGTTATAACTTCCTGAGGGAAGCTCTGGAAGTGGCAGTAGCTGGAACTGAATT 
GTTTAGAGACTCTGGGA GAATG TCGCAGCTGAAGCTGCAGGTGCTGATCGAGTCAAAGAG 
CCCTGACATAGCCATGC F J T T T GAAGAAGCCTTTGTGCACCTGAAACCC 
Sequence 350 
CTCCCGCGGTI 



CCCAGTGTT 
GG 



iTACCCGTGCTAAAAGACTTTTAGTTCGGCTCT 
1 1 1 f 1 1 CGTCGATTTGGGCACAGAGTTTCCTGG TTCAC G TGGATGTGA 

ATCCTTTACTCCAGATCGCCAGGCA G TT M I G 1 1 1 1 1 I T 1 CCTGCGTTGCTGAGAGTCT 
G 

^^TTTATTCATCACAGCAGGTGGATCTTAATTCCATATCCCTGAGGCCACTGCAATGAGG 
CAGAGG^aTGTGCTCCCTCATGAGAAAGGACTGGAGACCGCCCGCAGAAGAGAACGTATC 
CATGTACCT 
Sequence 351 

CCCCGC^GTGGCGGCCGCC^GNNCrrGGTACTTATAATGGCNNNN^NCNGGNTGTGAAT 

GGATTACANTGTATCTTTTCAGGGAAACCTATTATTATCAATGTGACTCCACNGGGGGAG 

TCCATGGTGATGATGATGAGGAGGAGGATGATGATGATGAGACACCTCTAAACTTGGAAC 

AAGTTTAAGACTTTATGAGAGAAGAAAAAAAATCACCAACAAGAATTGTT^ 

TCATAACTATCCT G TGTTCA I 1 1 1 ft 11 T I TATAAACAATAAGAAAAAGTTGTTGGATTT 

1 1 I 1 1 I AATGATTTC 1 1 1 1 1 1 GGGGGAGGGAATTTTTGTTGCAGTTTTATGGTGGAAAA 

GCAAAAACCAGAGCCAGGTGCATAATCTTGTAATCTGTGGATATCCCTGGAGCAGGACTG 

ANCCT 

Sequence 352 

NCCGCGGTGGCGGCCGCCCGGGCAGGTTGGTAACAACGCAGAGTCCCGGGAAGCAGTGGT 
AACAAC GCAGAG TCC C GGGAAGCAG TGG TAACAAC G CAGAG TCCCGGG AAGC AGTGGTAA 
CAACGCAGAGTCCAGG GAAGCAGTGGT AACAAC GCAGAGT ACCCGGGGAAAAAAG G CAAA 
TAGAATGAGAACCATATTATGTACCT 
Sequence 353 

CTCC C GCGG TGGCGGCCGAG GT ACAC CGAGC TTTG TCTCC TGG C CCCAAATCTCC TTTTC 
C TT AC TTTG GGCATTAAC TGC T GTTGAGG TC TCACA G C C TGATGGTCATTATCC C TG A 

GGCATAAATCAACAGGCTGTATGAGCATTGTGTGAGATTCTACATGAGGGAGAGCATTTC 



AAACCCATGACAGATGAGAGAAGTT AG TACAC TC TCAC TGAAC TGGGGATGTTTGAC TTA 
AAATGATGGACAATAAGATAGTGAGCAGTAAGTGTGCTCTAGGCTAGGCT ACGAGAGGCC 
ATGAGCTCCTCATCTCTTCTCTGTTCTGAGCTCTCTGATCCACCGCAC^ 
GTGCATT CTCTGTGCCTCTCCTGAG TC TACTTT CT GCATCATTGGG TTCTCCCAGCTC 
AC 

TTC^TAATGTCCTCCTAGGCTGCATTTSGAATTTGTGTGTTGTCTAGAO 
ACTGTCATTGCCTGTGAGGGAGACCAAGCTTACCCACCCAAGGGCTTTTG 

c 

Sequence 354 

TGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 II 1 1 1 I I 1 1 1 1 I I H I It 1 1 

GC 

CTTT AGAAGG TTAAAA TGCCAAT AT AAAGCTAAAACAG TAATCATCAGAGACAGCTCTAA 
T AAGGCTTTGC TACTGTTTTTACT ATATAAATC TTTACG TG TTAATGGAAAGAAAATTAA 
TTCATTXrrGTTACTCCATTTTmCTCTCCATATTGTATGCCrr 
G 

GGCAGAAAGATCATACAGTTAGGAATGAAGACATCAGAATGTT^^ 
AACTAGATACTATTATACTACTAAGAATAGCAAGAATC^ 

CCT AGCTCACACAAATGAAACGCACATC TC CAT GAATGCTTTCT AATAAATGCTTCCAGG 
ATAGT ATCATAAACAAAGTCAAAATTAAGAAAAATCAC 
Sequence 355 

GCTCCCGCGGTGGCGGCCGGAACCGCCATCTTCNAGTAATTCGCCAAAAT^^ 
AAGG GAAG GAGGAGAGGCACCCGATATATGTTCTCT AGGCCTTTT AGAAAACATGGAGTT 
GGTCCTTTGGCCACATATATGCGAATCTATAAGAAAGGTGATATTGTAGACATCAAGGGA 
ATGGGTACTCCAAAGATTCAGGTTTACTCACGCCATCCAGCAGAGAATGGAA 
T^CXTTGAATTGCT ATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGACTT ACTGAA 

AATGGAGAGAGAATTGAAAAAGTGGAG CATTCAGACTTG TCTTTCAG<^AGGACTGGTCT 
TTCTATCTCTT GTACCT 



GTTGAGC T CCC GCGGTGG C GGCCGAGGT ACCTGACTGTGGCTCAGATCTGCG TCGCAGCA 
GCX^GAGAAGAAATCACTCCAT ATCCGAT GAGAGGAAGGGTGG CACAGAGATGGTGTCT A 
CAATTAGAGACATTTCTGACTCCACCTTAGCCT AAGCAAACTTTATGTACTGAGTAACA 

TTGAAGGTTGTC TTTTAATGGTGGGGGG TGTTTTTTCC TTTTT AAAC T ACAGTGCTTGC 
A 

CAAGAGAGGGAG GGAC TCAGAAAAGGTTAG GGCAGGTGAGGGAGACAGT AGAT GGCC TGG 

GATGACTTGAGTCCATCATACTATn3CTTGGCAGGTGT(^TCCCCCATGTTTGATTCA 

AA 

TTCCATGAGTGACCTACCTTTCCCCAGGAATGGGACTGAGAGGGTAGTCTCCAGCAACTC 

AGTCTGCACAGGGCTCCCCGTTCAGGCTGCCTTT 

Sequence 357 

TCC CCGC GGTG GCG GCCGCCC GGGCAGG T ACCATC T GAC TTGGCAATG T AACGACACACA 
CGTTACGTGTGG^GCACAAACGTGGAATATTAGGAGAGAGCTGGTTCCAGCACCAAATCC 
AGAGTCACTCGGGGAAGGAGGTATGGTGGCAACACTTTATGCTTAATATrCAATTCTGCT 
C CA GT AG AACATGGTAC CACCATTCTTCCAAGTTCAAAAATT AT C TTTGATTCATTTTG 

TCCCCATTCCTCTAATATGTCACCAATTCTGCTGATACATTCTTTGTAATCTCTCCATC 

ATTTT AATCTGTTATTCACC TGA GC TAC ACAAACATT CAT C TG C ACAAG GAG TATTCCA 
C 

GTGCTGAAAAGACAGAGGATTAAGCCC TCCTTGTGGAGGC ATTCACAG TCTGGTTTTAAT 
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^^GAAACCAACAATTATAATACACAGGGATAAAAAAAGTAGAGGCACTTATTGCATACC 
TGTACCT 



TTGACTCCCCGOSGTGGCGGCCGAGCTACTTTTCTAGCAGTCTGT 

AGCTGAAAACACTGTTTCAGCCC(XTCTCTGGTGACCTCAGCCTTCTCCAGGTGTATCTC 
TTOTGATCTTGGAGACCAGCAGCCACAGCTGCTGCTO^ 

TGTGGTGGGGGGCAGGGGTGTTGGAGGAGAAGTTGAAAATCCGTGTGTTCTCTGTCCCTC 

TGCTCCTCCATC7TAGCTTCTGGAGGAGTTAAGGCACCAAG<JGCA 

Sequence 359 

< ^'"3GCGGCCGCCC<MKK^GGTACTGGTGTTGTGATCGGAACG 
TCATCACTGCTGCTCCAACTGGATTTATTACTOCGGGAATG 

AGGAAAGGTGCTGAACTGCCAAGGAAGGCATTTCTTGTGCCGTOTCTGGAACCGTGTATC 
CTTACT *CATCACTGAACGACACCAAGCAC^CCATGCACTTCTGGGTCCAACCTTGGCCC 
CTGGAGAAAGACACTGAAATrTGG<XATGCAGGTCTACrTCCCGTA^ 
TTANNAANTGTTTN NGCCC CNTTT GAAAAAAGGG NTTTTAAAANC NAAAANAAAANTTT 

NTTCCCCCCGGGGGGGNNGGNMTTT f J I TAGGGGGGAAAANGGNGGTTTTANTCCCCCCN 
NWGGNAAANCCCCCCCW NI I I I I N I I I N I G GGGWNGGGAAANATTTTTTWGGGGGTGCN 
C»XX»NGNNTTT><MWiA>iAAJ^NMAAAACCCCCCNTTTTN N I T 1 7 T TT AANANACGCNCNNN 
AANMGGGGG G1 rTTTTTTTTTTTTAA 
Sequence 360 

TGGCGGCCGAGGTACCTACTGAAAACTAAACACGCCAGAGGAAAT^^ 
ATTGATGAACT ANTAGCATAGAGCCAAACAATCTTTrCAAGAGGGTGTTTGTGAGATATG 
GTTGACCAGTGAAGACACGGGGGCTTATGGCAGAGATATTGGCACCAATCTNCCCACACT 
CCTGTGGAAA CTGGT TGAAGTGATTCCTGAGGGAGCAATGCTGAGGCTTGGCATGA 
TCCGC CCT ATAT TTT AGAGCATCTGGAGGAAATGGCANAAATCCTT AATCACCCCAGAGT 
CTACGC 1 1 1 1 U I GCACAT ACCAG TCCAGTCTGCCTCCGACAGCG TACC TGCCC 
Sequence 361 

GATTGAGCTCCCCGCGGTGGCGGCCGAGGTACTT'AAAACCAAATAAAAAGTGACATTTGA 
ATTTCTTTTAAAAGGATTTCCG^GCTCACAGTCAG^ 

AGCACAAAC CGG GCCAGC C TC CT AAACTGC T CATTT AC TGGGCCG T C T AC C C GGGAA TCC 
GGGGTCCCTGACCGA 



GAGC TCCCCGCGGTGGCGGC CGAGGT AC GTATGCACAGCCT CACAC TC T AT AAAT G T ATG 

TGTCCTGAATTTCAGAGCTTAATAATGAATTATGGAACTTGATAATGATTGGATCAGGCA 

GACAACACCTGATCAGTCCTAATATCAGAAAAGAGACAAGTAGACATTATGTGCTTCCTG 

AGGTGAGGCAGTAGTAAGGAAACAACATCACACATGTAGCAGTCTTGGGAAAAAAAATGT 

AACCrTGTATCTCGTAATGAGGAAACAATCAGTAAAAAAGTCTAGATTGTGGGACArrCCA 

CAAACTTGCCTGAACTCTTTAATAATGTCAGTGTCATGAAAGACACACCACACACACACA 



ACTTCTAATCAATTAAAGGAAACAAAAGAGAATGACAACTACATATAACGTATAATTCTTG 
ATTGGATCCTGGATTTAAAAATAAACAGCTATAAAGGATATTTT 

rrAAAACCAAATAAAAAGTGACATTTGAATTTCT 
TTT AAAAGGATTTC CGA GC TCACAG TCAG C TTGCGAGCCATTC T CCCGCGT AC CAGCACA 
AACCGGGCCAGCC TCC TAAACTGCTCATrrACTGGGCGTCTACCCGGGAATCCGGGGTCC 
CTGACCGA 
Sequence 364 

TftCCGCGGTGGCGGCCGAGGTi 



AGAGTCCCGGGAAGCAGTGGTAACAAGGCAGAGTCCCGGGAAGCAGTGGTAACAACGCAG 
AGTCCCGGGAAGCAGTGGTAACAACGCAGAGGCTTrcAGCACAGCC^ 
CTGAAAACTCCTTCACCAGCCCCCTCCACAGGATATAGAAGACTT AGATCACT ACGAGAT 
GAAAGCAGAGCCCATTAGTGGGAAAAAGTTGGAGGATGAAGGAATTGAAAAAAAAAAA^ 
AAAAAAANGTN C CTGCCCG 
Sequence 365 

TGACTCCCCGCGGTGGCGGCCGAGGTACCAAGCACTGGGTAAGGCAC^ 
TAGACAGTAACCCTCAAGGAGCTAGAGAACCGGATGGGAGACATGAGCGGTAATTAACTC 
ACTTGTTCCCCAGAGTTTCTATTTG nTTNTTTI CTTTTTCTGTGACTT ATTTTCCTATT 
j^CTTTCCTCCATGTAATrrTCACTATGGCCCAACTAATATAAACACCT 

GGAAAAAAAATTCTTCCTCTAATAACTTTCCAAATTTGTGGAATATTTATTTGTAATA^ 
AGTTATCAAGTTATGCTTATAT AAGCATTAAAAATTCTC C TCC TTTGAC T ACACACACA 
A 

C CACAGTG TGGTTCTAATCNATG<3GAGATATCAAGTAATTTTTTAGTAACCT GAATTTT 
G 

AGGGACATTTCTCTGTTTAAGCATGTATGCAAACTGATATGTAATCCTG^ 

tcaai ii 1 1 1 rcn 



CTCCCCGCGGTGGCGGCCGAGGTACTTTGCATCCTTCAACCCAATC^ 
ATAGTCCCACACCAAAGTCAGGATCAAATGATTCCTGGACA^ 

GTGAGAGAAAGAAGCCTTTGCACCAGTTGGTGCT GGAAGTTCTGGATATGCACCTGGAT A 
AGTGAAXXCCCCTCCGTCACCACACACAAAC^TTAATTTGA 

AGCTAAAACCATTAACACTTCTTGAAGGTAACATAGAATATTTTGTAATGTTATGATAG 
G 

CAAAAGTCTCTTAGGACACACAAAAAAATTAACCATAAAAGAAGAAAATGGCTGGGTGCA 

GTGGCTCACACCTTTAACACCAGCATGTTGGGAG 

Sequence 367 

CTC CCCGC GGTGGCGGCCGAGGT ACATTG TGATTCAAGAGAAAAGTCACATGCAGGTCTG 
AGCTCCTCCAGCAGGCCTTATGTAATGCTAAGATTTrrGGGGAAGATGAAGTTGAACTGA 
TGAACTGGCTGAATGAAGTGC^TGACAACTGAGCAAGCTCTCAGTCCAGGATTACAGCAC 
TGAGGGGC TATGGAAGCAGCAG TCTGAAC TTCGGGTT C TGCAAGAGGACATCTT ACTCAG 
GAAACAAAATGTAGATCAGGCTTTACTAAATGGTTTAGAACTACTTAAACAAACCACAGG 
TGATGAAGTTTTAATAATTCAAGATAAATTGGAAGCCATTAAAGCAAGGTACTGCCAGAT 
ACCGAATTGAG CAT ACCACAAAAAAGTTCT CATTTTG TGTCC TCCCATN CCATTCT CCT 
C 

ACTAACCAAAG 



C TCCCGC GG TGGCGGC CG CCG GGC TGGT ACAAT G TGCC TGGCACC TT ACAAGACACAAAT 
ATGCTCTTATAGGCT GGGGAAAT AAGAAAATAT GAATGAAGCAACCCAGGTCTTGAGC CA 
AAGAATTACCTGGGGTCCGTTGAGTTCAAATCTGAAAATTTCTGTCT7TCAAGGTCA 
TCGCCCACAAAC 



CTCXCCGCGGTGGCGGCCGCCCX3GGCTGGTACGCGGGGGTTTCCGG 
CATGGCGTGCTGGGGTGCAGGTGGCCGAAGGGGGCGTTACTGTTGCGACTGGCATCCGCA 
TCCGGCAGATGTAGaTGGAACCAAAGCCAGAAGTTACGCGTCACCCTTGC TCTACAGCC A 
AACATGCAGGA CTC TAG T AACCCGCGAAATGATGGGAT AGCGTTGCAAA TCC TT AAAAGA 
GTCTTAACGGAGAAGGAAAAATGTTACATTGTCAAAGTCCCAAAGCCTTTC^ 
GCAGGAACAATTGTTCAAAGTTTCTTTGGAACATCAAGGAAGGAAATCCAG 
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taagtgcaatgggggagtcattaaggattttgtgtagatacagcaaaaagacaacaatct 

TCAAGCCACAATGGCCCTCACCAGAACCCAGC 
Sequence 370 

CCCGCGGTGGCGGCCGAGG TACTTAAAACCAATAAAAAGTGACATTTGAA 1 I I C I I II A A 
AAGGATTTCCGAGCTCACAGTCAGCTTGCGAGCCATl 
GGACAGCCTCCTAAGCTGC TCATTTACTGGGCATCT ACCCGGGAAT< 
CGATTCAGTGGCAGCGGGTCTGG 
Sequence 371 

CCCCGCGGTGGCGGCCGCCCOGGCAGGTACGATTATTTTCAAACAAGCCTACGTCCCTGA 
CTAACCGACTGGAAGGTGTXiAGTGGCACTACAAATTCACAAAAGAACrGTAGCCTCAGAT 
AAJCAAAGGAGAGAAGGTC^ATGCAATCACTCATGCATXSCTAGT aattctcaaaccttc 
(j r J I i LAGAAACGA7TGGATTTTCAGATAGATTTGCAGTAAGAGAATAACAAGTCTTTA 
T 

TTTTTTCATCCCAACI I C I 1 I C HG CACAI T f I I CI I C I AGCTATATTTAATATCTGTTC 

tccccacacacttgctaatctacatttcacaatcttcttcactttcac ittgtci G CAA 

A 

GGAAATCTACCCTGGGACAGAAHAAGCAICICI 1 1 1 1 1 1 1 ICCCCCTGACCCTTGGCA 
TT 

TTCCTCTCCCTTCAACTT 
Sequence 372 

GATTGAGCTCXXGW<CGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGATGTCTCTTGTC 

AGCIUICI 1 1 CAGAAGACCTGGTGGGGCAAGTCCGTGGGCATCATGTTGACCGAGCTGGA 

GAAAGCCTTGAACTCTATCATCGACGTCTAOCACAAGTACAAGAGATAGAAAGACCAGTC 

CTTGCTGAAAGACAAGTCTGAATGCTCCACTTTTTCAATTCTCT 

A 

GTCAACTTCAATGTCGGATGGATGAAACCCANACAGATAGCAATTCAGGAAATTTGACTT 

TCCATTC 

Sequence 373 

GATGAAATGACGGATGAAGCAGTTGGAGACTCTGCT^^ 

AACCCTTCAATCAAGAAAAGACCTTTGATCAGGAGAAGACTTCTCGTCTCATTTCTGGGG 
ACACA7TCAGGA TTTCTCCAAAGCAGGT GAAGG TACCTGCCCG 
Sequence 374 

TCCCGCGGTGGCGGCCGAGGTACGCGCCAGTCACTAGCAGGTCCTTGTOAATCTCCTCAC 

GGAGGCACTTGCGAGAGTTAATGGGCAGATGGAAGGAGATGGGAAGGACCAATCTGGGGC 

CGAGCAGGAACAAAAGCAGCAACGCTAACGGAAAAGGGCCGCGCCGGGCTGGTGGGCCAG 

ACAAACCAGACATGGT GCTCC CC GC GTACTCCTT AT AC TT ATT AAACACAAAATTAATTG 

TAAAATAGCCTCAGGCAGGTCCTTCAGGAGGTATCCAGAAGAAGGC> 

AGCTGATGGCTCCGCCTGGGTTACTGCCCCTGTAGACTTCCXGTGGGACAGGATTGGGAG 

GTGGGAAGGACAGTGACATGGATGATCCCGGACCCTTTGTAGGTCT AGGCTAACGTGGTG 

TGNTTTGNGTCNTTAGCTTTTTAACCAAAAAAAAGTTTAAAAAAAGGTTAAAN^^V»^C^ 

N 

TNNN^NNNNNNNTNNAAANNNNGGGGTNCCTTGCCCGGG 
Sequen ce 375 

TCCCGCGGTGGCGGCCGAGGTACCTCAGCTGTTGATCTGTGGAGCCTAGGAATCATTTTA 
C TGGAAAT G TTC TCAGGAATGAAAC TGAAACAT A C AG TCAGA TC TCAG GAAT GGAAG GC A 
AACAGTTC TGCT A TT A TTGATCACAT ATTTGCCAG T AAAGCAGTGG TGAATGCCG CAA TT 
CCAGCCTATCACCTAAGAGACCTTATCAAAAGCATGCTTCATGATtMTCCAAGCAGAAGA 
ATTC CTGCTGAAATGGCATTG TGCAGCCCATTC TTT AGCA 1 I C C 1 1 I I GCCCCTCATAT 



GAAGATCTGGTCATGCTTCCCACTCCAGTGGT AAGACTGCTGAATGTGCTGGATGATGAT 
TATCTTGAGAATGAAGAGGAATATGAAGATTGTTGTTAGAAGATGTAAAAGAGGGAGGTG 
TCAAAAAT ATGGACCAGGTGG T ATC TC TACTTT GTTCCAAAG 
Sequence 376 

GGTCACAGGTCTCGAAAAAGCGGGTGGTGCAATGCTCCATGGGGATGAGGGGAf^ACCGC 
^GTGGAGCCAGCTCGGTGTGGGAGAGGTACCTCTAAGGTGTTCTTCCT ACCTAGCCTAGT 
TTTTTTCTACCAACCTAGTTCACCTAGTTTCCTGCCTAACCTCGTTA 
C 

GCTGC1TCAAGAATACTAAAGCAACACTCCTGATATTAAGCTACTACTCAGTTTTTGTG 
T 

GGCAAAAACAGNAGATCACATaXATTTGTCTTTTGNGri CTCTI GGCTGNTTAAGCANC 

AA>iAGTTTAGCACTTTAATTCATTGCTCTACCAAATGGTTTAGTTTGGAAATA 

G 

AffGTGGACAAGAAGf<nTTTGNTTT AATCCCTTCAAAGCCAATTNAACTTGGTTTTTGGT 
T 

TTAGGTNGAGGAAGGGtXARGNANTNGTTCAAAGGTAGGt^ 
COCN 



GCGGTGGCGGCCGGACGGAGGAGACGGTGCTGTGCTGTGTATGAAGACGGCAGTGAATG^ 
CTCTGCCAACAGAGGCCATGTGGAAGTGTTACATCACX: rTTTGCTl GG AAAGATTTACTA 
AGAAGTCAAAT AGTGGGTTCCTT AGAGGGAAGAGGTTGGAAAGAACCATGACTGTATTCA 
GGAAGGCACATGAACTGAAGCTT CTGTCAGAATGCCAATACAAGCAGTTGAGTGTTTCGT 
TGCTGTGTTATAAC 
T 

Sequence 378 

TCCGCCCGGGCAGGTACCAGGTGGTGAAACCAACTGCT GAACGCACAGCCTACCTCCTGT 
ATTACCCCCGAGTGGACCTGCTGTAAACCCT GTGTGCCGCTGNTGTGTGCGCCCAGTTGC 
CCGCTTfKJTAGGACACCACCTCACACTCACrrCCCGNCT 

GAGAAACTCTTTCTCCCTTTGCAAAAATGGGCTAGAATGAAAAGGAGTATGCCHTTGGGG 

ttcgtgcacaacacagc TTCCTGATTGACTC taactttccaaatcaaaattcatttggt 

T 

g aaacangac ttgtttgc ttggatttt agnaaaat acacaaaaaccc cat aattnc tgaa 
acaaattgcttgant cc tggagatnaaggaaag ntgggatttngattcc ccaagtcctca 
ttgcttaagtaggaataaaatccttgcacct^tgcnaacaaccaact^ 
tgaaaaantgaaaattttaa^ncttntccttt aa a a aa a agaaaaa 

Sequence 37S 

GAGGGACTGCTAGCCAGCCAATAAAATATAAACTCCATTTGTCTTAGTTATATAGAACTG 

TGTTTCCAGCTrAGAAAAAGTCAAACCAATGACTTNTAGAACAAMCTACTCTCATTTTT 

T 

A TTCAGCCTCT AGAACATGGAAGC TTT AAAAG T GAATTGGCTAAANAOGCAAGACC TT C T 
GAAAGTTAACATCTTAATGATTAAAAACAGTAAGTACGCACAACCGAAGCCGTAGAGTCA 
CAC TTGCAACAAAAGGTT ACAAN T ATTG C T AATGGGG C TCTG TCCGGTNCTGCTTGTCCA 
GCTGGACCATC T ATTTCATCCC TCCTCC TC TT GAGCTGTCATTTTAATTGC 



N CCGAGG TACGTTAGCTCATTTTCCCTTAAGCGGG TTGTGACGTACGNTGAAATTGCAAA 
CGCTCAAACTTCCAACACTTGCGTATACAC TTGT AAACCCAGCTTTGN N AAGTGAGACAC 
GCATCAAAATCATGATGAACAATTGACCGGCTGCNTNGCAGTCAAGCAGTTGGGTTA 
Sequence 381 

CX^GGTGGCGGCCGAGGTACACCATGTGAAGACTGGACTTAAA 

AGCCGAGAGAGAGGCTGGAACATAGCCTTCCCTTTGGAGGTAGCCTGGCCCGGNGGGCAC 
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TGTGATCTCAGACTTCCAGCCTTCAGAACTGTGAGACAATATTTT A t igm i aagccac 
T 

TAI 1 1 1 1 IGGTACCTGCCCG 
Sequence 382 

WGGCGGCCGAGGTACT H II I I I I N TWTM I l ll l l l ll I GAGACGGAGTTTCACTCTTG 
T 

GGCGCAGGCTGGAGTGCAACGACACGATCTGAGCTCA^ 
TCAAGCTATTCTCCCTCCTCAGCCTCCCAAGTAGCTGGGATC 

CNOCCNGGCAAATG I II 1 1 1 1 1 GGATGTTTAAGNCi*GACGTGGAGTTTCTCCATGTTGGC 
CAAGGC I GO I LI I CAAACTCCCTGAOCTCAAGGGNGATOCAOCtfrGTCTOUSCCTTCCAAA 
GNGCHTGGGGATTTATAGGCRATGGAACCAATNAACG^ 

AT ACANNACTACCATGNAGTTAAATCTGCKANT AN NATTGG AATCCC 
TCCTNl mil l AAAATGA * 



CTGCCGAGGTACTCACAGTCACMC*AAT^^ 
TTCTTGGCTnACAGGTTCCCAGGKCAAGAGCTTTACCC^ 
ATNATCC^AC^f AAACNACACCTCCT^^ 

GCCTTTGGTGMAACXrrTCNTCCCCAA^ 

CANNNGGTNACCGAAAWGAATCAATTTCAATTTTCCAATATGCNTCC^AAGGGGTTTTTC 
CCACTT ATTNCACACCTTCTTGA*GGGCCN NGAAl^GC 1 1 ICI 1 1 CAAATATTAANCCCC 
NC 

AAAATTGGTCACCCCAAATCCTAATTTCCTTTC 
C 

TTTTTCCCTTrTGAANCCTGGAAGAACAAGGTCTTGGAATt 
CM 

NCTGC T AAAAAAC T AAX NNMGGAA 7 ^rcCCCCCCCGGGCCTGCAA^30GGAAA.TTT CCNN T A 
WTCAAAAGCTTTAATCTNATT ACC CCNTCCAACCC TTCCAAAGG 
Sequence 384 

AGACTGCAGGAGATGTGGGCCGTGCCAAAGAGATGGATGAGACTGTTGCTGAGTTCATCA 
AGAGGACCATCTTGAAAATCCC^ 

TTTTGTCTGAAAATCAACTGCAGACTGTAAATTTCCGACAGA GT AGTTC 
AGCACTTGATCXATCTGTGTGAGGAAAAGCGTGCAAGTATCAGTGATGCTCiCCCTGTTAG 
ACATCATTTATATGCAATTTCATCAGCACCAGAAAGTTTGGGA ! G 1 1 1 1 rC AGATGAGT 
A 

AAGGACCAGGTGAAGATGTTTGACCTTTTTGATATGAAACAATTTAAAA 
S eq ue n ce 385 

GTACTCCGTCTC^GAGGANGGGATGCAAATCTTCGTGAAGA 

CACCC TT GAGGTCXiAGCCCAGTGACACTATCGAGAACGTCAAAGCAAAGATCCAAGACAA 
GGAAGGCATTCCTCCTGACCAGCANGAGMGTT GATCTTTGCCGHGAAAAGCACGCTGN GA 
AAGATGGGMGCCGCCACCCTG TGCTTGNACNTANCAACAATCCCATGAAAGGAGGTCTAC 
KCCTGGCACCCTTGG 
Sequenc e 386 

CTTI I GAAGGCCCCGNTCGCCCGGGCAGGT ACTCCCTGAT AAAGGGGAATTTCCaTGCCG 
TCTACAGGGATGACCTGAAGAAATTGCTAGAGACCGAGTGTCCTCAGTATATCAGGAAAA 

agggtgcagacgtctggttcaaagagttggatatcaacactgatggtgca^ 
ggagtcctcattctggtgataaagatgggccgtggcagcccacaaaaaaa 
aagccacaaagagtagctgagttactgggcccagaggctggggccctggacatgt actct 
cagaatgtrrgtcatatgcttcttgcaatgcatattttttaatctcaaacgtt^ 



CACGCCTGTAATt 
TTTCAGACCACCCTGGCCAACATI 



AACCATTTTrCAGATATAAAGAGAAlTACrTCAAATTMGAGT 
A 

GAATTTAAGTTAAAAAGT GGTTTGGAC TTGGGAACAGGACTTTTATACCTCTTTT ACTG 
T 

AACAAGTACCTCGGCCCGCTCTAGAACTAGTG 
Sequence 387 

TCCTGTATT UCC 1 1 1 1 1 AATCTTGCTTGTTTAAGNACNTTTCAGGGATTG TCATCATTG 
A 

TCATCTGTAAAATTGTCAAGNAjCTAAGGTCCTAAACCTTAATC 

ATAAAAATATCACCAACATAATATANCACGG 
ACTAACCCCT AAACCTTCTGCNT AATGAATTAACNAGAAAT ANGGGGGGCAAGGAGNGCC 
ANAGCTAAKACCCCCTN^ACCAGACGAGCTACT'fTAAGi^ACAGGTA 



rr act aaaaatacaaaann gg 

GTGTGGTGGCGGGCACCTGTAATCCCAGCTACTTGGGAGGCTGA^ 
AACCTGGAGGCAGAGGTTGCAG«SAGCCAAGATCACGCCA 

GAGGGCAAGACTCTGTCTCCAAAAAAAAAGAAAAAAGGAAAAAAGC C 1 nCTTG ATGCTG 
TTCGCCATTTCTCCACTAAAACGC C I GC 1 1 1 IC I I AACTCCACACCGAACCAACCTGA 
AA 

TATTTTGGCNAGAATGCCAACAAGAATTGAAAGAAAAGATGCTTT ACAAAAATAACAATA 
T AAAAAGCAAATT ATATT ATCCCTTTT ATC TCCATTCTT ACATTAAAAAAAAAAAAATCG 
GCCGCTCTAGAACTAGTGGGATCCCCCCGGGCTGCAG^ 

Sequence 390 

ACTACNCCGGG C1 1 1 1 CTCAGGCGGNGGCATGGCGGGACAGCAGGATCCGGTGCANCGGN 

AGATTCACCAGGACTGGGCTAACCGGGAGTCGGCCGCTCTAGGGGN 

Se qu ence 391 

CGCCGAGGT ACGCGGGATGGGATTTC TGACCATTTGCCCTGCCTCTTGCAAAATAGGTCT 
AATGGCAGGATGGTGTCATAATTAAGGCTACCAAGACTGCCCATTGTTCC^ 
GTrCATAATGGGGGCAGACAATAGTGCAAAAAAATTTTACATTrTATCTTTAGW^GTC 
A 

GGGTCAAATTGATTTCCATGGTTGAGGATGTAGCGAAGTGTGGAATCAGGTGGAATAGGT 
GGAGAGTTGCCCATAGTGGTTTGGAAAAGAGAAGAGGACTTTGAAAAGTGGAGGGCTCAT 
TAGGTGACCCAAATTTTACCTGGGGCATCCCCCCTTTAGGGCCCCAACTT AGTCTGTCAG 
ACATGTCTGACCTT AGATGGGTGCTGGCACCACTITGGAATGGTTCCCTGCATCACTGAG 
G^CTGACTTAAAGTTTTTCTATCTCACTTAAAACAACOCTITAAOGCTCTCAACTTAG 
G 

CAATAATAAA II IX 1 1 1 1 CATGAA TTCCC TTCA 
Sequence 392 

AGCGCGGGGAGAGGCCGGTTTGCAGTATTGGGCGCTL 1 1 UJGCTTTCCTCGCTCACTTGA 
CTCGCTGCGCTCGGGTCGTTCGGCCTGCCGCCCGAG 

GGGCGGTAATTACCGGTTTATTCCACCAGGAATCAAGGNNGGATAAACGCAGGGAAAA 
ACATGTtffACTCA AAAMA flGCCAACCKWAA^ 
CGTTGCTTGGCGG 1 1 1 1 1 1 CCAT AAGGGCTCC 
Sequence 393 

KATTGGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGACACAGGCACTCCTTTG 
TCTGGTAGAGAGGAGGAGGGGAAATGGAGCTATTCCAGGATACAAGGGATGGCACTOAGG 

iTCAGC 
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TTGGGGAGCTAjGGTATCAGAAACCTGAAGGATCCAGCCCGCTTTGTCCTACTAGTGTCTA 
TAAGTCTCTGTCCTGAGATCCTGGGGCTCCTCCTAn I C 1 A GAAGGGATGAGGTGCCATC 
AAAAATAACTTGGCTGGTGTAACAGTTT AGAGAAGGAAGTCACACCTG TAGCCT GGCTGG 

caggcaggtggacatgaggctgagaagggaagccagatgtcagaacatact AGGCT AGCA 

TGCCTG 
C 

Sequence 394 

GTGGCGGCCGAGGTACCAGG CTGGCGACAGGTGC TACCAGGAGTGGGCTGAGGGGAGAAA 
AACTATCTOCCACTCTTTTGGCCCAGGCAATGTCAACGACTTCCACATTGCCTGGCCCAC 

AG^GTGCAGTGCAAATCAACCAACAATTTACTGGTGGAATGGCAATCAAAGGAAACAGTT 
AAACACCAAACAATTNCTTAAAGCCAAAAAATATTTTTCATGGAGTTG^ACATTTTTCG 

A 

GTGIGJ Mill ICAAGTGTAAAAGCAGTGACAT7TTGTTCAAACAGAAGCAGCATCTAGG 

AATTCTGGCACTTGGGGT7CTAAGGGGGTTACAGGTATGCCATCATGGATTCTTCTCC 

C 

NGGGGCCGGGCCCCCGG N GGGGTT ANCC TTTCCATTTTT NANCAACC TTTT AAAAGCC CT 
TGGGGAGGGNGGGGTTTAANGGGGAATC^XTTTNAAAATTTTT^ 
CCOOCATTAAGNAATTTCCCAAGGTTTTTNAAGCCTTITTTAAACCCCT^ 
GWAAAAAGGTNGGAAAAAAGGGCCA KI M I N I I A CCAAAGCGNGGGGGGAGNGGAAGGG 
CCAAAKTT3GGAAGGAAAAATTAAAANGGGCAAAACCAAGGAATTANATTACCGTTCCAAA 
AAAGCNTGGGGAAACGAAGGGGGGCAGGAAAATTCAAGWAAACCGTTGG7CCTTGGGCCT 
TATTCAAGCCm 1 1 IUGI 1 1 1 1 1 1 1 IGGACCTTACCTTAAAAGGGCCCCAAACCCCTT 
T 

n 1 1 1 AATTTCCCTCCTTGGGAATNGGGGTTCCTGGCCAAGfO^CCCCAAAAGGTTTCCAA 
GGGAAAATTTTTrAAGGGCCCAAAAAAAGGGGAATTTTTCCCCCAAAA^ 

AAGGMAAAAGGGAAAANGGCCCCCNNCGAAGNAAAGGGGNCCTTTTGGGTGGGAATTGGG 
AAAACCCCCAAAAAAAAGGAAAAATTCCNTTTTTTAAAAAAAGGGAAAAANGGGGGGTTN 
TTN CC TTTCNAAAAAAATTGGC CCAATTTN GGTTCCCAAGGG TN AAGG N AATTTTTTTG 

G 

GGGTTNAAAACCTTTGG<3GGCCAANGGGGGGGAAAAAAAACCCTTTTGGGTTCCTTTGGG 
GGGGNAAG 



TGGGGGCCGGGCCCCCGAAJ^GGTTACC^XXGCGGGGGGGGAGGCCTTTTNTTNCCCTTTG 
GGCCCAGGGTNTTNCNTTTCCCCAAGNCAANGGAAACCCCC 1 1 I CU 1 1 NCTTTGGGTTT 
^^GGAAAAAANGGAATGGGGTTCCCGGGCTTGGCNTTTTTTGGGGTTANGGGCCACCGC 
TTCAAGTTCCTTGAAATGGTTCCCGGCNCATGCTrrCCCCGGGGCCCGGCTTCNTAAGNA 

ATCCGGAT AACCCCG GTCCGAACOCCTCGGAAAGGGGGGGGGGGGGGCCCCCNGGGGTAC 

CCCCAAGC I If 1 1 Tl GGTTTTTCCCC Mill AAAGTNGGANGGGGGGTTTTNAAAATTT 

T 

GGCCCGGCCCGCCTTTTGGGGCCGGTTAAAATCC AATTGGGGGTTCAANT AAGGGCCTTG 
GGTTNTTTCCCCTTGGTGGGTGGGNAAAAAATTTGGGTNTTAANTTCCCCGGCNTTCCAA 
CCAAAANTTTN C NCCAACCAACC AAAACC C AATTT AN C CGGAAAGGCC CCNGGGGGGNAA 
GGCC CAANTT AAAAAAAAGGG TTGGGTT AAAAAAAGG GC CC CC TTGGGG GG GGG GGTTG G 
GCCCCCNTNAAAAATTGGGAAAGGGTTGGGAAAGGNCCCTTTAAAAAACCCTTTCCAAAC 
CAAATTTTTAAAAAAFfrTTn>IGGCCCGGGTTTTTTGGACCCGGCCCNTTTCNAACCCT 



GTGGGGGGCCGGGGCCCGGGAGGGGTAI 
AGGTTNTCTTCCCNAACAAGGGGAAGCCC TTNT7TCNTTGGGTAT7TTGGAAAAAGGAAT 

CCCGOGAATGGNGTGGCCNGGCCCCGCATCTTANGGAAACCTANGT^ 
GGGGGCTTGCAAAGGGAAATTTCNGAATATTCAAAAGCTTAATCGGAATNACCCCGG 
GNA CCCCTC NGGACGGGCGGGGGCGCCCCCGGGGTAACCCCAANC M I l l ll ll IG GTTTC 
CCCCTTTTT AAAGTNGGAAGGGGGGTTTTAAAATTTGGGCOtGCCCGCCTTTTGGGGCCG 
GTT AAAATTCAATTNGGG GTTCCAAT AAAGGCCTTGG TTTTTTCCCCTTGGGTGGGTTGG 
AAAAAAATTTNGGNGTNATTNCCCGGCfrTTTCAACCAAAA 
AANCNCAATTTAACCCGG*iAANGNCCCC03GGCGGGG^ 
TTGGGTNNAAAAAAAWGGNCtXCTTGGGGGGGGGGGGTNGGCCCC^^ 
*ACGGT TGGGG AAMGGGCCTTTAAAAACCTTTCCAAACCCAANTTTTT 
GCCCGGTTTTTGGMCCCGGNCCfnTTCNAACCCTTTG 
NCCCCAAAANG TTTCC TGGGGGGGGGGAAAAAAA 



GGGGCCCCC 



TCCCCCAAAAGGGTTTNCC GGGGGGGGGGNAAAAA 



GCGGCCGGGTACAAAATTTAi 
TGTTACTACX1AGATCCAGTTCTTCTTAGGACAGTGTGGCTCAAAGGGATGAGACCTTCCA 
GACACTGGTATCTGAGCATCTGTGGCCTGCCCCTGAGTTGTCAAGATAATTTCTTATCTC 
TGAAGGAGTCCAGACAGGAATGCTTCCACTGCTGGGTGGGTGCTCGCCCX 
TAAGCGCCCGGCTCACCCCCTTGCT AGCACAG GG TGTCTTACACAGTTT ATGGGAC TTTT 
CTGTGAACT ACCTGAGGGGAAGAACCATGTNCCACTCCCTGCTTGCTCCTOVAATATT^ 
A 

Sequence 399 

CNGCCGAGGTACNCGGGGAGAGAGGAAAAGAACACAGATCTCGCATGGTTCA^ 

TTTTTAGGTCCAGGAGTAAGATATATCATACfKSAAAATGAAAATTATAATTCTTCTTGG 

A 

TTCCTGGGAGCCACATTGTCAGCCCCACTTATCCG^AGCGTCTCATGTCTGCCAGCAAT . 
AGCAATTGAGC TT ACTTC TT AATC TTT AAT AATGG GTCAACTTTTGCCACTACAAACTT 
C 

AGGGGC^CACTrAATTCATGGANTCCACCTTTCTCTGGGAATTTTACAACAGCAGCAGCA 
GG C TCAAATTCCAGGAC T C TC CCAGTTC TCTTT ATC AGCTCT AGAC CAGTTTGCCTGGAA 
CTGCTCCCAAAATCAGAATACCCTTAACCAGGGAAGAGGCGAGTTrrGGNCCCAAAGGGA 
GCCCCAAGGCAAGGGCCAAGGTTNGAATCCChfTTAACNGNhfnT AAAAACAACCCGCCTT 
T AAGAACACAAACCCAGG N C CCCCAN GACAC CG TTGAATGCCC CTT ATTGTTATTT C TTC 
CC 

Sequence 400 

GACAGACAGTGCTTGATGTrCATAAAAAATACAATGCCCTGGTAATGTCTGCATTGAACA 
ATGACGCTGGCTTTGTGGCTGCTCTTGATAAGGCTTGTGGTCGCTTCATAAACAACAACG 
CGGTTACCAAGATGGCCCAATCATCCAGTAAATCCCCTGAGTTGCTGGCTCGATACTGTG 
ACTCCTTGTTGAAGAAAAGTTCCAAGAACCCAGAGGAGGCAAGAACTAGAAGACACACTC 

GTAC GCGGGGGCTAACCAGGCCAGTGACAGAAATGGATTCGAAAT ACCAGTGTGTGAAGC 
TGAATGATGGTCACTTCATGCCTGTCCTGGGATTTGGCACCTATGCGCCTGCAGAGGTTC 
CTAAAAAG 
Sequence 401 

CGGTGGCGGCC GGTTGCCTTGTATG TC AC GAGCAATT AGGAG A GTCAC GAGGATGAAAT A 
GATGAACCCGACCATGCAGTTAATCACCAACATCAACTACTAGCCAGACGGGATGAACCA 
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CAGCGTCACACAATACAGTGTT«TGTTGTAAGTGTAACAACACACTGCAGCTGGTA^ 
GAAGCCTCACGGGATACTCTGCGACAACTACAGCAGCTGTTTATGGACTCACTAG^ 
GTGTGTC TC G TGGTGTGCAACTGCAAAC CAGTAAC C TGC TATGGC CAATTG TGAAGAGAT 
GGGAGTCTCCCCGTATTGCCCAGGCCGGTCTCAAACTCCTGGGCTC^ 
GCCCACTTCCCGAAGCCCTAGGATTACGGGAGTGAGCCACCGCACCC^GCCAGAAAAACG 
JTTAAAAATTTGGAAAACCTTAC I 1 1 M 1 1 AATGAGCATTTTTGCATCAAGGGGGTT AC 

GGGACATTAGGCT 1 1 i I HI 1 1 
Sequence 402 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATATXXTCTGTGG 
CTGCTCTCAGAGTGTGCACTCTGCCCACAAGCCAGCGCTCAAACTGGAAAAAGTATGTCA 

atgtcctgaatxitgggaaaacctttagccga^gttcttatcttgttcggcatc^mvagaat 
ccacacaggcgagaagcctcacaagtgcagtgagtgcgggaagggctttagtgagcgctc 
caacctcact gcccacctacgaactcacacaggggagaggccct atcagt gtgggcaatg 

TGGGAAAAGCTTCAACCAGAGTTCCAGCCTCATTGTCCACC^ 

AAAGCCTTACCAGTGCATTG TCTGTGGAAAGAGATTCAACAACAGTTCCCAGTTCAGTGC 

TCACCGGC 

Sequence 403 

A1 ^GGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATTAAGT ATTAAAATGAGGATTGAA 
CTGGGGCAAACAGGTTATTGTGAAAACAGTCAATATGTAAGCTCCTTCAAGGGAAATCAA 
CT ACTGTTCCTCAAGATT AGAAGATGTNCACAC TCTTTGGATT ACCTCCC TAAAGGAGGA 
AACAC C CATTAATTTTCGCTTATGGAATCAATATGGAG TGGAAATATGAAATGAGGAGA T 
GTTTT AGAAAGCAGGACANATCTACCTACCATT AC TGGAATTAAAATGT ATCC TCT GGG C 
CCACTCCA7TGATTCCGATCTGAGGTGAGGAGGACTAAAAGCAGCAGCAGGTT ACAGAAA 
GACTGAAT AAGATGAAAGT ATGCT ACG T ATG TC T AGC TGGGGAAGGGGGGATCTGGAAAA 
A 

Sequence 404 



ACAGTCTACACAGCNCGACCAAAGCCCCCGCGTACCCAGAGGAGTCGCTGGTGATNGGGG 
AGCT CAACCCTGTTNAGTAGCTCTGCTCATCAAGTGTCTGGAGAAGGAGGTTGCGGCATT 
GTGCAGAT ACACACCC CGNAGGAACATCC CTC CTT ATTTTGTGG CTTTGG TGCCACAGGA 
AGAAGAGTTGGATTGACCAGGAAAATTNAGGTGACTTCTCGANGGCTTTCCAGCTTGGTC 
VI III 

Sequence 405 

ccgcggtggcggccgaggtacgcggggggcggcggcggagagagctggctcagggcgtcc 
gctaggctcggacgacctgctgagcctcccaaaccgcttccataaggctttgcctttcca 
acttcagct acagtgttagctaagtttggaaagaaggaaaaaagaaaatcgctgggc^ 
ttttcttttgttctttgccaaagtcgt^ 

tttagaggtgctatcixxagttccttgcactcctgtraacaagcacc 

cagcagcgatagcagccgcagaagagccagcggggtcgcctagtgtcatgaccagggcgg 

gagatcacaaccgccagagaggatgctgtggatccttggccgactacctgacctctgcaa 

aattccttctctaccttggtcattctctctctacttggggagatcggatgtggcactt 

tg 

cggggtntgtgtttcttggtaagaactcnatggaaacaggcctcctt 

Sequence 406 

tccccgcggtggcggccgaggtacagttcacagtgc ttgatgataataaatggttattt^ 

ACTGGTTCATGT ATTTACTATATCAT ACTTTTTTTCATTAGAGTGTGCTCCTTC T AC TTA 

TGTAAAAAAAAAGTTACCTC^GGGAGGTCCTTCCTGAGGTCTTCCAGCACACGGCATTGT 

TATCATAGAAAATGACAGCTCCATGTGTGTTACTGGCCATTACCACCTTCCAGTGGGAAG 



GA TG TGGAGGTGGAAAGCA TAC TGATGATTTTGTCCCCG tggaggcct aag c taatg tgt 
g tg tttg tg tc ttagc tttcaacaaaaaaaag ttt aaaaaggaaaaaaaaaaaaaaaaaa 

A 

Sequence 407 

gtggcggccggtgtgctcatcgtagcctcgggtcgggggatc 



gtacctccxitgggtgaagtctctacatagctctcaggaaccttcggaaaggcatccaact 

C7T7TACCAAACTTAAAGTTTTTTTCCGA7TCAGTC« 

c 

TCTTCCTTCATATAGTCATGCTTGTGTTATGGTCCCAGCCTAOCGCCATGTTTTACAGA 
A 

tgacacattacctccacactctcccggactaggi ngtcaacagggccacagggttgcttt 
ctgtcttrggtggggcaggggagttgacagggatgagggtccaaggaattaagcatggaa 
tgaca*gaaaacangggaaagagttaccctgtcacatagtaggttaa^ aagggt 

TTGCAAGTAAGAGGNMTTrcGACCCTTrCNCTTGGCTGAG^ 

AGC TTTT ACTGGGATTTTCAATNNAAAAAATTAACAACAATGTCAAACTNGG 

NATTGGNTT AAAGCCTTTTT AAGATTCTTTTTT AAAT AACA TTTTTCCCCGAAAAAAAAA 



Sequence 409 

TTTTNGGGGGGAGTTAAATAAAATAAGCATGTCTNCATCCTTTATTCCT AAACATTTAC 
T 

TATGACAAATGTAANNACTGACAGAAATTTGAAAAATACGANGACACTTCTTAAATGATT 

TCCCTTGGTTCAAAATTTACCCCTTCTTGGGTTTCTNTTGCT^ 

A 

C TC TTC TC TTTTT AN G TTT GAAC T ATGCAAG TGCCAAAGGATTCCNCTG T AGTC TTTCC 
A 

AAGGGGGGGAAAGGGGGTNTATANAAAAAAAAAAAACACCTT 
Sequence 410 

GGGC AGGT AC TGTGCAG TAG TAAC C CATAATTC T AAAT GAGGATTATGGATTTTTCTGGA 
AGATTCI II I MO 



TATATGTATAT 
Sequence 411 

GGTACGCGGGGTGCTGGGATNCAGGt^GAGCCAGTGCGCCCAG^^ 

TATT AGCTG ffrCTGGACTGNGGGGCTCCTTGGGCAGATGCTGTATTATGGGGATAAGCCA 

CACACTTTNTGAAC TG GC C C GG T CAG GGGGGACAT AN CCATTTCCTGTGCC CCC CATCAA 

NACCCACCTATTCTGAGNGTNNGCTCCTCC^CTGCTTGAGTNATGGCCACANATCTTGGC 

TCGGNNCTCCTAAGCTGCATGNTGAATTCCTGGGACAACAAGACTGGCTTGTGG7TCCAT 

TCTCCAGATCCTTGGGT 

Sequence 412 

GCCGGGCAGGTACTTaGAGTTTTCCAAGTATGTTCTAAGCACAGAAGTTTCTAAATGGGG 
C CAAAATTCAGAC TTGAGT AT G TTC TTTGAAT AC C TT AAGAAGTT ACAATT AGCCGGG CA 
TGGTGGCCCGTGCCCGTAGTC«>GCTACTrGAGAGGCTGAG& 

CCCAGGAGGTGGAGGTTACAGTGAGCAGAGATCGTGCCAC TGCACTCCAGCCTGGGTGAC 
AAGAGAGACTTGTCTCCAAAAAAAAAGTTACACCTAGGTGTGAATTTTGGCACAAAGGAG 
TGACAAAC TTATAG TT AAAAGC TGAAT AACTTCAGTGTGGTATAAAACCGTGGTTTTTA 
G 

GCTATG TTTGTGATTGCTGAAAAGAATTCTAGTTTACCTCAAAATCCTTCTCTTTCCCC 
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AATT AAG TGCCTGGCCAGCTG TCAT AAATTACATATTCCTTTTGGG 
Sequence 413 

GCGAGGTACCTAGTCTANATGAGTTTGATGCTTACAGTCAAGGCTATTAGCAAATATT^ 
GGAAAAGTAAAGCCTAAAGAAGAAAAGAGGGAATGAATA G 1 1 PSTC I A GAGATAATAAAA 
GGAAGGTGAATTTTTAAAAAGACAAAAATAANGCTAGAAAAGACTGAGTGGAGAAAGCCT 
ACAGAATTTCAGAAAGCTAAAGAAATTGGAAATTAGATTGAATATAGATAGAAATGGGAG 
GACAATGCAGCCAATGAAAGACTCTGGGGACTAATAAAGGGAGAGOCCTGTGGTTTGGAA 

agtgtcccttaatcagcctgcagtgctgcaaaacagaaacccagag 
Sequence 414 

GGTGGCGGCAGG T ACGCGGGA TCCAAGA TGAATGTGCAGAGAAAA T AAAGAA T CGAAAGT 
eATAGTCATGAGGACAGAATAAAGACATTTTATG OC I I TTIGTTIT G n 1T G 1 1 T1CCTT 
TTTGTGGAGAACAGGG I C I C I C I ATATTGCCCAGGCAGGTCTTGAACTCCTGGGCTCATA 
CTGTCCTCCTGCTTCTGGCTCCCTAAGAGCTGGGATTaCAGATGTGAGCCACCATGCCCG 
GCCAGAATAAAGACATTTTAAAACT AAAAAAAAAAAAAAAAAGAGTTTGCTTTGCATTAA 
TCIMI rTTTCTTTTm 1CGTTI I I AH ITI'I AGII1 f I ATTTITTf 1 GAGACGGAGTC 
TCACTCTGTCACCCAGGCTGGAGAGCAATGGCATGGTCTCGGCTGAC^^ 
TCCTGGGTTCAAGTGATTATCCTGCCTCAGCCTCCTAAAGTAGCTGGGATTACANGTGTG 
ATAAAAGACATTTTAAAACTTANGGAAAANAAAANAAAAN 



■ 415 

ATGCCCCCGCGTAi 

ATTTCGATATCAAGCTTATNGATACANGTTGACC 
TCMAG 

Sequence 416 ■ 

CCCCGCGGTGGCGGCCGAGGTACGCGGGGCTGCGGAGGACCGTGGGCACGCCAGGGTCGG 
TGAAGGATCCGAAAATGGC TGGGCGAAAACTTGCTCT AAAAAOCATTGACTGGGTAGCTT 
TTGCAGAGATCAT ACCCCAGAACCAAAAGGCCATT GC TAG TTGCCTGAAATCC TGGAATG 
AGACCCTCACCTCCAGGTTGGCTGCTTTACCTGAGAATCCACCAGC T ATCGACTGGGC TT 
ACTACAAGGCCAATGTGGCCAAGGCTGGCTTGGTGGATGACTTTGAGAAGAAGTTTAATG 
CGCTGAAGGTTCCCGTGCCAGAGGATAAATATACTGCCCAGGTGGATGCCCGAAGAAAAA 
GAAGATG TGAAATCTTG T GCTGAGT GGGG T GTCTCTCTCAAAGGCCAGGATTGT AGAA T A 
TGAGAAAGAGATGGGGAAAGATGAAGAACTTAATTCCATTTTGATCAGATGACCATTGAG 
GGACTTGAATGAAGCTTTCCCAGAAACCCAATTAGACAAGAAAAAAGTNTTCCTATTGGG 
CCTANCCACCCATTGAGAATTA TT AAAT 
ATTACCCATTCTGGGCCTTTAAAATATTA71 
AAG 

Sequence 417 

GGCGGWCCI fTTTTTTm fTTI 1 1 1 ITTT1TTTCAGAGQGAGTTTTGCT C1 1 1 1 IGCCC 

GGGCTGGAGTGCAATGGCACGATCTCGGGTCACTGCCACCTCTGCCTCCTGGGTTCAAGT 

GATTCTCCTGCCTTAGCCTCTTGGGTAGCTGGGATTACAGGCGCCCAjCCACCATGCCTGC 

CCAATTrrGTATTrrTAGTAGAGATGTGGTTTCACCATGTTGGTCAGACTGGTCTNGAA 

C 



GCGACCGTGCCCGGCCATCAGTTGTATTT>rrATATAGTAGCANATGAACAATCAAAATGN 
GATTAAANAAAATGCCNTTTTT AAT AGCC TT AAAAAAAAAAAAN TTNTT ANT GAA T AAAN 
TTTAANCCAAAGG AGGGG N CAAAC C TTTTC CN TGGGAAATT C CAAAAC N CNTNTTTGGNA 
NGAATTCAAAGNAGGNTGAAANC CC N NCCC C TTTTTN CGGN GTThWiAAAAAA NANATTT 
TTTANNGGGGGGNCCCCNCCCAAANHATANTTCCNCNGTGGGGGGCCCTCTAAAAAANAN 



rrAAAAAAAAAANNTTNTTTTTTTGGGNG 
Sequence 416 

cgcggtggcggcccgaggtacgcgggattttgaatgaattctcaacaaaatgtgct agcc 
actggggacgcaaaacaagtaagatcx^gttgcaagam 
g ttggca tggagac taaaattctcaggaaaatgagatccgtgtt agattagaag tcctga 
tgtgaaatgggaggactcaggaaggaggatcgtctttacctgaggatttctagccagagg 
tcccagatgcgtgggctgagaactxagcgataagggggcgttccca^ 
ataagaacagaggaggcagcaggattgcacaagccccaggcacagtggcagttaggatgg 
ctggagagtagt^tagttctatgggttgcccaaaaaatgtgatgtgctt catgttttct c 
tgactcatggatctggtagagaccatagacatgatataggactaacttgcccatttttca 
canagaggaaaccatcctt atgac ttacc tt a> 

A 



CTAAAACTAGGGGGATCCCCCGGCTGCAAGGAATTCGATTTC^^ 

NCACCTCGAGGGGG 

Sequence 410 

COGCGGTGGCGGCCCGAGGTACAGTATATTGACCTTAAAAATCAGTAAAGCAGTCATGGA 
AATAACAGG TCG TGTATTATTCATGGGCACAAAC TGAC TCATGGCTGGGGAAGAAGCAGC 
CACCTTAGACCAGATGGACAAGCCAGATACTGCAGAGAAGTTTCTGGGCTTTTCGGGGAG 
CTCTAGATTCAATTCTGTAAAGTTATGATGCAG I H I C I CCTTCCTCTCCTCTCACCTN 
C 

T>rrGAGCACAGC7TTCAACAAAAACTTTGCATACCCCGCGTACCTGC^^ 

CGAGGTACnCTCTGAGCATTGGCCTCTGGCTGGGATTATGCTTCAACAGT^^ 

AGGTCCCTGGCTCCCTrCTGTTACAAAGTC^GGGAATGTGAATTCAACCCGTGATATTCT^ 

TTGTAGGTCTCTTGGTATGTGTTTGCCTCAAAAGGAGGCTTCCCAACTAAAAATTCATAG 

CAAAGAACTCCAAGGCTCCAAGAGATCCACCTTCTCATCATGCATCCAC^^ 

TCANGGGGCANGGAG7CCAAGGTGCCACAAAGAGNGGTCCTTCTGGGAAGATGGAGCATG 

TACCTCGGGCCCTCTAGNACTAGTGGAT 

Sequence 420 

GAGGTACGCGGGGGTCGGCGCCATTTTGTCTCGGCAGCGGTGGCCCGTAGCTCCATCGCA 
TTTTATGTTTCTGGCGAGAAGGGAAGGGAGTTTTCATCAGGTAGATTGG I M I IGT 
Sequence 421 

GGGGCGGCCGCCGCCTN CCCG TGAAAGACC TCCTGC T GGAAGACC TCCAGGATGGAGAAG 
TGAGGCTGGGTGGCTCCCTGCGAGGGGCATTCAGCAACAATGAGAGAATTAAAAACTTCT 
TCAGAGTCAGTTTCAAAAATGGATGCCAAAGTCAGACCCACTCGCTACAAGCCAATGACA 
CTTTCAACAAACAGCAGNGGCTTAACTGTATTCGTCAAGGCAAAGAAACAGTTTTGTGTG 

ctgccgggcaagctggggtgcttgactccgagggatcgttcctaaatcccaccaccggga 

ACTTGTAGTATGGACACNAGTGAGGTCAGCCTCGACTGTGAGCGCATG<^^ 

CNGGAAGCAGGCGTGCATCTTACCTGTACCTGCCC 
N 

Sequence 422 

ACTTCC^GCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAACTGGGGAATTCTGGCCCTAC 
GTGCATTCACAGGCAATGATGGGTTTGTGTGTATGGTGTCATGAGATCCTCTACCTCATA 
ACAAAAGGaCAGTGGGT AGAC T AAGGC AG T AGC T CAAAGG GC TTT G C AAAATTTT AAT AT 
A TT AAAACAAGAGGCATC TGC T AGAAAACATTC T A TTGT AT ACAT AC TGAAAACCCT AT A 
AGGT CC TGGATAATTTTTG TTT GA TT ATTCA TTGAAGAAACA TTTATTTTCCAATT G T G T 
GAAGTTTTTGACTGTTAATAAAAGAATCTGTCAACCATCAAAAAAAAAAAAAAAAAAAAA 

agtacc 
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423 

NCCCGCGGTGGCGGCCtXJAGGTACGCGGGAGAAGGAG^ 
TGTGAAAAGCCCACAGCTAAC^TCATACTCAATGGTGAAAGACTGAAAG^ 
^^CTCATGAAGAAGACAAGGAGGCTTGGTTTTGTGGCTTCTATTTAACATGGTAATGGGA 
AjST TC i AGCC AAAGGAAGTAAGCAAAAAAAAAAATCGAAATTAGACAGGGGGAAGT AAAA 
TTATCI I ITTGCAGATGATATGACTTATATGTATTATAGAAAACCCTGGGCCAGGTGCA 
A 

TGGCTCTTGGCTGTAATCCTAGCACTTTGGGAGGCCGAGGTGGGTAGATTGCCT 
ANAAGTTTGAGACCAGCCTGGGCAACACGGrTGAAACC^ 

AAAAAAAAAAAATTAGCCCGGGttnOGCGCAT^^ . 
TGGtSAMGCTGGAKGNTGCANTCAGC^ 



AAAAWTTTTAAAAAGGAAACTTWGWATTCCC O I IICIIIII t ATTGCCTCCAAAAAAAAA 
424 

CCG^GGTGGOGGCCGAGGTACTGCCGAGCCGCTCCTCCCGCAGCTGTGCCGCCTCCTTGT 

CCTCCTOC I LATTG TCACTGCCAAACAGGTCAATG TCATCATCC TCGTCATCC TCTGC 

TG 

GTGTGGCTGGCTTCCAAGCTGGTGCCCGTGGGCTACGGTATCCGGAAGCTACAGATT 

TGTGTGGTGGAGGACGACAAGG7GGGGACAGACTTGCTGGAGGAGGAGATCACCAAGTTT 

GAGGAGCACGTGCAGAGTGTCGATATCGCAGCTTTCAACAAGATCTGAAGCCTGAGTGTG 

GGTACCTGCCCG 

Seq u ence 425 

CX^TCCCGCGGTGGCGGCCGAGGTACTAAGTGGTTAAGGATGGAAAAGAGCTAAGAAGTGA 

CAACAAATACAAAATAAGCTTCTTCAAGAAAGTATCCGGCCTTAAGATCATCAATGTAGC 

GCCGAGTGACAGTGGGGTATACAGTTTTGAGGTGCAGAACtXTC 

CACAGCTT CATTGCAGGTTTCAGGTTGGTTGATTTCTTGGGC mTCCTT CA TCATTAT 

A 

ATAATGTAGTTCCTGATTTTCATAAATGTATATGGGTTGTTACATCTrCTATAGGATAAC 
ATGAGTOCGACATCTTCTGAATCAGCAAATTCAGAGGCAA 

; TACT GAATGTGGGAAAGCCTTTTGCCAGAAA 

TGTGGAAAAACATTCTGTGTGAAGTCAAACCTCACT GAACATCAGAGAACACaCaCAGGG 
GACAAGCCCTATGAATGTAATGCATG TGGGAAATCCTTCTGCCACAGATCAGCCCTCACT 

gtgcatcagagaagacacacaggggagaaaccttttggatgtaat^^ 



GGGATGTAATCAATGTGGAAAATCrrCTGTGTGAAGTCAAAACTCATTGCA^ 

ACACACACAGGGGAGAAACCCTATGA 

Seq u ence 427 

CCCGNGGTGGCGGCCGGGTACCTT ACTTAGCAGAGCACTTTGCAAACAT ATTACTTATTA 
GC^GAGCTCTTTGTAGACCTTCCACATCTGGCTGTnCAGATCn^ 

CTCCAGTT ATATTCAC TGGAGAGCAT AATCXCACACGGGTT ATTT ATAAATACAGAGCC T 
CT GATTGGACGGTCTCCTGCCAAGAACTaGTAATACCCTT GTTTT AAAATC ttcacaagg 
TAAAACTT AAAAAGCC AACC AAACAAATTGCTCTCCATTCT AC TTTTAATTGGGCCAAAC 
ACCATATCCTACAGTAGTAACA rtSTTTTI W5GAGAGTGTAAAAAACTCTGTTTACATTT 



CCTCCTCGTGGGTTGATCXiAAAATGTATAAAACTGACTGCrrTC 

GAAAGAGTGAGCTGCTGG TACC TGCCCGGGCGGGC CG TCT AAAACT AGGNGGGAT 



GGCCAAATGCAGAAACGTCCCACATGCCCACCAGGAGCAAGC TTCAAAATGTTCAGCTTG 
CGGGGCAffT>*NGCACiAGAAATMXlAGGGATGTTCC TGAAGGCCTNGATGAT ACCANTATC 
CTCATTATAAG^TGAATGCACGGGGCCC*nTGCGCTGGATACCGGCNAACCGGNTTCT 
TTMTGCCTNTGHCAGCTC TCATTCGCTGAGAGGCATAGACCTTTTTGANGATCATTCCAA 
NGCTA TA AGTGf nCTTAAGGAGCAAAAACCAGCTTCCTTG G I U IN I U 1 1 GAAGWCCTTCA 
ACTTT ATCTn^AA^TACCAAAGGGAAGGTNCAGGAAACTTTCCTCAATAACCGAMGGAC 
CTTTAGGACATGAACCAGGTGNCT GGNT AGGGGCTGGACGCCAGCCGAGGGCAAGAAACA 
NAATGGCCGATANCCGTTTTTGGGGTTCCCGCGGTACCWTTOMCCCXSGG^ 
TCTAAMAAACCAAAGTGGGAANCCC 
Sequence 429 

CGCGGTGGCGGCCGAGGTA C I 1 1 1 1 1 1 1 I I I I I H 1 1 1 T 1 I U I G ATCTCAACTGCTTTT 
A 

GCAAo ITG1 GAATATACTTGGGCTTTCTGTCTTTCCCCAAAAGCAATTTGGGATTATT^ 

CCTCCI 1 1 1 1 1 1 1 C IGCATTTCATCATAAATACTGTCATATTCATACACAGTAGCATCTT 

CTGCAAGGGCCTTCTGGATTTCCAGTTTGGTCTGTTTCATGGCCTGCT^ 

TT • 

CCCTCTGAAGGGTTTCACTCACAGAGGT CTCATCATCATCATCAGAATCATTCCCAAACA 
ACTACCTGCOCG ' ~ 

GTGGCGGCCGAKKrTACAGACAAAACTACAGACTTAGTCTGGTGGACTGGACTAATTAC^ 
GAAGGATTT AGAT AGAG T ATTTGCACTGCT GAAGAGTCAC T ATGAGCAAAATAAAACAAA 
TAAGACT CAAACTGCTCAAAGTGACGGGTTCTTTCTTGTCTCTGCTGAGCACGCTGTGTC 
AATGGAGATGGCCTCTGCTGACXCAGATGAAGACCCAAGGCATAAGGTTGGGAAAACACC 
TCATTTGACCTTGCCAi3CTGACCTrCAAAC«rrGCATT^ 

AGAGAGT AAACTT GAATGGAATAACGACATTCCAGAAGTT AATCATTTGAATTCTGAACA 
CT GGAGAAAAACCGAAAAATGGACGGGCCATGAAGAGACTAATC^TCTGGAAACCGAT^ 



CAGGCACCCGAAAGAACTTCCCCAGTATGGTGGTCCT GGAAAGGAC 
Sequence 431 

GG TGGCGGCCGAGG T ACCAAAACAACAGCCC TCCAAACAATGATGACCAG T GGAAAAACA 
^TGGAGTCACCAAAACCTGGGACAGGCTCATGCTCCAGGACAATTGCTGTGGCGTAAATG 
G TCCCTGGCC^ T< ^ < ^ AA ^ MTAC ^^^ Tl ^^ TTCCGGACT GAGAATAA TGATGCTGACT 
GCGGGAGTTCAAGAAGCTGGTGGTCAAGGAGGAGGAGGTGGAGGTGGC^^ 
GCAGAAGCTGGAAGTGG T CA T ATGAACT ACATTCAAG T AACACC TCAGGAAAAAAAAGCT 
ATAGAAAGGTTAAAGGCATTAGGATTT(XrrGAAGGACTTGTGATACAAGCGTATTTTGCT 
TGTGAGAAGAATGAGAATTTGGCTGCCAATTTTCTTCTAC^^ 
TGA 
. Sequence 432 

GCG GCCGAG GT ACCACTGCTTCCCGGGACTCTGCGTTG TTACCaC TGC TTCGCGGGACTC 

TGCGi rcn accactgcttactgcgttcccagca i rrcm i ctcttctcgtttcctgt 

A 

GATTCCGGCTAATGGTTrCCCCTGGCATTTGACTTCGTGATGTGTAACTGAGTC TC TT 
CC 
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ATGTAGGAATAGATCTTTGAC 



TGAAGGGGGAAACGCATTCCAGAGCAT1 
TGCCCGGTAAATCCCGCGTACCT GCCCG 
Sequence 433 

gnggtggcggccgcccxsggcaggtacaaatct acctccccaccaaatgtccttagagggc 
caaagatggcu i igi 1 i cttcatgataacatcgcc 1 1 i c i hi m l i h i i g agacag 

G 

GTTrCATTCTGTCACCCAGGCTGGAGTGCAGTTGTGCATTCATGGCTCACCACAGCTTGA 
ACCCCCAGGCTCAGGTGATCCTCTCACCTCAGCCTCCCCAGTAGCTGGGACT ACAGGGGC 
ACACCATCAAGCCCCGGGTAATTTTTGAAATTrTTATAGAGACAGGAT^ 

CTGGGATTACACGCATGAGC0ACCA\CACCCCAC C TGTCT ATTTTACAvVn^T^^TTTi^SG^ 
CTCI MIT JCCAGCAGTCATGAAGCTGGCAAATGGCAGAACTGGAGCTAGAAACTGCTGA 
CTCCCTrrATCTTTTCCATAGCACCCCAAGC 
Sequence 434 

^CGCGGTGGCGGCCGAGGTACTTrrCTAAAAGCTCATCCACTCTATCATTTAGATATCCA 

ATTTT(^GAATGTGCTCAACATTGGCCACTCCATCTGCCATTCTTAAGTCTCCTTGGG 

AG 

TCTCCCAGAAGAATTATG TTAC T ATTGTCTTTTAG TT GATTGAAATATTCTGTATTCCTC 
AAGGCACCATCATGTTTGTTAAATACATGAATTAGTTCTCCTTTAAATCCT^ 

C CCT ATGAAAAATATAA^TCTTrrGAACAGGCTTTAAAAATTCTATTTGTTGGATTTTCA 
TATTTTGGAGCTC rr AATTGATXTTCACT ATTATTTCATGATATTTGTAAATACATCTTT G 
ATACTAGAGATCTCAAAGCACTTAAGTCGATCACATTCACCATAGCTAAGAAGGGCTCGG 
AGAAGT AAATGATTTTTTAGATACT ATTTTAAA 
Sequence 435 

CCCG<XX3TGGCGGCCGCCCGGGCAGGACGCGGGGGTTGCTCAAACCGAG 
GCCATCAGCTCGCTGCTTAAAATTAAACCACAGGTTCCATTATGGGTCGACTTGATGGGA 
AAGTCATCATCCTGA 
Sequence 436 

TCAGGAATCCTGGGAAGCACCTCCAAGTGACTGCAGATCTGGAATAATAAGTGNGGGGTA 
GATCTGCCCAT AGAGC TCACTTT AGACC GGCCT ATACTCC TACAAGGAATTG N GGT AGGG 
ATC TTNTACTCATCC TT GG CACAATAAGAATGGCCAATGCCCTTTCT AG TT GTTTGGGGG 
AAGGTCTTTGAA GGCACC ATTTNCCCCCATCCCCCTGGGGGAAGAAATGGGGTCCCT AAG 
GTAACGCCANGGI II I 1 1 GGGGGTTNAATTTGCAAAAATCCCCTTTTTfKSNGGGNTANNA 
CACAAATGGGCTNGGCAATTTNTTTNTTTNra 

kill I AACCGGGGTTGGGGGGGGGGCAAAATTTTTTGGGCCAffNTTGGCAATTCNCNGGG 
AMAAAAAATTrCCCA AWSGGGC CNGWNGTTCAAimTCCTT^ 
TCNCCCCCCMGTTTMTTTTT TTGGANCCCTTAAAAAAAACCAI ITTTTTGG 
GG 

Sequence 437 

GGCCGAGGTACC 1 1 1 1 I AGAAGAGAAAAGAATCTTGAATTGTATATATTTATTTTGCTT 

ACAGAAAAAAAT GGTTTC G TAAAT AATTT GCC T ATTTTGGTT AACATAGCACATGGAGAT 
AATT^TCTGAAAGTTATAGGGCACTGCCACTGCTGAATCAAGAGCATGCCCAATATTTGA 
GGTGGCTCTGATTTCCTGGCAGCTGAACTCGGGTAGTCCAGTGGCCTAGCTGGTCCTGCC 
CG 

Sequence 438 

CGGGCAGGTACGCGGGGAGGTGCCGCTGTTGCTGCTCGTGTTGAATCTAGAACCGTAGCC 



AGACATGGGACTGGAGGACGAGCAAAAGATGCTTACCGAATGCGGAGATCCTGAGGAGGA 

ggaagaggaagagg aggaatt agtggatcccctaacaacaagt gagagagcaatgcgagc 
agttggagaaatgtgtaaaggcccgggagcggctagagctctgtgatgagccgtgtatcc 
tcrrccgatcacatacagaagaggattgcaccggagggagctctttggact^ 
gccgaggggaccgatttgcgtgggcccacaaac>rrctttaaacaacctt ggaaataaaat 
gtg tgggacttt aaatttcaccccaan gttctttcan t naattcc tggggggcattcaag 
aaataattttctcttttattggggttmtttggggaatnhltaagccctt cggggcccgg 

CT 

TCTTAAGAAACCTTGNTGGGGGANTCCCCCNCGGGGNCTTGNCAAGGGAAATT^ 
Sequence 439 

CGAGGTACTCTGTGATTTCACCTAGATTTGGAGAAGGTGAGGGAGGAAAGGCTGTCCTNT 
TTGATCCCAT ACCATGCAGGGGCAAATGGCT GCCAGGAT AACAAAATAAGAAGGAAAGAA 
AGAAAAGTGGGCCAGGCGCAGTGGCTCACTCCTGTAATCCTAGCACTTTGGGAGGCCGAG 
GTGGGCAGATTACTTGAGGTCAGGAGTTCAAAAGCAACCTGGCCATCATGGTTGAAACCC 
CGCCCCACCAAAAATACAAAAAATTAGTGGGGCGTGGATGGTGTATGCCCTGTAATCCCA 
G TCTACTTTGGGAGGC TGAGGCCAGGGAGAAATCNGC TTTGAACCGAAGT AGGCAGNAGG 
GGTNGNCATGTTGAGCACGAGTATCGTTGCCCACTT GCACTCCAACCTGGGCCGACAGNA 
GTCAAGTACTCTGGGNNAAANAAAAAfMTAAACCAGGAAAAAAAAGNGAAGGN^^ 
GGGGGGAAAAGAAA 



GGGGCGGGCGAGGT ACGGGGGATGTC T AAAAT ATC TTGTAAAAAGTGTT AAAAT AAACAA 

AGCCAGTCAATTAAAAATTTTGACTGTTATTGAGAAAACTCCAATGAQGGAAATAATAAG 

ATCTATAAAGGTCTTAAGAAAAATATAATTTGAAAAAAACATGTGGCTGAGT GTGGTGGC 

TCACGCCTATAATCCCAGCACTTT GGGTGGCCTAGGTGGGCAGATTGCT CGAGTCCAGGA 

GTTTAAGAGCAGCCTGGGCAACATGGCAAAACCCTGTCTCTACAAAAAATTAGCCAGGTG 

TGGTGGGACACGCCT 

Sequence 441 

GCGGTGGCGGCCGAGGTACATTCTAGCTTTGAACTCAGTGTTTAAAAA7TCAATCTGGTT 
ACACACTCTATCTTCTAGAT CC C TTGAG ACACTG TCTTC CTTGAANAAG N NCCAG GTGAA 
ATGGCATTTCAGCTGTGGAAGGATTTTCTCCAGGGAATTCTTGGTGACCTCACTCATGVVC 

GTCTTCCTGTCTGCCCCCGCGTACTGTTCCTGCAGGTTAAGGCAGGACTGGAACTCCTCC 
ACAG CTTGCAGAT AGTTTTCAGATTCAAGAC T AAC TT C TCC GAG TTT AAG AT G TGCCTGG 
GCAGCATAAAGCTGTGCTTCTTTTGTTTCTTGCCTTTTAAAAATGATCTTTGCTAAAT 
C 

AGCATATC C C AGGCAAGC TC TAG GTTC C CAATC TCC TCC TCC TCATTTTC TT G AAGAGAC 

TTGGTTTCAAGGACTGAATCATTTGGCAT 

T 

Sequence 442 

TGGC GGCCCGC CCGGGCAC GT AC TTTT GCTGCTGAG GAA TG GGAATCAAAAGAACG T AGT 
CTCC TGGT AACC ACCTCAGATC TC T ATT ATT AGGCT AGATGTNGNGCN N GTACTCCCCCA 
GCTTCTTGCTC N N N ACCCTGCACTGTAAGTTGCCCTT CT ATT AGCAGCC AAGGAAAAGGG 
AAACATGAGCTTATCC^GAACGGTGGCAGAGTCTCCTTGGCAATCAACCAACGTTGCTAT 
GAAATATGCCTCACACTGTATAGCTCATTATAGGACGTCAGGTTTGTTGAAAAAAAGTGN 
GGC AAG ACA TGATT AAT GAATCAGAAT CCTG TTTCATTGGG TGAC TTGGAT AAAAG AC TT 
TTTACTTTTANAAAAAAAAANTGTCAANAAANANGTTCCCTNGGCNCGGCTCTAAGAACT 
AGTGGGATCCCCCGGGGCTGCAGGGAAATTCCGNATATTCAAAGCTTATCCGATACCCGG 
NNGAACCCTCCGAGGGGGGGGGCCCCGGGNAN 
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CGCGCGGTGGCGGCCGAGGTACATGAGAGACACTTTAAGCAGGCTCACAGGAATAGAGTG 
AGTGCGGACTGAGATTGTTTAAGCTATCTCTGAACCCATTCCTACTG^ 

TATTGGTTTCTAACTACTACCACAGACACGGATAGCTC^CAGGTTCC^ 

GCGTTGTGGTC^GGGTTCATCGCCATCCTGCTCCACGCTGTCATAATC^TCACGCATGCG 
CXSCTGGGGACCCCTCTTCTATAAGGGACATA^ 

CCCATTGTTCGTATGAGGTGGGTGGGGACTCCAAAACCGGTAGCTCCTGCCCTACT AGGC 

CACTCTACCCCATT 

Sequence 444 

CTAAGTCAGACAAGCCAGTGCAAG 1 1 1 1 II I I II I CCTTTT4NTTTTGNGCTTACCTTCT 

GCTT AATGGAATTGTTATGGCTAAGCACATA .jAAGGCCAAAAAAGGAGTTTTTCAAACCC 
AGCAAATCAAGTG CTTGG ATTCTGAACTGCCAAAAGAAAACTGCACTTCCCGTCTTAAGT 
AAAACCGAAATGAGTTTTC TTAGGT AAATGTATTCATCAAGCCCAG natatagaaaataa 

aacccaggttawt ggtghagccgtttaggtt^cctgcatcattttccagggaaagat^ 
aaccaaaaatacxagtncgcagncgaggactcacaatgtgt^ 

ggcctggtcccacn«xgtnc<k>taatgcgcaaattccgggg« 

TTTGGWAAAAAAACGTTGGGAAAGAAATTGTTACOrrTNGCCT^ 
Sequence 445 

tccccgcggtcggggccgccgggcaggtactttactaaaatgactgcattct^ 
cttcagtct atggttcaagtcacrtaaagatt(^tttttgttgagtccttatgagaaaca 

G 

NAGTATGAATCTTGACGGTTTCTGC<XGTC^WGGCAGAGCTCTCTGACTTGGGTGTA 
TGCT ACCAGGCTGGGTTCAAGTGAGAAGTTCTGGTCAGTCTT CTGTGGGTTGAAGGTTCA 
ATATCAATTC TGTTTCAAAGCCTTTGTGA TGCT ATTTGAATCTTTGC TCGGT AT ATGC C 

CC CAGT GGGTCAAGTCTGGGACCTAGGTGGTGAGCT ATCCCATAAGTTCATTCTCAAACC 
GTCTTT ACTGCACTGTTTAGGGTCAGATACWCATTATATATACNACTTTGGGTGAGCT 
CA 

GGAG AT AAGCTTT ATGGGC TTTG GTG TTTTGATTT AT AAACAGGAG TTT ATN GAAC 
TGCCCAGGTTCCTTGGG 

GGTGGC6GCGGAGGTACGCGGGGAGACACAACTTCCTGGGCTTAGATATTTCAGAATATC 
ACAACTAAAGTCTTAAAAATTTCTGAAGGCTGGACACCGTGGCTCACACCTATAATCCCA 
GCACTTT GGGAGGCTGAGGCAG GCAGATTGACTGAGC TCAGGAGTTCAAAACCAGC C TGG 
GCAACATGGCGTAACGTCGTCTCTACAAAAAATGCAAACATTrGCTGGGCTTGGTGATGT 
GTGCGTGCAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTAGAACCX^TCW5G 
TGTAGGCT^^AGTGAGTCATGTTTGCACGACTGCAGTCCAGCCT GGGTGACAGTGTGTAT 
TAG TTTG TTTTCATGCT GCTGATAAAGACATACCTGAAACTGGGAACAGAAAGAGGTCTA 
ATTGGNCTTACAG ' 
Sequence 447 

CGGCCGAGGTAC GTTTTGTGACAGGCAATAAAATTrTAAGAATTCTTAAGTCTAAGGGAC 
TTGC TCCTGA T C TTCCT GAAGA TCTCT ACC ATTT AATT AAGAAAGCAGT GNGC TGGNC GA 



GAGCCCGGGC TTCAGCWGTTTTGGGGTTCCGATATTAATAAGACCAAGC TGAGTCCTCCC 

TCCCAATTGGAAATATGAATCATCTAGAGCCTTCTGCCCTGGTCGCATAAAATTATGT 

CT 



CACNAAAAhTTAAGGTNCXXDTGCGCaSNGGCNGGNCCGCT^ 

CCN CCCCGGGN C TGCAAGGGAAATT ANGNATT ATCCAAAGCC TTATTCGAAT AACCCGTC 
CGAACCCTCANAAGGGGGGNGGCCCCCGGT AT AC N CCAAGCTTTTTT GGTTCCCC TTTT A 
AGTGGAGGGGTTAAAhTrGGCCGCCGCTTGGGCGTAAAATAAATGGGACNAATAAGCCTGG 
7TTTCCGTGNGGNGGANAAATTGGTT>rrrCGCGCCTT^CCA 
TAGCGAAGCCCGGGGGAGCGAATAAAAAGTTGGTANAAAGCCGTGGGG 



CGGN GGCGGC C GAGGT AC TTTTTTTTTTTTTTTTTTTG TTAG TG TTTTCTGA TG TCTTTT 
CTAACAAATCTTTGCC TGCC CAAAAG TC TCAAAAACATTC TCACG TTTCT AGATTTTT A 
G 

C TTTAGC TTTTG TGTTTGGGAC TATGATCCATATTTAGTGAA TTTATTTTTGGGGGGGC 
A 

GAGTCCATGTTGCCCAAACTGGTCTGGAACCACCACACCCAGCTAATTTTTGTGAATTGC 

GGGTACCAGCACACCGGCGCCGTCCTGGACTGCGCGTTCTACGATCCAACGCATGCCTGG 

AGTGGAGGACTAGATCATCAATTGAAAATGCATGATTTGAACACTGATCAAGAAAATCTT 

GTT G GGACCCATGATGCCCCT ATCAGATG TGTTGAAT ACTGTCCAGAAG TGAAT ATGATG 

GTCACTGG 

Sequence 449 

C G GCGGCCGAGGTACAAAAAGCAGGGGC CCAGCCC GAG C TGTTGGC T ACATGAG TATTTA 
GAGGAAGTAAGGTAGCAGGCAGTCCAGCCC TGATGTGGAGACACATGGGATTTTGGAAAT 
CA GCTTG TGGAGGAATGCATGTCACAGGCGGGACTTTTTCANA 
ACATTTTGCACATAAGGCACCAAACAGCCCAGGACTGCCGA 

AGCCTGCTTTGGTACCTGCXXGGGCGG<XGTCGATCTCCTTGTGTTCAAGCAACTTCTTG 
CGGTAGTCCTGAAGCGCCTTATCTCTAGGGTCCGCCATGATGAGAACCCCGCGTACCTGC 
CCG 

Sequence 450 

NGGTGGCGGCCGAGGTACTCCCTACGGCACTAGTCTACAGGGGGAAGGACGCTCTGTGCT 
GGCAGC GGTGGC TC ACATGGCC TG TCT GCACTG T AACCACAGGC TGGGATG TAGCCAGGA 
CTTGGTCTCCTTCGCGCGTCAAGAGATAGAAAGACCAGTCCTTGTGAAAGACAAGTCTGA 
AT GC TCCAC TTTTTCAATTCTCT C TCCATTCTTCA GT AAG TCAAC TTCAATGTCGGATG 
G 

ATGAAACGCAGACACATAGCAATTCAGGAAATTTGACTTrCGATTCTCTGCTGGATGACG 

TGAGTAAACCTGAATCTTTGGAGTACCTGCCCG 

Sequence 4S1 

CGAGC GGCCCGCCC GGGC NGGT ACAAATGCGTTT ANGAAATG TTAG T AT AAGGC TGA TCT 
GGAC CCAAAC T AAAACAACGTT AATCCT CTT CAAATCT AATTT AAT ATAGGGAATAAGAT 

tattgaaaaaaaa1 1 1 t 1 1 1 cctgattttctttttgctgaagg i if f i t i gtagaaacca 
tggtaaaaagggaaaagaaacx:tttgactggcgggggcagggggaatacaaaaaaaaaat 
cccttgm 1 1 t i aaaatatacttgaatatcaaactcagaaagagttatttttgtgaaaga 

GGCAAAATT GG T C TTGAGC TG C TTCAGTC TAT GTC TGAAGG TTTT ACT GAAATTATGG 
TC 

CAGTTTTAGGAGAAAAATTCA(^GAAAAGTCAGATTGTAGATTTTGAGAAGGAAACTCTG 
AGGTGGTGATTTTCTCCAAGGTCATGGTTATGAAGCTGAATGAGGGCCTGAATTGCTTCT 
TCX^CAGATCCCAATTGAATGAGCGCCATTTTGCGATCTTT^ 
Sequence 452 

GGGGC GGC CG CTAA TG TNAGAAGTT AAG T NAGAACC TATA TT GT ACGAG GAAC AAAAGC C 
AATCAGTGTCCTTTTTGTC II 1 1 II I ACATAAACTTTTACT ACAAAAATTNATATATGGA 
TTTT GAATTTCCAG TCAAAC C AAATTGT AAAAC T G TTTCA TTT GGTTC T AT ATT A TGT AT 
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ACATAATTTATCTATTATATATTTACATTAAAATATATGCATATATAATGGATTTAATTT 
CCTTTNGGNACCCCCATATNTAGAAGMNTCTTCATAA^nTAATAAATAATCTAGGGCCAG 
CATTATGTTTGCTAOACCTCGNTTTGGCTCAATACTTAAAGTTAAAAGTTTCTGTCTTT 
T 

TTCTTGGACTTGAAACTGCCTANAGCGTCAGCCTCTCTGTTATT7NTNTCTATTTNCTT 
T 

TTCCCCCATCAGTCTTT TAGCCACTTGAAGCCAAAATTCTTAGTTTCTGTCCTAGTNGA 
T 

Sequenoe 4S3 

CACGGCTGfWAGGTAATCTCAGNTTtXX^TGTAAGGTTCGTTCGGCTCCAAGNCTGGGCC 
Sequence 454 

NGAAGGOGGACGCCCGGNCAX%TACGCGGGGACCTTl>IACGGGCGGGGGGAGCTGAGGCT 
CCTGMCG*nATCTNTGATCCTTGCACOCTX^^AGGAAGCT 
GAGGWCTNCACATATTGCAGAAAAGAAACCACTTTGG^ 
ACCGGTCACTTTGGAAAACAGGGGTGGGGAAGAAGCTX^ 

AGGGACCAANTCT AACCCAGGTGAGGNNAACCNTGGTNGATGTAAAGCCGGTtXjCTTTGG 
AGGACAGAATCATCTAAGTGGGAANAAGATACAC T AGGAAGGGNGCTGGGGGGANTACCA 
TCAA<SAGGGAGGNG<3GGATNACCTTCAGGCCGGGGGCTTNCGGWGGGGATGAAAGAAGGA 
ATGG<>*CCGGACAGGTTTGNGGGTNGGAGGGTATGAAGGCTTGGCNAAATGGTGGGGAAT 
TTTCGTAAOrtTCOGOCCXSGGTTTTTACA^ 

AGGGGAAATTTTCGANTAATGCAAGGCTT AATANGAATTACKCGGGGGGACACTTCGGAG 



>GCC GCCCGGGCA GGT><CGCGGGGAGGATCTCTGTCT*nTGTTCCCTCA 

(XAGTTCCTCT6CATTTGAACAGCTGTNCCCCCACCCCTCAATACCGTTTAGAGCAGAAG 

OCAGCAAATACTAATCGGTCAGGGACACGATAGAAACT ATTTTCGGCTTGATGGGCCACA 

CAGGNCTTCATTGCAAGCTCCTCAAATNTGCTGTTTGTAGCTAAGGAAAGAAKCCATTAT 

ACCKTGTGTNAAMCAAAAATGAAATATTGGCNTGTGTGCCAATAAAAAACCTTATTNACA 

AAGATTAATOGAGTNGGGCI^GGATATGACTTCACNAI^ACTGGTTAGT^ 

CCCTGGMTMCTAGYIAGTTAAAAATCCCAAAAACTNCTTATTAGTCCCTCCC 

Sequeno»456 

CGGCCGAGMACAACATGACATTTTT AACCAATCCAATCT AAAAATGTTGCCAGAATCCAC 

CTGTGGCCCNGAATCGNG7>1TTGGTTCCTCTTTCTACTCC^TGCAGANGACCAAACCTG 

TCCCGCTGCCACTTTGCTCACTGATATTGGGAGGAGGGCAAGGCCCAGGCGAAGTTCCAC 

TAAAAATG OCCCAG GAGAATAGGCACCMGGCTGGCTTGCCAAAGGGTTTMGGGTTTTA^ 

GCTTTCTGI 1 1 1 1 ICI 1 1 ICCCCGACAGCACAAAGAAMGTAAAGGGGCAGTTAATTGGAC 

AGAGTGTTATTTTAAAACATCTCTAATTGTAAATGMAATGTGGTTGGTTTGGGTTTCTA 

C 

TGCAATTGGTGNGAAGCCATGCCGGNGGGGAAAGAAGAAACmGACCCCAAGGNTAATTG 

AAAATNGGGAGNCCCCCTTTC 

Sequenoe 457 

NCGAT ATT AC TGTGC GAGAGG T AAAGGAT ATAG TGGC T AC GATT AC NGC C TCT CT 
Sequence 458 

CCCCGCGGTGG CGGC CGCCGGGGCAGGTACACGACAAAACTACAGACTTAGTCTGGTGGA 
CTGGACT AATTACTTTGAAGGATTT AGATAGAGTATTTGCAC TGCTGAAGAGTCACTATGA 



GCAAAAT AAAACAAA T AAGAC TCAAAC TGC TCAAAG TGACGGGTTC TTGGTTGTCTCTGC 

TGAGCACGCTGTGTCAATGGAGATGGGCTCTGCTGACTCAGATGAAGACCCAAGGCATAA 

GCTTGGCAAAACACCTCaTTTGACCTTGCCAG^ 

accaacattaagtccagagagtaaacttgaatggaataaccgacattccagaagttaatc 
atctggaaaccgattttcagtggcgatggcatgacagagctagagc tcgggcccag 

Sequence 459 

G Gt^G CCGCCGGGGWGGTACGCGGGTCGTGM^TGGTTAGTGAAGGCTTTQTAGCTGAGC 
AGTTTCTAAATAACACACCCACTCAACTGACATA^ 

CGGTTCAGGAAGGAGAACTTTGTGTGTTCTTTCGGAATAATGATTTTAGCA 

aatacaagggtcaactgtatttgttggtaacggaccaggggtttcttactgaagagaaag 

. TTGTTrGGGAAAGCCTACACAAOGTAGATGGTGATGGAAATTTCT^ 
ATCTTaWXTTCCTTCAGATCCTGAAAC^ 

AGGATTATCTTATGGCATTATCTCTACAACAAGAACAGCAGAGCCAA^ 
AACAAATCCCGGAAGGAATCAAGTGATTTGGAACTAGCAAAGAAAC T 

GGCGGCCGGGTACGAATGTGCAAATTAAAGCATGGTAAACTGATATTTACATAM 

AACCAACAATTAGTTTATACATTGTCAATGACCTTCTAAGATATGTCATGAGTG 

A 

AGAATATCTTTCCCCCAATGGAGAAGGTATTCAGA^ 

ACACACATCATAGGCTTT ACCTGTTTGACCACTGCCTCAAATGTGTGAGATGTGATTT 
TA 

TGATCCCGCGTACCTGGCGGGGCGGCCGCTCGAATAGACTTCAGGGAAACAACACGTCCT 
GAAAGAAACATGATTCCCCTCAAGCCACAAAGGATTTTC TCATCAAGTGTTTTCACCT CT 
GCATTAGATTTGGACACAA GAAGA GGAGAGGATTTACTCAGGTAAAAATAG 1 rCTCTTAG 
TC TCTTCCTCT AGTTACTAATTTTT AATTTAAAAATACAATT AACTG^ 

AAGTCACAAGACAGAAAT AAGCT AAGTTCTCTCTTN C TTT AGGGAACGCTGGTGGCAATT 

CACCA 

Sequence 461 

GAGTTTGAGAAAGCTGCAGAGGAGGTTAGGCACCTT AAGACCAAGCCATCGGATGAGGAG 
ATGC TG TTCATCT ATGGCCAC TACAAACAAGCTACT GN GGGCGACNATAAAAACAAGAAC 
GGCCCCGGGGATG TTGGACNTCACGG GGCAAhKXXiCAAGAfnTGGANGCCTGGGAANGAG 
CTGAAAGGGACTTCCAAGGAAAGNAN0=CATGGAAAAGGCT^ 

NAAGAAGCCTAAAAGAAAAAAAATACNGGGAfn' AATGAGAGCACMTGGATTTTGGGNTAC 

NTGTGCCCCATGTGTTTTATTCCTAAACTGG^ 

N 

ACCCGNTGGAATGG TTGGGGAAATCTCTGGGGAAAAATAANCCAG NTAAAACCAGCT ACC 

TCAAGGGCNTGCTCACCCATACCG 

Sequence 462 

AGCCCTCGCGGCGGTGGCGGCCGAGGTACGCGGGATATTGTTCCTGATTTGCCTGATGTG 
TGGACGGATCACCAAGCGAGTGACACGAGAGCTCAAGGACAGGCT ACAAT ACAGGT C AGA 
GACAATGGCTTaTAAACGTTTAGTGTGGTCTCAGGATGTGACAGGCAGTCCAGCCTGACC 
. TTTCTGCACACTCX^GACAAACrrTCCCAGACA^ 
GCGTGGAAAGTTATCACATTAAAAGATGGAGGATTTAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAGTACCTGCCCG 



GCGATNCCCCCTGGGAAGCTCCCTCGTGCGCTCNTCCTGNNCCGACCCTGCCGCTTACCC 
GGATACCTGTCCGCCTATTCinCCCTTCGGGAAAGCCGTGGGCGCTTTCTTCATAAGCCTC 
ACCGCTGTAGGNATCCTCAAGNT03GGTGTAAGGNNCGTTCGCTCGAAGGCNGGGGCTGG 
NGGN GCAC N GAACC CCCCC G NN C AAGAC C CGACC C G G TGGCGCC TT AAACCCGGAAAACT 
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AATNCGKCfTTGGAGGTCCCAAACCCCGGGGNAGGACACC^ 
AGGCAGCCAACTGGGGTAAACAAGGGATTAAGCAG 



1 1 1 1 m n 1 1 ii r 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 

GCCNCCACATNTTTATTGCATACttCA^ 

ATTNGXiAGACCATGCCCACTTACAGAATGCANCCGN^^ . 
GGKTGGGGMGCAAGATGAGAMAAGTTTCANCCCC^^ 

AAATTNTOCTGACAAGGMGGAACCGGGCTTGNGCT AGNGMGGNCTGAAAKAATTCCTGGC 
AAAMCGTAGGGGGAGATTAAATCTCGGAATTGACAGCAAGTT^ 

AGAGGGGTCAGCCTGTGAAATTTGGTGCCTGCGGGAAC TT C TT G ANGCCCCAA T GWGGGG 
" GCAiXWCTrNGAGANGATNGGGMTAAATTTAhtGC^GGGGATNTTT^ 
CCAAOCAAAAAAGGG 




CAGACATAAGTTCAGAGGAA TC I G I C 1 CCACTGTAGAAGAACAAGACAATGAAACTCCAC 
CTGCTACTTCKAGTGAGGCAGAGCACsCCAAAGCGGGAACCf^ 

AAAATAAGTCTTC TGAGGAAACCAAAAAGGATGAGAAAGATCAGTCT AAAGGAARAAAAW 
TTTTATNNMATTAAAAGT ACCTCGGCCCGCTCT AGAACTAGTGGGATCCCCCGGGCT 



AAGTGCATTrAGGTGACATGTTTAAGTTAACTTGACTTC^TTGAATGACCTAGTTAGTA 
A 

ACTAGTCAC TAG TAA ttcgg TCACCAAGCAAATCAAGCCTGCAAGAAAGGAAGCCAA T AT 
TCAAAATGCCATGTTACCATCTAAACC 



TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATTTT A ' I IG I C 1 ACCTCTCTGGACTTG 
CTCCCAGCATCCGGACCAAAACCATCAGTGO>CAGa>^ 
TTGACAACCTTC TGGTTTCAGA T( 

GATGAAGGGGTCTTCGACAA n T TGTTC I CAAAATCAGAGATAI 
CCACTGGAAATAACCCTACTTGCCCCCGAACGTACCTGCCCG 
e471 




TGGCGGOCCGAGGTACGCGGGGAGGTOGGTGCGCGCTTCTCCCGA 
AGTCAAGCGCCGGC G l rCTCTGCCGTCACCCI I I CC I TGC 
* e487- . . 

TACTT ITITIlll III NIIITIIliriGAGACAG 

TCTTGCTCCATCACCCATGCTAGAGTGCA<rrGGAGTGATCT 

CCTTCTGGGTTCAAGCTAT TCTCC TGCCTCAGCCTTCCAAGT AACTGGGATT ACAGGCAC 

ATGOCACCACGCCCAACTAATTTTGTATTTTTAATANAGACA^ 

C 

CAGGCTGGTCTTGAACTTCCATCAG^GNGATCTGCCCT 

TTACACGCATGAGCCACCGCOCCTGCCTGATTGNGTTCCTTTCT 1 1 I IGI 1 1 
CT 



GT 

CA CAAGTA GOGC \ U I GACTGGGTTCTGOCCGGTAGGAACGTAAGCAGAAGTGAATGTGTA 
TCACTTCTAATGGTGTGGGMCTGCCMAAACCTTCTAAAGGGGTATt 



TTGGAGCTCCCCGCGGTGGCGCTCGGTGTGCTGNGCTCA^ 
**6ATCGGOIAGTGCTCTGACTGCGTGGCCGACAAI<NGCTGMCGNAGAAAGAAA 
OCTGAAACATGACA GMGAGTG NTGWAAAGTGTGGAAATGCC 1 1 C H AAACTTNATNAAWG 
THAAKTCAAANHACAI I 1 1 1 1 1 1 1 CAAAAANATAAATTTAGAACT AAKTGNACCTT 
Sequence 469 

CGGAGGAGAATGGTATCACTCAGGCTCTCAGAGTGACACTGAAGCAAGACACTCATGGGG 
TAGGACATGACCCTGCCAAC^SAGTTCACAAACCACTGGTGGAATGAGCTCTTCAACAAGA 
CTOCGOCCA ACTTGGTAOTGOAAACTOGGCAGGATGGaCT 

ATCTTCTTT AGAACT AAGTTCAT CTT AAAAATTT AAGAAGGTGGACATTT CAACACCA T 



ATAGGATGGAGCAATTCTATCTCACCTCAGGCCTAGCA 
CAAAGGGCTTCAGTAAACCACTGGAG 1 1 F OCI I CATTAGGATTCCATCOCAGGATATCCA 
GAGGACAAGAGGCTGGCCAACTGCAGGATTAGCCTATC A 
CACGCAAQaGAAAGCI raGGTOGGATCTCCTQATCCCGCGTACCroCCC 
G 

Sequence 472 

GCCGGGCAGGT ACT ATGGG TGTAG TGWT ACT ATTACAGTTAATNCW I CC 1 1 1 GTAGTGCG 
CTGWT AAATGCAGTGAGGATTGGAGCACTGTCGACTGA G TCTCT GT GC 
Sequence 473 

CAAAAT AATT ATAATGTATT AACTCAT AC TGCCTGTCTTTT AT AGGGGAAAAAAATAAC 
C 

T>ITTTTATTTTAAAGTTATAAGGGGGNTTACCTTIiTAGI*GTGCr^ 
AT 

TAGCCTACCCCATTITGGTCTGGAACAGAAGACTT^ 

TTWACTARTT AAGGCAAGATCATGC I ' I N f G I CAGTTKACCCCAMTGKTTGGAATACCGTG 
NAGACCGA7 CGT GGC TCG^*CTACAGCCTCX^TGTNCCX^AGGCTTCCiAGCAGGT 
Sequence 474 

GGCGGCCGCCCGGGCAGG T ACGCGGGGGAGC T GAGCCGGTGGG T GAAGCGGCGGCCACGG 
CATCCTGTGCTGTGGGGGCTACGAGGAAAGATCT AATT ATCAT GGACCTGCGACAGTTT C 
TT ATGTGCCT GTCCCTGTGCACAGCC 1 1 1 GCCTTGAGCAAACCCACACAAAAGAAGGACC 
GTGT ACTTCT AAAATTGCACTTTATGTTTTGTAGGCTT GGAGCTT CTT GATT ATGGGTT 
T 

TTTCGTT ACAAAATTCAACAACAGAATCAA T ACTTTGCAT AAACATT ATGGATGCTTTTT 
CTGTTTGT ACCTCGGCCGCTCT AAAACT AAGTGGATCCCCCNGGCTTGCAGGAATTTCGA 

GGT 

Sequence 475 

TTGANGCCCTCOCCGCGGTGGCGACAGGGTTACATTGGT AAGGGTGACAGTT AGAAGGGC 
AAGTCCTTTT ACTGAAAT AGATGAGAGGTTTTAGATCTGCACAAACCTTTTTCATGGAAG 
TCCAACTTTGCTCCTGGGTAGTTTAAAGGACGTAGTCCCATGTACCT 
Sequence 476 

NGGC T ACACGCT AGGAACCTTGCAGCTT ACAGTGACAGAGC TCCCATTCACGAGGCCACC 
ACT CAT CT CGATTTCTGGATCTCT AGGGAA TGAGT AGAGCTCCACCTGGATTCCCTTT 
TC 

CAGTTTCTTATG TCCACAAGTCACTG T GCACaGaTMjGAGTGTTCGTTCTCAAAACTCAC 
AGGGCTCAGGGTGATGCGTGGAAATTGGGTCGCCTTCAC^ I rCCCCGCTTCA 

GAGGC^TTGTCTATCTGGGTTCTCCACKXJaGAAAGATGGGGAATTCAC^ 
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T ACCATG TCAACAATGACTGAAGTCTTCCAATCTGAGC caggcaaattccnggnggg tcc 
^GG GGGG AGAATCTCAAACAGAfTAAAATGGGTTTTCTCTTGGAACAAATTAAATTTCCCA 

ccrcm 1 1 ini rGNi rm icccc 



NGGNGGCGGCCGCCCGGGCAGGTAC I M H II I H H 1 1 1 II III H H 

TAT^A^AAAATGATTrTTTTAAGTTT GAACTTrATTGGAAGGAGTCCCTCTAATTCAC 
Aa,i 1 1 l^TCCTAGATAAATOGGTAAGAACCACATATGGAAT ATAAAGCATTGATTTTTT 

AAAACCACATAGTAGCACAGTTGAAAGAAATGC^TTCTCCAGGGTCTTAGAG^ 

AGGNGGCATCTTAGGGWGGGTCCTAACGAAACCCAAATTACCAGGTCTCATGGGTTTTCC 

TTTTGGGTTCAAGGATTAGAAAGGAGTCAGNGGTT^ 

GGGGTAGGAATATTGAAACC TTTOCTAC TT AGTCCANCAGGTTTTACCTGGTTCAAGGGT 
GGGHCCCCCAACCAAGGTTC HUH lATCTTTCAAGCCCCCATTCTTTGGCCCTCTT 

GNGGGGGGTGG 
Sequence 478 

TCCCCGCGGTGGCGGCCGAGGTACCTGCATCAGGGATAAGAACCCATTCCCCTCCCTTGT 
TCCGGTGTGCTCTCGCCATTOCACCATCCATGAGACGCACTCTTGTATAGAAGTAAAATT 
GCCTTOCTGAGAAAAAAAAAAAAAAAAAAAAAAAGTACCTGCCCO 
Sequence 479 

^^CCCGCGGTGGCGGCCGAGGTACGC6GGG<WTXrrGGCCTGCATCTCAGCTG<>CCGC^ 
TCAG NGTAAATAQAGCTT AAAGTCATGGTTTGGCTG CATAAAAATTTTCTAACTTGGGT 

NAATATTTGTAG NTGAAGTATCTGC TTTCATTTTTTTCACG TTATAAATAAAAAT AC TAT 
GCTGGNCGGGCGCGGTGGCTCACACCTGTAATCCCAGCACTTT GGGAGGCCAATGTGGGT 
^^^^^GAGGTNAGGAGTTCAAGACCAGCCTAGCCAAGATGGTT5AAACCCCGTCTCTAG 
T AAAGAT AAACAAAAAATT AGCT GGGC 



GC GGTGGCGGCCGCCC GGNCAGGTACAGATGCAAAC GGAGG TGT AGACTGNGCAGC TGCC 

AAAGTGGTGACAAGCAATCCAGAGGACCATGAAAGGATCTTAATGCAAGTCATGAACTTG 

AATGTGCCGATGAGGCCTGGCATTC TTGTCCAGAGACAGAGT AAGGAAG TGTTGGC CACA 

CCCTTAGAAAACAGAAGGGACATGGAGGCAGAAAAAJU^^ 

CTN 

Seq uence 481 

ATG TTTTGTGG CCAAGG TGAGGGCTGCAAGT G TTTT CTAAGGGTTGAAACATCANAATAA 
AGGTATGGTGGCAAGTCCTCCTTCTGCTAGGCTGGCTGGCAAGGCCCTATGTCTTGACCT 
AGGTGGTAGTTACAAGGGTATTTTATTTTGCCTTATAATAATTCACT AAACTATGTTATT 
TGAGTfMGATTTTTATGTNGTGNGNCNTTTTAATTT^^ na 

CNAAANGTTGCNCTCNGNCTCGGNTTNTAAGTAAACCTAAGGTGGGA 
Sequence 482 

CTGAGAGATCCCC TCAT AATTTCCCCAAAGC G TAACCATGTG TGAATAAATTTTGAGC TA 
GTAGGGTTGCAGCCACGAGTAAGTCTTCCCTTGTTATTGTGTAGCCAGAATGCCGCAAAA 
CTT C CATGCCT AAGC GAACTGTTGAGAGT ACGTTTCGATTTCTGAC TG TGTTAGCCTG GA 
AGTGCTTGTCCCAACCTTGTTTCTGAGCATGAACGCCCGCAAGCCAACATGTTAGTTGAA 
GCATCAGG G CGA TT AGCAGCATGA T ATCAAAACGC TC T GAGC TGCTC GTTCGGCT ATGGC 
GTAGGCCTAGTCCGTAGGCAGGGACTTTTCAAGTCTCGGAAGGTTTC TTCAATCTGCATT 
CGCTTCGAA 



G C GGTGGCGGCCGAGG T AC TC TTC AAAATTG TCAAGG TCATGAAAGACAGCAAAAAGTGA 
AGAATTC TT ACAAACTAGAGGAGACAAAGATTGGAGAAGAAACAATGAC TGGC N GGGCAC 
GGTGGCTCATGCCTGTAATCCACTTTGGGAGGACTTTGGGAGGCCGAAGAGGACAG^ 

TCTT ag^ttgggagttggagaggagcctg^ccaacgtggagaaaccccatccct ACTAAA 

AATACAGAATTAGCTGGGTGTGGTGGTGCATGCCTATAATCCCAGCTACTTGGAAGGCCT 
CGGCAGGAGAATCACTTGAACCCGGGAGGCA>4AAGGNTTGTGGTGAGCCAA 
ATTGCACTCCAGCCTGGGCAACAAGAAGGCGAAATTTCTGTCTCAAAMAATAAAIMACAA 
AAAAAATAAGT ACCTGCCCGGACCGGCCCGCTTCTANAACTAGTGGGATCCCCCCGGGCC 



CGGNTCC CCAAGCTTTTTGGTTC 
Sequence 484 

GATGTGAACAAATGTGTCATTGCTCTCCAAGAGAAAGGATGTGGATGG 

AGCTGGWGCAATTCGAGGCCGGGCAGCCCGGGTCATTCACGTAGTCACCTCAGAG^ 

CATCGAXXX3GCCGCCCGGGC*GGTCACAAGCTTTAT^^ 

GGCAAACAATGAAAGTAGAGTCGCTCAGAAACACGAAAGATCATATGT GT GTCATCACAG 
CATCGAGAATTTAAATCATCTGGAAGTTCCTGCTAAATTAAAGCATACTGTGCCMKAGCT 
CCCCnCTAATCAAAAAACGCTTGTCCrGGT4GAAAAATTTGCATGNGGGfrTT ACAGAGAGA 
GAGATCAACCAGGTGAGCVVAATCACAAGACTCTTA^ 
ClACGAAAAAAATAAATTAGCCATAATrTGGTT 



TCCCGNGGTGGCGGCCGAGGTA C 1 1 1 1 1 1 H M H 11 » I H I M 1 1 1 GGGGAGGATACT 

TCAl 1 1 1 1 ATTTTATATCGTGAGGTATTGTTTGGATTGTT ACAATGAACTTGCATTTCTT 
TTGTAATGAAGAAAATAATACAGAGGAAATAAGAACAACTAAACCTTTGGCCTGGGATTA 
TCATCC GGGC T GGGAAATTCATGTTGGGATGGCAAGGTTTTT ATTGATAACAAGGTTATT 
TTTTGGGGTTTATTATTGCAAAAAAAAAATTGTTCATTGGGAATTG CCCTCCTATTTGG 

CTTGGGCACCnGCCCTAAGGGCGACTTTTCACCAAGGG™ 

T ^CCAAAC CAGG CXCTATTGGAAGGGGTAAATCAATTGGGGTCCCC>AGGTTTTACCAA 
GGAAAGCCCTTTTGG GGGG N GGGGGGAAGAATT ATTTGGG CTTTGGGATT ATTACTTTCT 
AATTTTGGCCCACCACCATT IIHI I GGTTGGGGCAAAGGACCGGTTTCCGGTAATCCGG 
GCTTGGGTG<V\TTTCAC^TTGC^TCAAAAGGAAGCTTCTCATTGGGGCCAAGGGAGGTTT 
CCCT AATTTGGTTGGCTTGGNAAAG GGAATTTCAAAATAATTCAAAAAATAC TT AAGAAA 
TTTTTTNCCCCCCA 
Sequence 486 

TGGCGGCCGCCC GGGGAGG T AC GCGGGA G TG T GGATN G AACA GAAAATTGGAAATCAT AG 

TCAAAGGGCTTCCCTTGGTTCGCC^CTCATTTATTTGTAACTTGACTGGGGTTTTTTCT 

G 

CTTAAAAATTTCAATTCTCGTGGTAACAACCGCAGAGTAGAAGGAGAGGGTGACTTTACC 
GAACTGACAG CCATTGG GGAGGCAGATGC N G GTGTGGAG G TGTGGG CTGAAGGTAG N N GA 
CTGTrTGATTTTAAAAAGTGTGACTGTCAAGNTTGT ATCTGTTGCTTTTNTCAATGATT 
C 

AAJ^KSNGATACAAAATGGGGCTTCTNTCAWTCATTTAAAAAGGAAAAACGCCGACCATCCT 
TTCTAAGGArrCTCTGTGGGAAAAATGGACTGTCAATTAA^TGGCGGGGTTTT 
Sequence 487 

CCCCAGGGTTCAGTCCT(^AGGGGCCATCCTGTCXCACCATGCAGTGCCCCTAGCTTAGA 
GNCTCCCTCAATTCCCCCTGGCCACCACCCCCC AC TCTGTGCCTGACCTTGAGGAGTCTT 
TGTGTGCATTGCTGTGAANTAGCTCACTTGGTGATATGTCCTATATTGGCTAAATTGA 

CCTGGAATTGTGGGGGCAATCTATTAATAAGCTGCCTTAAAGTTCAGTAACTTACCCTTA 



PCT/US00/J3J12 



PCT/US00/33312 



Table 1 



GGGAGGGCCTGGGGGGAAAAGGGTTAGAATTTTGTATTCAGGGGTTTTTTTGGTCT 
TGCCCGGGGCCGGGCCGCTCTAAGAACT AGTGGGATCNCCCWCGGGCTGCAGGGAATTCG 
A TNTCNAAGGC TTAATCGAT ACCCGTTCCGACCTCGAAGGGGGGGGGCCCGGTACCCCAA 
NCTTTTTGGTTCCCCTTrTAAGTGGAGGGGTTA 



TCAATA II J G I CI I ATTAATA J 1 1 1 ICI I ATTTTATAATGCAATTACAACWGWTTTAGGA 
GAO*AAACAATATAAACAAAAGAATGTTA>ATAGG 

^^^^GCAANGGAAAGTCCATAATAANTCTTATTCC CCCCCAAATATTAAGTTTTATT 

CTTT>K3CCAWn'AGAGACCCAAAAAATAGCTT ATTGGGGAAAAAAATTANGTT ATTTAAA 
AATANGCCTTAAAAACCACCAAGGAAAAACCCTTACCAGGGCNT ATTAAAAATTAAACCA 
ATTAAAAAATTAGCAAGGGTTTAAAACTTTTTAAATGGGNGGGATM 



NGC»^CCGAAACCTGGTGAAGCCCTTTGGGCGATrG^ 
AGCTGGCTGCOXXXX>TCCGGGCAAGC*GGGCCAA 

^GCACCCAGT AACAGCAC<>GGTATTTCTTCTGGGTAAATGAAGAGC CTTTC GAAAAAAC 
1 1 iu 1 1 *»CCCTCAAAGTATTTACCAT AAATTCTCTTTAAAAG TG GACATGGTTCAAGAA 

CAAGNGGGCTCAAGAAGTTTNGAAAGTAA-^AGNAG^ 

TTGAAGTTTTGNTATAACTTTTGAAGCCCTCTGGCCCCTTT^ 

G 

GGAGGAGGTCCAAAI 1 1 1 1 1 1 ICI I I I NGTTTNCCAAT At 



NCCGCGGTGGCGGC CGAGG TACCTGATTTTATTrCWACTTTTCATC 

AATGGGACGATTTTGCTTTTGTTTCTTGGCCAGGAATCGCTTAATCCTGA^ 

TG 

AGAAGACATGGCGAGCAGC^GAGTOVAGAACACACCAC^ 

GGCCCCGCGTCCTGCCCG 

Sequence 491 

ACTCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAAAATAAAAAGGAGGCTGGTGGGAG 

AACTGCTTGAGCCCCAGACTTTGAG^TTACAGTGAGCTATGATCAC^^ 

GGCCTGGGCGATGGAGCGAAACTGTCTCTTAAAAAATGGCAGGGAGTTGGGGAGCTGGGC 

AGGTGCAGTGGCTC>TGTCTGTAATNCCAATACCTCTGG 

ACTTTGAGCCCCAGGAGTTTGAGACC^ 

Al I r I FAAAAAAfTTAGTC>TTNCTTACrrGGGTGC>nTCCCTGTNGTNCCCCACTTCTTT 
G 

GANGGTTTtJNGG^ CCAAGG 

CTTTTTHATTTTTACGCCTTGGCTTTTAAACCTTGG 

CTTTTAAAAAAAAAAAAAAAANGGGGGGNGGGGCNCNCGCCCTN I HI I H I I IGGCCCA 

ANCHCCCCMNA 1 1 1 1 1 1 1 1 I I T 

N 

Sequence 492 

TCCCGCGGTGGCGGCCGAGGTACATGAGAGATAATG1TATGACAAGAATAGTTTCTGCAA 
CATTAAGTATGGGTCAAAAAAAGAAGAAATGGGCCAGGCGCGGTGGCTCATCCCTTTGGG 
AGGCTGAGGCAGGTGTATCACAaGGTGAGGAGTTCGAGACCAGCCTGACCAATATGGTGA 

aaacccatc tctact aaaaaaaacacaaaactt AGCCAGGCAT GGTGGTGCACGCC t g t a 

ATCCCAGATACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCGGGGAGG TGGAGGTTG 



CAGTGAXXCCGAGATCACGCCVUITGCATTCCAGCCTGGGCAACAGA^ 

cccaaaaaacaaagaaatgactttagacaaatggcttgaatgaaattacaaagaggaggt 

gcattaaaaaatcccagcagta^aawcttttgaagaattaaaatg^ 

ataataaatgttcttttt 

Sequence 493 

pccgcggtggcggccgcccgggcaggtao^gggggtg 
' ' 1 * ggaagtttgtggcggagttctgtgatatgagcaacaatggaccagaagattttatc 
tctagcagcagaaaaaacagcagacaaac tgcaagaatttcttgggcagggcctggggaa 
TGC H I M I ^tctcatattagtgcctgtgatggcatctttcatctaacacgtgcttttg 

agatgatgatatcacgcacgttgaaggaagtgtagatcctattcgagatatagaaataat 
agatgaax^gcttcagcttaaagatgaggaaatgattgggcccattatagataancta 
aaaggtgnctgtgagaggaggagataaaaaacta a 

Sequence 494 

cgcxsgtggcggccgagctactcatggttgctgt aaattaaggcagccgttctgcaggg TT 
ttgcttac-ccaggctcctctgac^tctggctatc 

GC 

GTACCTGCCCGGGCGGTTCG 
Sequence 495 

AGATCTCAAGATCTGGACTTCTGTTGAAAAATTTTCCCGTGAGG 

AAGATGGGAAAAAAAT ACAAGAACATTGTTCTAC TAAAAGGATTAGAGGTCATCAATGAT 
TATCATTTTAGAATGGTTAAGTCCTTACTGAGCAACGATTTAAAACTT AATTTAAAAATG 
AGAGAAGAG T ATGACAAAATTCAGATTGCTG N CTT GA TGGAAGAAAAGTTCCGAGGTGAT 
MCTGNTTTGGC>CCAAMCTAATAAAAAATTTTCGA> 

CNGH CTTGAAAM CHTTTT AAAAAAAAAAGAAAAJ^GGTTTAu^AANNGTAAAAGGGGNCCCC 
CNCCCi i 1 1 1 1 lAAAAAAAGNNGAAAAAAAGGGGNGGGGGGG 



CGCGGTGGCGGGCCGGCCGGGCAGGTACCGTGAAAAGGGCACTTCTCGTT GAGAAGGCCT 
GACAGTGTC GTT AATGT CC TGC T GGCGCATGG TGAAAATTTCAGGGCAACAG T AAAGCAC 
i u 1 1 1 AATTTCCCTTCTCCAAGC CCAAGCTTTTGCAGGTAAC TGGAGCGCTTCCTC 

TTGCATAATAGGCAGTTTCAATAACTGGGGAC 
Sequence 497 

CCGCGGGTGGGGCCGGCCGAGGGTACNNNGGAGGCCTC^TAANGGCNGGGNATCNTWiAG 
GNTGGTATMGNACTGNTNANAAAGCCNNCATGG^ 

CAATTGWNGCNGCGAAACAGGCAACAGANTCTGNCATTATAT AATAAGGGCGTGGTACX3G 
TTGGGGAACCCCGNANGANTC Nf>fT ATGG TCC TT GNTTN GCAAGCN NT GCA TTTT AAA TCA 
GACGA(XGTNAATTTTJTTA>ICCCCAANCCTTN ttan aataaatcggcaatcgcgcaatat 
CTCATCATTNANCHACTGTGGACGACTTGACAATCTTAGTGGCTTNATGGACTTATTGCA 
AAAC TCGAGAAAGAACAAACCT AGG GG TGCGCCCT GACCTTCGGAATAA TT CG TAAGC T A 
TATGTGAGAAACTAGCAACAGGGCGTTTCATTTATG^NAANGGGACGCGAANTGGANGA 
TAATT A TGT AANAAGNGG GCCCC T AC GANTTT GGCC CCT AGACGCC CAG GGAAACC GCGG 
GGC N CCATGCATN AC NCAMAC TTAN GG NAG GGGT ANTT C TCCN CACAC N CNTCN TTTTC G 
ATTTGGANAATANGCTGGGAATNAATCCTACATGACCTGTGATT^ 
GCCGGTACWGNNCCCCCCCC^GGGGGGGGGGGGGNCCCCCCG 
i i 1 1 1 1 1 GGTTTCCCCC 7 1 1 T I NAGGTTGGAAGGGGGGGTTTNAAATTTTGNCCGGCC 
GC 

CTTTTGGGGCCCGGTTAAAAT 
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itacacgggccttccacttcagctgact 
gaatttaggcagttctggccacttcagtttccgcacccaggcctcctgacccatggtatc 

TACGATGAGATCC 



GTGGCGGOOGAGGTACCTCAATTGATGATTTCTGGTATGACCTAGCAAATACACTGCTTT 
C ACTGA AATTTCAGTCTTGCAATCTGCTTTGGG^ 

CATTTTCCCATCACTGGACTTCCAG O tTSm rCAATTTTTCACTGTTACAAACAAGQT 
G 

GCAACATTTATCTACAAACCTCTTGGATATTACACCGT AGGNAAGCTTTCTGGGTT ATT 

CCACCTAGTGAAACCTTGCTCAAGTTTGAAGGGGGTAMTGTTGGGAT>ICTTTCATC^ 
TT 

taattaaaattatttaccaaccatgttgaaaaagccccgaccaatggtcaagggactgng 



NTANACAAGGGTGGCTTGGGGGGACCTCAAGTTTTGGGGGTT^ 
TTTAGI^TTATTAGCCAAGGAA^^TG1TCTI^ATAAAAAT^GGGTNTT CTTGGATTT AGG 

AGACCAANGAAGTTAGGTTNGGGGGGAAAT 
Sequence 500 

CGAGCCGGGAGCCATTNANAG1TGTTAAAAGCCTNGGGGGTGCOCT AAATGAG TGAGCCT 
AACCTCACATTTAATTTGCCGTTTGCCCCTCAAC TTGCGCCCGCTTTTCCAGNT CGGGGA 

aaaaccttgtccn ttgcw cagcttgcattaaatcgaatcggmcccaacngccgocggggg 
gag^sagmgctggatttreccgttatttggggcggcttktr^ 

CAACTTGHACTT 



ACATACTAGCNNGGGTAGCATAAftAGKTGTTAAAGCCTGGGGG T GCCTAATGAG TGGAGC 
TT AAAC TTCACAATTAAATTGCCG^^^TGCTGCTCCACC TGCACC TGC TTTNCCAAGAT 
CT 

GGGGAMAACACNTGNCGTGCCCAGGCCTGNNATTAAATGCAATTCNANN NCAACCGCCGC 
WGGTGGGAGNAGGGACGGTNATTGCCGTTAATATGGGGGCCGCTACTTTTTCCCGC 
Sequence 502 

. NACAAACATTACGAGCCGGGTAGTCATAANAGCTGTAAAGCCTGGGGGTGCCNTAATGAG 
Sequence 303 

GCGGTGGCGGCCGAGGTACI 1 1 1 1 1 1 1 U 1 1 1 ! 1 1 II 1 1 1 1 ATGAATT ATTT ATTTTCTT 
TCTCAGAAAAGGATGCGCCTCCACTTAGCAAGGCTGGGCAGGATGT^GGNTTMT GNATCTG 
CCCACAGACGGGGTGGTTCTAGACGGCCGCTCTNMAAC 
■ 504 



ACATACTTANCCCGGNAGCATTAAAGTGTAAAGCTCTGGGWrrGCCTAATGAGGTGAGCT 
AACTCACATTAATTTGCGTTGC TGC TCAC TGCCCCGC TTTCCAG TCGGGAAAACNC TTGG 
TCNGTGCCCANGCATGCATNTAAATGNANATCGGCCCAA 
Sequence 505 _ 

CACAAGATACGAGCCCGGGAGCATAAAGTGTATAAGCNCTGGGGTGCCTAAN 
Sequence 506 

CGGTGGCGGCCGCCCGGGCAGGTACTCGTCTTGGTGAGAGCGTGAGCTG<rrGAGATTTGG 
GAGTCTGCGCTAGGCCC GCTTG GAGTTCTGAGCCGATGGAAGAGTTCACTC^ 

cccgcggttgatgcgtgcttttcgcaagaacaagactttcggctatggaagtccccatgt 

TGATGGATCCTGAGGCTTGAAAAAAAACTGAAAGAGAAT AAAATATC TTTAGAGTTCGGA 
ATTATTGAGAAAAATCAAANACTCCC NAGTTTTGATGACCTGNGAAGGAATATTTCNGAG 
GGACNCC^GCCCTTTGGGGNAAGGA^CCTTGACTCTATCTTTTCA^ 



ATT CCTAGT AACAGGC CCT^AAA GACTMAAWAGCAAACTTr^^ 
TTCWTTTTTATTCC C 1 1 1 1 II IH I ATT>HTTiTTTAAAAATAAANAAAATAATTTAATT 
*T AAACTTGGNACC TTTTCC TTAAATAAT ATT ACCTTT CTNATTGAAAAGG TGGGAAAA 
N 

GGGAAAATTTCC 
Sequence 507 

GGCGGCCCCGGGCAGGTACGCGGAAATCCCCTAACTTCCT^ 

TTTAGGTTAGATNGAGAAGTGTGTATGTGTGTGTCT G T G IGTGTTGCTCWGCACAGTNGA 
TG AACTGT AAACATAAATTGAAGATATTGGAAAANTACATNAAKTTATGGACCAACATGA 
'nTTCATTAGGACTTCCTA TTCANAGAGTATCAGTTTMACANNTTGGGTA TT AGWT 

A 

CTAGTATKAAACATTTCAGATACTTGCACTGATTTTCTGGTGGANTAAAAGCAANGGCTT 
NTACAAGTTKT AAGCATGTCTTKTANGNCT ATGCTTTGGAATACCAGCT AATAACCAAT 
C 

AACAAGW GCAGWA GCCTTAAWGTGGTA I rTTTTTBCl I G ACCCTAAAAAACATGGAACCT 
^^^t^^TTTCTNCAAAAAMTTGOCTTAACCAAATGGAAANTAGGTGGGGGGAAG 



TATCCGCTTCACAATTCCACACAACNATACGAAGCNCNGTTAGCATTAAAGTGTAAMAGC 
CCTGGGGTTGCCCTAATGAGTTGAGGCTAACCTCACATTAATTTG^ 
NTGGCCCCGCATTTTCCAGTTCGGGGGAAAACCNTGA 
ANATNGGAATTCCNGCCCAACCNCNCCGGTTGTAGGAGGGNCGGGT^ 
GGNGCGC TTCTTTCCCGCTT 
eS09 



CCNANGTACACTCCCACCACCACCMCATGG I C I C 1 1 1 C ATATWWCTCAAWWMTCAACMTG 

*nrcCTGf«GCTTCATAArmTCCThnTMCAT^^ 

CT 

C^NCTNANGCTNTNNANTCAATNCANTTNNCCTTAATNNAAATCACAA 

TACNCANNAANNTMTOfU^TTCANNNCC^^ 
TCANCAAAAATCACATCCACCATTNCtlATCCCNC^ 

AC T ANTN NATCCCCC N N NC TNCAANAATTC NAT ATCAAN CTT ATC NAT ACCCTCNACC 
TC 

NAANNNNN NCCCNT ACCCAACTTTTKTTCCCTT 
Sequence 510 

CGGCCGCCCGGGCAGGTACTCTCTGAGCCAAGGACATTCTCATTTAAACAGTTTAAANA 

^^TGGGNGCNGGATCGGGAAAAAAAAGAAATATACCCTGGCAGCCGCCTGCCCGGCCGGA 

A AGCGGA NAGGGACNCTAANATCAGCAAATTCNCCAGTTTGGATCC. I rGT C CI 1 1 I C CGC 

CCTTTTCCCCCCATTAAATCCANAACCCGTCACATGATAATTAANAAAANGGTTCAGTTC 

CTCCTCCTCAAACCACTTCCNGTAAGAGGATCCCCCNCNTACCTCWGOJCCTCTAAAACT 

AGTGGATCCCCC«3GC<rTGCANGAATTCNATATCAAC^ 

AGGGGGGGCCCCGGTACCCAACTTTTTTGTTC 

Sequence 511 

GGGGGAGGGCAGNAAAMCAAACCACAGCNCAC><GCAN 
AAAAAAAAAAAATNNNTNhlTNCCAAACANAA^ 

NNAAAGCCCNGGAACCAA7 1 T TTTNTGGGGGCGGGGGCCCCCCAAAGGGCGGGAAAAACA 
GCCACGACCCACGGC NCCAAGCNCGAACAGAGAGCNGGGGGAGACGCNGCCAAAAGCAAA 
A CGGCGGGCAAANCNNAGGGAGCAANNNGGGGCGAAAAGNN*<AACGGAACCANNANGAAA 

Sequence 512 

AGCANCACCGCGGNGGCGTTTGCGGGAGAAACNGNGGACCCCCCGGGCTGCAGGAAN NCG 
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ANANNCNATTTAGGGNGAGNNAAACCGC 
Sequence 513 

^AGNCACGGACGAGAGCAGATTANAC NTNGGGGGCNGNAAAACCCCAGCCCCGCGCGGNC 
ACAGCCCNAAAGGCCAACCCP f 1 1 IGGAGGNGCNNGGGGANGCAAACNGAAAAANAGCNG 
G^^AAAGMAGGAGNWGAAGCCAAACAGCCAAANM^^ 

TTGCNANI 1 1 1 1 INANGGGGGGGGMANCACACCCCCMGAANAAAGNCCGGGCNGNCGNCC 
CNGAACGAGGGGGGGGGGGGGGGGGCNGCAAGA^ 



iTACCTCCGAAATCTTAGCTTCAGT 
ill fCTCI uCCACCCACTCATTTATATryrT i < y f ^ATT^n C ft GTC TCTT CAGCW i G 

OA ' * ■ ■ • ... 

CAACTCCTCCAGCTCTGCCTGATAGAACTTC^ 

ATCACACGTCTTTTC T CCAAAGAGTCTGTTGGCAGTTCT AAGCAAGT ACGCGGGGTAAGC 
AGGAACTGAAACCACAGAGCTTCAAAAAAAGAGCGGGAC^^ 
GCTGAACATGAATCCACAGATCAGAAATCCG^ 
CGACAACTTTGAAAACGAACCCATCCTCTATCGTCG^ 

ACTGAAAA TAAAC AGGGGCCCCTCAA AlllC CI I I G GGACACAGGGOGTCTTTTCCAGGC 

CAGCTGTATTTrCGAGCCTCAGTACCTCGGGCWSGT^ 

Sequence 515 

TTCGCCCACCCGAATGATCACCAAGACACACAAACT 
AAAGAAGAAGAAAGTGGTCAAAGAACCACAGACTCGAT 

TTACI rTGCTGATGI I ICICCI 1 t AAGAGCTACATCCCCCTCT AAGAGTGTGGCCCAT 

GCAGGCACCTGAGATGCCTCTAGTOAAGAAAAA 
GGCGGCCGCTCGACGTGGTCGCGGCCGAGGTACAACTG^ 

ACAGCTT CGAGCTCTTGGCGCCAGAGTCCGATGCACTCCTGCAGATAACGGTCATTTC^ 
tttccgggagaacctctttcgaaaaacaacccggat GAGACT ATCT GGCAAATTGCAGCC 



*TTGGA0CTCCCGGCGGTGGCGTTTTGCTCTTGTAGCCCAGGCT GGAGTGCAATGGCAGG 

ATCTCAGATCACTG<>ACCTCTGCCTCCTGGGTTC^AG 

CC 

CAGGTAGCT GGGATTACAGGCATGTGCCACAACGCC TGGCT AATTTTGT ATTTTT AGT AG 
AGACTGG I I I C> CCATGTTGGTCAGGCTGGTCTCAAACTCCCGACCT C*GGT GATCCGCC 
CGCCTCGGOCTCCT AAAGT GCTGGGATTACAGGCGTGAGCCACTGCGCCCAGCTATACTG 
T AT ATTTT AAGGAAGTTCCAGCATGTTGCATCTTCTGCATTT ATCCCT AT ATCA 
GAACAT AAAGTT ATCATGGTGTTGGGTAAATTAGCGAAATTCAACCOCTTCCT AAGGTTT 
AAGGGGAAAAGGT AH 1 1 1 A AAA A ACAACTTAATNAAAACTTTACCCTTCTTATACAAGA 
GTGGATTTCCCCCTT AATTAGGGATGCATGGTTGATTAAACCTCNAGATACAGCTTTT 
TT 

GCAGTAATGGGGGGGNTGGGT 
Sequence 517 

GCGATTGGAGCTCCCCGOGGTGGCGGCCGAGGTACGCGGGTGTTGATCCAGTTCTT 

TTCAAGGAGAAGGATTTGGACGTCAGAGTATGTCAGAAAAACGCAC^ 

ATGCCAGTCAATTGGATTTCGTTAAAACACCGAAAATCAAAAAGCATGX^TTTAGTAGCT 

gacgagactaaactcaatacagtggatgactagaaagcaggttctcccagcagaga tgtg 
GGTCCi i ccctgggtctgaagaagtcaagctcattggagagtctgcagaccgcagttgcc 

GAGGTGACTTTGAATGGGGATATT CC n I C CATCGTCCA 
e 515 



AAACCCCACCCCCCAGGGGG 
CCCt^5GAAAAW3ACNNCCCCCAACCAGGGGANAA 
GAGGGAACAGGGGAACCANCACTTTTNTNTTTTTGGGGGGCA^ 
ttACAAAAAAAANANCCCCCCOCCAGGANGGGGGGGGGCt^^ 
AGACCCAAAGAAAAAAAAC 
Sequence 519 

GCGATTGGAGCTCCCaXGGTGGCGXXCGA 

TAATGTTTGT CATAATTTTAAATAAAGTGTGTGGGTTTCAGAATAAAAAAAAAAAAAAAA 

AAAAA W CAAAAAAAAAAGTACCT 

Sequence 520 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATGTTGAATAAATGTT TT T T TCC 
CTTTTAATTTTTCTGl 1 1 U UL I AGTGCATAGAATTGAACTGCTTAGGGAGTTTGAGGCT 
G 

CAGTGAGCTATGGTCATGTTACTGCGCTCCAGCCTGAGTGATGGAGTGAGAACCTGCtrrc 

AAT TAAAA AAAA A AAAAAGAAAGAAAAAACAGTGCAGTGGGCTCAnSCCTGTCATCCCAN 

CAGTTTTTGGAAGCCAAGGCAAGAGGATTCCC^GaAn TT OkAGACCAGCCTAGGCAACCT 

TAGCAAGACCTTGGTATCTTCCCAAAAACCTTTAA^ 

TGGCTGAGATGAGAGGATTTGCTNGAATCCAGGAANGTGGAGGCTG 

TTNGGGCOICAGCAKTTCCAGGCCTGG 

AAAAAAAAAAA 

Sequence 521 

cgcagctagctgcgagaattaatgtgaattgSS^ 

cgcacttgcggccccgogi i d i cccggggctaccgcc » l> i cm l>agccgtcgcttccaaa 



caagactgtaagtttt^tgaggcctcccgagcttcctgcatgcttc 

CTGTGCAGCCTTjCAGAACTGT AAGTCAATTAAAC C I C I H I C 1 1 1 A T AAATTACCCAGT 

c 

TCAGGTAGTTCTTCACAGCAATGTGAGAACAGACTAACAACAATCAACTCAT AA 
CACAAAAAAAATAGGTAAGTTCAAAATTAACATATTACCACATCCAACTTCTTT attctt 
GAGAA/V^.AAAAAAG'TTT'AAAATI 

TCAGGGCTC>CTGCAGTCTGGCCATATCTCAAGCAGGTC 
Sequen c e 523 

TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACG 
GAGTAAGAGAAAGAAAGAAGTGAGGGGATGTAAACTCGAATAAATTTCAAAGTGCCTCCG 
AGGGATGCAACGGGGCAAAAACTGAACTGTTCAGGCTTCAGATTGTAACTGAC^ 
GGAAAAATGAGGTTTGTGTGATTTTGCT AAAATGCATCACCAACAGCGAATGGCTGCCTT 
AGGGACGGACAAAGAGCTGAGTGATTTACTGGATTTCAGT 
GAGCAGTGGGAAAAATGGACCAACTTCTrrGGCAAGTGGACATm 

AGAAGACAGAAGT AGCT CAGGGTCCTGGGGGAATGGAGGACATCCAAGCCCGTCCAGGA 
Sequence 524 

GCGATTGGAGCT CCCCGCGGTGGCGGCGGAGGTACGCGGGGCTCTTGAGGAGTGAGACTG 
CAGGAGATGTGGGCCGTGCCAAAGAGATGGATGAGACT GTTGCTGAGTTCATCAAGAGGA 
CCATCTTGAAAATCCCCATGAATGAACTGAGAACAATCCTGAAGGCCTGGGATTTTTTGT 
CTGAAAATCAAC TGCAGACTGT AAATTTCCGACAGAGAAAGGAATC TGT AGTTCAGCACT 
TGaTCCATCTGTGTGAGGAAAAGCGTGCAAGTaTCAGTGATGCTGCCCTGTTAGACATCA 
TTTGTAAGTGCTGGAGTGCAGTAACGCCATCTCAGCTCACCGCGACCTCTGCCTCCTGGA , 



92 of 221 



PCI7US00/33312 



PCT/US00/33312 



TTCAAG TGATTC TCCAAC CTCAGCC T CCC GAGT AGCTGGGaCTA TAGCAGTGCAC CACC C 

ATATATGCAATTTC 

A 

Sequence 525 

AATTGGGGGGNAACNACMGGCCCCCACGGNCCCNCNGGCCAGNGCACCCATTrrTTTNGN 
GGGNGAGAANNCWGGCCACCCWGACCCGGAGAGGAAGGAGACMGTTTTTMAAGNNGCCNC 
GGGCCACACNCNAAAAANCGACCCGCAANNNGCACCGACAAACANCGGNGNGCNAAAACA 
NAACNNGAACANCCCGAGGAAACCGCCCNAT" 



NGGGGGCNN 



AACTTAATGTCl ICCI I 1 1 1 1 1 1 1 I J CACTGGCTTTTTCATANATCGAGACATGTAAGCA 
GCATCATGGAGG TAAGTTTTT GACC TTGAGAAAA TGTTTTTGTTTCAC TG NCC TGAGGAC 
TATTTATAGACAGCTCTAACATGATAACCCTCACTATGTGGAGAACATTGACAGAG TAAC 
ATTTTTTNGGGGNAAGAAGAATCCT ACAGGGTCATGNTCCCTT CTCCTGTGGAGTGGGGG 
GGNAGAAGGGGTATGGCCCCAGGGWNGGCCATA7TACTGACCCTCTACAGAGAGGGCAAA 
GGAACTGCCAGTATGGNATTGCAGGATAAAGGCAG 
Sequence 527 

AGGTACTCACAGTCACGCTCCTC TGAACCATCCTTGGGC TTCATGGGGT7GGCATTGAGG 

AACAAGGACACACATTGTGTCAG<3TCACGAAGATCATTCAGTTTCCATATGCTGAAGGTT 

TTTCCACTATTCACACTCTGTGGCGTAACCTTCTTGAATATAACCCCAAATGTCACCCA 

A 

ATC 1 1 1 I CCTTGATCTGAGACAGTCTGATCAG 



■ 528 

AAGGAhtAATTTTTTGGGGGGWCAAAAAAACCCCANC^ 

ATCTTNGGNAAAAGAGGGAAANAGGCCCCAAAAAGGACAAAAGGGN NCAN NCANAAAAAC 
AAANNNCCAAAAANCCGGCCAANAAKAN NNCAAAANNNNCCCCCAA I II I NII I II I IGG 
GGGGGGGAAANGGGAAGN NACCCCAANGNACGCAAAAACNACCCAAACAGGGGGGGG 
Sequence 529 

CCGCGGTGGCGGCC GAG GT ACATTGT AT ACTGCAGTGTC G TC TACATGGCATTG GACAG G 
ACATAATGTAAAACATAAAAGTGCAATTGTTACAC TT ACAT ATGAT AGTGGAATGG CAAC 
CGTGACCAATTTTTGGCTCAAGTTAAAATACCAAAAAC 
Sequence 530 . . 

CGATTGGAGCTCCCCGCGGTGGCGGCGGAGGTACTTGGAAACCCATTTGGATTAATTAGA 

GGTCTGTCTGAAGGAGTTGAAGCTTTATTCTATGAACCCTTCCAGGGTGCTGTTCAAGGC 

CCTGAAGAATTTGCAGAGGGG TT AGTGATTGGAG TGAGAAGCCTC TTT GGACACACAG T A 

GGTGGTGCAGCAGGAGTTGTATCTCGAATCACCX^TTCTGTTGGGAAAGGTTTGGCAGCA 

ATTACAATGGACAAGGAATATCAGCAAAAAAAAAAAAAAAAAAAAAAAAAGTACCTGCCC 

GGGCGGCCCGNTCTAGAACTAGTGGATCCCCCCG 

Sequence 531 

ACATT ACNAAAAGGAGAGG N G GCCCAGN N NAAACACNC NGAAN CCANCCN NGCCCNGAGN 

AACAAANCACNGGAGAACAAAAACGAAAAACAGCAGGNCCCN CNN NNAAANCCAAN N C AN 

ACAAAAANGNCAAAGNAGAAGCAAAAGCCANGNGNCCCGGCAANAAAGCCNCCCCAAAAG 

CAACAAAGAGGNCNGCCCAAACCNCCNAAAAAAACAAACCCCCAAGANGAAAAAAAACCA 

AAACCCCNAANGNAAANGAAACAANCAACCGGGGGCCCCCCAA 

Sequence 532 

TTTTTT ATT CAATTTG CGATN GACAG N N NT AG N TT N AAATG TTN G TAACAC TCTT AGAN 



N NC T GGTTT GTTCATTTGACATTJGGGGCTG CACCAATTTTT ATTACAAAAATCAAAAAA 
G 

TAAAAATTCTTACAATATTTGGAGAGTATAACCACTAGTTGCCTAGACAAAAGCTAATT 
T 

c tacaaaatcaaaaac TT aatgcag ttttattaagagag tcaaaattctctcag ttaac 

T 



aaagaaaatacatgtcaacagtcaggtaaatattttgacctgacaggttgtacaaat 

ggattttcactacatataaaggaatctgttacatgggggtaaaacttccagagaccaagt 

aggaagnggtggaataaaaaaccaataaatncaaacgccaccccaggctgg 

Sequence 533 

ccagctgctngcctgcaaagangagcgtcctnngggggggggnaaaaccccncccnancc 
nggamcttggccttcacantnn<^tggggggcactgggcgccacctca 

ctgccaatntotnnaccacggggcacgcacagggacac 
caaccgccccnncccgnggaccngcccg 
eS34 



tggcgncaccaataagc^taaaactgaagcctgattagtggagtgacaaactacntgaaa 
tggtgaaatgagggatctctttgagaaattaaagatcacntttgggattacntcattc 

TT 

CCAAGGTTTCCAAAAGTCTCATTCTTACTCGAGCC1TCAGMGAAATTGAGGGACTAA£AG 

atcaggcagacaaattgataggacagaaaaatctcctgactcgaaaacggaatattctga 
tacggaaaggatcgnctctttcaggtaagacagaagaagtgggcctgaagaagctagagg 

ACC TGCCCGGGCC GGCCGCTCT AAAACCAGGGGGATC C C CCGGG CTG NAGGAATCNAAAT 
CAAGCCTAATCGAAACCGNNACCCNCGANGGGG 
e535 



CTCAGNGGh«3GGAGAGANCCCTGCTTACCCACCAACCAGAACGNGGCCCGCCNAGAGGCT 
GGAACNGAGAGAAAGAANCNGGGGCTGGCNNAAGAAAAKANAGACANNCNCACAAAAGCC 
N AGTNCATNTTTNNTTNCCGNNGGGACCGNNCACCCGCAGAAANAN NNCACAANAGGCCG 
CC GGhK^AACGGGGGGGAGCACGGACNGTCAGGNCNCNGGGAAGGGGGCAGCGCAACCCG 
CAGGGCNNCNCCCCCCCNGGCCNNNGGAGAACCAGGGCCCNNCNCAGGGGGCCNAGGGAC 
CGCCAGGCNGG N NCAGCCAGGAAGGCCAAAANCAAGAGGGAGAAGGAGAAAGGN G N AAAA 
AAGAAAAAGGGGAGGNGG 
Sequence 536 

GGGGA*JCCC GCGG NGGCAN ATTGGGGGGGAACACACAGCAAAGANACGN NACAGC CTGAG 



CCN TNCCTCCATCTGCGCAAC AAACGC NAGAGAANCAATCCTTGGGGCAGATACTGGGGC 
TGCCCTCAAGGAGCTNNhtATAGAGGNCAGGGGACCTTTGNCGCTNTTTNNCTAGGGGANC 
Sequence 537 

GG NCCCCCCGGGCTGCAGGAAN NCGAN ATNTNCTTT AGGGNGACC AAAACCCCC 
Sequence 538 

GGC ACCCCGCGGNGGCC CTNGGGGGGACAAC NCCGCGCCCGCCAGNAACAGGCCACAGCC 
CAGAGCTCNNTCGGGGGCNNAAAAACCCGGACAAGCNGCANGCGGGGGGACAGGNCTGCG 
GG NCNTGGAAACACTGGACNG GATGGCACAN GAACC AGAACTCCGC TCCGNTTGGCTGCC 
CAAG GANCCC ACC AACN CATNCTAANCAGCG ANCACNG AGGAAACGCNTTTTAN N CC GAG 
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GKAC MAN N N CAN AGAACAGGC C NAC C GCAAGGGCANACCAAGAAAGGGGGGC GN AAGGAN 

^^MAGGGGGNAACAANGNACCANAGGNCNNCAAANGNCNGACANMCANC 

CNCNAAANGGCCNNCCNTNNCACAANMNCNN^ 

CAAAGACANGGAANNACC GGGGANAN NAGCAGAACCAAACCGGAAAANGCAN GGAGGGNN 
CAAAAACACCACCKACAGGAAGGAANAACCCAGAGGAAAAAGGCCGAAAGAAAGAAACCG 
AAANANAAGACCNGGGCCGAAAAAGCNN 

ANNNCCCCCNNCCAACCANNAAACAGGGGGAAAAAAAAAAAANCNG 



GCGATTGGA GC TCCCCGCG GTGGCGGCCGCCCGGGCAGGT A O I I ICII I II I ATAGTTTT 

I I Ibl 1 1 ITCT GAI I I 1 1 1 I H 1 1 1 GGTTTTTGTTGTTTTGTG 1 1 1 )1 1 1 I CCI II I II I 

1 1 ICI I AGAAAATCTGAGACACGTGAGGCCAGACAAAGCAAGGCCGGGGCTGATGG 
CCTGGCTGCCTGGTGGTTGATGCTTrTGCTCCCCCTACC 1 1 1 1 1 I II 1 GAGTTTATTCT 

G 

ATTGAI 1 1 1 1 1 1 ICI I GGTTTCTGGATAAACCACCCTCTGGGGACAGGATAATAAAACA 

GTAATA 11(1 lAAGAAGGAAAAAAAAAAAAAAAAAAA 
Sequence 540 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTT^ 
AATGATATCCAAACCATCAGCTACTTGTAATCTT^ 

Tl^GGTGGCTGCTGACTGATCAGCGTGGTGGTTGCT GAAGGTTGGAGTGGTTGTGGCAA 

III CI \ AAAATAAGAGAACAGGCTGGGTATATTGCCTGATACCTGT AAATCC<iAGCACTT 
TGGGAGGGTGAGGTGGGAGAATCTTTTGAGGCCAGGAGTTTAAGACCGGCCT 
TGGTGAGACCGTGTGTCTGCAGAAAAATGAAAAGAAATTGGCTGAGTGTGGGGGTGCATG 
CCTATACT ACCATCTACTAGGGAGGGTAGGATGGAAGGGTTGCTTGAGCCCAGGAATTCA 
AGGNTGGGCCACTGCACTCCACCCTGGATGGCAGAGTGAGATCCTGCCCT 

ATNA 

Sequence 541 

n 1 1 1 1 I I 1 1 I I 1 1 1 II 1 GTTAAAAGACACAAGTAGTGATATATCAACATCTGTTTAACT 
CGTG ACCGTTTCI HI 1 1 1 CAACTTCTTTTTTCTTTTCAGTGCTTTCTTCTTCCATTACC 
TTTTCCTGATTTCCACTTTCAGTTTCCATTCGTTCGCTATCTTCTG^ 

GCTCCAATCTGCGAAATACGGCACTCTCTTrATTGACTACTGCTTCTCT CGGCCCCCGCG 
Sequence 542 

GCCGCCCGGGCNGGWACAAAATGTTAAAGACb I I G I U G TATNTGTAAGGCTGGTGTATT 
CAGAGAGCATl^TCTCTTATTCCTCACTn'CCACCCCCGT ATTTTGTAATGACCATGAT 

AATGT TTWTAC HIT TGTNTAATGGGGTGGGGTGGAGTGGGGGCTATCTGAGAGTCANCC 

TGAGGTCTTTAGAGGACCANCTATTGTATCACCTTGGATACTTGAAGTTT 

Sequence 543 

CAAANACTTTGGCCANAhrrAAAATNGfrrGGAACTAKAGGTTTCT^ 

GG TT AAAGAAGCCAAACGGTNGCTTTTN GGGGGAANGCCAN GAAAGAAAANAAGGGGGGA 
GNAAAAAAGGCCATGNCCATTCMThn'GCCCCTTGGMAATGGAAGCCCCANGGGGGGG NAC 
ACCAAGCNAAANNAAGAAAAGG CCCCACCTTNATT C TTCAATTTTT AAAATT CC TTTT A 
A 

CCAGAACATTCTTCTTTTGGCAACAAGNGGTCTTCCCCT^ 

TCAC CCATTGGAAGAfrrGAGAGAGTNCACTGGGAAAAGC GGCC CACCTT ATTCAGTCCCC 
TCCCCTTTCTTGGGGTKfTGGCAACCAAAAGfrTTNTTC C TGGCGGGGCG TTGGGGACCCCG 
ThrrrrCAAACCAAGTAAGGAAGGGGCCTTTTAATTTTTGGGG 



AGAAAAANGCAATNGG TAAGNGGC C TTTC N TTGN GGGN GAATNAAGGGGCCCCACG GAAA 
AGCTTTTTCCtCTTGGAATTGTACCCCGGCCGGNACCTTT^ 
CCmANAAGGACCCCCGAAATOTTGGNTNGGGCCCCCCC 
Sequence 544 

TCCGCGGTGGCGGCCGAGGTACCAATACTTACTTACAAATTTAATACTGCTTCAAGGTAT 
TTAATCTAAAATTTrACCAACTTTGATTTGTCTGGTTAGGATATTTrGTTTTAGTGGATA 
TGCTTTAATTCGGATCAATT AC TGCAGT AAATCTCATCC C TAAGCATGAAA TGTTG TCA 
A 

CAAATAC CCAGTTCCATTTAGTTATCAATTAGCCCAAATAAGAGATACAAAGTATAAC AG 
TGACCAACCTTGTACCTGCCCGGGCGGCCGCTCGACGACTGACATAGACTGAAAGCAA 
AGAG TGC TG TG TTTGTT GCTAT ATCCCC TCCAACACC TAAGGCAATGCATTTCACATC 
TT 

GCTGAGAGCAGATAACCTCAATACCTGGGAACTAGAAAAT 

AGTGAGGGGTTAATTGCCGCCGCCTTGGGCGTAATTCATGGTCATAAGCNTGTTTCC 



AAAAAGTTGTNAAAAAGCCCTGGGGGGG*GCCCCTAAATGGAGG^ 

CAACCATTTT 

Sequence 546 

GCCGGGCAGGTAC C TGATGCAGGGAATTGAAGCGAGACCCAAAACGGGCAACCCAAT AGG 
ATGGCCATCTGCCXX1ATTAATGCCAGCTTGTCCAAGTGTAATTATTAACAGTGCCCXCTT 
TCAC TC TCCAAAGAG TN CC TTG TNCAAACAGNTT AATTG TGGAAGTCGCCTTCAAGATGA 
CTGGGCGGGTAAAGGAAAGTGGGAGTGAGGGAAGCAGGGTAGGTGGAGGGTGTGAAAGGG 
AGAGGGCCTC^TCTCAGGGTGGCTTGGACCTGCACCAGCATCGGCCTGCATGAAATGTGC 
TCC TA»CTCTTGCCCAGGC TGAGTATCAAAGAGAAGCAAGAAATCTAGAT AAAAAT NCAAA 
TCCAGAAACA 
Sequence 547 

GCGGCCGAGGTACAGGT AAGCCCTGGC TGCCTCCACCCAC TCCCAGGGAGACCAAAAGCC 
TTCATACATCTCAAGTTGGGGGACAAAAAAGGGGGAAGGGGGGGCACGAAGGCTCATCAT 
TCAAAATAAAACAAAATOACAAAAAGTTAT^ 

ATTTTI I GGCAATTTACCAATAAATTTTTT ACCACCGAAAAAGCCAAANTGGCCTTANT 
A 

CACCCCTTCNCCCC NTGNTGGTGGGGACCTTTTGGGGGAAGGAAGGGNACCTTGGGGG NC 

CCAATTTTCCTTCCCTTTTAAGAAAGAAGGAAAAGTTGGGGGGGGTN^ 

TAAGTGGAATNGGGGCTAAAGGGGGGAACCTTTTCCCCTTGTTAAAACCAAAA^ 

TTTCNTCCAAT7AATTTTTTGGGAAAATTGGAACCTTAATTTAAAAAAAAA 

ACCCAAAATTGGG TTGGCN AAATTCCAAAAAAGGTTCCCNTCNG^ 

TGGTGGAAAAACCI 1 1 T 7 fGGGGGGGGGGAATNGCCTTTCXGGCCTTCCCCAAAACNCNG 
NAACTTGGCCTGGTTCCAACCCTTTrCNACCCCGGTTTNNCCAAGG 1 1 1 I I 1 1 1 1 AAAA 

TCGCCCTGGGAGGTTCCAAAAGGCCCCAAAAAAAAAAAAAAAAAAAA 
Sequence 548 

GGC GC CGGGCAGG T CC C TTTGT AAT AT CC TTT AT AATAAACCAG T AAATGC TGTTT CCC T 
GAGTTCTGTGACCTGCTCTGGCAAATTAATCAAACCCAAG^GGGGGTTGTGGGAAC 
AATTTATAGCTATTCAGTCAGAAAAAAACAAGGTAAGACAATCTTGGGGCTTGCGACTGG 
CATTGGAAGTGGGGGACAGTTGTGCGGGGCTCAGCCTTCAACCTGTGGGATCTGACGCTA 
TCTCTGGGT AGATGAAGTAGAATTGAAC TG GGGGACAC C CAGC TT GG TGTCCAC T GCAGA 
ATGAATTGCTTGCTTGATGTCTAGGGAGGCCGCAGAATTATAGOKGGGAGGTGAAAAGCA 
CTTCTT AT AT AGCAG T G GCAAGAGAAAAT GAGAAGGAGCAAAAGC TGAAACTCCTGA T AA 
ACCAATCAAGATCTCATGAGGCTCATTAACTATAACAAGAATAGCATGGGAAAGACTGG 
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549 

NACCCTCTCAGOCNCCCTGTAATTGCGCN/^CTfrTGGA^CGCTOCAAOGATTGTCGAGT 
CGTATAGCGTCTATGTACATATAGCATrmCfMTAGTCATTGGTGTAGAGATAGAAAATG 
CTTCGTACA TG TCAATGGGAGAATGGGTGGTACCACT ACAGCGGAACT ATCCCTAAGTCC 
ATCCGCCTGGGGCGAAAGGAAGGAAAAAAAGA 
Sequence 550 

ntatci ran I GCCTCATGNGGGCTACACCNACGCT AGNNAGCCCAATGAGACGTTACGAG 

cgcgcaagtnagaaacnagatttcatagagcgcttgt1gggagagggacattcgcaaacc 

gcgcgtttaagttactggtagatattgagtanntaaggncgttggggaaacgcaaccaaa 

tacttxtagagccttigccgnaacaagntactacanttgttcngggggaacgaaggtgcc 

ccggntcaacccffttggcccccaaanagccccaagnc^ 

mnnctt aacngaaccacattgggcgaamggmncgcna^ttggncccc^rrggtttttatcnn 

NCANT AACC CNANCNAAATGtraXSNCNTCCATAGGNAAATC 

NGATA 1 1 1 CC 1 CGGCATTT^NTGGCCCC^^TTTCGCTTTNTNTAA^^CGCCA^^TAOCT 
NT 

AGCNCCC TTTT AGGCAACAT CCTTTAAAAACGGHGCGGAGCGGTG TCCCCCCAAGGGCC T 
TNCCCCCCCAAANGCCCCTTTTGGTGTCGAATTTGGCAAGCC^ 

AAAGGGGGGGG TTGGGG^MCTCCGGCCCCNACCGCCCCTTTGGNCCCTTGGGTAAAAC 
TO>AATNGGGGGGAI«3GCAACNAAAGGCCCCCTT<»^ 

AAGAAGNACCCCAAGGNAAGTGN NCCCACCGGGGGGTTHANAAANAAAACCCCCCAAAGC 
CACCCAACNGGAACTT ACCCCTTAHAAACTTTTTGGNATTAN G TTNTAACNAAANNNACC 
CGNCCAAAATTTAAANAAAANANAAGGGCGGATTTAATTTTTTAAAA 
TTNGGGGGTGGAAAGATNTAAAACAAATNTTAAAA 
Sequence 551 

AGTGGACTNTGTGACCTTGAA*SAAGTCATTTAACATCTCTG 
T 

CTACAACTAATATATAGTGGGTGAG GTG 1 1 C 1 1 I C I 1 1 G TTCTGNTACTWGGATGTGA 



CTCTCCNTTTGGAGATGAAACCATGGCGTAAGTAATATAAAGACTTTTCCCTGTAGTT 
AT 

C TT ACAGAC TGGAGAGAGTGCT AG TGAATGCTTTTGTCTTCAATGCCCATCTCTTGGAAA 

GGAGAATATCCCCATCATCACAACTGTCATCACTCTGCCGCACGTGATTGTGGAGAATAT 

CCCTCTCCNTGTGAATGCCAGAATGAGATTCATTTACAA 

Sequence 552 

GGCCGGCCGCCCGGGCAGGTACTACAATGATTCTGAAGCACAGTGTATTCAGACAGATAC 



ATGTTTTTATNTTAAAATCATTAAGGGAAGGGAACTACTTT 

13 

CCTTTAAATGNTTATCAAAAGAGTTTTCTAACCAAGGNGTAATACCCTTANTTCTTAAC 
A 

I II Nil II I CTTTATGTGNT AGTTGTTTTCATGCTACCTTGTGTAGGGGAAAACCTTTAT 
TTACAAGAC HCAT ATTTAKAAAAGGGCTANATTTTT AAAAT ACTCAANATTAATATT AAA 
AAGGTTGGCTCCTNGAATTANNAGCCAAGNAAAATTANTATTrTACGAGTTTT^ 
T 

CCCAAC NANGAAAAT AGGCCATTTCCCATAAACCCCAACCTCCC NANAAATGNAACC CCA 
AAGGGGCCAATTATTTATTACGTT ATTTTTTGGGGAAGGGGGAAANTCCAAN N GGGGGGT 
T 



CCAGATTACACATt 

CTTCACACCAAACACTAGCTCAAGCACTGACGTTATTCTACAGGACT ATGAACCTTCATA 
TCCACATTTACAGTCCGGACAGATAAAGGAAAACAACCCAAATCCAGGAGGCAATATAAA 
AGGAAGAGAACAAAACACACATTCATACACT CACACTT AAAAATAGGGGAAGACCAACAG 

gggaacttttc g ncrcTTOC tgg gatgtctacttaaaaatcccatgtgggtacct 

Sequence 554 

ncggg tggcggccgaggtactcttgagattgc TTTAAATTTTGTATTGAAACAACAATAC 

ATTTrGCACTGTAGTAATGGGAGCACTAACTCTTACAACAGTTAGTGAA rCGTTTI AAA 
G 

AATCAGTTCAGTGTAGACA 1 1 1 IG AAAAGAI IG 1 1 fCCTGTGCTCTACGATAGCTT AGT 
G 

CAATGTGCA C I H C TGI 1 1 1 ACTTGCCATTTTCCT GCTCTG I II I C TCTGTGACATGAAG 
C 

AACAGAAACTGAGATCAAAGTTAAGATTATATCCTGTTTGTAGTATCAGATATTTTTCT 
G 

TGTACATTrACATTCAAGTTTGATAACACTGGTGGTTTCATTTCAATACAAATTATGCT A 
GAGAACTGACATTTTCANACATGGTCATATATATGCTATTTGAATTOCTTTATC 
CCAGAT CTTGGATTGTGAAr C T C n GATGATAGATGTGCAGCTAATTTTGTCOCGAAA 
CT 



GGGTGGCGGCCGCOCGGGCAGGTAGAAGACCATGACACCGCCCAAAACACTTCC TGCAGA 
TGTTGTCGTTGGAAAACTGTCGTCTTACAGAAGCCAGTTGCAAGGACCTTGCTGCTGTCT 
TGGTTGTCAGCAAGAAGCTGACACACCTGTGCTTGGCCAAGAACCCCCATTGGGGGATAC 
AGGGGTGAAG tTTCTGI G TGAGGGCTTGAGTTACCCTGATTGTAAACTGCAGACCTTGGT 
GTTACAGCAATGGAGCAT AACCAAGC TTGGCTGTAGAT ATC TC TCAGAGGCGCTCCAAGA 
AGCCTGCAGCCTCACAAACCTGGACTTGAGTATCAAGCAGATAGCTCGTGGGATTGGTGG 
GATTCTCTGTCAGGGCATTAGAGAATCCAAACTGTAACCTAAAACAGCTACGGTTGAAGA 
CCTATGAAACTAATTTTGGAAATCAAGAAACTTTTTGANNGNAAGTGAAAGGAAAA 



CGGGTGGCGGCCGAGGTACCCATCTCTGCCCATCACCGCTGGAATTTTGATGACCTATTG 
GAAAAGATCTGGGACTATCTGAAACTAGTGAGAATTTACACCAAACCCAAAGGCCAGTTA 



GAGAGCCCGGGTGGCGGCCGAGGT ACGCGGGGGGGGAGTGGCACTCGCAGCTGCAGCAAA 

TCTCAAAAT AAAGAGGCAACGGCC TTT CTCTTCC TC TCCATCTCTC T AT AGCACACCTT 

T 

TATTTC 1 1 I ICI1CI1 III n AAGCCTCACGAAAGATTTTACTTGTAGATCAACTTTCAA 

AATGTAGGAAGTCAGAATGGGTGACATCATCAGAAAAATATGTGGAGCTGATCACAAGAA 

GTGAAGAACCCAGAGCACNGAAAGGGG TTGTGACTCCTGGGCCCAGGGAG TTGACAGCG T 

CTGGGCTTCAGAGGAGCCJ^GCCGCCTCGGAGTTGTCTTGGAAGTGAGGCTCTGCTGTAGT 

CCTGTrCTTCTGGCTCTAAGATCTGAATGTTGTGAGCACTAATTTGCTNTTTOCTGGA 

GG 

G TAACCCCAGTTTGG TCCACAAGGGC TT 
G 

Sequence 557 

GAGCCCGCGGTGGCGGCCGAGGTACTGGATGTGAGGTCTGCGAAACTTCTTAGATTTTGA 

cctcag tcca t aaaccacac t atcacc tcggccatcat atg tg tct actg tggggacaac 
tggagtgaaaacttcggttgctggcaggtccgtgggaaaatcagtgaccagttcatcaga 
ttcatcagaatggtgagactcatcagactggtgagaatcatcagtgtcatctacattcga 

ggcgcttgccttcagcttgtgggaaatcccgaagatggccaaagacaactcaactggttc 
gttgctttccagggcctgctgai i m ggaaatgtgattatt 
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ctccctctgttgcxxagtctggagtgcacgtggcatgatcttggctcactgcaacctcca 
tctcctgogctcaagcgattctcctgactgagcctc^^ 
gcctgocaccatgtcccggct aatttttgtatttttagtnaaanacgggg tttcacca 

TA 

TTGGTCAGGCTGCTCTCGAAATCCTGACtrK^A^ 

gctgggattacaggcccgagccaccgnacctggcctgtattxxcgcgtacctgcccgggc 
nggccnctnttagaact aggnggatcccccgggctgcaaagaatttggatattaaagctt 

AATOCNANTNCCGTOGACCTCTAGGGGGGGCCCCGG 
Sequence 559 

CGGGTGGCGGCGCCGGGCAGGTACGCGGGGGGTGGCTGGCTCCGTTTCCTGC mTGGTl 

CTTACAGTACTCGGCGTAGGCCTTAGGTGGGT 

« ' » 'U>IG GCTCCTi ayaXCTTTTCTCCCCTGtT« 

agctgggattttttactgtc tcctgaagaatttaacacaaacatggatatcagaccaaat 
catacaatttatatcaacaatatgaatgacaaaattaaaaaggaagaattgaagagatcc 
ctatatgcgctgttttctcag m GG 1 catgtggtggacattgtggcttta 



c tcagtctc tgccaggggaagattccttggaggaggccct gcagc gacatggagggagc t 
gctttgctgagagtctctgtcctctgcatctggatgaotgca c 1 1 1 i c ctttgtgtgg 
ga 

ggagatcctcaaagtaagcccctcggtgactgggctgctggca 

agtatctttattgaggatgccattaagtatttcaaggaaaaagtgagcacacagaatctg 

ctactcctgctgac 

t 

Seq u ence 567 

GTTTTGGGGGAACACCGCGGf*GGCGNTTt>IGGGGTANACCGGGCCA 
GGNC&AGGNNNNTNNNTTNGGGGGGTTTAAAACCCCNCGCCC 
NCANNANTTTTTTAI 



GCTCCOCGCGGTGGCGGCGGAGGTACI 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 Il l I U ATCGGCA 

A 

GCGACGCTCATACANGGCNTAGCGCCGGGAGGAACCCGGGGCCGCAAGTGCGTTCG^ 

GTCNATGATCAATGTGTCCTGCAAT 

Sequence 561 

CATGTGCGAAGCGCTGTGAAGAGTTGTTGCCTTNCAAGA 
CAGCt^CTGTCATTAAAACTCCCCTGGCGTGAAAGATGACATCCTTAG 



GCTTCCTTTGNTATTTT 



^NNfeSNNANNAAGTTTTTTTTT 
CNTAAAAAAAAAAAAGGNCCCCNAGNGGGG 

GCGGNGGCG GI 1 1 ICGG NCGAG O X I LTCI IGNCCATCTTCTOCOQCTGCTGAAATTTCT 
NTTGCGGGCGCTGNAANCCCAGGACCCCNCCCCCCGCGTACGCT GGATAGCCTCNT GGCC 
AGAAAGAGAGAGTAGCCGCCGAGCACAGCTAAGGCCACGGAGCGAGACATGTOGGCCCGA 
ATGCTGGCAGCTTCAGGAATCCCGGCGIiACCTGOCCNNTGCGGTCTGTTCG 



NNNAGCCGGGTATTCANCCTCTACTTCAAAGGCGCGTAATKACCGGTT^ 
^CANGGGGGAATTAACCGNCAGGAAAAAAGANACCATTGTTGT ATGCCAAAAT ACGGCNC 

atgctaaaaattggncatgtggaaaccccgtttaaaaaaaaag 



acaaaaacccaaaccocagagagcagnaatgncagaagangcanggagaacagcagaanc 

tnacaccgcngcflctctgaaggctoagaacacaagncaaanacatnnaact^ 

ccgctgagagaacacggggaaaaattniecantttagagangnccac^ 

aaaggggaagggcaaggcggngagacaacgacgnnanngnngggaagacnggggaggggg 

nggagaagagccnnggnngccagaannccggncggaggnc>cgaggcggngacc 

ggaccngcccgggcggncggncnagaacnaggggaaccoc 

e 570 

CAAACCCGCCNGGGANGGAAGGAAGGAAAAA 



CGAT AAGC TTGATATCCGAA I fCCI I U CAGCCCCGGGGGGGGATTCCCACTTAAGTTTTC 
TTAAGAAGCCGGGGCCCGGCCCCGGGGGCCAAGGGTTACCCCCCGGGGGGGGGCCCGGGN 
AAAAGTTTGGGAAAAAAAAAAAAAAAGG G1 1 1 1 1 1 M 1 1 lAAGGTNGGGGCWTTTTGGNA 

NNCCCCNGNCCANAACCCG 





ACAGATGTATGCACGCACACACACTCTCACTCCTAGACTGCTAAAAGCAAAAAAAAAAAA 

GTCCCTGCC 



NGACCTC»GCACTNAGCAI*C»«CACrACTrAGGGGGNGTTAA^ 

GNAGAAACCNCNGCGCCATGAG^riNTCAAGNGGAGGAAGAAGOGACCCGGGCANGCTGM 
GCGCAAAAGAAGAAAGANGAGGCAGAGGGCCAAGNAAACCGNNAGCNNGWNGCACCGNGG 
AGGCNTTKTfiGWTTTGNNGGGNGGAAN GCMM*AGGCCCNNGGAAGNANGAACNAAGAAG 

CG 

Sequence 556 

attggagctccccgcggtggcggccgaggtacgcggggggggactggaggacctgtctgg 

TTATTATACAGACGCATAACTGGAGGTGGGATCCACACACCTCAGAACAGCTGGATCTTG 



CCGGGGGGGGGGGGGGGGGGAAAAANCCCCCCMAAAAAAAAAAAAAAAAACNACNCCCCC 

CCCCCCNNGGNGGGGGGGGGGGGGN 

Seq u en ce 571 

CGGTGGCGTTTAGGGACCAAACGATAGGNG II C I G 1 1 1 AAGTAGGGACCTCTCATGGTNT 



CTCCATCTGTACCTGGGCTGAAATGGAATGTGCAAATGTAGCOCAGCCTGGTOCTTGGGT 

gttgccagttgattgatga^tgggagccaaag tgggatttncttkgacct AAACGGGCGA 

TGATGAAATAAATCGAGCGGGCGCCCGGGGAGGNACATCTGTGAATGTGAATGCCAAAGC 

GAAGGCATCCCTGAAAGTCCCAAGTGTCATGAAGGAAATGGGACATT^ 

TGCAGGTGCAATGAAGGGCG 

T 

Sequence 577 

TGNAAWCCCOOGCCACGGAAAA G GNGGCGCGWGAGCCAGAGCTCCAGCAGCCCNGGGAG 

GGCGGGGCCCGAGGCANGGANAAGNGCGAAGG A AAACCAAGAACAGGAGCaGA^ 

AAANACAAAGNGAAANGGGGCCAGNCAGCaTGTCAGAGACNGACCACAAAGCCCCCACNN 

CCaCnGAAAAAAAAGGNGGGAAAACACCGGAANNAAAGGAAGACCCAAGCAACNWGGNNN 

CNGGCAAIIGAAAGCAGCAAAAwV^GAAAANGAGGCCAAACCAANGGCAAWAAACAOOG 
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Sequence 573 

GCCGGCGGCCGCCCCSGGCAGGAACANAGCACTNAGGNGflXSNCGG^ACWCGGCANGGGAC 
AGGACANAAAGGAAAACANAAAAGANGCAAGGGGACACGACACANANGAAAGGNGAAGGO 
CAACGNCGACCAAACGGGGGNAGAAGACAAAAAACCAAAA 
Sequence 574 

ngggnggggttmt^^ 

gnaaaggggkaaaaaaattttttngaaaggggggggggggggggggaai 
aannaanggggggngnggggggggggnnf 




ACCCCACNCACCCCGANNNCNGGANCNCNAGGGAANGAGTTTNAGCNOCACCX3(G 

^CGCCCCAGAAACNHANGNCCAGAAGNCACTGGGCAGAGAMAAGAGNGNNCGGNCMCAA 

AACfJCACAGGGCMCAGGGTTNGCGTGNTTTTGGGGGGGGGGANGGG^ 

AAGAGGGC*IGGNKANCCGGGNNCNNC*4GGAGAAAGANGG^ 

ACANGGNAACANAACNGAGNNNNCAAflN^KSAGCAGNAANNCGGGGG^ 

Sequence 576 

TGCTGCTACACAAGACCCTGAGACTGACCTGCAGGAOGAAA^A 

TCTTGCCATCC 

Sequence 577 

CAGGTACAGAGACCTCCTT ACTTACCCCCCTTCTCCTTCGGCTGGAGCTCGGCGAGCGAG 
A^^CGGCGCTCGCGTTGGAGAGCGACGGCGGCCCCCGCGTAAGCAGTGGT AACAACGCAG 
AGTAACGCGGGAATGAAGAATCTTAGGCGGGTGCACCCAGTTTCCACCATGATTAAGGGT 
CTTTACGGAATAAAGGATGATGTCTTCCTTAGTGTTCCTTGCATTTTGGGACAGAATGGA 
ATCTCAGACCTTGTGAAGGTGACTCTGACTTCTGAGGAAGAGGCCCG7TTGAAGAAGAGT 
GCAGATACACTTTGGGGGATCCAAAAGGAGCTGCAATTTTAAAGTCTTCTGATGTCATAT 
CATTTCACTGTCTAGGCT ACAAC 
Sequence 576 

GCGATTGGAGCTCCCCGCGGTGGCCCGCCCGGGCAGGTACCTCACAACGAGTTCAGTCAG 
TAGCAGAAGGATCTTCTCTCTTGTTCCTGATGATTTCAAGGTCCTCACAGTCCTGATA 

C TGGTTCTTCCCGAAAC TCCCAAAT ATCTATGGAGAGC TGTTCT AGCTTTTGCACAG GGA 
ACCAGTGGACAGAGGTATCATTAAACATGTCCATGTATTGNGAAGTCTGAGGAAACTCAA 
GCTCCTCCAGTCCTTTTAAAATCTTTGCAATGTAGX5GATAATTTTTCTGCAGAATCCTT 
G 

CCAAGAACCTCTCCTCAAGTCCTTTGAAACTGTTCCCAATGATGACCATCTTAGAAAGGG 
CATCTACTGACCA6TTACTCCATAAAAGATTGTTGTACCTCGGCCGCTCTAGA 
Sequence 579 

ATTGGAGC TC CA C CCG C GGTGGCGG CCGAGG T AC TTTGGACAGTGAGGGTTCGATCCCAA 

TTT7AGGGGTAGGGTTGGGGGTGGGAGTGGGAGTGTGGGTTGCCAGGAGGAAGAATGAGT 

C TAC TTTN GANACAATT AAGT CATGGN CCTC TC TTTTTTT N 1 1 I M I M 1 1 11 H 1GGCT 

ACNTAGACNTCTTTCTCATGTATTGTTACTAGAACAACTT^ATAGGGTTTTATGGTTN 

G 

GGGAAAAGATTNN TAAAAA ATGGACTNATCTCTATTATACAGANNTATAATATAAAAATG 
ATTTAAAGGCTATAI 1 1 I ICAGCATGTAGGTAGCTNCNCTGTCANCCTGTTGAAGAAN 



TTCCTATTTAAGCTTATAGGATGAAAATATATAATTAAAG 
Sequence 580 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCAAATGCTTCCCTGGTCTTGATGAT 
CTCTTCCAGAGTCGATCTGAGTGGC C I 1 1 I CTGCAGCCTCCCCTTCTTTCTCTTTGAA 
TG 

GAATTAAACCCAATTTGGAAACAACATTGACCCAGTCAAAAGCTT CTAATGGTTTCTTT 
T 

TCTTCCTtXAGTTTT AGTTTGCTTTT A7TAAAAAAAGAAAATAGTGCATGGCCATAGCT 

CTTGAGTTCTCTTATTGCAGACTAACCATCAGGATGGTATCAAAGCACAAATACTTrGGA 

GGGGAATGCGTTGAACTGGGGCAAGTACCTGCCC 

G 

Sequence 581 

CGTTGCGCTCACTGCCCGCCTTTCCAAGTCG*K^NAAACCTGGTCCGTGCCAGGNTGCAT 

tAAATGAAATCGGCCCAACCGCCGCCGGGGNAGNAGGGCCGGTTTTGCC 

CGCCTCTTrrCGCTTTTCTCGCTTCACTTGACTrCGCTGC 

CG 

GGCTTNGCNGGTCGNAGGCCGGGTANTTCAAGTCKTNAACTT CAAAAA 
Sequence 582 

NTN GAGC TC CCC GCGGTGGCGGCCGAGGTACCAAATTGT7AAATACTCC N AGGCC TTT AG 
GAACCTGTGACTGANTNCAT AAATAMCAGAN CCT ATATTG TGATG NTGG TNAAAGGACAN 
GTGCTCANCTTCCAATTACA 
Sequence 583 

ACCCTCCTGGAACCGWAATAAGTTNinGGGGGGGGTNA^ 

GGACCACAW>ANCMAAACT>UAGGNCTAGCTCANAGAA^ 

AGCTGCTGCTTCCCCTGGWGGAA£ATNGCCTGCT>ICCTGATA 

AAAGATTfKjTTNGGCAAAGCCGACANCNACN CCAACNACAAGAGACACTAAAGNGCN NGC 
NGGGGGGGCTNC CAGGGGAGANGAAANGGGAAGNC GGGCN GCAG CAAC NCNGG N CAAAAA 
AAACACCAANNNCNGGGGCNCAANGGCAC NAANCAGAACGGCNCGCCCN NNGGGANCCAC 
AGCNAAGAACCGGCC 
Sequence 584 

TTGG TTATACAACATTTGTTTAATAAATG CANTTTN CAAAGC T ACACANGACTT AGATA 

TGAAGCAGAAAAGGTGGTTTTACAGTCXXTGCATTAAGGTCTAATTCTTACT ACCCTGGC 
CT TTTC C TT CC TG TGTGC TTGTGGTTGGTTC TTT 

TCTCAGGCTTTNTNATTCTGATGC7GAGATAGTTCTGTTCACTT AGCAACTTGGGACA 
GT 

GACACAG GGTTTG TTCTGTAC AAGCAGGTTATCCAAGAGGCATCCAT ACCC T GGG TTTT C 
CTCCAACCATAAGGAAAATTGATGCAGCTGITTCTGACAAGGAAAAGAAGAAAACATACT 
TCTTTGCAGCGGACAAATACTGGA 



AGGTACCTGGGCCACCAAACACAGCTGGACTCAATATATGGGGAAGGTAAGTGTCCTCAG 
1 1 I 1 1 GGAGAGAGATTACCCTCTTCCAAAAGAGTGCTTGATTCTGGTAGTCCAAGCTGTC 
TCCGTCTGGTGGCACCCCAATTTCCCCTGCCTAGACCCACCTCCTTTCCTCAGCCCCCTT 
CGCCTGCCGCTGAAAAGTGAGAGCGGGCTCTTGCGTCCCCCGCGTACCTGCCCG 
Sequence 586 

ggggggn n aaacccngaagangcggn nn acgccn n n c agagccacan natttttggncga 
aanaggggnccagnnccgaggaaggnggaggaggncngnaggnaccnngggcggnnnaga 

ACNAGGGGANCCCCCGGGCNGGAGGAATTTTNNAl I I I I I I I AGGGGGGNGGGGGNNCCC 
NNGGGGGGGG 



WO 01/42467 



PCT/TJSOO/33312 



PCT7US00/33J12 



^^^^^GCTCCCCGCGGTGGCGGTCGGGTCAGCTTTAAAGCATCATAATGAGT AATTATA 
GG TGAAT AATTTTACAGACAG TC T ATATTCTAGGAGG CAGCTGTAGGCGTTTTAATTGGA 
AATAAGCATTCTGAGATAATGATAATAGCAGTGTAGAAAAATGAAGCTAAAAAAATTCAA 
AGTGTTGAGAATCCTCCTGTCGTTCTGGGATTTTTATTTTAATCA 

ATCGTGGTTTT7TATTAGGTAGATGCCCTGGATAATCCTTT 
CAACTTAATGAAACCAATATCCTTCGCATAC 



GAACACCGAAGAGCCAGAtnWTTAAGGWCAGAGAAANC^ 

ANAAGAACCGGGAAGGGAANGAAGGACAGGGAAGAGACCAANGAC^ 

<^TANGAACAAGCAGAGGCAGAAGCCAGGCAXX»*GGNCNA^ 

GATG^NAAGCNGf^NAAGGAGGAGAACCGCCGACAAGf^^ 

GGNNACAGAANNNGGCGAGGCCNHAGNACCCCGGCCGN^ 

GGCMGGAGGAANNCXaAMANCAAGCNNAANGAAACCGGCGACCCCGAGGG 



GCAGAACAGACTTCCAGCCGACCAATTm 

ACC TTTCCAC TTTTTGAGGACAN TGGCCAGGGGC NC TGGGCTACCCGATGAGAAAGCAAA 

NCAGCACAGCATTJCCGAANCAGGGGAAGAGAGG^ 

C*XGAAGTGTNCCA<>GGOICAACNCTAIWCCCN^^ 

NAANNCGNAAACACTGGAGGAACGNAAANC**CGGGGAAGCAGNC^ 

Sequence 500 

GCGGWGGTTTTTGGGGGGCAACACGCGGGAC^CANGC^ 
TGGGGGGAGAAAAACXXCGO^X^GAACGCC^ANCACC^ 
NAAACAAAAGGCCCANGCCCGGACOkCf4GCCCCGG^ 

GGGN N NAANN GN GGCCGAG MGGAACAACCAT ATAANAAATTNCCNCGGG NGGGGGGGAGC 
CGAAGAANMAACNAAAAAAAAAAANCCCNWANANGGGGGGGGGGGA^ 



ANACCGNCGACCN C GAGGGGG 
Sequence 591 

C GCCCGGCAGGT AC TCAGGTTTTATC TCTGCAC TC CAAGT AGGATGAAANGAT AAGAGCA 
AAGGCTCATG TTTGCCAAGTCTGTC C TTTTG T AACAAAAAAC^CAGCAGCTTTATCAAGC 
AGAATTCCACCTGTATTTCTTAACTTGCCAGAGCTGAGTCTCATGGCCACCCT^ 
AGTTGGGGAGGTATTTTTAACAAGGCACATTATCATCTCCCCCACCCAAAGTGGAGCTAT 

TGCTAATGAAAAAGATACAATGAGATGTTTATGAAATTATCTGTAGCTATTAATGTCAG 

G 

MM IGAAATTTACTGACCTGGAAGAATACTCATAATGCAATGTCAAGTGAGAAGCAGGA 
CAAAGA 



TTGAGCTCCCGCGGTG 

C 

AATTAAAAAAAJ 1 1 I I 1 1 I 1 1 GTAAAGACTGGATTTTGCCATGTTGTCCAGGCTGGTCT 
G 

G<^TTC^TGGCCTCAAGCAATTCTTCCTCCTCGGCCTCCCTAAGTGCTGGGATTACAGGC 

ATGAGCCACCATACCTGGCCACTTCTTCATTCTTGT7GGCTT7GCGTNCCCGA 

AA 

TTGGKGAGAAGTTCCTTCGGCTGGGCTGAGGACCCGNGGTCATGGGTGGATCTCATGGAG 
AGAGGGCNAGGACAG 



593 

gtgnattgagctt^ccgcggtggcggccgcccgggcagg™ 

CAGGGCCTGCCGCCCCATTATGATGATGTCGAGGTTn 



cgaggtacaggtgcgattctggatgacaaaagaagatgcttacttcacagaa^ 

^CATTGGGAACAGCAACCATGGCAGCCAATCTCCCAGGAATGTGGA^ 

tgggagtcatttta^ 

1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 aaggaggttttattgttttagtaattjttaacataacttaa 
aataagagaggggaaatgacatctggagatct aggtatgtggcccattgcaattgagcac 
atttcttgggtctgttrctctatctctaagggcagtctcaaaaccccagc 

Sequence 595 

TCACGGGTGGCGGCCGCCCGGGCAGGACATGGCCACCAAGTAAGAATGGW 

GACAGAAGGCTAAAACAGGAAGGTAATCTTGTGCACCTGACAAATAGAAAGAATAAAG^ 

TCAAAATTGAAGGCANGCTATAANAGTATCAAAGAAATTTC 

TTTGGAAGCACAAAAACTTACNGTTAACTGCTTNCCCA^ 

AAGAACANTTTGNGGCATTNCTAAAMTTTAG^^ 

TANAATNGGGAMACACNACCTACC AI 1 1 1 f 1 1 H I CTAAATCCNAAATTTCTCCCCCCC 
TCC TTCCCAGAAANAGAGAAATTTTGffTNAAACCTT CAATNT 

i TAC T ATTT AAGAAAAGAACAAGGTTAAC 
TAGCCAATTAAAAATAAGTTCAT 

aaaagcacttgaaattatatatttaacctgaaaaaaaagttgctaaaattccaat^ 
tgtaaatatctttaacttgcxraacx^^gctatccccaaaacagtgtaagtggggcaaaa 
tgttcaaaagaaaaatcatcxagtgcacgtaagatggggcagccaagaaggcta 
ccttgngccgcgtaccctcgggccgctctagaactagtg 



CCGCGGTC 
TC 

tgatgtttattttaatg<>xcttagtccacacagttggtataaaatc^ 

G 

caaaaacaaaaggtctggagtcttagcatcagaagggcaccatatatacatctacag 
gnggccaatacaagtcattgccagacagtccttggaggcacaga^ 

CAAGCTCTAGGAACTTCACGGGTCCCAAGGGGTNTAGACCWCTTGTT^ 

agccgtaaaaaaaaatgtggggaagttgatgaaggcttttatgatt^ 
gcgtacctntggc 

Sequence 598 

rcac **cgtccggggaimn'agtagaaaggcxxtgggtgttctaaaataaggctc7cctggc 
ccacggctgactgtcttccttgtgtctctacagttggaccgtgactctt^ 
gnctaccxxagcctgatcctctctgataatctgcggcaagtgcggtacagtt 
caggaccrrgcctgacaaoccxxiagaggttcaatctgtttccctgt^ 

TGCTTCATCGCCGGGAGACATTATTGGGAGGTAGAGGTGGGAGATAAAGCCAAGTGGACC 

ataggtgtctgtgaagactcagtgtgcagaaaaggtggagtaacctcagccxxccaga^ 
g^ttctgggcagtgtcttrtgtggtatgggaa^ 

atgactggcctacccccccg^ggncccccggttccaccggggtggggggat 
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ATAGAGGTTCTGACTCCTCAGGAGCAAAAAACATAACCTGAAGAGGGAGGAAGTGGATTT 
GGGGTTCACCATTTCTTGGGGCACACTTCATTGAAA^TGANACTTCTGAAGAGAAGGCC 

acaagatacaaagacagnccatnccagttgaatcctgtcttccaagaacagaagaaaatg 
atccaggccc^ggaatccataacactggaggatgtggctgtcgacttcacttgggaggag 
tggcaac tcc tgggcgctgctcagaaggacctg taccgggacgtgatgttggagaact ac 

AGCAACCTGGTGGCAGTGGGCTATCAAGCCAGCANACCCGGATGCAC^ 

ACAAGGNGAA 

Sequence GOO 

agtjtgagacaatggcggaaggctocatggcggctggcl'ici i ccagcccttcatgtcacc 
gcgcti cccagggggcccccggcccaccc t gcgciatgccgag tcagcctcccgcaggcct 
ccctggctcccaagcco:tcctncxtggcgccatggagcc^ 
atccgagcatgggcggmccaa7gcagagggtgacgcctcctcgtggcatggccagcgtgg 
GGCCCCAGAGCT a tggagg t GGCATGCGACCCCCACCCAAC tccc tcgcccgnccaggcc 
TGCCTGCCATGAACATGGGCCCAAGGAGTTCGTGGCCCGTGGG 

ATAAATAAAAATTAAAAAAATCTGTGCAATAATTT 
TAAAATGTGCTCCCAGGAATAGACACAAATGTTTTGAGTATCTTTTAAGCTGCATTTTC 
C 

TTTAGTGATGCATTTGTCAATTGCACTGAATTTAAATCTGAAAGTCAGAGGTGATTATT 

G • • v • 

ATAGTACTTTTGTATnTGATATGGACAGTTTATTCATTTGCATACAG 
TCCCAGCTGATTAAAAGATAGTCAAGAAATTCT GCAATATAGCTGCCAAAATAGACAGCT 
ACATTTTTATGATATTCTCAT C I 1 1 rCTGOT rTTTTTTTCTTTTTTTTCI 1 1 AGCTATTT 
TACTTAAGCATAATAGCCACAATAGGACATATAAAAGATTATAAATACAGA 

TACCCCCATGCCACCCATGAGCTGTAGGGACCA 
ATNTCTNCGCTCGAKCTGGCCAAG 



ggggtcttactgaagatcttttagaaaacttaaatcctgtcacaggatatttagacatg 

T 

G T AGAATGTAGCTCAATTTTTT aaaaag t aactgacc tagagggtg^aagttgaaactga 

CACA TTTTCAAA TTTAAGA TT A TGCTT ATTTTGT ACAGAAAACAA T G TTT AAAGACCAMA 
GGCAGNATCTTGTTGTAWT GTATATAAACGCTAACACCAGGAU 1 1 1 1 1 1 AAAAACCAMAA 
ATTTAAATTTATTTTTAhKaCTTTTAATTGGAAAGGNTTTGGTTTTTTO nTTTCCTTTOC 
GAAACCCTGGGAGTTATTCAATTAATTTAATTAAAAACAGGGTNAGT^ 
C 

o " " 4 w J i ' J ^" r 

GKATTTTGGGGAACC 



CTCCCGGCGGTGGCGGCCGAGG TACCCAAAT ACCACTTCAGGAAA TC TGGCCAGATCACC 

tgaatccaaatgttctattaattcaatacacgttatcaagtcaaatccaagca 
gtctctctccacaacggagccatgatacaatgltmtggtxaaattcagatcccgaggt^ 
cagaaaatcccccaggaaaggagctaacgaatcccctctccatcg taatttatcctcatt 
aa t a tct ac tccaacaagcaa ttcaa tgcatgga ttgac ttttagcagccttaagagtga 
agt atcaccacatcccaggtgtgcaaccttcttaggctcatgttgatccactaaatt^ 

T 

AACGAACTGCTACCTGCCCG 



TTCGCCGGAATNTGMACTCCAGCTCTCCAGAGGATGNAAAAGGCTAfTTGGGCAAAGTTTT 
TGGGATGCCATTC TAN CTCAT AACCCACCCCANT GAAAC TNCAACCCNATTTCNCAAANA 
NAACNTTAAAATTGGGCTTGTWAATAAA^CCNNGNGCCGGCACAAAGGGCCGGCCCAM 
CCAT 

Sequence 603 

GTCCGGGAAAAATTACC TGTCTTGACTGCCATGTGTTCATCATCTTAAGTATTGTAAG 
CT 

GCT ATGTATGGATTTAAAGC G TAATCAT ATC TTTTTCC TA T C T ATC TGAGGCAC TGG TG 
G 

AATAAAAAACCTGTATATTTTAC I ITG1 I G NAGATAGTCTTGCCGCATCTTGGCAAGTT 
T 

GGAGAGATGTGTGGGAGNCTAGGAAAAAAAAAAAAAAAAAGCCCTTTTCAGTTTTGTTGC 

CACTh<5TGNTATTGGGACCCGTGTTAG^TTTGTATGCCAAGAATTTTTCTTGAAAAT 

GG 

AAAATGNTTTTGfrrmAGNACCGIOAGNATTO^ATACNCCGGTTAA 
TNGACCAAAAAG TC TTTGG C J M M II 1 C TTGGGTAATTG NT7T CCT AAAN G N T GG TT A 

T 

NTTGGTGGAN CTTTTTTT AACC TG GTTTAAT AANTTT AAAATN TGGC C C C AAA TT AA TT 
A 

NAGGTTTAAAAAATNATTAAGGNAATTTA 



GTTTAAAAATAACATTGGAATACAGGACTTGTTGCCAATTGGGTAATTTTCATTAGTTG 
T 

n I G 1 1 1 GTTTTGATTTGAAACCTGGAAATAGAGTAAAATTTGACTGTTTAAAATGTTGG 

CCACCGGTGCAGGTCTNTCCGNTCATCAAGCT 
Sequence 607 

CGGCCGATGAGAAGAAGAAGX5GGCCCAAAGTGACCGTCAAGGTGTATTTTGACCTACGAA 
ITGGAGATGAAGATGTAGGC^GGGTGATCTTTGGTCTCTTCGGAAAGACTGTTCCAAAAA 
CAGTGGATAATTTTGTGGCCTTAGCTACAGGAGNAGAAAGGATTTGGCTACAAAAA^ 
AAATTNCATOGTGTAATCAAGGACTTNATGATDCAGGGCGGAGACTTCACCAGGGGAGAT 



TTTAGNAACT AGTGGGAT< 
TCTGGANTACNO 
TTTNTTTGGTKTCCCNTTTTA* 



ATTTCCI^TATTNAAAGCTTTTAT 



TTGAGCTCCCCCCGGTGGCGCCCGAGGTATGCGGGAGCTGAO 

TCGGAAGGTCCTCTGCAGGTCCCCCTTCCGCTCTGCCGATCGACTTCCGCCTCGGGCAGT 
CAAGATACTGCCAAGGAAATCTGATGTGGAAAGGAAAATAGAAATAGTGCAGTTTGCTAG 
CCGGACACGCCAACTC TTCGTTCGATT ATTAGC TTT AGTGAAATGGGC t aataatgctgg 
CAAAGTGGAAAAATGTGCGATGATTTCAAGCTTTTTAGATCAGCAAGC 
GGACACTGCTGATCGCCTGGCGTCGTTAGCTAGAGATGCTCTGGTCCATGCACGCCTGCC 
TAGTTTT 



CCCGCG TCCGAGACAAT ACAAAG TT ACA TTTTT GG ACCA T ATT AAAAC TGCAA GAAGAC A 



cgcggtggcggccgcccgggcaggtagttcggccttgccgttagcttgtggagaacgtgc 

TTCTTATTCCTGGCAGGCTTCAAGAACAGCTGCACATGTGCC GCT AACTGACCGCGTTGC 
CATTGGCGACCTGGACTCTGAACTCAGGTTrATTCTAAACCCAGTGAGAGGTGAGGGGGA 
GTGATGAAAGGGGATCAGCTGTATTTGTGTGTGTGTGTGTGT<yVGCACCTGACAAATCTA 
TGAAACCCGAGTGAAAGGAGAAATGTTAGATTCrTTATTATTTTATTATATTTATATGGA 
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AAGCTCGAC TC TCCCTTTGG TAAGTCCGAAGCA 
Sequence 610 

GCGCGGTGGCGGCGGAGGTACTGCG I IT I IT 1 1 CC TATT ATAAAAGTGATACTGAAATAT 
GCTAATTAATATATTAATTTTAGTTAAATGCTGCTAATATGCATACCTCTTACTTGAAGG 
i m 1 1 AATATGTTTTGATAACT1TAATAACTTCAGGGTGATGTCTGTATAATTTTTAAAG 

tgcagctctctctaacaaatgtgccx:tacaactcctgattaaaccggc 
caaaaaaaaaaaaaaaaaaaaaaangtacctgcccg 



GTGGCGGTCGAGGTACTT ANGAGAAATTGGCATGCTTTGCTAATWTTTATGCAGAGGTAA 
CCATGTTGAnl NACAT ATGTANT G TTGAGAGGNA TG TCT AATTTTATGGTCNTAGGAAAAA 
TTAAAAGAAAACTGCTGC 1 1 11X1 UAAGTCTGAAATAHAAATGTTTACAACTTGACNAGG 
ATCCATTTGGTGGCT AGMCTCGCCTTCX^GGGNGGMAAAGAGAAf ATGCGAGTTCTGn^ 
TATGGACTHITCACANAAGCTA 
TGCNCACTTAATTTTroCCWTCCTGMCTTCWAAAC^ 



CCAGI 1 1 1 CCAAGAAATTTTGTAAAGTTGAACATGGCCATCTAGTCTTGCCTT AAAACT 

TTCTCACCACACCGACCTTCCCAX^TGCATGAT ATCXAAGGTCGACAGACCTGGA TT AGA 
ATCCACTCTCAAGCTTCTCATGCAGTGCGTATTGTATTTTCTGCATA 
^^AGAACACAGTAAGTGTA TTTGCC CAGTAGTC^jCATTGCCTACATATAGCCAAGTGTT 
ATAGT ATAGCAACTTAGTATATTTTTCAAGGAGAGCT AAAGCACC n T T G T A ATGNTTG 
G 

1 1 i^iuACTGTTATCTT CCI I TCC T ATAATTAATTTATTTTAATCTACAAATTGACATAG 
GGCT AAAAGCTTCAATATTTTACAAAATATTAATTAATGHAAATTGGTCCCAATTATTAA 
GAAA C 1 1 1 1 1 1 1 NCATTT 
Sequence 613 

AGGAAGNCCACTTTT GANGAGGC CA TTNAAAAN CNAACGG NNATGANCCCCCCACANNNC 
ACTCNGAGGGGGAGGTi 



AACCAAAGAGCCANANGCCCCGGGGOia'ICCA^ 
CGGGCCGO*>AGAACKAAGGGC«AACC^ 

AAGGCCMNAACCGAAAACCGGGGNCGGACCCCCGGAGGGGGGGGGGGC3CCCGGGGGAACC 

CXXAAGCCMfmNNGGGGNCCCGCNNNAANGGGAAGGGGGGGAA^ 

CANGGGGCGGNNAAAACAAAGGGGGG»«AAAAAAANGGCCCGGGGANA^ 

GAAAAAANAGGGGCNAAAANCCCGGKNNCAAAM^AANNCCCCCACCCA^CCANWSACC 

CWAGHCCOGGCGMCCCAAAAAAAAAAAGGGGGGAAAAAAGHCOCGGGQGGGGCG 

Sequence 614 

CCAGAGNT AACGAAACATTCTTT ATAAAGGTTTGAyACCCKCKGTT^ 
T AATTTAATT ACAAANGGAT AAAT ATGGT AACGGGT A TTT ACAGAAGGAAGGGNG TT ATT 
ACGGAAAAAGCTAACGGCACGACGTrTATTTTTCCGCCACAATCTTTCATACA^ 
ACAAANTGAAC TTGCAAAAGCACTAAAACAT CACAT G T AAACCCAGCT AACAGAAAAAT A 
CATTCACAAGCGTTGMTGGTGGGGGTGNGNATMGTGTGMGCTAAGGGNCAAT^^ 
GAAACAGAAGGGAGACTNT GGCACGGCTCAATTCTTTCCAGNCAiANAGMTACATGGAAGG 
TT ACAANCAGGG tgccccanaaaaaaggnacaccact ah TCAAT ACCCNCCAA t ACAAAA 
AGAAAACCAATMTTCTT CCNCCANTACCT AAAAAAGGAAACCCGGGGTAAAC 
S eq uenc e 615 

CGOTGGCGCCCGCCOGNCAGGTA l I > I NII IN M I) II I U 1 1 l AATTTTGCATGTAT 



NGCCTTNATCAAACT ATAAGCTTGNGGAGTGGCCAATATACTCCATTGNGATTATACACTG 

ATTTCCATCACCTGCCTTTTTACTATCAACTCTTATTAGA 

Sequence 616 

CGGCCGAGGT ACTG TGCCCTCTTTC TT ACTAGGTGACCGAGAGTGGTTTTGACTCCTGTG 
GG TGCTT GAAGTCATTCTCAGGGGTC TC T ATGACCTTTTCCCTCCTT3CAGTTCACTC T 
AG 

TTTCTTCTATTTCA TCATCCCGCACTGC T C TT AGCATCGAAG TCAC TGTC TGGATCTGG 
G 

TNTCTACTTTCACATCAAGTTTGAAGAATGCA 1 1 rCTCI fG MGGTATT CTGT I 1 1 1 IGAA 
CTTACTrCATTGGAGAAGCCC CT TG A I rTTTCTTCCTl I A TACCAGATCTGGCTTCACG 
A 

Sequence 617 

GTGGACGAGGGCAACCCNACTAGCCTAAAAGCCCGTOACACTrG 
CGCTTGCACCX^TCCGAAAGAAAAACGCGGGCT 

CCCAACCGTGCAAWrrGATGGTACCCCAAGCtXAAGCG ^ C Ml G GNAA 

AGCTGGCAACC«3GACrrGGG«XrrrCAAhVkC^^ 
CGANTAOCACTNAGTATTTGGCCATTC^^ 

WCCGCG At WTTGCITTGGWGGGWGCAAAATTCGCWCGCiNG AT 
KTTTTTAANAAAAAAA 

SOQUCfeCP 616 

COGCGGTGGCGGCCGAGGTACTGGGACAGTTGGGTGCGTT ATGGATACATAACCTGAGGA 
GCCCX3GGGGAAGCTGGCCTTGGGTGTTTT ACCTCAATCATATATCCACACAAGTGCTTCT 
CTTGACATTTCTCGAAAATGGGAGAAGAAGAATAAAATTGTTTATCCT CCACAACTGCCT 
GGAGAACCTCKGCCAGCAGAAATCTACCACTGTOGAAGACAAATAAAATATAGCAAAGAC 
AAGATGTCGTATTTGGCAAAATTGATAiCGAGGAATGTCTATTGACCA^ 
TTGGAATTCAATGACAAAAAAGGGGOCAAAATAATTAAAGAGGTTCT m AGAAGCACAA 
GATATG>GCAGTGAGAGACGATAACGTGGAATTCAGGTC£AATTT^ 
CACCTCGGGACCGAGGCCAGTGCCTGAAACCCA 

GGGATCATGGAGAAGGTTTATTGGCATTATTTrGTGAAAGTTGGTGGGAAGGG 
CTT CACCTGAGCCCACAAAAGACGGCAG TTTGCCCATGCCAAAGAGT ATNTT CAGCAGC T 
TCGCAG CCGG ACCATCGGTCACACTKTTATGATGAGGGAGAATTNA 
^^^ATATTTTGGCATTTATTTrCTAAAAA T AAACCAAAAATTGGAACCCAAAAAAAAAA 
A 

610 

TGGGGGCCGCAGGTACCT ACTATGTGTCAGCCATGGGGGGATACAAAC^TCTATAACXX^A 
CAAGACCCT CAGTCTTGTAGTCGCCTGACAGCCAGCCAGCTACAACATAATGTGGAAAGG 
ACAATGGTGGGAAATGCACTCAGGT CTTCCT AATGCACAGAGTATGCT CAGGCT GTGACA 
TCNGAAGAAAACAGATATTTACCTTAACACGGA^ 

GGGAGGAAATCCAG TT T T AAAAGTCTTGATTT AAAAAAAAGAAAACAC TTTCTGTGGAT A 
AAGATAGGCTGCAGGAAATGT AACCT ATGAAATTTTCTCAAATTAGCTTTCAAACACACA 
CAAAAAATTGCATTTGTTTGAGGAGCAGAATGTAACTTATATTAAAGAATAAACTACTA 
T 

TTAGT AT CTGAG TGGAAGT AGCTGCCCGGGCGGlcCGCTCTAGAAXn'AGTGGGATCCCC 
Sequence 630 

GCCGCCGGGCAGGTACATTCTAATTTTTATGAGACATAGATATGTATTTATAAAAAGATA 
GATGCAAAGAGAAGAAATrAACTTAATrCTAAGAGCCAAATTTACTCAGA^ 
frCACCAAAATTAACAGCCAGTTTTCTTGA ITTTCTTCn GAAGAAGAGATTGGTGTTGC 
T 
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ATGGTGAGATATACTAT6GCCTTGAGAGGCAGTTTCAACTTGAAAA6AAGAT6CAGGTTG 

AGCAATC GGAGAGGAC TTCAAAGAAGCTGATGAGCTCTCCCGT GGACTTACTTTGACAAT 

GTTCGMGAATCTGGCTGGCTAGTCTGAACTGGAGTGGCTTGAGAACTCTGGGCTTCCTT 

ATTCTCAAAGnCTTTTTGGTTTGCAACCCTTTTTTTAGTAACCT 

T 

GATTGTGCACACCCCCTGGTATTCCCCCAGCCATGGGCATGGTCCCAGAATATAAAGTAT 
GATGGAAGGGCTTCCAGG 



GGTGGCGGCCGAGGTTAAGGACGCCTGCOCATGACAGAGCCTCAGGAAATCGCGATGACA 
GTTTACAGCAGGAAAATCCGTGGAGACAGGAGATCCCGAGAAGCGGCGATGTTTGCGTAG 
AACCCTGTACCTGCCCG 



TACTTTGTA 

TCTGAACTCAAAACAAAGTAGTTGTATATTTTAACATTCAAAATTGGGATTTCOCAATG 
T 

TGTGTATAGTATATAAACATGTAAAAATAGGTTGTATTTTACTCTATGTATGATGCTAAT 
CAATGAACACTTT ATrTATTTTACA GAGAAAACTTATCTGTGAACTTTACTATATATCTG 
MTATTTTACCTTTATT 1 1 1 1 1 1 II I AAATAAAAAAGGGGTTT 
Sequence 623 

AAAAAGCT GGCTGTGTTATGTAAATAAAGCCCCTATAGTAATTAAAATTTAAAAAAAGTT 
TTTTAAGCTGGCT G 1 1 1 I C CTACCACTTCAGAGTCCTTGACGCCGTAATTTAGGGTCC 
CC 

TTCAGATTTGCAGACAGAAACAAACAACAAAACAGTTAAGCAAAACTAACAATGGTCACA 
CAAATTATACAATTTCTGAGTGCTCTAAGTGCATTGGAAGAAAGCTGAAACTCCATAAAA 
ACATCACC TGCCTTCCATCATCATGAAAGCAGGAAAACTT GCCTTC TTGTTGGGAGCAAG 
TAAAACTCGAAAAAAAGAGGTGTTGTACCT 
Sequence 624 

ccxacggtggcggccgaggtacggcggggagccgcctggataccgcagctaggaataatng 
gaatanggaccgcggttctattttgttggttttcggaactgaggccatgattaagaggga 

Sequence 625 

CTC>CCGCGGTGGCGGCCGCC^GGGCAGGTACAAACTTTGATCTTCTTTGAAATGTGG^ 

GTCCACTNGCTTTTCTGTTTCrrGTCACAGTAGCTATAAACAGCTGTTTAAGGATATCCT 

T 

ATCTAAATTTCTGCCAATGAGGACGAATCGAT7TGTTCTCTCAGTGTCATCCTTCCAGC 
T 

CACTGGAG TCTCC TCNATCATAG AGCTGAT CCCGCGT ACCTCGGC 

►TACGCGGGGATGAGTCCTAGGAGGCGCTGG 
CTCTTTGGCGGCTCGGAGGAGCGGCTGCTGCTGC7GCTGCTGCTGCTGGTGGCCCCTTTG 
CAGATGTATTGCTGTCCTrGAATATTAGCCeATTTGAAAACGCCTGGGAAGTTCAGCCAT 
CAGTATGTCAGT ACCTCGGC 
Sequence 627 

CCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 1 1 1 CTTCCAGAAAAATTCTCCTTGAGGAA 
AAATG TCCAAGAT AAGA TGAATCACTT AAT AGC G T ATCTTC TAAATTT GAAAT AT AATT C 
TG TTTGTGACCTGTTTTAAATGAACCA^ACCAAATC^TACTTTTTCTTTGAATTTAGCM 
C C T AG AAACACAC ATTTC TTTGAATTT AGG TG ATACCTAAATCCTTCTT ATGTTT C T AAA 
TTTTG N GATTC T AT AAAAC ACATCATCAAT AAAAT AGNGGG CAAAAAAAAAAA N NAAAAA 




• 628 

GGNCGCCGGCAGGTACGCGG^iGGAAGACGGAGGCGGGTTCTACAAGAGACGTAGGCTGTC 

AGGGAAGTGTTTATTTCGCGTCCGCTTCTGTnXTCCGCGCCCCTGTGCTGCTCCGACTC 

ACATACTCGTCCAGAACCGGCCTCAGCCTCTCCGCGCAGAAGTGCCGGAGCCATGGCGGT 

ACCTNGGCCCGNTCTAAAACTAAGTGGATTCCCCCGGGCTGGAAGG^ 

CNTATNGATAC 

Sequence 629 

ATCTTCTTTTCGGCCAGTGGA 



GTGTCGGAACCA<aTTGCTGTGGTGTGCGAGGACAG1TCTGTGGACCATt3CCTGCGGAACC 
GCTATGGGGAGGATGTCAGATCGGCATTGCTGGACCCGGATTGGGTGTGTCCCCCCTGTC 

g tgggatctgcaattgcagct act gtcggaagc 

tacatagtgtcgcgaactcaaatcggcatttagat 

AGATC CAGTGG TTTAAACGGCACGTTTTTGCTTAT AAAAAAAGTGCAAAAAAGATGTGGT 

TTACAAGTTAAAGtrTACAGAATCCCTTTTTGCTGTAATTGCAC^ 

G 

GCAGAGCAGATTCGTTTAAAA C I II G 1 1 1 U CTTAAAAGCTT ACAGTGTTTGGCT AATT 
C 

TCCTCCCCTTTTTACAAGACGGGGGCCGGAGGGTGGACACTGGTGGCAGX5TTAAGGGATA 
CTG7CACTTTAAGAAGCCTGCAGATTGAAGTGTAAACATGGAGAAATTAGGGGCTGATTT 
TTTAAACTGTGTGAGATATTAACCAGCCCGCCCTGTTATAAAATCAGGAAATCCAAACAG 
CGATTT ACACCGATT AACACCCCCTTT AT AT ATTTTTT ACAAAAATACAC TGAG AAAAT A 
ATCAAACGTTTTCATC TC TCTTGTCTTTTTTT GTTTTTT AAAAGT GTCAAAAGTCT ACAT 
TT AAATAT AAAAAATT AAAAG TT AAAACTCT AGCCCTTCAGTGAAGGAGAC G T AAAAT GG 
CGTGGGTAACAACAACTACCAAAAAAAAAGAAAAAAAAAAGAAAAAAAGGAAAAGGAAGG 
AATAAAGAAATAAAGGAAGTAAAAAGAAAGGAAAGAAAAAAAGG 
Sequence 631 

ATTGGAGCTCX;CCGCGGTGGCGGCCGAGGTACATCAGCTTGCCTCAAGTCT GGAAAGAAA 
TTGGCTTGGGCTCATCAAGTTGAAGGGACCACCAAAAGAGCTAAGATTGCTTGTAATACT 
CATG TGGCCCCTAGGATGCACCGACTGGT AGTGATGAG C CAGG TTT ACAAGCAGACACTG 

G CT AAGAGC TCAGACACTC tggcg ggggcacatgtaaagattcatc GTTGCAACGAATC t 

TTT AT AT ATC TGCTCTCTCCCTT ACGAT C TGTGACAATTGAGAAGTG CAG GAAT AGCAT 
C 

TTTGTCTTGGGCC CTGTAGGGAC TACACTTCAC C T CCACAG TTG TGACAATGTTAAAGTC 

ATTGCTGTTTGCCATC^TTTGTCCATCTCTTCTACAACAGGTTGGATCTTT 

Sequence 632 

AGGTAC CACAC TGAGGGCAGTTTCCAGC TC CTCTCACAAAC AGT AAATCT ACACAAC TTT 
CACAGAGAGTGTGTCCGCACACATTCACCATCAGCTTCAAGGAGGGGTTCCGATATTTGG 
T G GTC TT ACAC CGAGGGCAAC CCTGATCG TCCATGGCGGTTTC C CTC C T ACAG AC TCTC G 
CAGGCGC C TGTTT CAGC CAG AGC CACCT AGAAGCC CCC TCCCC GC G TAC CACC A CACTG T 
C C CAAATT ACCTCTTCATT AC C CAAATCAAAGAATC TTTC TG TTTTCCC AATCCT CAAA 
A 

GGAATGAAGAAAAACCAAAGAGCAAACTCAAAAGATGATTTTTACCATAAACCTCAAATG 
TGGC TTAACAAGTACCTGCCCGGGCGG 
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633 

GCCCATTGMTGTTTGTTTGTTTGCTTGAAGACCAAGACGG^AGTTGGGCCTCT^ 
AGTGGCTGCAAGAACTGGGATTCCCTCT<2CTTCTCTCTCTTCCCCTCTC 
TGCCCGGGCT 
Sequence 634 

GAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGG TAC TGAAAACCAC TTCCAGAG TCT AAAG 
CAGCTCAGATGTTATCTCTGGGGGAATTAGTGTTCCCCTCATTTAGCAACCTCCATACCA 
CAAG<5TCTCTGTCTGTAGTTACTGGGATTATCCAGATACACTATCAATGATACAAATTC 
A 

TAGGAGTATTAATGCATTTCTTTAAACACAACTT GATTAAGAAGCAAATATGTTAAGCA 



:taaattacagttagacacttcagtatcttctctttacatgtgt 
atataaattagtaagaacctg<>tccaaagcaatgtagtgtgtgtatgtatctatatat 

A 

tttattctaactcagc^cttcagaagcctttttgagttac^ 

catctgtagagg t aaaatttctatatt accaagctc cagaggaatatgatatttt acagg 

cacaattttctggctgtagtccctggggcatttcatttgctg<k:ctcca 

ATC TGTCTCTTA 
TAATCTTTCTATAGAAATGGCCACAGCATTATAATATTCAAA^^ 

c 

TGAGGATTTTTAhKjAAAAAAAAATCAAAGGACTTGCCAAAAGGATAACTACATAACAGAT 
ATGACAATCTACAGGACAAAAAGACAACATGTCACCAAATATTGTTCATACAACAGCGTT 
AATGGAAAACAGTAAAACACCTTTTAGCAGTGTX3CATGTTAAGTCTTTTAGTAAGATTA 



AATAGCACCCATC 
AAAAAAAGTCAAAGAAGCATTTCCCTTTGAATTCAGTCCTAAAAATATG 
ATTAATTTCAAAATAAG TATCTTACAAGT GTTTCATGAAACATTG GTTTT 
Sequence 636 

GTGGCGGNCGAGGTCTAAAGGGCAAGGTTCACCACT ACAAAAAGGAAGTTGTCTAAAAGC 
AAGAATTCAATTAACNGCTGGGTAAGAAAAGTCAAAACACTAATGAGTTGTCCATGAAGC 
CAAC T GCT AAGAACGCGCTCAACT AT ACCGCCGACA TT GAAGAC ACT ACGCAC GAAGC C T 
TACTTGGCGAGTCTGAATTTCTATTAACTAAGGGCAGAGTGAGGGAGAACAAAGAGCCTA 
C TTCC G T AACATTTT AG T ATCCAGAT AGT ACC TGC CCGGGC CGGCC GCTCT AGAACCT AG 
TGGGAT CCCCCGGG CTGCAGGGAATTTC KT ATATCAAAGCNTT ATCGATACCCGTC CGAC 
CTTNGAGGGGGGGGGCCCCGGTACCCCAGC I I I 1 1 G TTTC CCTNTT AAGNGAGGGG TT AA 
ATKT GCCGCCGC TTGGGCNT AATCATTGGGNCATAGG CTTGTNTTCCCT GN GG TGAAAAA 
TTG^^^TAATNCCGCTTCACAA^^TTCACCACCAAACCAATACGGAAGNCCGGGGAAGCAA 
* TAAAAGGTNNTAAAAGGCCTTGGG 
Sequence 637 

AGCTCCCCGCGGTGG^GGCCGAGGTACAGGAAAGGGAAGCACAGTTTGGAACAACAGCAG 

TTGGAAGACAAAGG TTCAAAGTCC TT GAGCT AAGAACACAG T CAGATGGAATCCAGCAAG 

CTAAAGTGCAAATTCTTCCCGAATGTGTGTTGCCTTCAACCATGTCTG^AGTTCAATTA 

G 

AATCCC tcaataagtgccagatatttccttcaaaacc tgtc TCAAGAGAAGACCAATGTT 

GGCCTC GC TGGG C TG T A TTCCTT ATATGATGCTGA GACC TT AATGGACAGAATCAAGAAA 



CAGCT ACGTGAATGG ggatgaaaatct aaaagatgattctctt c cttcaaatc caataga 

till ICI I ACCAGAGT AGCTGGCTTGNCTTCCTAATGATGATGNATTGAGAATTCAGCT 
T 

CTTT 

Sequence 638 

CGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGAAAACTAAACCTTCATTTACTG^ 
TCTTCTGACTGTGGCTTCCAGATGCTAGTTTACAGAACAA^ 

TATGCTGAGTTGACGGAACAATGAGTAAA<^TAAGGATATTACTGTGACTTTGAAATTCT 
GAAATTGTT CTTTCTTAACTTTT GCATT AAAATCACATTT ATTTTATAAAATAATGAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
Sequence 630 

Ca^CCGCGGTGGCGGCCGCCC03CNChK^TACATGGCX;CT^ 
TCTTAACCACGGCAGGTCACCGAGGCCTCGGAANTCCCT^ 
ATCAGCTGCT AGACGGGT ACCT 
Sequence 640 
AGACGATTG 
TGAGACCTO 
CACCATGGAGTACCT 
• 641 



AACATTGCTGTGGATTCCCAGCTTTCAGACAACACATGACTAAGAC 

GAATTCTGGAGTTATAAGGCAGAGGTCCCCCATCTTCCCGAACTGGCCT ATTCCGCTAGA 
AGCAAC^TGGCTGAACTCAATACTCATGTGAATGTCAAGGAAAAGATCTAT GCAG TTAGA 
TCAul IGI I TAATGAAATAGTCCT 1 1 1 iGA 

TAATGTGGATAAAGCCGTGCAAGCCTTTGTGGATGGCAGT GCAATTCAAGTTCTAAAAGA 
A 

Sequence 642 

TCCCGCGGTGCCGGCCGAGGTACTTGGAGAATATTTCCAC^TAGCCGAT 
TGTTGACAAGAGAAAGTTCTTT GGCTGTTACCCTCAATGATAGTGAGGTCCATTGCCGTC 
TATTAAATGGAGATGATTCCATCTTGTCTACAGACACTGAAATACCTGGCT AAAAGCCGC 

CTGCCC 
G 

Sequence 643 

rcCGCGGTGGCGGCCGAGGKACNAGAAGCTCACTGGCTGTGCTAAACCAAA 
AGCGCCAAAAGTGATTTTATACCAAGGG NCCATNCATACAAATAAACAAAATCCTATCCT 
CI ICT I ICI ATATNhTTNTTTCnTACATTTCTTATACAAATAACAGAATGCTTCATTTTAT 
TCACTTCAATAGGACAAAGTCCTTAAAGAAAGACTGAAAAGAGCTGATAATCAAAATC 
AAATTTT ATGCTTATTTTTGGGTTAGNCGCT ATCAATTTTCTGACATATTAACATAGGCA 
GGAAAACATTCTCAGTAAATTGAGCATTTGAGTCTACAAATGTCTTGAAGCACT CTGGCA 
AGTTACATGTATCCCATGTTGCTTrTGGNTTCCCATCTC^ 
T 

GCAAG N I I I I INI II I I I CGGGCAGNCTGTGAATTTTCAACCTCCTTTTT 



gagctcxcgcggtggcggccgaggtacaccctctggcctctcccaagcaagcagtgaggt 



cgcaagcagctgctacttcttgggcttctctagaatattgaggaatttcccccgtgtcat 
ctctctggactcatccagcc^cagctgataggctaggttctgtaggcctcgaaccttctc 
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CATCAAATTAGCCGTGGTGAGACTCCCCAGTTCTTTCAACATGTCGATG TCATCACGTTC 
TATCTCAGCCAT0CA7TTGGGTGGAGAACTAGTAATAGGACTTTTGAAGGAAGCTGCAM 
TTCAGCAACACCTGGTAATTGTT C TGGCCAAAGATCTGGTGAGGCACGGTCAAGTTTTTC 
AAAACTTAGCAAAGATGCTTCCAGATCTGTCCCCGTCTGTGGGAGACGCCATCTTTCAAC 
CCATGTCACGTCOCCGCGTACCTGCCCGGGCGGCCGCTCGAGCCAGGAACCGTAAAAAG 



CCGCGGTGGCCGGCCGCCCGGGCAGGTACTTCAGGGAGGCCTATATATTGGCAGGCAAGG 

AATGCCA GGACT GCCACCTGCTGCTCCACCGTTAGCCTCACTCGTGTGCTTACTCACT^ 

GACTGCCTTTTTGTCTATTTCTGGGAGGTTGGTAGAATGAAAGGGATGCTCCAAGGCAAG 

CAGATGGCCTGTCCACCTCCTATATATTGACAGTtKX^ATlGAGTGTAGAGTCrrTGCTACA 

AGAAACAAAGTT^TGAGAAATGCX^GGCTTCCTGTTAC^^ 

TACTCTATCCTTTAGACCAGAAAC I U N CI I CTAAGCACTTGCCTACOGGGAAGGTT 
GA 

GGAGTCTTGTTTTACCGTACC 
T 

Sequence 646 

TCNCGCGGTGGCGGCCGAGGTACCGGCCAACCCTGGTCOOC TTCTTGTrGGGCACTGTGT 
ATGGGCGGAGAAAATCCAWC ITGTTCT1 GCTGATGACGCAAAGGTCAATGTTGCTTCCGG 
AGOCCAGGTTCACTGAAGATTGOCANMTGCCGATGGCTTCGCTC^ 
TMCTNCTCCTCCATTGTCTGGCCTAAACTTTATCTTCAAATACAGACCATTGCTTGCTC 
A 



ga aggg ataggatgtcttgtattagggtnggagagtcccaggtaaatctatggtactncc 
cccctraafwacctnagnntctogcaacccaa 
aaggcattctgncttttnagcmatccgatttg^ 
tggtccagrrgnnagaaggggagttgggagcntccag 

Sequence 647 

^*GGTGACAGAGCGAGAGAGACTCTAAAAAAAAAAAAAAMGAAAAAGAACTGTTGAGGGA 
TACACAATATGTCAAAATATTAAAG C I I I M l I I AAATTGGGAACNCTCAGGATATTGG 

ATAATTAATTAGGCAATGATNCAAAGATGTTTTGTTTTAAAATTCANAACCCNCCAAAG 
G 

TNNAAACCNTNGNAANAATTTTTTGGGTITCGCCGCCCCCCN N II I H H I INTWNNGC 

TTTTTTTTTTTTNNNNGCC 
4AAAAAAAAAAAAANANGGGGGGGGGGG 

rATTTTTTTTT 



GTCCCCCCCCTCKI 



TGGCGGCCGCCCGGGG 

h ah urn i i iiiii i iii i 



rTTTTTTTTTTTTTTTTTTTTTTTTTTTT 



1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 M II 1 M 1 n I CCCNGGGGAANNCCCCNTTHTNNGGNNTTT 

CCCCCNNNGGCNCCNANANGTNAANCCNNCNNANCCNNGGGGNNGGGGNCCNCNNNNCCC 

NWiNG NGNN NNAANNNGGNT>tNGWGGGNGGGGNAAAAAAGGGGGGGCCNAI4GGGGNGCCC 

NCCCCNTTTN NC TGGGGGGN NAAAAAANGGG NCCCCCCCCCCCGN NAAATTNGGGGN N N T 

NAAAAAAht^NGGGGNCCCCCCCNG 

CCCCCCCCCCCCNGGGGGGG 



TTGACTCCCGCGGTGGCGGCCGAGGTaCACGATAGGAAGAATGTATATTCTGTGGTTGTT 
GGGTGGAGTGAATGTCTATGAGGCCCCTGACTTCTTTCAT7CAGGAACACAGATTCAGAG 



CTTCTGCTGTGCAGTAGGGGGCATCAATAGTTC A I I I I C I III I A TTGTCTGCTACCAT 

CCATTGTATGGATTCAACCTAGTCTCTTTATTCATTCTCCCAGGCTT7CCACCAGGCC 
AT 

ctc1 11cacttcx3ggggcacct1tcccagggagatgaagagacacaggttggcctctgct 
gggactcgacatgtctccccgogtacctgcgcg 



TTGACT(XCGCGGTGGCGGCCGAGGTACTGAGTGGGGAAGAAGGTAAGAAACACGTTGAT 

taacaccctgtgttctggcaggtgggatcagw 

caaggatgtcccitnxjactgcagaagaggtggaatttgtggtgg 

gttctcxiaagatgaatcttcaagaaataccacctt^ 

C 

CTCt^AAGGGAAGCAGAAAGAGTGTTTTGGAAGGAATC^^ 

TAAGCAGCACAATGAGGAACACAGTGGTGACGAGCT ATTGGA lU ll tj l CACTGTGCCATC 
AGGTGAACTTCGTCATGTGGAAGG^CACCATTATTCTACAC A T TOTGT 1 1GCCATCAAA 
TT 

GGACTATGAACTAI 
CAATAATAACTTAAGTCCCTT 
S eq u en c e 651 

GACTCCCGCGGTGGCGGCCGAGGTACTGCGTTATGGAGAGGTGTCCAI 
TCCTGGAAATTAACATTGGCTCCACtrrrCCAGCAATTGCTGGACC 

TTTTGAACAGCTTTAATCAGCAAGTGAI IOI11 1 ULACTGCAGCCCTTCT ACCGCTGGAG 
GAGGTGG GTCOCT OCTGGGGGTTGTTATGATCCCT GC 1 C 1 UCATGACGGTAAATGCCACC . 
TGCTACCACTTTTAGCC 1 1 I fCCI I GAGAAAATGCAAATTTATCTOCTAGCACTTAATC 
A 

AAGAAGC TTTGAGTG T AAATTGGGATTCTCTGGCAACAGAGCAGCAG TATGAAGAAGGAA 

CAATGTTCTCAGTCTTCTGACATTCCAGCTGCTCAACTCAAGACGTCTCA^ 

T 

TGGCAGCCGCAAAGCCTGGAAGACTGCTTGCAGCCCGAGCAGTTTCCTCCTGCTGC 
GCGTACCAGTGAGGAAGGA 

Se qu ence 652 ' 

TTGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGGCCAGGTCTCA^ 

CTCCTGGAGCTGCAGGCGGCGGGAGGGGCTACAAATGCTTGACTCAGTGATGCAGAAC^ 

TTCAGAGTTAGCTGGAAGCCACAGCCCTGCCTCTTGATGCAGCCTGGATCCAGGC 

GAAGAGGAGACCCCTTCCCTCTTGTGGGGTTTGGATCCTGTGT^^ 

CTCT ACATCAGGGATA TCCTGGACATGAAGGAGTCX^SCCAGGTGCCAGGTCT ATTTTTG 

TACCT 

Sequence 653 

TCCCGCGGTGGCGGCCGCCCGGGCAGG TACCTGTGAAC TGAGGAATTAT AGAT AAACC TT 
AGGTCAAATCATTTCGCAATTGCATTGGTGGTATTGAAAAATGATGAGATTTCTCTGACA 
GAGAGC TTTG TCC T AGTTTTTGTTCTTCAT AGGTCAAAAC TGGGAATA II C I L 1 1 G TCT 
G 

CAAGATAAAGTGTTTGTGCTTCTATCACCATATGCATGAACATGTAAGAATCAGATACAA 
TTTCTGC TTCATCAG TTTCACATG TTCATGTTG TCACT GAAAAAAT GCATC TAC TGTTT 



TAGCTCCCAAGGAGACCCCAAAT( 



iATGGAGTCTTGCTCTTGTT 



CCCAGGCTGGAGAGCAGTAGCGCGATCTCAGCTCACTGCAACCCCCACCTCCTGGGTTCA 
AGGTGATTCTCCTGCCTCAGCCTCCCCAGTAGCTG 



GACTCCCCGCGGTGGCGGCCGAGGTACCTGTTACCCTTTCAAAAGTAAGTTCTCCATCCC 
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ATAAAGCCATTTAAATTCATTAGAAAAATGTCCTTACCTCTTAAAATGTGAATTCATCTG 

GAATACTCCTTGGTCAATTGTAGGTA I fCTTl 1 1 G GTTrAATTTTTGCCAATGTAATTAA 
AAAATOQTATGTCATTTTTAAAATTTGTAI fTCTI I CA TTACAAAT AAGATTGTTATGTC 
AGTATTGTTATTGGCTTTTCG TATTCC TCTT AACG TGAACCGTC TG TTCATTGTTTTTAC 
CTGTT ITCGTTTTAGCAAGTAAGT ACCTCCCCGGGCCGGCCGCTCTAGAACTAGTGGGAT 
CCCCCGGGCTGCAGGAAATTTCGATA7CAAAGCTTAATCGATACCCGTOGACCTCGAGGG 
GGGGGCCCGGTA 
S e qu en ce 65$ 

TNCCGCGG TGGCGGCCGaGG T AOGCGGGGGAAG TCGGCCA T GGACTGGAAAGAAG TT C TT 
CGTCQQCQCCTAGCQACQCCCAACACCTGTCCAAACAAAAAAAAAAGT 
AAAGATGAAGAAATGGATTTATTTACAAAATATTACTCCGAATGGAAAGGAGGT AGAAAA 
AACACAAATGAATTCTATAAGACCATTCX^XXSGTTTTATTATAGGCT GCCTGCTQAAGAT 
GAAGTCTTACTACAGAAATTAAGAGAGGAATCAAGAGC IUILI 1 1 CT ACAAAGAAAAAGC 
AGAGAACTGTTAGATAATGAAGAATTACAGAACTT ATI 
ACAOCACCTATGATTGGAGAh«3GAAGCCGATGATC>ATTACCAAAA 



CGGTGGCGGCCGCCCGCCTGGTACGCCCAAGGCATTTAATGCCCACAGTAACAGGGCTGT 
TTGACAGTGGCAGAAGAGGACGGGACT AAAG TTACTTTGTGC TGAGAGGGGGAAAGAAGC 
ACAAAG nTGGTCTGl I GCATAATTGAATTTTTAACACTCTTATCCACAAC^AACACTTT 
1 1 U> I b I CC I GCTGTGTAAAAGACATTIAGATATATTACAGATTTTCAAAGAGGTGAGCAT 

Sequence 657 

ATTGG^GCTCCCCGCGGTGGOGGCOGAGGTACATTOCAATOAACA A 1 1 TCI I C ATTCTGA 
TCTDCTAGAAGACAGCAAATACCGAAAAATCTACTCCTTTACTCTT AAGCCTCGAA 



CACGGGTGGCGGCCGAGT ACCTTGTGGGCATTAGGTCANT MTTGTTATACAC TTTCACAA 
AAGATTTTATCTTTGATCTCTTGGCGATCT ICTTCTIGCCCATGGCAGCTGTGACTTTG 

GGGGGTAGCGGTCAATTCCAGCCAGCAKAGCATGGCTTGTAGGGGCKATCT GAGGTGCCA 
TCATGAATGn U 1 1 AAOGATNACAGCTTTGCGTCCGGAGTAGCGTCCAGCCAGGACAAGC 
^CCACHCTTCCCAGGTITCATGAACTTGCCCATTTCGGCAGCAACCACCCCGGGGCNCTA 
CaGCAAAAAAGGCCCCCCGCTG T ACTCTGCCCCGGGGCGGGNCCGCTTCT AAGAAACT AG 



CTCCCGCGGTGGCGGOOGCCCGGtaCTGGTACGGCCAAGGCATTTAATGCCCACAGTAACA 
GGGCTGTTTGAGAGTGGCAGAAGAGGACGGGACTAAAGTTACTTTGTGCTGAGAGGGGGA 
AA^aAAGCACAAAGTTTGGTCTGTTGCGTAATTGAATTTTTAAO^CTCTTATCCACAACA 
A 

ACACTTrTTTXSTGTCCrrGCTGTGTCAAAGACATCAGATATAT^ 

T GAGC ATCCTTTTACGAGCTGGGCAGGTGGGGAGTGGCGTGGTTTTGATG 

ATTTGG1 rGAATGAACGCTAAGATGGCCAGACNCACCTCTTNGATCTCAACTCTGCAGCC 

TGGG 

Sequence 660 

CCGCGGTGGCGGCCGCCGGGCAGGTACTATGACCT GAAGAGGGCAGAGGGCATCACTGTT 
GGTCCGGTCTOCACCTGGGGAAACTGAGGTTGGACAGTGT CTCTGTGGTGAGGAGCAGGG 
CTTCATCCAGTGCCTCTGTCCCCACCGAGGGGACTATGGGAGACATGG^ 

caacaggtga^actggagccagctgaaaactgggagaccgacgcagccaacaaacaatgt 
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cggtctt^tgtcttggcacctgcaggaaacaagctcctacttccagaaaaagtgctcctgg 

GACTCCAGX^TACCAGGCATCTGGGTAAGCTACAATGCTTAACCACTT aacacaatcagg 

aagcaacagccatgcattcggggaaaggaacttcagtgtt^ 
CTAAC 1 1 1 IC 1 1 iggtacctcgggccgntcta 
Sequence 661 

ttggagctccocgcggtggcggccgacgtaggcgggagacgac M il l ictcacgatgaa 

tgtcaccccagaggtcaagagtcgtgg^tgaagtttgctgaggagcagct 

tggatggactcaaggcaaaggcct 



GAGCTO 



TTTTATTTTTTTATTTT 



TTTTT 



IIIIIIIIUII 



TTTTTGGNCfUNANAAACNAGTTTTTTNAATTNATT^^ 

GANAANCO*C>fKNGAGGGCTNT KGGGGNGTNTCCHGKGGCNNGGGGGKKAGGGG TNGGGG 
NCT>K5GGGNGG^TTTRAGGGGCCCNNGNCCCN>^ 

gggcmggnggaattcgati 



TCCCCGCGGTGGCGGCCGAGGTAl 
TCCAGAGAACTGCACATCATGGATCTGTGGCAGACCAGGTGGtlAGAGACAGACCCAGGAA 
GGAGAGCAAGGCCCCCGCGTACCTGCCCG 



TNCG<K3TGGCGGCCGCtX^3GGCAGGT ACGCGGGGGCGGTATCTGTATCGGGCC^ ACTGG 
CTT>4ANGNGCNN^TTCCCTT(^ 

AKCCNCTN 1 1 1 1 1 1 NATCNCGNCAGCANCTT AAAATGCCTGGGAAGATGGTCGTGATCCT 

TGTSAGCCTCAAATWACITrGGATAATGTrrGCAGC^ 

C 

CACCOCAGAATCTAGATATCT TGCTCAGATTG GTGACTCCGTCTCATTGACTTGCAGC 

ATGGGGAAAGGTGACCNAATGGAGGGGGACCACATKTTACGCTTGACA^ 

TTAGG 1 1 1 1 1 GGGGACCGAACCACTCTTAACCTGGTGGCCAAGCAACCTTGGNGGAATCT 

ANGGMAAATTG 



TPCCCGGGGTGGCGGCCGAGGCT AAC AAGGAAAGCCCCTGGAGCTCCTGTAATAAGAATG 
TGGTTGGAAGATGCAAAC TGTGGATGATCATCACCTCCATTTTCCTAGGTGTCATTACAG 
TGATCATCATAGGCTT ATGTCTT GCTGCAGT AACTT ATGTTGATGAAGATGAAAATGAAA 
TACTTGAATTATCATCAAACAAAACATTCTTCATCATGCTGAAGATTCCAGAGGAGTGTG 
TTGCTGAAGAGGAA TTGCCTCACC TGCTGAOCGAAAGGCTCACAGATGTGTACCT 



GGGTGGCGGCCGCCCGGGCAGGTTTAATCTCAGGTCTCCCTCATACACTTCTCAGCCTCA 

GCACCTAACCCTCACACAACACTCCAGTATTGATGCAGTCAATCTTGTATAACATTTTT 

T 

GAATGTCCAATGTGCAAAGCACGATGTTGGAAATTATACAGAGGTGAATAAGACAAAAAC 
TCTTGCTCTCAAAGATGTCAGTC 1 1 1 1 IC1 1 1 GCAAGGAT AACACATGT AGAGT AAAAT 
G 

CAT AAAGGGGACT AATTTT AAATGTACCT 
Sequence 667 

GGCCGAGGTACTGGAGAGTCGGCTTTGACCATGGCCTCAGCTCAGCTCCAGGTTTGGAGC 

GGAATAAAACAGGAGCTAGCAAGATGTCTCATCTGaGCTTCCCAGTGCCCAACTTATCTG 

AGGCCTGGGGCTGAAGCCAGCGCTGACGGAT 
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Sequence 668 



GTTGGAATTTTAAAGAAAAGAAAGCaCAAAGTAGCACTCAGATGGCCTTTTTTTGTAAAGT 
GAAGTCAACCTAATACTCTGGTGCTTACTTTGOWTCTTTTCCATAAGTCAAGTATTA 

6 * 

TQTT AACAAT ACAQTT AAGAAGT AAGGA TAAACCCATCAAGG TCCACAGCTAAATAACCA 

GCAGATTCCCAGAAAC7TTATGTATT7GGGAAAAGTAAAATATACAACAGACATATCCCT 

GCCCTGATTAAGAGGGTAGATAAAAACAAAACATAAAACAATTTTACTTGAGATAGTAAT 

AAGTTATTTGAAA 

Sequence669 

^^^AATAAAATCTGTCTGTTACAGCGCCAGACTGAAGGACGGGTGCCTGTTTTCAGCC 
ATGAGGTAGTCCCTGACCATCTGAGAACCAAGCCTGACCCTGAAGTGGAAGAACAGGAGA 
AGCAACT GACGACAGATGCTGCCCGCATTGGTGCAGATGCAGCCCAGGTTGGACTGAGTC 
ACTGCCTTGCTGCCCCATCCXX^ATCCCATCATGAGAAGCTAGGCATTACCATTCCTGTCT 
AGTAGGGATACATAGTTGGTTGCGCCT AAGTTGCTTCTGGCAGAACCCAAGGAATAAATT 
TCTCCATATCGTTTNCTAGTT ACCCTAATCTCTGCACAAATTTTGTGTGTTACAGAAGC 

GATCCAGAGCTTGAATA 
Sequence 670 
TNCGGGTG 



TAAACTTTTAGGGT 
TAGAGATATCAANGGGGCTAATNTTTTATN 



TGCCTATTTGCAT 

GNCAGACCATGGCGG^ATGTAAAATTAGCTGCTTTGGTGTGGGCTGCAAAAATAACAGCTA 

ccattgcaaa^cgaaaatctttcattggcaccccttactggatggccccagaagttggaa 
gcagtagagaagaatggtggctacaaccaactctgtgatatctgggcagtaggaataaca 
gcaattgaacttggagaagttcagccacctatgttttgatctccacccaatgagggctct 

CTTCTTAATGGCA AAAAAG TAATTTrCAGCCTCAAAACTAAAGGGCAAAACAAAATGGGC 

atcaacattcataatttttgtcaaaatagcacttatcnaaaaaaaaaaaaaaa 

Sequence 67 1 

GCTCCCGCGGTGGCGGCCGAGGTACGCGGGGTCTTCTCATGCTCCGTGATGCATGAGGCT 
CTGCACAACCACT ACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGTGCGA 
Sequence 672 

AGCTCCCGCGGTGGCGGCCGAGGTACTCTTCTGCACTC 

TCi 1 1 i i CGAACCTATACTGTCTTCTGTAAATTCTTCTTACT ACCCTATGACCCGTGAG . 

C^ACCACTTTCCGATGCCAGGGTTCJGAC/JCCTCACCTGGCATAATATAAAGTGTTTT 

TTTTAT ACCC TTCCACTTGGAAAGAC TACAGAGGAATdTTGCACTGCATASTTCAAACTA 

AAAAGAGAAGA«3n/f\T7ACCT<SAAAAGCAAGAGAAAACM^^ 

CAAGGOMATpATTVAGAAGTGTCTGGTATTT^ 

QTCATAApTAAATGTTrAGGAATAAAGGATGGAqACATGCTrAXTTTATTT^ 



673 

GGATTGApQTC^CCGCGGTGGCGGCCGTGCCTCTTAATCATGGCCTC^GTTCCGAAAACC 

aaatacgaaccgcggtcctajtccattattcctagctgcggtatccaggcggctcg 

TGAACACTCTAATTTTT7G 
iTACC 



Table I 



AGCTCCCGCGGTGGCGGCCGAGGTACTGAAGCCCACCAGTGTCCGGATGGAAGTCTGCAT 
CTGAGGTTGCTCAGTGTCCCGGTCATTCATTTACACATTTTAACTTGCATTAAAGAGCT 



TTGTTnrTGTGGCCTAGACTCTTTTGACTGATCTGAAAATAAACTGG 1 1 1 1 1 1 ICAAAA 
AAAAAAAACAAAAACAAAAAAAAACACAAAAGCTGCATGTCTAAAATTACATGGAGTT AG 
TGTCTATTCTTTTTCCGCTTTTGCAGCAACTT ACACAGCATTTTTAACACCTTTTTTTTC 
TA GI rTTTTTSI I C GGI 1 1 I G III I CCATCAGGAATTTGAGTTCTCTCT AACCCAGCTT A 
CTGTGGGACATAGGAAAACTCAGTAGAAATACCTTTGGTGATCTTGTTGAGTTTAAGTCT 

GA tc ttggatctt aaac tca 

• 675 



*^cc<^ggtggcggccgcgagagcac^tgatgaccacgcgatcgtccagtatgagtgggca 
ctgctgcagggggacccgtcagtggacatgaaggtaacgcatgttgtcactgctggcagc 



tcxxgcggtggcggccgcccgggcaggacgcgggaaggattctgtaagtatgtagcagtg 
tttcttaggtaaaagtctctttttgctactgaaagggaaatggtctctaaacactggtc 

A 

ctgtagcaggtaaacactactgtaacgtggagaaatgagcttcatgctgaggtagtggtt 

G€CTTANAGCTGTTNTT*I*TNCTGNANAAAI4C^ 



CCCCCCCAN NTTTTTTGNAATI 

CCC C CCCC NCNCGGG N NNAAATTNTNTTNAAANACAT1 
TCCNGGGGGGGGGGGNNGGCCCCCCCCCCCTN 
Sequence 678 

GCTCCCCX3CGGTGGCGGCCGAGGTACTTGTGGCATGACGTCGATGATCGAGTTCANGGCT 
NTCTCCAN CTN GGNCNACATGATGCCCACGG N CTNGCCCCACCAGG TC TTNTGAAAGACA 
G NTGACAN GAGACATCCN CGCGTACCTG N CCG 
Sequence 679 

hrcCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTGTTGTGATCGGAACGTGTCGATCCCCT 
CTTCTCATCACTGC TG CTCCAACTGGATTT ATT ACTC CG GGAATG GTCTGAGGGGGAAAA 
CCAATG TGTTTAGC GTGC CTGCCCACCTGCG CCTGAG CACAAC T ATC C TGCAATCTGACC 
TGCC C C TC CTGCACAGGAAACCACCTTC CCC TCCC AATTGATGGTTCAAACAC TGC CACC 
GCTGACTGCCCTGCATCTGTGGGTCTGTAGAACAGAAAGGCAGAACAACTTATTTTTTAG 
GATTTAACGACAACCGGTTGAAAAAAACCGGTAGGGGTGTCNTGCTCACAGAGAATAAAG 
ATTTGTAGAAAAGGNGCTGAACTGCCAAGGAAGGCATTTCTTGTGCCGTGTCTGGAACCG 
TGTATCCTTACTACATCACTGAACGACACCAAAGCACCCCATGCACTTTTTGGGTCCAAC 
CT 

Sequence 680 

N ATTGAGC TCC CCGCGGTGGC G GCCGAGGT ACAAGGGGAGGT AATGATGGG AGC TCCAC T 
CCTTGGACCACCAGCTGGTTCTGGACCGTATCCCCATGAATCTGTTTGAACGTAAGGAGG 
AAGTC AAAAAAGTTC TTATTT AGGGTTTCTTTGAGATGTGGGGCCACTTCC ATTCCCA 
CC 

CGGCACAGGTAGGCACGCGCATACACCGACACTAGTGGGTCTCCGATCCCTCTGATCATG 
CATGTCAACCGGGGCAGGCACTCTGAAATTCCCGTTTTGGAGAGGAATTTGTTACATTTC 
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AGGATGGATGCC TC CACG T AAAATC TTGGAATGAGTTC CC TGATGGAGGCAATCTTGAAA 
AACCAATTTAGGCATG I 1 1 LC 1 1 GGCCGTGTCATTTGCATTCTCTGGAGAAAAGTGAT 

GGTAAGACGCTGCGGCTATCCACACACATGGAAAAGATGC 
Sequence 681 

gcggccgaggtaccctaatgtagtagtaaatttaagg<x;tgtcgaggaaattttaac^ 

TCCAACAGGTGACTATATCAGGAAGGAGAAAACCAAGTGCTTCCTGCTTCACCTT CTGCT 

GCTTTTGGGACTTTTTATGAAGCCTAGGTAGNCTNAGGACANGACCCT^ 

TCACTGGGAGAGGAAAACCACCAGGCTTCTCAGCTATTGGCTTGGCAACTCrrGG 

CTATGGCTTCGATCAGGGGCTCCAGGCGCTGATAAGTGGCCTCAGGCCAGGNAGGGAGGA 

TTCGGNGTAGCCGGGATTGGGGAGCAGCTAGGTNCAGGGAAGGNTG 

CCAhTTCCCCCAAAACCACC^TTTGGCCGCNATGGA 

GNT AAGTTTCTGGCATTGCCTGGCCGG^^^TGGGATGCCTTCTTCGGGACTGGCTCCCAGG 

GCCGAATNTTTTTCAGGGTCTTGCAAGCCCGGCT 

Sequence 662 

TTGACTCIVKCCGCGGTGGCGGCCGAGGTACTCTTO 

GCTCGGCTT GCGCCCGCCGGGTTTCCGGAGATATGTTGTATTCGGCTGGGTCGAGGGTCT 

CAGGCAGAGTGCGCAGGCTCGACGGCTTATACTTTGGGAACGACATCTTGGCGA 

GCACAATTGCGCCTG<X*X^TTCTGAGGCCCTTTGT^^ 

ITACGCGGGATGGCACATGCAGCGCAAGTAGGGTCTAC 
AAGGACGCTACTTCCCCT ATCATAGAAGAGCTTATCACCTTTCATGATCACNGCCCTCAT 
^TCATTTTCCTTATCTGCTTCCTAGTCCTGTATGCCCTTTT^ 

ACTAACTAATACT AAGATCTCAGACGCTCAGGAAATAGAAACCCGTNTGGACTATCCTGG 
CCGGCCTTATCCTAGGCCCTAATGGGCCTCCATCCTTACN ^TTTTTTAAANAANANAAA 
NGGGGGAANGGACCCtOCNTTTANAAAAAAAA^ 

W>KjGGCCCTNGGGGNTTTTT AAAAAAN NGGGG GANCCC CCGGGGN GGGGGGGANTNTTT 
TTAAAGNTTTTTT CCCCCCCCCCCCCCGGGGGGGGGGGGGNCCCCCCCCNTTTTTT 
Sequence 684 

CCGCGGTGGCGGCCGAGGTACCCCATGCAATATANTGGCTCTACAATCCTCAGCATGTTA 

ATCGAAGCCTTGTTGAGCTTCACAAAGGTTCCATTGAAGATTTGACNGAAGGCGAAGAAG 

CTGCAACACCTTTCGAACCTTTCGGCTCACTCCATTGATACCTCTGATTCTGATGACAAA 

CGCCAATTTGGGTTCTGCAGGTACGAGGACATTTTGCCCCGCGGCTTGTTGGGGTCTCCT 

TTACCCATGTTGACAGATCCGCGTCCACCCGAGGGTATTGGAGGGTATTCTTGCCTGGTG 

CCAGCTTTTCCTCAGAGTCCCGCAGAGCGGCCGCTCT AGAACTAG 

Sequence 685 

CGGTGGCGGCGAGGAC I 1 1 1 I I IUI I I II II I 1 1 1 H I II 1 1 1 G AGATGGAGGTTTCG 
G 

NTCTTGTTGCCCAGGC TGGAGTGCAAT AGAGCGATCCCAGNT CACT ACAACCTN CGCCTN 
CCAGGTTCAAGCAATrNT>TCTGCCTCAGCTTCCTGAGT AGCT GGGATTACAGGCATAAGC 
AACCATGCCCAGCTAATTTTGTATTTTAGNANGAGAT^^ 

GGNGGGTTTTGAACCNCCCCCCNNNGGGGGNCNCCCCCCCTGGGCTCAAAAAAANGGGGN 

GGTTAANT ANGNGGGGGGGNGGMCNNATATTCCNCCCCTTGTATAAAAAAAAAANANCNC 

CCCCCNCCCGGNGGTGTGGATATA/MTATTTNTACATTNTATNTTTTNTCCWCCCCCC 

NC 

GGG 



TGGTTTTT 



M H III I III I I IN I M I I 1 1 1 1 I I 1 1 1 1 | I I 1 1 1 1 I 1 1 I N I A AAACWGAAAT 
TCTT 

CAN NC TTTATNAAAAAAGGNCNT AANGGG CC TN TTATT AAAAAAN GNNTAAAANCCCCAN 
*^"nGNGGGCCCCNGGGGNGGGCAGGGWTGAflANCCCTT AAAGGG 
Sequence 687 

GCCGAGGTCGCGGGGGCTTACGATGGCNACAAGTATGGCGGCTGCTAGTGGTACGATTTG 
AAACGTGCTGAAGAGTATCX5AAGAGCGGAAAGAACAGACCCGGAATGCCAGGGCCGAGGT 
GTTGCGCCAGGCT AAAGCCAATTTTGAAAAAGAAGAAAGGCGTAAAGAACTT AAGCGACT 
TCGGGGTGAGGATACATGGATGCTACCTGATGTGAAT 

iTACACT CGCCAGCGG TTTTGCCACAAGAG T ATACCGG A 
ACAAAGGAGACAN GGCTCATTT ATAATCTGACGCGGCCACCC TCCT GCT GCGTTCGGTTT 
CCATTGGCTGGGACGGGACCTCACCTTCTGTATTTGTCCCGACTGGCT AGCACTT AGAAC 
TTTTTAAAAGAGGCAAA 
Sequence 669 

CTCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAACTGGGCACTGGATAGGTAG H CC I » I 
GGTGGTCAAGGTGGCTCTACCTGTCCTTGAGCTCTCX3TGTCACTCGCTTGGTGATCCGTC 
CACACATCAGGCCAATGAGGAACAAT ATACAGATGCT CCCACTGATCACAGAGAGAATGT 
AGTTCTT AGATGGAGACG TCATT ACTTGCATGG CAAGATCAGAGAAGCC ATCTGC TGGGG 
CCACCTAGAATGACACAAGGCAATGTGAT1 
C T AGGTCT AACTTATCCC TCTCAGTTCCTAG TTT AATTTAT* 
G 

gaggggcaggagataagaaaaatt 
actataccccacgtctactgcaccaaaattaggaa 



CCCGCGGTGGCGGCCGCGTTTGTTTCTTG^CGTCCTGAGCGATGGAGCGCGGGGGTGCCT 

GGTT ATTGTCCGCTTTCTCTCTGAGATGCTTGGGTTGTTTTrCAAGAGAACCT 

A 

T ATT CATTGC T CCA TCGATTGGATCC AG T CC TTGTTCAGAAAA TT G TTTCAAGGCACTT 
A 

AGGCTGCCTGAAAGCCTTGAATCGTTGCTAAATATrCCAGTTGNTTTGAAGGTTGTACCT 
Sequence 691 

GCG G TGGC GGCC GAGGT AC T ACAGGAAGAAACT AGAGGAAACGGGAATTTCATCCATG TC 
CTGTGTATCTGCTGGCAACAGGTCAGAACCGGCCAGTATGTTATTCCCTGCAGGCTGCCT 
AGGGTGCTCT<XTCAAACAGATCACCTGAGCCTCCTGCATCTATGAAAGTTATGACACAG 
CAACCAGTT AC TCAG AG TC TGATGAGAAAAACA GATTTT AGG TTTGGGAAAT GGGATT AC 
TGTAATTTACACATCCAAATGCAAACTGGAGCTCTGATTTGAATTCTACCCTGGGGGAAG 
AAC TTT GATGC T AACCCAACAAGGTACCC TGCC CGG GCGGGCCGCTCT AGAAC T AGG N GG 
GATCCCCCCGGGCTGCAAGGAATTTCGATATCAAGCCTTaTCGATACCCGTCCACCTCGA 
GGGGGGGGGCCCCGGGTACCCCAACTTTT 



gag tgactccccx5cggtgc03gccgaggtacaccaaatgtgacatcctttc^ 
attactrcatacgacattgtcaaggaaaggactagaagaattttttgatgacgc 
ctgggggcaagaaaaagtaaaatctggagcagcatggacctgtcagcaactaaggaacaa 

AAGT AATGAAGATTTACACAAACTTTGGTATGTCTTACTGAAAGAAAGAAACATGCTTCT 
AACCC T AGAGCAGGAGGCCAAGCGGCAGAGATTG CCAATGCCAAGTCCAGAGC GGTT AGA 
TAAGGT AG T AGATTCCATGGATGCATTA GAT AAAGTT G TC CAGGAAAGAGAAGATGCC CT 
TAAGGCTTCTrCAGA CTGG TCAAGGAAAGAGCTAGACCTGGTGCTTGGAGAAGAGACATT 
NTTTNGGAAGAATCATTTTGGCCCAAGGTTCAAGCWfrGGGGTTATTCGTTTGGC^ 
TAAATTAAAAAGGATNCCCATT 



Paje I20of22l 
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Sequence 693 

TCCCGCGGTGGCGGCCGCCCGGGCNGGTACCTCAGGGACATTTAAGA6TTGGACGGTGCA 
AATATATTCCAAAAGGGTGCAACATGACAGAGTGTATCCCCCTGC rrcTGTn*n G TAT 
A 

TT7TTCCTACT 
Sequence 694 

GGTCTCTG TTGGGGCTCCCCTTTCCTGAAC TTTGGCCAAAGACAACAGGATATTCTTGGG 
GGTmCTTGTTGTmGTTGGCATNNTnCTGTGCCTGTTGGTGATTCCAGCACAGN 

cc 

AGWGANCCGNGTACCTGCCC 
G 

Sequence 695 

GTGACTCCCCGGGGtGGCGGCCGCCGGGGCAGGTACTGTATAATGGAGGCTGACCAGAGC 
AGTTTAGGAGATTQTAAAGGGAG GI 1 1 I OIU AAGTTCTAAAAGGTTCTAGTTTGAAGGTC 
GGCCTTGTAGATT AAAACGAAGGTT ACCT AAATAGAATCTAAGTGGCATTTAAAACAGTA 
AAGTTGTAGAGAATAGTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCT 



NCCGCGGTGGCGGGCGGCCGAGGTACAGCAGGGTGCCTCATGCAAG^GAGGACTGAGTGG 
A 1 1 1 1 CCT7 AGGGATATTTATGAACCTTAAAGCAGGAGCTT AAAGGGAATTTGGGCCATA 
TTAACCAC TTAGGTCATGATAAATGATTACATTTTTGGACATTTTGGTGTCTTAATXrrC 
A 

GCAAGGGTTGCACGATAAGTTTTGACATGCATGCATGGGAGACATGTAGAAATTCTAGTT 

TCTAATTACTTCTAAATTCCTCACATAAGGAGTTTTTGCCTXrrGGATG^ 
G 

GNCCTAGGGWGATCTTTGCCCCTTTTATACTA; 
TGGGGWKTN/ 



NNMGAATTANAACCCCI 1 1 1 1 11 1 NGGNGGGAAAAAT1 
CCCCTN1TTTTTGNGGGGGGNANAAANCGCCCCOCTCGGGGGGGGAAAAAAAAAA 
Sequence 697 

CGCGGTGGCGGCGGCCCGGNCAGGACGCGGNGANGACAGCGNCAGGCGCTTGATTTC^ 

GAGTCCGGGTGCCTCAMCTGCCCAGNGCCCACGTTCGTAAGAAGGCAACAAGW 1 I II 1 1 L 

CTCTACAGAAGGATTTTGGAAACAfnTCGGCMGNTCCAAATGATTCTGATCGCAAATAC 

CTGGAAGATTGGGCAAGAGAAGAATTCAGAAGAAACAAANGTGCCACCGAAGAGGATACA 

ATCCGGATGATGATTACTCAAGGCAATATGCAG^TCAAGGAGTTAGAAAAAACACTTGCT 

TTAGCAAAATCTTAACTATAGCATTATTCTGAAGGGA 



AMCCTCAGCGCGGTGGCGGCCGAGGTACGCAGKCCMCCTGTAGGGATCNGTNTTGTTCNT 

GACNAGCCCTACGGTAATGCAGCCCGGAGCTTGTTTTCCGTAGCTGGGGAGAATCTTCTG 

TCCTTGCTGTTCATGTCGTGGAAGAGAGGGGCAGAG7CTTGCTCTGTCACCCAGGATGGA 

GTX5CAGCGGCGTGATCTCAGCTCATTGCAACCTCCACCTCCTGGGTGCAAGCGATTCTCC 

TGCCTCAGCTTCCCAAGTAGCTGGGATTACAGGCGTGCACCACT ACATCCAGAGACTGGG 

ACTACAGGCATGGATTTTCAGGTTTATAACATGGCAGAGTGAATTCTGGCAACACACTGA 

GTGATGCTTGNCAATGGCGACTATCAGGAATTTAAAACAAGATTT . 



CGGNGGCGGCCGAGGTA l I U I I 1 1 I I I H I I ti l I A GTGTTTTCTGATGTCTTTT 

CTAACAAATCTTTGCCTCCCCAAAAGTCTCAAAAACATTCTCACGTTTCTA^ 

C TTT AGCTTTT GTGTTT GGGACT ATGATC CAT ATTT AG TGAATTTA I I 1 1 I GGGGGGGCA 

GAGTCX^TGTTGCCCAAACTGGTCTGGAACCACCAGACCCAGCTAATTTTTGTGAATTGC 

GGGTACCAGCACACCGGCGCCGTCCTGGACTGCGCCTTCTACGATCCAACGCATGCCTGG 

AGTGGAGGAC TAGATCATCAATTGAAAATGCATGATTTGAACACTGATCAAGAAAATCTT 

GTTGGGACCCATGATGCCCCTATCAGATGTGrrGAATACTGTCCAGAAGTGAATATGATG 



PCTAJSOO/33312 



GTCACTGG 
b 700 

cggccgacttga: 



iTGGCACCATCTT G TCCCTGACCTCCG 




ACCGCGGTGGCGGCCGAGG^ 

tcagai mrrcrrn ia ggtcqaggagtaagatatatgatacgaaaatgaaaattataat 
ncttcttggattcct gggagccacattgtcagccccacttatcccacagcgtctcatgtc 
tgcagcaatagcaatgagttacttcttaatcttaatm^ 
cttc*gggcccacttaattcatggattct>cc7ttct 

CAGGC TCAAATTCCAGGACTCT CCCAHj I I C I CI 1 1 A TCAGCTCTAGACCAG 1 1 fOCT GGA 
CTGC TCtXiAAATCAAGATACCCTT AACAGGAGAGGCCAG TTTTGCCCAAGGAGCCCAGGC 
AGGCCAAGGTTGATCCCTTAJ 

CAOGGGGATGCCCTATGTA I FCTGCT I lAAAATGGCTTA 

caatacctatncaggtttacatgggc 
ccgcggrggcggccgcccgggcaggt actgcaagcaacagttactccgacgtgagatcat 
caagaacacgtagagaaaoccagctgtaatcatgcatggagatacacctacatrgcatga 
atatatgttagatttxx^accagagacaactt^tctctactgttatgagcaattaaatga 
cagctcanaggaggaggatgaaatagatggtccagctogacaagcagaaccggacagagc 
ccattacaatattgt aacc tt7tgttgcaag t g tgactctacgcttcgg 7tgtgcg tacc 

T 

Sequence 702 

GCGGTGGCGGCCGAGGA C I 1 1 H 1 1 1 1 1 1 M U II 1 1 I I I ATGAATTATTTA \ 1 1 I CM I 
CTCANAAAAGGATGCGCCTCCACTTAGCAAGGCTGGGCAGGATGTGG7TCTGCATCTGCC 

Sequence 703 

GGTGGCGGGCGCCCGGGCAGGTACAAGACCTTGACACGCGCAAAACACTTCCTGCAGATG 
TTGNGGTTGGAAAACTGTCGTCTT ACAGAAGCCAGTTGCAAGGACCTTGCTGGTGTCTTG 
GTTGTCAGCAAGAAGCTGACACACCTGTGCTTCGCCAAAGAACCC 

GGGTGAAG1 1 7 CTGTGTGAGGGCTTGACTTACCCTGATTGTAAACTGCXGACCTTGGTGT 
TACAGCAATGCAGCATAACCAAGCTTGGCTGTAGATATCTCTCAGAGGCGCTCCAAGAAG 

cctgcagcctcacaaacctggacttgagtatcaaccagatagctcgtgg^ 
ttctctgtcaggcattaagaagaatcgaaactgtaacctaaaacacctacggtntgaaga 

CCTATGAAACTAATTTGGGAAATCAAGAAGCTGTTGGAGGGAAAGTGA 
Sequence 704 

CGCGGTGGCGGTCTGCCCAGATCCATGAT( 

ANGATATCCCTTCAGCCTGAvGGGAATTGATGTTGATGAACCCGGAGGCATCAGTTGGCTC 
ATAATCACCCTGCACGTTCATX3CTCACCAGCTCCTNATTGTNNAGAGACAGMCNGGGACT 
CCCGGC CGAGGATG T ACCT 
Sequence 70S 

CCGCGGTGGCGGCCGAGGTCCGACGCAGCAGGCTCCGAAGATCATACAGACGCCATTACC 

AGTCTTGGCTCCCAGAAACCTCTGCGCCCCGCGTACCTGCCCG 

Sequence 706 

CCCTTAGCGTGGTCGCGGCCGAGGTAOGAGTAA A 1 1 1 1 Li ATT ACCTTT AATTAGGCAATG 
TTTCTT AGATAACCAT AAAAC TGCAAAAGCAATTTTT AAAAATG TAAAT AGGACTTCAT C 
NAAAAGTAAACGCTTCAAAAGATACTACTGAGAAAGTCACAGAATAGGAGAAAAATCTGA 
TGAGACTTTATGTCTAGAGTAATGAA1TCTTGTTAACGAATAACCAACCCCCTTTTAAAA 
ATGGGCAAAAGATTTGAATAAACATTrCACTAGAGACAATAAACAAATGGCCTTAAGCAC 
AAGAGATGCTCAACATCAGTAATTATTAGGGAAATGCCAATCAAAACTACAACGAGATAC 
CCTATATCCACTAGTATGGCTATAATAAAAAAGAGTAACAAACCGTTGAGGAGGATATGG 
AGAAAC TCGAGCCC TGG TCAGGT G T GGTGGATCACACC TGT AATTTCAACACTTTGGGA 
Seq ue n ce 707 

C CCTT AGCGTGG TCGCGGC CGAGGT ACCCAT A TCC AAGGC TTATTGCAAC TTTT AGTC TT 
GCCC CTGCTACTTACACAGTCCAGAATCACTTGGGTGAGGATTCCAGTAGGAOGGTGGCA 
TTTT AGGATTCAGAAT ATT AACCTATAAACCTGTCATTTGATTCTTGATTATTAATGTCT 
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GGATCGCCTG TGGT AGGGGTG t aatcccaggaaggcattaaatatatttgaattaatgta 

TATTTTGAGAATAAAAGGCTATTTCTAGAAAATATTACACACTTGTCTT ATGTTAAATAA 

AAA TTTGCT ATTT ATTGAATATCCCTTACCCACCC TTC TTCCCAATGAAGATCTT ATGCA 

TAtXrrT CACTCGAAGGTTTAAGATGTGACAATC 

TTCTCTGmATATTTTO4AACCGCCAiiAGCAAG«^ 

T 

Se quence 706 

CGCTTTCXjAGCGGCCGCCCGGGCAGGTACATC L 1 1 IT G CATCCTCAAGAGCCCATTCTTT 
' TCATX>TTCGGAAGCAACAGCGGCAGTCCCCTGCCCA^ 

ATATT^TTGCTGGAGTGATCTATCAGGCACCAGACTTGGGATCAGTTATAAACTCTAGAG 
TGGTAAGTGTCTTCACATTCTTTAAGCACT AAAGAAAACTTTTAATTAGCTACCl I G C1 1 
CCAG TAATCAAACT AGAGCTC^TCTGCCTTGTG TAAGTTGCT A TAAAGT ATTGACT ATTA 
GAATGTCrTGAACTTTGGTTACTGMGAGCCAAAGTC^ 

CTCAATTATATAGTAATTTANGTTCTGAAAAATAGGTTCTGGCTTtGCATATGT AATATT 
TTG TGAGTArrTACTTTGGAAAG TTTGGTC^ACCTAATGGAT AAATTT AGAAG TTTATTT 
TCCTT ; .v 

Sequence 709 

CCCTTAGCGTGGTCGCGGCCGAGCTACAAGCATG ACTTTTCTAG 
AAAGTTGTT AGACTAATTTTTCAACAAAAATTCTTTA T l GTCTI GGTAACAAAAGAAGCA 
TACTAAAAATTCTCAATAAGGCACAGTGTCTKTAGAAGCTnW3CA 
CTAATTAACACGAACTTGTGCTCTTATTTCy^G^ 
GAAGATGCAGAGGAATTTTACAATGAATTACTTCGATCAGCTGCA 
TTGX3GGAGACAATTACAAAGATNGTTTTA' ■ 

Sequen ce 710 ■ ■ . . 

CCCTTTCt>GCGGCCGCCCGGGCAGGTACGOGGGCTAATCC^ 

CATGAOCTCATCACTTOCCAGAGGCCTTACCATCTAATACCAATAC A I I GtJU 1 1 1 A GAAT 
TTCAGCATGAGAATTTGGGGGAGACAGTCAGACTCT 
ATTTAAGAGACACTTAAAAATG^TCAGAAAGGAGAGGATGAAGGCT^^ 
AGCGTTGAACATGGAAAGGAAGTGATGACTGCAGATAT1 
ACGCT AAGGGCGAATTCCAGCA 



LMI IIII I III W I H I I I I llMII INGAT 
AGCCATATACCAAATAAATGTTCTGTGACTAGGGGTTATGGCACAATGGGTATTGAGACA 
CTAAAAACTCTGCTTCAGGCTTCCATCCTCTTAATT^ 
TTTCTGATTGACATtrrTTTGGTAGATTATCGGGTTT^ 

TT TAGA ATGA 1 1 1 1 C I II 1 1 G 1 1 CTGGGAAAAAAAATGCATTCTAAATCAMATTCACTAA 

TACTTTGATTCACTTCCAAGGAT 

Seq u en ce 712 

CCCTT AGCGTGGTCGCGGCCGAGGT ACTT ACAAAAATnTTAACATTAGGAGGT AATTAT 
AAGT AGATTCTGTGATTAGGACTTCATTCATGT ATCTTTTGCTACATAAACCTTTGTTAG 
ATTAAATGGAAGACACCTGCTAGGTGATACTTTTTATAAAACATATGAGTAAGTCATATA 
TCTTTGTT AAATTTCT GTATG TTCTTTrTTGT ATAAAGATGGAGAGAAAGGATGGAGTGA 
^^^AAGGACCCTAATAACATCTCTGTTCAAATTAATTACTAAGTGATAGAAGT ATTGAT 
ATGCCATTAAAGATTTGCCAATTCtATTT 
Se quence 713 

1 1 1 CGAG CGGCCGGCCCGC GCAGGTACTGACAC^U3GACTCCAGGCCACACATATCT 
TCTTGAAAGCCC T I FT CCTGTTTGAAAAAAAGATGGTTTGTATTTGATAGAGCAAAAGAA 
GGCCACAAAATGAATrGTCTTCTTGTGGGCTGTGTTTCAGAACGGCCGG 
TGCTGACCnGAAAGACAGAAATTTTCAGAmGAAACTCAACGGACCCCA^ 
TTGGCTCAAGACCTGGG TTGCTT CATTCAT ATTTTCTT ATTTCCCCAGCCT AT AAGAGCA 
TA1TTGTGTCTTGTAAGGTGCCTGG 
Sequence 714 

CCGGGCAGGTACATATGCACT ATTT AGAAT ATGACATTAATCAACCACTAGAATTAAAAT 
GAGG TTATAAATCCTCAAAA TCACCAGAAG TAT AAATTT AAATGAAAAACCCAGACCACA 
GAACAAAAACAGAAAT ACGAAAAAATAATCACAAAATATTAAAAACAGT ATATAAACACA 
GTGACAGAATTaGGACTAAACATATCTGTAAAACAAT AAATGT AAGGGT AATCT CACCAA 



^ATGAAAAAGACCTTCAGATCATATTTTAAAACAAAITTTAAAAACTCAACTGTATGTTT 

ATGGAAGAGAGAGATTTAAAAATAAAGAGACTCAGAAAGCTGGAAATAAAAAGAAAGTGC 

AAAGAAATAGCAAACAAATACAGGCATAAAAAAAAACAAAGATCCCAATAGTACCTCGGC 

CGCGACCACGCTAAGGG 

Sequence 715 

TGG TGT AAAATTTGAAAACCATGAAAAAAT AAAACAAT AAAGGATC T AGATGCTAATAAT 
GTGGTTAGTTAACATGTTGACCATTTT^AAAGCAAAATAAGTCTTTGATC 
TCATAGCAAGATATAAGTATTTAATCTG-GAAAGACGTGGATTTGAAAA 
* T GTAAAGAACAGATTGCTAGATT ATT ATT AATAATATCTtTA TAAAT ATT ATATTTATC 
AATAATGGGTACCTGCCCGAGCGGCCGCTCGAAAGGGCN 
716 



iAX 1 1 1 CGAGCGGCCGCCCXXSGCAGGACAGTGGTGTGATCTTGGCTCATTGCA 
CCTOCTGGATrcAAGCGATTCTCCTGCCTCAGCCTC^ 

CTCCCACCATGCCCCGTGAATTTTTGTAT! TTTAGTAGAGACAGGG 1 1 1 CACGGTGTTGG 
CCAG«TGGTCTTGAACTCCTGACCTCAAGTGATCTGCCTA 
TGGGATWGGGCGTGAGCCACCATGCtXACCTCCTX^ 
GGCATTTTrATGCTGATCTAAGTGAAAACCTGGATATTTT 
TAGTTCTACCTATGACATGAGGGTGATCTTTATAATI 



1 1 ■ 1 1 C i CCAAAGTTA1 ITC1 



TAAAACAJTGCTTAANGGGAGAGTTTGGXSGGA^ 
GGATTTTTCGGGT 
S equen ce 717 

CCCTTAGGGTGGTCGCGGCCGAGGTACTAATCTAAATGCT AGACAG TTCAAGT GTAGCTT 
TGGAGACTT ACAGATAGCCAGCT AGAGAACT ACCAATGATGATATCCATCACGAGGAGTT 
TGGTGGCCA GCCTCC AAGATGGTCCTCAATGATCTTTG^ 
GTAGTCCCCTtrrCTCAGGGCATTAGGGTTG^ 
TGGTATGTCACTTCTGAACTTAGGTTATAAAAGACTATGACTCrr^ 

TCTCTGTC 1 C I V I UATCTTACACTCTAGTGGAAGCTGCCATATTGTGAACCTCATGGAAG 

GCCCACAGGGTGAAAAACTGAAGCATCTAATCAACAGTTAGCAAGAAACraAGCCTGNGA 

^CAACCAT^jTGAGTGAXIOOOGGMAAGATTrTCCAGTCCCAGTCA^CACTTGAHATAACC 

GGCAAC^X^TTAAGCTGACAGCTT AACTGCNANC TGATAAAAGACACCCTTGGGMCAAAAC 

CATTNGGAACCATTCATACCCCA 

Seq uen ce 718 

GATATCTGCAGAATTCXJCCCrTAGCGTGGTCGCW^ 
CATT3GCGACGTCTCTG*>TGAGGGACCCACGAACCAGCT 

GGAAGCATCAAGTTCACAGCAGTAATGCACACTGTGGCAGGAGMTGGCTTGAACACGAC 

*GGCGGAGGTTGCAGTGTGACGAGATTGCAjCCATTGCACTCCAGTC 

GAAACTCCAJCTGAAAAAAAGGAGAAATTCTITTATTTT^ 

T T ATG CATTTTCCAACTATGT ATGCA TCACAAGCTATT C 1 1 1 1 I C I G AGTTATAGCTACA 
GTTTTCCTACTG I I G I CTNCATGCCATTTTCATTTCACATGGTACCTTG 
S equence 719 

CCci ITCGAGCGGCCGCCCGGGCAG<^ACTTNNNTTTTNTMNN1 1 1 N F I KTNKNGGAGAC 
AGGGTCTCGCTCTATCACCTAGACTGGAGTGCACTGGTGC^ 
TCACACCCCACCC TCAAG TGTCAATCCTCCGGCCTGAGTAG 
CACTAAACCCAGCTGT TTAAT ACACCATTTTTAACCCAAAA^ 

ACAGTAAGTAGATTACATrTTGTAAACAGACAAAGCTTACAAA G ITTTC1 C A AATATGAA 

AGTCAT ACT AAACTGGGAGACTGTTAACTTCTTGATGGGGTTAATCTCTAATATGAAGCC 

''^AGTCATAGCTAACTACAAATTAGATATACAATGCCAAAAATKTTCAAAAATAACATTT 

TTTGOOCTTAATGGATTACAAATGCTAACCNACATAAAGACCCTGGGAAAGGGTTCANAA 

TCTKCTCATTACATACTTTCAAAATATCTTNCCTTTACTrTCATGAAATGG^ 

TCTATGTAAGTGATGACNTGNCCG^ 

GGCCCT AACTT AAAATGGG Hill UAAAMCCTTTTCCAAATTNGGGTfnTTGG TTTGGAC 
CCCWTTNAAANCTTTTTTAWCAATTW 1 1 1 N 1 I 1 1 A AACCCCT 

aaaanaaaaaangggccxttgggtaaccccx:tttttggg 

e 720 



cccttagcgtggtcgcggccgaggtacttgaagaacatggtaaaaatatgttcacaa 
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TATTTTATCTTAGAAATGTATTCAGTAAAAAATCTCTTTATTCAACTATCCTCTTGATTC 
AGGG GAAAA AAGGATTAGCATGGGAGATAACAGAATAGGAAGTTTAGGAGATAATGAGAC 
TTC TGTTTTAGTAAAGTAAATAAGCTTTAATAGTTTTTTGGTCATGTATTCAGTTTACCA 

gccttgaagatatttgtaggaaattttaaaagtttctctatttcatcccccatgataaaa 

A ^ ATATA G^TAAAAGCTGAATTGAACTTTCTTCACAGCACACTGAAAAATATCTTCTA 

TAGCATTAATCAGATCACAGAATGCATATTTAAACCAAAATTTGACT AAATTATTTTTTA 

ATTATTTAATTTTTTTCTGANACCGGAGTCTGGCTCTTGT(>ICCCAAGCTGGAMTG 

GGCNGGAACTNACTTATTGGAAACCTCCG<XTCCTGGGTCAAGC^ 

^^f^^'lS^^^*^* TGCCTGGGA TOGCAGGOCTGTGCCAJMCCTTCC TGGCC CCAislAG N KCCGG 

OCTTTTGGTTnT^^ 

nil 

Sequence 721 

GCAGTGTGATGGATTCTCANAATTCCCCTTGACGGCCGCCGGGCTGGTACGCGGGGTTAA 
CTATGI ITTCn I AACAGAAAGTTCTGTTTTTGT viATCCTTTTAAAAATAAAGCTTCACG 
GAAGGTATGAGAATAGT ATTTTTGAACTTTAAATT7CTCATTACCAGAAGACCATGTGGT 
AAnCTCTGTATACAGnAGAAGAGCACGGAAACnGAAGGCCTAAAAAAnAGCTGACC 
TTGTT AAAAATGTTGGCGTGAGGAGTATATTATTACCTATCTTTTTTTATTGTGTGTGTG 
TGTGTGTGTGrrrrrAAACTAATTGGCTGAAATATCTGCCTGTTTCCCTCTTT ACATTTTT 
CTTGGTTCTTTCCTTATTTATCTTTGTCCATCTTGGAGATCTACTGTAA^AGTGAATTTT 
TTTAATGGAAAACCAGTTCXX>AGTTTTACTCTCAGTGGGTTTNGGG^ 
nGAGAGGCCAACCAAGGTAAGTCTrCATGTCAGTNGTTTGGTTGAAGGAAACGAGCCT A 
TGAGGGTCAGTTmCCCCAAAANGGAA 
Sequence 722 

NGCCCTTAGCGTWO'CGCGGCCGAGGTACATGAACCTATTAATAAACCAnTCATGCrTCC 

CAGTTTGGCAGATGTGAGCAAACTATGTATAGGAATTCCAAAGGTAACTTTTTCCTnCA 

TTACm ACAGAAATACT GTCAAGTCCAATAGAGAGCACAGACTTGGGAGGCGGATTGGG 

TG GGTTTG AATCTCTGCTCTGCCACTTTTATTAATCATGTGAGTTGAGTATGTGACTTAA 

TCTCI I I I AGCTCAATTTCCCCATCTGTAAAATAGGAATAATAAAAATACT GACT7CAGA 

GAGGnTGTGAGG ATCAA TT AGAC AGTC^TGTTAAGTCTGTAAATTG'rTTCTGTAATGGG 

CAAGATAGCAAATATTTTAGATTnGTGGACCATGCAGTCTTrATCATAACTGCTTAACT 

GCX^TTATAGTGAGAAAGCAGCCACAGACAATATGTAAATGAAAAAGTGTGTCrrCTGTTC 

CAATAAAACTTTATT7TCAAAAACCAGCTGGCTTGNCACATCTGGCCTATGGGCCCATAA 

GTTGGCCCATCTCTAATGTAAAGAAAGGACTTTANCCCAAAGCCACAACn'GCATAGTAA 

TGCCTTAAAAAAATGGTAACATCTTTACTGGTATTAAT AATTACT ACTGCATCT ATTACC 

AGWAGCCAATTGGAGTAATGAATCCATGAATGGTATAATGGTAAAATACTAACCCTTT 

Sequence 723 

GATA TCTG CAGAA TrCGCCCTTAGCGTGGTCGCGGCCCGAGGTACTTACTrrGTTGCTCT 
TTTrCTAAGTTTTAAAGATGGATGCCAATCTCAGGCTTCTTTTCGTGTGTGTATGTGCGT 
ATGTCCATAAATTCTCTTCTA ATTAC AGTGTAAGCCACATCCGACAAGTTTTGATAGT CA 
CAGAACTGT ATCG TCACAC TATTTTTTAATTTCAGT AAGTTCTT CAC TGATC CC TGTGT A 
ATTTAGAAATGTrrCATAAnTCCCTACATTGGAGGGGAAGATAGTTTTGNTTTTATTAT 
T AATTTC 7 AGCT GT ANTTGAGC TCTT G T CAGAAAATATGGTTT ATTTT AAGTC 
Sequence 724 

C CCTTTTN AGCG GC C G TT N NG GGAGGT AC TCCTCAGCTTGTGC TGCCCTTCTCGAATGAC 
TCGCGTTTCCTGCTTTCATCACTACACCTCCCACCGCTCTCCATCACCTGCTCTGCTCTT 
AT AAGGATCCAGAGAAATGGAATAATC TT ATTG C TGATC T A TG TAAACAAGTTGAAGAAT 
CGTCTGAAAGAAAATACAGTGTGTCTAAACTGGAAAAGTCCTGTAATAGTTTGTTCATGA 
GCA TTTGCACAG TGGAGTTACTGTTCATCATG GGGG TAG 
Sequence 725 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAACTATTCCTAAATATTAAACACTGGTCAACT 
AAAATGC ACAuA^TTCATGAATTGG^TTT GCACTC AAACAAAAAAAAAT ACCATAGGC AG T 
ATCAT TTCT ACCTTTGTAAGAGGCAGGAATATTCATTAGACTCTATGCnGACTTTTCAT 
ATGTATTTTAACACTGTAGTAGGCTATCGGGTCTAGTTTAAGCTTCATTTCTAAACTACT 
CAAGAGCTCAGAACTGAC AAAG ATCACAAGAAATCAACTATTAACCTCTTGCCTGAAGAC 
ACAAAT G AAAT ATTCCC T ATTTT A CAAAGCAAATT AG ATTC C AAGA TTTT C CAAAGCCAT 
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ACT CC TGCJMSTTCACTGGGnTCAAACTTAAAAATCAT 

C^TTTCGAGCGGCCCGCCCGGGCAGGTACTCACTTAAATAAATAATTG^ 

TTATCTGACAATTAAAAAAAGGTATATGTGAAAAACCTTAAAAAAAATCTATTTCATTAC 

^^GTTGAAATGTTCTGTGCTTAATC^^ATACATCATTTAAATTCTTTTCACATTTGGACA 

ACAGAAAAACTGAAATCTATGGATTCCAAGCTGCAAAGTATTTTATCTAAATTGCAAATC 

AAAAAAC 

Sequence 727 

GAT ATCTG<^ GAATTC«CCCTTTCGACCGGCCGCCCGG 

TCTAI 1 1 1 1 1 GGATGAAAAAACAGCAGCACAAAGAAGTTCAGTAACT GGCCT AAGGCCAC 
ACAGCTTGTC TTCC TGAAGAC TGVjACCCAAACCCAGGCAGTCATAGAACATGCTGGTCGC 
TATTGGGCCGCTTGC TCT ATGGGGGACGGTGCTC^>.GGAACACAGCAATGCGGTTT AGGA 
TTC CAGGACCTGGGGCAGC TGCTGCTTCTTT CTTAGTT CTCGACAGACCACTGAGTGCAG 
I 1 1 I I CTAAATCTTnCCCCACTTTGATATGTGGTC^ATAAAACTGCTTCCACACGTATA 
ACCC>CTGTGAAGTTTAAAATGATTTCATGTTTGGGCAAATTCCTACTGAATGTTAAGCT 
AGATAGGAAACAAGTTCTGACT AACACAAAATGLMGGGCTGAATGAAGAAGTCNT ACTTT 
TATAAAGGAATTTTN CCCTTCC TCACCAAATC 
Sequence 728 

CCCTTAGCGTGGTCGCGGCCCGAGGTAC 1 1 1 1 1 1 1 1 1 1 1 1 1 I f T I II 1 II I IIGGTAGAG 
ACGGGACCTCACTGTGfTTGCCCAGACTGGTCACAAACTTTTGGGCTCA 
GCCTTGGCGTCCCAAACTGCTGGGATTACAGGGATAAGCCACTGTATAGAGTATGAAAA^ 
TATTTAAAAGAATCTTCCAAAGGAGGACAGCAGAAATGAAAAT^^ 
G>AATCCTTGACACAACTGGTTTTATTCCCAATGCCTCTTAAAAAGAATCGTTCCATGGGT 
GGCAGGAGGGGTGTTTTCATGGTGTGATGCACCGTGACTTGTT ATTN AAGATGTAAGTCC 
AGTGGTCCATCTATCACG TTTT AT ACCTTTCGAAAAAAAAAAAAA 
Sequence 729 

TCTNGATGCATGCTCGAGCGGCCCGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTTT 
CGANCGGCCGCCCGGGCAGGTACTTATCAGGATGAAATCAGAATCACAGTTGGCC^ 
CCATAAGGGAAGGGTATTTGGAGAAGAGTCAACX^GCACTGATGCCTCTC^XrTGCCCAG 
CAGCACCnGGATTTrcCTGGCTnATGCCTCXrrGTTrCCCCTGGCTGAGT AACTGCAGG 
CATTAGGrrCCTCTACACACGATATATTACAGGGAAATGGCAGCGATGGTCTGGAAGGGC 
AACACTGiGCCTTCTTTCCTCCTGAGCACTAAAATCCTAAACATGCAACTTAAAAAAAAAT 
TCTAAATGTGAACACCACCTTTCAGT 
e 730 



AATTGGTAAGATGATTTTATCTGACAATTAAAAAAAGGTATATGTGAAAAACXrrTAAAAA 
AAATCTATTTCATTACATGTTGAAATGTTCTGTGGTTAATCCAATACATCATTTAAATTC 
TTTTCAC^TTTGGACAACAGAAAAACTGAAATCTATGGATTC^AAGCTGCAAAGTATTrT 
ATCTAAATTGCAAATCAAAAAACATCTATAACATCTTGTTGGGGATACAAAGTTCTCCT G 
GCTG 

Sequence 731 

CCCTTCX^AGCGGCGGCCCGGGCAGGTACTTTTCTGAAGAATACATCTTCGTTCAATGTGG 

TCGTATTCTTAATTTTTTCTATAATATTGCTTGTAATCTTTAGAGTTATGGTTTCATTTT 

TTGACTATTAAATTTGAAATTGTTGACATCAGCAGTTGACTCTTCTGTGTAGATCATAAT 

t t i i 1 aatt aagaagacactctgaagtgttgaactataattgtagagtaaattctaagtg 
gaggatatcgtaaattcttttttgtcttggtattgacatgtaaatgttaacatatgtgaa 
taattcagtccccgattgtcacaggttctatgtctttacctcctttcaaaatactttctt 
t aa c aaat actttg acaaatttatt aaccattt at aagacaag ac ttaccaaggtggtgt 
tcgtttatgaatctttaaatgttttccaatacttaagatacatcaaaattataggacttc 

TCAATTCCATCCTATTGTTACCAGAATATNAAA 
Sequence 732 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTTTCTTTC1 I T 1 I M II M I H I T IGAGATG 
GAGTCTCGCTGTGTTGCCCAGGCCGGAGTGCAGTGGCACAATCTCGGTCACTGCAAACTC 
GGCCTCCTGGGTTCATGCC ATTCTGCCTCAGCG TCC CAAGT AGCTGGGACTACAGGTGCC 
C GC CAC CAAGC CC AG C T AATTTTTTTC TTTTTTTTGTATTTTT AG T AN ATAC GGG G TTT C 
ACCATGTTAGCCAGGATGGTCTTGATCTCCTGACCTCGTGATCTGCCTGCCTCGGCCTNC 
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CAAAGTGCTG GGAnACAGGCGTGAGCCACCACACCCAGCCTATTCXTTTACTnCTTAA 
AC I I I C nTCACTTTACTCT A TGGACT CACCC TGAATTCTTTCCTGCTCAAGATCCAAGA 
ACCCTCTTTTGAGGTCTTGGATCGGGACCCCTTTNCTGTNACACh^KACTG 
GGCAGACATATGAATTTGCACCCCCGCTTGGGTCrrCAATNTCCAGGGGATGA^ 
GAGGNAAACCGAGGGGAAAA 

GCC TTAGCGT6>GTCCCGGCCGAGGTACAAAAC TA TG TGAGAACGTATACTACTTC TCGGC 
CACAAC TACTATTTTTAGATATTCATAAAAT AACC TC TGATT G TGTTTTCACATTGCCGA 
TTCAGTT CT GTCCCAATCTTAT AATTCTGATTAAATGTTC TGGCCTCAAACTAATTTTTA 
AAAGGCCACTAACTCCAAATCTAGGAACAAAACACTCTGTAAAGACTCTGTAACTTGTAT 
AAAATTAACTTGAAAAATTCACTCACTCXXAATAAAACTATGATTTATGTAGCTCATAAGA 
GGGTGAATTTTGAATATTTAC TCTATGAAAAAAGCCTAAGCAATTCAATAAAAACTTGAT 
AACTGCACGTTTAAG TTTGCAGCATCTTGTACCT 
Seque nce 734 

W3CCCTTTCGNTTrNN«5CCCGGTCAGGTACTTTCTCT 

AAAAGAAAAGATCAAATGCAATAGAT AGOACTGTAAT AGATTTTGCT ACATTAAAAAAAA 

TCCATTTGAATACACAGTGAACTAAAACACCAGAGTGGCTAAAAAGTCCCTICATGGATA 

TTT"ACTTAGCAGAGAGCTCTTGAGAAAGACCCAACCAATAAACCCCAACCAAAGCAAATC 

CAGCTACTTCTCTAGCTGAGAG^GTGGAATGACTCCAAAATATTGTTTCAAGCTCAAAAA 

GCCTAAAACAACTCCACX^ 

Seq uence 735 

ACAGGCTGGGAATTTCCCAAATCTTACAAG^TCTCGT 

CGGAGTATCTCCGTCCTrrCACACTTTATTGTAAGCG^GGAGAGCAG^ 

TTAACATTTCATTCACAGGATCTGAGCTCAGCCAAGTCX;TCAGCCATTTTCT aatgagga 

TCACTTTCTTCCGGTTCCCCGTGACCTGTCCCTC^ 

CCTTC^uACATCCACTTTTCCACAACATTCTGTCTATGATACCTGC^ 

AGAAGCTTTC TC TCCAAGCTCTTCCCTTTCCTNTCT GAGCCTT CACCCGAGTC 

Se quence 736 

CCCTTTCGAGCGGCCGCCCGGGOSGGTACnGTCT 

TCACTGGTGGAAGGGTGCTTGATCCAGCTTTTGCTTTrrCCATGAGGAGGACTCTG 

GAGTTTCCGCTTTT ATTTCCTC TGAGGGGAAAAAAAAGAAGCATACATTANAAAACTGGA 

CvAGOAGAAAGACTGAGTAATTTCTTAAGTTCTATAAACTCATTTGGAACTTCT 

GTTGGAAAGAATGCAAATTTAATAAAAATTAGATGCTAAAATTGTTTCATCTAAATTTTT 

TAATTTACACAAATAACATAAAACTATATGAATAGGTACCTCGGCCCGCGACCACGCTAA 

GGG 

Sequence 737 

NAl I I I I 1 1 1 I I I 1 1 1 1 1 1 I 1 1 NGTrTTGAAAACCCTnATTCGGTTTCTCAGTAACAGT 
GATGCATTATAGAAATTCTTGTCTGC TAAACTTCATAGCAAACCGATCCCAGTCCTCACC 
T^TTGTGTGGTAGCCCAGCAGCAGAC^AAGATAGGAATTTTCTGCCCCCTAGCAATACTG 
nCATCCCATCAGATGGCCGAAATGCCAGTC TGAATCATTTCCTCTGGGTAGATTCNACA 
TTGAGGGTTGATTGGCTGACCTAATGTNTTTTCCAAAAAGGAAAATTTCAAOWjTTGC^ 
CGCATTATTCATGAATGANAATTAGATNTCATATCAAAnAAAAGAAANGAAAAAGCACC 
AGAfKlACCAGAACTACATAAAGCATCTCTTTACTACAAAAAA 
Sequence 738 

CCC TT AGCG TGGTCGCGGCCGAG G T AC T A TC TGCTC TGAATT AAAATTTAGAACAAAAA T 
CACCTG CCGTG CCACTACACATGGACATAATCAACTGCTAAATTATGATTTGTTnCTTC 
CAGTTACTTTTCCAATTATTTTTACATATACAAATATTTTCTT 

GGCACTATTCATTGTGT AGTTTTTTTGT AACTTATATTTTACCCTAAGCATTTTCTCGTT 
GTCTT AAATTATTAATWGAAAATTATTCATGGCTAAATAATGCCT AGGCTGCCATGAGTC 

ttnctccttctataaaccgtgtc^gcattcttttatatatatctttcagcacatctgga 
atgatttctttggaataaaatttctaaagttcgctggatcgaaagaatcagggatt^ 

AGTGTTCTTTCAATTTGGCAAAGTArrrrTTCAGAAACA^ 

AAACAAATTC Mill I ATGGNGCATTTAAAATCTACCTCCTTGTAGCCATATGCNNGGGA 
AAAATGGAATTATTTGGNCAACCATGC TTTCAGA T AC TTGGAAGAATTGGTCCTAATTNC 
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TTCTTTATGACCTATTCTGNGTTCCTGGGACTNTACATTAATCTTT^CCATGGATATTT 
ACCATTGGAAAGGG 
C739 



CTAGCAATGGTCACCCCTACATACnCGGCACAC^TCTTTCAAGAACAGGACACCATTAC 
CACACCCAAGAAAACCAGCATTTAATGAATnAnC^GGAG7>ITCATCCAAC>TACTCAA 
ATTTCCAC^GCTGTTCCGAAAGTATCCTTCAA7TCTGGATCC^TTGATGG^^^CACAGGTT 
GTATTTGGCTGTTACATCTTTTTAGTTGTTATCCTTCAG^ 

TTTCTTTC TTT ACAATATTGAC TCCTTTGAGGAACCGGGGC TGGATGTGGAGCATTC TCC 
ATTCATCTGATTGTTTO>TGTGACCAGATTCGGGGTCAO 

TTCACAGATGACCATGT>ITTGGTTATTAGGTAACAATAGATTCTCAAAGTAGAGAACTGG 
GAAATTGACCTTTGTC^TTCACAAATAGAA I I I I I I II I GAAAATCTAGAATTCCT CAN 
GAATNAATTGATTTCTTTTCTNTTTTCTTTTTI 
Sequence 740 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACAnGTCTGCATTTTGAGATT^ 

CTTT CTGGTG TTGATTTCTG TTT AATTAT AC TG TGATC T ACAAGCAGCACT GTATTATTT 

CCATTCTTTT AAATTTGTTAAGGTGTGTTTTATGCTCAGAATGTGGAGTGGACTATTTTG 

G TGAGTGTTCCATATGGAC TT AGAAGAATG TG TTTTCTGCTGTTG TTAAATGAAGTAG TC 

T ATGTATGTCAATTATTG TTTGATGATTGATGG TGTTGAAATCAGTTATG TCC TCACTGA 

TTTTCTGCCTGCTGGATATGTCCATTTCCAATAAAGGTGTGTTAATCTCTATCTATAATA 

GTGGATTTATCTATTTCTCCCTGCAGTTCTATCAGGTTT^ 

GnCTGTTAAATGCATACACCATTAAGGACTGnAGGTATTCTTGGGGAATTGACCCCTT 
TGGTTTCTATGTAATGCTCTTCTTTATCATTGGATAACTTT CCCTT GCTATAAANGCCTG 
GTCTGNCTGGGAAAAAAANACACAGGTNGNTACNTCTTCCCTT 
Sequence 741 

CCCTTTCGAGCGGCCG<X:CGGGC^GGTACTT(^GG7TAGAGATGACTT CAATATATGTCG 

CAGACCTCCCAAGGTGAGCATCACACAGCACTTATCATAATCCGAAGCAGCTCCACAGAG 

GCTAAGATGAAAACAAAAATCTCAGGAAATTTATGTTTATAAAAATGATACTTGCAAAAA 

AATGAATGGAACCATCTCCATTGCTTATTTAGAGTGnGACTCACTGAATAAGATTTTAA 

AnAGTCAATAGTAnGGATGCCTCTATATCTGCATATCAATAGGCTCATAAACAAGGIT 

GCTCAAAGAACTGCCCATCAACCACTTGGTTTCATCTTTGGACACCACACTGGTTATCTT 



NCTTCTTCCCTNCCCTGGTTTGGTTAGGA 
Sequence 742 

CCCTT AGCGTGGTCGCGGCCGAGGTACAGGTTTCCCnXSCCTCAACnCTCATCCTGGCT 
GATGAGAC TGTT AC TTT C C TTC TTGT AT AAAGAGGGCAAC TTTCATGTAGAAATTTT ACC 
TCCTACTTTT A AGAAAA AGGAAAATCAGAGTGCTTTAAAGGAAAATCAGAGTGCTTTTCT 
TGCATCTGCTATTTTTCAAGTGTCTTTAACTCAAAAAAATCAATATGCCAAAGTGGCATG 
TTTGGGGGTATCTGGTTCTGAATTCCTTCAGGAAAGATAGAAAGCAAAAGCAAAATAATA 
GGTTTAAAACT AAAAATATCCAGGTGCGGTGGCTCACGCCTATAATCCCAG 
Seq uence 743 

CCCTTTCGAGCGG CCG C CCG GG CAGG T AC TCC TCCTTGG CAGCATCAATCAGGCAGGGC T 

CAGCCCACACCCGGCTCCTAAAGACAAGAGAGCAGAGAAAGCAGAATGGTGTTTAGAGAC 

CATCGCAG TGACCT GATCCTGAAAGCACCTGT AGGAAATTGGCCTCCGCCAAG TGAATGT 

GACAATGCAGTCAGCCACAGTGACGGAGTGCAAGATCGGATCACCACACAGATCCAAGAG 

ACCX^TCACCaCACCTGAGAAACAAGAACCCAAGACAGCCTCATGGA 

T ACGCAGTT ATGGC TTCACT ACTGAATGCGATCTTGCAAAAG 

Sequence 744 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGTG TI ll l ll M GGGTAATTTTCTTGAGT 

TAGAAATGTAGTTAGAACTGTGACTAACGGCAnGCCTGGAATGTGCTACAAACACGATT 

AGATATTCATTT ATC TTCCTCGTATT AGACTGCTTGTAAGAGACT CAGTGTTT AGACATT 

CATTTCTCTTCCnGTATAAGACTCCnGTATAAGACTCGGTGTTCATTTATC 

^^^^^^^CA^TATATGAGTTTTTAACCATTGCAATGTGCAATAAATAAATATAT 

CTGAAGT AGCATT AGCCTTC TAGTTTTAAATAATAA 

Sequence 745 

CCCTT AGCGTGGTCGCGGCCGAGGTACC H I III I II 1 1 I I 1 1 II 1 1 1 1 CGTCAAAGTC A 
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CTATTTGGGCCCTAACATAATCCTGCTCAGAGCGACGGAAAAAAGGCAAGCCTTTTCAAA 
CATAAC TCTCTCT ACAAGCCAGCTATTATGGCAAGGGAAAAAAGAAAGCATCTAGATAAA 
TATCTATCAAAATTAACTTTAAGAGAAATACTCTCTTTCCTTAAAAGCCCTTATT1TTTA 
AGACACTAGAAAATAAGTTACT ATAAAAAG TGGTGG TC TGGGGGCT AAAAACAAAACAAA 
AAAAATCCTCTTTT CTACATTTTTTAGTTTTCT 



TTACCTt^VATGGTTGAGATATTTTGAATGGATTTTTCAAGGGGGGGAAATCCTTATTATA 

ATAATAAAOCAAAATACTTAACAGAAAATTGTCAGCTATTCTGACAA^AATAAACATTTT 

GACACACTTTATTTCTTTTGTCCGTTTCTGTGGTATCACTCATTGTCGTTAAGTAAGTAA 

AGCTTITTATATTTAGGTAAGAACTGATTrTATTTTTTAAATTATATTTTATATTTATTA 

GCACAGAAGAATAATGAGAGCCACATTTTAGTTCAACT 

Sequence 747 

CCC 1 1 1 CGAGCGGCCGCCCGGGCAGGTACTC 1 1 1 1 GTTTAGGT ATTTCCCTOCTGCTGTG 
TCCAGGATTGCTGTGTGGTGGTGATGAGTGCTGGGAGGTGAAAAATTAAAATAAGCCATT 
TACCAGTCAGCATCCCAATTAAATATTTGATGTAACTGTGATCTTTGAGCCAGGCTTATA 
TATTCATTTTCAAO<^GAGGAGTTCCCCATTTTAAATAGAGGCA 

TQGTTAACTCCATCTGGCTTGGGT l l I I CIG I 1 MC C1 1 I CM I G CTGAATTAGAACGGG 

TTACTCTGAAGAGTCCAGGTCTTACAGTGTGGTT 

Siq u mei 748 

CCCTTGAGCGGCCGtX XaXSG CAGGTACI M I IMMI III ir iil l ll MMr niTin 
NCATNCAANAAANATAATmACNCTTATTNTTTGAAANANAAATTf^ 
TCTn7TAATTNAATNCCANAATACNTTCThn>tANCCCTATGCCCTFMTACTAffT ANCTTG 
ATGGTTAGCGGGTAAGTAGGTAGTAGT AMAANANCANAANGGGAAATTNGGGGAGCAAAA 
ANGGGANAAAAAMAAAAAA 
SCQUBOOI 749 • 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCACTCACT ACATTACAAAATAGTCTCTAACA 

TAAAATTGCCTTAATAACTATACTATTATAGAATCTGATAAACCTTACATTATTAAATTG 

A TTATA AAATCTTCTTGGAAAAACTTTGGTATGTATCTTCAGAAGGTrrnTAAAAATAA 

TATTTTAAGGGCCTGTAAAGATTCCATTCT ATTAAAGCACAGMAGAATAAGTAATGGATA 

TTCAACTGCATACAGAATATATAATCAAAAAAACAATTTATTATTGTATTTGTAGAAAAT 

CATTACCAGAGTAAGCAAAAA 

Sequen ce 750 

CCCTT AGCGTGGTCGCCGCCGAGGTACATTTGATTGTGGCATATTCAACT ATGATTTTAG 
ACAAGATGTGTGTGTGTGTGTGTGTGTGTGTAGACAAAATAAAATTGAGAAAjGAGAAAAT 
CTATTCTACAATGAAATTCAATCTCTTACTTAGCTATTTTGAAATTGTGTCCCAATaCC^ 

cattaac agagcca aaatgaaatttaaaattatggttatactattattcacactaggtag 
ggtcaggtt TT TT t gnctgaattaaatggctcctttacnctagctacttangaaccactt 
nccataccctnaagctagagtaata 
Sequence 751 

cccttagcgtggtcgcggccgaggtacatttgattgtggcatattcaac 
acaagatgt g tgtgtg tgtgtgtg tgtgtgt agacaaaataaaattcagaaagagaaaat 
ctattctacaatgaaattcaatctcttacttagctattttgaaattgtgtcccaatacca 
cattaacagagccaaaatgaaattt aaaattatggttatactattattcacact aggtag 
GGTCAGGI Mill I gtctgaattaaatggctcctttacgctagctacttaggaaccactt 
cccataccctcaagctagagtaatagatacctgacc 

Sequence 752 

CCCTT AGCGTGGTGGCGGCCGAGGT AC 1 1 II 1 1 1 H I H I 1 1 1 II I I U I I GGGAGCCAT 
G GCAATC TTTT T ACACTTGATTTTAGCCAAAAGGCCAAGAAGCAATGAAAGCCATGAT AA 
TC TTTTT ATGCAATGTTATCANGT AAAAAAATGGC T AAAGT ATATT AGCATTT ACCCGAG 
TGGTATTCTTTTATAGAACTC^GCTACTAAAACC^GGGAGAGTACTTGGTGTATTTCTGA 
AACACTCTGCGAAGTTGTGGAT AGCTTC T GG TGG T AAGGATGGT A TTGAACACG TTT ACG 

Sequence 753 

ATAAAATAATAATGCATCTTACAGGGGaS 



TGACATATrTTTCCCATCTTCTTATTTC^ACCATTTGACTGGTTGTGCAG 

TTGCACTTTTTTAAACAATTCACACTGACTGGCAGTCTTCACC^ 

CATCXCTTTAAAATCATTTAAAAACATGATTTTTAAATTT^ 

NGTTTAC 1 1 1 f IT ACTTTTATTTATTTCCT 

Sequence 754 

ccctt agcgtggtcgcggccgaggtac m m 1 1 iii m i ii f t1 1 ggtggggagctgtat 
^^^"^cccagggctgtc^aaacaaatatccataaattgggtggattagaacaacaaam 
tt^attntctctanagaanaacgtittcttgccactccctggct 

AGTCCTTGTCXrrTCCCTGACTAGTANCTACATCATTCTCATTTC 
TGGCTGTCArrrCACTGNGTGCTTGTCTCTGGGTCTTCAAG 
CTGGTCATTGGATGTAGGGCCTACCCCAATC 
Sequen ce 755 

CCCTTAGCGTGGTCGCGGCCGAGGTACATGTTGGAAGGGTTTTTAAAATC 
GTGCACAGGCCAAACCCAAc 1 ? i CAGGACATGGG 1 1 1 1 CAAl. H C I t>GATGGTATGATGG 
GG T GA T AGT AGGGT AT AAAAGTATCCTGAGAAGTTGAAAGCAGTG TGTGAATGGGC TGTT 
CI 1 1 1 C I CCCCACAATCXTTTCCCATCTGCTGACAGTAGACTTAGCACCTCACAGATGCT 
TGGGCCTGGAAATGAAGCCATGAAAATGAAGCC^CAGCXTTCTTG 
GGTCCTCACCCACAGCACATGGG 
Sequence 756 

CCCTTAGCGTGGTGGCGGCCGAGGTACACAAAATATTAAATAGGATATTTATTTCTAAGC 
CAAATTTCAGAAAACAATTTACAAAC1 1 1 1 1 1 1 AAAGTATAAACATAGTGTATGCTTACT 
ATA AAAGGA AAAGT ATAAAACATTACTCAAGTATATATAGAAAATGAGTGGGCTGCTGAT 
CCCCCTCT ATATT ATCTATTGCTGTGTGACAGT ATTACCACAAATACAGTAGCTGAAACA 
ACACATTTGTTTTCTCACAG I ITCTG1 GGGTGAGGAGTTCAAGCATAGCTTGGTCCTCTG 
CAAGCTT ACAATCCAAGGGTTG 
Seq ue nce 757 

CCCTTAGCGTGGTCGCGGCCGAGGT A C I IC I 1 1 1 1 I » I I 1 1 I I I II I H I AAATQAGTAG 
GAAGAGATGGT ATCACAAACACAAAGCACAGGTTACTGTCTTT aaaaatttgcgttcttc 
TATTCT CCAATGGAAGTGGGAACAAAGAGAAAACCCCTGTGTGTCCT AGCACAATATGGG 
CATTTGTGTGGATTTAATAAATGGGCATTTGGATTGTTGGGAAAATGTG^ 
GGCTATAGAAACACAGTTrGATACGATGGTGAAAACTTGTCTACAATG A I G M ill ICAG 
AAATGTTGGTGTGATTAGAACAAGTCAGCAATGATGATGACAAAATATTTACATAATGTT 
ATAGA TGTGG CTTGCTAATGGAAATACCTATCTGAGGCTGTTTAGGAATACACAAATTGA 
GAACCGTTTAGTTCAAGTTTGCTTTAAAACAGTGGTT1 I C I GAACCCTTTTT AT G TT CCG 
NGACCT ATGATTAGNAACCATC TT ACCATTTT ANAATCAC TGCTTT AAAAAGTNGTNTCC 
GTACCTGCCCGGGC 
Sequence 750 

CCCTTAGCGTGGTCGCGGCCGAGGT AC TTGTTTTAAAACAATGTT GGAAATGAGGAAAAT 

GAGCAATATCAACA7TTTATCXTGAGGGACAGGGAGTAGAAAACAAGCCAGAGGCTGCTA 

GTTACATAGTTCAGTCTTAGGGATGAAGGGATTTATGTCTCTCCTCCCTCAGGTACGCGG 

GGACT ACACTGG T G TCTGACTTTTTTC C T AGAGATTTC TCCC TGAAAAATACAAGGGCTG 

TTGGTGAGAGCAGACTTGAGGT GAT AAT AGTTGGCCTCT GGTC T ACAAAGATTTCATAAC 

TCCTTGGAAAGCTTC 

Sequence 759 

ccctttcgagcggccgcccgggcaggtactccgattgcc tctcccatgcttctctgcttt 

ccaaagaaaaaactgaccttgtatagatcctgtcagctgattgcagtgctcttaacttct 

ccattgtcagttgttcagtctgaggagttaggtataaacccagagtggtattctcttttc 

TGTTGTGTTTGGTTTTGCTTACATATTCAGGAGCTGCTCTT^^ 

TATATG t lllll jctgtttctagatttaaaaatattccagaagcctggcctcaagataga 

TAATATTTTACTTTT 

Sequence 760 

CCCTT AGCGTGGTCGCGGCCGAGGTAC I M I I N I I 11 I I H 1 1 H I I M I T A AAAAAAT 
ATCCTTNAThiAGGNAAAAATTTTNNTTTNAATTAACNGGAAAGTTT^ 
TGTTAAATNGATTTNAATGCTNTTTTTGNATTNGTN^^^ 
NCGNG/4AATTGGGTNNTTTAATNGGGN G1 Ullll I AAA 
e 761 
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CCCTTAGCGTGGTCGCGGCCGAGGT ACAGATATAAAAAGGCTACTATTCC^ 

TCCTGGAAACAAATGTCTATCAAGAAAGCAAAGATAATCTAAACAGCAGCATATTCATAG 

GAT GACAAA CTATTCAACCATTATAAAGAAAACCGAATCAAAAGCACTGGCTTATTAGAC 

AAGAGTTTCCCAAACTATCATGCTAAAACAGTAACAGCGAGCTTCCAA^ 

1 1 1 I I 1 1 H 1 1 1 1 1 1 CCAAACTGAAAGGAGGGTGGGGAAAACAAACGCATCATATGTAAA 

GCACTGAGTCCAGCCT 



CCCTTCGGCCGCCCGGGCAGGTACGCGGGTATGGTTTTACG 

AAATTATCATGGTrCTAATCrrACATGTTAAC A l I I C C 1 1 GTTATGT AGGGATCAGACTT 

GTTATAACATAATTCCACTTTATAATTCAATGAAGAAGAAAGTTTTGTCT 

TATG TAATA TTTCATTATTATTACCATATTGATATTCTCTATATAAAAAAATTTACATAT 

TCTAGTTTTCAGGTAAAAGCTGTTGTGAACATTATTTTTTGTCTAGTGT 

AAAAAAAAAACAACTG 

S equen ce 763 

CCxrTTAGCGTGGTCGCGGCCGAGGTACGCCTAAGGGANG><KNGGAAC TCATKAAAGAGAC 
AAAAMSTGCffTTTTTCinTNNAAGCCATTjCTGTGG T GG 1 I UGGCGCAATAAAATAGTTGG 
GGCCCCCGANTGOCAMTGACI IUCII INTKGTNGGMAACNAAATGGCCCATCANGTTGGA 
CNCAOCTQWOCAWTTCACAAAOACCTTGWCCCCATTC 
AAATAAAAAAGTGTTGACCACTTXXrrTGGAT^ 

CCTGGAAA A I I GGI 1 1 I AAAAGGGGGGGMAGWTNGGGATCCAAAACCCTGGCCGGCGAAA 
ATAAGAT ACAATCCGTANCTTGTTHGGGAAAfiTTCAAATTTTAAl ' I G 1 1 CCCCCCAAGWA 
TTNGAATTANNAAAAAAAACCCC^AATTTGGGGGAAGGRAAAAAAANGT 
■ Seq u ence 784 

CGCCACTGTGATGGGATATTJTGCACAAATTCCCCCTTAGCCG 
CGCtSGGATTCATTTGAJGTGGGAATCTCAAAGCA 

TTCATTAAGGGATTAAAAATGTATAAGGCCAGCACCGTGTAAACCTTCG 

ATTTTXXTCGAAAWCCCATAATTGGTAGGTOATGTC 

AGGGGGGGT AAAGTrNGAATTCCCCTTGGGGCKAAGTTCC AACCCCAfn 

MAACWTTTTNGTTTTNhlAACC I I IA ANGNC C I I lll l l I GAAATCCCAAAAAA 

AAAAATTCWTTTTAAC C I I U ltl I AAATAAAGGGGGAAGGCCAAG 1 1 1 1 I I 1 1 ICAAAA 
ACTTCC CCTT A AAAAAAATTX5KTTHGGAAAATT AAWTAAAATTTAAGGTTCOWGGWTTT 
AAAAAAATTTTCCACCCCAAGGCCC TTACCC>*CCAAWGGGGNAAAATT AACCAAGGGGGA 
ACCI I III MKGAA 
Sequence 765 

CCCTTAGCGTGGTCGCGGCCGAGGT ACAGAAGCAA I G tl 1 1 1 1 GAAAGTTTTCT ATCTGT 
GGMTTG II GAATCCACAGATGCAGAACTCATGGAAACAGTGCCCACTGT ATGTCACAATT 
TCAGAAAATCAGTATTTCATACAATCANGCT AA T AGCXTT AATTTGTTGAGCACAAGAAAA 
ATACACTGAACCAATTCTGATTATTGCANGAX3AAATGATTGGCA 
GAATGAAGGGCGGAHAGAATTTCACATGGATTCAATATAirrcrrCC^ 



e 7 66 

CCCTTAGC GTGGTCGCGGCCGAGGT ACAGAAGCAATG HUH GAAAGTTTTCT ATCTGT 

GGT1 1 G rf GAATCCACAGATGCAGAACTCATGGAAACAGTGCCCACTGT ATGTCACAATT 

TCAGAAAATCAGT ATTTCAT ACAATCAGCTAAT AGCCT AATTTGTTGAGCACAGAAAAA T 

ACA^TGAACCAATTCTGATTATTGCAGAGAAATGATTGGGCAGGATATTGGGAAATAC^A 

TGA AGGGCGG AAAfMXATTTCACATGGATTCAGTATACTCTCCGT 

TTGATC1 1 I I IGTGGTTTAATGCCTTAATTTATTG<SGGCCCCTCTCATANGTTTGGGGG 

Sequence767 

CCCTT AGCGTGGTCGCGGCCGAGGTACAATCAAAGGAGTCTAATGGAACCAAGTAGC AAT 
GTTCCCG AAAAC AAACAAACAAAAAACCCCAAACATTTTG C I G 1 1 IC1 1 rOOCC TCTGTA 
TTTGCT AACTTT ATCATGAC TTTATTCTTAAAGCC TATCACTGGTCTGCTTTT ATT AAT A 
GATTAGTGGAAATTTTCACCTGGCCT ATT AGCaCCTT ATAAAGAAAT AGATT AAGAGT AG 
GAAATATATAGATGAAGATGTACTGTATAGAAGTTGTGTAAAATCAGTATGAAAGTTCAA 
TGTTGCrGTTCTTGCTCAGTGGATTTTAAAGAAATTGAGTAGTTCCTATGTGGATTTTTT 
TTTTTTCTTTTCTAAACTG 
e768 
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ccc tttcgagcggccgcccggg^aggtacatatacattatgtaatnnanaagcgtgcatg 

GG^^TGAAAAAAAAATTTTTNNTNTATAATC^JNGNTAC^ATATATACAATAAAACACCTA 
AAACGGAGAGGCTTGCCTTGTTTNTCCACAAATANGTTAAATACCCA^ 
ATGGATTGGTGGTTATGGTAGGAACACCAAGACNAAAAAGCCAGGCCGGG^ 
TAATTNNGGGCCAGTACCACCACNATATAAAGGCCACCAACCAAAAAAAG 
CCAAMAAAWAAGNCAACCGCCCCAAGTTWAAATWGT I HUI I GGGGAATTGNCCCAGTTA 
NTTCCAAAANGGAATTTTTGGTNCCCANTTANTT AAGGAACCAATTTAAATAATTCCCOC 
AiQGTTTAWQQAACWACCITWGTTNAAATTAAGGI 1 1 1 1 1 1 1 1 1 I G GGGTrNACCCCTTTC 
CGGGGCHOOGOCWGHAACOOCAWMOCGTICCIWTAA^AGGMSGHGGOCOOAAAAAT 
Se quence 769 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTT ATTTTTTTACTAAGGTTTTGTTTTGGAGA 

CTTGTTTGAAATAAAGTGATCCTT^TTCAGGATTTAGAAACAAAAGTTATACTC^ 

C TAGGGATT AGGAAGGCT AATG TGAAC TGATCAAAAGT ATGAATT ATGGAATGCC TTT AG 

AATAATCAACTTTTAGGTAATTTGATACTGCTATAATTTCAAGC TTAGAGAAAAGTTGT A 

AGAATGGCATAAGGAACTCCTATATATCC . FTATCTAGATTCACT AAATGTTCATTTTGT 

GCCAI 1 1 G I G I IATTCTTTGTCTCATCCTAGCCCAGTCAGCCTAACACCACCCAGGGGAT 

AAACCAGTAGTCTGATA 

S e quen ce 770 

GATATCTGCAGAATTCT5CCCTTTCGAGCGGCCCGCCCGGGCAGG 
CTTTTCAACACTTCCTGGCAGGCAGGCAGCATAAC^ 

CACTCTGCAACICI I ICI rCTGAGCCAGGCTCCCCTA CTOTCI 1 1 1 CATTTATGTCAACG 

CAGGGGAAGACCTCAAAGGGCTCTTGCATCCCAGTCTCACTTCCCAAGAGAGG 

CCCTCCAGGATGTGGGGACAGGAACTTTGGCGCAAGCCCGGGGCT 

AGGAGGGCTAAATAGTAGAAAGGAAAAGTCTTATTG<rrGATATGTTTGCAAACTG<iGA^ 

AAGATAGCCrrCCAGTGTGGAGCAAAGATGCIUCI ICI I CAAAGAGGGCAAGGGCAGCTTG 

GATTTTGTGCCTTACANGGTCNGTATTATATAATAGAGTCATGCATATTCAIITA 

GGCGAAAAGCTATATATATTTATGAAGGOGAGCCAACTACATGGGCAATG^ 

CATGTAACACATCXATGTTCACTTT AGGGGCA 

S e que nc e 771 

GGATATCTGCAGAATTCGCCCTT AGCGTGGTGGCGGCCGAGGT ACAAATAAAGTA 
AGGGNGGNAGAATT<GAAAANGAMGNCTN NCANCTTGAfTN NCHTTTGGGAAATTGGGAT AT 
CCTTTGGGGAAATGTAGTAATCAGTATATTCTGGGNAAAACATTAGTTAGAAGAATTGAA 
NTAAATAAAATTTCCATTGAATTTGGAATATGTTGTCCATTCTC^TGTAAACTAATGCT 
ATCAANGAT AAAGTANGAAATACCACATTTCAGHAAACAAGC TTGGAAGTAGKACAAGGT 
CCTTCATTAGNGCCNTAGCCTTGGMAAACCCTTAATAAMCCTATKTA 
ANTTTTTAAATTTATMACTCCTGG 
Seq u ence 772 

TGCAGAATTCGCCCTTAGCGT GGTCG CGGCCGAGGT ACCACCAAT AATGAGGCCCACATT 

GTGTATGCT AAAAAAAAAGTGMTTTTKN TTN T I CC 1 1 GGGCC T ACAAGAACATGTTTCT G 

TCCXSCTAAGGAfiAAANTWAAGAAAAACAATGGCCCCC^^ 

AMCXn TAAAACtn CACAfiGCOANGTTGNAATTTTAA^ 

NNCA NI I III 1 ICCCC C CCCAAA M AACCCAACMCCCCATTTACC^ 

I I r I CCKT T GGAA TTNAAIXGC CWAC C TT CC TTTT AAANAAGG^iA^iAAGGCCT^<NACC><A 
AGCCTTrCTTmCCCCCCCAATTmC«C^ 

GGGGGAAACCCCCCACCC 1 1 IGCGCCWTTTTTTGGWGGGTTCCCAAGGGGGAAAAAAACC 
AAGGGGCCWATTTAhlCCNAAAACCCAA TTCC CAKGGANATTGGTTT 
AAAAAATTWGGGGGCCCCWACCCATAA I 1 1 ICI ITAAAAAAAANGGTAAAA 
S eque nce 773 

CCCTTAGCGTGGTCGCGGCCGAGGT ACT ATCATCCCCCAAGGCCTTTTACAGTCTG^ 
A TCAAAATTGAAAGCAAAAAT AGGATGACCAAAGGACTACT A TTTHACTCTCTTTTCAGN 
AACWTCIffACAATATGTATG AAAACC TAAAATATCCACTWTATGGGATCATCAlWGGGGG 
GAAWKTAAuWTGTTGCCC KTOt r 1 1 KGKAAANGGGGCATTCANGATGTATTTGGGATGTN 
<><CANGGNCCTG«GGCANTTTT ATNTCAAGGATG^O^AGGGG^^r^WCATTAACCTGAACCA 
AGTGGAMTGACAMSNGTCTTCNCMTTATAAATACCAAKGGGGCCC^ 

cang ccacccc aattggaacc TT ATG GGGG GGCC ttnggccnttttttanaaaamaacca 

AAAAA 1 1 1 1 1 1 1 CTTAAAGGGGGAACCTTTTACCCGGNCC TTCCTTN I T TT GGGGGG 
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774 

CCCmCGAGCGGCCGNCCGGGCAGGTACATATACATTATGTAATTAAAAAGCGTGCATG 
TGTATGTATTAAAAATAATGGTATATAAAACAAAATTACAATTATaTACCAAATAAAAAC 

cacmctaaacgccannagggcatgcttgtttattcccaccatattagnt aataacccaaa 
tagataattaaantggaattgggtg 

Sequence 775 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I II 1 1 1 1 1 II » 1 1 I I 1 1 1 1 1 G AGAGGGGTCATC 
CTCCAAT CAT7A ACTACTTCTAATCTTCACTGCTACACAGAAGTTTCCAATATTTAGCAA 
CAGATGGCTTTGCTTTTACCTTATAGATGAGGCCAAAGCACCAGGTAGGTGGAAGGTTCT 
TGTATCGGTTCGAACCCCNACAGCGCGCCAAC 

CCCTTAGCGTGGTCGCGGCCGAGGTA C lllllllllll lin il TT Mil f 1 GGNCTGCC 
GTGGAGAGGATG GATGG GAGGGGGAAGAACGAGAGCTTTGTTTAGAGGCTGCTGTAWTAA 
TCCAGGTAAAGGCTTTTAATCATGTCCTGAACAATGATCAGCAATGGCAATGGANATGAC 
AGAACANAATTrAANAAGGAATAAAAAAGGCTTSCTGACTACTTGGATGTGGGTGANG 
Se quen ce 777 

CCCTTA GCGTG GTCGCGGCCGAGGTACTGCAAGCCAAATGCAATGAACAAACCAAGGTTA 
TTGATAATTTTACATCACAGCTCAAGGCTACTGAAGAAAAGCTCTTTGGATC7TGNATGC 
^CTTCGGGAAAGCCAAGTTTTCCGTAAGGGTAAAATCGGNAAANTGAAAGNAAAACCTTT 
AAGACCAGNCAGCTTTGAAGGTCAGCCTTGAGTAANAGAGNAATTTAATACCAATTTTAA 
G^GGAATTrGGAANAAANGAAAATGGCCTTGAAAfUSGGTTAGGCCAAAGGGGCCTTAGG 
GTTAAGTTCNCTrrAAGOCC AAGGA AAGGAAGGCCTTNCCCATGGGGGGGGGAAGH<VAAG 
NAHGNCCTTNAAAAAG'GCCCTITTAACCCTTAAAACCC CI 1 1 1 1 I C AAGGGGGAAAAAAA 
^^T ^T^GGAAA GGTTWGNAAAGGGTCCCAHGGTTTCCANAAGGTTWGGAAAAAAGTAA 
AGGAACC IT 1 1 rTGGGGGATAAAAAAAAGGGAAACCCTTTCCCAAGTAN I 1 I M I MG GG 
AAAAAAGG 
s 778 



CCCTTAGCGTGGTCGCGGCCGAGGTACTGGTTATCAGGATAATACT AGCTTCACAGAAGA 
AGCT GGGAAGTATTCCCTCCTCTTCTATTTTTTTGGGAGGACT ATGTGAAGAACTGGTNT 
TAATAAAAACTCCTTATTAAGGAAATTTTTT AACATACC AAAAAA TAG T AAGAATAGT AT 
CATGAGTTCCTGTGTTGTATTCCCGCCTAACTTCAATAATTATCAATAGTCCACCATTCT 
TATTTTACTTATACTTCCCCTCCCCAACACCTTACTCTTTTGGCGGGGGCTGAAATTATT 
TTAAAGTAAATCC CAAGACAT ATCATTCACCTTTAAATACTTCAAATGTATATCTTCTAA 
CAGGATAAAGGAC 1 I I 1 1 1 1 I 
Sequence 779 

CCCTT AGCG TGGTCGCGGCCGAGGTAC TACGAAGCTGCAG ATCATT AC GCTGATAT GAAT 

GACTGCTTGAAAGAACAATGACTCTGGCACAGCCACTGCTTTrCACCCAGGAAAGCAGTT 

TTTCACAGAATGGCTTTGATTTATACTTTGCAC^CCATTGAGAGAATAAAAAGAAAATCT 

AAAAGTTAGTCTTAGAGCATACAAACATTCTATATACTATTTCATCAACTTTATGTGATA 

ATGATATATAATTTATATATACTGAAATTATTTTCAGGATCCACTTACTGTGCTTAAACC 

CGAAAGTGAATGATTAAAGAGGCAATGGAATTATCTAATGTATCTTTTATAAATTAAGAA 

ATCAA 

Se quence 780 

CCCTTTCGAGCGGCCGCCCG GGCAGGT AGAGACAGTGTGATGGATGATGCTGCTGGTTGT 
AAA TTTCATCG TGTGTGTCTAA I Tilt 1 nCCTGTATGAATGGGGTAAAAACAAAACANN 
AAC Mil IT TTAGGAAGATTGTAATTTTGC^GTCATGTTTTTNGTAGGNAATGAGGGGN 
ACTCGTTTGNAGTCTTCACCTAAChK^TCCCTGNGNAGTTTfn"GAAGTTTTGGAAAGNCC 

attgaaannattgtgttgccccccaatg»«:ccttggaccngccttnacagtccx;ncnctt 

NNGG ATTC TTGC AAGC GTTGTC 
Sequence 781 

c cctta gcgtggtcgcggccgaggtac i i i ii 1 1 1 i 1 1 i hi i 1 1 1 1 1 1 1 g gcggatgag 
tcttttaatagaaaaacacacgtgcaacagtatcaanacacattttttngcaatcctgac 

AGCAGCTGAAC TTCAGTTCTTCACCTTGGGGGGTGGCCTGT ACATATCAAAATCTATCAA 
ATTGGACCCTCAACTATGCAI I I I ICTG NGTGCAAGTTATATC tcaatt acaaacaaaca 
AAAACACAAAACCCTATGGTTAAC CCAAAACCTAAAC TATNACCAAGAAATATCAATTGG 
GGTTATGGCATGACCATCCTCCCCAAGAAAATAAAATGCTTGACAGATTCTGAGCGGGA 
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Sequence 762 

CCCTTTOBAGCGGCCGCCCGGGCAGGTACAAATAAATGAGTTrGCAGTGA^ 
CAAATTACCT(^AGTGACAGATAGTAAGAAAAGCTT>rrTTGAGCAGGTGGAGGTCACTGA 
ATCCCCTACT ATGCACTTATCAAGATTrrACTTACTTTAATTTACTGGAAATTGATTlTT 
T AAAAAATG ACTACACTGTAACAAGGGAAGGGATCTGG G 1 1 1 1 1 1 Hal I GTTTTATTCTT 

g ttttttttaag t agttcaaa ttctgaaac tgtgatttaaaaattttttacag tcaagca 
ttctgattttgaacataactcccttccctttctgtgtaacaaaggtctctctgttatctc 
ttaaattt 

Sequence 763 

(XXTTAG<X5TGGTCGCGGCCGAGGTACTCTTC^CTGTC7TTGCX>TGAAACTTrATAACA 
TGGCTCTCCAGGTGTTGAATCTGGTGCCCTCTCACCCTGTGCTCAGGGAACACATGGCGG 
CAATCAGC^TGTGAGGCGCAGAGGGAGGGCAAGCTCCCCTTGTGATATTTGAGGTATCAG 
CTGACTCAAGTCTCTCTCCCTTCTCTCCTTATTCTCATGCTACCTNTCCCAACCATTGTC 

TTAACTTCCCTGGCCAGGATGCCT gccatattaaatggagaggaggcagtttctaaatgg 

CTTGACTTTGGTTGAAGTCTCAACTCAGGAAGCTCTGAAATTAATCCACCC 
Seq uence 764 

CCC 1 1 lCGAGCGGCCXiCCCGGG<^GGTACTACTCGATTGTCAACGTCAAGGAGTCGCAGG 
TCGCCTGGTTCTAGGAATAATGGGGGAAGTATGTAGGAAGTTGAAGATTAGTCCGCCGTA 
TTTCGGTGTACCCCTGGGAGGTGCCAGTCATTGAATAGATAAGGCTGTGCCTACAGGACT 
TCTCTTTAGT(>NGGCATGCTTTATTAGTG^ 



765 

CCCTT AGCGTGGTCGCGGCCGAGGTACAAGAGGATATGTGTGCATTACATGCAACCACTA 

C^CCATTTAATATCTGGGGTGTGAGTATCCGTGGGTTTTTGGNATCCGTGGGGGTCCTGG 

AACCAATTTCTCCTGGATACTGAGGGATGACTGGA7TACTGTGTGTTTGTGTGCTTGTTT 

TTAAGCTT(^AAAAGATTATGTGATCTAGGAGTTGTTAGATTTTATTATTGGTCTTAAAAG 

ATAAGCTTANATGTTGTTACTTTTTTGGAGTTTTTAGTTTACAGT^ 

GCAGCTTCANAC CACAGGAGACATNAAGCAGG TTT NAATTTTCAAI<K3AAAGGGNTTT ADA 

AGGCAAAAATATTTTGATTTGGTTTAGA 

Sequence 786 

TGAATTCGCCCTTAGCGTGGTCGCGGCCGAGGTACTAAAACTAAAACTGAGCAGTTTAAA 

ACATTCATTTAAAGGGATATCTAATGTGTTTATTATTAACATAAATAATGTTTTTATGAA 

AAATGTAACCTTNGTrrTCCAAAACAAAAATGTTTAGGGCAAGAGTAAC^ 

TTATTGCATCT CAGTTGAAA AATAAATGGCAACAAAAATTCTTATATCTGCTTCTGCAGT 

TAATCTGhfTCATTTTG [III GGTTGAANTATATTGAAGGAAATCTGTTCCTCCACACAGT 

TTGTGTAGTGGGAAAAAGGGGGGAC 

Se quence 787 

CCCTTTGAGCGGCCGCCCGGGCAGGTACGCGGGATTCCTGGTTAAGCAGGCATTGCTTTG 
CCCTGGAGCAGCTATTTTAAGCCATCTCANATTCTGTCTAAAGGGG I MM T1GGGAAGA 
CGI_m 1 1 CTTTATCGCCCTGAGAAAGGATCTACCCCCAGAGGGAGNAATCTGTAGNACAT 
TCTTTGCCT A CTT^^T ACTTTTATTTAGGCTNTTCTTCCCTNCAATTTCAATTTTCCTGT 
ATT ACCACCC TTTTTTCC CC Mill T TTGGGGGGGGAAGA 
Se quen ce 786 

CCCTT agcgtggtcgcggccgaggtacctgcaggcctcctacacctacctctctctgggc 

TThrTATTTCGACCGCGATGATGTTGGCATCTGGAAGGCGGGAGCCACTTCCTTCCGTGAA 

ACTTGGCC gtagggagtaagtcgccgagg gtctncnagncgttctttncttgaaggatgc 

ANAANACC CATGGCGTrGNGCGGACCGCGCI^CTTCTTTCCATNGGAACATTCAAAGGNN 
AGNCNCAAGTTTTG^TAGTAfn"GTAANTTTGGGNGGGTTAAAAAACCTNCCCAANGNAC 
CG<^CCTATTGNAAAAAAGNCCTTGNCTCCAANTGNGGCCCCCCTTGGGGTAAGTNAAAA 
AAAAAAGTCCCrrGTAANCCCCAAGGGGCCCCCCCTTTTTTTGGGGGAATTTCC 
Sequence 769 

^ C1 ^CGAGCGGCCGCCCGGGCAGGTACTTTAATTTCTTTATAATTTGTTGAGCTATTT 
AAAAAGATAATCCACAATCTCCTACCGCCATTAGAGCACAGGAAAAAAAAATTCAAAAAT 

aaaggaaaaacatggctcatatatctacagaagtcacaaaaatactatagggcacatata 

cccaggcct cagcgg tgggaagaaaacatacaaccaccgggcaaaatgtttgaacactga 

agacgggaattttttagggcc 
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CCCTTAGCGTGGTCCGNNGCCCGAGGTACTCAAGTCGCCCTTATGGAGCCCTTGAT^ 
*^^35^^ A ^^®^ < *^*®^®^^ A ^ AA ^^^^®^^CAGGX3AATGAAGT AAGTGTTTTTT 
^IGJT^^T^GTTATAACACCTGATTAAGAG^ 

ATTCCCACCCTCTCCAGCAGAATTGGCATTCT GCG TCC TTACCGGC TTTCTG TCACGTGG 
ATTTCCGCCTGTTT<XrrCATTGCCTCATGGAAA 

GGAGCTGAGCC^TTTGAAACTGAACCTACAATCTGAGGTGGGGGGTAATCT 

agtgctagatggtgaaaaaacaagtaggactttcggctgatgggtagaaacaaggacctt 
fkstaaagaatattcatgtgctcaaaaaggaataacttcctggctaattcttgcgt^ 

TCGTTTTTAAAATTAAT^^ 
Sequence 791 

GATATCTGCAGAMTTCGCtmAGCGTGGTCGO^^ 

TTCCTAGACCGATTCTAGTTTG^ 

GCTGCAGACACTCTGTGCCCCCCTGCATGCTGG^ 

GACAGAGACGATGTGGCCTTTGTCCTTAAGAATGAGGTTTGAAAGCCTCAGTTCTrrCX^T 
GTTAGGTGArmCTTGCAGCTCTTGGTATCTGCAGAATTAGTGT^ 
TTAACAGCTTTATATCATCAAAGTTTTAACAGTACCTGCCCGGGGCGG 
GG 



CCCTT AGCGTGGTCGCGGCCGAGGTAC 1 1 1 1 1 1 1 1 1 H I II 1 1 1 M 1 1 1 1 IT G AA 

GCTGAAGGCCACAGTAGCT AGC TAAAGGCCACACCACTGAACACTAAAACTrTAACCTTTA 
CTGGCTACTTTGTAHATAACATTCACAGCTCACCATGAATGCAGCTGCAGTCAACTAACA 
N^ATGAAGTTACCACTGTATTAC^TGGTTATATTAGGGACT GCTTNTACCTACTGGAGG 
CTGGGGAGGAATGTAACAGCACAAGCCATAATGAAGTTTATATACAGGCTT aatataaaa 
NAAAACCCTACAATGAACTCAACACAATTAT 
Sequence 793 

i i > 1 1 gcagaattcgccctttcgagcggccgcccgggcaggtaccatgcagggatagctg 
^fiV" ' ^^^cctcctcagcccctatctgttcagtgcactgaacaccagctgctctcttcc 
tctctggctcccatggcagccatggtctgttgcagagagaagaggattgcctgttccctc 
tttaagggaacctccgttttgctttctggaaccac 
Sequence 794 

ccctttcg*gcggcccgccc6<3gcaggtacg^ 

aatgagaaatc^tgagagatttractttcaattttattttaatttgaaagagcatatg^ 
ATCTGGAATA 1 1 7 1 1 aacatatagccatactgtttatttaaatttgtaataatagaaata 

GAG T AATTCT AC TG TTGGATTTT AATTTTT AATCATATT AAAG TTTAACTGGATTTT ATT 

TTAGGACTAAAATATTTAGGACTAAATAAAATTTTATTAATTAATTrAGGACTTTTG 

AAAGATATTTCAGAAGTTCAGTGC*TATCAAAAAAGC^ 

GAAAACTTC^TTGGCTAAAAGTGTCTTCTGTAATACTGATAGTGAAGAAACTG 

ATCCCGAGATGTGTTTGATG 

Sequence 795 

CAGCACTCAGAATCATTTCATATACCACCTTTGGjCAAACATGCCAGACCTGCAGTAGACT 
GAACCAAGCTCTCXCAAGCTCTAAATTGATTAATTTATTAGTTCCT agaagaaagagatt 
ACATGTTTA ICI I * I igttacagaagaaactttgaatagcagtitgaaaatttggcagggt 
ggaccacctaacttgacagtgtattattgtgtct gttttgaaggaataaaatggaattat 
ttataaagttttcatttgtattagaga 

Sequence 796 

ccxntagcgtggtggcggccgaggtac^ctatctgacctaatcctcaacacaaactaagg 
caggagacac^ggctg^^ 

CAATCTGAACCTATATTATTTTTAAGACCTCACGGGATXIACTGAAAGATGAGTATTATTA 

gttggaattttagggatgagaaaactgaccctcagggagaataac^ 

CC A ^^ A *GTG^CTGCTGGGATT^^ 

ctgacctctgacagtcgtgataaaaataat 

e 797 
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cccttggccgcccgggcaggtaccgaaaaatgattttgttatatatatttaccacaataa 
aaaagttttaaatttattataggtgacactgtttgctcactgtaggtcaggtattttt^ 
gm i i ii i 1 1 ctctttattrtattrttgaccaatggattcacgtcaccaggtgatttttt 

AAACAGCTTTATTGAGATATATATCACGTGCCATAAAATTCACCCATTTAAAGC 
TTAAATGTTTrTTAGTATAGAGTTCTGCACCTCTTATGACAATAAATGTTAGAATATTTT 
CATCACTCAAAAAGAAACCAGTATCCATTAGCA 
Sequence 798 

CCC TTTC GAG CGGCCGCC CGG^iCAGGT ACAATT1TTATGTTTACAGCTGTAACCCCTGAG 

TTATCAAG AGATG GAACATTAGATATGATTTATTCCTATTTAAGATAATAGGACATTGCT 

T GATT ACATTTTCAGAAGATATTTATCCAAAGAAA I '! I T T 1 1 1 FTT A ATCTAAAGGAAAG 

GTTTrGATTCTTAT6AGAAAAGAATGAGATTTCTTTAACTG<3AAAAT^ 

ACAGTCCATTG TGTAG TGATGTTGGATCAATGAGGTATCNCTAGGGTGTCTGMAGAAGTA 

TCTATATATTGCTTTTTAAGTTCTTAT 

Se quence 799 

CCCTTTCX3ACCGGCCGCCCGGGCAGGTACCATGT AGCTCTACTTTTCCATATACAGAGTT 
GTTTCCTAGCTTTCTGCTAATCTAACTGGATTCCTCTTCCCCATTTCCTCATTTACTA 
TTATAATGCACATCACATAATAAAAGC TT AAAAATGGGCTTTCACAGTTACTGTTTTCTT 
^^A^TAATTGTGAGAGAGCTTTTGCATCATTT ATTATC TAATCATGATTCAAG TGAC T 
AGGCTGTAGCACCCAAGAACCTT GCCTTAAAACAGTTTATTTTACCCAATAATACTACTT 
TGCCTTCTTACTT AAAAATCTCCCGTGCTTAACCCTTTTG^TCTTTATTTTGATTTAAGC 
ACTTGACC 



CCCTTAGCGTGGTCX3CGGCCGAGGTACTNTCTATTTTTAACAAG 

AATGTGACAAACTTACATAGCCAGCTGTAAGATAATTCTrTCAAATGCGCAAGTAACCTA 
ACAGATTTGTGCATGTCAGCCAGT AATTTCAAC^TACATTATAAATATGGCCAATTTTCC 
CAAATTCTAAATGAATGGAGATAAAATGCTATATAATAAATATGTTAGAGCACCTTTCTT 
®*^^*CTTKTAAAAGGAAAAAATAAAAGACATAATTATACTCACACCACCAGTAAAACC 
TCTGGTCACCTG7TTTGGGTTGTGGGAATGCCCCCAGCAGCCGAGAGACCTATATT 



GATGGATATCTGCANAATTCGGCCTTAGCGTGGTCGCGGCCGAGGTACTGATTATTCTCC 
TGC TTAGG GAGAAGCGGAAGAAGGCC C TTGGAAC TG TGAG TTTTGCATTCCAACTTGCT A 
AT TCAACA TAGATCCTAATTCCTTAAATGGTTGTAATTAGAAATTCTCGTGAACTGTATT 
GGi i M 1 GTCAAGCAATCTGTTTGGGGAACTTGAGCAACTGGGGCACTGCTGGCTAGGGT 
GAAGTTT ATTTAATTTGGTTTTATGACATTCTTCATCTTGGAAATGGGGTTTTC^AATAT 
TGCTTTCCCAAGCATCATTACTTATTTGCTGGTrTTTA 
Se quence 802 

CC CTTTG AGC G GCGCC C GGGCAGG T AC GAT AGGCA T G CAATT AAAGAAGACCTGCC T CAA 
ACATTTTCTGTGTGACCTGAGGCANGTCCTTTTATAGCTATAAACTAGGGACAATATTTG 
CTGTCAI MM 1 CTACAAATGTCACAAAGAACAAATTTGAGCCTGTCGCTGTGAAAGAAC 
TTAGCAAATGAAAGCATCCTAGGGAGTGTTTTAGATATCGATATTTTTATCCAATTAACT 
TTTCAAAATGAGTTTATTTGCTCACTGAAACTGAAGTACCTCNGGCGGGACG 



^ c ^^CGAGCGGCCGCCCGGGGAGGTACGCGGGGGGTTC>iGCTGTCrrCTTACTTrTAAC 

cagtgaaattgacctgcccgtgaagaggcgggcatgacacagcaagacnagaagacccta 
tggagctttaatrtattaatgcaaacagtacgcttgggagtcctcagcag<3gggatcatt 

CACAGTGAGGACAGACACAGGTGAACCTATGGGTCGTGGAACAAAAGTT^^ 
GAAAGAAGACCAflACTGAGTNCCTNGGCCGNGACCACG^TAAGGGCGAATTCCATCACAC 
TTGGCGGC 
Sequence 804 

CCCTT AGCGTGGTCGCGGCCCGAGGTACCTTGCACAGTG<XrrTTT AAA TTCATTTTGCTG 
GACAGTTGGCAGGCT C TTTCAC TT GAGAGGCTT AT ATC TTAACGATTT AGAATGGAGAGT 
TTGG^TCAAGCTCCCT GTGTG TGGTCTGTGCTTTCTATACTTTTATTCTTGGTATTCCAG 
AGTCTGGAGGCTTCTC I 1 1 1 1 AAAAATTGCTAGGCTCCTGCCAAATGTTATAATTTGGGG 
ATGTGAGTTCACTAAGAAATCAACTGACAAGAGGCAGATTAATAGGAGAAATGACATCGA 
AATTTATTAGCATGCAGGGGGAAAAAATrGATTACCAAATATCCCAGTAGGGTAGAGATG 
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CTTATAT ACCOCACC TCTTAAGAGAGAGGGAAAGTGGATGA7TTTAGGGGAATAGTAAAT 
AC 1 1 1 1 lATGGCAACTOkCTGGGCTTGAAGAATATAACAAAGGCCTGGGACAAAGTCTGT 
TGGGOCCACCAGAACAAGACAG TGGTTT ATGACAAAAGTCTGTTGAGAATGT ATTGAACA 

GAGGGAAATNGT 
Si qui n < 80S 

CCCTTCGAGCGGCCGCCCGGGCAGGTCCGGGCAG6TACT ATTACTACGTTCATTGTTTOC 
AGAiGGGGTGAAACGGGGCTTTGGAGAGGTTAAATAACTTGCCCAGGGTCACACAGCTATT 
AAGT13GTAAAGCTGGGATTTACATGAGCCCAGACAAAGAACCCAA<GAAGCT AAGCTATTC 

tcttgtaatacctcc>acataggaggcaagaagtgaggtattatacaggttgaggagata 

TTGTTCTG 



ATAGGTT^GTTAGTCTTAGAGATACCC^TGAGGTCACCTACTCAAAATGGGGCTCAGAGT 
AGCC 1 1 U ICCCATTCTTGTCCAGTGGGCGCAGCTACAGTCTTCCTGGCCTGGAGTGACTG 
GAGGCTGTCCCCACGTCCCACTTCAGTGAGGCATTCATGTGCACCCAA^ 
CTTT ATTTGCCrrGGAOGGGAAGATTCTCCAGAACCTTG TT AAGATGCACAGTGTGG TCC T 
CGGACTGGCAGTGTGGGCT CGGCAGTCCCTGGG 



CCCTTAGOGTGGTCGCGGCCGAGGTACACATATATACACACATATATAGATATATACACC 

cacatatatatttgctgacattttaatgtgaagttttagtctgggatataaaatggaatg 

TATGACATCCTCAAATGTCTGAATACTGTTCACTCCTATGTTTTACATT^ 
AGCAAAACATTTCAGTTGAGGATTTTATTAGAAAATAAATAATCATTTAGCCATATCT 
AAACCAGAATAAACAATGCCATAAAGCCTATAGGAAAATGCAGGTCAGATTCATAAATAT 
TCATGTGTTT ACTTTCAGTACAGGGAGGAATTTGAAGTAGAT AGAAACCGACCTGGATTA 
C TCCGGTXrTGAACTCAGATCACGT AGGGAC TTTAATCGTTGAACAAACGAACCl TTAATA 
GCGGCTGCACCATCGGGATGTCCTGATCCAACATCGAGGGTCGTAAACCCTATTGGT 
Sequence 807 

CXXrTTTCGAG<X3GCCGCCCGGGCAAA 

ATCTCAGTGGGAACGCGGACAGGAGAGCCCCAAACCAACACATGCTAGGGCTCTCTAGGC 
CCTTTCAGGCTAGATCTTGACGAGAGAAGAGTAAAGATCTTTCTGAGGTTGGT GCAACTG 
AGGA AACGAAAGTT TCGGCCTCTGCTGTCAGATCTATGAAAGGAAAGAACTGTGAAGTTG 
TCCCCTTTTGI MIC II IGACTTAAAACAAAAGAAAATCACTGGAACAAAGTCTTAAAGT 
AATAACAGAAATG TCAGAAAAG TTGAACATCTTATGGGCACATGCGG TGAGTT ACGCTAA 
CTTATAGCATCCACTGAGATTAGCCCGCATAGGATTCTTCCATGTTAGAGCT AAAAGGA 



CCCTTAGCOTGGTC GGGG CCGAGGTACTATCGCCTACCTATAAGGCATTTATAATGTGGT 

GGGCATTGTGACACTTTTCATATATTATCTCATGAAATCCTCACAATAATTCTGAAGGTA 

GCTGGTATTTTTATCTCCACTTTACAATTCTGAGGCTTACAGAAGTTAATTCAGTGGC 

AGGGTCACACAGTTTACAAGTGCCACATTGGTGAATATAAAGTAGGAACTTCTAAGTTTC 

ACTCTCCCACTrCCCTAGTTATTTTCCTAAGGCATGAATGTCTGGGAAATAGCATGCATC 

AGATTTTCCACCTCTTTAAAACTCTTCAGTTCATATAATTTAAGGGTGTGACTATTCATA 

GATACCTTTGAGCTAATCTTCTGGGAGCCAATGTAACCGCAATGCACACT GCAAAACAAT 

GCACGCTTMCTCTG T AAATT AAAAATGCCAACCCGAG C TTT GGGAAAAGCCCATCTTTTG 

ATATGAACAATTAGGGCAGTTTAAGTTTTAGAAATNAAGAAAGTC(^CTGGTGCTGCTTT 



CM 1 1 1 IC I mcr 

GAATATTGCATACCTATTAGAAAAGTCT7TTAACAATTAAAATTGGAAAATGACTGACAA 
ACTTACACTATTTGATTTAAATAAATAAATAAATGGTCACATGATAACAATCT CCTGATT 
GATATGCTrrATTTAACCAGGTTCTCAAACCATTGGATGTGAAAACCAAATTTTACAATG 
CAMAGGTAAGTGTTGAGTGTTTAATGGGATTrCATATTAAACATTAAGATCGTATTTGAC 
TAAAAATCTCTTATATACATTTCTAATACTGAAGCAAATCGCCAACGTGACTGT AAATT A 
TTTGAAAAAATGAGAAATTTCAGTTAAAATTGAATAATTTTATTATAGGTCTCATAATCT 
TTTrCAGCTTACATGGAATCAATGTGTCTTGATTTTTATTCTCGGTAATTTTATAAGGCC 
TT CATCTCCTTTCGGTTAAATGATTGCCCTC TCATTCCATTTAATGGNGG TTG TT ACACT 
AGCAATCTGTTGGAATATFrACATGTGGGTTCGGGATTTTCCAAAAATTGGAATTANTAG 
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AACC T ACCGC TGCAAAATAGATT AATATTCACATGGGAAAAATCCT GGNCAAGGGGAANT 
TTCNNCATTAAATTNTTTNCAGGGGAGTCCGGTTGGCCANCCA 

TNGGGGGAATGGCTTAAAAGCCCTTGGGAAAAACAAATTGGCCAAAAAHGGGAGTTACCT 
TTTAATTGAANAAN 1 1 1 M 1 1 1 1 ACCCTNAAAAANGGGATAAAATGNACTTGNCCNAAAA 



• 610 

CCCTTAGCGGCCGCCCGGGCAGGTACTCCA1 1 1 CM 1 1 1 ATTCATATTATTTCACCAAAT 
AATATTCCACTGTGTAGATCTATCACATTTCGTTTAOCAGTTTATCAGCTGGTGGAC 
TTGGCTGTTTCCATTTTTTGGCTGTTATGAATAATGCTGCTATGAGTCATA 
CCTCTTACTCAAGAAACAGGTTCTCCAGAAACTAAG 

TTTTAACAAGTCCTCCAGTGGATTCTGATGCAATGCTAAG^ 

GAGAGGGTTCTCATCCATCAATGCCTGATTAACAAAAACTATCCCAAATGACT^ 

AACCC 

Sequence 81 1 

CCCTTAGCGTGGTCGCGGCCGAGGTAGAATCATTAAAACTATGTTGTAATACTGTTTGTC 
TTTGTATCCATTCTGGCGTGTCTCCATACACTTCACT AATATTTGATATACCTGTTTTAT 
ACCAAT ATAATGC TGCT GCTGT ACGTAGAAGCTGT AG TCACCATATCCTCTATTTG TTCA 
ATTATTTTTTCATCTrCTGGCACAXn^AGGATCTATAACAATGACAATATCTTC 
TTATTATTCAGC TTAATGAAGGAAGTATTTGACTGGTGCAGCA<»CACAGAACTAAGAGG 
AAAACAAAAXXrrCTGAATAACCCCATTGTTCCTCTCTAGTTATTCCTGGCTCAAATGTTG 
GTTTGTTCCCCCCGTCCTGCCC«5GCGCC^ 
GGCCGTTACTAGGTGGATCCGAGCTCGGAACCAA 
Sequence 812 

CCCTTAGCGTGGTCGCCGGCCGAGGTACCTAAGAGTTATTAATACTATTTCAGTAAAAAA 

AAAAATTTAATAAACCCTGTGTGATCCCATTGTAACAGAAAGGCTGATGTTrTCTGTTGT 

GAAATACAAATGCAAGGAAAAAATCATTTCTTTGTTTCAAAGGATGCATTTCTTCCATM 

AGAATAATTTGTATTTATTTTTAAGGGTTTATTTTAACTTATACATCANCCTATN^ 

T ACATTTCAAAAT GATCTG TGCTCTTT AAATT ACCAAAAGCAA 

Sequence B13 

CCC TTGAGCGGCCGC^iGGGCAGGT ACATGTGCATAAGAGGGAATGCTTCCCT ACATT AC 
TCCAGAATACAAAGCTTCTTTCTGCCTTTCTCATCCACATAATGGAAGACACTTCT^ 
TGAAATACTCCACAIOTATrrCAGTTCTCACTGGTGAGTCTGAATATAAGCTCTATGAGA 
GCAGGGACCTTGTCAGrrCTTATTCACAATATCCCCAGCCT 

AGTAGATGCACAAAAGGTGTTTGCTGAATGAATGGATGACTGAGTCTGTGTGGGGTAATG 
ATAGGGCTAAGGATGGGACTCTAAACTCAGG I \ IU C 1LMUI GGGTTTCACAGTTTACTGG 
TCTTAAGAGGAGAGTTTCCTAAACTTGCCTTATGATAAAAACCACCTTCAGGATTTGMTA 
AAAATTACCCATTCCTGTAGATTCTGAGTCAGTGAGCTGAAGTGGAGCTGATGAATCCT 
Sequence 814 

rnTTTTTTTTTTTTn TTT 1 1 1 1 1 II I U M H I I I M H H I II I I I T INT TT7NNCA 

AN WATT AAAAT AAAAAATT ATTTTT AC TACAAACAGARAAACGAATT AAACT AN NANCC T 

AANATACTTT>rrGGAATTGAAATGATACATTATATATACCTATNANGATAATNGNKTATA 

NCXSNNhlCTAAACTACAAATTAGTCATAAAAANGACTTNTGTNCTATATC^ 

GGTATTAAAATTGANTATNATAAGACAATA 

Se quence 815 

CCfrfTT CGAGCGGCCGCCCC<y^A<>GT ATAAGT ATT ATGTATCCATAAAAATTAAAAAAT 
CTTTAAAAATGCATATGGGGGTGAGTAGGTAAAAGAAAAGAGAACCAAGAGAGCTGCAGC 
CGGGGAGCACAGC TT GCTTT AAACATGAGATC CAG CTCAG TGATCATGCGGGGGAAAAG G 
CCCGOCATTGC TGGAACTCCTAATATTT AAAAAGATGATGGAAAC TTGAAATTTTAT ATT 
T AA T C TT C TCATTTTT AAGTGTTGGCAA TGT ATT GAAGACTTTGAAGCC TCT C TGC T GG T 
CAAACAAGATGTATCTGTAGGCTGGATTTAGTCCACAG 
Sequence 816 

CCC TT AGCG TGG TCGCGGCCGAGG T ACAAC TGT AAT AGC TATTGG TC TTCAAG TGGGTTT 
AGA TTT GG T GACAT CAG TTTGAT ATTC TC TT AAAGGAAAT AAAT ATTCAAGAAC TGATT A 
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TGTTCTAACATGATTATATTCATGGTGTTACATAGGCCTCAA 1T1 1 nC ACAGAAAGATT 

TTTGGAAI>GGACTGTGAAGTGAGGCTTTTTAAAAAATTATTTTATAA^ 

GCXTGATAACTTAGTCAAGGATATACTGTCTGTCTCACTACTTTGGACTTATATGGCTTC 

AGATTAAGTCATCCAAGAAACATACAT 

Sequence 817 

GAT ATCTGCAGAATTCGCCCTTAGCGTGGTCGCGGCCCGAGGTACATG TAATAGACACT A 
TGCTACAGCAAAAGC r nTCTT ATTOTCTI f AAAATTTTCCTGGGTGCATAAAAC TATGT 
GGGTAACTCTTTCCCAATTTTTAACTTTTACATTACAAGTCATTTTCAGAGT AAAAAGTC 
ATTTAACAAAGGCAGATAGAAAGGCCTCAAATCCCT^ 

TTCAAAAGGGAGAAAATTTCTTT AAACTTCATGGGAAAAGTATTTTTAACATAATAGAGA 
GGCTTT ATGCAGTCTTTGACAAGATGAT ACTTTTGGAAT Al 
TTTCATATTATAAA 
a 818 



ANTTAANI 1 1 I 1NAAN NNTNNI M 1 1 1 1 1 II 1 1 1 1 INAATWI UN! INNTTTTATTAAAA 
ACAAANGAAAAANTNACt I 1 1 1 1 NTCCAAANANNCGGCCTGAuXAAAACWTAAAAAACAAT 
GC NNGGA TGGANTCAAANTAAAAAA t 1 1 III 1 1 CCTAOGGAAAAAAXAAC 1 1 1 I II I G GT 
I INI II I 1AACAAAAANNTAGKAAAATTTCNNT I N I I ri A AAAAGNTAAAATNGGNTTTT 



8 819 

CCCTTAGCGTGG TCGCGGCCCGACCTACAACTGT AAT AGCTATTGG TCTTCAAGTGGGTT 
^A ^^^^ SGTGACATCAGTTTGATATTCTCTT AAAGGAAAT AAATATTCAAGAACTGATT 
ATGTTtrTAACATGATTATATTCATGGTGTTACATAGGCCTCAATTTTT^ 
TTTTGGAACAGGACTGTGAAGTGAGGCTTTTTAAAAAATTAT^ 

ACCCTGATAACTTAGTCAAGGATATAL rGTCT S T C l CACTACTTTGGACTTATATGGCTT 
CAGATTAAGTCATCCAAGAAACATACATACATTCTAAATGGTATATATTGGGAATATATG 
CCCCTTT AAAAGAATCAGGTCAGAAATGCAATAACAATTAGACTAGACTGTTGCCCGTGT 
TAGGAGAATGTGTGGGTCATCCTAGTTACTAATTACTCTCACTrCAAGATGGAGvA 
CCAGTTT AACATAGTCTT AAAG TT 1 1 C 1 1 AAACCCAAATAATTTATGA 
Sequen ce 820 

CCCTTAGCGT GGTCGCGGCCGAGGTACTAGAATTAiGTTCCAACTACTGCT GGTGATAAAC 
TCACCATCTACCTTCACTTG TTTTC TCTTAATTCTCCAAGAAGT AATCAGGTGAAT AAAG 
AATCATCATt>GATAATATTCrCCAAGATTCrmAAGAAATTAATT^ 
AATGATTGCACAATTATAGGATAGAAATTACTATCTTGTGCTCTAATTCAAATTGCTC^ 



ATCTTTGTTTCTTTCCTT AAAACTT AACA 



0CCT TAGC GTGGTCGCGGCOGAGGTACTGGAAACCAGACCTTACTT AAGCCCACCAAAGG 
CAAGGTTTGGGCCT GCCACAGCGGATTTC^AAAAGACAAAGCAATGCAAGCCAC^ 
AAAATGCCCTAAGT GGCT ATTCAGGT AATATATAAAAGT AAGACCAGGCT AATT AGTAT A 
GAATGGGGTAAACCAGAGAGCAGAAAGCCCTTCTTTAAAATGAGCCT ACCACTGCTTGGC 
CTC-^^^TGAATTTAGACCCCATCTTCTGATATTTCAGGAGAAAGT AAAAATCT AGATTT 
TTATCTAAAATCTTTTTAATTTTTAAACAGTCACCTGAT^ 

Sequence 822 

CCCTTGAGCGGCCGCCCG<KX>GGTACAGAGC^ 

CCATACTiCCAGCCTOCCACTT GATACTGAAACACGTTTG T GAATTCAT GGCCGATGT CT A 
ACTTCCCTCACCACCTTTCCGATATGGACAG^ 

ATTGTGCCTGTCCTCCCTTGACTGTCATGCATATAATCAGCAIC 1 1 1 CC CACT AAGTGAA 
GG<>CCCAGA<rrCGAGCACAGGAGCACAGCACCC^ 
ACCATCAGCAGGGAGATTACACTTGTGTCATTT 
Sequence 823 

CCCTT AGCGGCCGCCCGGGCAGGT ACCAAGACTTTAGAGGGCAAAGAACAGAGGATTCTT 
G^GAAAGGGGACTTGAAGGTGAAGAGATAAAGGCrGGTa 

^^'^'^GTGTTCCTGGGAAGAATCTTGGACTCAGGCGTGGGCAGCTGGATGCCTGGGTT 
CCTTAGGCTTCCTCCAGGCAATGTACTTGGCTCTTTCTCTCCCCGCGT ACATAGTAAGTG 
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TATGAT AGATGTTTGATTTGTAAATTACAAATATAAATTATCACCCCCATTTCCATTTAT 

TTTCTTGATATATCAAAATGTGTTG 

Sequence 824 

CCC TTAGCGTGCTCGCGGCCGAGGTACCCCCATTATAGTAGGGAGACTGA^ 

TTACAGGGTGAATCAATGATAATGATCTTTGCAG C 1 1 IC I G GAGTTAAAAAGCATCAAAA 

TTGGGAGATATTAGATGATGACATCTAAGTATTAAAATAAGGAGATATTAAATGATGACT 

CCT AGAAATGAACCTGAATAAGGACT ACCGCAATGTG fBTG G TCI GGGAAAGGACAGTTC 

TTTTAATGGCTGGCTGACCCAGCCTCAATTTTCTTGCAGCTTCGCCGACACGAG^ 

ATCTGCAATTACGAAGCATCTGCCAACCCAGCAGACCATA 

Seq u ence 82S 

CCCTT AGCGTGGTCGCGGCCGAGGTACCTCT CATGGC 1 1 1 H G G1 T C CAGCANTGAGGGC 
ATTGGTGAGAT CAGT GGTAAACTGTGCAAG C I IICI I H I ATCATT AGGAAATGTGAAAC 
GTNANGACAAATTTTGAGTTTTAACAAGGACAAAAAGTTCAAAGAA 

AAAAAAGGGTGGCTAGATTTATCTTGGGTGATGGXGGAAATGAGAGAGt^ATGCTCTTGA 
AAGGTGGTCTGTGGATCTGTCTGAATAG. AAGAGCACAGTRAGTATGC^TTGCCGGAGAA 
AACGTCCTT GAAGCTGCTT GTCTCATGTGTATGATGTG 
Sequence 826 

cccttagcgtggtcxacggccgaggtactcaaicaagcagctgacttatgt^ 
t tgtgataca ggaactgtttccagagctcaataaggt acgcgggaaagtgaactcagtt a 
cctctgtttgg tgt gtgtatcacttgcagatx^tgtctaccaccttttcagtgacatcct 

AGAAGCTTCTCTATTACCACAGNAA^^ 

TGAGCATC III II I C T GATATAAACCAGGGAGGGAAAAAAACAAAG TTCCTTCACTTTGA 

AGGGAATATTC 

Sequen ce 827 

CCCTT AG CGTGGTCGCGGC CGAGGTACATATATGAAAAGCCAACATTCTAAAGTAGAGGT 
TCACTTAAI I 1 1 1 1 1 1 1 1 1 1 I CAAGAGAGGCTTCTTGGTAGTTTCATCACACAGTGGTTT 
TATTAGGGGATGT AAGGATTACAGAAACATCGTATTTTTTAAGATATAGT ATTTTTTGAA 
TATGATTTGAATTAATATAGAAAAGTGCATTTTTTCCA C> 111111 1 AGGGAAAAGGAGAT 
ACTT CACCAGCAGGATAAAAAGGAACAAGAGGGGAAGCGGAAATAAAAATTCCAGAAAGA 
TGAAAAATTGTTGATGTAAGATGGAGGCACATTTT 
S equen ce 826 

CCCTT A GCGT GGTCGCGGCCGAGCT AC AAACAAGC HIGH AAACT AACCCTTGCCATCC 
TG GCTA CTTT ACCCAATT AACCACCCT AGCCCA^XaACGTTTTGCTTTATCACATGTTCAC 
A GTTT GCT A TTCTT TGTTCAATCTTGT AACTGACTGCAACT GCTTCTGTGGGTCTCTGTT 
TCTTTATGAAGTTTCCCAGGCCATACAAAACri G TG T I AGCCT ATCTTCTGTCAGTTTAA 
TTGTGGAACTCAGCCAGGCCCTTAAGAGGATGGAGGAGAG 1 1 1 1 1 CCCACAGCAGTTCTG 
AATGGGATGAAGTGAAAAATAAAATC TCCCCATTGCCACTACACCACCTCCTGATGAGTC 
TTGCAGCAGAAATACCGTTTAACTGTTTCTGCTTTTA I M il l 1 C I G ATTATCATCCAGT 
TTTATATAT TTCA TATCTGGGGGCTTTGATAATTATATATACAT A C I TI 1 I TG AAATTAT 
TT ACTTATTCTTT ACATTGAAAAGGAACTTGCTTTGTAAATCTAAATTCCCTTTNCCTTC 
TACAl 1 1 II 1 1 1 
Sequence 829 

CCCn 1 CGAGCGCCCGCCCGGGCAGGTACTCACAAGCAATAACAGATTCATAGATCAGTT 
GACATTGGCTGGTCTCCAGGACAGGAATGTGGCC^^ 

GCACT GAATCTGT GTCTCC CCTGTT ACCTACTrTTGCCAgltjAAATTTAAGTTTT AAAAT 
ACT TTCAGA ATG T A TTTTTACT ACTGCAAGTTTTTGGTCTTTAAAATGTCAAGTAGCATC 
TCTCICI 1 1 CTCTCTGTCTCTTTCTGTTTCTCTCTCCAG TTTT M TTTTTTTTTT AATTT 
CCATATGGGCTAAAGAATCCAAATATTTTAAAAATCTGNCTCTL' 1 1 H C 1 1 C 1 CTCATAA 
AGTGAATTATTCCTC I 1 1 1 I IG I I H ATGTAACTCTATATATTCTTA G I 1 I I IC 1 1G AAA 
TCATTGT AATGCTAACTTTGTTGTTTCAAATATCTrGGTGATTGCTTCATTATCTCTTCA 
ACAAAAAAAACCTTTAATT 
Sequence 830 

CCCTTTCGAGCGGCCGCCCGGGCAGGT ACAAGCCATTGAATAAGCCTCTTCCTTTTTTTT 
GCTCAAACATTCCACATCCTTGTGGATTCCCCTGCATTGTTTGTTTTATATAACATTTGA 
TATTTGTTGTAGCTTGT AT ATGAACA T AA I 1 1 ICI 1 1 AGAGGTAGTCACTGTTCTCTCCA 
GT ATGACCCAGG 1 1 1 C 1 1 GACT CT GAGT AA TGCACCTTCT AT AACT ATCT AAATTTC TAT 



r*$t 1 39 of 22 1 



P»fc 140 of 221 



PCT/USOO/33312 



tgaagctttttggattatgagtatgctgactt7tcacgattggctggtgcatgtttagac 
ittaaatgtcatatccttcatgtctcaaagccaaaatagtaacatctcatctcagaacaga 

GCTGTGACCACATGCCAATATATGTGTCACAAAGTCTACATATGTTACATTCCTTGGAAG 
TCT CCTTAAATG TTTCACA 
Sequence 631 

^I TG tSS GGCCGaxx ^ CA ^ 

AGGCAAGCTGCCTGTAATTGAGCTT GCTTGTATATGACCAATCAACCTTTGCTTGTTGAA 

GGGTTAGTTATCTAGTTTC^rrCrmCTTTTTTGGAATTTGGTCrrr^ 

AATC 1 1 IC 1 AGTCT AGAGCATGTGAACAGAACAGAAGGAAAATCAGGACTCAGTTTACTT 

AATTT AAGCAAGCATTGGTTGCTGCAGTTCAGGGGAGG7TAAAGTTGCTGGGC7CCACTC 

TCTT ATTAGCATGGATGCTT AAGAACTT CAMGGGTTTGGAGGTCAGC TTGAACAGCTG TT 

TTTTGCACTCTCXXrTGGTTr rrAGTAG CCTGAGTCTATAAAAAGAATACCACT CGGGTAA 

AAGCTAATATCC TTTAANC CATTTTTTACC TTGAT ACCATTGCATTAAAAAAG NATTATT 

CAATGGGCTTTCATTTGCTTTTTTGGGCCTTTTTGGCTTNAAA^ 

AATTGCCATGGNTTTAAAAAAAAAAAAAAAA 

Sequence 832 

CCCTTAGCGTGGTCGCGGCCCGANGTACCCT AGGCAGGGACAGTCAAGAAAACTTCATGG 
*I , =^^ TGT ** A ^ T A<3CGAGAAAGAGGAA^ 

CCCCTACCCCCAGCTT AGAAGGGCAGGAACCTGCTGTCCAAAACAGGAAATATAGGAAAT 
CCAGCTTGAGAAACTATCCACT 



CCCTTTCGAGCGGCCGCCCGGGCAGGTAC17 I U I H 1 1 » M M 1 H 1 1 1 H I IGGGNCA 

T^TS^S^h! lul 1 A TTGTTACT ATTACH TTGCCTGGCTCATATATATAAAG<^TTT 
AGAGAGACTGTTCC^ 

ACTTACCTAAATATTACTAGTTAAATTCAAAGTAAATGAGTCTGTATCTTTAAAACTACT 
TGGCAGTAATAATTTTTAAAAGTAGATTTTTATTGCTTTTCTTGXACTAACTAGTGTTCA 
T^^AACACAGGTAGTTTTATTTGTGCCTGGAATTAAGGAGTGAGAGACATTTGTAAAATG 
TTCACAATC^ACGCCrrGTCCCATTTTAAAATCTCACA^ 

ATTCAC^^TAAGAAATGGTATTTGGTCATTCTCTGAGTTCAATCTGTGCTCTAGTAAA 
TATAACTTGNGAGGAAAAAGTAAAAAGGNCAAGAGTCTAATTCATTTTCAGTTm 
Sequence 834 

CCCTT AGCGTGGTCGCGGCCGAGGTAC 1 1 1 1 1 H 1 1 1 H 1 1 II H H I II M IGGNTTTT 

TTATCTGACCACTTCCAGGAACAAAGCCAGGGCTCTCTGGGCACXTGAGTATCCATTCTC 

TTroTATCATCCATTCCATGTCCAGAACACATTCACATCCATGCTTATAGTTCCTCATTG 

CCTGAAGCCTGCTGGGTGGGGCATAGTATGAATACTTGCCCTCATCATCCCCATTTC^ 

GATGCATAAACAGAGGCCAGTCAGTATGCCTGCAGACTGTGGATAGAGCCCGAAGCCTCA 

< ^TTAGGCAGCTTGCATCCAGCTG TG AGTCCCAGCTAGG GGAACT GAGTCAG C CTCCATC 

ACTCCGTGTCTCGGTTTTCTGACCTCTCAGGTGGGTATCATGATGCTGGCTTTGGAGGGT 

C TTGCGAGAGTTCTTGGC TTTGCC T AGGAAAT AATTCA 
Sequence 835 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I I I H 1 1 H 1 1 1 1 1 1 H 1 1 1 ) 1 1 H I U I I M A 



cccttagcgtggtcgcggccgaggtacttagcaaagagactt acacattagtgaaaaatc 

taaaatcagccttacgtgggatctgcccaaagtattatttgcaaaagtatcattttcagt 

tttaactjttagggg^gcagcgtag^ 

ITt^ra^GGAlTTG^ 

gatgtgttacagaactgagcagc^tatcattagatgtgttacagaactgagtcctactta 
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CMTMTTAATTTAATTTCAATAGCGATCCCCACCATTTATGTCCTAGGC^ 

t tggtc t c tgagcgaaaacacagccttatctgcaataaaagcctctgctttgctttggca 
tgttttt acaatcccgcgca 

Se quenc e 837 

CCCTTT CGAGCGGCCGCCCGGGCAGGTAC 1 1 1 1 n 1 Fl 1 1 1 II II I H I I I I I G CAAACT 

TTAA TAGGTTTTC TTAGCTTGACAAC TCATTCT C T ATATTCAC NAACATCTCC TGAC TTG 

TTCCTTCAGTGGANATACCCTTTTCTAGCCAGAGTTGGCAAAAGTAGCAATAGCATGCAT 

TGGCTTGTTTGANAGGCCCTGGGTGAGCCTTTGTTGCATAAAGTAGGAGG^ 

TCTT GGTAGCATATGCCTTCATTATAAGTTTGCCTCTTTGAAAGAATATTCAAAGACCAA 

CACvAAAAGAGAACATTTCCAGATCCAAGAGAGTGTATGTAGAAACAGTGACAAGTTAGAA 

AATCAACTT AGGTATCAGATAGCAGCCACAAAATATGTTCTGAGGAAAAATTCATAGCAA 

TTTATAACAGCTGAAAAAAAGAGGGAGGATGCGGGAAGGTAGATTTTGTC>GAACT^ 

AGAC TAAGGATTT ATTGCATATTTTTT AC TAATTAAATG 

Sequence 838 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTACAAAAATAATGAAGCCAGCTAATTACC^ 
CAGGTT ACAACTTT ACAAAGAAGTGAAGCAGCAAAGAGCTGAAGCAGAAATGACATAGGA 
AAACAGCAGCAAAGTCCTTGAGTCCCAACAGTCCACCTCAAAGACAMCATACTAAAGAA 
CAAAGGCCCCTAATCCACCTCCTCACCCGCGTAC I I II I I I M 1 1 I I I 1 1 H I I I H I NC 
TTTC TG TTTCAAATTCTTTATT AT ACATCATGG TTGCACAATTTGAGGCTGGTTAAA 
I'^ A I T S£J TTT ^--^^AATCTCrrTTGAATATTTTCTGGCTT ATTACATGCAAATGACCAT 
GAAAATATTTGGCATTTTAAAATTCTGAAACTCTGAATAGGCACTTGCATGAAGGAAAAC 



TGGTCTTGTAGTCGGTTTGGAATTTTCTAAGTCAGGGTGG^ 

°IS ATGTGCA *^ TGG * AC *^ TCTC ^ 

CCrrCTCAGCATCTGGCAATGTCTCTTACTCAATCTTGGATCAAAAGGGCGTTGGAGGAG 

GAGGCTGGGAGGGAAATCCAGACAGTTCTCCGCCTCTGACATCAGGTCCAGCTGTTAGCA 

TC G TGC TG TGGG TCCCTGAAC^VAGAAGCAAAGTCAGGACT(^TTTGGCCAGGTAGGTGAG 

GATCCAGTGTTGGGTGATTCTGATCCATGCAGCCCrTAGAGGCGACACAGACGTGAACTG 

GACATTCT AGGAAGAAAGAGC C GACTGC C GGGT GACC TGTC T AGTTCACATCCACTCAC C 

ATTTCCCTCCTCGTTCCTATTCTTAGAAATAAGACTCTGACGCTCTCTTTTATACAGGCT 

AGTCCCCTATAGGCATGTCATGGTGATTATTTGCAATCCTNCTGACTTTCCTAAGAAGAG 

ATCANACTTAGCAGCGTTAGTC 

Sequence 840 

GTGGTCGCGGCCGAGGTACAAATAAATGTATCTTGGGTTAAGTGCTATAAAGGAAAAGAA 
CACKSTTCAATGGAAGGAAAAATTAGAATTGTTGATACATGAATGGAAGTAAATGACC^ 
ACTTCCAACTCTAAATCTCTGTCTCATTTCACCTCTTTGTAAATAATCATTGCTATTATG 
TTAAATATCACAACTACTGTCATTTCTTGTTrACCCACTACATTCTAAGCTTGGTGCTGA 
CATCTTTGTATTTATTATATAAAATTCTCAAAATTACTCTGCCCGTTAGGCTTTCTTATC 
ACTTATTTCAAATGCAAAAATAAGGTCCAGGGAAGATAATTATGTNACTTGTTCATGATT 
GGAGAGCTAATAAGTGTCAGAGATGAATTNAACCAAAGTTTGGTGTGACAAAAGCCTCTG 
GTTTTAAGCAAAAGGGGAAAAAAAATTCTCATTAACTCCAAGGATTATCATCAGGGAGTC 
CAACAGGGTTCCCAATTTGGGAACTACCTATATTCAATTATCATATGGCAAATGGGTCCC 
• CTTTTGTTAGATGGAGAAGGGCCAAAAAAAAAAAAAAAAAAA 
Sequence 841 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTTAAAAATGTATGTGCTGTTCTAATGCTACT 

TATTATTATTCCCTTCCTTTGTAGAATGTATCAACACTAAAAGTGTTTAATCCTGACTAT 

AACAATTATTTGTTAACTATTAAAGGGGTAATTATACTCTAAGCTTCCAGTTTTCAGTTA 

AAACAAAAATGATTAATATGCCTATACAGAACTTTCTCC^GCACTTGGTAAGTATTTTrT 

AAA GTGAAG TC T ATTCAGAC TGC AACCAG T AAAC T ATTT ATGC TT ATAATTTTTCTCAC G 

ATGGATTTCTGTTCCTTTGGTGCATTGGTTGTGTTTATTTTATGT^ 

CAAGGTGG GAAAAA TGACAGTGGTTTAGAAGATAAGAAGCACATGAATGTAAAGTAAAAT 
ATGTGGAGATTTTTGGCCACTCTGTAAACTACTATCTGAAGTAGTTTTAAATATTTTAAG 

842 
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CTCACAGGCTCAAGTGATCCTrcCACG*CAGC^ 

GCCACCACACTTATTAATGTAGATTrCCTTTGTAGATGTAGATTTCTT^ 
AGCTTTTCACA GCT A GTCCTATGTCTGCAGTTTCTCAGAATAACCAGCTCAAAATATGCC 
AGAGAAGTATATTTTGGGGTGGCATATTCTA 

3J?T^ TC ^^^ CC< ^^^^^ TAGG ^^^^^^ GAA ^TTGCT CTTTCAGTCCCACTG 

°^^rr^ TG ^^TG GGACTAGAAAATGCAANGGAGGGAGAAAAAGGAAAGGCTTG 

GTGNATGTCAAAGATTTTTAA 

Sequence 843 

CCCTTAGCGT GGT CGCGGCCGAGGTAC H I H 1 1 1 I I I M H 1 1 1 1 GCCTATTAATTGAT 

AAAGTAGAAAAGAACCTTTAGCTTTAAAGGT ATGTCTTAAT AGAGCAGTGCTAAGACAGG 
TGGTT AGGT ATGTGAATGCATGCCACTTAGAAAAGAATATGAAGGAGAAGGGACCAAGAA 
GGCAGATACATTGCCCCTGATAAAGAAGTC^ 
GCCACTAAAAATCTAGGAGCACAAATAATGAAAG 



G^GCGGCCGCCCGGGCAGGTACAAGAGAA^ 
TCAGGCTCTGTGCTATACTTACATTATC^ 

* A 5Z*^£i A *" AA * G ^* G< * AAA ^ TAA *'^ C^^TTTACTTGCCAAAGGTCACACAGTTAATAC 

ATGGTGGAATCAGGACTCAAAATCANGCCTGTGTGACTCCAAAGTCCAGTGCT 

CTTT^«AG^AACCTTCATAATACCGGA 

ATGTCCCTGAGTGANTCACAAC 1 1 1 1 I C 1 1 CAMCCAGCTTTTTTCATT 



CCCTTAGCGTGGTCGCGGCCG^GGTACCAGGAAATTGGTTTG^ 

TGGA CCAAT CACTGTGGCCAAATACATGAGGTATCCTTATTGGCTC^TTCTACTAGCAAC 
AG^TGGTTTAGAGAAGAGTGTATCACAGAGAAATGGGGATCACTATTATAGGCAGATTGA 
ATAATAAATGTTCACTCTACT ACTCAATAAATATTTGTTGAACAAATCAAAGCTGATCCC 
■ i 1 1 i « CAAAATTTTTAATGTGACTCnTAGGGGATGGTGGATCCAGGAGAGAAGATTAGT 
GCCAOU3TGAAAAGAGAATTTGGTGAGGAAGCTCTCAAC TCCTT ACAGAAAACCAGTGCT 
GAGAAGAGAGAAATAGAGGAAAAGTTGCACAAACTCrrCAGCCAAGA^ 
TATAAGGGATATGTT 



CCCTTCGAGCGGCACGCCCGGGCAGGTACTTTATTTAm 

actatttcacaaaacaaaatgtagcttt cttaaaatttgtagttaaaatgttttctttgt 
tttcccaataaaatgtaaagtttaatatgtgatggct aaactcctagggggataaggagg 
cgctaggagaataggcaggttggaaaagggtagtcgggacttgtccagattc llu mi GG 

TAGTCTGGGTAGTCTGT ATATTTACCATATGGGC T ACAAGACACACACACACACACACAC 
ACW>CTCACAC>CACACACACACACACACACACCCTTGTGAGC^ 
TTGATGGTGCATAGTTTGGGGAGTGGGTAAAGGATATGTTACTTTTGTA 
Sequence 847 

CC CTTAg C GTGGT CG CGGCCGAGGT ACTATGGTGTGTGTGTGTATGTGTGTGGTGTGTGT 
GJGTTTT^AGTTTANCCTTTTG I I 1 1 1UI till IGGTTGGCAGTAACCCNATTTTAATGA 
CTAAGC I 1 1 1 AAAAATACAGTACTGATCATTCTATTTCCCCCTNT ATTGATCCCCACCTC 
CAAATATCTC^TGAAC^ACCGACTAATCACGACCCAACAATGACTA^ A 



AGTGCTGAAAATCTCATGCACTCTAGCTATGAATGCAGGTCTACTTGAAGCAAAACTCTT 
CAATCT AATTGI 1 1 1 CTCAATCTTTGTAAACCAGTTTTAAGAGTCACCAGAAATCTGTAG 
TTTAAGGGACCAGATACATTTCTTGGCTGAGCC7TGTAGGACCAATATGCTGGACCAATT 
CGGTAAAATACACCATAAATTATGACTGCTTTATCTGAATGCATGGGACACTTGCTACGA 

TGGTGATCAATTCAAATCTCTTTCCTTTGCANCCCAGGTTTGGTCAGTCTGGCCAGGAGT 
GCAGATTATGACAAAAAACAAAGCTAAAAGACCTGAGCCATTAAGGTTACAG 
CCACCGAGTTAAACAACCT ATTTAAATGCAAGACTATTGATTGGAAT 
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CCCTTAGCGTGGTCGCGGCCGAGGTC^jGCCGAGGTACAAAAGTTCTGAAATAACACTATA 
GGCTTAAGGAATAAGGGACCAGAAGTAGCCTGGTAGCCAGTGTATTTCTGGCTTT ataca 

t tcctta ggaaaaaaaaactttatagatgtatttaagtagaattaaggtttacacaaatg 
An i i 1 1 gagagagagagtccctaggacctaaacattcgttctacggagatagggtcaac 
acg<^gatatttatttagcagcatggtctgcagaagtaggaggaggtgaccagatgtgat 
ggattatgcctgtaattccac 

Sequence 850 

CCCTTAGCGTGGTCGCGGCCXiAGGTNCCACCTAACAAATTGGAGGAAATGAAAAGACGAA 
TC^ACAACATTTTGGAGAAAAAATTTATTCTACTTCTAGAATTTC^ 
TAGTTCTTGGTTTGGTAIMTGAAGTGAAATCAAAATTGGATATTTGGAACATTAAATATG 
GGAGCAGAGAATCTGTGGAATTATTGCTGGAfK^ACTGGCATAAATTTATTGAAGAAAAAG 
AATTCCT AGCTCGACTTGATACTTGrTTTCAAAAATGTGGAGAAATTTATAANAATTTGG 
CTGGAGAATGTCAGAATATTAATAAACAGTATATGATGGTGAAATCTGATGTTTGTATGT 
ATAGAAAAAATA TATATAATGTGAAG TCCACTCTACAAAAAGTGCTGGCATGTTGGGCTA 
CTT ATGTGGAAAACCTTCGCTTACTAAGGGCTTGCTTTGAGGAGACNANGGAAGGGAGAA 
ATT AAA 
S equence 851 

COCTTTCGAGCGGCCGCC CGGGCAGGTACCTATATTCTATGCAAAATTTATAAAATAATC 

CTTGAACATGAAAACTCATCTTAAAATTACACGAATTAAGTAAGCATGCAATACAGACAC 

TTGCAGGATGCCTGGCCTC TGGGAACTGCTCCTGTCTCTGTGTGAATGTAGAAGTGAGGC 

TCAAACTC1 C1CI I AGGAAAATTTfCCCTTCCCACTGGCCATCCATTTCTG^TGACTCAA 

CAATTCCCACAGAGGAAATGGGAATAGTATCATCAACTAGC^GTCCTCCCATGCCAACAG 

ATTTGGGGTCCTTATCTAAGTGTTTCTGCAGCCCGGTCTTCCCTTC^ 

TGGCTCGTTAAAATGA ^ 

TAAAAAAAAACAAC TATA TAT GATTGAAAGAAATTTGAGATATTTTTGTTTC 
S equenc e 852 

ATGGGATGTGCTTGCTTCATGTCCCCTCCCCTCTGCXrTGCTCAACCCTACACAGTCCTGT 

CTGGTGACCGTGCCAAAGTCCTTCCTGCCTTGCAGAGAGGCCTNTCTTCGTCGAACATGG 

GCCTCAGGAAAGACAGCCTGAATGCCACTACCCCAGGCTTGTGGAAGGTTCTGCATCAGT 

GTGGC ATTGTTGCGATAGCCCTCAGTTGATGCTTGTTTGTGGTGTTGGGAGGCAGGAACT 

ACTTT AGGAGGGTGGAGGGGTGAGAATGAAAAGAGGACTTGCCCTGAGCCACCCAGCTGT 

GGTCACCTGATGGC 



GGNCGGGCCGAGGTACGCACATACATACACACTAACGCT CAGCATAAACTTTCCATTACA 

cttagacaatgacttgtggaggaaaaacaaggataaacaagagtctcaagaacttaagaa 
aaacatcagagttgattatttagcactttctcaggattctaaggcaatangcct aanttc 

AAAACGTGAAATTGTTCT CTATT TCCCATTAGTCATTAAATGAGATAAATGACAAGCTAT 
TGCTGCTTCTCCATTCTGTTTTCAAAGAACATTACAAAAATAAACCAGTGMGTTCTCTAA 
CAGTTCTAAAAACAGWTTG 
Sequence 854 

CCCTT AGCGTGGTCCCGGCCGAGGTACCAGAAGCAAGGCAGTTTAGGGACAAAGGGCATG 
AGCTTAGAGTCAGATTTCCTAGGTTCAGATCCAAGCATHACTACTTA 1 1 I I CI I IAAGAA 
CTTGGGCATCTGTAAACCAGGGATAATATCTTCTTCAAAGGGCTGNTGNGAAGATTCAAC 
AAGGTAATACATAT 
Sequence 655 

CCCT T AGCG TGGTCGCGGCCGAGGTACCTGGGACTACCCACCACCATGCCCGGCTCATTT 
TTGTAI I I I I AGT AGAGACAGGGTTTCACCATGTTGGCCAGGCTAGTCTCAAACTCCTGA 
CC TCAAG TGATCCACCTGCC TT GGCCTTCCAAAGTGC TGGGATTAT AGGTATGAGCCACC 
GCACCCAGCCTTCAA (I I AATTCTGATAGAGCACCATCTACTACATGCTTAATA 

ttatccataaacagacatgtctgagcacagaagatcatgttaatgaaagattattgaaag 

GTACCTGCCCGGGCGGCCGCTCGAAAG 
Sequence 856 

CCCTT cgagcggccgcccgggcaggtacagaaaaaagcataatgaatacaacaactagca 
tcaaactcagtgtatataagaatggctaagtgaccattagtcatgtgaaaagcttaacaa 
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CT ATTAAGCTCTT ATTTTCTTACTAAA^ACAATTTTAAGTTCTTTCAAGGCTATAGTTA 
CGCTTTACATAAGAGGCCCT ATTACCCACTAATTCTTAAAATTTCT ACCT AC TTAAAATT 
TCTTTAGACATTTCCAAAGGTTAGTAAAGGAAGACATAAGATATGCTTACTTAAATCCTT 
GCTGGTTCCATGOCTCCCCATACAT 

CAGGTACCATGAAATAGGACCTTCTACGGTTTAAAATAAA 



> TAGG TCAATGAATGCCTGACTCCAG TGACAGACCAT AA 
TTATCCAAATCTCTCATTTATGAATATGGAATATAAATATGCTAAATTGAT7ATGTCATG 
AATAGACI FCTTTI 1 1 OCAT AACAATGTTTGGAG TTTC TCACC I I rCT CCTHMOCTTWTT 



CCCTTAGCGTGGTCGCGGCCGAGGTACAAATGTX5AG I T C TT C T C CAGACCATCAATATAG 
A 7TGGATTTATACACTGATCGC TGTG TC7CTCCTTGGT AAT AACCTT ACCCCATGTTGCA 

acaaaca tggacttg tt acaacat cccagag t gaaa tctgaat g tggtcaagaaagtt ca 

GAAACAATAAGAGTGATGCAATGCATACCACAACTCAGGCCCAGTGCAAAAGTCAGGCCC 

CAGCCCTTCCCATATAAGGGACTTGGTCATTTGAAAAATC^^ 

ATAGGGACCTGTAATCAATTAGAATATTC 

iTACTGCCTGGACri U AGGTG G 1 1 1 A AGTTGGCAG 
CTACATCGAAGGACTTCTGAAAACCTCAAGTGACAGTTACACCTTTG 
CAGCTGCCCTTTTCCCTCAAAACATGGATAAT^^ 

GCGTC 1 1 1CCIC1 AAGCCAAGTTCTCAGTCCACATTAGTCCATACTTGGCTACAGAATTG 
ACGTTTGTGGCCACAATCCTACT AGAAATGACCTTTGGGTAATATCCTTATCTTGTTGAT 
CTAGTTAGGGTCAAGTAAA 
Sequence 860 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTATGCAGAAGGAAAGtiAATTGCAGATGGAAA 

AAGCTGAGATGCTATAAGGAATTACGGATrTTATAAAGAGATCACCATGTGGGTG^ATGT 

AAAT AT AGATGAACAATGAAGCATAAACAAAATTTT AAT ATC TT AGAGGC T AAAATATTT 

AGAAATGAAAGACAACAATAGCATATAAGTTAAGAAAGGGGGTAAAAAGAATCAAGAGCA 

TTCTAAGGTCCTTATATTAOjrGGAAGGA 

AAATTAACAATGTATGCTGCCATTTC 

CCCTTTGCGGCCGCCCGGGCAGGTACCAGCACAGGAATTGCTGTATGTTTG > 1 1 1 IAATT 
ATCGGTTTTCACTTGGAGGGGCCAGTTCTCT ATATTTCAATCTATTTTCTAT ATCAGAAA 
TGAGCAOGCATTTTAAAAAATGGCTTTCATTGATGGAGAGGTAAAA^TGAAATGGCTTT G 
TTGT ATTTATATTATAAAAGGCGATTTCCCAAATCT AGAATTTATTACT AAAAATCAAGT 
TTGCATTCAGGGGAGGAGTATGATTTGCTCAAGCTTAC I H i ll I ATAGGTGGGGTTTTT 
ATATTTTCAATGTGATTACTCAC 



GGAATAT AATTTGCAAGGAAT AACG I F B I rGAAAAACATCCTCTACAGACAATATGAACA 
ATGCCTT AGTCATCT ATTGATTA TGAjCAATATACTCTTGAAGAAATTGTTTTCGGTTC TG 
GTTTCTGTGGTACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 663 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTACACCTCACCACCTGGGTC 

TTACCAAGAGACAGAGTAAAX^CATGCTTTCTCCTATCCAAACCAGTCTCTCCT GTTCCC 

TGCTrrGTCCAAAGCCAGTTGCAGGAATTrATGTCTTAAAGTAAACCATCGTATGATAAT 

TTGCCCTGAAAATGTGCCTATTAAAAAAAAATAGGATATGATGGGAGGCAGACATAAACA 

TTCTGGTCAATTTATTGGTGTTATTATrTATTTCAGTTAATAAACTGCCCTTTCGCTATG 

CTTCACTTTCCACGTGTTTAGGCAG 

iTACATGCTCTAAAATGTAAGGATTCATTTATGAG 
AGAGTGAACATACTGCTTGTAGCTAAAACATTACAGGAGACCTTAAAAAGGGGTATAATT 

gg tccc tatgt gaaatgaacct gacat atttttataaatt atttgtgcatgact atcttt 
tgktgatagcactaggaagacttnt aacgtttaaatactttatttgccctcaattactat 
ttaaaagtcctataattttaagtaattttacagctgacaaagataaatat ttttt icttt 



tag rrmc I aatgtcttggaggtaaagtggaaatggcctgttttgacacataatttcta 
gaacttggagttaattttgatcagttccattttgggt 

Sequence 865 

cccttagcgtggtcgcggoccgaggtacatgttactgggtattaaatgcgttcatagtag 
ggtattaaa tgagcaaggtccccatcccagaaaaatgtgcagtttgtccaa tgggaaaga 
tgcakagacagtttcagttaatatactaagtgctaaagattgggatgtgcacaagaagct 
ggaggtaaaaattctggaaaactgaacgtgaagtcaccactaggcaagctgcctgtaatt 
gagcttgcttgtatatgaccaatcaalx, rrrocTTGn gaaggattagttatct agtttc 
ctcc rmcrm i i ggaatttggtcttttaaggic 1 1 gataatctttctagtgtagagc 

ATGTGAAGAGAACAMAAGGAAAATCAGGACTCAGTTTACTT aatttaaagcaagccattg 

GTTGCTGCAGTTCAGGGGAGGTTAAAGrT GCTGGGCTCCACTCTCTTA7T AGCArGGATG 
CTTAAAGAACTTCAGGG 



T AGATATAGGATAGTGATACNTTGAANAGGAC TATGAAAAGGGACAGTAGGGCTTAGTGG 
AAAAAGTTTT AACGANNTCTACNGTTATTGAATNAAAWT ACATATAGCGNGATTCTT ATT 
ACTTGAAATTAGGAGGAGAAAGAATTTTTTGAGGTAAATTNGAAAAGACATAAAATAGAC 
TA 



CCCI 1 1 CGAGGGGOCGCCCGGGCAGGTACGCCGGGCATGCAGCCAGGCTAGACCGGCTC 
A 

GCCCCACTTCAAGACAAAATCTCAGCACCCATTACTCACCATACATATTTATGCAGTGAG 
CTGCATCATGACCAGCTATCATCTTACGTCATA G 1 1 T M T I C TCTGGTAGAGATAATTAA 
CTTATTATGCTTGATCAGTTAACTCTTGCTTAGAAATTTAAAAAA 

AATTCTTTGTAGAAATTTTTGAAAATAGAAATATTTGAAGTAGAAAGTT AAAATCACCCA 
CAATTCTGCI tTTOT TAAGATTTGAATATGTTGTCTTCCATGATATATAACAAAATTTGT 
CTGGGTATTGCATATGTCGCCTTTCC TTCTTAATATTGCATTTTGAGCATTTAACCNGAA 
CACTAAATATTCTCCCT AGAACATATGGATTTTGAATAATTTAGCTAATTATAAAAATAA 
CTTCCCTAATGGTCCTTTGGGCILI 1 1 IAAGGI J 1 1 GCTGGTATATGTTCAGGGGATGAA 
CCACTTAAGGCTCTTTOACCACCATACTGNCCATACTGCCATACTGGCAT 



St guinea 666 

CCAGTGT GATGGATATC TGCANTTTTCGCCC TTTCGAGCGG TT NTT ^KK>GCAG^r^T^^rTT 

CNNCCTTTCTGTGKT ATTTGTGGCGGNATGTTGNATACTCTCTACCATGGGGATGAAGAC 

ACAAGAATTATGATAGTTCATTGAAAAAGGTTGAGAATTCAGAACTTGTCAGTTTCCACC 

AATAATGGGAAAGATACAATATGACAAAGTTCAGTTGCTTAAATGAATCTAGGAATGAAG 

AATCTAGAAATTATAATGGAGAGGTGATTAGGAGTTTAAAATGGTTTAT 

Se quen ce 669 

CCCT TAGCGTGGTCGCGGCCGAGGTACATTAAATTAAGCATACT AAAGAAAAAAGGAATG 

TTTTCTTAGCAATTTAAGAACTTGCTTAAAAAGAAAAAAAGATCAACCACT 

G^ CAAAA A TTAC CCACAAGATGAAATTCAGTTAAAATTX^CAAACACTGTGGAGATGGAAA 

GCCTTGTATTTTAGATGAAAGGATTTATGGCTGGAATTAAAAGAAATTAAAAGGGAGAAA 

AGTGGGTGAATGGAAAAGATTTAC 1 1 1 1 I G I 1 1 1 IA AGTGTTAATA6CCACTTTTTGTCC 

AGTCTGNATCTC^TTTCATTAGTCTTTATATATATATATACNCACACACCCCNCACGTAT 

GTTATATATACATATAATGGTrTATGTATTATATATGNGGATATATACACCTTATATGGT 

TATATATATGGG I 1 1 I II 1 CMNGAGCMTT, ATATCATGGTGAAATGAGTTCAAATGGACCC 

TGGCCCGGGCNGGCCGhTTCGAAAAGGGCNAAATrCCA£C>jCACTGtKX;GGGCGNTT ACTA 

GTNGGATCXXXAGCCTCGGGNNCCAAWNCTGGGCGTAAhK^TNGGGNAATAGGTGTTTNC 

CTGGGNGGAAAATTGGTNTNCGGTTAAAAATTCNCCCCAACATTCCCAN 

AACC TNC NCCNN NTTTT AAAAATTTTGGCCN NNTTTTTGCC CCGCC C TTTTNANAAAATT 
TTGGGGCCCCCCCNCCNI 1 1 1 1 1 
Sequence 670 

CCCTTGGCCCGCCCGGGCAGGTACTAATATTCTTCAACAGAATGCAATAAAATACGAGCT 
ACAT AAATGCAAAC TTGGTTCAAAGGT AGC TATG I KIM I AAAAAAGGTTATTATAACA 
GACAAAGGAAATGCAAAC TTATCC TTCGAAACCCTGATAATTGG TAAT ACCAAATAAC TG 
G T ATC T AAT AAATA T ACAAATCAAGAGAA T ACC TT GC TAGCT AAATTAAAAAAAAAAAAA 
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871 

CCCTTAGCGTGGTCG GGGOC t3ACGTACAAXjGGCTTCTTTGGTGATAGTTTCT 
TAAATACTGTTCTGTTATTTTTGAAATCTGATCAAGAATrGACACAA 
TATTTATACTTATGCCTACTTTTAACCTTTT AGGAAAACTTT ATGAATTGGAATATTCT A 
AAATCCTGAAAT AATTTGGAAT ATTC T AAAATTCTGAAGAGAATATGAACGGATTG TTGG 
AATGGAACTTTTACCCGATTCCCTCAGACTAGAG TGTTCATACGACATTTTGCCAAGAAG 
TTCCTATAGAGGCAATATCACTTTTAGGATGGATGGGTCT AAAAGGATCATATTTAAGTT 
TCTGGTTATTCATGGMTGCACTCACTTTAGAG 

TATT rpTCTCroCI AAATAACCAGTATGGAATTATAGAAAGAAAGGTGGGGAGAATAGTC 
CGTGTGA TCTWCTGGGCAOCATTAAAGCCTGTTCCATCCAGCCCCT^ ITTTOOTCT 
gCTT] GUCTT TCAAQQCCCA QAAGA CATTTMCATTCC^ 
. CCCCTCTCTTGCCTXCATAI tfO l I IG CAAGMGGGGGCCAGAAl 1 1 1 1 1 IGGATTCCCNT 
TAAAAATGGACCTTGGtKrrNTTTTANCCATAAWXrTGTGA 

ccccTTyncccccccGGGGGG^ nrtm ignaaaaaaaanh 

CAGGTACA bl rCT S TOI 1 1 1 » CAATTGAT ACATACTAC 

ttatgtaagaaaaatgagtaaaaatagagggccacacaggcaacaggcattaggttatgc 

ACAGAGAAGGAAAAACTTCAjGAGGTTGTGC TGCCATC7TtTTGGAACAAACAA£iAATCTAC 

acgaacagaaacatgatggaagaacaagggttagtt actccaacgaaaaaacatggcagc 
aaaaaaaaccattttgaagccaagcttttgatttaaccatgaatgaaa^^ 

TA TGACCAAATACAGAAATTAT 
Se qu en c e 673 

CCCTTAGCGT GGTCGCG ^^rCGAGGTACTTGTTAAAATTTiAGATTCCTGGACCCACCCTAG 

ACCT ACTGGATCCAAATCTCTGCAGACATGGCCTT5GACATCTTCATTAT AACAA^ 

ACATAGATTATTTTGTCAGTGtSCCATGTCTTGC 1 1 FG C1 ICI G 1 G GAAACTACTCTCCAT 

CTTCTGGAGTGGAATGTCCCCCATTGCTrATCCACATGGT^^ 

GTCTCAGATGGC^CCTNCTGAACTGGGCCCGAGCT^ 

CCTCCCTNGGAGCNTCACTTXrTCCCATC^ 

Sequence 674 

GAT ATCTGCAGAATTCGCCCTTTNCGTGGTCGCHTTTCGAGGT ACTGAGGATGACTAGAT 
GACAAATAATAAGAAAAAATGGCATTGACTTTGTATAGAACTT AATAATCAGATTTTTAA 
AGAGGTTACTCTATTCTCTT ATTTGAGAGATATGGAAACT ATCT AGGCCT AAAGACTGTA 
AATC TGCCTGGAATCAGAT AGTTGGCAGCAAAATCAGAAAT AGAAAGCAGTT ACTCAACA 
ACCAACAGTTTAATTTAAGAAACATTTGACAAGCATCTCCTT^ 
AGATGJOTGAATATAAATATG^ 

CCCI rAWGTGGTCGNhnrrTMGAGGTACTTT AAAAATAACAGAGTGTGATTTAAGAATAC 
TCAGACTAGAGCCTTCAGTGAGTTGTCTGAGGGAAAGGAGTGAAGTCAGGACTT AGATAG 
AAAGATTACAAAGAAAGTCAAAGTAAGCAGAGGAAAAAGATACX^AAATGACA^ 
AATAAGCAGT AAGGGAATAAAGAAAACAAAGTTG TGTGTGTGT GCATGTATTACATGAT A 
^TC^^GGAAAAAGAACTCGCAATTTACTAAAGGAATAATTCATGGTCATACCAATTTC 
TGTGTCCAAAACT AAC TT GATT AGT ATCAGAAGGAAAGTCAATG TTT AAACAGTCCTTCC 
CACATCTGCTACTTGCATAATGCCT ATGCAACTGTCATAAATTAAGAGT AGAGAAGGGCA 
CAGGGCC 

CCGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTT AGCGTGGT CGCGTTCGaGGTAC T 
TG*nAAAA TTCAGATTCCTGGACCCACCCTAGACCTACTGGATCCAAATCTCT GCAGACA 
TGGC CTGG ACATCTTCATTATAACAAGCTTCCACATAGATTATTTTGTCACTG^ 
CTTCCn I GCTTCTGTGGAAACTACTCTCCATCTTCTGGAGTGGAATGT CCCCCATTGCT 
ATCCACATGGTCCTCGCCTCCCTGATACTGTAGTCTCAGATGGCACCTC^ 
CGACCTCAATCACrTTCCCAGACCCTGC^^ 

gggcaaaggagccaagtttgggcaacaaatccctatgcatttagaagt AGATGGGGCTGC 

ATTACAACACACAAGCACTCAAGGACTCTCTGTAATATCT GGACTCATAGGAAGGTGATC 



ttcttggtttgttctgaagncatgg nccatctnctattctagaattanagagttcctgga 

AAATTCTT ACCANAAAAAA t nCCTTl I GGNTTNGACGC TT AATTGAGGffr AATTTCTAT 

TNTGGGCAATNTCAAAGNNATTCAAffGAAAAAAAAAAGGG 

Se quen ce 677 

CGCTT AGCGTGGTCGCGGCCGAGGTAC IIIHII I M l A Ai mi l lHT ll lH I AATA 
GAGATGG<3GTCTTACTATGTTTCCCAGGCTGGCTCGAA^ 

TCACC TTAACCTCCTGAGTAGCTGGGACT ACAGGTGCAWACCACTGTGCCCTTACTTCT A 
TTtTTrACTTGACAAAGGAGAGGAAAAAAAAAGGAAGTTTAGAGAAATTAAGT AGT AAC TT 
GTCCAACTTTACCCACAACCACTAAGTGGTAAAGCTGGGGTTTGAA 
TTAAATCTCAGTAACTGAAAATCACT ATGGAGCACCTTT AGGT 
Se quence 676 

GCCTTTGACCGGCCCGCCCGGGCAGGTACATXaTTTGTAAAAT^ 
TCAAACCAACATTTCCATTTrATCTATCTT AAAT ATA I U I TC CTCT TC TTI ACGCCTAAT 
TTCTT AAACTCCC AOAtillllli ICIGTA '.GATCTAGTCATCTGT AGCACTTCT CACAAA 
TTAAGCTCT Cn ATGC OCCAAACAGTAACGAAAGAG G T C TCTT A GTTGGACAATAAGCAG 
T GAAAaATATTTCTTATGGGACAAGAAATT AACATTATTAG TCAAAT GTTGATGCCGG T A 
GGCTGAGAAATXaATTCTXiACTTAAAAGCCCCTGGGTrTT AAAGCTCTCTT AGAAAAACAT 
TAGT 

Sequence 870 

CCCTTAGCGTGGTCGCGCCCCGAGGTACAAGGAGCT AGATCATCAAGGAAGGTCAGGGCA 
GGGTTCACAGGATGAGGGCACTTTGCCAI IUU I tgTG AI I rGGTCAACAAATGACACAG 

gttatttacaatcttgaccttttggaaaagatacagcagctaatagccta 
gaggtagaaaacaagtgccacagtaga 



cccttagcgtggtcgcggccgaggtacatacaatagagtattattcagccttaaaa 
tgaaaaaatcctgacatgctaaaatataaatgaatgttgagaacattatgctaagtgaaa 
tgagcccatctaaaaacccaaatactgtatgatttcacttaactg^ 
acaa attca ta aaaa cagaaagtanaatagaggtttqcagggactgg<^gttacttgata 

TAGAGTT1 C>ATTTTGTAAGATAAAAAAGTTCTGGATATTGGTTGC><>GGAATATGAAT 
ATACTT AACACT ACTTiAACTGCACACTTAAAC^TGGTT AAGATGGTAAATTITGTTAGGT 
G TTTCTT ACCACAATTT AAAAAAGAAATTTT AATTAAAGGAATT AAAAAATTTACAAAAT 
ACTATTCA TCAT TGJ<GTTTCCAGTTTATATTCAACCACAGCAGTATTTCAGGTATAGT aa 
TTAACTTACTTTCA 
Sequence 861 

CCC ; 1 1 UGAGCGGCCGCCCGGGCAGGTACCACTGCACTCCCACCTGGGTGACA 
ACCCTGCCTAAAGAAAAAATTTAAAAAAATAAAAAATTTAAGAATTTTCTATGCCCTTTA 
CGAGGCCAGCTTAATCAGACTTCT CT AGGCCT AGGACAGGCTTAAGATCAGTT AA TTT AA 
AACACTTCT GATGTTTCTTGAGCATTGAAAAGTTTT ATTCTTTCTGCTTG T G T T T T C AAT 
CTTTTTGTGTTT GTCTTTTACT AAGGCT AGAAACACGT ATTTGGTTTGGTTATCTGAAG T 
TT AATTGGATTCA 7 fGTGT 1 1 ATAGTATTTATCCCTGTAGTGTTGGAATTACCAGTCACT 
TACATTCATAlTTNAGTrrTTTGCCT 
S equence 68 2 

C*-C 1 1 icGAGCGGCCGCCCGGGCAGGTACI 1 1 1 1 C IT GAATATTTCCAGGGCACAAGAT A 

ttcttatacaga aacctc agaatggaaaatagctaagacataagcagtgtttcacagaac 

CATCCATCAGTCTTTTTTAGGATGTAGCAGTCTTCCATGTATCACTTAACC^ 

tcttacoxatctttttgggcagggggtggtagaatttaaaatttaccat^ 
g^tgatagtaagcatagaattttgggatgt ci 1 1 1 1 1 1 1 ccttgccctaaaccttcaga 
GTTCTGCCAGGTGATT caaatgttt aagatcccat aatctcgcct gt gtgct caagcgaa 
cactaacactttaaaaagtgggaatgaaaaaatctgaactggttgaattagacacagtat 
ttggccccatctttcaatttcag 

Sequence 683 

ccc ttagcggccgcccgggcagg tact caaaaattt aaat agccatc t aaaaaca tc t CA 

GGTAAAAAATC I G I CCCCTGCATTTGAAACCAAAATT A1 Mil I L I LACTAAAACACATT 

TTATTTAATAGTGAGGTGAAArTACATTAGCCCTCTTCACATTTATTTG^ 

TTTT AAAAAACTT AGATTCTTTT AAAAAAAT AAA TT AAAGAAAAATGACAT CA TT CAT CA 



P*ir M7oT33l 



PCTAJSOO/33312 



GATAGCCAGCTACATCTGTAGTTTGATCATTCAGTTTAACCGTTTTATCACTGTTGATAT 

GAACATTGAGTACCTCGGCCCGCGACCACGCTAAGGG 

Sequence M4 

CCCTTAGCGTGGTCGCGGCCGAGGT actttgatacatgtaaagtgcaaggcaccttgct a 
GAGAGCATANGAGCTATACTAAGATATAGAGTCCTGCACAAATCCACAAAATAACATGAA 

tacaaagtgtcctaaaagtcatgccaaataaaacagancatataactgggcagagggatg 
gagagtcacatgctgcaggagctgagcgttgacatggtgtt atgggatatgaacttgaga 

T^l t tiAAGT AGAAC TGAGAGATTTCTGGAAAACTANATGTATNAACAGAAGCAflGAGGAA 

taggagatggtttggaaaacatcaagcagctca ui i i cmu gggtggtccaggagaaaga 
agctcaaacaacatfcagtgataacacttaaaannatcaaaaattt 



CCCTTAGCtyTGGTCGCGGCOGAGGTACAATAAACAAGACAGTGCCTGCTTGTGACCAGGG 

GCTGGGC CTGTTT^ TAGCTCTTTTCCCTGCCTTTT GTCTTCAGAGTTGATCTGCTTCTTA 

9^ C ^J?"9 AC " " 1 CAGAGTTTGCTATCTTAGAAGCAAGGATCATTTTTAATTGGTTTOT 

TTACTTCAAA GTCCC ACTCATCAGAGGCAGNTGTTTCGCTT ATATTTGGCTCAACT actt 

TNTCTGCTTGGTTTAGT AACACTAATGTTT ACTAACATTAAAATGAAACCAGTTTTGCAG 

CTAGCATCTATTGACCAAATATAATTATTTATTrcAAACTGTATATTCCAAAATTTAAAC 

ATATTCA ATGCTTA TTGAACATCTAAACATATANCCTTAATGAATAANGGGAA^TATAA 

CCATC TGGTTTTTGGATCTGAAAGCCACAACCCACCTGCTAGAHTANTTTGGGGAAAGGC 

TTTTTAWTTCCAAGTTCAAAGGNTGAATTCTCCCGAGGGWNGNNGGGGGNCTTCCCTTCT 

MAACCAGCAANAAAACCTNGCMCCA^ 

NGCATTTrACTTTOCTTTTTT 



CCCTTAGCGTGGTGGCGGCCGAGGTACATATGGCTCGGGAAAGGGGGACTGGATTAATAA 
ATTCTGGTAATATAGTAAGGACAAAATAAATGTAAAAAAGATAGAAGTAAATGGAGAACA 
TCAACATGAACGCGTGCT CCm GAGTAGAAAGTAATTTTTCTGCTTTX5TCACTCAAATA 
GCTGGCAGACCTGACATCACCCTGCCTCTGCTTCCAT GCTCT AAAACTTT CCT GGGCCT C 
AGATTTGGATGCTAATATGATTTTCCACTTAGTGGATAAGAGCTCCCTGGAGAAGGGCTC 
ATTCTTGGATGGACAACAGAATTAGAGCCTGAGTTCTAAGAGCTTAATAAAACAAAAG 
Sequence 887 

CCCTTCGAGCGGCCGCCCGGGCAGGTACCCGATGAAAGTTTAAATCT AATCAACAGTATT 
ATGCACTGGTTGAAGAAAACCAGGATTAAGACGGAGGATAGTCAGCATGGAATCTAANAA 
GGGA AAAGT CCGKTAACTATATGTGTTCATNAGA7TCTAAAGCTGTT AAGGGAGAAAGAC 
CCTGAGTCT AATGAATATAAACTTTAAATTTAAAGAAAAACATGNTCTGTTATAGAAAAG 
TGGGCTTTTAANTTTTGTAAAG 
Sequence 888 

CCCTTAGCGTGGTCGCGGCCCGAGGTACCATTAACCGTCTTTTAAAAAATTATTATTAGT 



CCCTTAGGGTGGTCGCGGCCGA GGTAC TAAACAGGCCAGATATATTCTCTCATTAACTTA 

TTGCCTAGCAGAGAAGACCAACATTTTTAAAAGTTTATACATATAGTTAATTTCTATTAT 

GATTATATGATACAAATGGAAAGTGCTATGAAAATGTGGAACAAAAGAGAATAATCTGTC 

TGAACAGTCAAAGAAGACTTCTGGGAGATGACATCTGAGCTAAAGGTTGAACAAGGAATT 

GGAAAAC^ GCTG GCATGTGCAAAAGACTTGAAWACTGAAGGAGTTAGCCTTTAAAAAAAT 

GAAGAAAGTTCTATTTGGCCAGAGCAGAGTrrCAAATAGTGCCTCACAGGCCACGTTAAA 

GACCTGAGGCCTTTATTCTAGGAGAATAGGGAGCTGCTCAAGGAATTTAACTTGANAAGT 

GACAAAGATCAGATrrGCAATTGCCTrrCAAGGTGGTAGGTTACAAGGGAGTTGGGTCTC 

TTGACCCTTTGCAAATTATACCCCAT1TCTTAACCTAAGAAATGGG 

Sequence 890 

CCGTTTCGAGCGGCCGCCCGGGCAGGTACTTGCCTTGCAAAATTATATTACAAGAAGAAG 
CACACTTGTTATAGAAGTGCTGAATTGTATGGAACCTAAATCTGTCAAGTTACCTGTCTT 

TTTAAGGCTAGGCAAGTATTGCGCTGATGAGCCAGGGACTGCCCACCAATTGGCAGGCCC 
ATTGGGTGATAAATGTCCAAGGACCTCTAGGCTGACGACACATTTTTCATCATTAATCCA 
GTCTATTGTAACCAGGGCCACTCACATTGATTCGGACTAGGGGGCATCATCTGCTGTTAA 
AGAGGG TGATGAC TC GC T AAAAATGAG GG 



PCTAJS(Xl/333I2 



Sequence 891 

ccctttcgagcggccgcccgggcaggtaccacttcatggctaagcatgtg 
ccggtcttcctgggcttacatctttgctttgcctcttctttcctgtgatgagtcttgggg 
taggcct caaaggc tgaatc ttcaatataaat acaacagtgaatgaacaacaaatggtta 
ttttaaag^tctatcttggatggctatttaamcactaaacccow»7tgctcacctgt 

TGACTG^AACAAACAATAGTCC^TTCTTCATGCGGGCATGGTGAGGGTTTTAACCCCGCA 

TTGTCCACAAAGACCGCTTAAATTATAGTAGATGCTCAGCAAATCTGAGC7ATTATTTTT 

ATCACGACTGTCAGAGGTCAGATCAGGCTTCGGGGTCAGACACACCTGGGTTCAAATCCC 

AGCAGGGCCACTTACTGTTGGAGCCGGGGCAAAGTCAGTTATTCTCCCTGAGGGTCAGTT 

TTCTCATCCCTAAAAATTCC 



CCCTTCGAGCGGCCGCCCGGGCAGGTACTACAGAACAGGAACAATCTGCCATGTGTGTTT 
ACAACTTCAGAAAGCCCTGGAATGACAGTTGCCAG^GCAGTTCTTTGAATTTGCAGGTCA 
GAATTA GTGGATGATGAAI HIM I CACACATGGTCAACTCTGTGCCAGCTGCTACAAGA 
TGTTGGAACAGGTATATTTATTTATTTAATGATGATCAATGATTCTTCCAACATCAGGGA 
ACATCAGGGAAATCAGCT AGTATATGCTCTTTTTGAGGATTTTCAGCTCCAAATCCTGAA 
AGCATTCATGAAACTACATAAATTACTTTTGTrAAGCAAATCATCATAAGTAAATCCAGT 
CATATGAAYCTGGAAGGATTTGCTGGTGGGCACT AACACTGACCACATGTTTCAAGTGTG 
GGCAAGTTrACCATCCATCACGGATTTTGTGCTTGGTGAATTGT AGGGAGTGAAAGAGAG 
AAGGATGTTTGGCCCAGTTGTCT T T TT T ACCT ATATCTGAAATTCTT ACTTAGTCAAAGA 
ACAAAACATTTAGACATTTCATTrCCTTTTGGGGGTtTTAAG 

TGTATATT rrAGA AGAAAATTGTTTTTATTATATATAATTTATTAAAATTCNGGNGGAGA 
AGACCAAATTTrATCCTG^GNAAAANATTTAAATTTGAAGNTTAGGTTGGCTTTm 
ACCCNCCGGCCNAACCCCAAC 
Se quence 893 

CCCTTTCGAGCGGCCGCC C GGGCAGGTACTAGCATTAAAAAAGT CCTACAAATTATTAGA 
GAGAAAATACAGGTTGC^CGCAAAGCATAAAGAATGAGAATGGCATAAGACATCTTAACA 
GTGCCACAGAAACTAAAAAGTAGTTCTGAGTAAAAATGAACT ATTTACCCAGCCAAACCG 
TTAATTAGGTATAAAGGTAGAGTTAAGACATTTATAGACATACAAGATATTAAGATTACT 
GAGTCAATTGATATTCAACAGGGGTGCAAATGGAGAAAAAGTCT7TTCAACAAATAGTGG 
TGGG ACAAATGGATAGCCACATGCAAAAGAACATATATATAAGAGCTAAAACCATAATGC 
TTTT AGAAGAAAAT AT AGGG TTTAT CTTCAT GACCTTGAATTTGACAAAGGATTCTTGGA 
CATGAC AC CAAAAGCACATG CAACAAAAGAAAAATTGGAGTGAT ATG 
Sequence 094 

CCCTTAGCGTGGTCGCGGCCGAjGGTACAGGTCACACAGCACATCAGTGGCTACATGTGAG 

CTCAGAC C TGGGTCTGCT GC T GTCT GTCTTCCCAATATCCATGACCTT GACTGATGCAGG 

TGTCCAGGGATACGTCCATCCCCGTCCTGCTGGAGCCCAGAGCACGGAAGCCTGGCCCTC 

CGAGGAGA GAGA AGGGAGTGTCGGAC^CC^TGACGAGAGCTTGGCAGAATAAATAACTTC 

TTTAAACAATTTTACGGCATGAAGAAATCTGGACCAGTTTATTAAATGGGATTTCTGCCA 

CAAACCTTGGAAGAATCACATCATC 



CCCTTAGCGTGGTCGCGGCCGAGGT ACAGGTCACACAGCACATCAGTGGCTACATGTGAG 

CTCAGACCTGGGTCTGCTGCTGTCTGTCTTCCCAATATCCATGACCTTGACTGATGCAGG 

TGTCCAGGGATACGTCCATCCCCGTCCTGCTGGAGCCCAGAGCACGGAAGCCT GGCCCTC 

CGAGGAGACAGAAG GGAGT G TCGGACACCATGACGAGAGCTTGGCAGAATAAATAACTT C 

TTTAAACAATTTTACGGCATGAAGAAATCTGGACC^GTTTATTAAATGGGATTTCTGCCA 

CAAACCTTGGAAGAATCACATCATC 

Sequence 896 

CCCTTAGCGTGGTCGCGGCGAGGTACCTTGAGCTGCCTCAGCACTCTTTTGCCATTCGTG 

C T AG AAACAGC CAAAGCC AGA CAACCAAATT ACAGATGCTT AAATGTTAATGCCAGACAC 

CAAGGCTCCGTGAACTTCCCTGTTGAAGATCT GACCCCGACTACTTGAGGACATGAAACC 

TAACTGTGCAGCTAATTACACCTTCCAAGGGCAATGACATCGGGTCCTATGATTTTATTC 

AGGAAAGCAATAAGGCAATCGGGGTCACTGTGAACATCATITGAAGGGAAGTAACTrCTT 

AGCTTTATTCCACAAATGGTCTAT 

Sequence 897 

CCCTTAGCGTGGTCGCGGCCGAGGT ACCGGTGTAGTGTATAGAATGGTTTGTATCAAACT 



PCIYUSOO/33312 



Table I 



AGATCTACATTACTTTACTAGAAATATAGGGCAATAATAAAATTTCCAAAOCCAAACTGA 

*cgataat atatatttctttagaaagtc tcagaaaacc CA TTCCTGAATGACAAAACGGA 

GAGATAACTTACAACT AGGTGATATCTGAAGTTAAATTTTCTTGGTTATCTATTTCAAAA 
ATTCACAACTATTCTGCACTAAAATGTTTC^TGGGTCA^ 

aatcccaacacgttgggaacctgaggcaagagg 
S equence 898 

C^CTTOGAGCGGCCGGCCCGGGCAGGGTACCNCGG 

ACCTTATGAACCATTAACTTGGGAACCCCGGCAAA^ 

AGT^A>*GA^rMACCNTTTAAACATTTGGT>n^ 

CCQTT^CCTTTICCAAAGGCCGGAAAAAMGChK>tAAC NI i II M 1 1 1 1 1 1 1 H 1 1C 
Sequence 899 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGACAGATGCCTG 

TCAATTTACI 1 1 ^GCTGtSAGAGATAGAAGGCTCTCCTGCAGCCTTTTCGTCTTCGGCTG 
TCCGCTGGTAAGAAATCCGCCACACAAGAAAGGACTGACATTTG 
AGAGTTGAAAGTGAAATAAAGGATAATAATCTTTGTCTT A I U I C I I I GTTTTAATGTTT 
CCCAACTTACGTTAGGACAATGTCAACAAAGACAGATGTGCCTAATA^ 
ATGTGTTTTCTCATTCCTATC 



CCCTTTGAGCGGCCGCCCGGGCAGGTACATT^ 

TGTATAGGCAGACCAGACTGGTAGGGAAGAAAAGCAGAGATATCAAGTGGGGGACATGTG 
TTTGCCCTGGGGCTCT ATTGGCCTGGAATTTTGTGGTAGGAGGAAGGCACAAAAAGTAGA 
CTGGGATTACAGGCGTGTGCCACCGCGCCCGGCCT AAAGTGTGTTTTATAATAAAACCTC 
AATCTGAAACATrrrAATAAAACCTTTAGATGACTAGATTTATGTTT ATTTTGGATTTAT 
GTTTATATGAATAAAAAAAGAAAAAAGACGAG 
Sequence 901 

^ ^ttAG GGTGGTCGGGGGGGAGGTAGGT ATGAGATGCATTTGAAAACTTACCTTCTTTA 
T *V?l[ '^"CTGTTGCAAI HUT \ CCATTACCTGGGAATAGCTGCTTTGGACGGCAAAC 
CAAGCAATGCCCTTTCACAGCTGTGGGATGAATGGGGAAAGAAGTCTTGGTAAGGAAGCA 

CAGGAACTTGCAACCCTGATAACTTGTGCC TGCCTGTCTGT AGGCCTTTAATGATGTTTT 
ATTGAATTTTGG 



CCCTT AGCGTGGTCGCGGCCGMK3TACTTCTATACAAGGCAAAATGAACTCTAAGTAAAA 
AAGAAAATCACACTTCTAAACACAAATTAACCATTTCAGTATTTAATTGCTCCT AAAAGG 
TGTATTCTACTTCATTAAATGTAAGAGAAAAGGTTACCTACATTACGCAGTTTAAGAAAC 
AGGATAAACTTT AGCATATAAACCAGTCTTGATTACAATTTCACACTTTCAACCATCTTA 
TTTATACCTCTACATTAGATAATCTTTAAATTTCCATCAT^GGTrTTCCCATGGTTAAC 
CTNCCATATAAAATTTTGGTAATCCTGCCC 



CCCTTAGCGTGGTCGCGGCCGAGGTACTGGGTGAGAGGAGAGAGCTCATGTGACCCGAGT 

C TGGGTGG TCTCAGGCATGG TAT AAAGAAC T AGGCCAACCAAC TGCACTAGACATAGAAA 

CTAGCTGAATAAACTCATCCACTCCGATTTCATTTCAGGTATCTCATGAGAAACTAGAGG 

ACAAAAACAATTCCAAAATTAACAAAAGAAAGTTTACTCTAGCCATCAGTGOCAATGAAC 

*^*^^GACTGCCTGAGAGTTATATTAACAAAATAATTAATTCAGACGAATTAAGGAATT 

AAACCAGCTATGGGAAATATACACTCTATACTTAGATGCACATT 

Sequence 904 

c TTTCGAGCGGCCGC C CGGGCAGGNACTT AAATAAAAT AAAATTAAAACAAATCATTT 
TAGAGATAAAGAGT GAAGTTACTAGAAAAAGGTGACTAGGACTCTGTTTATGAAGAAAGG 
TTAGTArrTAAATCATGAAAAAAAGTAAGAATACTTAATTATTCAAGTAACTTAAAATTG 
TAATTCAGAATGGCTTTTATGTATCTAAAACAATCTGGGCTGCTATAAAAATTCAGTCA^ 

cttctaaacttccaaacacaaaatagttatactcagtctaagaatatccgacctaccgtg 
caggaccagagggc TCATCTC 
Sequence 905 

CCCTTTC GAGCGGCCG CCCGGGCAGG T ACTT AAATAAAAT AAAATTAAAACAAATCATTT 
^^ TAAAGAGTGAAGTTACT G^^W-^TGACTAGGGACTCTGTTTATGAAGAAA 
GG TT AG TATTT AAAT CATGAAAAAAAGT AAGAAT ACTT AATTA TT CAAG T AAC TT AAAA T 



PCTYUS00/333n 



tgt aattcagaa tggc ttttt atg tatc t aaaacaatctggggctgctataaaaattcag 
tcaacttctaaacttccaaacacaaaatagttatactcagtctaagaatatccgacctac 
cgtgcaggaccagagggctcatctcttgccgagcttaatacagttt 

Sequence 906 

ctgatatcagagactttcttggaagtaggtttcataggatt^g^ 

t gggcgaagaaagaagggtcagttgggtggtgcattgaaataagtggttccaaaag 

ctaggtcaactttttaactggctagtgaaaatgagattcctcaggatacaaaagcaagga 

gaagacaggaataaatcaggactccaacaggcagaacaggatttatttagggcatgcaat 

gtggagggccctaatgggaacatgacagtgt 

e 907 



CTAAAGCCTCCAAACC TTGCTTG TTTTGTTTAAGTATCCCT GGGGCTCATCACAGGGCCT 
GTTGAAGrrCTTTTGAAATGAATTGAAGAATGTGAATAATAGTTCTAGTTCT^ 
ATGGAAAGCTAATAAGGTTTATGCTAGAGGCTCTTACTGCTGGGACTCTCTTCTTGTTTT 
TGGTTTTTAGGAAAAAAGCTAGAAAATCCAACTTCAG^^ 
TTGAAAGTATG T CAAAACAAAAACTGTTAA 



CCCTTAGCGTGGTCGCGGCCGAGGT AGCTATGAGATGCATTTGAAAACTTACCTTGTTTA 
TATGT7TCTTCTGTTGCAATTTCTTCCATTACCTGGAATAGCTGCTTTGGACG 
AAGCAATGCCCTTTCACAGCTG TGGGATGAATGGGGAAAGAAGTCTTGGTAAGGAAGCAA 
TTCAGAGAACATGGAAGCATCTCATGGCAGCAGTCACAA I I I IG IU UG CGTAATATTTC 
AGGAACTTGCAACCCT GAT AACTTGTGCCTGCC TGTCTGTAGGC CTTT AATGATGTTTTA 
TTGAATTTTGGT 
Sequence 909 

CCCTTCGAGCGGCCGCCC GGGCAGG T ACCC TC TTCTCAATTTTGCT ATGAACTTAAAACT 

GCTCTT AAAAAAATA I 1 1 II 1 1 1 AAAAAAGGAGGGNGTTATTATCAGAGATCCCATAGAC 

CTTAAAGGATAATGAAAGAATGCTATGGGATAACCTTCATGCTAAAAACTTCAACAACTT 

AGAAGTATGAAATGAATGAACTNTCTCCAAAAAAAATACAAGTTACCAAAATTGACATGA 

^^TAACAGAAAATFH>IGANTAACGCTCTAACTATTAAAGGAACGTGAAGTTTGTCAAA 

AGCTTCGCCAAAATAAAATTCCAGGACCAGATGG 

Sequence 910 

CCCTTTCGAGCGGGCGCCCGGGCAGGTACTCAATGGGGTAGGGTGTCTTGGGATCTGACT 

GTTTCTT AGACCTT CAATGCTTCTTGGCTTTCCTCACTGCTAGTTATAATTCAGTTTTCT 

CAGGTCTAAGTCATTCATCACTCTTTTGTCTGCTTTTCAGCTTCCAAAAATTCATTGCTA 

TTATCTCCTCTCCTGTTTTCCCTATTGGTGTGTTTGTMTCTTrrTCTrrAAAAAM 

TTTGTGG 

Sequence 911 

CCCTT AGCGTGGTCGCGGCCGAGGTACAACCTAGCCAGCTGCACAGCAGCTCTCCAAGAA 
AAAGGTGT ATATTAGACAGATT CAATTATTCATC TTGTGATTATGAGTAGTAACCAAATT 
GTCTATGT AATTTTCTTATGGTGAACTACCCAAAGCAAGGCCTCACCTTAGGCTACCAGC 

ttgactcttaagtggacagaaagagccaaaggctaaaaggtttgtgagaaacct CATGAG 

CACTGAGTGTTCTAGTTWiAGATGAAAACCGGTTTCAGGTATGAAGCAAGAGGCAGTGCT 

AATTGGTAGAAGTAATTACATCTT 

Sequence 912 

CCCTT AGCGGCCGCCCGGGCAGGTACAACAGAGCACAATGCTTAGATTTGG 

aataagatgaaagataaattatgattttgttcaagtgttaaaataaaactaagacactta 

AGGACCACAAAAATTTAGACCAAAGTATCTTGTAAATTCTACCTGGTGAAAGTTTGATAT 
AGCACACATATGACTTTTCTATATTATTTTCTGTTTTGAGTTTAGTAGTAAGCAGATGGT 
TTGT ATTTTCTTT AGTTGCAACTAAGTGATCAGTTTCATGATTTCTCTT ACTATGAAACA 

rrrrmTTTTi icttaacagttatctt 

Sequence 913 

^ C ^^T c G£ GGCC GCC TGGGCAGG T ACCACAAAG TT ATTGCCT ACATCCAGG TC AAGA 
AGATCTrCTACTGTATTTTCTTCTAAGAGCTTTTACATATAGGTCAATGATCAATCTAAA 

attaagagttgtgcaatc^ttaactctagctttagactggtatacTaattggtttgtata 
cgaactgggttaaaggcataggacacatgcaggctgtgttx^tttcacvkgcagggctctg 



i*s* i3i of r i 
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TAATTAGGCAATAATTACTTACCATCATACCTAGTGAGGCAATATGGGAGAAACAAAACA 
GGCCAT ACAGCTTCACTATT ATTCCT AC T 
Sentence 914 

NNCACCCCTAGCCTGGWCGCGGCCGAGGTACTTGAGGACGAAGCCACAGAGCAAGCGCTA 

AAAAAAAAGTTAACTAGAACCTTACCACTNTTNCACGCACCCCAATTHCATAAAATGTAT 

CAGNAAAAAAAAACAATNATCTAAAC^VNAAAAAAGNAAAGAAAAANNATNNANCACATAC 

GNAACNGGGTGTCAACTAGGNAACNGACCT ATANNAANMNAGGAAGANAGNGNCTNCCTT 

CCTCAATNNNCAGANNNACGGACX5GGAGGCTCAAAAGGCCCGAGAGGCTCNCTACAAGGA 

GAAAG 

Sequence 915 

CCCTTAGCGTGGTCX3CGGCCGAGGTACCAGAAATGGTAAATATATGAGTAAATATAACAC 
ACTTTTTTCTn I AAAATTTTATTTAAAAGGTAACACTTTGCAGCAAAATAATTAACAAT 
GTATTGTGGG7TATATAGT ACT AAGATGTTTGACATAAATTACATAAATAATTGGAGGAG 
GGAAATAGAAL, n S TGTT G TI UAAATTSGTTTGATATTATATATGAAGTGGTATATTATTAT 
TTCAAGGTAGCCTTGATAAGTTAAAGGTTACATA7TGNAAACCCTACAATAATCATTACA 
AAATAAAGAGATATAACAGNAAG 
Sequence 916 • 

OOCTTACCGTGGTCGOGGCCGAGGTACTTCATAGAjGGTOCAG^CCCC rTGOGTCTGGCAT 

tcctttggtctataattca<jTaaactctgctaaaaaggaaacqagactagcttgctgtg^ 

COCCI I AAGCGACCCAGGGTAGCTTGTGATGGTTCAGATTATGA 1 1 IGI rCTAGA GCT I I 
ATGTTGAGGAGTTTATCCTATTTGNCCCCTNCCCTT^ 



GTACTGCCTGGCATGCATCTTCT CGATGGTCTGTT 
ATCTTG TGGGAATGACATTCGTTAAG I IC T TI rCTGTGTGCATCXXACGCAAATAAACAA 
TGTTTCATCAGCAA^TGAATTGCCGTATACTCATCAGACTCT 

GATOACTGCAGTGGGTGACG C I G I fTGTI I ATGACATCACTTGAGAACAGAGT AAAGTGA 
GTTTCATATTTTCCTGAGTCTTGAATTCTCATTTTAGACATCTC 
GCCATOGA6TATTCTAAATGAGC 
Sequence 918 

CCCTTAGC GTGGTCGC Gra 

tcatggcactatgaaatgtctccx:tgattccctccgacctxxx^ 
tggccxttgggaagaaat accagatgg t atgcat atggcttt gggaagagc tttgagcag t 
ggtcacttotctttttttaatgcatttcaaaatgtgttttc 
taattccctcaaataaatg 

Sequence 919 *■ 

- CCCTTAGCGTGGTOGCGGCCGAGGTAGAACAATTTATGC A I TCC1 1 1 AGCAAT AGTTGGA 
CACTTAfiAATGTAAAACTGTTCAAACAAATTGGTATATTGGAGTTTGGGTAGAAAGAAGG 
GCCGTTGGAAGAGGAGGAAAAGAGGGTGAGATGATACATTAATATAAATTACTGAAAGGT 
GGTGTTCACATTTAGAAI H I HIII 1AAGTTGCATGTTTAGGATTTTAGTGCTCAGGAG 
GAAAGAAGGCCAGTGTTGCCCTTCCAGACCATCGCTGCCATTTCCCTGTAATATATCGTG 
TGTAGAGGAACCTAATGCCTGCA 



CCCTTAGCGTGGTCGCGGCCGAGGTACTCGCTA 1 I 1 C I AGTTCAAAATCACAGATTTTCA 

GATTGAAAAAATTTCAATCGACTTATTTTTCAAATGAGAT^CTGGGACAAAGAGAAATT 

CCATGACTTGCCCAAGATTACCTACAGTTTAACTGTCAGCGGGGCTTAAAACCACAATCC 

ACATCTCCTGACTCCCAATCCTTTCACTTAAAACAA^AAGCAAACAAACAAAAAAGATT 

TCTAATAAAGTGGAATAATTNTAAGAAAGGCAAGTATCACTATTTTAC 

Bequera ii 921 

CCC TT AGCG TGGTCGCGGCCGAGGT AC TCACATGT AAAC TTCT ACTTTCCCCTTCAGATT 
ACAGCAACCATCATGCCAAAGCTATACACTCTCAGGGAATCCCTGTGGATTTCACTGATG 

ACCACTTGACCAACTA 

GTGCTCATC 



1TGACTGGAACATAGTTTGAAAGGGATGGAAATTTCCAAAAGGTGTTAATAGACACATAA 



AGAT T I T T AAAT ATT AAAAAAAAGAAAAAGAAAGA 
Sequence 922 

CCCTTAGCGTGGTCGCGGCCGAGGTACATACAGT ATGCACTOCCTTCTCTGTG1 ITTTTG 
TC T GAG TTGATGATTTGGAGC TCAAAGAGCT AGCGGAGGGAAAAGC TGAAGCCATTCAAA 
CACATAATGAGAATTGGAGATGT^AAGAAGGCTGAGTTCTAGGAGTTGC^AGAACTTAG 



GAATCAGCTACATGCTGTTTAI 

actgaaaactgatgaaagajggagggagacaggagg 



ATGTGACTGAGGAGAGTATGCC 



cagggacagctcaacatgggcctggacagtcaaattccatc^ 
gagaccatccagtgacttcttcctgaagtgctgcg^ 

tgctgagtgatgggacgacagngtggagct actgt*gctctccaagccgcngcccacgacc 

agcctgcctgagaacgaagccagc 

Se quence 92* 

tax 1 1 pcgagcgggcgcccgggcag^tacttgocttgcaaaattatattacaagaagaag 

cacacttgttatagaagtgctgaattgtatggaactrraaatctgtcaagttacc i g i c 1 t 

tcaggtccgtctccccacctcccagacctcattatattatcccg^ 

tttaaggctaggcaagtattgcgctgatgagccagggactgcccaccaattgg 

attgggtgataaatgtccaaggacctctaggctgacgacacatttttc 

gcctattgtaaccagggccactcacattgat 



C OCTT AGCGTGGTCGCGGCCCGAGGTACCTACTGTGTTGAGCCCTCTTCCATC A 
GTTTCG1 CAGATCCTAGGAAGTGTCCCTGACGGAGAAGTTTTACAAAATGAACTTCGAAC 
TGAAGT ATCCCGATTGAAACGGAGATC T AAAGATC TGAATTGCCTTT ATCCCAGAAAAAG 
ACTTGTGAAATCTGAAAGTTCAGAGTCTCTTCTTTCTCAGACAANTGGTAATAGT AATCA 
CT ATCATCATCATGTGACATCCANAAAGCCACAAACAGAGCGGTCCTTACCAGTGACTTG 
TCCATTGG f I CCAATTCCTAGC 
So q ucoo 926 

CCCTTAGGGTGGTCGCGGCGGAGGTACCCAAACACAAGATTGCTAATAGACTGCTAATAA 
TAGAACTT AATAAATGAAATAAT1TATTTCATTTATTGTTGCTTGGAATACAGAAAGTGC 
TTAGTAAATATTGAATGAATCAACAAAGTACCTCCCAATATAGAGAAATCACTTCTGAAA 
AGGATAAAACCAAGTTGATCC TATTCAATCGAAGGCATCTTTTGGGGCTGTTACAGTT AT 
TTCCTrrATTTG^AAGAAGGAATATGATATACCTACTTTGTTCCAAGTCACTGCTTATAAT 
GTGCT AATGG T ACCT 
Sequence 927 

CGCTTAGCGTGGTCGCGGCCGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTC 
ATGCCTTGATCGCGAGAACTGGTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATG 
TCATCTCAACAACCTCAGCCAGAGTGTCAATTTCTGTGAGAGAACAAAGATTTGGGGCAC 
TTTCAAAATTAATGAAAGGTTTACAAATGACCTTT^ 

CAAATATGGAAATGGAATTGAAATTCAACTTAAAAAAGCATATGAAAGAATTCAAGGTTT 
TGAGTCGGTTCAGGTCACCCAATTTCGAA 



CCXnTAGGGTGGTCGGGGC<X»AGGTACAAGAAAGAAAACAAATACCAAGTATTTACAGAT 
CCAGAGAAAGTTCACAAGAATGGGAGGATGGCAGTTCCAATGCTTTGTAAAGTCAAAAAT 
AGCCACATTGCAAAACAAACAAAAAAAAACGAGAACGTTCCCGAGTGTGCCTCCAAAACA 
TAAAGGAGAAAATCATACAGAAAAAC^CTGATGTAAGGGTTGGAACTTGAGCAACCAGCTA 
TCCAAATAT " 

GCTTTCAGAGCCTACAAGTAGTCCCCA 
Sequence 929 

CGCTTAGCGTGGTCGCGGCCGAGGTACTTAAGCAATAAATCTGAGCAATTATCAGGTTAT 
TTTATTGCATTrCTAATGAGTTCTTCTAAAAAAAGTCAATCAATTATCACTGCTATATAT 

gttctgt gtgta aggagtgcttgagagtctttaattgtaacatttattaaataagaataa 

GAGGACATTTTT AAAGGAATT AAAGGAACATT AATTCC TTCATAAATG TAT AG TGC TT AA 
GCTCTGCTTTAAAAGGTCTrrCCATGTGCTCTTGGGTAACCACTTAGCGCTGAATTCATA 
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GT AT AAA TATCAATAAATGTTGCAATCACAA 
Sequence'930 

cccttaccgtggtcwcggccgaggtacgcgggtg 

tgaaatctgggcccaggaaggacctctcccatcc7t ggagcctcct cattctcctgtctc 
tcacnngtccodccacctctaccatgatgtc^ 

tggcttgcgcocttcwtctgg c1 1 1 1 c i cxxx^u>ctttgctccacttctaacai 1 1 1 i c 

ttccttcatctt4acatgaaagggacaangggt^^ 

a*<gnnccccacatcctxxjctntgc«:ttctgac 

Sequence 931 

caagagatcctnccac ctgagtct cxct aataggtagaactacagg tgcacaccaccacg 
cctggctaatttaaaaa 1 I I I I f 1 1 atagawacaaggtctcactatgttccccacwctgg 
taaagtatttttaaatttcagacatgaataatgatgcaaatca 
attctu mci gttaccttattcaaggactaa 

Sequence 932. 

CCCT TACCGTGGTCGOGGCCGAGGTAC T ITI rrTrTTTTTTTTTTTTTTT ITTTT TGNAT 
TTTTAGTAAACACGGGTTTTCGCCG I G 1 I A GTCAGGATCGTCTCC A T C T DC I gacctcct 
GATCAT COGCC 1 1 GGCC I UXAAGTGCTGGAATTACAGGCATGAGCCACCGTATKTGGCC 
AfiANAAATTTTTT AATATAAATTTTTTCAG TTACCACTTAAAGGGAAATATGATT AAAAA 
AACTAAATAAAGAAGAGCTTTAGT AAAACCATGCCCTCTT GCT AATCTATTAANAGTCAA 
ATCTGAA " 
' Sequence 933 

CCCTTTCGAGCGGCCGCCCG^GCAGGTACAGTATGTTTCCACTT ATGGACAGATAATTAC 
GTAGT AAACATAGAAACACACGAACTGAAAGGACAGACACCAGTATCAGAACTAAGTCAC 
CCATGGGGAGGGACAGAAGGAAATAGGATGGAAAGGGGTTGAGGGACTTC GTATTT 
GTGATGTTTTAGTTCTTTAAAACAAAAATCTAAATGA^ 

agaaaacatcaaaatgtcaacaatacttaaacctgag tgttggg tgcctgaatgttat at 

TGGTCTCTGCA 

C«HTTCGAGCGGCCGCCCGGGCAGGTACCCAGTATATGAGCM 

GGATAT AGGGAGTGGATAGCT ATT ATT AATTGCAGATT ATTTTGGAAGGAAAAACACACA 

GAGAATTATGTATCTTTCAGTGTAAATGTTAGTTCTAAAAACAATCATA^ 

GCTGCAGTTATAGAACACAATTCTGAmCTGCCTCACCCC^ 

CATTTCCTACGTTTGATCTGATAGTGTT ATTAAAAATAGCTG TT ATTTTTAAT AGCT ATA 

CTAAAACATAAAAATGTTTAGGCCAGGCGT 

S e quenc e 935 

CCCTTAGCGT GGTCGCGGCCGAGGTACCT AATTCATAAGATAAGGATTAAATGAATTAAA 
ATATATAAATCCCTTAGATAACAATGCTAGGCATATGTTAAGCACTATGTTaGTATCATC 
AAATG TTGTTG TT ACTGTTATGGAATTTATCACAAAT ATG T AATTATATG TTTCGTAGTG 
ATTATTCATCACCOCTACTGGACTCTAAGGTCTGTGAGGATATGTCTATTT^ 

CTGTATCCTCAACAACTGCTGGTTGTCCCT attgtaggtgttaggtattaagtgcatgat 

AGTGAATACATAAAGGTT 



CCCTTAGCGTGGTCGCGGCGGAGGTACTACAGATTAA^ 

atagagaacagaaacaggc i cm gatttcaccatggtcaatgctctgatgtgccaaaca 

CAGGAGGTT GTGGGAACATATAGACAGTGACCAAACTTTTAATGAATACAGGAAGATTTT 

CTGGAAAAGATGACATGTACCACACAGCTGACAGACGaGTTTA^ 

GTGATAATAATCTTTTTATCATAAAATTTTAAGTGTGGTAGAGAA 

AAATGTTGAATGAAATGTGTTAT 

Se quence 937 

CCC IT T CGAGCGGCCGCCCGGGCAGGTACACTAAAAATAGAATATAAGGGAGTGAAATCA 
AATCCTCGCTCACTTGAAGAAATAACAGTCTGTGGGGAACTNGG n GTTTCTCAGGTCAC 
CT CAGGGGA CAGATGCTCCCT AAGGTGCAAAA G AATGAACTGGTGCTGATATATGACTGA 
TAAG I II C I G I AACGGGCCACTGACGATTTCAATTCCCAAGGAACATAAATTACCTTTTA 
GCCTGTGTATTTAGACACAAATATGCAACCTGCAAAt. 1 1 C 1 1 C 1 Ga GGACAGATGTCAAC 



TACTT 1 1 ICAI I i 1 1 II 1 1 1 1 ACAGTCAAA 
Sequence 938 

CCCTTAGCCTGGTGGCGGCCGAGGT ACCAAGT ATACTT CACCAGAT ATCTATAGAACATT 

CCACTCAGGAACAGCAGAATCCACCAGAATATATATTCTTCTGAAGTGTATGTGGAACAT 

TCTCCGGGATAGACCATATGTTAAGTCATAAAACGAGTTTCAATAAATTTAAAA 

ATATCATACCAAGTATGC T C TC T G ACCAGAATGGAATGAAATTAGAAATCAAT AACAGAA 

GAAAATTTGGGAAATTCACAAATATGTAGAAATTAAA^^ 

TGGGTCAGAAAAGAAATCACAAGG 



CTTCCATACTCTTTT AATTGGATATGCCAGTGTG7>frCAhrrAATTTGCAGTGGCTGTAAA 
ACTTT G^GAAATTTTGTAGCTTTTAGAAACCACATACCTGTATTGCCTGATTGCTTATT A 
AGTT^TCTCTTAGAGGTTTCCAAAGTTATCAGTTTGAG^ 

ATGAAAATTTCAGTGGTGAC>^ATlATAGAATTTATCATTCAATTCAGTCTTAACTAGAA 
ATAATTGCATATAATAAAACAGG 1 fCTTGACTOI ICTTTI 

ITACTGCCACTTCCATTTTGTAAGTGAAGCCCAGA 
G A AGC AAAGAAATGTGCCCT AGGTCACATAGCT AGTCGGT GGCAGAGCTGTGATTGGCAG 
GTTGGTGGAATGCCTCC AAAGCCCTCGACCTTGCCACT ATACTTGACGCATCTCTAGAGA 
AGAGACAGAAGTAGCCAGGATGAAGGTCTTCAGGTTTAAGAAGAACTATGAAAAAGCAAA 
AGAI I T I IGT 1 1 ICGTGGT I TT TTTACTATAAAGGAAAACTTTAAATAATAGCAAGAGTG 
CTATAGGTAAGATATCAGA 

Sequence 941 

CCCTTAGCGTGGTCCCGGCGGAGGT ACCTCGTGGTTGAACTTATTTGGGGACAGAATTGA 

GACGGAAAAATTTGATATCAAAG G AAGTATCAAAACCCTTGATGTGGTTAAGAGCATGGA 

TAGTGAAACTAACCTTnGATGTATGGTGAGAGAGCAAAAGAGAAAGGATTGCA 

TGGAATGTAGAGGATGAACATATTGGTAAT AATAATACTGGTGGAATTGTT ATTCAGGAA 

AAAATAGCAATTATTCCTGTTCATATCTCAAATCATTGTA I G I I G I 1 1 ATTTAAAGGGAG 

ACATGGTAGAAGATATCAAATATAAAAA 

S e q ue n ce 942 

CCCTT AG CGTG G TCGCGOCCGAGGT ACATGAAAATGGCTG TTTTTCCCCACATT AGTCAG 

CTCTGGATTTTGCATGTGTGGGGC 1 1 1 1 ITI I TGATAGTTA 1 I IGI Mil lATTTTAAAA 

ATTrATTTTGCCAACCCAGTAGAGAACAGCTGAGCATCTTCTCATGTAT^ 

CTGCATTTCTGC fj SCI I ATTGGCCATGTATTTATTGGCCATTTGCCGTCTGCTGTGAAAT 

GTCTTAAATTTTTTGCCX^TTTTTCrAGTG^^ 

TCTGATGAI 1 1 1 X ATTGT 

Sequence 943 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTCAGGAGATACATTCT 
GTGTGTTCTATAAATGTCAATITAATOCAGTOGGCTTATGATTTTCAGTTCT 
ACTGATTAATGTGTATATACTAGTT CT G T T ACTAAGGaGGCATGTT AAATT AATCCC7 AG 
CTGT AATTGTGCATTAGTTTGTC I C 1 1 1 1 GAGCTGTTCTAGCTTCATAAATTTTTGGAGC 
TGTTAGGTGCATATACG 1 1 1 AGGATT ATTTTGTCTTCTT GGTGAACT AGACC T 1 1 1ATCA 
TTAGGAAAC 
Sequen ce 9*4 

COrrTAGCGTGGTCGCGGCCGAGGTACAAAAATCAAC 1 1 ICCI 1 1 1 1 ACTATCTGGAAAT 
AGGAAAATGTTCCATTCACTATGGTGACAAAACTGTAAAATAGGAATATArrTCTGAGGA 
AAGT AT AGGTATTTACAAAT AGAT AAACTATATTCTT AGAT GAGAATACTTAATACCCAC 
• TTT AGAAAATT AAAAATGAATT ACAGCTTTTT AAAAAT AGATT AAGCT GGGTG TGATGAC 
ATGGCACCTATaGTCACAGCTACTCAGAAGGCTGAGGCAGGAGAAGCACCT gagcccagg 
AGTTTGAGGCTCTAGTGAGCTAT 
Sequence 945 

CCCTTTCGAGCX3GCCGCCCGGGCAGGTAGCTGCAAG t ccaaagaggaccaggagga tccc 
CGCCAAAAGAAGGGT AATCGATGGGACAGCAAAGTT ATCAG TCAAGT AAGGCAGAAAT GC 
TrGAATGAATAAATGTATATAGATAGAAAGTAGAGACCTTGATAAAGTCAAACTCCTTGC 

ctttacaagtgtgtgttcagcagccatgcaagggagatgcccatctggcagtgg 
gcaaggtgtcagagccctagtggcagggagatggcatccacatatgagggagggtgacat 

GGTGCT AAC TGGGCATCT AGAT AGGGCAGGG 
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Sequence 046 

CCCTTTCGAGCGGCCGCCCGGGCAGGT act gcatatttaatgaattattttataaattgc 

TGTTGTGAAGCATTTGTGAATGACCTGCCTCCTAGCTTTCAATGCT ATTGCCCAGGCTNG 
AC TTTTATTGCAACTGTTTTATGATACAGTTTTGCATTGTATGTGTTTACTT^ 

agcatttcct gggaggtttctttttctggttatgaaaataatatatgcttatggggaaaa 
attggaaaatagaaacnagtatct agaagaaaaatcactcataattccancaccctgtta 

ATACTTTGTC ITTTCI lACAGTTTCTAATA 
Sequence 947 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAGTAGATGAGAACTACTTAm 

AGCA TGCT A T AGAAACAAAA TATGAGT AATTC TAACTGTAG TTATG TT ATATT AGCAT AG 

TGAGATAGTAACATTAATAGAATTCCTTAGGTGGA AI IICII IA ATGC 

Sequence 940 

CCCTTTCXAGCGGCCGCCCGGGCAGGTACTGCATATTTAATGAAT^ 
TGTTGTGAAGCAmGTGAATGACCTGCCTCCTAGCTTTCA^ 

CTTTTATTGCAAC lb I H I A TGATACAGTTTTGCATTGTATGTGTTTA C 1 1 1 1 tAAAGAA 
GCATTTCCTGGGAGG7TTCTTTTTCTGGTT ATGAAAATAATATATGCTTATGGGGAAAAA 

ttggaaaatagaaacaagtatctagaagaaaaatcactcataattccagcaccctgttaa 

TACTTT GTCTTTTCTTACAGT 
Sequence 949 

CCCTTTCGAGCGGCCGCCCaSGGCAGGTACCAAGAACT AAATTGTGATACGATAGGTGACT 
TATGAGT AGCACAGAATGT AATAGGCCCATCTCTACCT AGTTCTGGTCACCACACTTCTG 
TCAAGGTAGCTCGGAGAGACGGTGTCTACTTATTCACCACATCATGAGATCACCTCAAAC 
TGAGCAGGCAGGCAATGAAAACCGTGAGC MUM I ACATTAACTTTCTGAAAGTCATTT 
TTTC TTATTCCACTTT G TGCCTTTTTTT AAAAGCTGCAGC TTCATGGAATTTAATCCTGG 
TATTTAAAACACT 

iTACTTGGT AGGTTGATCTCTTTCATTCTCATGGT 
TTAATTACCATCTATTCACTGATTACroCCAAAACTGTATCTATAGTCCAAGACTGTTTC 
T AAAAGGTCT GCACCCACATATGCAAATAAAT A 
jequencn 951 

CGGCCGAGGT AC TCTT AGGAAAGAGTAATGGGGTTGAGGATGGTTAATTTAGCCCATCCT 

AACTTCTAGTGAGAH I T I T 1 CANAATATTTTGGATGGTTCTCTCACTTTNGTTATTAAG 

CATT AGGGAAGAAGATTC TGCAGCC T AC TCAGGT GAGCCAATCTCATGGCATTGAACANA 

NAAKATATGTTTTCACGTCTTTAACCAflTGTTTTTCATAGTGNAAGTCAGGCCTTTCTCX 

TTTGATCTAAGTGGAACCAAGAGGTTAGATACTCCCTTTNCTTTAGTTATATAATGGGCT 

TCATGTAACT 

Sequence 952 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTCTGTAGGTCTACAGGTAAAAAGCTATTACG 
TTGCAAACATTATAACGTAATGTAAGGTCTGGATTACATGCCTAAAAATCCAATGATTCT 
TGGAATOATCAAATCTGTTAAGACTGAAAAGAATACCAATGTTTAAATATATCTATAAAA 
TG CAGGTCAAGGGGC TAAGAAAATTGCAACACT AGAAAAC CAACAAACTTAGGTTGTTCT 
AACATACATACACAAATACAGGAGGGACGTTTATGGGTCACATCTGCGAAACATTTTTTC 
CCAAAAAGCTGAATTTT 
B953 



COCTTTCGi 



GGCATTTTCCCCCAAACOUXXATACTCTGTAGAT^^ 

ACAGCTCTTCCCCATTCCCTATCTGAAAGCAAGGAACCACTGGCTTTGGTCAGGAAACAG 

GCATACAACATCAGATGTGATTATAA 

Sequence 954 

CC CTT AGCGTGGTC GCGGCC GAGGT ACCAGATGTTG TAAAATTT ACTA T AATT AATAGGA 

GTGCTATTTTGTTNCTCCTACATTTACTCCATAGTAAGCTKTTGT7TGAGAAAAAAAATG 
CCAGTTTGGTGCGTAGTAGATACGCAGAGGCCTGNGAAAGGGACNGATGACNCCATTACC 
C C AT GGGT ACAG AATG T ATAATG CTTCCCCTC TCAAACTTGGGTT G N TTTGGNTTTTTTT 



TACA 



PCT/US00/33312 



CCCTTAGCGTGGTCGCGGCCGAGGTACCTTTAAGCCAGATTCATGGTATGAAGGCAGCAG 
CATAGCACCTCCATTGACCCACATGGGGGCCTGOCTTGGGCTTCATCAGGCCTTTGGAGT 
CTCAGATCCCTCACCTGTTAAAGGAGAGT AATACTACCCACTTACC 1 1 1 1 I GGGTTGTTG 
TGAAACACACATAAGACAGTATTAGGAGAAGTAAGGTCTC^GGG 

GCGGCC CCTAGGTAGAGGCCTGTTGAATTGGATGACAGTGAACTTTGCAGCATTTCCTAA 

CCTCAGAAGTTCAAGA 

Sequence 956 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTCTGCTTTATTCAGTCTAGGW 

tggatgtg tgcaggtgacataatttcaggggataagg taaaaattagatgaagcccaagc 

AAATATTCTTAAAAAGAAAAACTT AGGATTTTTTTTT ACAAAAGTT AACTTAAAATGCAT 

tatctagaataatgttataaatcaacgtatagagacgttagtgaatagttcxxttcatta 

GGATGTTGAAGGAATATGGTTTCAATATTCAACAAATGTCGTGATGCCTATAAATTTTTC 

TACAAACAAGAGTATTGT 

Sequence 957 

CCCTTAGCGGCCGCCCGGGCAGGTACTTCAGGAGATACATTCTGCTAGTTTGGCGTGGTG 

TGTTCTATAAATGTCAATTTAATCCAGTCG<3CTTATGATTTTCAGTTCTA 

GATTAATGTGTATATACTAGTTCTGTTACTAAGGAGGGATGTTAAATTAATCCCTAGCTG 

TAATTGTGCATTAGTTTGTCT C I I I I CAGCTGTTCTAGCTCCATAAATTTTTGGAGCTGT 

TAGGTGCATATACGTTTAGGATTATTTTGTCTTCTTGGTGAACTAGACCTTTTATCATTA 

GGAAACTGTCCATATAACCAC 



CCCTTTGAGCGGCCGCCCGGGCAGGTACTCCATAATAT^TCTTTTAAATGGGCAACTTC 
TAAATATTGATACAACCATTAATAATAATGCTTATAGGGTAAAAGAAAATTTTTGAAGCA 

ctgaattcactaacctgggtcatggtccaattttgctcact ACTTCATATCTTTT atgta 

GATTATTOCTATAAAGATGTTCCCTAMTTCCACATCAGTTTGTAAAGTCAATGGATTAA 

attattcaaatgtagctatttaacggtcagtaacaatgcctagaaacctat 



cccttagcgtggtcgcggccgaggta c i i n 1 1 1 1 1 n f m i n 1 1 1 1 n n 1 1 l 
cagtcttgctattttaagtccaggctggactcaaactcctrgaa^ttgctcaagcaatct 
tcccacctcagcctcccaagtagctgggattacaggtgtgatgtccagcttaggttccag 
ctnttaaaah^nttgtcagtgtggtgggcgaggtgggtc^vcatacacatataattataag 
gtaaaaaatcacaactactacaagaaaggtggaaacatttatgagaaaaccaaagaaggg 



cccttagcgtggtcgcggccgaggtactccagcctgggtgacagagtgagaatatgtctc 

aaaaaaaattatcagcanaagataatatagaccccaaggctaaagggaacc^ttatcatc 

tctaggcctgaaagcctaggagagggtgctgtatggagaggactgcttctgacagaggga 

tatagccaaccttggtggcctaatagagaggaaagtagggaatagcttcaccttccttct 

ctaatcttctgctagtatccctattaatttagcctaattagaagctggaaggtaggagag 

cctccatgggccaaaaaagctgttgtagagaacatggatcx:ttgaggggggtaaatgggc 

agataattctagccacagattg 

• 961 



aaaaaaaattatcagcagaagataatatagaccccaaggctaaagggaaccattatcatc 

tctaggcctgaaagcctaggagagggtgctgtatggagaggactgcttctgacagaggga 

tatagccaaccttggtggcctaatagagaggaaagtagggaatagcttcaccttccttct 

ctaatcttctgctagtatccctattaatttagcctaattagaagctggaaggtaggagag 

cctccatgggc>aaaagctgtgtagagaacatggatcctgagggggtaaatgg 

Sequence 962 

cccttagcgtggtcgcggccgaggtacttgagaatatgattgtaaatttgatcagcagct 

ACAAC ATTTCAATGATGCATA 1 1 IT IT T J I CAGATGCATTCCTTTGATTGAATTTAAAGT 
c aag c TT G TGC TTCTGGATGG TTGCTTTGTCAG T GAACAC TTGGATTTGGAAAAT acagc 

acctgggttggttttgagagaaaatggtttcaactttataattacagttttaacc^ 
aacaacaaaattaggatggtagtgaaatggaactaaatcaaatgcaaggttttagtttaa ' 
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tanaacaatctcatcctttaataatctttaaaagaagaacaact aaataacccaatnaca 
aaatttgaaaattagggtcaaacct 

Sequence 963 

cccttagcgtggtcgcggccgaggtacttgagaatatgattgtaaatttgatc^ 

ACAACATTTCAATGATGCATAI U III I T F CAGATGCATTCCTTTGATTGAATTTAAAGT 
CAAGCTTGTGCTTCTGGATGGTTGCTTTGTCAGTGAAC 
ACCTGGGTTGGT7TTGAGAGAAAATGGTTTCAACTTTAT AATT ACAG TTTTAA 
AACAACAAAATTAGGATGGTAGTX5AAATGGAACTAAATCAAATGCAAGGTTTTAGTTTAA 

TAGAACAATG TCATCC tttaat aatctttaaagaagaacaac taaataacccaataacaa 

AATTGAAATA 
Sequence 964 

cccttcgagcggccgcccggccagctacactgcat 

CAGCCTCTCAACAGAAGGCTGTTCTATCCACACT ATCACAAGACCTGGTGGAGTTGAGGC 

**CTGCTGAATTACC^TACAGGGAAGAATGAATTGAAGAAAATTCCCATGC^AGATAGG^ 

TCTTAAAAAATAAATTTACACAAGAAAATCAGCACTGTAAAGGTAATTGATAAGCCCAAT 

agaagggaaacctatacaaagaaatagaaataact AAGCAATCTGAAATGGACTTTAAAT 

AATGATG 
Sequence 965 

CCCTTCGAGCGGCCCGCGCGGGCAGGTAGA^ 
CCAGCCTCTGAACAGAAGGCTGTTCTATCCAC^ 

AACTGCTGAATTACCATACAGGGAAGAATGAATTCAAGAAAATTCCCATGCAA 
TCTTAAAAAATAAATTTACACAAGAAAATCAGCACT GTAAAGGTAATTGATAAGCCCAAT 
AGAAGGGAAACCTATACAAAGAAATAGAAATAACTAAGCAATCTGAAATGGACTTTAAAT 
AATGATGTTTACAATTCTCTAAGAGGAAAAGGAG<^TTANCATCAGTGAAACAAAAGTAG 
GGCTATAGAAAAAACAATACTT ATGAAAAAACCAATTGGAAATTTTTAGATGGAAAAGCC 
TGAAAGT AAAAAATTCAACACATGGTCTAAAAGAATAAACTGCAGACAGCTTGAAGGGAA 
AATTAGTTAATTTTACCNAAGAAA 
Sequence 966 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGOGGGTCAAAAGGATGAAAATGTT^ 
AGAATGAAATTCAAGAAAACTTAAAGGAAATAAAAACTATTTAGCACCCAGTGAGGTAAA 
AATCGCAATGTCTGGTGTCCAGTCAGTTACCAGGCATGGAAAGAGAGAGAAAAACATGAG 
CCATCATGAGGAGAAGAATTAGCAGAAACCAAACCAGAACTGACATACATACCAGAATTG 
GCACACAAAAGGAT ATTAAAACAATAACAACTGCGTTCCATATGTT CAAAAAGTT AGAAA 
CATGAAAGA 
■ 987 



agaatgaaattcaagaaaacttaaaggaaataaaaactatttagcacccagtgaggtaaa 
aatcgcaatgtctggtgtccagtc^ttaccaggcatggaaagagacagaaaaacatgag 
ccatcatgaggagaacaattagcagaaaccaaaccagaactgacatacataccagaattg 
gcacacaaaaggat att aaaacaataacaactgcgttccatatgttgaaaaagttagaaa 
cutgaaagatacaaaaataaaatcaaacttctaaagatgagaaactgtagtgtttggagg 



CC^TTCGAGCGGCCGCCCGGGCAGGTACGCGGGCGGTCTGTGCCCCATCACCATTTCTAA 
AGCACCCTACCCTC>TGGCAGTGTCCX^flAAGGAAGGGGTTTCCATGGTAACCTCAATGGA 
T ACAGTCAGCTGACGTCTGGCACCGCC TGTGCTGGTGTGCGCTAGCCTACTCACTCCCTC 
GGCCCrCCCrCAATCCTTTCAACTATATTTATTA G 1 1 C I C 1 1 1 A ATGGAAAGTATATAAT 
CCCTTAATGTCAGACCTTGAGTGGGCACTCAGCTrTATTAATTTATTTAGGTAATAAAAT 
TTACCTTCCTAATTAAATTCTCAGTAAGTCCTGGGAAGCTGTATTATTTTAAACATKTTG 
CACAATTGT 
Sequence 969 

CCCTTCGAGCGGCCGCCCGGGCAGGTACGCGGGCGGTCTGTGCC^XATCACCATTTCTAA 
AGCACCC TACCtrrCATGGCAGTGTCCCAAAGGAAGGGGTTTCCATG GT AACCTCAATGGA 
TACAGTCAGCTGACGTCTGGCACCGCCTGTGCTGGTGTGCGCTAGCCTACTCACTCCCTC 
GGCCCT CCCTCAATCCTTT CAACTATATTTATT AG TTCTCTTT AATGGAAAGTATATAAT 
CC C TT AA TGT C AAGACC TTGAG T GGCACTCAAGCTTT ATT AATTT A TTT AGGT AAT AAA T 
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TTT ACC TTCCTAAATT AATTC TCAAGTAGTCC TGGGAGC TGTATTTATTTTAAACAT 
Sequence 970 

t^^^AACCTTTTTCTATTCTCTGCAACAGTGGATATAGGATTGGAGTTA I HI I MCT 
TAAG TTTTTGG TAGAAAACTAGCCCAfrrNGAAGTCATGTGGGTTTGGGATTNTTCTTTGT 
AfK^NAGGmCCTAATTACTAATHAGCTTTTGAAAATAN 
Sequence 971 

CCCTTAGCG TGGTCG CGGCCGAGGTACCAAGATTATGATAGCCTCTTAAAACAAATTGGA 
GGTTATAACCTTTTTCTATTCTCTGCAACAGTGGATATAGGAT^ 

TAAGTTTTTGGTAGAAACTAGCCAGTGAAGTCATGTGGGTTTGGA \ fTTCTl I bl AGGAA 
GGTTCCT AATT ACT AATT AGCTTTTCAAAAT AG TTATGAGAATATTCAGGTTTTCTAT^ 
CTTCCTGTGTCAATTTTGTTG TCTTTTTCT ATAAATTTGTTCATCT ATAATTTTAATATT 
TTTGGTATAAT J CAAAAATAATCTTGTATTTATTTTACAAGGACAGGGATCTTTA 

Se quen ce 972 

CXgTTAG CGTGGTC GCGGCCGAGGTACTCCAGCCTGGGGGACAGAGTGAGACCCTGNCTN 
AAAAAANNTTrrrTTGNNTNTGANNNf4l4GANTAA^ 

AGAAATTAGCTCATGATAGNCAGCTTT ATATTATNAATTATGTGACACTTTGGATATTTC 
AAAAGCACATTCACAAAGTGTATTGTCACTTAAATACCTCAAAATTTCCCTGTTATACAT 
GCAGATCATTCCCCATTCAMCGCTGGGTATGGGACTGAACTGTGTACCTTGCCCGG 
GGCCCGCTT(XiAAAAAGGG<^GAAATTCCAGCNACAC T GGGGCGGGCCGG TTT AC TT AG T 
GGGATTCCCGAGNCTTCGGGTTACCCCAA 

CCCTTAGCGTGGTCGCGGCCGAGGTACTCCAGCCTGGGTGACAGAGTGAGACCCT 

AAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGAAAAGAAAAGAAAAGAAAAAACAAGA 

AATTAGCTCATGATAGCAGCTTATATTATAATTATGTGACACTTTGGATATTTCAAAGCA 

CATTCACAAAGhKITATGTCACTTAAATACCTCAAAATTTCCCTGTTATACATGCAG^ 

TTCCCCATTCAGCCCTGGTATGGACTGAACTGTGT ACCTGCXXGGGCGGCCGCTCGAAAG 

GG 

Sequence 974 

«XTTrCGAGCGGCCGCCCGGGCAGGTACAAAGCTAGAAGCAGC^ 
TACAAACCCNAHACTGTCTACACCCAGACTTTATTCTTCTACAACCAAATTCCTCAAACA 
CACAATCTTGACCAGTAMCAGTTGAAANGGGAGTTTAAGGTGGGGGTGA 
Sequence 975 

CCCTTAGCGTGGTCGCGG<^3AGGTACGCGGGCTACCAAA£XTGC^ 

NGGGGCMAANAAAWGNNNTTNMCCNANCCTCCGAGCAGTACCATGCTATATTGGTCACTG 

TAGCTCTGGTACATAMTTTTNGAAGATTGGGGTAATGTGGATTCCTCTAGCTTTGTTAAG 

CTCTGTTGTTTTGACTTAGTATTACTTTAACTATTAGGGCTTC M I > I I GGTTMCATATT 

AAATTTGTAAAATAAAATTT 

Sequence 976 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAGCTCTCATTTGTCACTTrTCAAG 

CANGCAGGCANCATAACTGGTCCTGCTGGCTGATGCAGACCACACTCT 

TTTGAGCCAAGGCTCCCCTACTGTCTTTTCATTTTATGTCAAGGCAGGGGGAAGAGCTCA 

AAGGGCTCTTGCATCCCAGTCrrCACTTCCCAAGAGAGGCACTGAGGCCCTCCAGGATGTG 

GGGACAGGAACTTTGGGGCXAAGCCGGGGCTGTCX^GAAG^ 

TTAGTTNGAAAAGGGAGNAGGTCCTTT 

Sequence 977 

CCCTT AGCGTGGTCGCGGCCGAGGT ACTTT AAAAAGT AAACAAATTTAAC TGAAGCAT GG 

CTATTAGTT AGTGATTCTTTGTAGATTTTCTGGAAAGT C I rGTTTGTI I GTATTAAACAT 

TAACTCTGCTGTATGCTGTAAATACACTGCTAAGATCAATATTGAAAAACGAACAATAAT 

*^CAATTCATATGGAC CTTCA AAATTAGTCTTATAAAATTTTTATGGATATTGGNATTAT 

CCCAAGCCAACCTGACTTTTGAGGACTGACAAATAATATCTTAACTTT AACCCAGGGGTG 

GATTTCTTGCCATTTieCCTTTTGGMTTT 

Sequence 978 

TCAAAAAATGAGC TAGGAGTAAAGTATCTGCTCCAGAATCTACCCCCATCCCAGAAAGAG 
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CAACOCAACTGTGTCCTGAGTGGCTCTTAGAGTTTAAGACTCTGAATGAATGCCTAAATT 

TANAAAGGGTGTGGACCAAGGGATTTTNGGTTAA7TjTATT>^ 

CAGGATTTCAAGT 

Sequence 079 

COCTTTCGAGCGGCCGCCCGGGCAGGTACC TGGCAGCAGAGT AGGCACT AATATGTGTTG 

AATGAGTAGGTGAAATAAACAAAAACCTAATGGCGATGGAATTTTATGGAAATAAGTAAA 

CTTCATTATTGCTGAAAATACCGCAGATAAATAGAGGGAGGCAGTGTAATAGAGTGGAAA 

GAGCAG T AGACCAGGAG TCAGACAGTCGAGGATCTCATTCTAAATTTGAAGGTGAA TAGC 

CATGTGGCTTTAGACAGGACTUTGAACCAC C M G 1 1 M CTTATCTGTAAAAGGGGGGAAG 

TCATAATAGCTACTOCTGCCTAACTCATAMGTTGTTGAGAAAATGAAGTGATT 

Sequence 980 

CCCTTTCGAGCGGCCGCCCGGGCAGGT ACA TT ACCTTTT ATG T ATGC TGGAAT AAGAACT 
TGTGTCTACATGCATGTAGAAACAATGGAAGGATAGGCAAGGAAAATGAAAAAAAAATGA 
TAACCTATGGGGAGTGATGGCCACT AGATGAC TGGGGACAGGGGCTGGTGAGTGAGCGCA 
ATTATCTATTTAAACAATCAGAAATTKrrCCCTAAATTACAAGTTTCT 
AAGAAATTCCCGACAAGCTCTGCAAAA TAAGTTCTGTCAATCAAATCTTACATGATGCAT 
TAACTGAGCTATTTTAAAAT ACT ACCATGGAATTCATCnTTAAAGGGTGACCTTTGTAAA 
AG 

Sequence 881 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGTATTG TTGAC TGGCTAACAGAGGACCAATTA 
ATAAGCCAAAGAAATGGC TCTTTAACAATGAACATTTCT GCCATCAACTGACAGATCCCA 
' GGAATAAATE — " " " — " 



TGCCCACCCCATAGAAGGGCTATCCCTCCAGGTCAGGTTAGC^ 

CAAGTCAAGGAGGTGATGGTTTGCCTTTGACATCTCTACCCAGACCAGACTCCACT 

AAGACTCTCCCTTTTTTCATCACTGCCCTACCTAGTTAGGTTGGTCCTGC 



CCCTTAGCGTGGTCGCGGCCGAGGTACTT AGATCAGATGGATTGAAACATGACAGCCCCA 
TTTCATCTGGCCGGTTAAGGTCCTCATGGAATGAAAAACACTTTCGGG ATG 
AGAGAGAGAATGGG ITTCTI IAATTGCCAGATTGTCTGAACACAGCCTCAGCTACTTCTA 

CA TTCAG TGCCAGCTCTCTAAAGTGTGGATTCTGGATTCTGGT AGAAGOCAGTAAAGAAA 
CGI T f I TCTCTGGAGTGGAAGCCTAGTAAGATTTATTT 



CGCTTAGCGTGGTCGCGGCCGAGGTACAGTGACATTTCAAGACATGGCCCAATGCAC^^ 
CAACTTCCCAAAGCTGT AATTCACGAGATTCCTCAGGGTCCTCT AAGCTCCTTGAGGGCA 
GAAACTT ATCTTTGTATTACAGCTAGCCTTCAATCAGTAGGTGTTGAGCTGA I II 1 C II I 
TTCTTTTTTAAACTCAGAAGTTAAGTTCXAGCTTCAGTGGCTATGCCCAGATGGTCTGAT 
TCTGAAGGACAAGAGAATTGAGNTGGCATAAGCCCTGTGCTTGGCATGTAGTANGTTTCT 
CAGTAAAC TTTANCTGGCGGGA 
Sequence 984 . 

GAATTCGCCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTTAGTAAAGATGGGG 
ATGTTGGCTAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCAGCCACT TCGGCCT CGC 

TTGGAAGAAATGAATCCTAGTTITGGTTCAGAAACTGTCAACAG 

TGACT ACT AAATTTCAAGCAAAGAGAGC TGAGTTGGGGGTAAAAGCAGGGC TATTCCCCG 

CCTTCAGACAATGCTTGTCCCTTATCAAGGGCAGACTGCTGTCTGG 

Sequence 985 

CCCTTAGCGTGGTCGCGGCCGAGGTACTT ACTTA A 1 I 1 1 1 II I M l M III 1 A GTAGAGA 
TGAGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCTG 
CCTCAGC CTCCCAA AGTGTT GGGATTACAGGAGTGAGCCACCGCACCCAGCCTGTGTGTG 
Mil II I AC TTAAAAAI1 1 I » AAATTTAAATTTAAATGTTTAATTGACAAATAATTTTAT 
ATATGGGGTATAATGTGATGTTTTGATGTATACATTGTTGTATACGTTGTAATTGTATAC 
A TTGGGG ttg t at acattggga TGT AT ACCATTGAAATTATTTG na tccagaaaa ttaa 



cccttagcgtggtcgcggccgaggtac^tggaatacataattttgaaatggagtcagg^ 
ttrcctaatgatccattttgtaattcacctaacagctgagggaaggtccagagaaggaag 
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AACTCAAGGTTAGTAGACAAACTTGATATTGAGTTGCACTGGCTOC^ 1 1 1G GT 

CCCCTAAAGAGTATTTATCATCTTAGATTCAGCTT AAGTTGTGGACAAATATCAAGGGGA 
AAAGTA7TTACAGTTAACGTTGGAATCACACGGTTTTCCGGGGTTGTGCC 

tcaactttggtggtttctaaagagggaccgattattag I roci 1 1 CA CTAAGGAAGGGGA 

AG 

Sequence 087 

CCCTTAGCGTGGTCGCGGCCGAGCTACCTGGCCTAGAAAAT A1 rTTTTTTTI IGAAATGG 

A*nCTCACTGTGTCGCCCAGGCTGGAGTGCA^ 

AAAAACAAAACAAAAATAAACTTTACTCAAATATCACTTTC 

CTTCAATCATTJCGCCTCTTCTAACTKTNAjCACCA C 1 1 1 11 ICCACTACGGCACGCATTAC 
ACGCCAACTACTCACCAGTTCACGTTTTCC^ 

TTCCTAAAGAACCAGGACTATGTTCTACTA G rC TTI Ul AGCCACTGCACT 
CCCTTTCGAGCGGCCCCCCGGGCAGGTACTCCTGTTTCTA 

TTGTCAAATGTTGAGTGCACAGATTTATTCATTGCAGC A rTTGGTTTTI CATATCAAAAG 

ATGGGAAACAT7GTGCAAACAATGCCCATCAGTA£TGGATTGATTAAATAAATTAGGTAT 

ATCCAATAATTGAATATTATGCAAGTATATAAAAAATAAGAATCATGAATATGGAAAGAT 

TTCGAAAAT ATATTGC TAAGATT AAAAAAAAGGAAGGGGCAGAAGAAAATAAGTTGGGT A 

AAAAAAACGCCAGAAATGTTTAC TAAT AATTATATTTAAAAACTCATAGGATAAACAAGG 

AAGGGTAATGAAATAATTAAT 

Sequence 989 

CCCTT AGNNTGGTCGCGGCCGAGGTAC I T 1 1 1 1 1 I I M l I 1 1 1 1 1 1 II I III IG GTAGAN 

ACAGGGTCTCACACTTTGTTGCCAGGGC I GG I C I NGAATTNCTTGGACTCAANCAATCCT 

CCCGTGTTAGCCTCCCAAATTGCTAGGGTTATAGGTGTGAGCCACCCTGCCCAGC 

TTTATTTCAGATGTTCAAAACAACAAACAAAAATAACA^ 

ATACGTGTTAAATGAGAAATNGTTCAGGGCTTTTATAAATTTGTGACCT 

CTTAMTCC IllUCr CCAT AAACTCT AATTNCAAATTTTACT ACCACAGCAAAAAAGAGG 



GGCTT AGCGTGGTCGCGGCCGAGGTACCTGTGATTGTC T G TG 1 1 GA GACTATTACAGAGC 

TCCAAAAATTAAAATAAAAATAATAATTrrACAGAAATACATATTTGCATTGGAATATTT 

AAGAAAGTTGAGTTTGGATGCCACAAGATTATTGGAGTNATAGGNA^ 

CTCACACCTGTAATCCTAGCACTTTGGG 

Sequence 991 

CCCTTAGCGTGGTCGCGGCCGCGGTACCCT AAAACTTAAAGTATAATAATAAT AAAATTA 
AAAAAACCAAAAACAAAGATTAACAGAAAAjCAAAACANCAAAAAAACT^CCAGGATATAC 
ATTGAGTCATTTGCAGG TTTGGGAGGGGGGGAAATGC 1 1 T H I GTATT AGGAGAAAGGGA 
AGCTTTCATTTTAAATGGCTAT ATTACTT AAAG TTTGCANTAAAT ATTTATTACTTTC 
Se que n ce 992 

TGCTCGCTGGACAGAGGGCAACCCAACACTCTAGCCTAAAGCCCCGTGACACCTGC 
GGTGCTTGCCCACGCNTTGCACCCG TTCCCGAANT AAAAAGTCGCCGGTCCTANAAGGCG 
NCXsAGNTCTTGGTNGACCTTTGNGCANCCCCACCCGTTGCCAGTCTTGA^ 

ATGGAAC 

CCCGGTGGGTAT " 
T 

TGGCCGGGGCNCAAAAT1 
G 



ATTjCAGAATTCGCCCTTTGAGCGGCCGCCCGGGCAGGTACCCCATCA<»AGTG 

GOC TTNCC TGTATGTAAACCTTACCTAATACTTTCAGTCACCACTCTTT 

TCCC TTTT AAG N CAAAAAA^GGGANG NAAG T AAGTTGG N NATTTGGN GTTTCAAAGNG NC 

CAATTGNCTTTTGNC 1 1 1 1 1 TCACA 

Sequence 994 

CCC TT AGC G T GGTC GC GGCCGAGG T AC CAAGTTGTTC TCAAAC TTTCATG TTTGT GT AT A 
CAAATCAGCTGAGGCCTTCAGTAAACTACAGATTC^^TGGCCTGGCCXrrCAGAGATTTTG 
AC TGAACAGG TCTGAGTTGGGACT AGAAATA TGGATTGCTAATAGGCACCCTGACAATTC 
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CGATGTAGGTGGTCCTTAGAACATATTTTGAGAAATATATTCTGTAGTCTGGCAGATAAA 
GAATTC 1TAACAAGGAGG TCCTGCCCGGGCGGCCGNT CGAAAGGGCGA 
Sequence 995 

CCCTT AGCGTGGTCGCGGCGGAGGTACCATCATC TGTTTCCCTCTrGGTTATAAATCTTTA 
ATGAAAACGGATTTAAAAAGTCACATTATGA TGCTCGAAGCTCTGACCTCTCATCACAAT 
GAGAAGCAAAAGACATGCCATAAACATGATATTTCCCACAGGAACGATATTAGAATTATG 
TGATGCAA TCTCATCCAAGGTCA TGGTA TCAAACCAGACACAGC T AAAAATG T A TCAT AA 
TAGCAAGGATACAGTAGCAAGGATGGGCCTCAATAAACATTTAAAGTGGAAAAATTCTTC 
TCTAACTCATATCAAG TACCTGCCCGGGCGGC 
Sequen c e 996 

CCCTTT CGAGCGGCCGCCCGGGCAGGTACCAAAATAGATAAGGATCC I G I III 1 1 GAAAT 
GAACCCCAGTTGCGCCTTAGGGATTGTGAGTTGGCTCATT^ 

II 1 1 f I AIT C T C 7 AAATTTCGGGACCT GATGCTAAGGAATGTGAATATACAGTTAGGTTC 
CTGCGAACCC T G I G M GGTTCAAAAAGGCTGGTGGAGGGAAATTTATGACACTAAATGCT 

tatattagaaaagaggaaaattggccgagcacggtggctcatgcctgtaatcccagcatt 

TTGGGAGGCCGAGCCAGGTGGAT 

cccttagcgtggtcgcggccgaggtacttgggaacaatagct 
aattggaatrragacttttcaaaattaaattcacttacttct attcccaactcaatctaga 

ATATTATTGCTGATAGTGAAAAGACC^GAGAGATGAC^ 

taattgtttttactcacacctgtngatgtcactttaaa * 

AAACT ACTCCAATTTAAGTAATGAGTTAGAGCTTTGGCAACCATTAAGGC I C I C I 1 1 IUC 
CAACTCTAACAATATGTG GTAA TGTCTTCCCTGACTTCA 

aaggttatatttaaaggg rn I C I ACATTTTTTTGGGATATTTAGCTC C TTGNAATTTAG 
NfiTT AT ATGTCT GGAITACAAAAACATATHATATTCAAAGAATTT^AACACTT AGAGGT 
AGAAGTGAAATTACAGGTTGAAGAATTATTTAAA 



CCCTT AGCGTGGTCGCGGCCGAGGTACGTGTTTTACTTGGTGCT GT AGGT AATGCTAATT 
CATGATAAATTTTGAGAACCACTCTAGGOTAGTATGTT^ 
AACCTTGAGAAATACACTTTTAATCATGACTGAGCACACAC 
. TTAGACGTTCCATGAAACAATGCTT ATCTTACAGTGTGTTTTCTGCT CTGGTATTTTTAC 
TTATATTCTATTA AATAG ATATGTGTGTATAAACTTATTGATATAAAAATTjTGGTCATGA 
TCCACT AAAGTGATTTT ACAAGCCACT AATGG 
Seq u en ce 999 . 

CCCTTAGCGTGGTCGCGGCCGAGGT A L 1 1 I II H 1 1 H I M 1 1 1 I H I I H C I II I 1 1 1 1 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 NAACTTGGGTTKTCCTTTrTNATNATTCTGN 

**^THAMAAAAACQlAAANCCTGTT>^T>rrAGGGTTTTNATGGMT ANAGTTGHANAAAA 

CTTGKhnTTTGTNAGTTTHAANAAGNCCATTTHAATGAGTKAA^ 

AANCNAACAAANCTGMAAAAAAGTAGGGGNGGGGG^ 

CCTTCANTANCATGAGAGGG 

S equence 1000 

GGCnTCGAGCGGCCGCCCGGGCAGGTACTT AACTGAAT ATTTATTTAAAAAAGCATT AAT 
TTATCT ATCTATATAACTAAATCTATCAAATATTCTTTAAAACACGAACCAAAGTTAATC 

tgaaactcttcctgtgaaaaaagtcatgtattatatgccttcaacacagaat^ 

Ai 1 1 OC I GTGGCATTATACTATGCC C n I G I C AT ATG C 71 T T 1 1 I C CCATAQAGCATTTT 

1 1 cccatagaacttt gt attcctccacttctaocac c 1 1 1 l i i i gaagaactcttattta 
. ccatttcttggactaaattaggaa 
Sequence 1001 

cccttag cgtggt cgcggccgaggtacccagaat atggt atatc tc ttcattt atttagc 

TCTTT1 1 AA A 1 1 IG U 1 1 GGTAATATTCTGTGA I IIIIHIIHMII MGGTATGGAGG 

tctt acatcttttgtaaaatttattcctaatactttggattttgacattatcataaaaga 
aaatt a tttca ctgacttttccagtttgct gctgggctaaacatatamttaattwtttat 
attttaatcttgtatcctatnactttgctaaattcatata 

Sequence 1007 

CCCTTTCGAGCGGCCGCCCGGGCAGCTACTACTTG^ 

tcaaaatgtcattggttataaagtggtc>tcagacc>tgcagactattactaatattggt 



tatgttttagtttattgcagtgaaaatacaaaatttaaaagttattgtag 

gtgataattgtcaaggggttag tatggtctgagcatggttgatggtgctctgtcattctg 
gtatt a aac aacctgccaaa itstcl i g attacatgtcctaaaaaagtgaggggaagaagt 
gtaggacaaatgcaaaataaaataacacatttagctatacttttaagtatt^^ 

Sequence 1003 

CCCTTA GCOT TjGTCG CGGCCGAGGTACAI C I GI 1 1 C I G AAAGCATTTTTCACTGAACCAA 
rTTTCTATA CCI 1 1 1 I I I I GIA I IL I 1 1 I CG I 1 A G C ! 1 1 I G I I I A TA1XSGTTGCTATATT 
TTTCAAGCCTCATACCAGTCATATAAAACCATGATAAAACTTCATCAAAGC^ 
C>AATTTCAATTATCAAGT AAAATTCTAAAGAAAAATTTTTTACT agtttggaaatagat 
CTACATGTTTGA I 1 1 II I I lUCTTCCTCCt: I CCI I IG I I I I I I G1 1I I FCTCTCCCC1 1 1 
CCT AAAAAGTTAATGGCTATCATTATCTTCACCAAATTAi i TGI 1 1 GGT ATACCCATAA 



. CCCTTAGCCTGGTCGCGGCCGAiGGT ACTCCTGAACTTAAAAGTTGAACAACAAAAAAAGA 
AGGAAAATGCGTTAAT ACCTT ATTGTAA "HATTA 1 1 1 1 1 1 G GAAGACT ATTTTTT ATATT 
CAGAAGAAGTGTCAGAGTCAGCAGAAAGGGATT ATTTCTCCATTTAC ACAACAATGGT 
TTTAAATGACTX3GATAGATAGAAATCTCTTTCAACTTAACT 
TCTCTGTTTCAAGTTAGTTTTCCAAAGGATTACTGACTTT^ 
TGTCAGGCCTTAATGACATATTTCTCCTCAAATAAAGGATACAACATGC 
Si 'umiw w 1005 

CCCTTAGCGTGGTCGCGCCa3AXOTACTTCGG^^ 

TCXrrCATAGCACATATGAGATGTAGCCATAAAATAGATGAATTCTTGAAATAMGGAATAT 
AACACTTGACT ATTCTGATTCAGNAGAACATAAAAAATGTTCTAi 
CACA TTT A TATKT ATTTCT ACAAGTHAACArfGAAT ATCTATT AGA 



COC7 1 1 I SGACCGGCCGCCCGGGCAGGTACATAO 1 1 C 1 GCI H jCATTGGTCCCATT ACAAT 
CCTGT CT AAATCCTGAAGTAAAAATGAATACCATAGTGAAGAAATTACTTGTGCATGTGA 
AAGAGGCTGGT(^AACTCCTTAATTGCAACAGGGATTTGA 11 CI ICI ACTAGTAGTTAGG 
AAAGGTTGCATT AAT ATTCAGT AGTTAAAATGTGCGATTCTAAATTTTTTGT AATTTCCC 

atgagagaataaattttttcaaaaatattcccact 
tctgtgaagatggcaaataaaatgac 

Sequence 1007 

NTK7T>IGNNHAAT74CNCNNTTAGCGMGGTCGAGGGGCGNGG>04^ 

TAATACTTTAAAATGTGTTAAGATATATGATTTAAAAAGCATTGTHAATTGT^ 

GT G T CG TCT ACA TGGCATTTGGACACrfiACAMT AATTGT AAAACA TAAAKAG T GCKAA TT G 

ttacacttacatatgaatagctgaaatg*<gcaacagtggacgcaan rrTTTTWgncrro 

AAGTTTTAhrrAATTACCCCAAflAANACCT ATTTAACH AGGCTG ATNCT AACKTGGGCGAT 
ATTT AATGGJ4TTTCTT ATT AATTTGGACCKAAAAANTC T 7 7 77 GGAATTAANCTTGGGOI 
ANTTCGCAACCAAAACCAAATTTTAAT 
Sequence 1008 

CCCTT AGCGTGGTOGCGGCCGAGGTACACTGGCTCACCTCTCAGGG C 1 1 IGCTOC1 IGGG 
ACGCTATTCAAGCTCAGCATCACCTGTCTCACA1 C IG I V I GG GATCCTCAAACCTGACCT 
TTGT AAATTTCCACT AACTGAAGATTGT AGAGGAAAAAAAAAAACATC TTATCGAATTCC 
TGCTCTTAT AGCTGATTTTA GCT AT TAGGAAAACATCCCAAGTTGAGCTTTTC^ ATTCCT 
AGA ATTTCAGAI [TC TTTCCTTTI fAAAAATTTTATCTOCI 1 1 1 AT AGTAGT AAAAATAT 
1 1 ICC 1 1 1 1 1 1 1 1 1 GGAATGGGAGGTCTT AAGCTCAGTGTCAAAAAT AAAATCATTTT 
Se quen ce 1009 

CCCTTCGAGCGGCC GCCC GGGCAGGTACCTTCTTGCCTACAGCGTTTAGC I LCG I ITGTT 
TTGCATAAAGATL I G 1 1 M C I GACTTCGCATGAGGGGTAGATGTTCAGCTTATTCTCACT 
ATGTAAATTACTTAGTAAATAATAGGAAGAGATGTTGAAATACAAACTTTCTGCCACCAG 
ACCTTCACTCT ATTGCAGTCATTTTCTCCCACTCTCCCCCC 7C7CI CCCACTTCCTCTGA 
GGATTACCTTC CC C fC TC l CAKCATTCCTCTGTCAG I GGG 1 1 1 1 1 1 1 1 I U.I I IGG CATG 
CAAACATGCTCAAGTCTGTCTTATA 
Sequence 1010 

CCCTT AGCGTGGTCG CWTMTC GAGGTACTCTTTrc^GATGAAAG^ 
ACCTGTGAGTATGTGG TTTTTGATCTGTGACT AAACTGTCCCCATTTCCCAG rTTCTCTG 
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CTCCGTCAAATATCAACATTTTACCAGGTTTCTCTGTTGTTGCCAAACCTGTCATTTTTA 

tttggtgtggcttcttgggaaacttccatggcccatttgatgggaatcaaacagtgaaaa 
caaggacagatgcaccagaggtggcati^aggaacaaatgggtcataagaacttaccttgg 

CACCAGCCCCAGAATGG T>4AGGAGGAAAGGCACTNTAAGGTATCAGAAGGT AGAAAGGAN 

AGGTTGGATNATAGHAATGGGGGAAAGGG 

SeqticnoelOll 

CCCTTNTNI^GGTGGCGGCCGAGGTACTGAGACACTGGATCCTAAGAAAATCAGAGTTAT 
AGCTAGTGGCAGTTATCAAGGGAATGCAGAGGTTTCTGTATTCTGAGCATGTTCCTGTAA 

taggatagataggcgatgtggcagcaacaact cccaattcgtaatg tcttaaaacaaaaa 
caagttttatttcccatttatgccatgtttccagcagagtttctcagagggctgtgctcc 
atgcatttactcaaggtctgggaatgatcatggctacact atcttgcagccagcatattt 
ggaacctgttgccactctgatggcagcagagaacaaaagaaa 

ftftrnMirco 1012 

CCCTTTCGAGCGGCCWTTTWNGGCAGGTACGGG C 1 ITTTTS1 1 CTTGTGCAGTAACAGTG 
AGGGCATGATTAGCGATCTTTGCCAGCTGA-rGTCI IG> IGGACACCTGCCTTGTTACCAC 

GCACAATCTGAAACTCTT AGNGATACCAACAGCAACAAAAATGAAAGCAGTTATGGTGGG 
CAAGCATTAATCTAAAATTTTTTTTAA 



GTCTCAAATTCCTGACCTCAAGTGATCCTCCCCCGTCAGCCTCCCAAAGTGCCAGGATTA 
TAAGCAGGAGCCACCGCGCCCAGCCTAI f TTSTI I CTTAAATTTTTTTG 1 1 1 » ICAGTCA 
CCAGAATTTCAGCATGCAT AAATCACAACGGTTAACAATTTAGCATCTTTGG C MC I 1 1 1 
CCTGTGCACTTACGTTTTTATGTAGCCAAGATCACACGTTGCAT^ 
ACAGCGTCTAAGTCATCAGCACTCTATTGTGATGATTTATCTTAAAAATATTCCAAGCGA 
TCATTTTT AGTAA CTGTGTAATATTATATCATAAAGTTAAAACATAATTTGTCATTCAAT 
TGTTGAAATTTTTAGGTT ACGTAT A nTTCTCTI A TAAAT ATGTAAATATGTTTAT AAAA 
AGTTATATACAGTTTTTTATAAATCTTTGTGCATACTTT ATACTGGTTCCTTAGCATAGA 
GACTGTGGGAATAGGATTTCTTGAAAAAANGTAAAAAGTGTGAGTATGCATATATACCTG 
GT ACAT ATATG TT ATT ATT AT AAAN GGT AAT ATTCTTTTTTTTTTGGAGAAAGAANT CTC 
ACTGKACTT CAN NCTGGGGTAAAAGTGAGACCCCTGTCTNAAACCAACCGGAAAAAAAA 
Sequence 1014 

C CCTTCGAGC GGCCGCC CGGG CAG G T AC TTATTCAGACAAGAG TTC TGAC TCTCATGCTT 

GAGGATAAGATT ATACATTTCAGT ATTACATTGAAGATATTTCATTTTT AAC CAGACT AA 

CTTAGTATATTGTTATTTTTAATGTGACGAAAGAAATATTTTCATAGAAGCTAATGCTGA 

GTCTTTTGATAATTTGCCGTATCTTAGTCAATCCGAAAAAATTTATTTrCTACTATTTAC 

AT ATTATCC T AGTGGATATTACATTAC TT ACT GAAGCCTTTGGTTCTATGTTTCATCT AC 

TCAGACTTAATTCAGGAAGAGCTTCATCCAGATGTTTTGTTTATTTGTTTCTCGATTAC^ 

TGTATGAGATTTCAGAATTTATGAGATCATAGGTCAAGTGAAAGGTGACAGTTGAGAGGT 

CAAGTAAGAAGCTAAAATTTGTGAAACCAAAGAAATGACAGGACAGTGCCAAATGAAAGG 

TCAAAAGTCAAGTGACAGACTCAGTACCTCGGCCCGCGACCACGCTAAGGG 

Sequence 1015 

TTGTAGTTAGCCAGTGAATTTCATTTTTGAATTTTTrCTT^ 

CTCTTGCTCAGGATGGTCTCGAACTCCTGAGCTCAAGCAATTTGCCGGAGCTCAAGTCTC 

AGCCTCCCAAAGTGCTGGGATTACATGAGCCATCGCACTCTGCTGTTTCTGAATTTTTTA 

AACAAATAAATATGAAGCAATCAGATGCCAAAAATTACAAAAGAAAATCAGTATCAAAAA 

TTTGGAGTTTGAGGCCAGGCACGGTGGCTCAGGCCTATAATCCCAGCACTTTGAGAAGCT 

GAGGCGGGCAGATCACGAGGTCAGGAAATCGAGACCATCCTGGCTAGCACGGTGAAACCC 

CGTCTCTACTAAAAGTACCTCGGCCCGCGACCACGCTAAAGGG 

Sequence 1016 

CCCTTAGCGTGGTCGCGGCCGAGGTACTATTATAATAAGTTAACATATTTCCCCT ATATG 
CGGAAAATGCTGACTATATCTTTTGGTTGCTTTGGAACACT ATCTCCTCACAACAGTCCT 
TGTCTACAGAAATGGGAAAGGGAAGGACACATTTTGGTTTCTGCAACATGGCAACATTCG 
T AAAACCAGAAATGATG TG TGACAAG AAAC T AAAG AAC T GGAC G AAATT CAC TTC CATTC 
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ACCCTGGTTAAAGCTTCCTTGAATCAGAGATAAGAAACAACATGAAAAATCTATTCCTTT 

TAGAAAACAAGTCTlTAACCGAGAGGTTGGTTTATTrrGAAAAGGAATTAGACTCTGGGC 

CCACATACCGCTCGTTCAAAATATAATGCTGTGGTTTCAACTCXTGCTAAATGTTGCTGT 

GAC TTTT AAGCAGAGAACTTCTAAAAGGAAGT AACCT AGGGAGGGGCTGAT AT AACTCAG 

AGATCAAT AATTCATTTT ATTGGAAAT AGGAG TAGTAGT ATGAAA TGCT AGCANACTGTT 

TCATTTGGAGGGAGGCATTTTCTA 

Sequence 1017 

CCCTTAGCGTGGTCGCGGCCGAGGTACAATTCAACTATCATTCTGGTT 
GGAGACTGGCT ATAAGGT AGAAATATGGTTT GGGGTCTTGGATATAGTCATGGGTTGCTT 
TGAAGGACTGGTGACAAAGTTTGGACTTTACCTTGCAGACAG 

fl I I MGAGCAGGAG TGCACGAATCAAAGGAAATTAAfTTTAAAAAAATTT AAATT AAGG 
CT AGCAGGATTCAGTTTTCAAACTGGCCAGCTGTGGACTAAATCCAG 
CTTGTTTGACCAGCAGAGAGGCTTCAAAGTCTTCAATACAT^ 

GAAGATTAAATATAAAATTTCAAGTTTCGATCATC TTTTTAAATATTAGGAGTTCCAGCA 
ATGCCGGGCCTTTTCCCCCGCATGATCACTGAGCTGGATCTC^ 

GCTCCCC GCTGC AGCTGTCTCGGTTCT CI 1 1 IC TTTTACCTACTGACCCCCATATNCATT 

TTTAAAGATTTTTTAATTTTTATGGATACATAATACTTGNNCCTGCCC 

Sequence 1018 

CCCTrGAGCGGCCGCCCGGGCAGGTACGCGGGTCCCTTATTTTCTGG 1 G l 1 1 A CTTGGGA 
TGCATCAGTGAACAAAACAAAGGTATCTGTCCTTATGAAATTTATATCATAGCAGAGGAA 
GACTGGAAATGAAT AAAT AAATAAAGAATGGAG TTTGTTGGAAGGTAATAAGTTCTGTGG 
AAACAAGGAAAACCAAGGCATGGAGGTTTGGAGT GCTAAAGTGAAGGTGTGAGAACAGAT 
TGCTCTTGCTCAG ) 1 1 rCTGTCI HI ITTGTI I AGGAAATTGTCATTCTCTGTATGCTTC 
ATT ATAATATATACAATAAATATGAATTGTTATAATTTAAGATAAATTAT ATAAAT AT AA 
ATTATAA 

Sequence 1019 

CXXTTTTCGAGCGGCCGCCCGGGCAGGTACTT AGTTACTXXT^ GCCCATAGACGTGTTTGA 
CXTAGAAAAATTTCTTATACGCAACAGATATTCATAGAAATATATATTAAAATAAAGCTT 
GAAGGGTGAATTAAATAAATATTTACTT GGAAGCT AGAGTGGGTGAATTAAACAAATATT 
TACTTGGAAGCTACTTTATAGCCACTGGGCTGGATTTCATATAGAGAGTTCTTG^CCCTTG 
GGAGTTNTACAACTGCTTAACAXJ rT TO TCI ATGGTAGAATACA 
Sequence 1020 

CCGTTAGGGTGGTCGCGGCCGAGGTACCTAATGCTTTCAGCCCAGGAGCAGAAAGAGAAG 

TGGGCTCTTTGCTTTGAGAGTCTCTGAAAATTTTCAATACCCrrGGGACAAATTAAATGAG 

GTAGATCCTTCTTTGAATTTGTTAATAAAGCATGCTTGTTTTGTCTCCATAAAACAGGCT 

TTGACCATTAAGGTTTATATTTTAAATGGGTAAATTTTATTGTAATACACTAATTTTAAG 

AAAAGAA1TAACTCATGGCTTAAAAGCAAAAACCAGACCTTGGATTTCACCCATAACTTT 

AAGGCTGGTCATTTTAACCCTGATTTGACACACTCTTATTATGGT'GTCTTTTCTC^ 

TT GGCT AAAT ATTTCTGACCATCATAGCAATC TTTTC TATAAAGGAAGCAGGCAAGAGAG 

CTAGAGTGAAAATGTTAAAAACAAAACAAAAAAGACAGGATACTGGCTACCAGTTTTTCT 

TAATTAAGATGATCT GTTTTCGCAATTGCGT AAATT AGAATAAAATGTTATTTAACTCAA 

GGATATTTCTTCACTGAAAGAAAACT 

Sequence 1021 

CCGTTTCGAGCGGCCGCGCGGGCAGGTACTT ACAGTCTT AAGATATCCATACACCCC CAC 
ATCCGTCCTTTGTGCTAGAAGATTACTGAANATTTAATTCCATTTATGTCATTGGATTTG 
T AAAAAACCCC TT C TGGATTCAAAGATGAAGGC C TCACTT ACTTTATTTTTGTCATTTTC 
ACAGACCCCTTATGTAAATGCCTCAAGAGTAAAGAATCTTGCTCAAGTGATTTTTGTATC 

ATTCAACAAATACCCCCCAAGGGACCTCGGGCCGGGGACCACCGCTAAGGGCGAAATTCC 
AGCACACTGGGCGGGCCGGTTACTAAGTGGATCTCGAGCTCGGTACCAAGCTTGGCCGTA 
ATCATGGTCATAG 
Sequenoe 1022 

CCCTTAGCGTGGTCGCGGCCCGAGGTACCGTGTGGGCCACTAATACATAAGCATCTGTGT 
TGGCTGGGGGTAGGTGTAGGGGGTGCTTGGGGAGAGATTTAAACAAACCCTTTCCTCTAC 
TTGCAAC^TCTCTTAAAAGCTTGTCATCATGTTACTTCCTATTTCTTTAGAGTTCATTTG 
TTTAAGACGGAAACGTGCTTCATCTTGTTCGCTTTTTCTGCATC 
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TTCTAATTANCCCCAACACGGAAAAGAATGTAACAGAACTGTCTTAGTTGTGCCATAGAG 
TTAGAATCTATCTATTAACATGTTTTAGGTKATAACAAGAAAAATAATAAAAACAAACCT 

attatgagaagctggccatggcaataaattttgaaagattaccaggaaatataaaaggaa 

MG 

Sequence 1023 

CCCTTCGAGQ5GCCGCCCGGGCAGGTACATATATTTCAAA 

AATGTTTTCACTCATAATTATGTGTTCTTCCCACTTCTATATTCTCTATTnSGGGAAATA 
ATCC CATC^ACCACCCAACGGCCCAAACCAGGAACC TGAAACT AACCAT ATTTCCC TCCC 
ATTGCACATAAATTAACTTCTAATCCTACCTAiCTrATCTTTG 

CAG TGGC AATACTTAGGGCTTWCCTrACTrTTT ACCAGGACTATTACTAGAGCT'NCCTAA 

ATGCTTTCTATCTGTAGGCTTACTCTTCTGCATTTCTAT 

Sequence 1024 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCACAATGGAAAGATGATCTTCCTGCATTGTGA 

AGGTTGTTCTCATCAACCAAGCCTGCAATGACT AGACATTCTAAAGAGAAGAGTGATGGC 

AATGGAAAGAGCACACATCCGCTTGCCAGGTCACTTCTATCAGTTGATGACATGCCATAT 

TGTTATGGCTAGGTCAGCTTTGCACAAGTATGCACATGCAAAATAGAACTTGGGAAAAAA 

ATCTTTGATTTGGCGCTTTACCAAGTGGATCAGGTGTGTCAGAGTTCA^ 

GTCAGAGTTTAA 

Sequence 1025 

CCCTTAGCGTGGTCGCGGCCGAGGTAC IIGII I C IUCCCTrCGGACCACTCTCCCCACTA 
GACAGCTGTATGGCCGGC TCCCTCACTCTCC TCAGGTC T ATCAGAGGGTGGCCAC TGACC 
TCATTGTCTCAAAGATTATATAGAACACACACGCACCCATGCACGCA^ 
CATCCGCCTGGTTC CGTGCACTATTCCAGGACC T ACAGCAGTGCC T AGAACACAGAAGAT 
CCATTAGCAACATTT GTTT AATGAATTTATAG TGCC T AAACCTGCACAACTCTGACTTT G 
CCTTGCTATTAGAAAATGCAAGGCCAGGCGCGG TGGCTCACACCT GT AATCCCAGCACTT 
TGAGAGGCCGAGGTGGGCGGATCACTTGAGGTCAGGAGTTO^AGACAAGCCT GGCCAACA 
TGGCGAAACCTNTTC TTT ACTAAAAAT 



CCCTTAGCGTGGTC GCGGC C GAGGT AC TGAGGCT AATGGT CTT AGTTGGGAT AAGGAGAG 

TGGGGAAGGGGCAGGGGGAGATGATGAAATTCATTTATCCTCTGTGATGCT ATGGAAGAA 

CAATTAAGATCATGTTTCCTACTTGATTTTAGTTGCTAGTCATTTCTTAATCTAAGCACC 

CCCTATAATTTACCTATGTCATCATGCAAAATCACCATCGGTAATAATGTGGGGGCGGGG 

GAAGTCTATACAAGAATATTAAGGCCCT GTGCGTGAGCATGTCTATAGTTAAAGACTTAA 

TGAGAAAGCATCAAATTGTGGTGCAAACAGCTGAAAG TAGAAGTAAAT CACAACG T AAT A 

AGATGCAACTTTGGAGGAGCTGAAAGCAACAfl^TACGTTTTTTATCCAAAAAGGAGTAAA 

AGAAAAAATCGCNACGGCAGTTCCTTCAGATAATCAACNGATGATTTCATTTGA^iAACCA 

TAATTAACTAGCGTTGTTTGTAAAATAA C TT1 1 1 1 CATTTAT ACNTTTTAATGNTT ATT A 

A 

Sequence 1027 

cccttagcgtggtcgcggccgaggtactaattctrrtcctctttcctagaccgattctag 
tttgttgccttccctttcctcggaaacccgaagtttgtggatgctgcagacactctgtgc 
ccccctgcatgctgggtgcctggccaggtgccagggcataaagacagagacgatgtggcc 
tttgtcctt aagaatgaggttt gaaagccccagttcttccatgtt aggtgatttcttgca 
gctcttggtatctgcagaattagtgtgaatgcttaaaaaatattaacagcttt 

Sequenoe 1028 

cccttagwtggtcgcggccgaggtactatgggtgtagtgttactattacagttaatccg 
tcctttgtgtgaagctgttaaatgcagtgaggattggagcactgtccactgaatctctgt 
gcaagaacttactcggtgttggcaggggtntccnggtgtctggctctgatcttggtcgctg 

GATAGNCGNCTGTNTNTCTTTAGGTGCCCAAGGCGACGGC 
Se quenc e 1029 

AATGGGAATTGCTTAAGCTGTCCTGAAAAAATNGCCTTTAAC^TCT 

TGTGATACATAGAAGTTGGGAGGAAGATGTCGGAAAGCCCTAAGAGAGCTACTTGCCAAC 
CCCACCA T NAGG TC TNCC TCAGT G TTCCT AG TCAGGACAGACGAGGC CGAGTCTGAAATT 
ACGATAAGNCTTTGAATGCAGCATAAACAGACC 
Sequence 1030 
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iTACTTTGACCTGTATGTAAACTCTAGTTACTTT GG 
TCTTCTCAGGCTCrTGAGTCTTTCACAATTAAAGTAGTCTTTGAG 

CCTCATAGCTATGCTATTGGCCTGGACACTCAAGGGAGTATAAGCTNGAGGCAAACATGG 

ACTCATTTGTNTTTCTAACTTTCAGGGGATTATTTGMCCATCATrGCCTGATGTCCAGTG 

TCT 

Sequence 1031 

CCGTTAGCGTGGTCGCGGCCGAGGTACCATTGTTTTG TTCAAAATCACAATTTAAATAC T 

TCGTGATTTTAGAAATAATTGGAGCCACXX3TTTTCACCATTAAGG 

ATACATTTGGCACTGTCCATAGGTTTATGGCTTCGAACT7GTTTAAGACGATTC 

TGAGAGCTGATTTGCCATGGTTATGAAGCTTTCAGGATATAAACTATAAGAATGACAAAC 

TACAGCAGTTGAAAATGTGTCTTCAGATACTCACTTGCAACTCCC^ 

GATTGAGAAATGAGGATCGAGGGACCAAATCTGGCTTGGTC^GTAAGAGTGTAGGTAACA 



CCGTTAGCGTGGTCGCGGCCGAGGTACAGTGGTGTGATCGCAGCTCACTTjCAGGCTCAAC 
CTCCCGGGCCCAAGCAATCCTCGCACCTCAGCGTCCCCy^ 
TTATTTATAAAACAGGTGTTGGGCTGGACTTGACCCGTGGGC^^ 
ATTCTGT AAGCTT AACATGTGTT AATTACT GCAATCTGAATAACAATGCT ATGATATAGA 
. CACTGTGTTCCTTTTAATAGACAAAGGAACCCAGGCACAGAAGGATTGACTAATATGACC 
AAAGTCAGACTGCGAGTGAGTAGCAAGCCTGAGCTCTGAACCATGACAGTTCACATCTTC 
CACGACAGCAGCTTCTCAATG C1CM I G GAGGGACCAGAGCCCAGGCAGTAGCAACGGCT 
ATGAGGTGGTGAGACATGACCAGCAGATAAGCCCT GGGCAATGGTCCAGAGCT GGAGGGA 
GTGGAGAACTAGCCATTTGTGAG I 1 1 G 1 CaAACAATCCCTGGGGGAGTCTGGAAATTA 



AGTCAAAGATAGTTCCAAAACTTTT GAACAGAACACTGGATGAATACTGTTTACTGAGAT 
GGGGAACACTTAGAGAAAAATGCATTTGGAAAGCAGAAATACGATCAAGACTTCCAIT^ 
TGATACATTAAGCTTGGTATGTTTAATTCATAGCT ATATAGAGGTATTAAATTGGCAGGA 
CAAAATCATAGCT AGAGATAAAAATTTAGAGTTCACCAGT GTAAAGATGATATTTGATGG 
CACAGGATGGAC 11 1 C II C TGGGATTTGAGTATACATAG 
Sequence 1034 

TCGCCCCGGGTCCGMGNACGCGTGGGCAGGCATTANTTN N NGCCCAGTTT ATGAGTG TGA 
GCVttACCACAGTACTGATTACTGTGAAGCTG^ 

AAGATTTTCTGTTCAACAAATTGTTCA I HUH IGC CCGTNI ITTCTI I NTGAGTAATN 

CTTTGTATATTCNGGATGTTGATCATTATGGATTATAAAA 

Sequence 1035 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCATTTAACTGAGTGAAAGCTTTACAAT^ 
GGGTT ACTCATTAGCAGGACCT GGGTTTTG I \ 1 1 1 AATCTCATTAACCCCTTGTTACCCA 
TTT GATAACAAAGACTT CAAGGAAGAATTTGCTGAAAAATCTC TGGGAGACAGTAAT AGC 
TTCTTGGGCCTGACTGATAAA C r n 1 1 GCCTCCAGCAATGGAAATGT GGGAAAATTCCAG 
AT GCTAAATGATCTGGCTTGGACCCAGCAGGTT GAGGT AGTGGAGCCTTTCGATTGAGGC 
ACAGCCCAGGACTGCTGCAAGGGAGAGGCACAACAGAT 
Sequence 1036 

AGTCGACCACGCGTCCGCTTCGAGCGGTACCACGAGG^CGCACATATGCTGGACACTCAG 



GTGTCCTGGGCCACGCTGGCAAGGAGAG 
C 

TGTCCTGTCATCTTTGGCCACGTGCAGGGCCACGAGCGGAGCCTGCTGGTGTCCACGGAC 
GAAGGGAATGAGAACTNCAAGGCCAACCTGGAGGAGGTGGCTGAGGTGGTCCGTATCACC 
AAG^AGCTGACCCTGGGGAGGACCGTATAGCCCCAGGAC ATCNNCGTCCTCACGCCCT AC 
AACGCGCAGGCCTfTTGAAGATC^TCAAGGCCCTTCGGCGAGAGGGCATCGCCGGGGTGGC 
CGTGTCCTCCATCACCAAGAGCCAGGGGAGCGAGTGGCGCTATGTGCTGGTGAGCACCGT 

cccgcacxtgtgccaagagcgacctgnaccancnggccaccaagagctggctcaagaagt 
ttctgggcttcgttgtggaccccaaccaaagtgaacgttggctttcaacgcc 
angggctctgnctgatcngaggaccacgttcrrnttgcgcttgttgccccctttggccgt 
aancntnctggackttttgcgaggntta 
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ATNTGCCTTT CTGAAAAGNCTTTTCACCT1GCAAGMTGCCAGG 
ACTGGGANGGGAAAGTTNAGGGCCCGCC 
S e quen c e 1037 

COCTTTCGAGCGGCCGCCCGGGCAGGTACCATTTAACT^^ 

GGGTTACTCATTANCAGGACCTGGGT rtTGTTn \ AATCTCATT AACCCCTTGTTACCCA 
TTTGA TAACAAAGACTTCAAGGAAGAATTTGCTCAAAAATCTCTGGGAGACAGTAAT AGC 
TTCTTGGGCCTGACTGATAAACTTnTGCCTCCAGCAATGGAAATGTGGGAAAATTCCAG 



CCCTTTCGAGCGCCCGWICGCOCAGGTACTTTGAC™^ 

GACACACAGCACAATTGATTTAACACTTCCAA7TTTGGAACTATTT3GATAAATAATGATG 
GGATTTAAATAAAGCAATCCGATTCTACTATTACAGCATAGG G fCTC I I G I A GTCCTCTT 
AGTAAAAACTATTGTGACACTTCC 1 1 c m C I CCAAATaTTCGGCCTGGAAAGACCT AAA 
TACAATQCAGGQATTGAATGAAATTCACACA I M I N I I CCT ACGGAAACAACAACCTTT 
C I TGC 1 1 A TATTTAACAAAAACTAGTATAGA 1 1 U X 1 1 1 ATATTAATAGTTATATGGTAT 
Mill I C I CAGAGTAGAAATCAGGTTTATAGGCTAAAGAATATAGGCT AATTT 
Sequence 1039 

CCCTTAGGGTGGTCGCGGCCGAGGTACTT AOATCAGATQGATTGAAACATGACAGCCCCA 

TTTCATCTGGCOGGTT AAGGTCCTCATGGAATGAAAAACAC 1 1 1 CGGGGACT CTCCTATG 

AGAGAGAGAATGGO 1 1 1 C M I AATTGCCAGATTGTCTGAACACAGCCTCAGCTACTTCTA 

GGAATAAGACGAAGCAGTGAGGAAGTTGCCAGTTGAGTGATTCTTGG 

CATTCAGTGGCAGCTCTCTAAAGTGTGGATTCTGGATTC7GGTAGAAGOCAGT 

CGI 1 1 TCICI UGAGTGGAAGCTAGTAAGATTTATTCTGTGGTGATGAAGOCATCTGAAAC 

CTTACAAGCAGTGTGGTTGTATCAGCATATGGGAGCTGACTGGCTCAGGAC I ! IGGAAGC 

CTGCTTCTCTGTGOCTCAflCCGGAACTCAGGTT ACTCAGT AGTCATTTGCT AATTTCTGA 

GAACGCANCACTCCTGAAGGGGATAGAAAGGATGAACAATACCC 

Sequence 1040 

CCCTTTCGAGCGGCCGC CCGG GCAGGACTCTTATCAA C I GTTTl A TAGATGAGAAAACAT 

AGATTCAAATGTAGATTATTTTACTACAGACTGAGAAACGAATTAAACTAG 
ATACTTTCTGGAATTGAAATGATACATTATATATACCTATAAAGATAATTGGCTATAGCT 

tcctaaactacaaattgtcataaaaatgacttctgtcctatatcaattagaaactggtat 
taaaattgagtattataagacaatagaatgt 

Sequence 1041 

TTGGATCCAAAGTTGGTCAAGTGTGCAGTGTTTA^ 

TCCTGGCCTGAAATATGTCACTAGTTAGAAACATTAGAAGCTTTC^GGTAAATAAATATA 
AAAAACCAGTCAACCGTATTCTTATTTCTTCGTCAGAGAATt^ 

CTrCCTGCTGQATTCTGGATGGGAGTTGTTGAACATATTAATCTCATTATTTTCTGTAGA 

GGAjCAGGTTGTCCCCCCTTCCTCATTAGCG 

Sequence 1042 

CCCTT AGCGTGGTCGCGGCC GAGGT ACCC TGCTTTGATT ATTTCCGAATCCAGTGGG TAG 

AGAAGG TAAA GGCAAGGGCTCACTGGATATrrTTAAATTGTAGGGATGTCCTTTGCTCTG 

GGTCAATTTTAGGATCAAATATAAAAGCACCTATAGCTCAGAGTATCTTCTAACATAAAA 

cttctgagataccagaaattttccaaaacatggtataaacagtatgaaacactgggtaga 
taaaagc tttctct aaatcttaaagtgc tcaaat atcatgacctgattttttagtttt ag 
aaatcagatatttttctattccatatcttaaacttt 

Sequence 1043 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCGT1TGTCCATGGCTATTCCAAATACCCCCAT 
GTTTATTTAAAATGTATATATAATCACTTACATAAAAAGAGGTATGCTTAAATTCTCATG 
ACTCTATGGTTGGACCTCTGTGGTTGGAGCAGGCAATAGAAATGTCTGTAATTCATTTAA 
AAAAAAAAG TGAC TTTCC7 ACC TTT AGATAG TGAGGACAATC TGTT AACTCTTTGTG TTG 
ATAAAAGCAAACATTTCAGGGCACGGTGAAAGAAATCTCTACCATGTATAAGGTTATATA 
TAT AC CAGAAGCAG TGGAG TT AGGAC CAAATT AAGATTTGA 



ATTGTTACATATAATTGTTGTATAC 



CAT AACTT ACTA I II 1 1 1CI M l I > A TTTTT AT ATATAA I fTTTTTI I G GTTTGTTTGTT 
TG TTTTTT AAT AAAC TGTTATCACTTA 
TGCCCGGGCGGCCGCTCNAAAGGG 
Sequence 1042 

CCC TT(X*AGCCGCCCCCCGGGCAGG TACTTTTCCTGGGTTGTGAATCTTGGAGGTTGCCC 
TGTCAGaCTGGTGAGATCCCAGTTTAGCTGTGCT AGCT AAAGCAAGGaGAACAGAGAGAG 

ggat agatacttttgcttagtaaa a rCTTTcm GAGGGTAGGGACTGGAGTATGGAACC 
ttttcagaggaatgagaggggcttgtgacgaaagggtagaggag 
aaatcttacacaactaactaatgtcataaggccgaggatgagaaagtagca^ 
ttcatgctcatcacacataaagcattcc 

Seq uence 1046 

ccctttcgagcggccgcgcgggcaggtacagcactttcaaagtagt^ 

TCCATTTAACAGCAACATTCAAATATTTCCGATTCt G CT^ 

ATACATAGAAATATAGGAGCAAACAGAGCAATGCAGGCGCTCTATGATC^ 

CAT AGATCTTAAAAGGAGAAGAATGAGGGATTTGCCT ACAACCCACAGCCAATCTATGTG 

GAGACAAAGGGTGACTTCTTCCTTCTATTACGTTCCTTGAGG^ 

ATGACCTCGAATCATAATTTAATATCATTCTA 

Se quenc e 1047 

GCCTTT CGAGCGGCGGCCCGGGCAGGTACATTATTGGTAGT ATCTCAGAATCCTGCTTAG 
CTTI rGAGATAAACCAAGinCATGATATTTTGGGTAATATGGCCATAGGTATCATGGAAGA 
TTGAACTGCCCAGT ATTTGCCTTTTTCAATA TTT ACTTTGTAAGAACCTGACACTG T AGG 
TCCTCACCACACCAAAACCTGCAACATAAACTTCA^ 

aagctaaacaaaacaaaaaggaaaaaaccgtctatatacaatcaccc re cr re i CTACAT 

TTAAI I reCI I CATTCAAAATAAGCA 
Sequence 1048 

CCCTTTCGAGCGGCGGCCaSGGCACGTAGAACACTTTAAAAAGTG 

GAATATCTCAATAAAAACATTT1TTAAATAAAAACAATTGCCAAAGGCCTG 

GAACATAATTCAAAATAATTTATGGATCAAAAAATAAATCATATAAAGATCTrGAGAACTA 

GAATGTAAAAATATAGAAAAAAGTCATAACAATATTAGA>4AAAAATTTGAGCTGGATAAC 

^AAATAGTACCTCNGCCNCGACCACNCTAAGGGGGAATTCCAGCAGACTGGCWG** 

Sequence 1049 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTATAAACAA^ 

AGCC AGAAGAAAAGGGATCTAAAGTAGAGAGAGAAGATAGGCTGACTCTCCAGTTGCAGA 

^^gattatcagctcatcacacgaccgaaactctctggtgatttgctatccacatccat 
ggcgttrggtggccgtaaagartgtaacggcct^ccatcct c ttg g tr a aaatggcaggtg 

TGTTGACAAGAACTGTCTTAGGTACCTCG 
Se quence 1050 

CCCTTTCGAGCGGCCGGCa3GGCAGGTACCTCTCATCTC 
TTAAGAATACTAACAAGAAAAAATCCAAACCCCCAATAGAA^ 
ATACCCTTAAACACAAGAATTGTATTATTCAATGAAAGOVATACAAGTAAA 

taccttggctattttttcaatgtacctcggccgcgac^ 

Sequence 1051 

ccctttcgagcggccgcccgggcaggtacccatctcttccattctg 
taag^tgtaacttgtagcttgtagaatgaatgatggagtagaataaataagaaaggaat 
atatcattaaatgcacaggttaaataaataaaaatctattaataaagagoctaaagaaag 
aaagatgacatttcagcacatattgggtgaaataagttgtttagtccagc^ 

TTTTAGTGGATATGTGAATTGCCTATTAAAATGCAAATTrTAAATTAGTT AATGTGGGTT 

ggacctgagtctgcgtttccaacaagctcccaggtgatgt 

Se quence 1052 

CCCTTTCGAGCGGCCG<XXGOGCAGGTACGCGGGTATAGCTATATA^ 

TTTATGTAAAATTTCCAAAATGCTTAATATGGCAGTATAATAATTATAACTAGATTTACT 

TCAAAACATAGACATAAAGAAGATTACATGCCTGTAGAAGTTCATTGAATTAGGAATCAC 

ATGCTATTTATTTTAGGAGATATCTrCTTAATTAAATGTTTGACCCATGTGAAGTCATTT 

AACAGATCTGTTACGCATTATTCACATATGCAAAATAATCTATATGATCTGAATACCATT 

TCCATCTTTAAAATTACATATTCC 

Sequence 1053 - 
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TACAATCAAAAAAAGACAAAAAAGAAATGGTGTT 
AAAAGCCACAGTAAACATAAACCTGATATGAAGTATAAAACCACACACACTTTGCTCTTC 

atccggacaatgcccaaaattatactgaggtattggggtgggctgataccttc^ 
gagagagggaccatgttcaggaggtgtattcctcgatttaggtggtga 
ttttaagacagggtc tgactctgtcacccaggctggaatggagtgacgtgatctcggctc 
actggagcatcaacctcctgg 

Sequence 1064 

cccttcgagcggccgcccgggcaggt acaatgaaaattacaaaatactgttgagagaaat 
taaagaagacaaataaatgaaaagagacggaacatgttttcgc ttg taaaac tcagtagg 
ATTAAGAIUCI II I Ci I C CCACGACTCT atagctttaaagcaatcaaaatcanactggtt 
ttgtct gaacgtttttgaataagtcaatggcttattt^ 

TGCCAAAGANTAGGCAAAATATTTCAGAAAAGAAGAAAGArrGAG 
GACTTCAAAAGTCACTAGAAGAACGAGGCCAGACTGCC 
Sequence 1055 

CCCTT AGCGTGGTCGCGGCCGAGCTACCCACCACGTTCATG TCTCCTC T AGGGAAC TATA 
AAGTTATTAACACAAGAACC C TGTCI I A TTCATCACAGTATCAGCCACAGGCGCTGAGAC 
AGTGCTTACACAGAAATGGCCCTTGATAAAATATGGGCTGAATGAATGAACATATGAATT 
TGAGACTTTGAGAACTAAATT AAAAGTTATTTCT ACT AGCATTTTT AACACAAGAACT AT 
TGAGATTACTTATATATTAGTAGTAAAATGTTTGCn IATTCATTTTGATTGCAAAACTT 
ATAATGAACTCAGTGAACCTT6NCCCACCTTTTT 

Sequence 10 58 

TGT ATTAAAATTCTG T AT AGACATTACGT AGCCTCAGAGTTGAATTTGGACTGCCCTT AA 
AATAAAAAATTC TTAAATCTTT AGTGTGGTGTCT ATTAATTTTTATGATGATTTACAAGT 
TGGAAATGATTACTTTGCAAGTCATAGTTTACTtTGAAGTTAATA 

AAGGAAAAATGCCATATATGGCATT G TT C TT A ACAGCTT ATGAAATTTGGAAAACGATAT 
TTTAGAAAGCI itctci IGKTGGCTGGAATGAAGTGGAGACCCTGCT 
Sequence 1057 

CCCTTCGAGCGGCCGCCCGGGCAGGTACAGC TTG TTCAGGATA I ITCTTCTATTTn C CT 
TTGAGTTCTTGTTCATATTCTAGTT AATTTCT AGTAGTTCTT AATGTATTTTAACCAATA 
GA l l rTTGTCTTCCT ICTGCTTATGTATTCCTCGTAAATG G I I I I rGTGACTTGTCTAAG 
TATAAACAACTTT ACTATTAGCTXJTAAAATTTTCATTTTTAGTATGTCATCAATCTTTT^ 
TTGTGMlTrAGTATGATTAAATGGTTTrrCACTTGGAAAGATATTGAATAGTCT CA 
TTGAI rTTTTTI l A AAGTCA l 1 1 I C A t 1 1 1 1 1 



iTACT ATACCAGAGTTAAATTGCCTGTGTTCTTTT 
ATTCCACTTrTCCAACTTTAAAA 

AGT AATGCTTT GGAAAGT GGAATT AAACATTTCAGAA 1 AA I AOCT G T T AGGCCOGGCTCA 
ATGGCTCACGCCTGT AGGGAGGCTGAGGCAGGTGGATCACC TGAGGTCAGGAGTTCGAGA 
CCAGGCTGGCCAACATGTT AAAACCCTATCTCTATTAAAAATACAAAAATGAGGCATGGT 
TGGCAGGTGCCCGTTGTCCCAGCTACTTAGGAGGCTGAGGCAGGAGAATTGCTTG 
GGGAGGTGGAGGTTGCANTAAAGCTGAGATTGCGCCAGTGCACTCTAACTTGGGCAACAA 
GAGTGAGATTC I G T C T CA AAAAATAATAATAATTAAAAAATAATAGTTGGTAGATTGAAC 
ATAGAAAACACGTTTTGTAGATAAAAAHT GGCCAAGTMTTAGCCAGCTTTGACAATTTTT 



• 1059 

CCCTT AGCG TCGTCGCGGGCCGAGGTACTTTAACAAATTAAAAACAAATTTr AATTT AAA 

ATATTTTAGAAATTTTACTT AAT ACATTT ATTT AATGAAGGGTGCTrTT AAGAACTTT AA 

ATCCTCAGGTAAACACCACCACCTGCAAAGTATTAATATCAACTTTTT 

TGCTATGT AT AAGCT ACTGAAAGAAGACAAAAATTAATAAAATGTGTCCCTCC T C TT AGA 

TATCTATAATCTAGGAAAATGAACAGATTCirrTCAGAGAGTAAAGTCCATAAGAACAGG 

c^tcagatcttatcttatttaccacxacatcctgagaatggagcacagtgcctgacacata 

AT AGA TGCTCA TAAT AGATGCTCAGGGTTT AT AGTGAGT G^ATAAGT AAAGAAATGAGTG 

agcaaatatctcttaaaaagaacagacttttaaagtt aacaagcaagtgatgtgttattc 
agtagcaaataagattgtttcctaatgtcataattcaatttt 



S 1060 

CCCJTTGGAGCGGCGGCCCGGGCAGGTACAGTTACCAAAA 

GCTTTTTGCATTTTAGTGGATGCAAATTGTGTCTTAGTAAGAAGAACATA 

GAAAGAT AATGTTGAAGAAAATAACAAAGCTT AAGGACTTAAACT ATTACCATGAAGACA 

TGTATAACT ACAGTAATTTTAAAAACTGT1 H I I 1 G CATAAGTATAGAGAAATGTACCTC 

GGCCGCGACCACGCTAAGGG 

Sequence 1061 

CCCTTAGCGTGGTCGCGGCCCGAGGTACTT ACGCTTTATGATCTTGAATATTTTCAGNGT 
MT AAGGAATCT C I I LCI IC I I IGATCTCCACTGCATGAAGAAC 1L1G1 I G CAGCTGTTAA 
CAAGGAAGTTTTGAAATACAAAGCCAGAACCTJGCC 
GGAGATCCATTTTGAAGAAGGTATAATGGCAACC 

CCCTT AGCX3TGGTCGCGGCCCGAGGTACTTT AACAAATTAAAAACAAATTTT AATTT AAA 
ATATTTTAGAAATTTTACTT AATACATTTATTTAATGAAGGCTGCTTTTAAGAACTTTAA 
ATCCTCACCTAAACACCACCACCTGCAA*GTATTAAOT 

TGCTATGT AT AAGCT ACT G AAAGAAGAGAAAAATT AAT AAAA I GTG1 CCCTCCTCTTAGA 

TATCTATAATCTAMGAAAATGAACA 

Sequence 1063 

CCCTTTCNAGCGGCCGCCCGGGCAGGTACACAAATTCTAGGNAAT^ 

GTCTAGAAI 1 1 1 M 1 1 U M I IATGAACCANATCACATTTCTGGACATGCTAACCATTAAA 

ACGG»<GAAGCTTCAGCTIGGTTGT I ATTCTTCCATTAAACTG T n CAGAAACATTCAGGC 

GGCAGATAACTCATTTGGATTGTTAAGAAACACCAGGTTTTCCAGA 

CTCATAGAAG I go I C I II C1ATATGTATGTTATGHATGATGTNAACCATAATATATATGGN 

TAAAATTTTAGTAGGAGTTATCCT I IGCT1 1 1 1 AT AATTTCCAGTTTTNNCGMNAACGTA 

ATTCCTTTTTTCGGAITCATTrrrrAGGTAAAAATGGTTCCC^ 

AAAT AAAG TCTT ACTTTTGAGTCTTTTAAGNCGTNNATTTTKGC^ 

GGAACNAGAAAGGTMfTTAAANCCmAAATTTTTGGAAATTA^ 

ATGGAAAGATTCITCGACCACCNMGKTTTTAfrrAAAAAACCNTAA 

NNAAN GGGG GGGGCMNgTAC CCC*^ 

AAAAWC+4AAAAKTGGCCTTNGGGGGCTTMT^ 



Sequence 1064 

CCCTTTCGAGCGGCCCGGCGGGCACGTACTTACT ACAAGCAGCAAAAGGAAGCTCTAGAA 
CAAGGAATT AAACACAG T GTTTG TTTCCAATCGCAGAAGAGGCCATGAGCACCAT ATGTG 
TGTCAGGCTTATCATCTGAACCAAAGAAAGGCCAATCCTTCACCTTrCTTATGACTCTTA 
TAGGCTGGAATATTTGACTTGGCCATAAAGAAGTTAATATCTCACACCTAGT AGTATTCA 
GTGACACAGAAAGGGAAAGAGAAAGGATGAAGAAAAGAG^AAAGA 
AGATAGAAATtTAATATTCTTTCCAAAGCATAAGAACAATTAAAAATATA 1 1 fCTCI GMT 
GMAAGTGGAGGATGGA 



CCCTT AGCGTGGTCGCGGCGGAGGT ACATTGAAACAATATAGTAG I C I I CCCCTTTACAA 

AGCTGAATrAAAGTAAAAGTGTGTGTTGGGAATAATAGGGGAATGTGGATTGTAGCTGTT 

TAATAAAGATTTAGATACATATAAAATTGCTTAAGGCC^^ 

AATTXCAGCACTTTGGGAGGCTGANGTGGGTGGATCACCTGA^ 

ACCCTGTTCAACATGGTGAAACCCCATCTGTACCTGCCCGGCGGC 

Sequence 1066 

CCCTT agcgtggtogcgggcgwggtacccacatgatcccaaagaggaggggccctgtaga 

AACAAGAACCAACCAACAfiAAAGCAGTGNCTACAGGCACCATGACAA^ 
AAAGTGCATCTTCAAATAGCACACAATTTTCCAATTTAM^ 

gogaamaaagcattaxttagatacaactgaatttctcaaaagtatattawcacagcctac 
aaat aaa tcctt aaat gta 
Sequence 1067 

CXCTTAGCGGCCGCCCGGGCAGGTACCCTCCGTGA C I I » I C AGGGTCTCCTGGTTGAATG 
AATTTCCANAAGGATTAAAATGTGTGTTCTTATTrGTGCTT^ 

7GTGTrGGAGGCTATTAGAATAGCTGAGAGGGTAAAACATAAACACATACGTANGAGCCT 
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GACATAAACACATAGGTAGGAGCCTGCCATAAGCACCGTAGGTAAGAACTAAAAGGGTGT 
GTTTCCATrTCANGNGGTCCAGMCCTTCCTTNCATACTCTNAGATGACAAAAACACAAAG 
TTGCTGGAGCTCACACAACTAATGACTAAANCCAGAAAGTTTGGACATGGAGAAACATTT 
TT 

Sequence 1068 

cccttagcgtcgtcgcggcccgaggtact atattagtgtagcaattttccaaaagccatt 

CATCTTAGAGGGCTAAATGATTTTACCTT ATCAATTCCTCCTGTGAAAAAATATCTCTAA 
AGAGGTTTTCTGCTGGAAAATArTGTTGCTGTCACATTGATATGCCAACAAAAGCTAAGC 
AGGGAA<nCAGGCCAAGAAATATCTNCCTGCAAGAGAAGGCATCGCACATGTATCTCTCC 
^TGCTATTTAAAATTTGCATTCTGCAACATAGAACGGATAGGCCATGCTGCAGAAGCCAG 
GTCCAGCAAAACTGCTTTC7TTGCCCNTT ACACATCCTTTTTGGAGAAGATGCTGGTGAA 
AGCAGCAACTACCATCTGCCT CCTG TTGACT7 AAGTGCAACAGGTGGAAGGGANGAAGGA 
AGGGCATCGCAAACATCATTCTATTATCTCAACCTT GCTTTTCTCGG 



CCCTTAGCGTGGTCGCGGCCGAGGT ACCCT GCTTTGATTATTTCCGAATCCAGTGGGTAG 

AGAAO GTAAA GGCAAGGGCTCACTGGATAri TTTAAATTGTAGGGA t U I UC I I IGCTCTG 

GGTCAATTTTAGGATCAAATATAAAAGCACCTATAGCTCAGAGTATCTTCTAACATAAAA 

CTTCTGAGATACCAGAAATT7TCCAAAACATGGTATAAACAGTATGAAACACTGGGTAGA 

TAAAACCTIT CTCTA AATCTTAAAGTGCTCAAATATCATGACCTGATTTTTTAGTTTTAG 

AAATCAQATATT TTTCTATT CCATATCTTAAACTTTCATGTTAAATTCTAGTTCTGACAA 

TGTAGGGTTCTATTTTTTTCAGGTGATTGTTGGGAGCGTATAGAAGCATATATAAATATG 

GAATATO rOTTTCTTTT TTCCCCTTCTGAAAGAAAGTCAAQCCTCTAATCAAATAGATTG 

ATGCTTCAGAAACTTAACAGAATATTATCTGCAATn'GCCATAAATGCATTTTTCTTGGG 

GAAGTTTCC^TGGTCAAAATTATTAGTCATTGCAAAAGAGAAAAGTTTGACACCTGGAAA 

TGCAGACCCTTTTGCTT 



ccctttgaccggccgcccgggcaggtacattatattaatgaaatttatct AGTCCTTGCA 

AACTTGTGCCTATTGATTTTCATTAGTGTAAACT AAAGAGAGAAACTTCACACTGACATT 

TATAATTGTAAGAACTAAGAACCAACCATCAGCTTTTCTATGCCAATCCATGCCC7TCAG 

GAAQTT CTTGA GGCCTTGAG<ytTGCTAGTTTAGTAAATTGCTTACT 

GCTACATTTTTGGAAAGANGGAGAATTAAGTTTTTGGTG 

Sequence 1 071 

CCCTTAGCGTGGCCGCGGCCGAJGGTACC^AAACTGAAAAAAGATTGTGTATC 
ATTTCACATAAAATGTATTTTGATAAAGTAAATTCCCAAACCATGGTGCTCAGAGGTTGT 
AACAGTCCATGTAAGTT GAAGAAAAAGAGTTATCAATCAAT ACGTGACTATCAATCATTT 
ATTTAATCATTATTTAGTTTTCACATATCTAGAAATTTCAGTAGAAGAACCAGCCCTTCA 
T AAANGTGGCCATT CCCTAT ACC TGCCATCGATTACATT ATTTT ACT 
Sequence 1072 

CCCTTAGCGTGGTCGCGGCCGAGGT AC I rTTTTTTTTTTTTTTTTl 1 1 GAGACGGAGTTT 
CAC TCTTGTTGCCCAGGC TGGAG TGCAATG GCGCAATCTC AGC TCAC CACAACCTCT GC C 
TCCCGGGTTCAAGAGATT CTCC CGCCTCAGCCTCTTGAGTAGCTGGGATTACAGGCATGT 
GCeACCATGCCTGXnTAATTTTGTATTTTTAGTAGAGACAGG 

GGCTGGTCT CGAACTCCCGACTTCAGGTGATCC TCC TGC CTTGGCCT C CAAAAGT GTCAG 

GATTACA<K3CGTGAGCCACCACGCCCTGCTTAAGTTTTAATAAG^ 

TTACGACTGGCAACTTAGGTCTCACAAACACAGAAAAGCTTGTCTTTAAGTATATTGTCT 

TTGAAAAGTrAATACACTCTCTAAATGCTCCATTTAAAATGATTTACTTTATAAATGCAT 

G CACTGAGAGAAAAGAT ATTTGAATGATAT ACACCAC AATG TT AAATT AAC TG N GATTGT 

TTCTAAGTATTGGCACTATGGNCAAT/TTTCTTTTTCTTGGTTATGCTTTTCTGAGTTT^ 

AAAC 

Sequence 1073 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTATTGTATCAGAAAAATGCTAATTAATTrTTT 
GCACAT AAAGGGCA TTTT AAAC TTGGTTTT ATTC TTT G TGAT AAAT ATGGATGAT GAATG 
GTAATGTTAAACAGAATTCAAAAGTTATCAGTTTGGCTAGCCAGACACAGTAGTATATGC 
C T A T AGTCCT AGC T AC C C AGGAGGC T GA GGC CAG AGGAG CC C G GAAGTTC AC G TTT A GCC 
TGGG CAGCATAGTGAGAC AC TG TC TTTT AT AAAAACAACAGCAAAAATG ATC AG TTT GGG 
ATAGTAAGACAAATGGCTrTCTTTTGTTAGGAATTTC TCTATTTAAAG GACTTTT AGGCC 
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TaGAGTGGTGGCTTACGCTTGTAATCCCAGCACTTTGGGAGGCCAATTGCAGGAGAATCA 

CTTGAGGCCAGGAGTTGG^SGACCAACCTGGGCAAAGTANGGGAGACCCTGTCTTTNCAA^ 

AAAAATTCAAAAAATTAGCCCAGTGAGGGGGGNGCTTGCCTGWGGGTCCTAGCCACCTGG 

GAAGGCTTGGGGGTGGGAANAATTACTTGGGCCCANGAATTTGANGGTC 

TTGATNCCGCGTH*ACCGAGTAKAAGACCCTTN 1 1 1 I N I I NAAAAACTTTAAANTTNAAC 

NTTTTTTA 

Sequence 1074 

CCCTTAGCGTGGTCGCGGCCGAGGT ACTGG^TCACTCTGCCCCA 

AAGATCCGACTGCT AGGAX3CCCCGGCTTCTTCCCTGACCTGCCCGTCTCCT ACACCCTCT 
GGTCCTGCTCCACACTGGTCTAATAACTGGTGTTCCACATTCCTCTAACGTGCACA^ 
AGTCCTGCCC«»TGCTrTTCACCTCCTGTCCATTCCTCTrATAACG 
Sequence 1075 

GATATCTGCAGAATTCGCCCTTCGAGCGGCC*XXXG^ 

AACTT AAAGAAAATGACTAGATACACATCAAATTAAGCTGCTGAAAACCAAAAACAAAGA 

^^aaatttttgaaagcagctagaaaaaaattacacaccacacagaggggaat 
cattacaaagatttttcaccagaaatcagagaagtgaaaagacagcta^ 

GAGCTGCTCAAG<3AAGCAAGCATCTACTCG<WSAT™^ 
GGAATGAAAGTAAAATAAATACATTCTCAAAGAAAAACAAAGA 
ACTGATCTGCTAGAAAAGCTAAGGTCAACATTAjGGCTGAAAGGAAATGCTGCAT 
GAATGAAGAAAGAGCAATAGAAACAATAAATATATAGGAAAACACAAAATAC 
Se quence 1076 

CCC 1 1 1 COMCQQCCGCCCGQOCM3GJACTTCACJGATTrAJQGCAMBtTCMCCMTCCA 

TCAGTGCTCAAAGCTCCTTGTATTGTCAG<^TGNfm»4CATTATTTGTCACT 

AATT AAAC TGCCAAC TAGT AGCA I MUM! 1 GTGTCTGATAGATTCTTCATGCAGAAAGA 

ATAAGTAAAATGAGATGGGACACAAATCTGAGTATAGCATTGTCATTACTTTTTGCTGCA 

CAGATTACTrGCAAGAAATATTCTAGTCTGGCGCATAACA>MkATCCACAAATTCCAGATT 

TAAAAAAGTAGGTCTATATAAAG<XTTATTTAATATTTGGTATATTrTTTAGG 

TTGGCMG*INCCTTTATNATGCCAAGGCATTT^ 

ATAGGAAGAAAATCTTAACATTC^CGTGGTGGATTAAGAAACNCCNCCCCAC^ 
TTGGATTAANGG+K>TrTATTAAGTAAAAG<rrTACCGTmAAGTAAG^ 
AATNTTTA 
Sequence 1077 

CCCTTAGCGTGGTCGCGGCCGAGGT ACAGAGTAACCATGACTTACTAGGTGTTATGATGA 
AGGTGTATGTGTGTGTATATGTGTGCATGCATGTAGATAAGTGTGTGCATTTGCACACAT 



GGACTC TT AAGATTGTCTTGTAATTGTCTTTTTG TTGTTG TTGAAAATTAAGGGTGTATA 
TTAAAGGTAGTTTTTACCCAGATCTTATATGTGTGATAGCT CACGTCTGTAATCAGAAAC 
CT ACTGTTT AATGGCCACCXAATTGCCATTAGCTTCCTAGAGGGTGATTTAATAAACTAT 
CTTCTTTAAAACTCATTTAAAATTAAGAGACATGTTTGCATACAATGGATTAATGACGTT 
TTC ACACTAACCC<^AAAGTCTGCTTGCACTTT CTTTTGT AGGCCT AACATTCATTTCAT 
ATGCATTGATTATTATrGTTGAACTTGCATTAATTACATCG^ttSCATATATGGACATACAA 
TGTC^TCTGCAGAAATTAAGGATTTTTTA 
Sequence 1078 

GAATTGGG CCCTC TANATCNTTCTCNACCGGNC GCCANTGTGATAATTCTCCTNTAATN N 
GCCGCCCGCCCNGGTACAGACTTTW5TTCX:TTTGCTTTT AI H I 1 1 1 M I I I 1 1 1 GCATN 
GATATGAATAGTTTCACTAATTCCATTCATGGTTCTGTAAACNTTCTTAAAACTTTGTTT 
TAT GGGA TTATCAGAGTAACAAAATAATGTAGTCCCTTTATGGGACTATAAGTAACTTAA 
TGC TTTTC TTTCCCT ATTTCAT ATCC CC ATATTTGGTGCAAT AATTT AATTCA 
Sequence 1070 

CCCTTAGCGTGGTCGCGGCCCGAGGTACAGCTCACATTCATGGGGAGGAAAATCAGGGCC 
TGTCTTTAGATAGGAGATGTATCAAAGAATTTGTG^CAT ATTTT AAAATCACAGCACTA 



TAGGGAG TCGACC ACGCGTCCGCTGCCTCGCCCAATGGGCTCATAAACAAAGTGGCCATG 
GTGGCAGGGATAGACTTTCTCAGCAACATGGACTTTCACTCACCAAGGCAGACCTGGCTA 
CAG C C AC TGCTGAGTGCCCCATTTTCCAGCAGCAGTGCCCAACAC TGAGCCCTTGAT ATG 
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GATCATTCCTTGGGTGATCACACAGC TACATGGTGGCAGA TTGATT ATATTGGACTTCTT 
CCATCATGGAAAGGGCAGAGGTTTCTCC TCCC TGGAATGGACACTCCAGATATGAGTTTG 
CCT ATCCTACACGCAATGCTTCTGCTAAGACTACCATCTGTGGATTCACGGAATGC 
Sequence 1081 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCGATGTGGCTGACATTTGGCTGGAGTCTGCT 
AAGATGTTTTC TTATNCTGGATGGAC GCAGACC TGTAACACCCTGTTTTTCATCTTCTCC 
ACCATATTTTTCA TCAGCCGCC TCA TTGTTTTTCCTTTC TGGATTTTATATGGCACGC TG 
ATCTTGCCTATGTATCACCTCGAGCC I MCI N I CATACATCTTCCTCAACCTACAGCTC 
ATGATC7TGCANGT<XTTCACCTTTACTG<KK3TTATTAC^ 

^"rcTATATTCATGAAGAGCATTCCAGCATGTGAANGAGTGATCACCAAGGATTATGAAAA 
GCAAGAGGAAGAAGGANMAAGAAAGAAG 

Sequenco 1002 

CCCTTT CgggC GGCOQCCCGGGCAGGTA CI I M l 1 1 I I I I 1 1 1 M I II 1 1 INGCTGGTTA 

ACAAATATTrTAATTCATTAAAATAAAACTTAAAAATTCATGCTTAGTCTACACAAGTTT 

AACTT ACTTT AGTCAC TTAGTGAATTG TGAATTGGCTCCCATT AGTGGTCAGGANAATG T 

ATTTGCTGTANAAACCAAATAAATCAAGCTATTATCGCCTTGTGAGTA^ 

CCACGCTAAGGG 

Se quence 1003 

CCCTTCGGCCGCCCGGGCAGGTACTGGGAAGTGCAC TTGGACGAACAAAAAATAAAAAAA 

AAAAAAAAAAAAAAAAAATTAAAAAAAAANGGAAAAAAAAAAAAAAAAAAAAAAAAAAAT 

NN NTTGGAAA AAANAAAAAAGGAAACANNHANMGCGGGTTTT^ 

AAA I MMII lAANNANNCCNTTKAANNNTNNffTGAAAATGTGANhnTTNNNNNGAATI^ 
ANC NTNNNTCTT WTTfTGGNTGATTTTTTATGTGTTCCAAATNG T f 1 1 1 T I T T ANNGAANA 
AAAATTTTTTTTTNNGAAGmAHAChrTNNATTNAAAN KATTT ATNCNTN NT AAAAATTN N 
AANAAATTrTAAATNfm^AATGGNNTTNAAANTTTTAAAATTT 



CCCTT AGCGTGGTCGCGGCCGAGGTACACATTTTTCTGAAATGTCCCCCGTGATTAAGTT 
GTGAACAAATGAACATGCCACATGTCAACAAACTGAACAAACATG GATTG TT AG TGAC TT 
ANAGGTGCAGGGAGG<^TAGA<»AGAjGGCTACCTGTCTTG^TCTGCCAATCTCCTGTGTCC 
CACACTGGCTACAAAAATACAACCACTGGGTAGGTAGGGCTC^TCTAGAACCAAAATTAG 
GAATAAGGATTG^GAAGAAAACTCAGCAAGGGTGATGAATGAGTTTCAGC^ 
AGTTAGC TGAAGAATGAAT AGGACACAGTGGAT GAAGGAACAANGC TATTCCNGGGACCT 
TTTGAAG 
Sequence 10S5 

CGGCCCGCCAGTGTGATGGGATATCTG<^GAATTCGCCCTTAGCGTGGTCGCGGCCGAGG 
TACCACCTAACAAATTGGAGGAAATGAAAAGACGAATCAACAACATTTTGGAGAAAAAAT 

tt attc tacttc tagaatttcattact acaagtgc tt agttcttggtttggta.gatgaag 

tgaaatcaaaattg^^tatttggaacattaaatatgggagcagagaatctgtggaattat 

tt3ctg gaagactggcataaatttattgaagaaaaagaattcctagctcgacttgatac 

cttttcaaaaatgtggagaaatttataagaatttggctggagaatgtcagaatattaata 

aacagtatatgatggtgaaatctgatgtttgtatgtatagaaaaaatatatataatgtga 

agtccactctacaaaaagtgctgccatgttgggctacttatgtgcaaaaccttcgcttac 



GGGWCCGNTCTAAAAGGGC 



cccttcgagcggccgcccgggcaggta 
aggg tc tcgctct atcacctaaactggagtgcac t gg tgcaatct cggct cactgcaacc 
ttcacaccccaggctcaagtgtcaatcctcccgcctgagtagctggaaccacacgtgcgc 
accactaaacccagcrg tttaa tacaccatttttaacccaaaacattaagaaaaatatag 
gaacagtaagtagattcattttgtaaacagacaagcttacaagttttctca^ 
gtcatactaaactgggagactgttaacttcttgatggggttaatctctaatatgaagcca 
cagtcatagctaactacaaattacatatacaatgccaaaaatat 

Sequence 1087 

ccagaagggcagacttcaacccagaaacaactgtgaattgtgatg^ 
gtatctgaacaacgaaattatacttatagactactttcttttcacagaacaaatgagctt 
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tcttggcttttaacaaaattatcattgaaaactacaaaattaagatca^ 

GC ATTCA GAGGGTTAATCTTTTGTTTAAAATCCTTCC^AAAGTCTrAAATGTGTTTATAT 
GCCTTTTGGAAAAAAAATTTATTTTATAATCATTTNGGATTTACAa 
TAG T ACC T CGGCN C GC GACCACGCT AANGGCGAATTCC 
Sequence 1088 

CCCTTNCNAG<X3GCCGCCCGG<X> l GGTACATCC^^ 

TCATCATT"CGGAAGCAACAGCGGCAGTCCCCTGCCCAAGTTATCCCACTAGCTGATTGCT 
ATATCATTCCTG<3AGTGATCTATCAGGCACC^ 

GTGGT AAG TGTCTTGACATTCTTTAAGCACTAAAGAAAACTTTTAATTAGC TACC TTGC T 
TCCAGTAATCAAACTAGAGCTCC TCTGCCTTGT GTAAGTTGCT ATAAAGT ATTGACTATT 
AGAATGTCTTGAACTTTGGTTACT GTGACCCAAGTCGGTGCTCAAAGTATATTTCATAG T 
CTCAATTATATAGTAATTTAAGTTCTGAAAAATAGGTTCTG<^TTTGCTATGGAAATATT 
TTGNGAGTATTTACTTTGGAA 
Se quence 1069 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACATATCCCTATCTACTATGTAA 
GCCAAATGAAATGATGTAATACAATGAACTCCTCAGAAAATAAGCTCTGTAAAATCTCAG 
ACTGCCTGTTTATCATATGCTAGAGTAAACTTACATTCC > I TCTTSI I A GAGAAAAATGA 
TGGT AAAATCCA TGCATTAATCAAAACTAAAAACATGAAAAGGCAAGCCAAC TACAAGAG 
AAATACAGTTGGCC CTTGAACAACACAGATTTGAACTACATGAGTCCGTGTACCTCGGCC 
GCGACCACGCTAAGGGCGAAT 
Sequence 1090 

CCXTTTTCGAGCGGCCGCCCGGGCAGGTACCGTGCAGAAGAAGCT ACCAAACAGCAAATAT 
GGAAATAOTCAG I mTTTTTTTI I AAAGCCTCAGTAGAAGAGTGCAGAGTTACACTGTC 
CTGTTTG^^TGCCCCCCTCCCXXTTNCGACCTAAGT 

CCCTTAGCGTGGTCGCGGCCGAGGT^^ 

AAC TAGA AAGGCGTTTTTCGI MG1 1 IGI 1 1 GTTTTAAAAG GGTCCTTAGTGCCTCTTAC 
Se quence 1092 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGCGATCTAAAGTTGGG 



AGAAAAGGG<3ATTGATTTTAGTGGAAGAACAAGAATGTrCTGAAATTGATTGTG 
GT ATAATCCTGTGAATATACTAAAACATTGA6TTGTGCACTTTACATGAGTGAATTGTGT 
GGTATGTGAATTTATATCTCAATAAAGCTATTTTTAAAACGAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAGGTNCCTCGGCCGCGACCACNCTAAGGG 
Sequence 1093 

CCCTT AGCGTGGTCGCGGCCGAGGTACCAGGTCACCTG 

TTCCTTGCCATTGAAGAACTCTTTAACCAGTTGCT GAATCTTTGGAATTCGAGTCGAGCC 
ACCAACAAGAACAATTTCATCAACCCGCGTACATGCTAAGACTTCACCAGTCAAAGCGAA 
C^**^*TACTCAATTGATCCAATAACTrGACCAACGGAACAAGTT ACCCT AGGGATAACA 
GCGCAATCCTATTCTAGAGTCC 
Sequence 1094 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATGCCAAAGACTTCGCCATAACT^ 

T AATTACACCT GCTACTGTTTCACTTAGTGGCACTTTGCTTAACCTGTT ATACACAGAAG 

GGGTTGAGAAGAC AAAACACT G TT AACTTCATT AT ACC TTT GACAAAGT AAT ATTATGTG 

ACATGATGTGTTTTCCCCAAAATATTAGAGCTGCAGATTTAGCTGATTCMTTTATGGGA 

CAATTTGTTA TGTGATCT AACAATTTGG CAT AT AAT C T AGAAAGCAGC TTT ATGAT CAAA 

AATTGATTTT ATA T AT AT ACAT AT AAAT 



CCCTTAGCGTG GTCGC GGCCGAGGT AC 1 I I II I II II i 1 1 1 I I I I > 1 1 H H 1 1 1 II I AC 
TTCA AAATAA CAl I I 1 1 ATTATATAAAAATGTAAAAATCCAGCAAAACCAGAAATACGGA 
ATATATTTTTCTGGGCTTTCACATTTGTTGATTTTTATTCGCGATCTTTr^ 
TTACACCCTCATCCCCATTTCCAGTCTGATTATACAAGN^ 
GAAT AAAT ACATCAAAAAGAAGAGGCAAAGCTGTGAAACT AAG TTGCA 
e 1096 
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TCTTGTAATCTTTACATACTTGTAAAAGCATTTCTCAAATTrcAGCTTACTTTCAA^TA 

aagttcttactgtctaatatgctctctttaaatttattaagtattttaaaaataccctgg 
ctctttatc tagtttcaa tc taagtatagaaaagcattctctgtaaggctgtcttaaaaa 

AAAGAAAAAAAAAAAAAAAAAAGTACCTCGGC CGCGACCACGCT AAGGG 
Sequence 1097 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATCT 
GAGAGTCGAGTGGCT TTTGTAAAGAAGTnCAGC TACA T^ 

CCATCTTT ATCACCAGAATGAGGAAC TCCTGGAAG TTAACTGCACCATCAGTGTTGATAT 

ccaactctttgaaccagacctctgcalcc 1 1 h i cct gat atactgaggacactcggtct 
ctagcaatttcttcaggtcatcc 

Seq u ence 1098 

ccctttcgagcggccgcccgggcaggtactaccattcc^tagaatggaatattacccgat 
gaaaaaataaagttgaacacatgctacaacatggatgaac i i reel I A TAAGAACATTGA 
aaagaaaatgccaaaagaaaaatgagttttagctcaa ai i r 1 1 i r aagaggcctagcctg 
ctcaagatatcctgttaaaaaaamamaaaaaaaaaactcttcccatatctaaggtgaaa 
ataaaaaacatttttaaaagtt><aatataaaagaatgaaat^ 
tatacagaaattatattaatgggtgg 



CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGGAG^ T GGCCC 

GGGATGAAGGCCGTGTTGGGGCTAAACCACACTCTGGAATTCrrGT^ 

TGTGTGAACTTGAGCAAGCCATTCACCTTTCTTAAGCCATTTTCT^ 

CCTCACCAAGTATTCAACGAGAACATGTAAGTGAAATGCTTCACAAAATX5CCTGGTAAAT 
AATAGATGCTTAGAAAATGGTAGAGAGAGAAAAGAGCAGTCTCTGCCtrTT^ 
CGGCCGCGACCACGCTAAG . 



GG<X5NCCCGGGCAAAAATKAT^TTGGGGGG*3GGGGNCCCC 



CCCA OGTTNGGT TTGGGANTGGQGGNAWTT AKTTTCTTf GGCCAACOAAAATTTTCCCCC 
Sequence 1101 * "* — — - ^ - * 

CCCTTAGCGTGGTCGCGGCCCGAGGTACTTGTTGGCT AGGAGCTGAGCTT ATCAGAAGAA 
ACAACAGCATTACAGGAATTGTCTTATATGTGGTCAGTTGTAAAGGTGATAAAAATTATT 
CTGTAAATCTTGAAAACCTAAAAAATTTACGCAAGAAAAGACATCACTTGTCTACTGTAA 
. CATCCAAAGGCTTTGCCCAGTATGAGCTCTTTAAGTCCTCT GCCTTGGA7GATACAATCA 
CAGCATCACAAACTGCGATCGCTTTGGATATTTCCTGGAGTCCTGTGGATGAGATTCTTC 
AAATCCCTCCACTCTCTTCAACTGCAACTCTGAATATTAAAGTGGAATCAGGAGAGCCCA 
GAGGTCCTTTGAATCATC TCTACAGAGAAC TGAAATTTCTTCTTGTTTTGGCTGATGGTT 
TGAGGACTGGTGTCACTGAATGGCTCGAGCCCCTGGAAGGCAAAATCTGCTGTTGAACTT 
GTCAGGAATTTCTGAATGACTTAAATAAGCTGGATGGGATTTGGTGATTCT 
Sequence 1102 

GATATCTGCAGAATTCGCCCTTAGCGTGGTCGCGGCGCGAGGTACGCGGGATTCCC^ 
GTTT 1 C T 7 CTAG AAG TTTT ACAGTTTTACGATCT ACATTTTTGGTCT ATGACCCATTTTG 
AGTT AAATTTTG TGTAAGGTATGTTAT ACATG TGGAAGTTCA I TTTT TTGCATGT AAA T A 
TCCAATTGTTTCAACACCATTGGTTGAAAAGAGGGTAT 

GCCTCAATTAAAATCAGTTTACTCT ATCTGCATAAGTCTACTTCTGGGCTGTCTAGTCTC 

TTTCATTGATCTGTATGTCTGTCX^TTTTCCAATACCACT GTCTTT ATT ACT GTAGTTTC 

ATAGTAAACCTTGAAATCATAATTCTATAGTAAGTCTAAAAAAATCACACAGGTTGGAAA 

TGGACAATTAGTATGCTAA14ATCAGAGCAATCTTGTTGGTTCA>IAATGGTT^ 

AAT ATT AGCN CAG TGNNCTT CACATGCC TCATTGATGATAAC TGGAGCTT AATG TGAA 

Sequence 1103 

CCCTTAGCGTGGTCGGGGGCGAGGTACTTTGTTAGCGTCTGCGTGTGTATGGAAAGTTGA 
GAAAAAATGGCATGAAAAGATCATGATTGGATTTTCTTTTAAACCTGCCCTT CTGTAAAA 
AATAGTTTATATATTTTTAAATTAGTAGGTATGTGTGGCTTCCTTTTTTCCTAACATTCC 
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CAGCAAATTTTTGCTGCTAAGACT ATCACTGTT AAAGTGAAAATTACAGGGAAAAATGTG 

ATGAATATACCGTAACTCAAAATGTGATATTTTCTTAAAATCACTCTTTTATGCTTTAG^ 

AACTCGTTCGTCTCCACTTTGATTATTaGTGTAAAGA^ 

TGTAAAATTTAACTCCTTGTCn 1 1 ACTTGGGGGACCGGGGCCCCT GGAGGGCTTCCCT A 
CI 1 1 U-OCACTATGTTAACAGGT AAATNCTGATTTTATGCCTTT AGTTTGACTTATTTTT 
^J^CHAAATATTAGAAGTTATTGCTTTT AAAATGTTTAATGT GGGACTGAAATTTTGATCT 
TTTKNTTNAGAAATCTATGAAGTGTATTCA^ 
TATTTTGGHAATTCTGAAATTGWTTTGGCATCTGG><CCAAAAACCT 



acttgcaatgttttg^cattaagagagagactatacattgag 
tctagcctgttgtccaaaactgctt ataaaatttagcaact aattttcacttttgacaac 
tatttt aattct agaaaat aggttt at aaagatttrcttaaagtgtt atctatcc ttcca 
atgacttattataaattttagaatgtatttctatagggtggaaaaatctcctt^ 

^^GAACAGTTTTCATGyVAGAACATGTTAGACCATGTAGAAACATGGGTACCTCGGCCG 
NGACCACGCT AAGGG 
Sequence 1105 

CG^AGTGTGATGGGATATCrrGCAGAATTaSCC 

I 1 1 1 1 1 11 1 1 1 TTTNTNI IIIIMI J ATATGGCAATATTTATATTT ATTTTTGCAATTCC 
TTGGATAAAAAGCATTTGAACAATGTTTGGTAAGGMGTT ATTCTCATAAAAACTTCTTTN 
AAAATGAAGGTTTTTNTATTTTGCACAAAAGTTAAA 
Sequence 1106 • 

CXXAATTGGGCCTTTNGATGCTGCTCGAGCGGCGCA^ 

CCCTTAGCGTGGTCGWMTTWGAGGT AGMACCTGCATGCTGTTT ATGCACACAGAGATTTQ 
AGAACCATTGTTC T GAATGCTGCTTCCATTTGACAAAGTCCGTGAT AATTTTTGAAAA^ 
GAAGCAAAGAATGGCGTCTCTTTTTATG TTCAAGCTTAT AATGAAAHTCT G TTTGTTGAC 
TTATTAGGACTTTGAATTATTTCTTTATT AACCCTCT GAGTTTTT GNATGT ATTATTATT 
AA 

Sequence 1107 

GATATCTGCAGNNKTTCGCCCTTTTOAGCGGTCGNCCG 

CTAGTATTTTGAAAAAGTATAAAAATGTGGCCGGMCGTGGTGACACATGCCTGTAATCTC 

AGCCACTTGGGGAGGCCAAGGGX>NGGAGAATCGCTTGGAACCTGGGAGG 

G 

CAGTGAGCCAAGATGCAGCATTGCACTCCACCTGGGCAACAAGAGTGAAACTCTGTCTCA 
AGGGTAAAAAAAAAAAAAAAAAAAAAAAAAGTAC I I I 1 1 M 1 1 H m I III 1 1 I MGGG 
TCATTAGTT ATTAATTTTACNCMAGTT AAGACTTGAAAAAATGAATGATATTTAAATCAT 
TG TGAC7T ACTGAGAAGX^AAGAACCAATGAG TGAGCCCAAAGGAGTCT AC T ACCCAT ACC 
TATTAAGGGTAGGGAAAGGGTTTAAGT 
Sequence 1108 

CCCTTTOG^GCGGMCGTTWWGGGWGKIWCAATGAAATG TCTTTT AAAAAAAGTTTGTTGT 
AATTGTGTATGTAATTCTGACAGTAA 

CCCATGTTCTGGA TCTGG GGACTGCAATATTACAGAAATATGCAAAAATAAGTTTAGTGC 
TCAGAGATAAATAATTTTNCTTATTTXAATGCATCAATGCGC^ 

AAGCCAACCACTGCTATATGCAAATAAATAAAACATTTGACAACACTTTTATAATCAA^ 
CCAACATTATACAAAAAATGTGTGGCACCGTGCACATACNTGTGCATATGTGTATGCAAT 
GCCT ATTTAAGAAAAAAGGTGTCTTGATGAAAATGATTTTGAAAATAGTCACTGACACAC 
ATTATATACAAAACCTITTATATAAAAAA 
Sequence 1109 

CXX;TTAGCGTGGTCUjCGGCCGAGGTACATTTTGGGCCTTTAATCCC^TCTAAA 
CTGTT AACGAAACTCAAAAACAGAAATACCTATATTTTCTCGCTAAATCCAATTGTTACC 
T ATGATGAGT AAAGACAC T AGATC TGCAGGTC C T AGT ACAATCT AT ACAT AAAAGGC C TT 
CAGATTTGAGGCACAAAAAAAAAAGGGCAAAAAAAGAAAAAAAAG 

ACACATTTCCTTCTTTTATCT(^AATATGAGAAGGAATCCTTTCTAACTCTAATAACATA 
TT AAC AAGAATT AAGAACAC GATTGTCGGGGAAC TCA GATGTTGGCAAAGCTTAAAAAT A 
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AAAAAACAAGGGCTGGGT GCAGTGGCTCANGCCT AT AATCCCACACTTTGGGAGGCCGAN 
GCAGGAG GATTG CTTAAGCCCAGGAGTTTGGGATCA^ 
TATNCCTATCTTNTNCNAAAATT^ 
AGCCCCAGCTACTTANGA^IGCTTAAATGGGGAGGA 
7TGCNTGAGCCTTTGATCAAAACTTTACTTTAACCCTC 
TT^TiftAAA^AAMA A AAAAGGfiVUS AA AAAMA^IAAAWGGGGAI 



CCCTTACCGT GGTCG CGGCCGAOGTACTGGCA 

TTATTCCACCAGATACCCTAKATC A r C I C TC1 CAAGTTCg1aGTTCCa5IuS^^ 
GCAGGGGCAGAATGCTCCCAG ICTCTTI GCTAAAGCATAGCAAAAATCACC ITTGCT O CT 
CCAGTTCCCAATAAG TTCC TC A T C ICI U 1 1 G GAGACCACCTCAACCTGGACTTCATTGCC 
ATATCAAGATC^GCATTTTGGCAAAGCCATTCAGCAAGTCTCT 

C CCACAI 1 1 IU-1UICI IC1 1 C I GCA CCCTTCAAACTATTTCAACCTCTTCCTGGTACCT 

AAGTTCCAAAGGTACTCCACATTrTCAG^ 

CT^CCTXX»>IGt3GCGGGCCMTCGAAGGG^ 

1111 



CCC7TAGCGTGGTCG«3GCCG^GGTACTrrt^ 
JCTTpCTATGTTGCC**^ 
C£TCAGCCTCCCAAAGTGCTAGA^ 
ATTTTTATAAATGAAGTTTTATWGGAACA^ 

GGCTTCGT ATGCCATATAGCAACAGAATATATTAAACATTTACT ACCTGGCCCTTTGCAG 
AAAATUT 1 1 GACAGCTC^TGCTGMATAAACATAAAATCTGCCAAAAAATGCTGATATTAC 
CXX^ACATGGAGAAACACTGGA ACCtrCTCTTCAGAAATCACATGCCAATTTAAATATTACT 
ATX^AGACAAATACACTCTG A I I I I I M I I C CTATTOCCTTTCTTTT A I II ILII IIII G 
AGAOMC«TCTTGGCTCCCXTG*CX>AGCT^ 

Sequence 1112 ^' 
GCGCTNGTGTTTC>ATCCCTTACGCNCCGCAGCCN^ 

TMCCTGCT ACAATGGGATGGCCTGGGGGATTAATGGAACTTTGCCGGGACCAACTT ATGA 
TAAGTGGGAAAGCACTTTAGG<XrrGATCCCATATAMGTGGTG^ 

AAATGCACACGGAGGATAAMCACCATNTGACACTGGGGGT GGTNCAG ttggagc TCT GGA 
AGGAAAAGNCT1CC1 GGGGTGGATCTCTAACAATATTAATAOCTCMGCCGCAOCCGCT AA 
GGCGAATTCCAGCACACTTGCCGGCCGTTACTAGTGGATCG 

1113 



CCCTTAGCGTGGTGGCGGCCCAGGTA C 1 1 I I II 1 1 1 1 1 I II C 1 1 H 1 1 1 H It 1 1 G AGAC 
AGAGTCTCTCTCTGTCACTCAGGCTGGAGTGCAGTGGCATGATCT CAGCTCACTGCAACC 
TCCACCTCCTGGGTTCAAGCAATTCTCCTGCCT^^ 

GCAGGCACCACCACACCCGGCTAATrrTGTATTriTAGT AGAAACGGG G 1 1 1 C TC CATGT 



TGCTGGGA TTACAGGCGTGAGCCACCGCGCCCAGCCAC TTCTGT ATTTTT AAAAAAGTGG 
TAAGATTTGAGT ATT ATA CTGGG ATAGAACTGAAGTTGGGGGCTT AATTTGATCT ATCAG 
CTT ATTGAAAACAAGGACCTTTT AAAAAAATGGTTTTGTT AGGTT GGAAGAAGTGAAG TT 
TTAATTCCGTCATTTAAKTTAGCCMAGTATGTTGA \ I I 1 1 III IGGNGAAAGKGTACCTG 
CCCCGGGCGGGtMGTTCGAAAMGGG 
Seq u ence 1114 

CCCTTAGCGTGGJCGCGGGCGAGGTACCACAMGGAC 

TGAGGCAGGTCTTT GTCAATTT AAGTAAT CCTG TCAGATGGTG T ACCAATCTTGT AACTC 

acgacaaagcact gttgctgagat act gtgatttattttcctt AATGGGCaGTTTTTTTA 

tatatatacgttccattttcagacaggtggtgctttgagt^ 

AAACATGGATCTll I 1 1 1 iatttaactccci 1 1 rcTTCi mctaaggtgcttaatttccat 

GCTrGACATCGTACCTGCCCGGGCGGCCCmCGAAAGGGCGAA 
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1115 

GTACAGAAGGGTTTCACCATGTTCACCACACTGGTCTCAAACTCXTGGTCTCAAGTGATC 
CATCTGCCTCAGCCTCCCAAAGCACTAGGATTACAGACTTG^ 

ATCACTTTATATT TTCAA GAAGGTGGTGAGGG IGIUI I G GTGCCTGGGGTCTCTAGCTGA 
AGAAAAGGGAAATTTTTCTATCTCTGGTAATGTCTTTATGGATATAAACCTCA^ 
GGAATAGCT ATGGAATGTATGCTTCTGGTTAACTAAAAATTAACCAGTAAACACTCTGTA 
NTAACCATTACAGAAAATACTTCTGCTTT AAAAAAGT acctgcccmggcgggcogctcga 
AAAGGG 
S equence 1116 

TNTCTGCAMAATT CGCCCTT AGCG TGG TCGCGGCCCGAftG T ACCA T CCCAAGGACACAAG 

TTTCCAGGCAGCAGCCTNCAAGAATTTTOTTaGAGATGTCCC^ 

TGTTACATCTGGACTCTGGATTGCAAGTGTAAGGAAGA^ 

GGAACAAATATTGGCAACAGAGCCCCC^GAGGACAGt I GTC C C 1 1 ITCCAACAAGTTAAG 
TGQAAAATGC 1 0 1 1 GCXATGGGAGTACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 1117 

TTTTAAAAMMCAI 1 1 1 II II IMCAGGGGGKGAAAAAAAAAAAAAGGGGGGGCCAKTTTTC 
ANCCTTGGAAAAAATGGMKrrrTAAAAAATT 
NANMACC Wt X iC 1 1 1 1 1 NAAAAATA 

'"III lAANI 1 1 1 1 1 rAAAAAACOCMCCOCMCNNCCATTTTTAAAAGMGGTTXTNTTTT 
NAAAAAAAAATAAANATTGGTTTTTTAAAAAAAAAAT^^ 

CCAAI I 1 1 UNI 1 I AAAAAAAAAAACC CGNfrr TTTAAAAAAAAAGWGGGGAT 1 1 1 TI C CA 
. NM TTTAAA OOGGGAAAAAAAAA G GGWl I II 1 1 1 I G GGNNAAAAAAAA A AGWCCCCCCCCA 
AAATTTTTGAAAAAMA 



AAAAAAAAAAAA>IGGGGGGGGGCCCCCCCGG G 1 1 II M l I IT AAAAAAAAAANATTTTTT 



CCCTrAGCGTGGTCGCGGCCGAGGTAC 1 1 1 1 1 1 1 1 I I 1 1 1 1 Tl 1 1 1 11 1 f T1 1 A AAGAAA 
AAGTTGGCCCAGCOCCAGGGAAT AAA TTTT GACTGTCT AAACAACCACAGACCAAGGGCC 
AAATCTCGCCCTCTGACTGTATAAATTAAGTTTTAC^ 
TTATCCATTGTCTACATACWCTmAGGCTCGATGGOl^^ 

TTATCT AGACAAAAAGCCTAAAATATTACCGTTTGCTCTTT ATNGAAAAAGTTTGCCATT 
CCCTANTCTAACGGTTAWATTCTGACTTATCATGTTATCCTACCCCCCCCCGNGTA^ G 
CCCGGGCGGCCG 1 1 1 NAAAGGG 
Seq ue nce 1119 

CGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTTTCGAGCGG 

AA TATG GAAAGGTAAGATCCATACCCAAAGTTAGGTAACTGTTTGAGTTGCCCA 

T AGTTTAAACACT GTAGAAGT ATTAR^GAGATCCTTT AGGGAATGATGCAAGTGGCATTTG 

AGCTATTCATTT ANAGAAAAAGTTT AAAAACATGCNGTCT AAAANGGAAGAGATNGAGGC 

CATTNGAAAAAATNTTCTTAAGATTAACAGCTGGTTATCXXACTGGCrrAA^ 

TGWGGCANAAACCAC CGTK ITGGCTAAACAAAGWGGG^ATGGCGTTTAA^ 

GGGCAAGGCTAAANATTTTGAACTTAATCCTA 

TCTTCATGAAAG I N 1 1 1 1 1 NT T CACACTACCTAAACAG»TTTATACATTTGCTTTTATCTG 
gagggatggaaaaaccaaaw rTTTTrm TGCCXTTTAATCCTTAAAATTGAAACTAACT 

Sequence 1120 

CCCT TAGCGTGGTCGCGGCCCCAGGTACACACAT C TTTTT GAGATCCT ACCTTCAGTTCT 
TTTGAGTATATAGCCAGAAGT GGTATTACTAAATCTTACGATATTTCTATTTTTAATTTA 
TTGAGGAACCACTGTAG 1 1 1 1 1 I^TAGCAGCTGCAGCATTTTACGTTCTCACCAAGAGTG 

cacaag^gttccgagcttcccacatcctccccaacacttgttat^ 
attgcagccat^tagtgggtgtgaggtgacatttcattgkggttttgat^ 

CTAATGAGGAGTGATGCrGAGCATCTTTrCATATGCTTACTGGTCAT^ 
TTGGAAAAATGTCTATTCAAGTCCTTTGACTATTTTAAAAATTG 

CGTTGGTG^rTGACTTGTAGGAGTT><CTTTCTATA7TCTGGATATTAATCCCCCTATCAGA 
TATATGATTTGCAAAAATCTTCTCnAATTCCATAAGGGTA 

* " " " ATG>WTAGAAAGNTTTTAHG>TrTTGAAANA*tCTAAATTAT(>iGGTTTTA 



Pae« 1 10 of r: I 
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CTTTTGGGGGGCTGGG 

Sequence 1121 

AGACGGGGTTTCACCGTGGT AGCCAGGATGGTCTTGATCrTCCTGACCTCGTGATCCACCC 

ACCTTGGCCTCCCAAAGTGC TGGGATTACAGGCGTGAGCCACCGTGCCGGGCTGAAAAAT 

AACCCTTTAGATATCTACAGC7T7AAACTGTGTGCAGTCATGAAAAGCAGACATTAGAAG 

TCATTGGCATTTAATAAATTGCAGTAAAATTATACAGTAAATACATTACAATCATTAATA 

ATAGGCTTT AATGAGAAGAATTTAATAAATAATCATTAAAAAGACAGGAGAATTTTATTC 

TGGTCTCAATATGGTWGCTGCTCTI CTT ATCAAATCTATAATAAAACT ATNTGACTATNA 

TATAGATTTCAGGAGCTAAAAAAAGCCTTATATTTTCAAAATTAAAGAACI^TTTTAATT 

TTGCNAAATCAATNAGCATTACTGAAGTTTAAGGAAATTTTGGAATAAAATATATGGCAN 

TTANATNOCGCCTAAAAAGAATGNAATCTTAANGATTNCTTTTTGGCTCAGGGGCNTAAA 

ATTCCA 

Sequence 1122 

NGCCCTTCGGNTTTCCGGGCAGGTACGCGGGGGCGGCTCGTTCAAGA 
CCAGTTGCCTGACGAGAGCTCTTCAGCAAAAGCCCTT GTCAGTTTAAAAGAAGGAAGCTT 
ATCTAACACGTGGAATGAAAAGTACCTCGGCCGCGACCACGCTAAGGG 
Sequence 1129 

CGCCAGTGTGATGGGATATCTG<^GAATTGGCCCTXrCGAGCGGCCGCCCGGGCAGGTAC 

CTTTTATCCCTCAAAGGACCCTTCTT GGGTriT GAATGGAAGOTTTA 

TGTTTTCTTTATTTTGCCACTTCCATCTTTTTTTGTGGCCCTCGATCCTATTTTTCC^ 

ACTCCATGCTTGGTTGGCCCTrATAAAACTTGTGCCCAAAAGATTGTGGATTAGACTTTC 

CGAGGACTTACCTGTCCTAGGGGAGTAGGCAAGCACTTCACTAGGGAGGGGGTGGGGGAA 

^GGAATGAGAGATGA^AT 

ATTTCAAGTCT ATTGTTTTTAG TAACTTCATTTATAN NCC TCCATAACAAGTT AGAAGGA 
TGTATCTGCT ACCATTTATTCCTATAATTTTAAGAAAGNT GGGGCTT GACATTATACTCA 
TTTAGTGAGAGTANATGCCAAAAAAAGTGGAGGGG 
Sequence 1124 

C^^^^CG^NCGGCCGCCCGGGCAGGACGCGGGTAGGGCAACTTGGATGTATGC^ AGGG 

TTCGCAAAAAGT AAACAAAAATA CAAGGGAAAAAAATT ATTGACAATGAACTGCTTTGG T 

AGTGATTTGTGA 1 1 1 IUI 1 1 1 1 1 CTTGATTAGTAACCAACAGCACAGCCACCAAGAAATT 

ATGCACATGTGGGACCACGTCAAGCTGAAGCGTTrGTGCCCAACAAAGGAAACAATAAAG 

AAAATAAAAAGGCACACTAAAAATTACAAGTTTGGGATAAGGGATTATTTTTGAAAAGGT 

ACCTCGGCCGCGACCACGCTAAGGG 

Sequence 1125 

C CCTTAGCGTGGTCGC G GCCCGAGGTACAGAAAAAGACACATTTAGAT AAACTGAAGCAG 
ATTAAAGTGACTTTATAAGAGAACATCTTTGTTTTTATGTTTAATTTCAAGTATGGTTAA 
GCACTAATTTAATTCAGTGCTTTCTGCTTATTCTGTTTCTAGTAACTCTTACAGAAACAA 
G TGTAGTCA GTAGCCAAGATACATCCATGTCAGCCTATATATGACTTACTAGGAGGGCTT 
AAGI tin I AAAA GAGATGAAAAATAAAGAGAAGGTCTAGTATTTTCCTCCCACATTCCA 
^C^^TCATTTTATGTGCCCCCTTTGGGTGAGCACATTCCATGTTGTAC^CCATTGATCA 
T AGTAGTCAGAGCATGGAGCTCTGGAGTTCAGAAAAANT ATTTTATTATTGGTGGTATGA 
CAAAAATAATTCCATGAAAAAAAAAAAAAAAAAAAGTACCTGCCCCGGGCGGCCGCTCGA 

Se quence 1126 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTACT b I Id M I A AAGCTGGAGAAGCCTC 

TATGGCTTATrCCCTTAGAAGCAACAAATGAAATGATGTATAAAGCATCAAGTCAAAGAT 

ACAGAGAACTGGAGACATCCACTAATTGTTATGACAATCAAAGAAGTCATCTCCGTAAAT 

acctaagggttgtctaaggctataaaggtcaatttgaaagccagttagggatccacccgt 
gtttca taaa agtgtcttacactcatgtttggctttcaagaagtgatatgcctactaaag 

CTGTTATTTT GAGACTATCCCGCG TACCTCGGGCGGC GACACGC T AAGG GCGAATTCCAG 

CACACTGGCGGNCG 

Seq uence 1127 

CCCTTTCG AGCGGCCCGC NCGGGCAGGTA L I IIII H I M lllll ll l ll l l ll IGGGCT 
CCAA TTC C ATTTT AATTTTG I I Id 1 1 G T T T GTCTTTCCTCAAATATACAGTCCATCACC 
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TTG»=TCAGTG^T^ 

GCCCATCTTNTGTGCTCTGTGGTACTNCT ACACACATAAATTCAGGAAATGACTAGATGA 
GCCTGAGTTGGCTTTAfrrATTAATGTGCAAATACAGTTTTCTATACCAACAAACGC 
Sequence 1120 

CCCTTTCNNTNNTGCCGCCCGGGCAGGTACTATCX5ATTGGGTCGGGGGTGATCTATTATC 
ATTGAGTAGGGAAACTT ACTAGGNTAAATAGAAAGTATATANAATGTATTTGGTTATAGA 
TATGTGAAGGAAAAGGCATANTTATATGGTCATCCA TGCTGGGGAATATTTNGNAGNTNT 
N 1 1 1 1 G T I GAGAGAAATMGNNCATfrrTGGATCAATAGMATTAGACAAATATCTTGNGCAT 
CAAGAGACCTGGAAAGATG 
Sequence 1129 

GATATCTGCAGAATTCGCCCTTTCGAGCGGCCGCCCGGGCAGGT^^ 

GGCTAG TG T AATTCAGTC TTTTGAATAAATGGAAAAAATAAATTGTATGTTATTTTTATA 

CAGAAAAAAAGGCCTTAATATCATAAGG 1 1 1 II I I A T AGCCCTCAAAACTGATTTTTAAA 

T GGAGG TAGGCAACTGAGAAAATAAGCATTTAAATTAGTTTTCACCCCAAAGCCCCCCAA 

AATTTTGCTTACAAAATT AGGGT ACCTCGGCCGCGACCACGCT AAGGG 

Sequence 1130 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACITtNl I f l I m TT 1 1 1 M l TT rT C CTTTT 

UM iii ii i ui iii n iii niimi N in ii nA GAWAA CM i i i iiin iii A Tw 

GNNA WMNMA ATTTTTffrMCWGGGGGGMTTAAAAA A I lll l l l l I N NNNGNTTTCCNNWT A 
NTNNATT TTAANGWW WGGGN NWTWTTTN W NCCCTTTGWTNTNGGCNAAAAAAAAAAAAAT 
lllllll NTTAAAAACCNTAAANGGCTTCCCTNAAR*NAAAAA^ 
AAAAATAAGGNAAANAAAAANTTTTT 
Se quence 1131 

CO CTTTCG AGC GGCCCG CCCGGGCAGGTACCCAGAGGGAGAGGCT AGCAGTATTTTTAAA 
TTGGTTTCTAAATTTTTTATAGCTTGATGGTAGATAACACATTTGCTTCA 
CTGAAAAACCAATCCTCAAAAGACCTCTCAATTAGAATTCTT AAATGACAATGTTTTCTT 
T ATCAT ATATTTGAGAGATTGATTTAAAGAAAAATAATGCTTGACT ATCTGAAATAATAT 
TTTAACCCT ATCATAAAATCTCTGCCTGGTAGAACAGCTGACT GTGGAAGGGTAAAATGC 
AGAGAACCAGTCATTGGGATCTCCCTTCTCTACTTTGTACTGAAATCTTGAAOCTGTAGA 
ACATTACTTATCACTGTGTC C TTTCTAATGGGGAAAATAAT AAT AAACACTT GCAGAGT A 
11 Tl T f AAAAG TTTTTAGC TTT AAAAAAAAAAACC 
Sequence 1132 

GATATCTGGAGAATTCGCCCTTTCGAGCGGCCGCCCGGGCAGGTACATCACATGGTGAAA 
GCAGGAGCAAGAGGGATAGAGGTGGCATACACTTTTAAACAATCCGATCTCAGAAGAGCT 
CACTCACTATTGCAAAGATAACTCCAAGCCGTGAGTGATTGGCTCCCATGACCTGAACAC 
CTCCCAC CAGGTCCT ACCTTCAGCATTGGGGG TGACAAAGCAACATGAGATTTGGGCAGG 
GATAAATATCCAAATTATATCATTCTGCTCCTGGCCTCTCCCAAATCTCATGTCTT CTC A 

cattgcaaaatataattatgccttcctaacagtccccaaaagtcttaactcattccgact 

TTAACTCAAAAATTCAAAGTTGGCCAGATGCAGTGGCTCACACCTATAATCCCAGCATTT 
TGG 

Sequence 1133 

GAT AT C TGCAGAATT C GCCCTT AGC G TGGTCG C GGCCCGAGGTAC TGAAACT ACAGG TGT 

GAGCCACCATGCCTGGCTTAAACATTTGTTTTrAATTAGCCAGGCTTGGTGGCACACATC 

TGTAGTCCCACCTACTCAGGAAGCTGAGGTGAGAGGATCACTTGAGCCCAGAAGTTCAAA 

GGGGCAGTGATCACTCCArrGCACTCCAGCCTGGGTAACAGAGTGAGACCCTGTCTCGCC 

AAAAAGAAAGAGGTTAAGGAGGAGAAGACTCTAACCAAAAGAAGTAACTGATATTATTGA 

AAATTATTTGATAGGAATCGCAATTATTTTGGATAACTATTTTTACATATTGTAAGCCAA 

CCAAATAGGGTCTTAAAAAGTTTCAAGACCAAATGATTCATGTTCTCTACTTCAGCCTAA 

AAAAAAGTTAAAGAATTCTTCAATTACCAAAAGAACAGTTATTCTATANTT ACAAAAAGA 

CTTGAAACTTTTCACCTGAATGCATCTCTTTGTTACAAAACCTTTAAAGGAGGTAGGGGG 

GAACTTCATTGATTCATCAATGCTGNCTGGTTTTTTAAACCCA 

Sequence 1134 

A GTGTGATGGGATATC TGCAGAATTCGCCCTTTCGAGCGGCCCGCCCGGGCAGGTACTTT 
N II I I I I II 1 1 I I M I I I rNANGAGCCTCTGGTTACGTTNNCTTGTATATTTACTTTCTC 
ATCCrrTCTCTTTTCTTACCTTCCTCrTTGACTCCTTATCTTTCTATGCCAACCCTCTCT 



PCT/US00O33I2 



AAAAAGTCAGTATGTAATATAGTTGCTCTTTTATTTAAAAAATTTTAAGATTGATATTTG 
CTT ACTATCATGTTACGAGGCTTTATTTATATGTGTATTACAAATATATTTGTTAACTAC 
TAGCAAA TATTT TATGTAATAACTTCGGTATTTTATTAAAATCCTGTTTTTAAAATTCTG 
AAATGTCATTTT AAG T AT AGGAGACAGG TGAAATTG TTCAAGG TT ACTAC T AAACCAGGG 
AATAAGGGAAGCTTAGATTCTTGGNCTTrrTTGAAAAAAGAAAAATTTTA 
Sequence 1135 

CATGCTCGAGCGGCCCGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTTAGCGTGGTC 

GCGGCCCGAGGTACAGAGGAAATGGGACTTTGCAATTATATTTTTCTAAGT 

TTGGTCTCACTACCCACATCACCTGGAATGGTTACCAGGCCTCAAAGGACTGCCGG 

GCTAAACAGCTGATCCGCTCTCTGAAGCCAGACAGTCTT ATCTGGGAGGTCCTTT ACAGA 

TGCCACTGTTGAGGGCCCGAAGCTGAA>fAAAAGTGACTCX^TCCTCAAGTAGTCCTTA 

TTCCTTTTGAA<X^AGCCTTGCTGTTCTNGGGC«K^TTTGT 

NNNGGTTCTTTAAAAANAAAGNGATGGGGTCCTTTTAAGGTAATTGAAATAAGGTGTT^ 
ATGGTGTTAATTGGGTGATGATGTACCTNGNGGCNGNCTGGATAAAAGC 
S equence 1136 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACAGATGAAGATGTGTT AAATATCTCAGCAGA 
GGAGTGTATTAGAT AAATGGAATTATGATATATATGATATACAAACTTTTTTCT ATTTAA 
AAATATATTAATGGATCAACTTTAAAATTGTTAGTTGCCAGTGA I C 1 1 1 1 H G GAAAACA 
AAAATGGGGCATTrGTTGATTTATTTATTTTCCGTCT CTAATTAGTT ACCTCAG TTTGAT 
TGAAGCCAGTGAAGTTGTGCTTTTCCTCT ACTTCTACTTCCTCTCCCCGACCTTTTTCTG 
CCCAGTGTAGGGTGTATTCTTAAATTCAGACAGGGGGAGGATTCTTTCACATATNAC^ 
GCTACCTCCCAATCTGGGC^SAGTTTTTC 
ATTCCTGAAT AAAGGCCT AGT A 
Sequence 1137 

CCCTTTCGAGCGGCCGCCCGGGCAGGT ACAACCTTGGCTCACCGCAACCTCCGCCTCCCG 
GGTTGAAGCGATTCTCCTGTCTCAGCCTCGCCAGTAGCTGGGATTACAGGTG 
CACGTCCTGCTAA I r 1 1 7 GTGTTTTTAGT AGAGATGGAGTTCACCATGTTGGCAAGACTG 
^^ C ^GAACTCCTGACCTCAAGTGATCCATCCGCCTTGGCGTCTCAAAGTGCTGGGATTA 
CAGGCATGAGCCACCGCACCTGG<XXTGTCAGGGTTTTCTTAACAT^ 
TGATTCTGACAACT GTCACAACATTTTGGGCCAGGTAACTTTTGGT GGCTTGTGCCCTGT 
AAGATTTTAGGAGCATCCCCGGCTTCTACCCACTAGATGTCAATAACATCC 
Sequence 1136 

CGCTT AGCGTGGTC GCGGCCC GAGGT ACAAAACAGAACAAGTCTCAGTTTTCAGTGCAAC 
ATTTGAAAAAAT AT AT AT GCT GGAATCT AAT AATTAAAAGGAATTTT ACC TATTATGAAA 
CATATTACA 1 IF T I I AAGTTAGATAATCANK3TTTCAAAAGGAGTATTCAGGTTATTTAAC 
I I I G 1 1 II I AAATGGCT GCATG AGAAAAAAATGTCTA I 1 1 II I H I A TTAAAATATTTCA 
TCACTTGTTAAAACATA I r I TT GATCTGAGTTTGGTAAAAGTATT ATTTT ACCTGCTGTT 
GCCCTGCCCGGGCGGCCGCTCAAGGG 
Sequence 1139 

CCCTTAGCGTGGTGGCGGGGGAGGTACTATCTCGAATGAAGTT AAAAACAAATTAGAGGG 
AAAAGGTCAGGTTAGCATGTTTTAGAACTATTGGTAAACT ATAATTCATGGGACATTATA 
TAATCAAAAGATTAATATTTTAAGCACTAAGTTATAAAGGGTTTACACCCATGAATAAAA 

agattagcatcacitactatgaaccaccattccatgaatccatgtagctgaacactccta 
atgaaaagtttaattatccttcaacctgtagttgaagaactcagttcatgttcattgaca 
gatttccattacagac^^tatattgatgttactttctttgacactatattttatatag 
gatatattaaaattgaaaacttaatgctgtttagaaggctattaatatactattaatttc 
tgaaagctttgagttttc tgaaaaggc ttttaagatcaaaatttc tgaaacactcgacac 

ATTCTTCCTCACCCACATTTA 
Sequence 1140 

CCCTTAGGGTGGTCGCGGCCGAGGTACCAGATTATGGACtCTGCTTCTGGTGTGGGTAGT 
AGGTGGAG GGTAGCCAGGAGGGCTTGGGGTGGGTCATCACCTCAC^TTTTGAGATGGGG 
TTTT ATTTT GCAGATTCATGCATTGATCAGAGGCCX^TTTGACACTC 

tttcttcctgttcgtgtttcggtgtccgcccgactatcccatccacccacctcgggtcaa 
actgatgacaacgggcaataacacagtgaggtttaaccccaacttctaccgcaatgggaa 

AGTCTGCTTGAGT ATTCTAGGGTAAGAGGAGACTTTTAAG TAGCCAAGTCCGG TTGTTAA 
GCAGATAAITACTCTAGGTCAGCCTTTATCAACCGGAGTCCCTCATCTGAACTACAGAAC 
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AGAGAAAATGATTGAG TGAC TCTTC TCAAATC TCCTTCAGGATGGT ATGTGACTAGTA TC 

^J^^ A< ^TGGANAGGGGGAGAAGTTAATTTATTACAGTGGTAACCTTTAGAAGTGGTCN 

CTTAAGANTGTGGGCCCTGAACCATCTGGGGAACrTGTAGCCCAGCGNGTrrcrrGGGGCC 

CTTATCTTAGACCTACAAAAAGAAACTTTGGGGGTTGGGG 

Sequence 1141 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I M I NI 1 1 M I M l I G ACGGAGTWTGGCTC 
TCTTGCCCAGNATGGAGTGAAGTGGCACGATCrCGGCTTACTGAACCTCC^CCTCCTAGG 
TTCAAGGAATT CTCCT GCCTNAGNCTNCTGAGiWAGTGGGGATTACAGGTGCCCGCCACCA 
TG TC TGGC TAATTTTTGTG TNT ATAGTANAGAC GGGGNTTT ACCATG TTGGCCAAGGC TG 
GTCTTGAACTCGTGACCTNANATGATCGACCTGCCCTGACGTCCNACAGTGCTGGG^ 
CAGGCATAGCCACCGAGCCNGACNAGGGCNNTTTTAWCAAGGAAA^ 
GGCTGTTGGTGTGCANAKAAimtATACTGTGmACATGTTGTGAAACCTGAAN I II K III 
GNTN N GATTTN GT ATGANGAATGAN N N NCGGAC NCAANCACCCNT AAGGGGNGAAATTN C 
AGACANANTGGACGG GCNGTT, ACNTATNGGGATCNf#IATKTT>(GGTAACAAAANNTNAGG 
CTGnWfnACNTGGTGnlAANGtSTCATGTTACATTGT^ 
NNATTTNTAKANANCATACTANIWNNGNGGCTT^^ 
AAACCCCCC^KXCCWCCCCCClmTNNCCCCCCCC 
S1142 



CCCT TTCGA GCGGGCGCC CGGG CAGGTACTATTAGCAACTGTGATGATGATGATTGTG^ 
TCTT ATTTT CAT ATCTTGGGTTTTCTTACAGTGAAATATTrGTTGTGTTATTTTCTTTGT 
AAAAATAAACCATGTTTGCATCTTGGTCTTCTTTCCATTTGG^ 

GATTCC TCCTAGTAAAATTGCATTTTCTCCCTAGGAGTACCTCGGCCGCGACCACGCTAA 
GGG 

Sequence 1143 

CCCTTTC gagcggccgcccgggcaggtacctacacacatatatgcatatatggtataatg 

TATCAATATTT ACAGAGACCATAGT AAACACAGGACAAAACCAGGCATTAAGAGATGCAT 
GGGAAATAGCATTTAAATGGTAAATATGGTAAAGATTGTTTTATGGTTTTTGGGTTTTT^ 
1 1 I I I I AATGATC^TATTTTTAATGTTACTTTAAAATAGATTAGTGGAATGTGATTCAAT 



< ^^^^^Qagcggccgc ccgggcaggtac t ataag tagntggtttgtatganatggttaa 
aaagg ccaaagataaaagg 1 1 ic ii i itt ttcc h ti t i gctatgaagttgctgtttatt 

TTTTT N GGCC TGTTT GATGTATGTG TGAAACAATG TTGCCAACAAT AAACAGGAATTTTA 

TTTTGCTG 

Sequence 1145 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTGTGTTTGCTTAAACAAAGTGACTGTTTGGCT 

TATAAACACATTCiAATGCGCTTTATTGCCCATGGGATATGTGGTGTATATCCTTCCAAAA 

AATTAAAACGAAAATAAAGTAAAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGCCGNT 

CGAAAGGG 

Sequence 1146 

CCCTTAGCGTGGTGGCGGCCGAGGTACCAAGGTGAAATTTGAATGTGTGAACGCAT^ 

CTGTGGAGTTCTTTTCAAAGAGATTTCAAAGCCAkCAAGTTAGATAAGGCGAAGAAGTA 

GCCAGAGTGAGATCGAAGTAGGCCTTTCTTTTAAAAAATAATAGCTTTT ATTTT ATGTCA 

GTATCTTC TTTAGAAATCTAACCTTCCCTTTTCACGCTTTTTGAAAAGATAGCTAAAATT 

CAGTGTGTTCCTCTTATTATAAAGGATTGGGCTAATAGTTAAGCATTTCAAAACATTTCA 

GTTTCGTTAATCAGAAGCTGCAAGTGGG I IIG I II I ATAGCCAGTTTGCTTTTAAATTTG 

GCCATGTGGGCTTTAAGTTCAACGTATTTGTGTTCTCTTTAT><GTTACTCTCTCCAGAAG 

TATTACCCAAACTGTGAAGTTGTGGTTATGGGGATGGCAAACATTCATTCTATTCGGAGG 

A ^ 1 ~'^^AAGTCTfrrGCGGTTGCTGTGCACTCAGAATGCCANAT(XCGGGAAAGTAAGTC 

CTT 

Sequence 1 147 

AGCGGCCGCCCGGGCAGGTACATCTGTCAAAAATCATATTTATGTGAGATGTGTCAATAC 
TA^CTTGTGTr^TTP^TGCTACnTTAGAANGArK^TAAAAAATATCXTGTTrGGCTCCAA 
AAAAAAGAAAAGTCAGCCGCTCCTGCACGAGTNGGAGCTGCAACCCTTT ANAATTGATAA 

tcacaaacccctnagacccanagtaaaataaaaaaaaagatatgtnacattaggcattga 

TGG AAAAGGACTAGATCCTAGTATAAGC ATCC T AAT AAAAGGAGAGGTTNAAAGACGC TC 



P**t 11} of 221 



P*ft 114 of 221 



POVUSOO/33312 



TOCAGAACCAGNNTTNCAGACTTTNTATGATAANCT AAATGTGCCANTCCTCGGCCMNTG 

ACCACKCTAAGGGG 

Sequence 1148 

CCCTTcAOCGGCCGCCCGGGCACGTACTATTGAACCAACAOOATATCTTTTTTATTATTTC 
CATGAGT7AATCCTACAAACAAAATTAAATACCTCTTTT ATAAAACATCTTTTCCAGTGT 
TCTAATTGATGGAGATGCGGATCACTCATCTATAAAAAATGACTTACAGCTTCAGCTT AA 
TCAGTTGCTATAATGTGA AAACAG GAATGTGTA 1 1 1 1 1 1 1 LAACTAGGTAAAAGGTGCAT 
ATAATTTGAATTGTTAAATGTTTTATTAATGAACAAAGTAAACCTTTTAGTAATTTTTAA 
ATTAC rGSTCn AGGTGTTTGAAACAAGGTAAAAGTATACATTCCAGTTTTGCCCAAAAG 
TCACTTAAAATATCTACAAATTATTTAATCTGTGTGTGGTAACACCATTATTGCTCCAAT 
TTCTGGAAAGAGTCT ATTTTCAAAGTTTAAAAAAGAGOAAAAACAGCAAAGTGGCTAACC 
TTTCCAGTCGAAAGAAAAAGTCTCC1 1 CATOGGTTACACTTTCATATTTTTATGCAGCAT 
TAAGTTATCTACCGTTATGGGGGAACTTGGGGTTT 
Sequence 1149 

CCCTTA GCGTG GTCGCGGCCGAGGTACCATATTGTT C 11 N 1 1 ACAKNTWTT ACTGTCTCA 
GfrTATAATTTTGCAATGGCGGTTTCNCAACTNGCCTGNC^ 

TAATCTACGTAAACAAGTTAAAATAGGTAAATGNAATGTGATNAATACTTGNGGACAACC 

TGGTCATAATTTANAATCTCAAGGCTATATTAAATAATACATATTTCATTATTNGGGTAT 

TTTCCAATANAAATGTATTGGAGGAAAACCTTTCCCANAA 

AAGGNGAATNANKN 1 1 I G I L I AATTCAAAAGCTTATTTAAACGTTATGTGTAAAACACGG 
TNAAAOAACCNTMAAATAAAGAAAOATNTAANATAAAACGTTACCAAAAATAAAGTG 
Sequence 1150 

CAAAAATAATAGNGNAGAAGC TGGGCACAG TGGC TCATGCCTgVmTCGCAGCAC TTTGG 
GAGGCCAACTCAGGAGGATTX3CTTTAGGCGAGGAGTTGAANACCAGCX 
AACAAAAAAATTACCCGGGCATGGTGATGTGTGCCTGTAGTCXXAGCTACTTGACAGGCT 
GAX^TGGGAGGATCCCTTGAGCCCTGGAGTTCAAGGTTGCAGTGA^ 
TTGCACTTCCAMOCTCMATr " " 



ANAGAGGTGGAAGGGCTCAWAAGTGCITTCCACATTCGC^ 
GCTCTAAAGCTAGA6GACTTTTA 

Sequence 1151 < , . 

COCTTAGOG TGGTCGCGGCCGAGGTA C T r i 1 1 1 1 1 I I H 1 1 H I IIIGGGGTI 1 1 11! 1 1 
nTTTTTm GA6ACGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAATGGTGCGGTCT 
CACCTGACTGCAACCTCC6CCTCCTGGGTTCGAGATTCTCCTGCCTCANOC7CCCAAGTA 
GCTG GGAC T ACAGGCACCCACCACCACACCTGGCTAATTTTTTTGTATTTTT AGT AAAGA 
CGGGGTTTCACT ATGTTGGCCAGGCTGGTNTCGAACTCCTGACCTCGTGATCCACXCACC 
TTGGCCTOCCAATCTTATTTGCTTTACAAGTCCTGCTTCAGGGTTACCTTCCCTGACCAC 
TOC TGOCTCCCTC CC AACATTTCCAAGGGAC TGTCATTGCCTTAAGTT ATTTTTTCTGTT 
NAGMTTTTTTTTT GGCG 1 1 1 T7NTT 1 1 1 1 11 T NAAACAGCGTATTAATCTNTCGCCAAAG 
GCTTGGAAATCANTNGCCCAAATTAAAAGCHTTGTTGMAGCCTTGAA^ 
AGCAAATTCCTWT ACCTTNAGNAAAMTf*GNGACT ACN GGGCC CATGCCCACCACGC TTG 
GGCCTTTAAAATTAATTTNTGGGTAACAAAAAAAAAACTTAAGCCCTANGMAAANCTTTG 
GTTTAAAAATNACAAGAGGGACTTNNATHTTNCATTNATACAAATGGAAAANATTAA>nT 
TCNTCIiTTANNANGANAAAGGAAAAAAAAAAAAAN 
Sequence 1152 

CCCTATtXaAGCGGOGGCCCGGGCAGGTACAAGCAAGACTTTCCTTT AATATT GAT AAAGA 

ATTGAGTATCATGTATGC ATTCC CTTTTATGATATACAATTAATTGAAGTTATTTCCCCT 

TGTATGCAATOATCCACATTTTTCTTCTGACCTTTTCCTCAAGTC 

ATG ACTGC AT I T 1 GGAGGTGGTCCCAGGAGAACAGATGTTTGCCTT ATAATGGNGTTTTT 

CCATTTTTATCTTTGATTGNGCAAGGGGGTTGGAAGTATTATTTAGTCATTATATGGATT 

CCTCTAAAAATTGTTt^A^TA^t*ATATATATTCATTTATTCAC^TTACTTATTGTTTATTT 

ATTGCCTTAGAGTATACCCAAACACNGGAGGATTCAATAATGATCAAGACAGGTCTAATT 

TCTGTCCCAAANGAGCTTAAATATGNGAATTAGAAAAGGAATTTT 

Sequence 1153 

CCCTTAGCGTGGTCGCGGCCGAGGTACTACATAGAAAGGGCTTGGAAGTCTGATTCAGGA 
AAGGAAATCAGGAAAGAACAAAGGAAAT GAAGGAAGAAT AAAAAAGAAGAGAAG TCATT G 
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AAAAAG T ATGAAAAAATATGAAACaGATAACAAGAAAGTAGAGGAGATTCCAAAAAATAC 

aagccaggttttctgccctcattctatagagtcttgagaattgtagggtgtaagaaataa 

^Qf^TCAAGTCreAGAGA^^^C^nG CI I C 1 1 1 C 1 1 GTCTCACTGATCTGGAACCCAGG 
Sequence 1154 

CCCTrAGCGTGGTCGCGGCGGAGGTACTCCAACTATCACTrGTCA 1 I 1 QTCTAGGAAGGT 
AAAAT ACAGGAAGTTCCCAAC TTAAAAATGGGCTTGACGTAGCAGTCATTTGTAAGTCAC 
TTGCTTGGAATTTAGAATGCTT CTTCCCTCTGCAGAGACAGCTTCCATATGGTGATTAGT 
ATCCAGTCAGCCCACAGAAGTTATTCAGTCTGTTGCT ATAGATGAAATTATCCTTA'TTTT 
TACTTCCCCTTGGAATAGACCACCTACI OI I I U I I C I G AOTGTGGTCI 1 1 HC 1 1 1 I G TC 
CTATTCCCTCCTCAATC C I tl IIIIIHIII HUM I NCTGGGTTTC TTCATTATTCTC 
TAATTTCTTCTTGGCTCAAAATACTTCAAGTTCTATTGNGGTAGCCT AGATTTAGGGACT 
AGTTTGG 
Sequence 1155 

CCCTTAGGGTGGTCGCGCCCGAGGTACCTGCACGAACAATA1 ITXTGTACCCATGGAAGA 
GGGCCAAGGCTCAGTCAC TCC TTGGATGGCCTCCT AAATCTCCCCG TGGCAACAGG TCCA 

CTTACTTTGTAGCCTTCA7t>GGGTTTGCTGAAGATGGCGGTATATAGGCT 
GTGGTGAGG TTGAT CGGGGTTTATCGATTACAGAACAOGCTCCTCTAGAGGGATATGAAG 
CCCCGCGTCCTGCCOGGGCGGCCGCTCGAAGGGCGA 
S eq u en ce 1156 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGCATTTTTGTATTC 
CCTGAGACTGAGTAATTTACAAAGAGTAGAGATTTAAATGGTCAAGGTTCTGCGGGCTTT 
ACAGGAAG<^TGGTGCGAGCATCTGCTCAGTTTCTGGAGAGGCCTCAGGAAGCTCTTAAT 
CATGGCAGAAGATGAAGGGGGAGCAAATTAATCACATGG TGAGAGCAGGAACAAGAGAGA 
GAAAGGAGATGTACATATACATTATGTAATTAAAAAGCGTGCATGTGTATGTATTAAAAA 
TAATGGTATATAAACAAATACAATATaTACAATAAAACACCTAAACGCANAGGCTGCTTG 
TTATCCACAATAMT AAT ACCAAT AG 
B1157 



CCCTTAGCGTGGTOGCGGCCCGAGGTACAGGC I iX. I G CC 1 1 1 AAGAGCACTGTTTTGCTT 
^GGGGCAGAAAGCATGGACTTTTAAAGGGGGACTTGGCATGAATGGATTCAGAGGAGGG 
AGTGAGCAGTT GGGGGTCTG CG TGACTCGCTTTCGTGCTT AATCT ACTGGTGGTCGAGCT 
GGCTGCATCACAAGCAGAGC T AGGTTGTAT AGTGGCCTTTGTCTCAAGACACTCTCCAGG 
TGGGAGAGCCTTCCATCAGGGACATACTTT AGGTTGCAAATTGACTGTTGTC1 CTTGAGG 
CAATCTCCTTGTGGGAGAGAGTTTCTGCCCTGGAGCTTCAAAAGTAAGCACGTAGTTAGA 
TAAGCTTCCAGTGTANNTGAGTGl CTGGT UAAAGGGAAGCTAAAGGTTATGATTGCATTT 
TCTGAAAGAGCTAAGGTANGGAAATGGGGAACATAAAAAAAAAAAAAAAAANAAAGTC 
Sequence 1150 

GAGAAGGCTTCATTAANGGAATCTCACTGNG^ 

ATCAGNNGGNGGATATGNGTGATAAGCTGATTTCAATATTGAAGTATNGAAATAAAATAT 
TCTTTAGACCTGAAAAAAAAAAAAAAAAAAAAAGNACCTGCCCGGGCGGCCGCNCGAAAG 
GGCGAATNCCAGCACAOJ GGCGGCCGGNACNAGNGGANCCGAGC TCGGNACCAAGCN NG 
G 

CGGAANCANGGCATAGCNGNNCCTGGGGGAAAANGGNAN 
Sequence 1159 

CCC TTTCGAGCGGCCGCCCGGGCAG GTACACCAGCC TGGCGACAAGAGCGAAAC TCCATC 
ACACACACAAAAAATTAATTAAAAATAAATAAACATTGGTCAAAAATATAAAGCTGTATC 
AACTGTAT AT AAATAA7TGAATT AAAAT ATCATGCATAAAATCTGGGTGTAATAAAAACA 
AAGAATAA I Mill 1 AAAACCCAAAGCAAGGCAAGGGGTGATGTTACCAAACTGCCATGT 
ATC^GAGATGTGATT AGAAGGAAATCCTTCAAGGGGAGC TTATTTATGGTACCTCGGCCG 
CGACCACGCTAAGGG 
Sequence 1160 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGGGATT ACAGATATGAACT ACCGTGCTCCCTG 
ATACCCT AAATATT rATCAAAATTTTTCAGTGCTATTTTGCTCATAGGATTAAAAGGGCT 
ATTTATTA! MM 1 ATAACTACAGCTGACCCTTGAACAACATAGGGGTTAAAGGTGCAGA 
TCCCCCGTGGAGTAAAAAAAAAAAAATCATAAAAAACTTTAGATTCCCAGAAAACTTGAC 
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TATTAATAGCCTACTGTTGAGCGGAAGCCTTACAAACAGTTAATAGACATTTTGTATGTT 
GNATGTATTATATAATGT ACCTGCCGGGCGGCCGCTCAAAGGGCGA 
Sequence 1161 ■' 

CCCTTAGCGTGGTCGCGGGCGAGGT ACTATAAAG C 1 1 1 IU I 1 C ACACACACTCTGAAGAA 
TGCTCTAAGCCCCTGAATTAAGCAGAAAGTCTTCATGGC 1 1 1 ICIU GCTTCGGCTGCTCA 
GGGTTCATCTGAAGATTCGAATGAAAAGAAATGCA I G 1 1 IPCTCCTCl ICG CTCATTAAA 
TTGCTTI I AATTtXAAAAAAAAAAAAAAAAAAAAAAGTACCAGTCTCACATTTGGCCCAA 
ACCTCAGGATTCTOOCTCTGCCTGTCTrACTTCATGGTAGCT 
AGGG 

CCCTrAGCGTGGTCGCGGCOGAGGTACCAACCCTATTTTACAGATGGGAAA 
GAGAGAGG TTAAATC ACTTACAGAAAGCCACACAATTTTGAGTGGAGAGCTG 

. ATCCAGGCAG I C1GACCCTGGAGCTT ATGTGCTTAACGATACTGCCTCTCATGTGGGCAA 
AGGAJGGCCCAGGiAGAAAGGCAGGCCCAGATTCCAAATCT GGCTTGACCGTCTAAGAGGC 
TGAGTCTTAA CCTcnCTGA GCCTII GC IGF 11 CATCTGTAAAGTGGTCCTCCTGACAGCT 

. GCCTCCTAGGGTTG IT1 I GAGGATAAAGTGAAGT AATGGAGGGCCC 1 I U GOATATGGTAC 
CTGCOCGGGOGGCCGCTCAAAGGGCNAATTC 
Sequen ce 1163 " 

* ^CpCtrAGCGTGGTOGCGGCCGAGGTACCTTTTTC^ 

TGGGGTGGTGCTC TGAAACTAGGT AGAAGTCX; TCACCCCCCAACAAACCTTT ACCAGTGG 
TT TTAGCAT GCAGAAGATTCTGGCCTGAACCAGTTACTACT acagaggctgcaaaatgat 
GATTTTTTCATTCATTCTTTNGTAAATACCCGGT ATTTTTCACAGGATGAATGT ACCTGC 
COGGGC6GCCGCTW5AAAGGGCGAATTCCA 

• S equence 1164 ■ 

. i iwn 1 1 rn 1 1 1 n ii i n 1 1 1 icnc rr agcagggtctcactct gtcacct aggc 
tggagtgcaggc aacag gccaagaccctgtctccaaaaagaaaaaaaggaataattct AA 

AAGACTTATATTGA 1 Mill CCCCAATTAACATTAAACGCCrrCXAGCTGCCCGGTGGGAA 

ATTGGGTTGGCATGTCACTGAAAGGCAAGTACCTOSGCCGCGAC^ 

S equenc e 1165 

C CCTTAG CGGCCGCCCGGGCAGCTACAAA C 1 1 1 C 1 1 C AGTTCTAATTTCTAAGATGTTTC 
AC I C I Tl AAGTAGAAATGAAAGTCATCTGACTGAAAATTATAGCAGTATCTAATTGTTTT 

tcataact agccaaattcagaaatgtcctggatatatttctggacaatgtagatgctgat 
atcc 1 1 ugatttaggttatactgacttttatctttaccaaaccatattaacatttgcatt 
ttataattggaatgagaaatttagagtaagagatctggatcatgcaggcag 

AACCAACAATACTTTTATGT ACCTCGGCCGCGACCACGCT AAGGG 
S c ^ ucm p 1 166 

COCTTAGCGT GGTCGCGGCCGAGGTACGCGGGCAGTGGTTTTGCTCT ATACCACTGAAAA 
GCACTATAACATAATTGTTGWCCATGATACTGAAGCTTTTCCCCTC^ 
TACATTCAGAGCTCTATCAATAAGAKGAATAGATATTACAGTGAATT^ 
^^GCCAGTNGGTATCCCCAACCTAATTTATCTTGGT aaattcaccctgtttcctagtg 
CTGKTGGATAAAAGAGTGTTTACTTTTTATTGC^^ 

TTCAATTTATCAACATANCCT AGACTTCTGTAAGTGGAATGNTCATTAGTAACT CATCTT 
TTTGTTGKrATAATTGGAAAACAGAAACGAGGCTTATTGCTATTGCAGAAATMCNAMSCT 
GGCAAAAGGCCNAGT ATTTNTGGTATTCX^rrAATATAACCAGCTTTTGAAATTT ATGTG 
TTTGCATTAfrTGCCTTCTGGGTTACCXAAGTATTGACT CTGNTT AGTTT GGCACCTTTTC 
CGGMCTT AACANAAAAATNGNAATTTGGTTAATTCTCTT AAAKATTKGGTttGKAKCT AGT 
f*GA»*SGAGGTMATNNCCTAGGAAHCT^ 

C^*** 1 ^^*AAANGGGCGNNTTCCAM^AAhrTTGGCGGGCGTTACT AAGTGGGHTCN CNNCC 
NH rcGGGACCCGAGCTTGG^*CGTATT^^^TGGGGAG#4ACCCCCT CCGMCCCCCNCKTTNTTT 
TGGAATAGAAATTCCCCCCC 
Sequence 1167 

cccTT agcgtggtggcggccgaggtacttttctg t c n CT AATTTTTAAAATTATTAATG 
tc 1 1 c t atttttctaaggctgattttttct aatgtct gtatttttcc i rrm i o catc 

^^^ TAA GTAGAGTTCATTrATTTTCATTTATTCTTGTATAATAAAATTACTT AAGGT 
TAGGAATAATTAAGTTTTGCTCCCJVTGTTTTTATGTGTAACAATCTCAATGTTGTATGTC 
ATCTACTTCAAAATTTCAAGCTTGCCCtTT AAAAT ACTGTTTAAAAAACTTT ATGAAACC 
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AGTATtTCTCTCAACCCTTNGTGTAATACCTGGTTTTACTTTAAATGTGGTCAAGATAAT 

ttaacctgt 

S eq u en ce 1168 

CCCTTrCGAGOGGCGCGCCCGGGCAGGTACGCAGGGATATACAAAGGTGAAAAGAAAGCT 

gaaatatttgttgatggctggaatattta r T MM igatcaaatagatgaactgcctacc 

tattggtcagaatgtggaamaatacagaatctgttgggcagttatggttgggccttctt 

cgtttctacacagaggaatttgattttaaagaacatgttattagcatgaggagaaaaagt 

ctgcttacaacttttaagaaacagtggacctcaaaatacattgttattgaag^ 

gatttgaatcataatctcggagctggattatcaaggaaaaatgacaaattttataatgaa 

GCTTTTATCAATGGTAGAAGAAGTATTTGGGA 1 1 1 I GU I LAAGGGGATTTCAAAWGAC 

TACCCCTCAA 

Sequence 1169 

ACACCAGCTGGCAAAATGATAGGGCTTrTCCTTTGAATTANTCAGCACAGGTGTGAAAGA 
CAGAATGACTAATCCATCrGATTAAACATANAOCTT^ 

ACAGATGACAACTGCTACTGTTCCAAGGC fCCT AATGATGGTTCAGTTCTCAGGGCCTCA 
AGTCTTTTTCCATTCCATCNCANAGTAMTAOCTGI 



CT A I 1 1 rgl I U GI 1 1 rOGGAACTGAGGCCATGATTAANAGGGCGGCCGGGGGTGGCTATT 
GTGGGAAGTCAT AACCCACAGAT AGATCAACCT AAGAATCCTGGCCC TTCTGCACTC TCC 
A<^TGCAGGACAAACATCTTCTCAAGCAGTCAACGTANAATG^ 
ATTACC CACATAT AGTAATTAATAGATGGTAA 

TCCCAACACTTTGGGAGGCCAGGACAGATCGCT^ 

GCCAACATOOOGAAACCATGTCTCT ACTAAAAATACAAAAATTATGGTGACGCCTGCCTG 
TANTCCCAGCTACTCGGGANGCTGAAGCAGGAGGATCGCTIGAACGCATGAAG^ 
TGCAGTGAAGCCGATATCGCACCARAAGNGCTCCAGCCTGGTO 
NTTCTT AAGAAAAAATAAAAAT AAANGTTGTTNTCTT GAAGAAAAAAAA 

IT ACAGGAGGAATGTTT GGTTGGGAGAATCACAGC 
TTTACAAGGGTGTTTATATTTGA IT1G I G1 11 ATATTTGAGGCAGGTATTGTAATATAAA 
GGAATCCATT ACCATGTCCT ATAAATGACCTCT AG<XATTTTA1GATTA7TGTTCTC TGT 
AAAACTCTTCAAGACTTCAATGAGAAGTTTGTTTATAAGAATTATCTTCT 
CTTGTGAAGAGCGTATTCTGTTTTTCT ATCAGTTCGACATGAAGTCCACATCACATGCTG 
I IC 1 1 1 1 C I AGTT ACATGATGTGCCT 
Sequence 1172 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAAC£CTATTTTACAGATGGGAAAACTGAG^^ 
CAGAGAGGTTAAATCACTTACACAAAGCCACACAATTTTGAGTGGCAGAGCTGGAATGTG 
AATCCAGGCAGTCTGACCCTGCAGCTTATGTGCTTAACGATACTG^ 

AAGGATGGCCCaGGAGAA^GGCAGGGCCAGaTTCCAAATCTGGCTTGACCGTCTAAGAGG 

CTGAGNCTTAACCTCT 

Sequence 1173 

CCCTT CGAGCGGCCCGCCCGGGCAJGG T ACGAAGACAGOT cc ttcaatcccgccagc t ca 
TGTGCATCTGAGGGTGGGGCTCTGTCTTCATGCTAGAAACCAAACTGCTCTCACAGCTTC 
CTGCTAAATCACCAGGGCTAACGGATAAGCAGAGACGGACTACCCGCGTACCTCGGCCGC 
GACCACGCT AAGGG 
Sequence 1174 

CCCTT AGCGTGGTCGCGGCCGAGGT ACAGATTGCAT AATAATTTTT AGATAAATGTCAGG 

AACAGAATCACATTCTTAAAAGGOIGAATTTCTATAAACGTCTGTATATG^ 

GAGCAGCTCTGCAAAGATGTGTATAACTGCATTTGAAAANGACAGTGAAAATTTTGC^ 

ACTGTAGAT GTCC CACAGTCTGJe^TTGGAATTTAGTTCTGTGACTAAAGGAGGCTTACAG 

NTGCTCCAAI 1 1 IGG1 ICIGKGGGGTACCTGCCCX»3CAGCXX5CTCAAGGGCX^TTCCA 

G 

CCCTT AGCGTGGT CCCGGCCGAGGTACATGGTCACAACAGATGAGCAACTGATATCACTC 
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ACACATGCTATTAAGAACTGTCCTGTGATAAATAACAGACAAGAAATTCAGGCATCAGAA 
AGCOGAGCCACAG6TAGAAGAGTTATGGACAGTCCAGAGCGTCCAGTTGTAAATGCCAAT 
GTCTCAG TGCCATTGATGTTCAGAGAGGAAGTGGC TGAATTCCCACAGGAAGAGTTGCCC 
GTTAAACTGTCTCAGGTGCCAGACCCTCCAGATAACATGAATCTGGOCAAGAATTTTCCA 
GCACATA I t I I i GAGCCAGCTGTGTTGTTAACACCAC 
B1178 

GTTAAACATGTGTCACT GGGCAG 
ATGTTTTTGGTAAACAGGCGGGGTAAG 

atttgccgagttccccgcgtaccaatgactggttccatgatcccctaagagaacacaact 
taggaatgtggattctaatgatagctttatactgcttaggcaaatttactt ctgagcctt 
atgtgcct7cagtggtgcaagcaaatttcctttacactttagagaggttgattaacgagt 
acctcggccgcgaccacgctaagggcgaattccagca 

cccttagcgtggtcgcggccgaggtacactgaagaattaagctgtaatgagggaacacgc 
ctgcaacttattcttt aatagttcagaaatattaacaattgggtaatttgggtgaaaggt 
ataaggagct ataaatgttat7tctgcaacttt f atgtaaatttcaagttat7taaaatg 

AAAAGTTAAAAAGTTTAAAACAT AACAGAATAGAAGAT AAC CTATTAAATAAATC TGAGT 
CCAGGCATGACACAGTGGTTCATGCCTGTAATTCCAGGGAGGGACTGGGAGGCCGAAG TG 
GGCAAATCACTTGAGGTCAGGA 
Sequence 1176 

CCCTITCGAGCGGOCCCCCGGGCAGGTACTAAATTGTTTTAG^ 
AAAAAAAGGTG A T TTIITI1I IT GGCTGCAAGTAGGCACTTATTGTAATTTTTATTCATG 
CTATGAACTCATGATTTTCCCTTTATTCTCCTTTGATCCTACTTAAATAAATTTATAGAG 
MTTGAATAATATAGAACCAAGATAAG^^ 

Sequence 1179 

^^^T^^^^fT^fi^fPP^'^'T ^. W V*. VA * r nT1TTTTTTTTTTT1 NCTTTT 

TCCTTT AANGGN NHAAANNCAATT NCTGGATTAAAAN NCCCCNGGAAAAAN GNNGGGGAC 

CNTTTITGGAAAAJIAAAArrANGGAATTTAAAAA 

W*ATTNNTmNAAAAAATACANTTTTAN^^ 

AfnTT TNAAANAGGTTTTTACNC I II I HO I > AACAACCCCNGGNAAAAAAAAAN AATTT 
Hill 

Sequence 11 SO 

CCCTTAGCGTGGTCGCGGCCGAGGTA C I 1 1 I I I I I I H 1 1 1 I H I I t I 

urn nrmccccN 



TTTTTTTCTTTT 
CTNNTTTT 



TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 

TTTNCNTTTTAAAAAAAANTTTTTNNfiAAANKiGn T TTTTAAAAANTTTNNNNGGNNGGA 
AANTTAANANNATNANNNGGNANAATTTl lllllllll I NCCCCAAAAANTTNTTTTNGG 
GGCNTTAAMTTTAAAAAAAAANTTTNNN^ICCGG^ITTTTGG^INNNGNMNGGNGGGAAAAM 

b 1181 

kT AAAAAT ACAGCAGGG CAAG 



AGGACCAAGATGGAGGCAGTGATCAGGGAATCTCAATGAGGGTGAGACTGCGACAAAGAC 

TTGAAAAAGG TGGAGAAGCAAGCCTTGTGGGTATTT AGGGT AGCAGT AGTC CAGGCAAGG 

GGAACAACTAGTGCAAAGGCTCTAGGAGGCAATGTGTTTGAAGTGTTTTAAGAACAGTAA 

GGAGGCTAGTATGGTTAGAACAGAATGAGCAAAGGGGGCCAAAGTGGTAGAAGGTGGGGA 

TCAAAGAGGTAATGAGGCCTTTG 

Sequence 1162 

AAGGTGAAGTCCGACANTAGGCTATCTGGAAGCTC^GGAGCAAGGACCANTCATCCAACC 
TCAAATAGNANAAAANGGNNNGNAAGCCCGACAGGGCAGCCTTCAGTCTGTGGCTGAAGG 
CCCTAGAGCCCCTGGCGAACCACTGGTGTAAATCCAAGAGTCCAAAAGCTGAAGAACTTG 

gagtcgaatgtttgagggcaggaagcacccagcacgggagaaaagatgggccggaagact 

CAGCCAGTCTAGCATTT N CACATTTCCCCCGC GT ACC TTGCCC N GGC CGGG 
Sequence 1183 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTTC II I! I GTGTATTACTTTTCACTTAGC 



ataatgtcctccagcttcatccatagcagcttcatccataacttctgggtgtagccatgg 

caagggtaaactgatatggcacactggtgggcatgtcttctggagaggtgcttccaactc 

ttccctgttttagctagtcctcaatttgtctgatgtctgaaccccactgccagagttgag 

tcttgcctgctgagtcatgtccagactcctacctcagaagtatgaagcataactggtgtt 

acaaacaccatcttcagaaca 

Sequence 1184 

cccttcgagcggccgcccgggcaggtacgcgggggaagctcattctatacccgaagagca 

GTCTCAGAAAGCAAGATTACTTTTGTGT T I T 1 1 AAAAAATGATTCTTTAATGTAAMTTTT 
CTAAACATTCTGATTGGAAGTAGTGGATTC^TAAATGATTCCAAAGTCATCTGTAATTCT 
TCTGI rTTTGTTI IG I IC T G I C I I I I C I IC ATTTTGGCTTTGGGTGGGGGGAGGGGCAGG 
TGACACAAAGGAI I I 1 1 1 1 li 1 1 I I H I 1 1 1 AATTTTTGGAATCTTTTNCCAATAACCCA 
GCTAAAGATTTGCACTGAATACAACTTGTATGCCTTTTGCAT 
S e qu en c e 1185 

CCCTTCGAGCGGCGGCCCGGGCAGGTACTCCTGTA 1 1 1 G 1 1 C 1 I ATGAAATCACTATCTG 
CXTTTCTC^TATCTAGTAAGATTGGCTGGCTCAACTTrCTTCTGTCAAATTATATGGTTAT 
TTTTTATATTACCACA7GAGCATTATATTAAAAGTGTTTTTAATAGTTGAATGTATTTTG 
CCAACTACTAGTATAGACTCAAATTTGCTATTTAA7TTTTAAAATAGAATTTATTTTGTA 
AATCCTTTAAAAAATATTTGGTTAGTTTTGGATTAGAAATGATTTATGTTAGC CATG TGT 

tgaagatgaaattg 

tacatatccctatctactatgtaaagacaaaaag 
gcaaatgaaatgatgtaatacaatgaactcctcagaaaataagctctgtaaaatctcaga 

ctgcctgtttatcatatgct AGAGT AAACTTACATTCCTTTCTTGTTAGAGAAAAATGAT 
GGTAAAATCCATGCATTAATCAAAACTAAAAACATGAAAAGGCAAGCCAACTACAAGAGA 

aatacagttggcccttgaacaacacagattttgaactacatggagtcccgtgtacctcgg 
ccgcgaccacgctaagggcgaattccagcacactgncggccgt 

Sequence 11 67 

ctcctcttgaggtcccaagatggcgtggggttcctgggoctgtcggaaagtggcattctt 
tactaac^agaggtcaggaagcctgcacaggaactgtgtagacaaggtatgaggcx^gtt 

TTCCCAAGGAACTTTTATTGGCTCCATAAGTCAAGTTTGAGTCCTTAAAGGAAAGCAGAC 
CATTCCCATCAAAGTCCTGGTAAAACAACT AGTTTCTCTAATTGTGTCCTGTTGCAAAAG 
AAAACAGATTCTTATTGCACTTGTGCAAA 
S e quen ce 1 188 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATATCTTACTTGATTATTTTATTTTCTATCC 
CACCAATCCACACCTTCACTGGAAAGTAAGTTCCATAGAGGCGGAGACTTTTGTCTATTT 
TGTTCAATGAACATCCCAAGGACCTAGAACAGTTTCTGACACATAAGAAGTATTCAATTA 
TGTGCTGGCTGAATGTATGAATTAATAAGTTGAGATTCGATCACTAGTTGAAGTATAAAT 
ATATATTTTTGCAAGAATAAATGCTACAGTAACTGA7TATGACAGCTAATTCTGTGTACC 

Sequence 1189 

CCC TT AGC G TGGTCG C G GC CGAGG T ACAATGGCATAGTTGAG TAG TCACCACAGGACC T A 
GCTGAAATCCTAAAATATTTATrATCCCTTTATAGGAAAAGTTTGTTAATTCCTACAATA 
GACAACGAACT ATCAGAATCTATCATACACAGCAATGGTGAACACCTATTCCAGTTGGGG 
TGTGTGTGTGTTTGTGTGTGTGTGTATGTGGTGGGTT 
Sequence 1190 

CCCTTAGCGTGGTCGCGGCCGAGGT ACACCT GGTTTCACAGAAAACAAAGCAACCTCTTA 
AACAC CAGC TC GGCAAAATGAT AG G GC TTTTC CCTTC GAATTAGTCACCAC AGGTG N GAA 

agacagaatgactaatnccatctngantaaanatagaccttnnnagaaatcaatnacnct 

TATNTTACA 
Sequence 1191 

aattcgcccrragcgtggtcgcggccgaggtacttctaccatcttttgtctactttcgtg 

acttaaactgccatctgtgatacatgaggacttacctaaaatgtctgagaactgacttac 

gcttgattaccaatgttttggagtttataaagctcaattctaacagaacatgatgatgta 

taaaaataatcttaaaaaataaaatatgatggtatagtaataaagtaaaaataaatatgg 

tacctgcccgggcggccgctcgaaaggg 
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1193 

ccctttcgagcggccgcccgggcagg tacaaaacaaatctgaaatatcttatt aaacaag 
aaagt aaaaatgttatcaaaaa<rractgtcgtctcatcaaaaagattcagaagccaattt 

AAAGAGTCTCACACTGGACACAAAAATAATTTGAGCTTCAAAATAAACTGCAAGGGATTA 
AAACACATAAATTGTGTTAAAATCCACAAGTTCATAATGATACTAAAAAAAAAAAATCTT 
GTTGGI 1 1 CC I C I AGAGGCTACTAGAAAATCAGCTCATTATTTCTGATATTGGTTTAAAT 

Sequence 1193 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACC 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 II I II 1 1 1 M 1 1 
TCATNCAANAAANATAATTTTACACTT ATTCTTTGAAAGANAAATTCTATGGAATTTTNT 
TNTT CTAATT^iAATTCCAAAATAGATTCTNT>M^KX>^^ATGCCCTKATACTAG^lAACT^K5 
ATGGTNAGCGGGTAAGTAGGTAGTAGT ANAANAACANAANGGGAAATTNGGGGAGCANAA 
AAGGGANAAA 
S eq u e nc e 1194 

CCCTT AGCGTTGGTCGCTGGCCGAGGTACATATACATTATNGTAATTAAAAAGCGTGCAT 
GTGTATGTATTAAAAATAATAGGTATATAAACAAATACANT ATNTACAATNNAAACACCT 
AAACGCAGAGGCTOCTGI 1 ATC 
Sequence 11 95 

CCCTTAGCGnTX3TCGCGGCCGAGGTACATAGTGTCGCGAA 
GATCCAGTNGGTTTAAACGGCACGTTTTT GCTT ATAAAAAAAGTG 
Sequence 1196 

CCCTTAGCGTGGTCGCGGCCGAGGT ACT AAAGGGAAGTTGCTAGGAAATANAGCAGGT AA 
TTTNTCOTTAATTATQCAAACCATNGCAACACAGTA^ 
T7CAGTGNTTTTTTGGCATGAMTGTNATGGAANAGT AAACAAAA 
Se quence 1197 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAGGA 

CTGTATACACTTGTAi " " '^"^ GAGJGGGAA * * ~ zrTnx 



CCCTT AGCGTGGTCGCGGCCGAGGTACATGGCCCGCT CCCCCGTCCATTCCANTTTCCTG 
CCCTCTACTGGCCATGACG^jTCATCACAGTGCCCTCCTCATTCCTAACTTTT AAATACAC 
TTGAGACCCGCCTGATT AATNTT GCACT AN GAAAAAjCAAAACAANAACAAACAAN NAACA 
AAAACAAGACACTCACATACAATG 1 1 1 1 1 1 AATGCTTGAAAAGTACCTGCCCGGGCGGCC 
GCTCGA 
Sequence 1199 

CCCTTA GCGTGG TCGCGGCCGAGGTACCACATTCCT GCTCAGAAACTGCTCACTTCCTTA 

AftTTGTCTTTTTTCCCCCAGCGTGAAATGTATCCATTTATAACTTGCCTATTGCCTGTTC 

TATTAGCATCCAAAAATGTGGAAGGCCTCCCAACCACCATTTCTNGCTGTGTCCTTAGGA 

TGTGCAGWAAAAA ATATAG ACCTAACAGfrrTATGTTATAGAATGGGTTTATTTACTTTGG 

GTGACTGTTTATAG I n T T AAATAAAAGACTGAACATTTTNTCGAAAAAAAAAAAAAAGA 

ANAAGAAAGTACCTGCCCGGGCGGCCCGCTCGAAAG 

Sequence 1200 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTAjCAA 

ACGCTGACTCTGGAGCCAGATTGTTTGGGTTCAAATCCTTGCTCTGTCTCTTACTGTGAC 
GATTTTAGGCAAATAACCTAACCT CGCTGTGCCTCAG TTTCATCATCTAT AAAATGGAAT 
TTATAATAGAACCTACATCATGAGTTGGTGTGAAGATTAAATATATTTATATCCCGGCTG 
GGTGCGGTGGCTCAACCCTGTAATCCCAGCACTCT AGAAGGCCAAGACAGACAGATCACC 
TGAGGTCAGGAGTTCAAGACCAG 
Sequence 1201 

CCCTTTCGAGCGGCCGCCCGGGCAGGT ACGGAAGaGTAAGTGGGGaGGGATGGGAATGGT 
TCCTTGAGACAATCTTTTACTACAGTAGATGCTTCATGGATGGGAGAGTAGGGACTGGTG 
ACTTATTTATAGCCTTCTCTTTTAAAAAAGGACCCATTTCTCTCTTGAATGGTGTGGTGA 
AAATTAj " 
TAAGGG 
Sequence 1202 
CCCTTAGCGTGGTO 



1 1 nun 1 1 1 mmmu 1 1 1 h i i i 1 1 1 1 n ii i ii i i i n 

NTTAAANNTTTNTNTNCCAAAAANTTTNNTTTGGAAATNC^AA^ 
TTN NTNCCN TTAAN GAN CNAAANTTTNAANAAAAANTTTT 
Seqw not 1203 

CCCTTTCGAGCGGCC GCCCGGGCAGG TAC TAG TCCATTCTCACACTGCT ATGAAGAAAT A 
CCTGAGACTGAGTAATTTATAAGGGGAAGAGGTGTAATTGACTCACAGTTCTGCAGGGCT 
GGGGAGCCCTCAGGAAACTTATAATCATGGCAGAAGGTGAAGCAAGCATGTCCTTCGCAT 
GGCAATGGCAGGGAGAAGTACCTCGGCCGCGAC CACGCTAAGGG 
Sequence 1204 

CCCTT AGCGTGGTCGCGGCCGAGGT ACTTTTTTCCT ACAAATGAGT AATTGAAGAATTTT 

GTTTAGCCAGACCATTTAATTCTCATCAATTCCATAATATTTCTAGTTAAATC^ 

CATTCTATA TTAAG TAACATTTTATTCAGATCCATATCTAAATAGCAATTTTO 

TACTAAGAAI 1 1 1 1 CCTGGT ATGTATGGTTTTGGTGTATTGGAATGTACCTGCCCGGGCG 

GCCGCTCAAGGGCN 

Sequence 1205 

CCCTT AGCGTGGTCGCGGCCGAGGTACCAGAAGCTAATCCCCACCGG<3GTTGGTTTAAAT 
AGGGAC T AAC TACTTTGGAGGACATGGAAGATACC TCAAGTTTAAATGCTT ATAAACC AA 
GGCTCAGCAATATTCT AGTTAAT AC TCT AGAGGAATGC TTGCACAGTGCCCAAGAAGGTA 
TTAAAAGAATGTTTATTCAGGTGTTATTTGTCATAG TGAAATACTGGAAGCACTGT AACG 



CXiCTTAGCGGCCGOCCGGGCAGGTACAAACAA l 1 1 1 1 1 1 IA ACTAGCAGGGCATGGTGGT 
TTGTGCCTTTAGCCCTAGCTACTTGGGAGTCTGAGGCAGGAGCACTTGCT^ 
AGTTTGAGAATACAGTAAACTGT ATCACACCAC TACACT CCAGCCTGGGT GAGAGAACAA 
AACCCTGTCTGAGAAAAAAAAAATTAAACTGAGATGCATTTCCCCCCTTTTACACTAAGA 
AACAGACCCTTCT PI GTTTCTCACTGGCCGCCAAAGGGAATGCTGTATGAGCATTTCAGG 
TGCAGATGCAGCTGC GATATCAGAAGACCCC 
e 1207 



AGATC^AGATTATTGTGTCTA»AAGAAAACn"GGGAATATATATACTTGACCCGCTTCAC^ 
GCTTACATTGTCTGTCTGATTCTTCCAGGCATTAATTAGAATTrGCAACTCCT 

AGGTCAGGAGTTCAAGACAAGCCTGGCCAACATGGCAAAACCGCATCTCT 
CCTCGGCCGCGACCACGCTAAGGGCGA 
s 1206 



TAGCCAGAACAAGAAGTTGAAjGGTAAACATAGAATATTTTATACAGGCACTCACACC 
CATTTCUjGAAAAGGATTAGGAATCCAGATGCCGTGAATTTAACTATTCGTTACAGGCTTG 
TCCTGCAATATGCTCTGGAGCAACTTGCCTGCAGAGATTTCTGTATCCaCGGACaTTTAA 
ATATCGCAAAGGCTATCTCCAGGCAAGTATGTTCCTTTGCTTGTCATCCCCGCGTACCTG 

Sequence 1209 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGGAGGTCTiXATTCAGTAGGTGGCCC 

GGGATGAAGGCCAGTGTTGNGGCTAAACCACACTCTGGAATTCTGTCAGCAAATTCCTNG 

CTGTGTGAACTTGAGCAAGCCATTCACC I II C 1 1 AAGCCA TTTTC TT GAT ATTTCACAGA 

GCCTCACCAAGT A7 TCAACGAGAACATGT AAG T GAAATGCT TCAGAAAATGCC TGGT AAA 

TAATAGATGCTTAGAAAATGGTAGAGAGAGAAAAGAGCAGTCTCTGCCCTTTAATGTACC 

TCGGCCGCGACCACGCTAAGGGCGAATTCCAG 

Sequence 1210 

CCCTT AGCGTGGTCGCGGCCGAGGTNCATTGTGAGAACTCTGGAATTATTATTTTTATTT 
NATTATTACTATATTTTATCTGACTAGAAGCCATTTATTACCAAACCAATTTATTTCTTA 
tMGTTG AAAACCGTCTGTGAGAAGCTTCTCTGCtCCTGGATGGAGATCCAGCGCTTTTTTT 
I II n IGAGGCAGAGTCTTGTTCTGTC»#CCAGGCAGGAGTGCANTGGCACGATCTNTGNT 

tactgcaacctccacctcctgggttcaagcaattctcct 
Sequence 121 1 

ccctt agcgtggtcgcggccgaggtactcctgccaagagggcgacaagttcaagctgagt 
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AAGGGGGAAATGAAGGAACTTNCGCACAAGGGGCTGCCCAGCmGTGGGGCATTCCAGA 
GAACCATGTGCTGTGAGGGCC ttccgagtccatctgtttaatcctgtcattggagacttg 
agaaaccagagoccagaagggaaaagtgattgtcccaagatcacacagcactggagaaac 
tggatgaggaggggctgaagaagctgatgggcancctggatgaga 

Seq u en c e 1212 

CCCTTCGAGCGGCCGCCCGGGCAGGT ACAT ACAGTTTACATTGTGGTAACAAAGTAGGAC 
ATGC TA TGAAGGCCCTTTCAATTCGC TTGACAAGAATGACAGAGATCTACT AGACCCAAT 
TTTTAAATAATATTGCTGGTTTTTGCTCAACATGAATTAAAATATGGTGGCTAATGTGCA 
GATTTTACATTTGGAGAACTTrAATTTTCAGTATTAATTAGAATTTG TTTAATATTACAA 
ATGCATTTAATGACACTTAAAATTGT ACCTCGGCCGCGACCACGCT AAGGG 
Seq u e nce 1213 

CCCTT AGCGTGGTCGCCGCCGAGGTACCAATAAGCATACCT AGAGTTGAGATTTTGGTTT 

CTAAATGCCATTCTCCAATTAAAAAGGAATCAAAGCACCTCAGATAAATGTTTAATTCCA 

GGGCTGGGGCAGGGAAAGTGAAAGAGAATCACAGAACATCCTGTAATGACAGAAAAAAGT 

CACAATAAATGGTGGGATTATGTCAAAAGGACATGGGATTCAACTTGAAAGATCTTCCAA 

TAGCCAAATCTGAGAAAAGTTAAGCAACAAAAAAAATAAC A AAATCTTATAATCTATAGA 

AAAAATATGAATGTATA 

8 1214 
CCCTTM 



1 1 I I I I I I I 1 1 1 1 1 1 I I I I Hi 1 I A NAAATNGO 
CGGCAGTTTATTAGTCACAACTGCTCACAGGGAGGGACGTCACC^ 

TCACAGGANAGTTGCATTTGGGAATANAGTGAACCANTAGGGGCTGTGG*AGGCAGGCTT 
TGCAGTAACAAGAGGAAGAGGCGArrCCTGGGTGCTCCAAATGTGACAGGC ITOTI 1GAA 
TAATTTTCCAOGCTGGAGGGAAGTGAGCCACGTTGANACCCANGGAG 
Sequence 1215 

AGO OQCCG COCGGGCAGGWACAATTAATTGTGTT C n GTGACCTGATGATTTTfNGAAAA 
TTTGCTTTTCTCn I AAGAAATTTAAGTTTTCAAGGGCCGT ATTAGTTATCTAAATATTT 
TCGGCTAATGTTGACTT AT AAAT AAAT AAAAATTT AGAAATAT ATTCATGATGACAATTT 
TGTT ACTTACACTGCCTATTCTTTA I I TCT T TT 1 I A GTTGAAA GGTGAA ATTTTG ACCTT 

AT 

Sequence 1216 . 

CCCTTA<3CGTGGTCGCGGCCGAGGTACAMGGAGGAANTMAGAi4GTAAATNNA^ACCAGAN 

CTGTTGAACAAACTG 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACTGTGCTmAGCCTTGGT 

CTGTCTTAAAAAAAAAA AAAACAMAAAAA A M AATTRATTAAAAATTTAAAA^AATGAAA 

AAAAGCTGCATGCTTGN 1 1 111 GTTTTTAGTTATTCTACATTGTTGCCATTATTACCAAA 

TNTNGGGGAAAATKCAACTTACAGACCAATNTCAGGAGTTAAATGTTACTACGAAGG 

ATGAACTATGTGTAATGAACCTGGT AGGCATT ATTTATTGAATTNTHANCATTCCA»IATG 

TCCAGCACATTTTTAAT 

Sequence 1216 

c ccttagc gtggtogcggccgaggtacaatgttaaaataatctgagttttctatgatttg 

GCTTTTCTGCC TTGAGT AACT ATNTAAGAT ATC TAGCG TGATNTTNTTTNATNTGGGCTA 
cn l n AGAACAAAACANAGGTNTTTAMAANAAACCACTTGCCCACANGGMCTTTTGA^ 
CGTTTACCTAAGTCAAGTGTAATTGAAAAACATAACCAAATGCACCANGGGGTNT ATTGT 

Sequence 1219 

OCCTTAGCGTGGTCGCGGCCGAGGTACC! rTTTTTTTTTTTTm ICGTCAAAGTCACTA 
TTTGGGCCCTAACATAATC CTGCT CANAGC GACGGAAAAAAGGCAAGCCTTTTCAAACAT 
AACTCTCTCTACAAGCCAGCTATTATGGCAAGGGAAAAAAGAAAGCATCTAGATAAATAT 
CTATCAAAATTAACTTrAANAGAAATACTCTCTTTCCT^ 

CACTANAAAATAAGTTACTATAAAAAGTGGTGGTCTGGGGGGTAAAAACAAAACAAAAAA 

AATCCTCTTTTCTACATTTTTTAGTTTT 

Sequence 1220 

CCCTT AGCGTGGTCGCGGCCGaGGTACAGAATTATCAACTGATTTGGTCA 



ATGCTGGTTGATTTCCCTC^TTGTGTAAACATTGACAGGTATGTGACAAATGGGGAAAAA 
AAATCCAAATAATAAAGTGACATATTGGTGTTCATAAAAAAAAAAAAAAAAAMAA A AAAA 
NAAGTCC rrTTTTTTTTKT mi l T 1 1 1 W 1 1 A CTTtMTAAAAAMACWGAGTTTTATTTCA 
I^TGTNTNTTnnTTGMGNCCCCACCKITTWNATG ACMACTNTHTCCT 
NTNATAACATTNCCATACATACTTAAAAC 
Sequence 1221 

tt aggaa tttcacaca t aaaacct ggagagatagcacatgc j l i rfctttcttci iggac 
tgtgagctgtacctgcccgggoggccgctaaggg 

a gttgaV Utgatatgattta^ 

tcxrrrrcgnactgggctggaahacctaaaactacmtgt aaaatgtaagtagngaccaata 
aaaaataaggwtaccttaac1 i uji i ii i icc t 

Sequence 1223 

CC CTTTOGA GCGGOGGGOTOG^ 

TTCTGGAAACI C CI I fCTCT T CCT TA CATCCAGGAA Al I rGCTGTI I A TTTTGAAAAGCA 
AATTTAAACC T ATTTAAGGGAGAGAGAGCTCTTGTAAAAATTCAT7TATTAGTTCTGGAC 
CAATGTTATTTATAAGCTATTATTTCAAATGATAAAAAATAAATGCAT^ATACATTTGAT 
GATAGAACATTTTTCTTTTT 

Sequence 1224 ] 

TGTGAAATGTGGGGAAGGTTTATAAGTAAATGAATGAGGGGTGAGGTTGTTACCATTAAT 
GHGCCTTGAAGTNATATTTGTGGATAGCTAAAAGCAA I I ITTGGTI I ATTTGGTTTATTG 
TTTGGTTTA 
Sequence 1225 

CCCTT AGCGTGGTGGCGGCCGAGGT ACATCATTTGATGTATGTTTTTGTTTTTTTAACAT 
AAAAGGATTATATCCTTTTCCGCC*GCTGTTTT^^ 

ACATATGTTGGATGGGTTTCTATAACATTTGAAATGACTGCCAAATATTTCACTGTATGA 

TCATCATTTAATATTATTATCAATTTTGTATATTTAAGTTAGAACTTTTC 

AACATCATT ATGAATGAGCTTTCTTGAACTGTATTTTAAT ATACTTCCTT AGGATAAATG 

C TT AAAGT AATAA 

Sc Qu cnoc 1226 

COCTTTCGAGCCGGCCGCCCGGGCAGGT ACATATACACTaTGT AATT AAAAANGC GTGCA 
TGTGT ATGTATTAAAAATAATGGTTATATAAACAAATACAATATATACCAATAAAACACC 
TAAACGCAGAGGCTGCGTGATATCCACAATAGTAATACCAATAGTATTAATGATGTNTAT 
G T AAACACAAAGAAAAGCAGCGGACCGT ATT AATAGGCAAACACACAAAAGCACACAAAA 
GCAAAGCAAAAAGCCCGCCAGTAATGTT 
Se quence 1227 

CCCTTTCAAGCGGCXXaAWCCGGCAGGTA^ 

AA AAAAT TACCCTrAAGTCTTTGACATNACAQCCCCTGTCACTrCTTGTCAWAGTTTGTA 
TG TGTTGHTAATNGGAATGTCTATTTCTTTAAAGAGCAGAGAACTACAGTTACAGGGGTT 
ACAGTGTGAGGGGTGACACATTGCTGGATTCT GAGCTCAGGCAAGTCTGTCTGTGTCTTT 
ATTAATAGAGGTCTATCTTTrCTTAATACTGAATGCAATGGACCATTCCAACCTAAGTTA 
TC TNGAT ATAC TGGGATTACAAT A 
Sequence 1228 

CCCTT AGC GTGGTCGCGGCCGAGGTAC 1 1 1 1 I 1 1 II I I I 1 1 1 1 1 1 1 1 I I H I 1 1 1 ANANA 

CAGAGTCTCCCTGTGTTGCCCAGGCTGGTCTCAAACTCCTACGCTTGAGGAATCTTCCCC 

CTTGGCCTCCCAAAGTGCTGGGATTACAAGCATGAGTCACCATGCCCAGCCAATAATGAT 

TTCTTGATTGAAGGAATGAATGAATTAAAAGGTTCATCTTTGGACACAAAGGCAKACAAA 

AGTTTGACAAAAGGCATTTTTGAAACTAGGACCTTTATTNTAATATTAGTCTAAACAX3NG 

GGA 

Sequence 1229 

CCC TTTCGaGCGGCCG CCCGGGCAGGCACAGAAAAAAAATC T ACACCAGGTAACAC TGGA 
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GGATGCAGGGC TACATTTGCCACTGAAGAAACA TTGTTCTCTTGCATCTGAATTCCAGTG 
CT1TCCAAATAGATGCGTAGATGATGAAAAATGGAGGAGCTTCTTTTA1 1 1 CI ICI ICI I 
TCCTCCTTGAATTCT AGTACTTTGTGAACTGTTGAGGTGTCCCTTCCTAAGTCaCAATTC 
ACACTGATGCATACACTATAGTGAAACACTGGCTTTAAGAAAACTGATTAACAGAAAACC 
GGCAATTGTTATTTATTTTAAA 
S equence 1230 

ccctttgagcgggcccccgggcaggtacaggttctaaaa^ 
cttagtgatattagtggatmgtgtagacaataatattagtcctaga 

Sequ— CB 1231 1 ' ■ " 

CCC 1 1 1 gagccggcccgoccgggcaggt actccataatataatcttttaaatgggcaact 
tctaaatattgatcaaccattaataataatgctt ATAGGGGT AAAAGAAAATntMTTGAAG 
cactgaattcagtaacctgggtcatggtccaa mtgci gact acttcatatcttttatg 
tagaataattcctathaacatgttccctaaattcccat 

• aaatta ttcaaatgtagctatttaaccgtcagthacaatgcctagaaacct atttattca 
tctgtaatattaagaaggctgaatttgattggatcttgaaaaatcc *• o.'-. 



CTTCTTAAGAATGGCC A n G G CTT CCGGAGOC 

CCTTTTAGCCCGTTGGGGTTCCGCCGGGGCCCCGAAAGGTTACCA I I I > I WAAAAAAAGG 
GGGCGGATGGCCTTAAATAACCTTTTT^ 



Se quence 1233 •.*.*.-- -y 

CCCTTTGAGCG<SCCGCCCGGGCAGGTACTCC^ 

TAAATATTGATNCA^CCATTAATAATAATGCTTATAGGGNAAAAGAAAATTTTTGAAGCA 
CTGAATTCAGTAACCTGGGTCATGGTCCAA ITTTGCl CACTACTTCATATNTTTTATGTN 
GGATTATTCCTATAAACATGI I CGCTAAATTCCCATCANTTrGMAAAGWCAATGGATTAA 
ATTATTCAAATGTGGCTATTTAACGGCCAGKAAACANTGCCT AGAAACCT AT ' 



CCCTTAGCGTGGTCGCGGCCGAGGTACAGTTrTTGCHGA 

AGGCGGGAWAAAAAAATTn] 1 1 I I M I T ATTATMTTGGATCT AGCCT AWNTCTATTTTTC 
CACCTGCCCCAATT AGGT ATTTCCCANTTGCNACCGGCCT AATTCCANAATT AAATTTGT 
MCCTyTT ATAATTNGTTTWCTKWANTCCAATTGAAACCCCTTTT GGGGTT ATTGXNTCCH 
CHCACAC 1 1 1 1 1 1 HATT GTTTAAAWGCAW T AAAAAACCAmNTTCKTCGGMT AT AT AAA 
^TAASACGnKXrrrTTTACNTTATNGTTAATTAAAAANCCNCAA 1 ICC 1 1 11 MGTTNGMCC 
AACCCACTTGGAAAAfnTCCAAfaAAACCTCTNCCT^ 

AGGAAAGTAACCCCC1 1 A*TTGKHAAAAGG><WTGGGGGAAAWNTTTKGGGCC.1 1 1 rS GNGG 
TTMCCGNAAAAAMAAGGGGMTAAC 

CCCTTCGGCCGCCCGGGCAGGTACTCTGTAAGTCT 
A CTTC TCCCCCACAATTTrrAATCAAGCACCTCCCAGTAAGAAGTTATTT 
ATTTTAAGTTGACAACAGATGT ATCAGATGAGGAAAAAATTGAGCATGTGTGGT GTGATT 
ATATAATAGAATTGGTTTCTATAAACCATTTATAGTATTCAACTTTTATAGTATTACTTT 
TTCAGATGTATGGATATATAGACT ATT ATTTACT AACT GAGGCTCTGCGAAGTGTAGTGT 
AT 

Sequence 1236 

cccttagcgtggtccgcggccgaggtactcggatctnttatnn 
ttcgtctacaagccacacttatwcaaaattntgtggacaactc 
tgcttanattctcctawttagtccctgagggtttataccttttattctt^ 
TTAACAGAGG mCTGl cggaagcagagttaaatgcctatgttnactccatcatggtt AT 

CTGAAAGTCTGAGGNGCAATTTCAAAAACTCA 
Sequence 1237 

cccttagcgtggtcgcggcccaggt acttct gactaaact ggaatt atgagtgaggaaga 
gmgwatta ctama taaatgactggggcaangcaaaattgaggaggaaattahaaact 
tgacaamacttttaaagagoctactttgaaatnacagaagt^ 
aatggctagaaagtatggtttaactggaccctattatgcctttt 

b 1238 



ccctttcgagcggccgcccgggcaggtacaaagc^^ 
tacaaacccgaaactgtntacacccagactttattc^ 

cacaat ctga acagtagcagtgaaagggagtttaaggtgggggtgagggagaagggagta 
atatggttttttagtaatatagtaatttaca 

S equen c e 1239 

CCCTTTGGCCGCCCGGGCAGGTACGCX5GGGCGCTAT 
A 

AWAftAACCTnrmTWTWnTAGGACCCACT 

TTTCAGTGGAAAACAAGTTGGTCCGTTACTAGCT AAGATG TGTTTTGTACCTCGGCCCGC 

GACCACMCTAAGGGOiAATTTCGAGCACACTGGCGGCM 

Sequence 1240 

CCCTT AGCGTT5GTCGCGGCCGAGGTACGCGGGCTACCAAACCTGCATTAAAAATTT 
TGGGGCGACCTCGGAGCAGAACCCAACCTCCCACC^ 

A GCTC TGTAACATAGTTTGAAGTTGGGTAATGTG A TTCC I UI AGCTTTGTTAGCTCTGTT 
GTTTTCACTTAAGTATTACTTTAACTATTAGGGL t I I 1 1 1 1 1 GGTTCCATATAAATTGTA 
AA ATAAA TTTTTCCAGTIC IG 1 GAAGAATN i CATCGGTA G 1 1 IG ATAGGAATAACATTGA 
^TCTGTACCTGCCGGGGCGGCCGCTCG 

CCCTTTGAGCGGCCGCCCGGGCAGGTGGATCACTTGAGGAGT^ 
AACATGGCGAAGCCCCATCTCTACTAAAAATACAAAAATTAGCTGGG 

TGTGCCXJCGGTAATTAAnflNCCCNANCTT ACCTTTGNGGAAAAACTTGAAGGGCCAGGGA 
AGAAAATTT<^TOT>rrTTGGNAAACCCCCNCC*rr AAGGGTTGGGGAAGGGATTTGGCCAAG 
G TTGGAAG TTTCMAAAAGGAATVrGGCCAACCACAAGGMT GNCCAACC TTCNCCAAAGCC 
CCCTTGGGGGWCCCAAAA W NMAAGwnTGGAWGTAAC Cl ITC CC CAA1 TCI 1 1 INAATNAT 
A TT ACANN7 ATNT AGAT AHACNMT A T AANAGNGA^0>iMGA^(A^^TJGGNT>4ACCCCCT^^(N G 
GAGGCM0CGGNCMGNAACCCCCANCC9AMCCTTTAANAGGGGGG 
Seq u ence 1242 

ccctttgagcggccqcccg^ 

ccgtagtagtcccagctacrtgggaagactgaggcagg 
tggaggttgcagtgatstcaaagattgcaccagtgcactccagc^ 
tccatctcaaaaaaaaaaaaaaaaaaaaaaagtcctngggccgcgaccackct aaggg 
aattccaacacactggcggnccgttactaatggatcccagctcgg . 

Se quen ce 1243 

CCCTT AGCGTGGTCGCGGOCGAGGTACAGAATTCA G n'I C I G GGGAAAGTGAAGCNTGAA 

GGGAATCATAMGAAAAATTTGATTTTTGTGTATGGTGTAAGA^ 

ATCTTTTTGCCACAlfrGGGATTNTCCCAGGCC 

GGAAAAGGAAGKAACTTACTCNTTNTTTCCCCCGCTTTTTTGGTCGGGAA/^ ATCCTTTT 
GGGG MCAAAACCTCTTATG*nTTGGGWAAAAGAGWGC<JC C> l 1 1 IC ACCTTTTTTGWCCTT 
TTTCWAACCTC TTWC AATTGGGGGTCTTTCCACCCAATTAACC^ 
CCTTWG<»AAGKTTTKCAI<CCCTTCCCCCAATTCCTTATCKKG 

AACCWTTG GTTGC TGCWrrrTCOGTTTCnTrTTAAAAM 1 1 1 I rCCTCMCCGGGGHAACTGG 
GAAACCTGGTTTTGGCHTTCCAACOCTT*IGn1CATTTG#<C 
AAAGIiAAAAGGWCCTTnlGinTTTGTIWGGCC^^ 
Sequence 1244 

CCCTTAGCGTGGTCGCGGCCCGA»^ACAAATAAN G TCTr c 

ATGATMTCTTCCAGCTTGGGGACATTTGGGAAATTGGGATTCTTTGGGGAA^ 

ATCAGTATATTCTGGGAAAACATAKTANAGAATGAATIOWTAAATTI^ 

ATATGTTGTCCATTCTCCCTGTAACTAATGCTATCAAGATANAGTAGAAATACCACATTT 

CAAAAWC AGCTG GAGT ARACAGGT CTTCATAGGCT AGCTTGGAAACCT AATAGCT ATT AA 

T AATGAAATTTTAATTATACT CTGGATTCT AAACAATGAACACACANTGATCTTTTTGAC 

TT 

Sequence 1245 

CCC TTAGCGTGGTCGCGGCCGAGGT ACAGATGTGTCCTTTCTT ATAGTCNGTCAATGC TG 

ggaagtaacaggcagatgtgacttcacttgamcatttggamgaancaaaaaaggttgc^ 

TTGWTCGWCCTT AGGGTTTAGATGGGCAAGGAXJL I I GC I III >GCNTCCCCAATTTCTT 
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AGGGTAGNTGTTNTTCTTTG^TTGCANGGGATNNGTANACCGGTACATCCTTCTTGNNG 
GAACCAAGGGG*WNACKTTATGAANTGNAAAAGGGGANGTTCCTTTGTAGTAAANGGCCT 
TGGATTGGTTTTCAAANMGGNAAGNTGGGGTTCCACCA 
Sequence 1246 

CCCTTAGCGTGGTCGCGGCCGAGGATAC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 GNCTAATT ACTA 
CCTTmATTCTAATTGTGAACCATGGCCCTGAAAGCTTGATAANCAAGA 
CCAGAAGGGG*WACTAAGTGNGGTTCGGXCAAGNAA^ 
AMTCAANTGGWCTTMTTCCCTT 

Se quen ce 1247 

AAAANCTGTAATTCTTT ATTTGAAACAANTGCNTT CAAAAGAANTNAAAACACTTCAAGG 
ACTT CT ACT AAACATAAAAGGT CNAACAAACT GTGGCAAAAANTTTT GCAATTNGTANAT 
AAGCTAANATAGGGGlTAACNAGTACCCCAGGCCAflA^ 
ACTTCCAMTCAMNNAAAAAGGG 
Sequence 1240 

CCU I rCGAGCGGCOGCCCGGGCAGGTMCTAVCCCT ATGAGGCATAATTATAACAAGCTC 
CATCTGCCTACGACAAAOSGACCTAAAAATCGCTCATTGCATACT 

CAATAGGCCC IT MG GN TAGTT AACCACCCCATTTCTTCATTCCAAAACCCCNCCCTGMAA 

AGCATTMNAACTOGGGNNGCCANNTTCAATNTCTTACAATNAAATCCGCCNCCCAACCGG 

GGCCTTTTAACAATTNCCCTNCCAATATTACCTTAATTTNCTTGGGCCCT^ 

AANCNTGCAAAAACCTTAACGGNAAACCGGGC^ACCTTCCANCCCAAGGNTGCGGCCAAT 

TTCNATTAAAATTNCCCTNCNTTCCTACCAANAGGGGGA 

Sequence 1249 

CCCTTAGCGTGGTCGCGGCCGAGGTACTATATGTTGCTCTCTCAGTGGCAACAATGAAGT 
TTTTGCAATTCTAGAACTTGGA 1 1 1 1 1 1 1 1 1 1 AACAAAAGTCCCAAAACACCAAAAATGT 
AAACAAGATANMGAGATTAATATTGNAGTGGNfKSTAATTTAATTAAAGTTATATTTTGGG 
TTAA1 1 1 1 1 AACAACTGAAGTC TTATT GTTGAAACTT ATTTTT CA 
Sequence 1250 

CTNTACATGCATCCTCCAGCGGCCGCCATGTGATGGATATCTGCANAATTCCCCT 
TGGTCNGCGGCCGANGT ACTT AGGTGCCTACAA CAT AAACAGCA 
Sequence 1251 

AGGAGCCTGTTCTGT^^ 

ATACCGCCATCTTCAGCAAACCCTGATGAAGGCTACAAAGTAAGCGCAAGTACCTNGGCC 

GCGACCACGCTAAGGG 

jf HU6QM 12S2 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCT ATTATTATTTCAAATTTAAAAACTTCTTC 
TTTTrrAAGAGATAGGGTATCACTATGTTGCCCAGGCTGATCTTGAACTCTTGGCCTCAG 
ATGATCXrrCCTGGGTTCAAGTGATTCTTCTGCCT CAGCCTCCCTCTTATTTGCTTTACAA 
GTOCTGCTTC AGGG T TAOCTTCCCT G ACCACTGCTGCCTCCCTC C CAGCATTTGCCA GGG 
ACTGTCATTGCCTTAGTTTAI M TTTCTui 1 1 iGTITI II M I IGICGI I 1 1 IGl Mill 
TTTGAGACAGCGTTCTT AGTCTGTCGCCAAGGCTGNGAGTTGCAGTTGGCCGCAATC 
Pciqmwce 1253 

CCCTTAGCGTGGTCGCGGCCGAGGTA C TT nT I I I I II 1 1 n 1 II 1 1 H I 11 1 II I ACTT 
TANT AGAGATGGGGTTTTACCATGTTGGCCAGGCTGGTCTTGAACTCNTGACCTCAGGTG 
ATCCACACGCTTCAMCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCAGC 
CTAAATATTT>iTTTATAGCAATGCAAGGATGGCCTAACACACTGCCTAAATCAAAATTGC 
TATTCACTTCAAGGGTATTTCATTTACCTGACT AGC 1 1 HI I GGGTGCATNTGGAACATA 
ATGTA 

Sequence 12S4 

CC C I 1 1 U GAGCGGCCGCCCGGGCAGGTACAGTCTTTTATCTTGGGATAAAATGGCTAGAT 

GAACCACAATCAACTTCTCCAAACACCCACCTTrGTCTTCTACCACAATAGGGGTCAGAT 
CTATTGCTGACTTTTCCTCCACCTTCTCTAI 



TTGAGACTATACCTAAAGGAAGAACATTCTCCTCTGTTGCACACTATTATCCAATTGGAT 

agacccacatctaaatgtctgcaattacagtaatgtcagctgggcattggtggctcatgc 
ctgtaatcccanc 

Sequence 1255 

gaattcgcccttagcgtggtcggggccgaggtac 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 ic 

M II I F I I I I I I 1 1 H I 1 1 1 1 AWAATAACAAAAAATTTTTTACTNAAACATAAANATTTN 
CAGA*K3TTTCCNNACAAKKX*fTNCAAAATGGTCACAANCTTTTTTHA 
Sequence 1256 

CT OI I M 1 1 1 1 1 M I II 1 M II 1 1 



T TTTTT AGMT 

TTCCTTTTTAATGAGCTGACCCTTTAACACAAAAAAAGCAGGGGTGArc 

AGGAAGTGGAAATAAAAAAATCTCAAAGCTATTTGAGTTCTCGTCTGTCCCTANCAKTCT 

TTCTTCANCTCACTTGGCTCTCTANATCCACTGT GGTTGGCAGTHTGACCAGAATCATGG 

AATTTGCTANAACTGWGGAAGCTTNTACTtXrTGCAAGTAAGCANANATC 

ATAACTTGGTTATTTGAGCCNCGTNTTTTTGCAA^ 

AATTTACTTTCAATNGTTTTTAAAAAAA 

Sequence 12S7 

GGCTTAGCGTGGTCGCGGCCGAGGTA C 1 1 1 III M l I M l II t II I I II I 1 1 I IMGGGTT 

TCAAACCTCAGTITGAAAATGAGAGGAAAACAAAATAAAATGATTTAiCATAATCAAAGGA 

TTAACT GATACAGACTTTTATTCTAAATGCTCACAAGCACAGAAAGCAACAAGAAATCAG 

ATC TTGAACGAATTT ATAATGATTC TTCCAGGAAGCACCGNGGCAGCCACATAAGCCGCT 

NTTCACACCTGGCTGChrrrCTGOCAAGTTTAGTCCTCAAAGAGAAAAAGAAGGGAGGHAA 

AAGACCNAAAAAAAAAAACAAAA 

Sequence 1256 

CCCTT AGCG TGG TCG CGGC CGAGGT ACC TTGC TGG TTAAT AT AACTAAGATTTTGCCTTT 
ATTGGGTTAGGTAT C I 1 1 M l I I A TTTTAGCACCTGATAGCTGTCTTTCTACTGAGTAAA 

gaattataacttttagatgtcagagaaaattagagtatttattgtcaaa 

Sequence 1259 

cc c 1 1 1 cgagcggcggcccgggcaggtacttcaagaattccaaaag n*m gactgaaat 
aagcaaactcactaatgattatgaactgaaca^ 

atacctcttccttcj^agtaagtttgccatgcctaccatatgtgtgagtggtattctggaa 



atccctgtggtg ci I 1 1 ccatcctgatctacagatattcagaactgcagggagttccttt 
tagtcctggcaatctgaacctg ai i i i i r oc 

Sequence 1260 

c^ctttcgagcggccgcc^gggg^ggtactggtgggattgttaga^ 

agtgcaccttggagtctgtggaggtctcaagaatatctcttttggacgtgaccaggataa 

caagattgccgtaaaaaactgtgatggtgtgcctgcccttgtgcgattgcttcgaaaggc 

tcgtgatatggaccttactgaagttattaccggtgagttctaggc^aaggaaaattgct 

aagtcagtgttactctctagtgatgttgagaactagagggatttccagaccttttactt^ 

ttgatgaa/m^ttgtgaactggtggctgtgggtcaaaatccatctcacag^tttgtt^ 

tggatc 

Sequence 1261 

CCCTT AGCGTGGTCGCGGCCGAGGT A C 1 1 M 1 I 1 1 II 1 1 I I I I U 1 1 1 I I I I G l I I IGCC 

tcctctgactatattttcaaatagtctgtcttcaaggtcagnaattci I MCI ICTGGCA 

TGATCV>ACTCTGCTNTTAAAGGACTCTGATGCATTCTTCAGTATGTGAACTGC 1 1 1 M IC 

AGCTCU2ANAATTTCTGCTTCATTCTTTTAAATTCAATCTCTGTTAAATGTATNTGGTAAA 

A7TCTGAATTCCTTCTCTTTGTTATCTTGAATTTCTCTGGAGTTTTCCTCACTTATTTTG 

AATTCTGTCTTGAAAGGTCACAATCNCTGTTTTCTTAAGGGATTGGGGCCCTGGGTAACT 

TTATTTTAAAA 

ATCAAGCCTGGTTCCTAGGATGCTGGAC 
TTCTAGCTTAGTGAGAATGCAGTATAC I 1 I 1 1 GAAAACTTCGTGC AGGAATCCCTCAAAT 
GCTGTAACTAGGAATGGGTCAGTGAAGTTCAAACGACTTTTCCTTGAGGGAGTATTTTAA 

tcggacaagggaactctttttcttttgggcaatggcx^aacaggactgagaagccagagag 
cttgcacctgagccatctcagccgtgagagtaacagtcctaggaaaatagatgggggctg 

GGGGTAAGGAAAT 
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1 1263 

CCCTT AGCGTGGTCGCGGCCGAGGTACTl 11 1 11111111 1 1 1 11111111 1 1 AGGGGTT 
TTCTITGrrAGAGACAGGGTCTCACTGTATTGCGCCAGGCTGGTC^ 
AAGTGATCX7TCCTGCCTTGGGCTCATGAAGTGCTGGGATTACAGGTGTGAG 
CTGACCTATATTTAATTTTTTAAAGATTAGACTGGTGTTAGCTGTAAATAGTTTGAAATA 
CCTCTCTGATAGGTGCTAGCTTATCGTTACT CTTAGTGCTTCTTGCATTTGCAT 
Sequence 1264 

CCCTTTCGAGCGGCCGCCCGGGCAGG TACTTTGTGTTTAAGAGAAATTCCTAAACTGGAT 
ATATGTGGGAGGCTGAAAGCAC TGTG AG TTGAAGTCAAGGGGAGAGXjTCCAGGCGCAGTG 

AAGTTTGAGACCAACTTGGGCAACATAGCAAGACCTCGTCTCTACAAAA^ 
NAATANTAATNTAAATTAJ 
CAGCCACCACTGGCCGGC 
Sequence 1265 

CCCTTTCGAGCGGCCGCCCGGGCAGGT ACCTTATTGTTAAAGTGAGTCAGATAAATCTTC 
AATTCCTGGCTATTTGGGCAATTGAATCATCATGGACTGTATAATGCAATCAGATTATTT 
TGTTTCTAGAjCATCCTTGAATTACACCAAAGAACATGAAATTTA^ 
TTATTrATTTCATGCATTCATTTTATTTXXX^TTAAGGTCn'GGATGAGA C 1 1 C 1 1 1 G GGGA 
GCCTCTAAAAAAATITTTCACTC 



CTCCAGGGCTCCTTCCCAAGTGCCTANAAGGGTGCTT^ 

GCCCAGGGGGCT 

Sequ en ce 1268 

CCCTT AGCGGCCGCCCGGGCAGGT ACTCAACACT GATTTGAGAAGAAAAGTGTGATTTGC 

TTACCTGTGATTTTGAGACCTATATAGTGAAGGTTTGTTGCCAC ITTTTAGTTTCCT C AA 

ACATGCAGAAGTAATGAGGTTTGACAGAGACATGAGACTATAAGATGTCTGTCATTGCTG 

CCAACCATGGAAAAGATGTT AAGATGT CCAGCTGCCCATAAAATCAT ATTTTCAAAGTGT 

GAGACACGAAGAATATCTTTCTCTTA1TTGGAAATATGCTGAAG 

AGGATTNCAGTAAAATGGGAGNC 

Seq u ence 1267 

CCCTTAGCGTGGTCGCGGCCGAGGT ACTTTA 1 1 1 1 1 1 1 1 1 IN I 1 1 1 I I f GGNTTCTGTAA 
ACTNTKATTTTACACTTATGGGCCACTTGCCAACTCAGGGG^ 

TTTTCT ACAAAGG TTTACTTTGG TTG TAAAAGATGT AGTTAANAGGGG T ANGAANAATTT 
NNGGAATmATTTTNtrTTGGCTTNGGTNAAAAACCT 

TTCXIXiAATTAAATATTAAANAANTTNGGN CAACCGTTTTGTT ACCNT CNCCTTTTCNAGG 
AAAAAAATTCCTT ATTTGG NACC I I N I I I C I IGGftAAATTTTTTNAKTAAAANAAAAKTG 
GGGCCAI 1 1 1 UNIT 11 1 

.TACGCGGGGGGCTTTGCAGATGTGATTAAGCAAA 
GGACCCCAGATGGGGAGATTATTTTGAATTACCTAGGT GGGACTOCACGTCATCACAAGG 
GTCA GAATCCAAAGAGATGTGAGAATGAAAAGCACAAGTGAGAGCAGTGGGATAGCCAAA 
TTTTAAGAGGGTTGTGAGCCAGAGAATATAGGCCGCCTNTAGAAGCT GCAGAAGGCCGGG 
GTGGACAGAGTCTCCCTGCGAACCTCCAGAAGCAGCACAACCCTGCC 
CTCrCGATCTCCGGGCTGTAGAAATAATACATCTGTGCTATTTTAAG 

S e q u e n ce 1269 

CCCTTAGCGTGGTCGCGGCCGAGGT ACATTTAAAAGGT GATGCT AATACTTTAAAATGTT 

taaganataagatttaaaaagcatttgtaaattgtatacttgcananctccgtnctacat 
tggcattttggaacaaggnacattaattggtt 

Se quen ce 1270 ' 

CCCTT AGCGTGGTCGCGGCCGAGGT ACT GCAAGCAACAG TTACTGCGACGTGAGCAGCAA 
CAGAAGT AThTCTCTCCTGAAATTATT ANGCAGTACTTGNATCAACCACT CCGCCGTT ACC 
<>TACCAAAGCCGTCGCCTTGGNCACCG 
Sequence 1271 

CCCTT AGCGTGGTCGCGGCCGAGGT ACA A 1 1 1 1 1 AGTCAAGGGATTGTTTGAT A C 1 C 1 1 1 
TCTTGTCCOAjCSGAGGAGAGTTCCTTGT^ATGAO 

CAGGAT ATT AAAC TGAGTGGT G T CAAAT ATT NC CAGGATCAATTCGACAATGCCATGTGT 



ACCTGCCCGGGCGGTCCNTCNAAAAGGGCNGAATT^^ 
TANTTGGATTCCCGAGTCTTCTGMTT(X>AAA^CTTTTGGCGGTTA 
Sequence 1272 

TACACTATACACTTCAACCTTGAAATGTGGACCCAAAAAACATTCTATTTTTCAGTAATC 

NATTGAATTTNGGTGAGGGGTCCNACACCCTCAAATCCTAANTTTATCACANA 

C WTNCTTGGCT GCCAAGCGCTGGCNGATGAAC rTTGT M TI GC I GNANCTCTTNATGAKTT 

GGATNCCANAGTHTCNTGATGATCCTfnTC^^ 

GTAGNGGANTGACTTTC 

Sequence 1273 

CCCTTT CGAGCGGCCGCCCGGGCAGGTACT nilMlllllllHIIHIGMIIIHH 
1 1 !T1 I1111II1MTTIMTT f TT TTIT1 FII FITATAAAAAACNTrrNAAATTAAAAA 
ACTCAAAAAAAAAAJ4AAAAATGAGCATTTTAAAAAAWGGAAAN AWTTN NA AW NWMNNNNG 
GRAAAAAAAAAAAAAAANNGNAAAANNAAAANTNNNGNATTGNTTTTTT^^ 
AANATCKTCCCCCTGAAAAAAAAAAAG I I I II I I 1 1 1 I 
Sequence 1274 

CCCTTAGCGTGGTCGCGGCCGAGGTACTACAAACAACAGAAATTTA7TGTCT 
TGGAGGCTAGAAGTCCAGAATAAGGTATTAGTAGG J I TGGl ICTTl ICTGAGGGCTGTGA 
AGCAGAATCTGTTCCATCCCTCTCTT C TT GTCTTCATCTGTTCTATG rCTGTCTTTQTTC 
AAATTTCCCCTRATATAAGGATAGCAATCATATTGGATTAGG 

GATCTTAACATTTGCAAAGGCCCTATTTCTCACT AAGGTCGTATTTACAGGTATAAAGGG 
TGTAGACTTTAACATCTTTTTGGGGGAAGACACAGTTCAATCCGTAACAAGATGTTAAGT 
CCTTTCCTCTCCTAAA 
Sequence 1275 

ATAGGGGCCGGAAATTGGGGGCCCCTCTAAGAATGCCATGGCTTCCGAGGCCGGG 

CCAAGTGGTGGAATGGGGATATTCCTTGCCAAGAAATTTC 

Se quence 1276 

GCTATAAGTTCAAATCTGTTGT ATTACCCTAAATTAAGATAAACCAACCTGAATTATAGT 
AGATTTCTCAATAGATGAGGAACT GAAAAATACTATGTAAAATATCTTCCAAAATGCTTT 
TTATA CI lll ll I ATTTGTAATTTGGTCT ATCTAAAATGTTCGTTAGCTTAACTTAATGG 
GCGTTATTGGATTCATATGACT AACGTTTCCTCAGTATTGTAATGCTTGAAATATTTGAA 
AGAAAAAATGTTGTTTTTTAGTTGAAACTGGTATATATAATTTCAGTGCTTGGCAGGTTA 
GT AT ATTTTT ATGCATTTTT 
Sequence 1277 

GT ACCAAC ACAATTG TT AATTTC C TCACAGGCT NAAGGCATTCTGGGAAGCT ATACAGGG 
GACAGGAAGCAI I 1 1 1 rGGGAGCCT AAGGGGAGCC AGTTTGGAAGAGACAGCATTCTCCT 
GGCT AGGACAGG TGGN GGN GGTGGCCGGG TTTNAGGNTCT NCAAGGGACCCTNTGC AGAT 
TATTCTGAGCAC 

• 1278 

CCCTT AGCGTGGTCGCGGCCGAGGT ACT AAAACTAAAACTGAGCAGTTTAAAACATTCAT 
TTAAAGGGATATCT AATGTGTTTATTATTAACATAAATAATGTTTTATGAAAAATGTAAC 
CTTAGTTTTCC>AAACAAAAATGTTT AGGGCAAGAG TAACATT ATTTTACATT ATTGCAT 
CTCAGTGAAAAATAAATGGCAACAAAATTCTTATATCTGCTTCTGCAGTTAATCTGTTCA 
TTTTGTTTTGGTTGAAGTATATGAAGGAAATCTGTCCTCACACAGTTGTGTAGTGGAAAA 
AGGGGGACT ATTGTAACAGGGCTGTGCACaTAATTGTGGATGATTTTCTTTGATACAACA 
ACAAAACTTGGTGGAT 
Sequence 1279 

ccatggaatgaaatataagtcaacaagtatgacagtttcgctttgtttattatggaagaa 
tcattaataatttgataattaaatggtcctgaatggttagccatgttctcx:gcatttaaa 

TAAATAGTATAAACATAAATGAAAATATTAAAGTAATTTCAACGTGATAGAGACCGCTTA 
TrTTTAGTTCAGGTAGAGTTCCAACCT AATGGTAATTAAGATTCCAGATCCGAAAGATGT 
CATGTGAATATTGCTCTGAAAAACCAAAATTAAAGCTTTCTTAAAG 



CCCTTAGCGTGGTCGCGGCCGAGGT AC II I I H I III II I I II U I M I I I INGAAGGCA 
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ATTTAATAAGATTTGAGCATAGATATTAAAjCTTAGCATGGACAGAGAAACTTATTTNTTG 

GGGGACTGGCATAAGTGAAAGAACAGAATCAGTKTGACC^ 

S e qu ence 1261 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTCTGACTTTCTAACAAATTACCATAAAGGA 
AGAAT A mTTCGTC I ACTATTGTTAGAACACCTT AGAACCATCAAAAATATAATTACAT 
GGCTAATAGAAAAAAAAGAGCAGTTTTAAAATATGTTTTATGTAACCTATTTTCATTG^ 
TTTC AI menS l 1 G CCGAATAGTAG 1 TGTTCTAAGTAAATACAGGTCTCAATTTCACT 
ATGAATAAAAAAAAAAAAAANGAAAAAAAAAAAAACTACCT7T3GCCGCCGACCACCCTAA 



CCCTTAGCGTGGTCGCGGCXX5AGGTACTCTTTCTTAT T 1 1 C T T AATCAATACAGCTAAAG 
G l I rGTCAATATTOTTOATCI I r I I A AAGAACTAAAATTTTGI ll lbl IOATTTCCTTTA 
1 1 1 1 III 1 1 I UGI 1 1 1 A TTTATCACCACTCTTATTTTTAGT A 1 1 IC CI I HI fCTOGTA 
GC1 1 R3GGTTTA61 1 PS1 ICI I AAGTTCCTTAGGTGTAAAGTT ACGCTGTTGAAATGAGA 
TCTTCTTATTTAATGTATGGATTTATAGCTCTAA A I 1 1 FCTC1 I A GCACTGGTTTCACTG 
CATGCTCTAAGTTTTGATA 
Sequence 1283 

COCTTAGCGTGGTOGCGGCOGAGGTA C I 11 1 MM II 1 1 I I I 1 1 1 1 1 CTTTTTA 

NNANNCTNGTTroGOCNNCCAAANKNCTNGGATTAHN>K»NNTGAACCANC^ 
TTTAAAANCNNAATNTTTTTNNGGIiMMIfrNM 
GGGAAAAACNTGGAKNTTTGG N IMIIIIII 
Sequence 1284 

GTACTTGGTCTCTACCTCAGATAAI rCTTCI I IGAGCTmGAGTAGCriCTCC 1 1 1 1 IC 
ACTTAGTTCTACATGTATTCTATGCAGTGAGGTTTC^^ 

CTGTTGACAATCTAGOIUI 1 1 1 IGA TGAAGGGTTGCCTGAAT A I ICTTTl I A CTGACAGA 
1 ICI fCATTATGl I rCTCCI 
Seque n ce 1285 

CCCTTANNTTGGTCGCGGCCCGAGGT AC TTTTT AATCTTATTATTAAACTAACCCCT G TG 
GTGGTGTGGCTACATTCTTT GAGTTT/VSAAAACGAGATAAAGAATTGCTCATATCTTCCC 
AAATTGTGT AGTATAAAAAGAATGCTGTCCTGGTTGTTTTTTGTAGAATATGGAAGTCCC 
TGCAGTAAGT AGGCAACATGCTACCCTTCTATTCAACACAGCACTAGAACAAGGCAAGTG 
GGACC1 1 IG TCGACAGATGATTCG A I I I C I 1 AAAGTCATTGGCTCTGGAGAATCTGAGAC 
ACCTNCATCCACACCCACAGCTCANGTTAAGCTGCAAAAGT^ 

ATAGACGCAGGTAGCATC1 1 1 CTCT AATGGTACCTGCCCGGGCGGCCCGCTCGAAAGG 
Sequence 1286 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACACAG 

AGGAGTATGATCCTGTCGATACCTTGCCGTAGGTTATGTAACATGATTGGAGCGCAACCA 
G<rrGTTCTCTTGCACAGATCGAGAGTGAGGGGTATTTTGTGACATTACACAGCATCAGGA 
GCCTGGTGCCTCATCAGGTGTAAGTTCTTATAACCACTCTTGGCAAATTTATTA^ 
. GGAACACAGTCAATCTGT AACTCATAAGT AGCT CTACGTTTACTTCAATTGACAATCCCT 
AACCCA TC TG T CCC T GGCAGAAAGAAGGAAAGAT GACA T GCAT GGACAG TGAACAGAAAG 
GGATGAAAGCCAGGATTCCTGGGATGAACAGACAGTGGCAATTA^ 
CACAACCTATTACT ATGTCTAAAAACGACCAGAGCAGAGAGG CAGAAGAGAAT AAGC C TG 
AAGTCACCTTCCACTNAAAAAGCAGCCAAACTCGCTCAAAGGAGTAACTTTTAAAACCTG 
GATCTAACCTGGAAMGGGCTAAAAAWTGGCTTGGTTCTGA U I H IT 1 1 I 
» 1287 



CTTTTATTTCCAGTTTCTTGGACCAGAACAATAAAATACATAAGACATCGTTTCTATATG 
GTCATATACTATATAGAATAAAGAATTGTTATGTAAATTATTAAATGAGTATAGAGACCT 
TTACATAAAAACTAAGGTACl H 1 1 M 1 III H H IIG I 1 1 III I IT M I II 
Sequence 1288 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTTGTGCAGACCGCCTACCTCATCCTGTGACTT 
AGAATGCCTAACCTCCTGGGAATACAGACCAGTAGGTCTCAGCCTTATTTTACCCAGCCC 
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TTGCTACATTCAAGAAGGAATCACTCTGGTTCTAATGCCTCCGACAG^ 
TCAGACTCTAAAGCAAAGAAGACTATCTTf^GTGACAGCAAGA^ 

ACTCGAATGGCCTTGAGGAGCTATTATCAATAAAAACAGTATAACTT ATAATTATCTGTT 
GTGTTACAATGAAGT ATATCATCACTGC 
Sequence 1289 

CCC 1 1 ICUAGCGGCCGCCCGGGCAGGTACTAAGGTTGTTAGCCCICIGCIGGAAGAGAG 
GTATTAGTCCATTTTCACACTGCTGATAAAGACATACCCGAGA^ 

AGAGGT7TAATGGACTCATAGTTCCATGTGGCTGGGGAGGCCTCACAAT<^TGGTGGAAG 
GTGAAAGGCACATCTTACATGTTGGCAGGCAAGAGAGAAATC 
AAACCCCTT ATGAAATCATCAGATCTCGTTAi 



ATCT CTGTGGCATGGATCC 
TACATCCACAACTACACATTATTTATTTATTTATTTTTTGGAAATCCCAAT^ 

TGACTATGACAGACTAACACAAAGGACAAGAAATGGC7 CI CATGGGATGTAGGTGGAAGG 
^SflOO^^^^^^^^^^^^O^^^CCAOMSGTGTOCJOCCCTC I G I ICT CI I G GGG 

Sequence 12S1 

CCCTTTGAGGGGCCGCCCGGGCAGGTACATAAGCTC 

TACATCCACAACTACACATTNTTTATTTATTTATmWGCAAATCCCM 

ATGGGCCTCACCTCATTGACATATNC 



CCCTTAGCGTGGTCGCGGCCGAGGTACATI 
ATTCTGAGATTTt^XX^AAGCTGTGGATTCTTCCT ACT CCTT AANAAAAAAACTTT GG TTT 
TATTTAACATCTACACCTTTTNGTCAGTTGTGTTAGCGTCT^ 
TACTCrrAAAAGATGTAATTGTTGTCATTTTrGAACAGTTAAAC^ 
AGAACCCCAATGGTT^ 

Sequence 1293 

CCCTT AGCGTGGTCGCGGCCGAGGT AC T ACCTGTTT AAGGACAT ACCAGAAAAAAAGT AT 
TGATTTTTATCCTATGCT AAACAGTGCTGTGATAACTT 1 1 GTATCAC TTGGAGAATGCT C 
CTGAAATT ATGCAACACT ACT AGATAACCCCTGGATCAAAGAGGAAATCAAAAGGGAAAT 



TGACACTAATCAGAAGAAATTAGGAAAACGAATAAGTAAAAGAATCTGAAAAGGAGAAAT 
Se quen ce 1294 

TGGTTTTGTGTGCTNACATGTC M C M GGCATGAGAATGTTTAATTTGGAANTAGTGGGN 
C*CTCAGAGCCNTCCTACAAAGGCAGTGGCAAAGCTTC^ 

ANT AAAC TTTGCCTNGGCACGCG NCNTCCTGNAAAN TGThrrTTGTTTTGGGCCTATTTCT 

TGCTGAGMTNCXXTrTAWNGGMTTGTNCXTTCGNNTTTTTCATTTC 

TCCCCATATNGAACAAKATTGGTAATTTCAACN^^ 

C 1 1 1 1 1 1 1 GNGACTATGMCCCOCCTAANTAACMACCCTTGGGATNCAAHT^NGTNAANTT 
I l UI HCCI I MUC IN 

ATTTTnTTTTTTNGGCCNANCCNTCCAANCAAATTTTTC 
Sequence 129S 

C CC TTC GAGCGGCC G CCCGGGCAGGT AC N GCGGGCTC TC TCCAT GGG TC TGTG TTC CAGA 
AAGCTATGAC TCTTT AATGCATCTCTTAG 1 1 1 I 1 1 CCTTATTTCCTTTATTCTTAGTATC 
ACAGTCCATGATATCCACTGT CCTTGGGGCGCCCAATTCATTGTGCAAAAAGCATTT AAA 
TCAAAATACCCCTATTTGTTAT INI TIT 1AAAAAGTAAAGTGGGGGATG 
Sequence 1296 

CCCTTTCGAfK;«^CG<;C05C^CANG T ACAA I GCACATGCCGAAN GACCTI A_NTN TTGGA 
TGTGATGAAAT G TTTTC T ATG CCTGGAATAAA T GC C TTNC TTTGGG N TGT AAT AT CTT AA 
ATACGTATTGCTCCTCNATCTGTGAGTTATTTAA I Mill 1 CTC TGAAG NAGC TKTGATT 
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TCTGGGCTTTCT AGTGTGATCATCTA 
Sequence 1297 

CCCTT AGCGTGGTCGCGGCCGAGGT ACATTT AAAAGGTGATGCT AAT ACTTTAAAATGTT 
TAAGATATAGCATTTAAAAAGCATTGT AAATTG TATACTGCAGTGTCNGTCTACATGGCA 



CCCTTCGGCCGCCCGGGCAGGTACGCGGGCTTCCT ACTTCCACCAAOCCCTCTTNGCAGA 

GACTGCTCCATTCCATTAAAAGGMGAAGGTTCAACTX3GANAC 

CT 



CCCTTAGCGTGOTGGCGGCCGAGGTACTAAACGT^ 

GGCC TGGTGGCTC*ACGCCTGT AATCCCTGCACTTTGGGAGGCCGAGGCAG<3 TGGATC^ 
GAGATCAGGAGATTG^AGACCATCCCGGCTAACACAGTGAAACCCTGTC T C TACTAAAAAT 
ACAGAAAAAAMAAMAAAAAAAAAGAAAAAKGGICCI 1 IGIKTACTGCAGTTGTCWTWTAC 
' ATGGCATTGGACAGGACATAATTGTAAAACATAA^ 
TGATAGTGAATTGGCAAACGTGACCAATTTTTT 
Se quence 1300 

CCCTTCGAGCGGCCGCCCGGGCAGGTAGATACAAAAAAATCA 

GAGTAGGAAATGGGAACTGGTGTT AAAACTCTT ATAACATATGTCACTGNCTT AAGGGAC 

AGTGTTTTAAAAACGCATACCTCXSGCCGGGCGCGGTKGGCT^ 

Sequence 1301 

UCC 1 1 1 LXaAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTrAAAATG 
TNT AAGATATAGATTTAAAAAGCATTNGNAAATTGTATAC TGCAGTGTCGT CT ACATGGC 

attggacaggacataa . ■ " v * 

Seq u ence 1302 

cccttgagcggccgcccgggcaggtagggcgcgcagcag^ 

G - 

ATGCGGCAGGCAAGGCACAGAGGAGCAAAAGTGCCGCACAGAC^^ 

CAGCAGTCCGTGAGACCTGTGTGCCAGTCACTGAXSCTGGGTCTGGTAXX^AGCTGGTGG TG 
GCGCACTGGGGCTGACTGGTC^CAGGGTAGGACATAGCTTTGCCrrTTCACG I FBI GGTGC 
ATCTCAAACTGCATC 

GTCGCAACAGTGGCAGNAGTCTT GT CCAAGGGGAC 
S eq u en ce 1303 

CCCTTAGCGTGGTCGCGGCCGAGGTACTCAAAAAACAAAACAAT CCT GTTG 
GTAGAAAAATTTGAGCAACAAAAT AAAT AAAG TAGT ATAGGATTATGACCCCAAGTAT AA 
AATAACCATCTATGAGTCCATACATATATAAATAAATGATTGAATAAATATATAAACGGA 
GAAGAAAAAAAGACTATCCATAGCAGAAGAATTCCAAATAATTTT ATAGACAGCT CCCCT 
TT AAGAAAACAGACCT ACTGAGTGTGGT C TACAATTAATGCTCGCGTACCTGCCCGGGCG 
GCCGCTCGAAAGGGCCGAATTCCAGCACACTGGCG 
S e qu ence 1304 

CCCTT AGCGTGGTCGCGGCCGAGGT ACTGTGATTAAGCCAA 

TGCTGCATTGKAGCAGTATTGAAAGTTATGTAGGTGGATTTTTAAAAAAATATT 

TAAATTTTCTTAGCAAAAGTCAAATGAGTAACAACACACAGTTTGGAAACA 

Sequence 130s 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACACTGAAAACTGGAC^ 

ATTAGCTCTCTGGGAGTGAGCTACATGATGTTGTGCACTGAAAATTACCCAAA I G I TCT C 
GCC I ICIC II I CCTGGATGAGCTTCAGAAGGAGTTCATTACT ACTTATAACATGATGAAG 
ACAAATACTGCTGTCAGACCATACTGTTTCATTGAAT^ 

AAGCAGCGATATAATAATCCCAGGTCTCTTTCAACAAAGATAAA 1 C 1 1 1 C I GACATGCAG 
ACGGAAATCAAGCTGAGGCCTCCTT ATCAAATTTCCATGTGCGAACTGGGGTCAGCCAAT 
GGAGTCACATCAGCA1 1 1 ICIG I I GACTGT AAAGGTGCTGGTAAGA I HC I ICTGCTCAC 
CAGCGACTGGAACGAGCAACTCTGTCAGGGATTGNAGGATTTATCCTT AATCTTTTATGT 
GGAGCTCTGAAATTT AATTCGAGGCTTTCATGCCTAT ANAAAGGCTT CTGCCAAHTGATG 
GNGAATGATTTTAATTACCTCATTGGCA rTTTTTCI I G GGAACAAGCAGCCCTGGCCTTT 
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Table 1 



AAGGGATTAAGTCTAAAC*ATTGGGCCTTTTT^^ 
AAA 

ACAGTGCATGGAGAAAAACCTAATGCCGGC 
GGGCTGTCACCTCATCCATGCTATCAGTCTACTTrcCCTCT^ 
AGAAAGTGGCATAAAAATGACCTGGCTGGGCACAGTGGCTCATGCCCATTATCCC 
TTTGGGAGGGCGAGGTGGGCAGATCACCTGAGGTCAGGA^ 

CAT GATGAAACCCGG TCTC T ACT AAAAA T ACAAAAATT AGCCGGGCA TGATGG TGGGCGC 
CTGTAACCCCAGCTACTCAMGAAAAGTGAGGCANGANAA'n 



CGGTIWGA A I I I CXC IIII G I A TGAA C I UG I CM I MAA I U I ICCTTTA ACCCATTCTTC 
TTTTTCAAATTGGTTTCT ATTGGG I I I I I 1 1 1 1 MCI 1 1 1 1 IG GAKGTTGGGA C I 1 1 1 1 1 
AATCTACCTTGG 

S eq u e n ce 1307 

CCCTT AGCGTGGTCGCGGCCGAGGTACCC rTGTTACAAATATACCATCATCATGAGGTCT 
GAATGGGI 1 I CC I C 1 ACCCCCGACACCACCTGATATGCTAAATCCAAGTTCTGGATCCTT 
TTCAACCCTCACTCGAAT CICI IG l 1 1 1 GCCAGTTCATGGCCTTGTCTAGGAGAACAATG 
GGGCTGTGT ATATCGAGACTGGTGGGCCACTTTCAGCATCAAGTAATCAATT A G TT G TT 
TCTAGAGGGATGGCTTGCCACAGATGCCTGAGGGGGGTGATGTATTTGACTA7 AATTTGC 
CTBAGGCCTGAGAGGCTGGCOCATCTGTCCATTACTCAAAGGCATCT 
AGTATCTTAAAATGACCAATAATAATGTCTTATTTCAAATAT7TGGA 1 1 ICI ICI IGGAG 
CAT TACAA AAGCACT AGAGTTTTCACATTCTAATTAAGTCAAACAATACCATGCCACTT A 
CTATTTTTCTATAATTTTAAAACTTAAAAGAAATAAGCT^ 
>TACACTCA CI 1 1 1 1 1 AAGCTT 

81308 ^^^^^^^^ 

GGJ«>CAGGANTCCTGACTGCGAAAACCC TGAGCT ACAAAAGCAAGATTT T AC T GAAATT 
AATTATTTCACAGACAGACTGGANATCACAGOTCACTGAAAAGTC 

GCTAAGGATCT AGGAT AAATTGT AAT AACAGCAAAGGGAAATTTTTTT AAAGAAGAGCAA 

AACTCAAAGTCAAAACATCACATACTCTTATGCCTTTGGAAAAGAAATAATAAAAATAGA 

AATTTGCCNCCATCAAAATTATAATACTATTTCTGAATTCAXX5GAAAAGACAGGNGNAAT 

TAAAGGGAATTAATTAAATATATCAAATNTCTACCCTATTATKAACATACCAAGAAAATG 

AAACAAAAAATTAATTAAAAACAAATTNTTTGGGCTCCAGCCGAAAAAGAAAATNCCTCC 

AGX^GGCACACACACCACKNCACCCCACACCACG^ 

Se quence 1309 

1 I MM I II I III I I lll ll H I I MI i n il lllM lAAANAACCNKAAKCCMTT 



TTTT 

TTTTKACCNAACGGGTT ^^>e^^T>iA^^rAAI^CN>CCCCNTTT^^AAA^^^4ACK^t^iNTT PS1AAAA 

KNNTTfnTANAAAAANNATTNnu\CCCCNNTNTNAAAAAAAAAAA 

Sequence 1310 

CCCTTTCC^ 

ACTGCAGGGA AGGGCC ACAAAACTCTTCACCC CAAG GTCTAGAAT 

GAAATT ACGAATTTCTGTGTGTTGTCAGCTOT AAT AGAATCCCTC 

TTAAn nGTI rATTTATTTATACTTATGTGCCATCTTCTCATGAAAAAGAGGCAGTATG 

TTAAAAGTTTGAGTTCAGATTTTCTGATGTAXjATAAATAAGCTAAAGAAGGCAGGGTGAA 

GTGTGATATATGAGAATTTCCAGAGCAGGGTATrCGTAACTTGTAAGTATTTAGTCCAAG 

TTOCCTCTCCCAACACATTTTACACTAGAATAAGATTGAAAGGCCAGATGTGGTGGCTCA 

CGCCTGAAATCCTTTTGGGAGG 

Sequence 1312 

«KX>GTGTGATGGGATATCTGCAGAATTCGCCCTTTCGAGCGGCCGCC^^ 



Pi**203if22l 



P^*30J«f22l 
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AG TAAGCCAAGATTGTGCCACTGCAC TCCAGCCTGG TGACAGAGCGAGAC TCTGTCTAAA 

AAAAATAAATAAATAATAGAGGTGAATGT ctgcattaggatcaagacaagaagaagacag 
acaatcactttggaattctgagactaoctccaagaatcatccacggaaggatgtcaocca 
ttt aaccagggct acggatcaaaaaggaaaaaatacagtcagtggagaagtagaagagtc 
tcctgaaaaatatccgtatttoaaaaggcagcaggagttgatagaaaacataactaaaaa 

AGTAGAAGACAC TGTT, AAATTTGAATCTGGATCCTATATAGCTTCTTCTCTGGGATCTAC 
TGAGGAGTGAAATCTAAATXSAAGATTTAGCTTAGAAAGCATOAA6ATAGTATGTTCCAAT 



CGCTAAGGG 
Sequence 1319 

CCCTTAGCGTGGTCGCGGCCGAGGTAC 1 1 1 II 1 1 1 1 1 M II I I I II M IG G WTNMT TTTT 
r nTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTnTn I NAAAAAAAAAfKaGCAATTTTA 
ANAAAAAAATNNAAATTTGACN GGN NAATACCAAANGGAAAGTGNNTGANCCNCN NAAAA 
AAAANAAGGTTTTACW III I T X> < AAATTTAWKTKT TTTAWSA AAAAWAAAGTTTTAAAAN 
TTNNGAfmTTAAAACCNCTTTTNAACTG^iAAAAAT^^ 

TTAATATAANCNAAAATTTTNNTTTTTTAAANTA/^AATTANCNACCCNAAT^ 
Sequence 1314 

CGCCCQGNCAOOTACCTNCTTAGAAACCT AOACTCCANA GAACACT Q TTTGA CAACCACT 
GCAGTAGAACATAATATATCAAGATTNTAGGAGTGGG fTTCTTTTTI CATTTTTACATGT 
TWTAGAATAACATGCATAATCAAAGCTAATAATACTGTGTTTTCTTTACT CTTTTATTTG 
CCTCTAAAGACATCCACNCATAGNGG TGAACT GATTTTTAATGCGTTTTAAATAAAAGGC 
ATTGAAAAATATTAATAATTGMAGTTACTAAAAGT A1 HC I C II I GCGATTCTCTNATCT 
GTGTTTCCAGACCGGTTGGGAGGGGTGACAGATCAGAAGGCTCTGGTCAAGAGAATGAAA 
ATGAGGATGAGGAATAAT AAACTCTTTTTGGCANGCACTT AAATGTTCTGAAATTTGTAT 
AAGACATTTATTATATTTTTTTCTTT ACAGAGCTTTANTGCAATTTTAAGGTTATGGTTT 
TTTGGGA GI 1 1 1 1 IXC 1 1 > 1 1 1 1 1 1 GGGATAACCT AACATTGG G 1 1 1 1 G QA ATGATT GGG 
TNCGATGAAATTTGGGGAGATTGGTATTAAAACAANAACTAGCAAAAATGGTTTTTAAAA 
CTTTTTTGCCCGTGT ATTGAAGGAAGTGCT ANNAAAATGCNAAAAGTGCCAATATTTTTC 
CCTA 

Seq u ence 1315 

CCCTTTGCGGCCCGCCCGGGCAGGTACATTTGGTGGAGITrGAGACCAGCCTGGGCAACA 

CAGTGAGACCCTGTCTCTAAAAGCATT AAAGCATTAATCCTCGCATTTC GATAGGGCTAT 

GTAGCTrrTAAGTAAGCAATGTTAGAATGAGTTGTAGAGTTTTATTTTTGTGAATATAGT 

GAGTGACAGATGGCAATTACATGAGGATATTTGAACGAAGGTAGCTCGGCCGCGACCACG 

CTAAGGG 

Sequence 1316 

CCC7TAGCGTGGTCGCGGCCCGAGGTACCAAAGACACTTATTATTCTAACATGCATCAAG 
TAAAGTAAAACAAGGAGAGAGGCTGCGGTGTGTGGGTAGGGGATGCAGGAGAAGCTGTGT 




B 1317 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTNNCANGI I U II T 1 1 II 1 1 IT I M 1 1 1 II I 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 IACNCTGAGTCAAAAAATTKTTTTAATAGTTNCAAAAT 
I I I I n H I U H I n n I I ACAAAATCANTTTAAAANANCNGG NGATTTNMCCNTAATT 

NGAmNTTGGNTCAAAAATTNTNOTAAAAANCC 
CCCTTATGAAAANNCCCCTCNNTTAAAAAAAA 
Sequence 1318 

CCCTTAGCGGCCGCCCGGGCNGGT ACTACTTTTG I I 1 1 I I H I 1 1 1 I I I GGATCAATAAG 
TWATTTATG7TGWTCACACAATAGTTACAGAAGCATTTAAAAACACATGCNCACNTGT 
TTATTATACCATACATACAAACACACATACAACTTAATATTTACAAGCACATACAAGCAC 
AT ACAAACAT AT AAACAACAAACAACAC TAATTN AACAT ACATACAAT ACTTACAGCTT A 
CGTTT 

Sequence 1319 

CC C TT AGC G TGG TCGC G GCC GANGT AGATGAAAAC ATCAG T G T GACAG TT AAT ATT AAAT 



GTCAACTTGATTGGATTGAAGGCTGTAAAGTCTTU 1 1 IC I UGGTGTGTCAGTGAGGGCGT 

TGCTAGAGAAGACTAACATTTGAJiTCAGTGGACTGGGAGAGGAAGACCGACCCTCAATAT 

GGGTGGGCACCATCCACTCAGCTGCCAGCGAGGCTGGAACAAAACAGGAGGAAAAAGGTG 

GGATAGGTGACTTGCTGAGTCTTCCAGCTTTCATCTTTCTCCCCTGCTGGATGCCTCCTG 

CCCTTGACATCAGACGCCAGGTrCTITGGCCT^ 

TGCCGAGGGCTCTTGGGCC7TTGGCCACAGACTGAAGGCTCTACAGTGTTGGCTTCCCTA 
CTTTTGAGGGCTTTGGACTCGGACTGGGCGACT ACTAGCTTCCTTNCTCCTCANCTTGCA 
GGTGGCCTATAATGGGCCTTCACCTTGTGAACATGTGANCCAATTCTNCTT AACAAACGC 
CCCTTCATACATACATATATCCTATTAG7TCTGGCCCTCTGGAGAACCCTAATACACTCG 
ATAAAATTTCAATTAAAATTTTTAAATA 

Se q uenc e 1320 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I 1 1 1 It 1 1 1 H 1 1 1 1 1 H M 1 1 1 1 1 ICi 1 1 1 n 

i i 1 1 mi 1 1 ii i i m i i him i i i iii i i ii i ii i i i i in i i i i n i i i 



CCCTTAGCGTGGTCGCGGCCGAGGTAC 1 1 111 I II M II I H 1 1 II 1 1 1 1 1 U IC HCTT 

ii n i um 1 1 1 mn i n ri ii mm u i m u i i n i H i m ii i i i ii ii i 

M II II I MM II H i l l UN I I I II I II 1 1 I I I III I I A AAAAAAAKT 



1322 

CCCTTAGCGTGGTCGCGGCCGAGGTAC^GAGCTTCTTCCTATTAAGTGCCTAAACT AT AG 
GCAAACTTTGGTGTTCCCACT AAAACACAAGAGCCTCACACAATTAGGAAAAAAAAATCA 
AAAGAAACAAGGAAACTGAGAATGGAAGTT AGTGTAAATCTCTGCATTTGGGGAGTTGT C 
ATTAACTCCAGAGGCCAGCATAGTTTCCATGGAGGCCTGAAGGGAGGGGACCTCCTGCCA 
CAAAGAGTrrCGTrCCAGACGAGTCGTAGCAGTGGGTGT AAACAGCATTGGGGAAGAAGT 
CAATGTCTGAAAAGTAATTCCTCCAGGTTTCATCATGATTCTACGGGAAGAGAAAGAGAC 



CTGGTCTCCTTAAGGGAACAGGGTTCTAi 
CGA 

Sequence 1323 

GCCTTTCGAGC GGGCGCCCGGG CAGGTAC 1 1 1 I H I I U I I I 1 1 1 1 1 I N III TT TCTTT 
TTTTTTTTTTTTTTTTTTTTTTTTTTTT^ 

TTNAAAAAAANTrrWCCAATTWGGGTTrTTTAAGGGAAAAAAAAAAA AAAAAAAW NNAA 
ATTNCCCNNNAAAhnTTTNACCCCCCCCCNTTWNAAAAAAAAAAAAN I I I H MNAAAAA 

Sequence 1324 

cccttagcgtggtcgcggccgaggtacttggtttagttatggctgttttttgcctct aac 
acttttattttaaaaagaaaattaaaataggttattgggatcaaagatatagg c iiii i g 
ttactttgaatgatttttgtaattcagaatatgcacttgttatttcagttct^ 
taattattggtag agttcatc taattacc ctat aaatccct ggagaaaggtggcccccat 
atacttt a m i ch ggttatatgt ataaaaatcagtaggcaatgtaaaaatg fttttstg 
tgaatttatgtgagttataattctaattctatgtcaatattcacctcagattaccacatg 

AAAGCTCAGTCACCAACTATGCCTCAT ACTGAAAT ACCCAC TGATTAAAT GAAGTTGAGA 

accagctcctatcgtacctgcccgggcggccgctaaggg 

Sequence 1325 

aagcaggcatggcatataancaag ci H I M 1 aaggctgagtgacttatgtggctgatag 
aggaaggataggaggaaaggaaatatagtgaaaaggaacagagaggaataataaagctgg 
caagtcacagagancataattagactatcaaaagaanatttggaagaaaggcatggacag 
gaataaagacctncttctaaagcaaggtagg GAGAGCAACTNNATGTAGATTGAANAG AA 

Sequence 1326 

ccctttcgagcggccgcccgggcaggtacgcgggatatttatttacaaaacacttcatta 

tttataaagaatttactaacagtttatcttatttataccca.tac^tctgctactttggga 

ggcccttt/^atagaaaacagcattctttttgccaaatatgaccaaattacttttattta 
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TAATTTTTGATTTATGTTTCAGCTAGATCTAAAAAjGCATCTGAAGGAATTTACAATGAAA 

gatacctatgcaataacatttaggataatctttgacattttgg 

aaaaaagtgt atctttcaagt agatgcaaagcattataatgactgacacttgt atct aac 

tccagtcttacagataactaaggcaaaaagctaaataaacaatatgtaac^ 

TGGTAAAAGGAAGTATACTGGTCTGTTAGCAGAGACAAAC MUM I AGAATTGAAGTCT 

GAAAGAAACAAAAG 

Sequence 1327 

gccga^«3tacangccgnggaagagactcaagtaggagcgcctgcccgagctganactaga 
tgtgaacctttcaccatgaaaatgttaaaagatataaaggaaggagttaaacaatatgga 
tccaactcccctt atataa^laacattattagattcgattgctcatggaaatagacttact 
ccttatgactgggaaattttggccaaatctrtccctttcatcxdtctc^gtatctacagttt 
aaaacctggtggattgatggagtacctgcccg 

Sequence 1328 

ATCTCCACCOCCCNGGCGGCCGCCCG^GGCAGGTACCGGAAATCTGCAGATC 
TTCCTATAATGATGCCCTCCT CACGTTTGTCTGGAAACTGGTTGTGAACTTCCGAAGAGG 
CTTCCGGAAGGAAAACATAAATNCC&SACGAG 

TCCTATTACTCGGCACCCCCTGCAAGCTCTC TTGATCTGGGCCATTCTTCAGAATAAGAA 
GGAACTCTCCAAAGTC^TTTTGGGAGCAGACCAGGGGCTGCAC I I II G GCAAGCCCCTGG 
GAAGCXiAGCAAGCTTCTGAAAGACTCTGGCCAAAAGTTGA 

CTGGGGGGAGGTCCCGAGGAAGCCTGGCCT AATGAGTACCCTCGGGCCGGCTCTAAGAAA 
CTAMGTGGGAATCCCCCCGGGGCTGGCAGGAAATTTCGATNATTCA 

TTTAGTTGAAGGGGGMTAAATTGGCGCCGNCTTTGGG 
Sequence 1329 

CCCTTTCGAGCGGCCGCCCGGGGAGGTACAGAAGGTTTGGGATTCAG 
GATGTAACAATAGGTGGCTCAAK^TCCAATCTATGTGAAAAACATTCTCCCCCGGGGGGCG 
GCCATTCAG<^TGGCCGACTTAAGGCAGGAGACAGACTTATAGAGGTAAATGGAGTANAT 
TTAGTGGGCAAATCCCAAGAGGAAU I IU1 1 1CGCTGTTGAGAANCACCAAGATGGAAGGA 
ACTGTGAGCCTrCTGGTCTTTCGCCAGGAAGACGCCTTCCAC 

GAAGATGAGGATATTGTT CTTACACCT GATGGCACCAGGGAATTTCTGACATTTGAAGTC 

c cac ttaatgattcaggatctgcaggcc ttggtgtcaagtgtcaaaggt aacccggtcaa 
aaagaagaaccacgcagatttggggaatctttgtcaagtccattattaatggagggggca 

GCATTCTAAAGATGGAAGGCTTCG 
Sequence 1330 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCGTGTTTTGATAGTTGACT AACACTGACCTG 

TAATGGTC^ACACCCTCTCC>CTTACTTACACTATCTTAGGTAAATAAGACTTTTATTC 

CTAAGTGTGAATTTTCACAGGAGGAGAAATCTGGCAGATAGATCCTCACCATCATCT 

CAGTCGAACTGGACTTCCTTTTCTGAATTGACCAGTCAAAGAGAAAGGAAAAGAAAAAAA 

AT ATGACCCGGTTGAATTT AGAGTATCAAAGCATGGAGT AT AGAATAATTTTGTTTTT AA 

AAGAGGAGCTATTAAGTTGAATGGAAGGAAAAAGTTCTGGAAAATGCGTTCCATGTAAGG 

ATAGTAATCCCG 

Sequence 1331 

TATCTGCAGAATTGGCCCTTAGCGTGGNCGCGGCCCGAGGTACTGTTTGCATT AAT AAAT 

AATTCACTGGTTAAAAGTAAGAGACAGCTGAACCCCCCGCGTACCACTGTAATCATTATT 
CCCAATGTTATX^TTACATTGACAGATAACTCX^AGTTTTGCTAAGCTGAACT GATGTTAT 
GGCCATAATATGTTGTTGATTCATGGCAAANGGTGATGTGTGAGTTATGATCCTG III m 
CTCAAAATGGTGGTGGAGGCCGGGAGCTTATATGTTTATTTATGTATGAATGANGATAGC 
AAGAGATGGCATATAATCACCAGACTGATCATATTGGATTCTTTG 
Sequence 1332 

CCCTTTCGAGC GGCCGCCCGGGCAGGTACTGGAT 1 1 1 1 GCAAGCCCTCT ATTTAAAATTC 

c^cagaaattaaataaggaggcttt ggagggaggaatgccctanacaaattgtggagtgg 

GTTTGTTTTGTTTATGGAGATGGTCrrTTAAAGTCTAAATTGTCCC^^ 
C^TTGAAGAGGGGCTCAACTCAGCTGGGAGGGAGGGGATGGTTGTCAAGC 
TTAGTTGAAACCAAGTCCATTCTGGGGCCAAGAAGCTTCCATTTTrAGCAAAGAGAGAAA 
G GGGAAAAAT ATACAMAC TCGTACCTCGGNCGNNACCACGCTAAGGGGCGAATNCCAGCA 



CA 

Sequence 1333 

C CC TTTCGAGCGGC CGCCC GGGCAGGT ACTTAATTCATTC T AC TTTGTGTT AACTATCTT 
TrrATCTGTAGGTCTCATCACCCCAACCAGACTATAAATrCCTTTGTCATTATTTAAATC 
CATGGATGGAACTCCCATAGACATCAACCAATCACCAATAGACAAGCCTTAGAACATGTA 
TTACAGGAAAAATAGAGTAACACATACAACTAATACAGAG GAAGAACANTTGACATTAAA 

NGAAGGATAATNCG TNAACTTAGAATATTCATTTGCXTGCTTCAACATTCA^ATAATTAAA 

Sequence 1334 

CCCTTAGCGTGGTCGCGGCCGAGGT ACAAAGTTCAACAAAGTTTGTTCTTGATTAAAAAA 
AAAAAGAATGAATATCTAATGTATAAACAACTCCAACTTAGATTTCCAAAATCTTGCATT 
CATTCACATTTGTG C H IT II C I AGACAGCTGTCATTTACATTCCT AGGGTTGTATTTCA 
CTATGTAAAATGGX3AATTTAATCTTTATAAATGAGGGATTTATGTAAAAAAAAAAAAAAA 
AAGTACCTGCCCGGGCGGCCGC TCGAAAGGGCGAATTC CAGCACAC TGGCG 
S equence 13 35 

CACTTAAATAAGAATCTACCATGTAAAAAACACAGT ATGGGACACTACAAGGTAGTATTT 
AT ATATTTTTTAAATGACTGAGCTACAGT AGCTCGGCCGCGACCACGCTAAGGG 
Sequence 1336 

CCCTTAGCGGCCGCCCGGGCAGGTACATCTATCTGACCCG^ 

TGCCCCCGTAGGATATTGGCTGGGGACACCTGACAACAGAAAGT^ 

GGATTGGGAGTTACCCCAACACCAGCAGGATGCA GGAAA AAGTA 

GCCAAAGATTACCGTATTGAACTCTACiCAAGGCATCTGGTGACT AGAAAACTCCTGGAAG 
GTGGTCATAGCAGAAATTG7TGGGAAAGTTCTCAGGATAATAAAAGAGAAATITTTATTT 
CCTTCATTGATCCACTCCTACAGGGAAAAATAAATGGCANATGAACCCATGTATGTCANA 
CTCTG^IAATAAACATCAGTGAGATCACAGTGTCAG^IGAAATTTCAGCCTGAATTAAA 
Sequence 1337 

ATAAATAAAAAGTGGTATGCCAGTAAAGTTTCAATTTACATTTCTCTTCTGAATGAAACT 
GAGCATTTTCCATTTTCCTCCTANATTCTT AGGAAGCCTTTGTATCTGCGATATAAGTT A 
CTTTCTCC rrCTTI GTC^TGTTGTTTAACTTTGCACTTTC Mill AAAACCTGCAGTAAA 
TTTTAAATCTTTTCATTCAGTGCTrCTGGTrTTCAAATCACA 

AGTCANAGGGTGTGACCACAGACTGTTCTGGTGCTTCTATGGCTTCATCT TTTCA CATTT 
GAATCTCTGACGTAG1TGGAATTTATTCTGGNCTATAAGGANCGGACTTTATTTT 
CAAAAI IIIII I NAACAAATGGTAAACTTAACTCCTAAAGGCAGATTNT 
Sequence 1338 

CCGTTAGCGTGGTCG03GCCGAGGTACTTTTGGTAAAAGATTTTAAGAAGGC^ 

ATGAATTTCTCACCTAAGTTTAGAGGGTTAAAGGATTGTGTTAAGTGAGGAAGGAAAAAA 

TCTAAAGGTTTAAACAAGTTGTGAAAGGTTTATAAAAAATTAATGTGTGCAAACATATCN 

G GCT AAAG TT AAAGAGGT ATT ATTC T GTTTTTCCAT AAATTGAACATTGGAAT AAAAG T G 

CAACAGAGTTnCCTAAATCATTGmCTGCTCTTTAACAAAA 

ATAAAAGGNTT ATAAMAATCTTACC 

Sequence 1339 

cccttrcgagcggccgcccgggcaggtactaaaaatttccacta 

aaaataaagaaatacataaaactcaaacagtaagtcaatgtgattatttgtttcatttca 

gaagatctatgggtcccactgcccgccacacgtagtctcctgggttctcaacgaagtgtg 

ACCAGCTC TTC TGAAGAGGT AGGGTGAATGGCGAC TGTG TT GTCA 
Sequence 1340 

(XXrrrAGCGTGGTCGCGGCCGAGGTACTTTTAACTATTTGTTTCTTCTACGATAATTGGT 
TTGTTGTGACTTTATCTACCTAGAGTAAATTTTGGCAATTTGCATTT I T CTCAAAATAGT 
TTTTGAATTTATTGTGTAAAATTGCTCAAAATAGTCAATTTAAACAAA7TTCCTGTTTTA 
CT ATTTCCCCC TT GTCATTT AAATTTTTG T ATTTGTGC TTC C TCC CGCG T ACCTGCCCGG 
GCGGCCGCTCGAAAGGG 
e 1341 
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CCCTTTCOAGOGOCCGCCCGGGCAGGTA C 1 1 1 GA CT A H I M I AGGAACAAATT A C I M I 

GACACACAGCACAATTGATTTAACACTTCCAATTTTGGAACTATTGGATAAATAATGATG 

GGATTTAAATAAAGCAATCCGATTCTACTATTACAGCATAGGGTCTCTTGTAGTCCTCTT 

AGTAAAAACTATTGTGACACTTCCTTCTTTCTCCA^TATTCGGCCTGGAAAGACCTAAA 

TACAATGCAGGGATTGAATCAAATTCACACA 1 1 1 T I H I C CTACGGAA A CAACAACCTTT 

CTTGCTTATATTTAACAAAAACTAGT ATAGATT 

f ?mynm r* t 1342 

GGTCCGTGGTGCGGGATCGAGATTGCGGGCTATGGCCGGCGAAGG 1 1 1 1 TC CTCAGTACT 

GGGATATCCCGGATGGCAGCGATTGCCACCGCAAAGOCTACAGGACCACCAGTATTGCCA 

GCGTCGCTGGCCTGACCGflCGCTGCCT ACAGAGTCACACTCAATCCTGCGGGCACCTTCC 

TTGAAGGAGTGGCT AAGGTT GGACAATACACGTTCACTGCAGCTGCTGTCKGMGCCCGTG 

TTTGGCCTCACCACCTGCATCAGCGCCCATGTCCCGCGAGAAGGCCGAOGCCCCCTGAAC 

TACTTCCTNGGTGGCTGCTCCNGANGCCTGACTCTTGGAAGAOGCACGCACAAACTACCN 

OGATTGQCGOOCGACGWCTGCGTTOTA C rn G GCATATCGGGHCTTCCTGGTCAAGAATG 

GMCNCGGMTTGGAGGGGCTGGMfiAGGG TGTTTGKAAAAACCGAATGTTTT4AGCCCTTGTG 

OCTTGCCGGGGACCTTTCAGCCCTGCAATAATGCGTCCCAGAAATAAAATWnGTGGTCT 

TGGTGTNNGAAAAAAAAAAAAAAAAA 



CGGCCOGCGTCCGAATGCAGTG^MGTGACACTGCCTGACCTTCAAGACTAGATCATCAA 
AGGTGCTACAGCTTCTGC I \ 1 G GCXrACCCTCTCTGTCGTGGGACACTCACCCTTGCACC 
C>ATGTCCACACTGTGAGAACTTCTAT GCTACCTGGAGAGGCC TTCT ATAGATATTT CAG 
TCAACAGGCCTAGTTAAAGTTrcAGCCAGCGTCAACCACCCAACATGTGGGTGAGTGAAC 
CCTCAAATGATTGCAGCTCCCAGCCTTTGAGTCTTCAGTTGGGGTCCCAGTCATTGAAAC 
AGAGTCAAGCTGCCCCCGCTGTGATTTATCTGAATTTCTGACCCACTGGGAGCATAATAA 
ATGAI IGI 1 1 IATGTTNAA 



GGGAGTCCACCCACCCGTCCGTCCAGA A I 1 1 C I AGAGTGGGTGGGCATGATTCCAGTCAA 
TGGGGGACCGCCCGTGTCTAAGCATGTGCAAAGGAGAGGAGGGAGATGAGGTCATTGTTT 
GTCATTGAGTCTTCTCTCANAATCAGCGAGCCCAGCTGTA 
CATGGCAGGGTCCn G GGGTAi 
CAGC TCTMACCCACTCTCTCCTAATCOCCTACTT AAGTAGGGGTTGCCOCACTTCAGAGG 
TTTTGGGGTTCAGGGTGCCTGNTGTTTCCCTTTNCTGTT<CCCAGGTCATTCCAAACCCTT 
CTGTTATTTATTAMGGCTGGNGGGAAGGGI H I? r CTT CTTTTTTC 1 1 IGGAACCCTGCC 
CCTIGrTTCTTTACACTTGCCCCCATTGCTTAAANCTC 

• GGGCAATGGGNTTGGAAGCAAAAAGGGGCTTCCNTT AACCCCGGGCAAGGCAAAANGCAA 
AAAHAACCNGGGTTTTNT ANTTTTTT AAAAAAAAACC TG TTTTNTT ANCAMAAAAAAAAA 
Sequence 1345 

TAGCANTTCAGCCCTGACCTGGGTCCGCAGCCTCCAGGGCAGGGGCTGGAGTGGGTNTCT 
' CAAATTAGTGCTAATGGTGGTCANAACTGACTACNCAGACTCCGGCCCATC 



CCCTTAGCGTGGTCGCGGCC GAGG TACTAOATTGGGTGTGTGTATTAAGAGAAAGACAGG 

AGTCAAAGATAGTTCCAAAACTTTTGAACAGAACACTGGATGAATACTGTT^ 

GGGGAACACTTAGAGAAAAATGCATTTGGAAAGCAGAAATAGGATCAAGACTTCCATTTT 

TGATACATTAAGCTTGGTATGTTTAATTCATAGCTATATAGAGGTATTAAATTGGCAGGA 

CAAAATCATAGCTAGAGATAAAAATTTAGAGTTCACCAGTGTAAAGATGATATTTGATGG 

CACAGGATGGACT T TCT TCTGGGA7TTGAGTATACATAGAGGAAAGATGTGAGGATTGAG 

CACCAGGGGACTTCAACATTGACAGGCTCAACAGAGGAGAATTCCCAAGAGGATGAGGTT 

AAAGGGCG 
Sequence 1347 

CCC7TAGCGTGGTCGCGGCCGAGGTACTTTTAACTATTTGTTTCTTCTACGATAATTGGT 
TTGT TGTGACTTTATCTACCTAGAGTAAATTTTGGCAATTTGCATTTtTCTCAAAATAGT 
TTTTGAATTTATTGTGTAAAATTGCTCAAAATAGTCAATTTAAACAAA I I rCCTGTTTTA 
CTATTTCCCCCTTGTCATTTAAATTTTTGTATTTGTGCTTCCTCCCGCGTACCTGCCCGG 



GCGGCGGCTCGAAAGGG 
Sequence 1348 

CCCTTAGGGTGGTCGCGGCCGAGGTACAAATTACTCTGTAATATTGCTITCTATTAAAAG 
GGTGTG GTn I M I M M G MUM 1 1 1 1 1 1 1 1 1 1 MA GCTAGTCCAGTGGT C I I IIIG AT 
GTTGGTTCAGCTTAGTGGTTCTCAAOCCTGGAACAACCCGTANACCCACC7GGGGAGCTC 
TTAAAATTATCAAGTGCCTACCCCACCTICCAAGATTCTGATTTAAATCC^ 
TTAAGGCACCCCAGGTGATTGTAATGTACCTGCCCGGGCGGCCGCTAAAGGG 
ft 1349 

C M 1 1 M H 1 H > 1 1 M 1 1 1 1 1 IG G Gl II 1 1 1 1 1 1 



m 1 1 1 1 1 1 1 1 1 ti 1 1 1 1 ( 1 1 1 1 1 1 1 n i niiTTiTinni i ii I nrnTTTTTTm . 

KAAAAAAA*K3GHTAAAANNAAMTTTTTfTITTNCCCCCf4AANGGGA 
NAAAjgm TANNTTTTOGNAAAAAAAA^ 

MI III IHA AAAAAAAWWNHTAAAWAN NIH I 1 1 1 NGGGGGCGTTAAAAW H 1 1 1 I I M 

WIGGOMCAAAAAAAAMNNNCCCCMTITTNleSCOT 

MTTTTAMm»P»CIOITTTNAAAAAAAAWTWWTM^^ 

WtHWHWWGGAAAWhnTTTAAAAAAGGGAAAAAAAAAAAM OG ' 1 1 1 II I 1 1 I M NMMGWGGC 
CAACCCNNGGTGGNGGAAAAGNNACfJCCNCCNAGTTTTWCCCCTGGGNGGAAAAAGjnTT 



ft 1350 

CCCTTAGCGTGGTCGCGGCCGAGGTAC TTCGTCTTCTAATTTCAAAAATATAACTTAAAA 
ATGTAAATATTCTATATGAATTTAAATATAATTCTGTAA A I G I G T G T A GGTCTCACTGTA 
ACAACTATTTGTTACTATAATAAAACTATAATATTGATGTCAGGAATCAGGAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAANGTACCTGCCCGGGCGGCCAAGGG 
S eq uen ci 1351 

CCCTTnXaAGCGGOGGCCCGGGCAGGTACAAGTATTATGTATCCATAAAAATTAAAAAAT 
CTTT AAAAATGGATATGGGGGTCAGTAGGTAAAAGAAAACA C AACC A AGAGAGCTGCAGC 
GGGGAGCACAGCTTGCTTTAAACATGAGATCCAGCTCAGTGATGATGCGGGGGAAAAGGC 
CCGGCATTGCTGGAACTCCT AAT ATTT AAAAAGATGATGGAAAC TTGAAATTTTATATTT 
AATCTTCTCATTTTTAAGTGTTGGCAATGTATTGAAGACTTTGAAGCCTCTCTGCTGGTC 
AAACAAGATGTATCTGTAGGCTGGATTTAGTCCACAGCTGGCCAGTTTGAAAACTGAATC 
CTGCTAGCCTT AATT7 AAA 1 1 1 f l| t AAATTT AATTTGC TTTGATTCCTGGCTCCTGCTC 
AAAAAAATCTTCAATGGCTCCCCCTGTGTGCAAGGNAAAAGTCC 
Sequence 1352 

CCCTTAGCGTGGTCGCGGCCGAGGTAC 1 1 I I 1 1 M l 1 1 II 1 1 H II H I H 1 1 1 1 I I ACA 
GHT ATACTCGNGGAAAGTT ATTCAAAITTCAAATTTATTT ACAGNG1TTGAAAAGCAC 
AAGAGAAGATCTTCATTTATGCAACAAGTCAATCATTTGCAGTATGTATGGAAAATAAAA 
ATCTAAGGTAAGTCAAACAT ACAAACTCTACl rWTTG CTT TCI CCATTANAAT AT ACACA 
TTGGAAATCTAAGTT CCAAACAGTTCCTNTNTACT GAAKATAGTGAAATTTAGTGCAAGC 
CCCCTAATTACCAA I ITTTTS G 



CGCTTTCGAG 

AAAGCGGGCTGGGGACAGTTCCAAGTTATAGGaG G I 1 1 1 CCAATTGGCAGTTCGTTGAAA 

gagttt atctt aagacctggaatcaatacaagggagtgtg t ctgggtt aaaataaagggg 
ttgtggagatcaaggttcttattaggcagatgaaggctccaggtagcaggcttcagagag 
aatagattgtaaatgtttcttatcagacttaaaaaggtcccagactcctagttaatttt 
tagtggatcaggaaaaagacctggacagggaagagg 

Sequenco 1354 

CCCTTAGCGTGGTCGCGGCCGAGGTAC 1 1 H 1 1 1 1 1 1 1 1 1 1 1 1 1 INGI 1 1 1 1 1 M 1 1 1 1 1 

i m i ii 1 1 1 1 1 1 hum i i 1 1 n .i i ii i iiiii i iii i imiiii i iiii i ihi i i i 



IATNAAANNANI 1 1 II I NANAAAAAATTCTTTANNAAAGGGGGGAAA 
NNAAAAAAAANTTTTTT 
Sequence 1355 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGAACCTGCCTGAGTATGACCTCTCCACCTTAT 
AGTTTATGAATGTCTTGTTTGT GAAAGTGACT ATAACCCAAACTTTTTTTTTTTAAAGAG 
GATTTGGAAGTTGTATGGATTTITTGTTATCTTCACTTTACTGCA^ 



PCTAJS00O3312 



ctcatcatttaaaatgacatgggtgtcggttttgtagatc 

TAATTTCAGTTAACAAAATGTAAAACATGACATTCCCTGCAGATATTGTTGTATACCAGT 
ATGGTTTCTTC IC1 I 1C1 1 1 AAATGTTTTTGGCCATCAAGTA 
ft 1356 

ce cmc 

iiiniiiiiiiii iiinii iiiiiiiiii iimi i 

AAAAA A AAAAAAAt 1 1 1 1 irtNA AAAAAA A l IIIIWT 
AAAAMCCANGGGM I M I M I INAAAAWfrnTTMCCNGTTANGTTNTTNAAAANHANTTG 
GGOGGGGCGMCl Mil INTAAAAAANGGGHNNNHCCGNCCCGNAAAAAAAAAAN 
■ 1357 



CCCTrTOGAGCGGCCGCCCGGGCAGGTACAAW 
GAATATCTCAATAAAAACATTTTTTAAATAAAAAGAATTCCCAAAG 

GAACATAATTCAAAATAATTTATGGATCAAAAAATAAATCATATAAAGATCTGAG^ACTA 
CAATGTAAAAATATAGAAAAAAGTCATAACAATATTAGAAAAAAATTTGAGCTGGATAAC 
AAAAATAGTACCTCGGCCGOGAOCACGCTAAGGG 
Sequence 1358 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTACATGGAAATAAGTGT^ 
TTArTGGTAGCATATAGATTTAGAGTCAGGAATBATGGTGATO 

GTCCACATGGGTGGCTGGOCAGGATAGTGACACC I MU C H I C I AATGGCTTAGTGTACC 
TGCCCGGGCGGCCGCTCGAAGGG ' " 
Sequence 1359 

CCCTTAGCGTGGTCGGGGCCGAGGT ACAAAGAAAAAOCT AAGGAACGGTATGT AT ATTAA 
TCC CrTTATTAAA AATGTAAAAAGCCAAAAGCAAGATAGACGCAGATATGTGOCAAAATA 
TGTAI 1 1 1 I M 1 1 U CTG GAACAAATCACAAGAAATGTAATAACAGTT ACAGTGAGAGGAG 
CCmT^CAICTCTTTCTAAACTATTTGAT^ 

Se quen ce 1360 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGATAGGCCTTCTTGTTATTATT^ 

AAGAGACTTGACGTTTTATGAGTGGGGTGGATTGT AGGTTGAGCAGAACTAATGGGAGAG 

GTGCTGOCTAGAGAAAGTTAAAA A mCTGTI AGCTTTGCATTGAGCTTTTT AATATCAT 

TTGTFCATTTCACCAGTTCAGAGGATTGGGGGTGATGGGCACAACAGAAATGATG 

TAGGCCAAATGTTACAAAATAGATAAAATTACCTGACCAGTGAAGTGTGTTCCTCAGTCG 

CCATGGAMCCTCAGATTTGAACTCCCAAAAAAAAAAAAAAAAAAAAGNM 

S e quen ce 1361 

CCCTT AGCGTGGT CGCGGCCGAGGT ACT AT AGCTTCAGTG TGGTTTAGT AAACTT AGCCT 
AGGAGGCCAAGATGTCTCCCTAAAACTrAGTCTCTGTCCTATTTA C 1 1 1 G M 1 A TAAGAC 
TGTG AOCTA ACTTCCCATGGCCAATTCAATCGACTAGGTTATCTTT 

AGGCCTTTTCCCAGTCAATCCATGTCCAACCCTTCATCTCCAGCGT GATCACTCAACTCT 

TCAACATGCCTGCTTGCTGCAGGNTTAAAACCAGACCCACCATCCTGT 

ATCGCCCATTGATGCCCCGCANGGTAAAATAAAAACTA 



Sequence 1365 
CCCTT AGCGTGGT* 



ATTN 



cgangtacatgaaaatggctgtttttccccacattant cagctctggattttgcatgt GT 

GGGGCIl I llllinil 1 1GATAGTTAI MUM 1 1 1 1 ATTTTAAAAATTTATTTNGCCAA 
CCCAGTAHAGAACAGCTGAGCATMTTCTCATGTATTTATTGGCCATTTGCAT^ 
CTTATTGGCCATGTATTTATNGGCCATTTGCGGTCTGCTCT 
GCCCATTTTTCTAGTGATAAAACACTGAAGCACATTTTTAAAAGA 



CCXTTTAGCGTGGTCGCGGCCGAGGT ACATTT AAAAGGTGATGCT AAT ACTTT AAAATGTC 
ATAAGATATAGATTNAAAAAGCATTGTAAATTGTATACT AGCAAAAGTCGTCTAHATGGC 
ATTGMACAGGACATAATGTAAAACAT 
S equen ce 1364 

CCCTTAGCGTGGTCNCGGCCGANGTaCTTAAACTTTTTCAGCCTACT ACTGCACACCTAG 
GCTATGTGGTATAGCTACCTTGTATATGTGG>ICTGTCACTGACTAAAACTTnX»rr ACACA 
GNGTATGACCCTACTATTCANCCrrGAGAAGATGGAAATGCTG#*C^ 

GATGAACCTGGAGGACATTAAATTAANTGAAATA><GCCAGG<iACAGAACGACAAGTAACA 
CATAATC 



TACTTTAAGTTCXAGGATACATGTGCAGAGTATGCAGGTTTGTTACAGGTATACATGTG^ 
CATGG IC G1 1 I O CI GCACCCATCAACCCATCACCTAGGTTTTAAGCCCCACATGCATTAG 
GTAI 1 1 G 1 1 C 1 A ATGCTCTCCCTCCCCTTAACAGCA G I I I I fCTATAGGWCAAAACAAAT 
TTGGGAACCAGAATWGWCTA C I lilC I 1 1 A TATAAATGATCATTACGATTTGGGAWGAGGG 



CCCTTTCGAGCGGCCGC«X»XAGGTACX>CAAC^ 

AAATA CAAAA ATACCACAACCACTCCCGGACTCCTCCATTATTrCAGTAATA C I GC*, I GC 
CCTAG MIT rCAGGATACATCATGCAAATAAG I TCT H 7 A TTTTTCAAATTATTTTATTC 
CTAAAGTATCTTTA A 1 I M I C I 1 1 1 I GG I I A TACAGCTT ATAGAATAAACAAGTCACAAG 
AATCTTCAI I IG I 1 1 C T AAAGTATATAATTTT ACAAAAG I I G I M I ACTCAATGTGAATT 
AAAATTTGCAAGGTCTAAAAAAATAAAAAAATTTTAAAAAGTAAAAAAAA - 
Sequenoe 1367 

CCCTTTCGAGCGGGCGCCCGGGGAGGT ACAATATATTATGAAGCATGACCACTTTATTTT 
GAAACTTAGCAATTGTATTGCT GGGGTTT ATTGTATCTGTAGCATGTCACTGATTATTTC 
AGTTAGTTTTATAATGATTTTTAAAAAACATATCT ATTTGGAATAAGATACAGCAACAAT 
CATTGCTATTGACTTGTTCAACCCCTTAGTT ACACTGTATGATCAACATATAACAAGATA 
CAGTGGGAATGGCCCATACAGT ATATTACTGTrGTGTGATGATTGGCTTTGCAAGCAGTT 
TGATTTTGAAATGCTTTGATATTCTAATTGACATGGAACAA 
S eq u en c e 1368 

CCC1 1 A GCGGCOGCCCGGCCAGGTACATATOATGGGCGCAATGCACAATA C 1 1 1 I A TCAC 
AATCAACTTTTTCTTTGTATCCCT ATTTCAATGAGC^GTCAGTCTCAAGAGGTT actgca 

tttcmgttctaactagacatttgtacttgtgatcacactacgg 

ACCTGGGGCCATTCTAGGC IC I I rCAAGTGACTTTTCGAAATCAACCI 1 1 1 1 1 ATTTGGG 
GGGGAGGATGGGAAAAAGAGCTGAGAGTTTATGCTGAAATGGATTT AT AGAAT ATTTGGA 
AATCTATTTTAGNGTTNGTICCItMTTTT AACGGTCATTCCT 



ATCATCTTAATTC A IC I 1 1 C GAGATCTATTTCTACTACATGC TGC I C I C 1 1 I CIC IATCT 
TACATCTCCCAGAATGTTTTATTTCAACAAATTGCTAATCTGTGCCAGGCATTGTTATTA 
GCAAAATGATAAGCCCTGCATGTAGCAAAGTTCCTGCCTT CACTTGGATATGCATT AACA . 
AGCTCTGATTAGTCCCACTTAAAAACCA 1 1 IG I I C CCCCGTCATGCAGAACTCCATTGCC 
AAGCCACACAACACCCCAGCCAGTAGGGTAGCAGCTNCCTGGAGCAAGGGA 
Sequence 1370 

CCCTrAGCGTGGTCGCGGCCGAGGTA CM I II I II I 
1 11 ITTTT IT 



I imillATTIIIIIim TT 
1 1 1 1 1 1 1 1 • I 1 1 1 • 1 1 1 1 



II I M I M 1 1 M 1 1 1 II M 1 1 1 NTT 
TTTT^PCNNCCGCNNNAAAAAAAAGGMCNAAAAAAAAEIGGN I llllll I G CATAATNAAA 
AANNNAAAAGGGGMTTTNAAMGGANTTG 
AA M I I II I AAAAWCCKGKAAAAAAKTTT 

AGCCCGAGTGAA^^ 

TTGAACGACACAGGTTTGGAAGGAATAGTT AACTCTCCAGT ATTATTGGAACATCTGGAC 
A CC ACC AAC AAAAAATCTTAGAAAAGGGTCATTT AAGGCCTAT AAAAAGTGCCACCTTTC 
CCAGAATTAATTCAGAGAGAAAAATCTTATCTGCCTCCTGGCAGCTACAGCGC^iAAAGT 
ACCTCGGCCGGGACCACGCTAANGGGCGAATTNCCAGCACACTGGCGGC^ 
Sequence 1372 

CCCTTAGCGTGGTCGCGGC CGAGGTA C 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I M I G GI H I M I I I I 

IM II IMIII I MM IIII II IMTW IIIIMI III MMI IIII A AAAAAWCCGGWTIIC 

GCCGCWAMAMAAGGTIgAIOtATTWITTRAAAWWTM^ 



tTACAGCT ATTCTCAATGGATAATTCTATAAAATATT 
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TAAAGAAGAATCAACACCAGTTCTCCACACTCTCCTCTAGAAGAAGAGGAGGATGGAATA 

ccttcccccttaatttatgaggccaatattaccctgatgccaaatccagacaaacsatatt 

GTCCCCCAAAATAAAACT AACGATCATAGATAAATACCCTCTTATAAATTTAGATGCAAA 
ATCTTAAGCAAAATATATTAGCAAAATGGAATTCAACAATGGAATA AACCT ATTATACCA 
CCAAGTGGGAATrTATTTCTAGCTATTGCAAGACTAGCTTGGACCTTTTGAAAATTGATT 

Sequence 1374 

ATATCTC^AGAATTCGGCCTTTGCGGCCCGCXX^ 

GCCACCCjCACCCGGCCAGAAATTATaAATCTAACCAGGATTCCAACCTACAATACAATGA 
AATATXA rTTCTCTCn ATAGGTTTTTGG TTTTAACCAATCTATTTTAAAAGGGGCAATT 
CAACGATTATGGTTTATATGGWGGGATTrCTGTTTGAATATGATCAAATGTTCACTGGAG 
AACAACCAATAATTTGCAAAAGGCATAWTATGCCTTACATTAAATCTGGATCCTCTTCT 

aaaact agaataagcatcagttcagtcacccaacggtgggaag 

Sequence 1375 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGtX^TATGATTGGCCGGCGAATCGTGG 
TTCTCI I I roCT O Cl I G GCTGTCTGAAGATASAT JGCCATCATGAACGACACCOTAACTA 
TOCGCACTAGAAAGTTCATGACCAACCGACTACTTTCA 
Seqm ooo 1376 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTCTTTTTTTC 1 1 II ICM 1 1 1 1 1 1 1 1 1 > I 
rTTTTm X GAGACAGGGTfiTCACTCTGTCACGCAGGCTCGAGACAGAGCAAGATCCGGT 
CAATTAAAAGAACAAATAAATAAACAAAAATGCCCAACAAGGAAGAGAACGGGAAGTCAT 
AGGCAATCTCATTCATGAACATAGATTAAAAACACCTGAAGTATATACATACCCACACCC 
CCGACATGAATACATATGAGATGTGTAAATGTGAATAC7TACATGTATGTATATGAAAGC 
AAAGCAAATCAAACAATGTAAAATAAAAATAACACATNATGACTGACTGGCATTTGTCCC 
AAGAATGCAAAGCTACTTGAGAAAATCTATTAATTCATCAATTTAATACTTT AAAAGAG 
Sequ ence 1377 

CCCTT AGCGTGGTCGCGGCCGAGGTAC CATATAAAAACATT CCAGTGTCAACAGCACTTT 
AAATTTTCACAGTAATATATGAAAGAACAGACTTTACACTTC TTTTGGACAGAATTATCT 
TTGCTATGTTTTAAAATACTT AAGA^TAGAAACAAATTTAAGAGAGTT7TCACCTTT AA 
AATTTATTACATAAGCTATACACACAAAATGAAATCCTAGTTATAAAAGATGCATCTAGA 

TTTAAATGTTCTCAGTGACTGGCATTGCTTTATGCATTACATAAGATAGTATGTACCTGC 



CCCTT AGCGTGGTCGCGGCCGAGGTACACAGGGGCTTGACTTTTTCAACTTCGTTTCCTT 

TGTTGGAGTCAAAAAGAACCACTTGTGGTTCTAAAAGGTGTGAAGGT GATTTA AGGGCCC 

AGGTCAGCCACTGTTTGTTTACAAAATCAGGTAACTAACTGCATACACTTTTTCTCTTTC 

CATGACATCAAGACTTTGCTAAAGACATGAAGCCACGGGTGCCAGAAGCTACTGCGATGC 

CCCGGGAGTTAGCCCCCTGGTAATAGCTGTAAACTTCCAATTTCTAGCCATACGCTCAGC 

TCATCCATGCCTCANAAGTGCATCTGGAGAGAACAGGTTTCTAAGCATAAAAGATGAAAG 

AGCAGTTGGACTTTITAAAAATrCAGCAAAGTGGTTCCCTCTCTTAGGGACAGTCAAAAC 

CAAGTCACTTAGGTAGTACCTGCCCGGGCGGCCGCTAAGGGCGAAT 

Sequence 1379 

CCCTITCGAGCGGCCGCCCGGGCAGGTACGCGGGGTGAATGGAATGCCTTGCAATATGAA 

tgttaatataatgtgtaaagggagattaaaaagtttgaatgattatcctaaaaaaaaaaa 
aaaaaaaaaangtacctcggccgcgaccacgctaaggg 

Sequence 1360 

CCCTTrCGAGCGGCCCGCCCGGGCAGGTACAGTAATTrrGGAAACCTCTTTGATGTC 
CTTATAGAAGACACCTGGGTTCTTATATCTGCTTCTGAATCGATCTATTGTAATGNNGTT 
ATTTTGGCTGAAGTATGTrGAAGAAAATACTACCTTACAAAGATATGTATTTTCA 
Sequence 1361 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAAGCCATTGAATAAGCCTCTTCCI T MUM 
GC TCAAAC ATTC C ACATCCTTG TG GATTCC CCT GC ATTGTTTG TTTT AT AT AAC ATTT GA 
TATTTGTTGTANCTTGTATATGAACATAATTTTCTTTAGAGGTAGTCACTGTTCTCTCCA 
GTATGACCCAGGTTTCTTGACTCTGAGTAATGCACCTTCTATAACTATCTAAATTTCTAT 
TGAAGCTTTTTGGATTATGAGT ATG C TGAC TTTTCAC GA7TGGCTGG TGC ATGTTT AGAC 



FCTTUS00/3J312 



TTAAATGTCATATCCTrCATGTCTCAAAGCCAAAATAGTAACATCTCATCTCAGAACANG 
AGCTGTGAOCACATGCCAATATATGTGTCACAAAGTCTACATATGTTACATTCCTTGGAA 
GTCTCCTTAAATGTTTCACAAAATGTGAACAAAGCTTG^^TTGNTATTGGATATTTCCGA 
GATTGGGCACATTTAAGACAGTAAACGGGGAAAGGTGGNGAAAATCTATAAGAAAGATGC 

tgtatcttgagaattggaaaaatgangaatcntgacatgg tttgaaaaat cat 
Sequenci 1362 

cottttcgagcggcccgnccgggcaggtaccaaaattcattcaaga 
gcctgagcaacatggcaaaatcccatctctacaaaacatcaaaaaaaaaaaaat^ 
gggcatggtggtgcacacctgtaatcccagcttgtcag^ 
cttgagcccaggganggtca>iggatgc*gtgagc^ 

ctggggtgacagaatgagaccccgttc tcaaaaaaaaagaagaagtngataatcttgaat 
agccctatatctatagaaacttaamagtgctggggagatataggtattattatxxxtcaa 
ttttacnagatggtgaaaattgagggttcanaagaagtaaaagtctattgctcaaggtca 
tggtggctaagaatattggcanawk^ 

httttttactaanaaaattggttcccttggcccaaato^ 

ccaw i ii i ii 1 1 m aaagcacctatgnnttgggflgttttatcangcattaa tkttgmat t 

ggctitrggaaaawacogmgtntcwtntnggggaaagggaaaaaaaaaw 1 1 1 1 1 h i cca 

acttggcccttcggnccaanttgggaaaaa 

Sequence 1383 

CCCTT AGCGTGGTCGCGGCCCGAGGTAC I I ItSTGI I G TTGGTATCCAAAATTAGGACTCT 
GAGATTCTTGTGTATTCAGAGAATTTTTAGTAGGAAACAAGGACAAATTTGCATATGAAA 
TGAAAATAGTTATTACATGAX^AAATATGTAjGATCTrGATTTTrrAGAAACTGAATTAGTCC 
AAAACAAGTAAGAGTGGGAAAAGCAG TAAAAAGTTC TTCTTGAATATTGCTGTTGTCATC 
CAAAGTATTCTT A 1 1 IC I M I A GGTGAAAAATTTCCATTACTCTTTTGWGATATTCTCAA 
AAGAAAGTTTAGGATTTTACAGG NGTTCTGAAATACTGAATCTTAATTCAMGTATTTCAA 
TAGAGTATTATTGATTTGCTTCCTTATCACTAGATTTTTAAAXT ATTT ATTT CTAGGCTA 
TAGATCTTCCT AAAAATATAATCCAAAGTAN hTTT AAAAAGCCCGATTNT AANCCAAAGTA 
TAAAAGATCT C 1111 M GGG>A>GCCTGGTNTNTTTAACAGTTTTTCCCAANWTTGGGTT^ 
GTTTTTGGAAAACAWGAAAATATNTGGTNCmAAAAGCC^ 
GGGTTTTCTCGCCTCARAANAAACCNMTKAAAAATTTANGTTTAAATTGGGW 

Sequence 1364 

CCCTTGAGCGGCCGCCCGGGCAGGTACCTCACTCATCTCATCCTTGGCTC^ 
GTTAGTATTTAGTATTTATTTTAGTAAGATATTTGTGTCTGTATGATGGTCAGAGTTGAA 
CTGATCTGGCTTGTCATTTTTCAGTAATAAAAAAAGTTACTGAATTTM 
GATGCATATCTCA7TCATTACGATTTATCAGAAACCAAAGATTTA 

ccaaaggag caacc gaggataaaacaacactccat aaaggcc tcttgg gatgtcagaaat 
ctaaaatctaaaagaaaacagacacagagcaagacaataacatxiacaagctmaagccag 

AGAAATTrAAAATTACCAACATCCTTGTTGGAGTAAGACAGTAAATATCAGCCTrGCAGC 

AAGACAGGTCTGAGCAGCTGTGGGCAAAGAGGTAAACCAGTGGGGGTGCAAGGAGACTGT 

CTGCAGCTTGGGGCAGAAATGGTGGGAANC^ACTTGNGAAAAGCTTCATGTTTTACAAAC 

CAAAAAGGTCAGGTAGCACCAAC^^rATTG^^TGGTCAAATCAATAAAAGGTTACTTTCAA 

AAAAAAAAAAAAAAA 

Sequence 1385 

TTNTiAAAAAA^Vr^ 

GN WTTTTAAAAHNTHAAAAAAAA 1 1 M M T M I 7 NTNGGNCCCCCCCCAANCATNNTAAA 

TNAAAAAAAAANGTTmTTTTNCCNNAGGAGATTTTAAAAAAGACTNl M MM HTIN 
N C AG TTTTT ATTT AAAAAAAAAAA 
Sequence 1366 

CCCTTGAGCGGCCGCCCGGGCAGGTACGAAAGCAGTCATAGACAGTATGTAAACAAATGA 
G TG CAG NTGTGTTCCAAT AAAC CTTT ATTT AC AAAAACC GGC AATGAGATGGATTTGGC C 



PCT/USOQ/33312 



TATGGGCCATCATTTGCAAACTCCTGATTTANAACAACCCTGCCATGAGTTCTTCCACAG 
GCTTGAAAACAGGAAGCAAAATACAAAAAGTACC TCGGCC GNGACCACGCTAAGGG 
Sequence 1387 

CCCTT AGCGTGGTCGCGGCCGAGGTAC I I M 1 1 1 1 1 1 M l M l M M M 1 1 1 1 f IT! O CT 
I II III I II MM MM I llllll l I II I I MM I I M II I I MA TTTHAW 1 1 1 M I I II 

•TAANTNN mTTTTTTI NATNTNTNGGGNCNNNNNAAAANTTTT 



TATNTAAAAAAAWNTATKNG^K^TTAf/GTNANAAAAATAAAAAN TTTCCCNCCCCAKAAA 
AAA AANC KCCAAAAAAAA A I I M 1 1 H II I I I AAAAAAAAAGGGCWNRAAAAAAMTTTNN 
CNC T TTTATTT ^ ^ 

AAAAATNPTTTANAAAAAAATTTTTWTTAGAAAAGAANAANATATATGA 
AAAAAAANGAMMTTTrrAAAAKAimTNAAANAAAATAATACTO 
GGGGNGGGAANKAAATTK^TTTTTAAAAACATA^lAT^frTCTATAAAAAAAACCCC 
Sequence 1388 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTN 1 1 1 II 1 1 M 1 1 M M 1 1 IGGTAGTAAAAA 
TATCCCAATCTCTTAAATGTATAGGTGAAAAATACTAGTTTCGAAATGATTCCT^ 
GCAACAATAAAAATACTCTTMTTCACTTGAAAGAAAAAACCCAAA 
AAAGTCATGAAGAGAATTTAAATTAAGGTTTTGGTTCCAC I I rGTCTCAACTTTAACTTT 

TGTTCTTCAAAAATTTTACAAAAACAGTnXSACTCAA 

TTTTTCTATGTTGAAACAGTATTTGAAAATTCTAACTTATACTGGCAGATAAAATGATAA 
AAAAG^CATTNTACTCTTKWNAGGATTATCAAATGCTGGTGATTCCCGCGTACCTGCCCG 
GGCGGG 
Sequence 1389 

TTCTOCCAQG^G^AG^^ 

TTCAAGCTATTCTC^TGTCTCAG<XTTCCAAGTAGCTGGGACTACAGGTGCCT 
TGCTCAGCTAAI ITTTG IATTTTTAGTAGAGATGGGGTTTCACCATGTTGGTCAGGTTGG 
CCTCGAACTCCTGATC TCAfKSTGATCCACCTGCCTWjGCTTNTCAAAAGTGCTGGGAATT 
CAGGCANTGAN CCCACO^TGCCTTKXXXG CATGTGGTCAATTTCTTGGGGGGTAAAAC C G 
GATCCGAATTTTTGCAGGTTG C I I I I I GTGACCAAACTTN I I 1 1 I NGGGGGAAA 
Sequence 1390 

g<^tatctgcagaattcgcccttcgagcggccgtc^gggcaggtactctcaaaagctagg 

GCTGCTGACTGAGCANCTACAGAGCCTGACTCTCTTTCTACAGACAAAACTAAAGGAGAA 



ganttttctactgagtt 

Sequence 1391 

CCCTT AGCGTGG NCGCGGCCGAGGTAC TTTGTTTTNGOrTGGTNGG I MM I A AATAACA 
GCTTTACAGAGAGATATNATTCATAATTNATAAGGNTTT AAC1 Mi l l ICTTTTTTTAAG 
ACAAAGNTTACCTTCTGTCACATTGAAAAATCTCCTATATTCTNGGAAGATTCTGAGC 
TACATTC^CGACCCAGGTTTGGGATTNNGCATACTATTGGANAAACTGTTTCCTGAANAT 
AAACACTTGAAGA ATTTGAOA^M^AATAA/mC TAAAACCOGA/WACATTGwAGACAAAGOC 
MCAAAAACATTTGCCT TAACA TTGC AWNVkA AAATTACTTTAA^ 

GNANAAAAAAANAAGNMM INI I I GTTTTGWNTT WGCAAAAACTTTTTGAAGGAATGGC 
ATTGAAMCTTTAWNAWGGGGCGAACCWCCNTTCCAAAGGGAAA A1 1 M l 1 1 TNCCTTTNA 
GAAGGGAATTGGANCTNAAAAWAAAAATWTNGGGTTANAAATAAAAAAAAA H I I II 1 1 M 
TTTACAAGTTNK3CNAAAAAATTAAAAI4AAAA^ 
TTTTTTNGAAAAffTNGGANAAGGTTTTAAAAAAATTCNMAAAAAA 
• 1392 



TACCATAACTTACTATTTTTTCTTTTTTATTTTTATATATAAT1 
GTTTGTTTT1 1 AAT AAACTGTTATCACTT Ai 
GACCACGCTAAGGG 
Sequence 1393 

CCCTTA<^GTGGTCGCGGCCGAGGTACAACTGCCCTACATTrCTGCCTAAAGGCAATTTC 
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CAGACTACACANACNGAGANGAAATGCAAATAGAGCC(^NCTGTCTCTGAAAAGAGACAA 
GAGAAATCTAATTTCT 

Sequence 1394 

TCTGTGTTGCCCAGGC TGGAGTGCAGTGAC TATTC^TAGXSGGCAAGCATTATGCACAACA 
GCCTCAAACT CCTGGGCT CAAG TGATCCTCCTGCC TGAGCCTCCCGAGTAGCTGGGACT A 
TAGGAGTGCACCACCACGCCAAGCTGGCATTCTCTG 1 1 MCI I ATTTCCTGATTCTACTT 
TTAGCTTTCTTAATATGCTGATATGTTTTGTTTGGTATATCATATATTAAAAAAACAGTT 
CATCTCATCCCCATCATTNTATCTTT AAGAAGCCCCCCAAACCATTTTACACATTTAGGN 
AAACAATGGGCAGGCAATAAGG MTAGNGAACATTCCATAGCC<nCTTTTGATAAACCACA 
TCCTTACCTGNTTTT ACTNGTNAAAAAAGG^ATTMTACAATrGGGTTTCTGGCNCCCTAA 
AAATTCAAAACCTTAAC M I I I I I I G GGAGGGAGTTGGGWGGATNCCAATAAAANGCCNA 
THH 1 17 T r T 7 GAAAATCfOTGAATGGAATTGACCTGOATTGAATTCCCATTTAAAGTCTT 
TTACTTTATTANGGTTTTNAANACTTTATTTTAA^ 
CNNCTTGGGGTTCTTAAANNrrTAAGAAAACNNAAAATTTTNTCC^ 
■ 1395 



GTTTCCGGTGGAT7CCCAGGX3ACTGTCGGAGGTGTGGACT 

GNAAGATGTTCTACCATATCTCCCT AGAGCACGAAATCCTGCTGCACXXXJCGCTACTTCG 
GCCCCAACTTGCTCAACACGGTGAAGCAGAANCTTCTTCACCGAGGTGG^^ 
ACAGGGAAGTATGGCTTTTGTAATTGCTGNCACCACCATTGAGAATATTG G 
TGATCCANCCNG^SCCCGAGCCTTTGTCCTTNATCCAGT^ 
AAACTACTTTGTTGAGGCTGMTGCAGCAAAGGJ^GCAAACTAACT 

NTAAAAAGTGACACATTANT AATCCTTNAAAGGAAATTCAA 1 1 1 ICI Mill INCTGGNN 
CTTCNTTTTTGAANCATGGTTATGGGAAACCTT AAGCCTGTNTTAAANNGGAtf^ATTCTT 
TTANTT AAAANNTGNAAAANNGCCTTTTTKTACTCCTTTT AAAAAAATAGNNATTTNTTA 
AATNCAATNGAAATTGN NTN GGGGAAAAAA 

Sequence 1396 

CCCTT AGCGTGGTCGCGGCCGCGGTACTTTTTGTTTTATTTTTTA T J 1 rTTTGAGAGGTA 
TGATTCTTTCTAGAG AI MM I CTCATGGCTACTATTAGATCAGGAATGGGTGATTGGGA 
GATTATTAjGATCTAGGTTAACTTCT accacttt accct AATACATAAAACTTTTTCCTAA 
ATAAATGATGGAAGGAATKATACTTGGGTTACCTGGCATTATTTTCAGTAAGAAAAAAGC 
TTTACTAACCACTACATTTATGGAAA^^TGTAGGGGTAAGTATTTTATAGGTCATAAAAA 
AACACCATAATATTAACGAATCTCA I M MC I II I A AATGTGAATTAAAATCCT AACAGG 
CATTCTTTTATAAAAATGACCCATAGGCTAAAAAT 
Sequence 1397 

CCCmCGAGCGGCCCG<X:CGGX3CAG<3hL^ 

GT CACTAGTATGTG TAAG GCAGT CTGCTAGGTTCX^AAGG AATGTG GGGCTAAGTGAATAA 

TGCATTGTGAAACATGCCATATGGAAGGGATAAACACTGATGACAAAGTNATTGCCAACT 
TTTACTAATTTTGTCAAATTTTAAAAGAGGTACCTTTGGCCNCG 
ATTCCAGCACACTGGCCGGC 
Sequence 1396 



CTrrCTGG^AGTACGGTCACTGCAGATGTAATTAAGTTGANGATCTCAAGATGAGATCAT 
CCTGGATGCAGGATGGGACCTAACGAT AATGGCTGGTGTCTTT ATAAGAGAAAGGAGAAN 
GANATTTNAGACNCANACATGCANATAGGAAAGCCKC^ 
GAGT GNTT AACCT ACAA 
Sequence 1399 

CCCGCC AGTGTGATGGGAT ATCTGCAGAATTC GCCCTT AGCGTGGTCCGGCC GAGGT ACT 
T ACATAGATCT AATTTAT ACAGTGAGT CAAGACGT AGAAT AAATGC TCCCACATAGCCT N 
TCTTTTGCTTTTGCTTCTCTCCTCTGAAGTGTGAGTNGAGT^ 

ATGGCTATTTCCTAAGTTGTAAAGTfKTrcCATTTATAAMTGCCAMTGTTGNAAGGTGGTG 
TTTCTANACCTTCCCTGATGCGATTTTA 



CCCTT AGCGTGGTCGCGGCCGAGGTAC I I I I I I I II 1 1 I I M I U I M I C M IC M I I M 



P9gc2t6«fX2l 



PCT/US0G/3331Z 



nimnui ii iMm iii ii i m i M ii m nH ii m ii n i ininn n 

Tm ITTTTTTTTTTriTTTITITTT TTITH M NAAAAAAKTrNAAAAAAAANTTTTTT 
TH^AA/SAAAffTTKAAAAWNAKNAAAAAAAANNNHNAAAAAAAAANTTGGGGGGAAAAAAJ< 
T*<lAAAAAAAAAAA»iATNNNAAAMT>itWiAGGGMAAAAMN mm I NAAAAAW4AAAA N AA - 
AAANTTNAI I III 1 GWCNWGC TTWAJitCAAWTAhlTTTTTAAAATCCAAATTTAAAAAAA A T 
TNA AAAAA AAAAA NI HUM I A AAAAAAMTNNNNAAAAKTTTTAAAAAAANCCCCCCCC 
^^^TTCCCNAANTTNAAAAACCKKTKGAAAAAAAAANAN CCNCCCNA TNAAAAA TKN 
AAAAAAAAAAAAAACCCCNffTGNNAAACAAAAAANNAAAAAAAAAAATTNN I 1 1 1 1 1 1 1 1 
TAAAA 

Sequence 1401 

CCCTT AGCGTGGTCX3CGGCCGAGGTACTCAATCAGATGTTAAATTCTTCAATGTAAATGC 
TCTGTCATGCCATCCTACCTCCTGTGTCCCCCAC^ AAAAGCACTC 
TGGGCACAG TAG TT AGACAATAAACGC T AAAAGCCT GATTTAACAACTGT ATATAAACAA 
ACT ACTTTTATGTGACTACTATACCTCTGGGCATGGTATTAACCTATCCCAACCAGAGTA 
CCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 1402 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC TTTT AAAATATATA TTTTCTAATTTTGAAAC 
*TTCAAGCTGCGCATAATG<JTTGACACCTGTAATCTTGGCTACTTG 

GAGGATGGC TTGAGGCCAGGAGTTCAAGACCAGCATGTGCAATACAGTGGGACACCTTCT 

GTATTTAAAAAAAAAAAAAAAAAAAAAAAGGAAGGAAATGTTCAAATACACAaAAAAGTT 

GAAAGAATATTATAAAGTGAATATCTGCATACTTTTCCC^ 

CATGCC ITC1 G AATTGTACCTCGGCOGCGACCACOCTAACGG 

Sequence 1403 

cccttagcotggtcg cgqccgaggta c imiiniiiiiiiiiii ilium 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ANAANGGTGG TATTNT AACATTTATTAAAATAATGCTGGGGGT 
^^^^^CMMCAAAKAACCAAANAATTAAAATGCAAGCTKTTrAAAATCCCAACT 
Seq uence 1404 

CC CTTTC GAOCGGCCGCCCGGGCAGGTACTTTAAI 1111 I C 1 ATTTATGAATTGCTTATT 
TGCTTTGCTCATTTCTCTAGTAAGCTGCTTTTGTTAATTTGTGAGTAATTTATTCTAGGT 
ATCAGGCXrrCTGGCATGTTTCAAATTTCCTAG I U I C T 1 1 GTCAAAGAGAAATTTTTAACT 
TCAACATAAGTAATTrGTCATCTTTGTCCTTTAL, I 11 1 1 G > GATTTTAAGGACATAATAT 
CT ATTAC TTTAAAAGTATTGAAAGCTGTATGT ATATTCTTCAACT AGCCACCTT ATTTCT 
GTTCTAGAGTTTGAAI 1 1 U l I AACTCCAAAAACAGACAATAATTTTTAAAGTCTTGATCA 
AACTCTGTT ATCTTCT GCATAGTCTATTTTTCAGCATTC^TTAAATGAATTGAGAAAAA 
GGAGGTACCTCGGCCGCGACCACGC T AAGGG 



CCCTTTCGAGCGGCCGCCCCGGCAGGTACCTGGCT ACAGTAAATGCTCAAGGCCCTTTGT 
TATTATTTCAGATGGTCAAGAATAAATGTTTTTCAAGGATCTTCTTTTT GTAGACAACTG 
TGTAGTCACAGTTT AGAGTCGTAAATTATCTGCCTGGCAAGATACTTTTTAAAATTAAAA 
TGTAAAGAACCTGAGGGGATTCACTCCOAATGTTTATGGACAAACTGAAAGGGCA 
C^^^^^TATTACCTTCTACATTTATGTGAGAAAGTGCTTTAAGACACTGTACCTCGGCCG 
CGACCACGCTAAGGG 
S 1406 



CCCTTAGCGTGGTCGCGGCCGAGGTACATACAATAGAGTATTAITCAGCCTTAAAAAGGA 

TGAAAAAATCCTGACATGCTAAAATATAAATGAATGTTGAX5AACATTATGCTAAGTGAAA 

TGAGCCCATCTAAAAAGGCAAATACTGTATGATTTCACTTAACTGTGATATCCAGAGTAG 

ACAAATTCATAAAAACAGAAAGTAGAATAGAGGTTTCOAGGGACTGGGAGTTAC TTGATA 

TAGAGTTTCAATTTTGCAAGATAAAAGAGTTCTGGATATTGGTTGGACAGCAATATGAAT 

ATACTTAACACTACTGAACTGCACACTTAAAGATGGTTAAGATGGTAAATTTTGTTAGGT 

GTTTCTTACCACAATTTAAAAAAAATmAATTAAAGGAATTAAAAAATT^ 

TATTCATCATTGNGGTTTNCAGTTTATATTCAACACAGCACTATTtCAGGTATAGTAATT 

AACTTACTTT 

Sequence 1407 

CCCTT AGCGTGGTCGCGGCCGAGGTACTAGAAGACCTTCCTCGCCACTCTCTCCACATGA 
GAGAG TCAGC tg ccc tttc t cc t g t gc c TC TGCAGGAA GAAC TC TCTT GCATGGCACATC 
TCAGCTCCTCATTGAGGGATAGTTTTCTTTGATAAGAAACC TG GAGTCCATTTACTCTGA 



PCT7CS0G/13312 



CC TCTC TTTAAAT C TAT ATCCAGAGCCACT AGCCCAGGAAAAACTT GGG TGACCCGTAAT 

AGGC TTATGACAGTTTGTATGTGCTCAGCCAATGAGCAGAAAACCTC 
GACTTrAGCCCAOCAG fTTGTCI GGTTGACT AACCTGCTGAGAGCTAAAATTGGCACCCA 
TTGCCCCGTGCCTTCAGCCAOTCTCCTGGGGCAGAAGTATGC^ 
C^ACTGAGTGGGATGTGGGTGATGCTGACATGACTGGCTAGAGC^ 
GGNTMACTACT AI mil f 1 GGJ<CANGATCTCTTTCCC C' 1 1 H ri 1 1 1 1 1 1 1 1 1 
Sequence 1408 

CCCTrAGCGTGGTCGCGGCCGAGG TACCCTTT ATAGGAACCCT CAAA TT AAAAAAAAATG 
TCTTTTAATGGATGAGAGGGAA<XACTATAACATGAGTCC^ 

TATACAAT A I 1 1 1 1 1 1 1 1 AATTTTGGAGATAAAAGCTTTAAGAAACTTTTTGAGTTAATT 
ATACTCA TAAAA TGAGTTTCTTTAAT AAA TT AAA TTTT ATTGTGTAAAATGTATTATT AC 
ATAAAATGTGTTTTTGAATCAATGCAGTTTGGGGATGAATATAATTAAAATA 
AACTTAGAATTCAACT AATAAAAATTT AGCCACACTT, ACAAGGGGGAGGAAGT CCCTAG T 
TTAAAATGT ATAACTGAG TGGTAGATCAGTACCT GCCCGGGCGGCCGCTCGAAAGGG 
S equen c e 1409 

CCCTTAGCGTGGTCGCGGCCGAGGTACT ATGWnKTIWrGTTNCTATTACKKTTAATCCT 

T><CTTTf^TTGTGAGCTTGTNAATGCATGTWGAGGATffTGNAGCACTGT^ 

CTGTG 

SeqMen ee 1410 

CCCTT AGCGTGGTCGCGGCCGAGGT ACGAGCCT ATAATCTCACCT AC TCGGGAGGCTGAG 
GCAGGAGAATTGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCC^ 
CACTCCAGCCTGGGCAACAGAGCGAGACTCCATCTTAAAAAAAAAAAAAAAA^ 
AGAGAOAGAGAGAAGGAGCOGAGAAAGTGAAGTCATAAGTGTAGACCAC IPCTTC1 GAGG 
GAGAATCCACCCCACCTTCCTCCTAGCTTCTGGT GG 'rT G CT G GCA A I CT T 1GGCGTTCCC 
TAGCTTGCAGATGCAGCACTCCAATCCCt UC I l TC ATCTTC1 1 AGGGTGGTCTCCCT ATG 
TACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 1411 

CCCTT AGCGTGGTCGCGGCCGAGGT AC T I ITTT1 T TTT I III II 1 n I II 1 G I ITfTIIl 
II 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 NAACCGAGNAAGTTTTTr AATCCACTT AAAAAT ACAAHAG 
CWCAAATCCACAT TTATTTATTG A I I II ICG 1 1 AGTTTAAATCCTTGAGGGGWACTTTTT 
TTTTTI 1 1 II HIT 
Sequence 1412 

AACTTNCCCAIM N 1 UNAAW 

TTNANG f^TNA><GGCCCCATTGGGNCCTTTNCCCGGNJ' 
GCCCCCCANGHTTNG^ 

GGGCCNCCCGGGCCCCCCCCCGGGGGGGGCCCCAAGGOGG<j 1 1 1 I AANCCCGGNCCCC 

GG 

G NGGGGGG^K^GGGTTTGGGGGGAAAAAAGGGAAAAAGG 1 1 1 11G GGCC NI111 I CCTTTG 
GGAAAAAATTTCCCCCAAGGNCCCCX^TTTTTNCCCCTTTTT^ 

GCCCAAG GGGGGAATTTCCTTT AATTTTCCGGGCCTTtT*GGGGGGAAGGGCCCAA 
TTTGGGGCC TTTTTTTN KTTTTCCCCCCTTTNAAAAAGGGGGGGAAAAAAAAAAAAATTT 
AAANCTTTCCN1 TTTTT 1NGGGGGT TTNAKWN AAAAAI<GGGNNCCCCCCCWAAGGGGAAN 
GGGGAAAAAAAAAGGGAAAAAAAANI 1 1 T M 7 NAAAAAMTTTMCCCAAAAGGWCCCCCCC 
CCNCAAAAAAAAAAAAAANN TNNNTC CCCCNCNA* N NANAAAAAN NT ATGTNTC N NAN N N 

NNCCCCCC 

B1413 



CCCTTTC GAGCTGGCCGCCCGGGCAGGTACT 
TTTTTTTTTTTTTTTTTT 



IU I H1II I I III 1 I 
1 1 1 M II 1 1 1 1 1 1 i l i 1 1 ! I 1 1 1 



TTGTTTTT 



I I ■ I 1 1 1 I 1 1 i 1 1 H 1 1 
TGGGGGGN^^TCCCNAAAAANTT7NNTNNG^01AANTTTNCCAAANTTT 
NAAAAAAATNC*K3NNTTNNKAACTMANNAAA^ 

TNNN^^CN^4AAAAAAATTT^^^TTTNTNNTTTTACNNCNAAAAMN^IANAAAAN I I I 

Sequence 1414 

CCCTT AGCGTGGTCGCGGCCGAGGT ACGCGGGTCAATTATCTTTATCATAAACATTTT AC 



PCT/USOO/J1312 



^TGCAGCTATTTCAAAGT^^ 
Sequence 1415 

^^^^^^^^GGCCXCCCGGGCAGGCAC^ACCTTTCAJO^^JGCJ^ TTCTTTCATGACCAT 
AGTGI J 1 1 1 II I CCTATTACTCTTTCACTTACTCACAGGATTCAACX^CATCTGACTCATC 
TGTTCCTCCTCCCAGAC TC 1 1 L 1 1 GATCTTTATTTTTTTAATTTACCAGAGAAGAGCAAG 
CACGTGAGCAGTGAAT AAC TTGCAAGGATGCAGACTTTTTTATTTTGCGATGCTACTTTT 

AT IC* IAAAAGGGTTTAGAAAGAAAGAAAAGACAGCTGTTAGGTTA7TTGATTTTCAAGT , 

TTATCAAATAAAATTCAAATAGAATTGGCAAATCTTTAATGGCATATGAATACTTCT 

ACTTAGTAATTAATTTGAACAGAaATG™^ 

TATGTTTTATAAAAAGT ATCACCTAGCAXGTGTCO MCATTTAAT ♦ . 
Sequence 1410 

CCCTTAGCGTGGTCGCGGCCGAGGTACACGTGTTTTCCT^ AG 

CAAATTAATCAAACCTAAGAAGGGGGTCATGGGAACACTGACTTGAAGCTGG^ 

AAGTTCTGGATGAGGCCTGGCCTTACAACTAGTGTCTG^ 

CTGAGCCo I v. 1 1. 1 i^AGCCT GTGGGATCTAATGCTATCTCCAGGT AGATAGCATGAGAATT 

GGTG<X^3AGAAATCCCCACACATTTGGTCACAGAAGTCTAC I G I G I IGATGATTGTGGTG 
TAAG *GCAGAGGAAAAGCAATTTGATTTTTCTC^^ 

TCAACTAA TGATTTACC 1 1 1 CATTGTAAGGTTATCATGCTCAAGTATTAATGTAGGAAGG 
CM 1 1 1 1 GATGCAWAGTGTGTGTGTGTGTGTGTGTGTATATATGTG P S I U I 1 1 GGAt 
GCT AACATTAAAAAGGGAAATGTATAAGGAAGAAGAAATGGNGNTCTAAACTTAA 
Sequence 1417 

cccjtagcctggtcgcggccgaggtacagatca^ 

^^^jy^^^^AACTTTACTTATT^AOAATTAGCTTT GAGACTTTGGGCAGGTCACAATTTT 
TCTCTATCTCCTATCCTGTAACT^ 

ACTTGG 1 1 f HG I Mill I T T AACAGAT AAAAAATGTGACTGGGCACAGTGGCTCATGCC 
TGT AATATCAGCATTTTGGGAGGCCAAGGTGGGAAGATCGCTTGAGGCCAGGAGTTTt^ 
AGGGGCCT GGGCAACAT AATAT GATC T CATCTCTACAAAAAAAAAAAAAAGGAAAAAAGG 
CAACATTAGTGGG^STGTGGTATTGAGCACTGTAGTCCAAGCT ACTCGGGAGACCGAGGCA 
GGAGGATTGCTTGAGCCCAGGAGTTCAAGACCAGCCTGGGGGAAAGTTTCT>^TGGGC TG 
CAAAACAGCATCT AGCCATTG TCCTC TTCAATGT ACCTGCCOGGGCGGCCGCTCGAAAGG 

Se quen ce 1418 

CCCTTACCGTGGT03CG<XXGAGCTACT 

AGTCGTCAAC TGGT AACATGCGATT AGCTGCT AGAGGGACT GAGGACTCT AAAGAGAACA 
TAAGCAGCAAATTGCAAGAGCATCTGTAACTGCTGGGCTAAGGCAGGGGACCCAGGAGGG 
*GCA^TCCAGGAATGGGGTGGCTCCCCAGGGCCGAGATCCAGACCTCATTAAAC^ 

AGCAGGGGCCTGCTTTCTGGGGGTGCTGGTGGAAATCACTAGACAGTTACCCTGTGGGTG 
CCJ^GCAACACTTTCTGGGCGTT AT AAGGAAGATGGCC TCT AGTGTGCT AGTGGAACTCTC 
TGGAAGCTACCTGGAGGGTGATGCCAAGAGAATTTGCTGGGAAGCCATGCTCTG^ 
TGGTGG^gTCCCTAGGAAACTGCCT 

Sequence 1419 

CCCTTAGCGTGGTCGCGGCCGAGGTACACATAAGTTC^TTC^ 
GGAAAGACTAAATACATTTGTGTCTATTAATCAAAATATGAATTTAGAAGGAAATA^ 
T^T5* ^'^^^^^^'^^^^^^ TTGTTTCCA.TAATTT 
1 1 * IAAAAAGAAAAATTACAGAAATAAGACTTGGGGGGTGGGGGTTGAAAAGTGGTGAAA 

GAAC taaacaagtagaagagga tttct aaagcac TGGTC tca t gaaaaaagtttca tgtg 

TGACTGGGTCCACTGAGATTGAAAAGAAATTGTTTATACG^^ 

TGCTGTCAGGGATG ACATG ATACAGGACCAGAGTCTGTG TAAACAACAAAGTTTTCTTAA 
^ TA ^C* T *CACGCTTTTAAAAATTGCAAGAGGTTTTAAGTTT aattcaaaaatctgtt 



PCT/U500/33312 



ATTCCAGCACACTGG 



T AACAGCCA TTTTG T Al 

c 

Sequence 1420 

CCCTT ACCGTG^TTCGCGGCCGAGGTACACCTCAGAGAGG^ 

AC T ATGC CTG T GGGCCAAATCTAGCCCAAGGTCTT GTTTTTGT AAAGTCCCTGTGAGCT A 
AGAATAGTTTTCATACTTTTT AAAGAGAGAGAGACAGT GTGTGTATGTGTGTGTGTGTAT 
AATGTGACAGAGACTTTATATGGGCCTCAAAGCTTAATTTCCT^ 
TGCT GACCCCTGATGGATGCTATAAAAATAATTTCAACTATCAATAt 
AACCCAGTGAAAAAATCGGCAAAGAACTTCACCGTA* 



CCCTT AGCGTGGTCGCGGCCGAGGTACGACGT AACTCX^GACATAGGCTTTAGACGTTCT 
CATGCCACCCTATCTTCAAAACCACAGAGAGTT^ 

CAGGGAACTTCTAAAATAAAAATCTTACCAATCTGC1 1 (jGCCCAAAACTTCACCCCATCT 
TGGAAGGGAGGGGAGACAGAATGTTCTGATCT AT ATCTGATGAGGGCGT GTGGTTGGGAC 
CTGAGCATCCTCCTGGTTGGGCTAGTGATC 3GGAGAGAGGGCTGTTACTCACGACTCCCT 
CCAACAGAATACCAGAAACAGGCAGGCAGCTCAGGTO 

AACCAGCCCTCACCAAGTT AOCCCTGTAAATCCTTGTCTCCCCA7GCACCTCTACTTTGA 
GTCAGAAATGGATTCATTGCAOGCTCAG I I G 1 1 1 0 1 A TTATGTGAATGAAC 
B 1422 

iTACCAAATTCTCTTATCAGTCAGGGTTCAACCAGA 



ACGAAACCCAGGGATGAGT AATCGGAGGGGAGCAGCAAGCACAGGGAAAArGATGACT GGG 
AG^CAAGAAACTTGGCGTTCAGTCCCAGCTCTGCCCT GTCATTTTCCCTCACCTGTAAAA 
CTGGATCAGAAATCrrTACAAAAACAAAAAACAAAAAACC IC I I CAGTATTTCCCTCAAAC 
AGGATCCTCCTCACATCTGTATTTATATTTAAAAAATAAAAACAGAAAAGAAAAAGAA^X 
AGCATGACATCATTAGGTGTGTGT ACCT CGGCCGCGACCACGCTAAGGG 
Sequence 1423 

ATACTGCCTGCGTGGGGGTTCACAACACTTATA^^ 

AG TAAGT GGAAGTTGTTCT GAAAT AAGCCCCTGGCAATTTTCTGCAATGAAAAGGAGCAG 
AGGTCAT F Tl CT T ATAATGCTCAGCCTCAGAGATAGAACACTGCCCGCGTACTCTGGTTC 
GGGTTCAAGTGAGAGGCTTTTCATGAAAATCTTAGGATTGAAGAGCTCTA 



TAGT AAGTTTT ACTTTT AATACCAGAAGTGAAAGAAAATT 

» 1424 

CCCTTTCG AGCGGCCGCCCGGGCAGCTACTT N IIIIHIIIMIlll) Mill GGGTANT 

nin iii iii i n ii M i ccMiLiiimui i MMii iii i i Mni N iii i i i n 

TKGGGWNAAAACCAT>ICTTTKTNAAWWN 1 1 W 11 1 1 NAKMCATttCGGGGAHAGGHTTHAW 

ANKNAAAAACWTTAAAWGCATTTTAWfrTTTTTHAA A CCAAATTTT^ 

C TGAAAA NAKNT TT GGG^rrTTCAAATNAATTTTTTAAANCAAAAAAA A A C I MCI NCNAA 

TNTTTANWTTTT AAAAAAAAXATTTAAAAAAAAHG>0>rrTATAAAAGNGGGWTTGAAAAA 

NNCKN T NNTTAOAAAAT NAKATTCX^TTTTTTAiZ^iNGKTT NNf^TTTTTNGG TT AAAAT A 

CNNTANCTI^TTCCTNAAAAACAAWAC^ 

AAAC I 1 1 1 1 C I HAAATTTTTT WGGKAAAAAAAAAAA 

CCCTTAGCGTGGTCGCGGCCGAGGTACT ACCJkTCTTAACAATATTAAGTCTTCT GATCCA 
TGGCCACCAAAIUICI 1 1 CCACTTATTTGGGTCTTCTTTAATTTCTTT CAACAATGTTTT 
GTAGTTTCCAGAGTAAAAGTTrTATGCTTTGTG<^TAAAGTTATTCCT ATCAAATTGTTT 
TCATGCTATTGTAAATGGGATT GCI MCMIIICIIM C 



rr AGCAG TTGGGACT ACAGGCAC 
ATAATTrTTGTATTTTTTGTAGAGACAGGGTTTCACCATGTCG 
GCTAGGAAGGTCTTGATCTCTTGGACCTCGTGATCTGCCCAGCTCGGCCTTCCAAAAGTG 
TTAGGATTACAGGGCMSTGAGCWGGTTTTCKTTTGI^MTTGGTT^ 
CCCTTTGCTGCCCAAGCCCGGGAANNTGCAAGGGGTGTGfMTCT^ 



PCTYUS00/333U 



CCTTCACCCTTTTGGG 
S«qu«nce 1426 

COCTTAGCGTGGTCGCGGCCGAGGTACGCGCTTCAGGGCCCnrTTCAACTAAGCACTCTA 

CTCTCAGTrTACTGCTAAATCCACCTCGACCCTTAAGTTTCATAACG<KrTATCCTAGTTT 

TCTQGGGTAGAAAATGTAGCCCATTTCTTGCCACCTCATGGGCTACACCTTGACCCCCGC 

GTCCTGCCCGGGCGGCCGCTCGAAAGGG 

Sequence M27 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATATTGCTTAGAGCAGTGCTTTCAGATATGA 

ATCATrrCTAGAATtKSATTATAGAAGGATGGGAGCTTTTAGTATTTAGTAGTrTCCTTTC 

TTCTCCCTAAGTTTACAATCCATrTTAAAAAATGAATGAATTAAGTATCTCCGAAACAAA 

CTGGCAATTGCTCT GAAGACAAGTTTAGCAATTTCCGTGAAATAATTCTCTGGCTTCGGC 

CAAGGCCACTGATTGATTTCTAAGCAAAACAACAAATCCCGTCAGGATCAGGAATGATGG 

CAGAGTGGCCCTGTTGGCTTTGTAGCT AAATTGTGCTCAGGCAGAGAAGAACCACGAGCA 

ACAGAGCCCTAAACTGAAGTCCCCAATTCTGTCTACTCTACCG7GCTGCACAAAACTAGT 

ACCTCGGCCGCGACCACGCTAAGGG 

Sequanc* 1428 

ccctttcgagcggccgcccgggcaggtacagtctt attttcagcctaaagaaatggacac 
ttctcagcataggcggacgtgattggttgtggtcgaatccttttcctaaccaggatccat 
aatatcacagacaaggtaatatagcactg7gaaggatgtgtctttcttcaaatggagcca 
tga gagatg gtgg i m il i a agttgatttgatgttggatgtaagtaagtcctgtgggaga 
gaatttttttaaataaaaaatactgtttaaaagtgtctcttctaacttgatctctacctt 

TTCGGCTCTNCACTTCTAACTGCCCCCCACCAGCTACACTTrCCGAGTTTGAA ATAATGA 
ACAATACCI 1 1 1 GC I G ACAGACCAAACCTTAATTTCTGTGGGCAAATGANGGGTTTTTTT 
CCCCCCAACAATGAAACAAA1 1 1 1CI 1 1 GAAAAAAMTCTTCTCAAAGATGGTTCTT ATTG 
NAAATAACCCTTCC 



222 

Table 2 



PCT/USOQ/3 33 12 



ACTTAATATTTATATCTTATTTTTATTATAATTTATTTATTTAACT ATTA 
T7TTTACTATATTTACCTTATATAATATTTTCATT7CT7CATATTTATAT 
TAAACGCNCCCro^CNAATGGCTTTGCTCTGAGCTCNCTCCGGA 
CGAGGTA C ' I II 1 1 1 1 1 1 1 II 1 I II 1 1 1 1 IG GACATAC TOAGAO AATTTGO 
AATTATATGTTATOGTAGAATAAAGATDGA GOTCCA TTTTTCTATACATO 
A AAA TTT AAA T ATTT A GTTTOGG A TTTG A G ACTTCT ATT AGGCCTCTGT A 
I I IC1 1 1 C I A GTlTTrTCGCTACCATTCTTTAATOGGAGTATOCAAGCCC 
AATCACCCTCTATCCTATGTCCTAAAGCATCTTGAA 1 IGG1 JGTTCATGT 
1 1 1 1 111 ICA TCTOOA G I GIC I I 1 1 GCCAGCCTCTT AQCCTA TCTG ATCC 
CACTTAGCCTCI G AGGTTCT G TT A AGTTCTCAC CI TCT TI ATG AA I U 1C 
CCCAGCCATAATGATCTTTTTAACCTCTTTGAGCTTTTACTATTTATACT 
CTTT ACCTAAOCAACTAAATGGTTTTTGTG AAATGTGAG AAC ATATAAAT 
ATG AA TOGA TAAAATACTGTATGTACAAAAATTTTTAATA TTT ACAACTA 
ATAGCAAI II I ! 1 O TG ATGGAOCTTTTrAGGGAATTTrrATTTGGCTTTT 
AAGGGATTAGGGTTTATGCCTAATTAAATTAATTACCATGCC 
>Scqaence2 

TTTTCTTAGCTCATCGCGGGCGGCCOGAAGAGCAACCOAGATGAAGGTGA 
AGATGCTGAQCCGOAATCCQGACXATTATGTCCGCGAAACCAAGTTGGAC 
TTACAGAGAGTTCCAAGAAACTATGATOCTGCTTTACATCCTTTTGAGGT 
CCCA CG AG AA TATA TAAG A GCTTT A AATGCT ACCAAACTGG AACG AGT A T 
TTGCAAAAOCATTC CJ IGC1 I C GCTGG ATGGTCACOGTCiATGG AGTCAAT 
TGCTTGGCAAAGCATOCAGAGAAGCTGGCTACTGTCClTrcTGOGGCGTG 
TG ATGGAG AGGTTA GAATTTOG AA7CTAACTCAGCGGAATTGTATCCGTA 
CCT 

>Scqueaee3 

TTGTCTGTTGCATCGAGCCGGGCGTNCGGAGAGGAGTCCTTTACTTAGAG 

TCAAGCTGAAGGAGCATGACAACCtDCAAAGACTGTTATGTTGTGAAATTT 

A GGCTGTGTTTTAAT AAT A CTG ATG ATG ATAGG A TG A AATAGTAATTT AT 

TGATT ACTA TATCTACTATATGTCCXTTAAGATAGCAGGGTCTTT ATA CTC 

GGAATCTCATTTGATCCTCATA G 1 1 1 1 1 A TTCGTT ATTATTATCCTCATT 

TTACAGATACAGAAACTGAGGCTTGAGAGAGGCTGTGTAATCAAGAGTTT 

GTATGCC1TTCATCTGAGGAGGTTGAGGACAATCXXAAGTTAGAAAAATA 

AATGTCTTTAGGATTATTlTTCCTTAATXnTTAGAATATrAATAAGTTAC 

TCAGATAATCTATTGGAAiyiUl - ICATGGCAOGGGGAAGAGGCTAGAGTT 

G O! 1 1 I rOOTTI ITGTTTl I G GCACAOGGTCTCACTCTOTCAOCCAOGCT 

AG AO 1 1 1 rOT GGTGTO ATCl I G GCTTACCG AAGCTTCAACCTTCTGGGGT 

TCTACCTCAGCCTTCCAAGTAGCTGGGACTACAGGGGTGCATCAACAOGC 

CCOOGTGTAOCTOGTOOGTTTAGAAATO 

>Sequeoce 4 

TG A GCCGTATGCAT AG AGNCTGGCGTCCGAGGT ACTCA GTTTCCTT ATCT 
ATAACATGOGGATAATATrCGTAGCTACATCGTTGTTATGAGGATCAATA 
TCTGTAAAGCTCTTAGAACATGCA 1 11 1 1 C I 1 1 G I A CTAAATTGTAAGGTC 
TGGCAGGCGCGGTGGCTCA CAOCTGGTAATCOCAGCACTGTGG AAGGCTG 
AGGTXXKXKSCAGTGGGGAGCGAGGGGTTGTrACTACTOCAATGTAACT^ 
TTTCTXAGAAATTAAGGCAAAAAGTCITACTQACCATGTAAAGOAAATCC 
AACAATTATAAACAG1CIC1 GCC 1 1 1 AAGGAGCTTATAGTCTAGTTAAGA . 
AAGCAGACTTAAACATATGAAAAGTTAAACATTGGTXAGGCACAGTGGCr 
CATGCCTATAATCCCAGCACTTTG<^GGCCAAGGCAGGAGGAT^^ 
AGTTjTAGGAGTTCGAGACCAGCCTGTCCAGCATOGAGAAAGOCCATCTCT 
ACTT AAAATACTA-AACTAGTTGGGCATGGTGGCGOCTtXXTrGTGATXXXA 
GCTACTT^JTTJAGGCTGAGGCGGGAGAATCATTTGAACCCGGGGGGAAAOG 
TTATGGTGAGCTGTGACOGCOCCATTOOC 
t5 

~ ACTCTTTCATTG AAAGG A 
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AATTAGGTTGAAOCTCCAGGAGGCCGTCAGAGTCTOAGGAGAGGCTGGCT 
TGATGTCTAGATACGACGACAGCAAGGCTGCTTAGAGCTAACAGCGCATT 
GGCTTTCACTACGGGACTCTOCT 

CATCTGTGOCCNNATTTGAAATGCGAGCTTCACGGCGGTTjGGGGCOGCOC 
OGGCAGGTACCTATGACCATCTTACATTA 1 1 1 1 1 A TGGGTGGGGGGCATT 
GGCTGTGOAATGTOGGCAGTAACTTGCAC^GTCAGTAA 
CGGGTTCTTGGCATCCCCATTCTGGCACTCCTCCT^ 

ACGCTG O 1TTG1 G GGCGGACGGGCAGGTTGGTGCGTGGGGTGTCCGGGCA 
CTGGCTGTTiCATGCCTTCTTCCTClTCTC 

AA AGTC A CCA G TG A OCAA TTCTCOCAGTGT1 1 CTI TGGG ACTCA ATGCCT 

TGGGCTTGGCATTGGGTAAAGCCGACTCGCCAGTTTCATTCTOAC^ 

CT A T A GT AGTCCGG TG TG G A CCTCTGOCCTCCCTGCTCTGCGG A A GCTTC 

CTCA GCCTTTGCTTCTCA CT A TTT A CTA TTTGCGGGGCCTGGGGG T A CCC 

T 

> Sequence 7 

GGGCGATTTGCAGGCCTCTCCGCGGTGGCGGGGAGGTACGGATCAATTCC 
GCTGAGTTAGATTXXAAATTCTAACCTXrrCCATCACACGGOOCAGAAAGG 
ACAGTAGCCAGCT IC 1 CIGGATG C I 1 1 IXXIAAGCAATTGACTCCATCACG 
GTGACCXTCCAGCGAAGCAAGGAATGGI 1 1 1 GCAAATACICGTTCCAGTT 
TGGTA GCA TTT AAA GCTCTT AT A T A TTCTCG TGGG ACCTCAAAA GGA TG T 
AA AGCAGG ATCA TA G 1 TTCT1 GGAACTCTCTGTAAGTCCAACTTGGTTTC 
G<X>GACATAATTGTCCGGATTtXGGCT^ 
TTGCTCTTC 
>Sequence 8 

TGAAGGTGAAGATTXrrGAGOOGGAATCCGGACAATTATGTOCGCGAAACC 

AAGT roGACTTACAGAGAGTrCCAAGAAACTATGATCCTGCTTTACATCC 

TTTTG AGGTCCCACG A G A A T AT AT AA G AGCTTT AAATGCT AOCA A A CTGG 

AACGAGTATTTGCAAAACCATTCCTTXXTITCGCTTX1ATGGTC 

GG AGTCA ATTGCTTG GCA A AGCA TCC AG A G AAGCTGGCT A CTG TCCTTTC 

TGGGGCGTGTG ATGGAG A GGTT AG A A TTTGG A A TCT A ACTCAGCGGAATT 

GTATCCOTACCT 

AGGTACCACATGCACTGATAGC 
T CIC1 1 1 O TATG AACAG AGCTGTGGCAGGCCCTATGCCAGGGAG AAAGTA 
AGATTGGAAAAGAGCTTACCAAGGAGGTGGCATTTGCACTGTGCTTAAGG 
GGCAAG^AAAAACOTCTTCCAATC^GGA 

ATOGCTGOCGGTGCACTCGGTAACATCTATAATOOCAGCACTITGTGAG 

GCTGAGGGTCGGAAGATTGCTTGGGGOCAGGA O 1 1 IU AGACCACTTTGGG 

CAAATTGCAAGACCCTGTCTCTGCAAAAAAATATAAAATCTAGCTGAGTG 

TGGTGGCAGCTGTAGAOOCAGCCCCAGCTACTCGAGAG^CTGAGATGGGA 

GGATCGCTTCGGCCrAGGAGTTCGAGGCroCAGTGAGCT 

ACTGCACTOCAACCTtXXnCACAGAGCAAGACCTOT 

>Seqi»«oce 10 

GGTG^GCTTACCGGGTGGOGGOCGAACATCCATXJTTTrAACTAGC 
CAAAAOCTATGTGTTACTATCAAAATAAAATTrAGAAAAACA A 1 1 1 1 C 1 1 
ATAAAA H 1 1L1UI 1 1 U I ATTTX3GACTACATAAACTOOCTTTAAAATTOA 
G AAAT ATOCCCT AAAACCA T AA GGAAAAAGCCAACAGAAAGAACAAAAAG 
ATCACAGCAATTAGGCOGITCTATTCAATTTTCOCATG^GCTAAAAATCA 
CATTCTTCACAAAGTAAATT ACGCt X 1 G I III 1 1 A TTCTT AAGCACTAGG 
GTTAGGATIGTGATCTGAGCT^ACrAAATCGGAAAAG 
ATAGAACATTTAGTTTATTTATACCTTAATGCCCGGAGAGGTAATATTTT 

actttaaaatgcataacccatotgacatgctaog rcn O C AAAAC 



>Sequcace II 

CGAAAGACCCTATCAGGCX3CGGCCGCCGACAGCTACGCGOGATTGCTGGC 
CTGGTTCTOCAGGGACCTGAGATCACTGAAGCTGTGGTCGCTGCOGTGAT 

gtggaggaggcagagctcagatagaaaaggagggagtgacactcaagctg 

CAAGCAGTGACAGTGCCCAGOGC 1 C 1 G A TTJTGTC TCT C ACAG C I P3TAA0 

GTGTGAAGACAGCTTGCC 1 1 iu atgtgggactggagtaggcaaagagttg 
GTTOCATGCeX.il UJU, i 1 1 GO 1 O GACCTTGGAAAGAACCJCC T G GAC rTT 
TGI 1 1 IL IGCCAAAAGGGCAACCTGGCAATGATGriCIGATGO 1 1 tCGTC 
GTTAGGGCCATAAATGNTTGTAGGGAGGGTGGGGAGTAAGTAGGAACCCC 
GCAATCOGGGAATCGC^TCAACOCATAGGGCCCCCTTGATTTGTCTAAAC 
GACCTGAAOCXX1 IGGI 1 GCCTTCAA TITGACT AACAAATTOTAACCTTA 
TTCTCCA G IT 1 TDO OCAOGAGAAOOGGGGOOGTTGTACCAACOOCOCTT 

> Sequence 12 

AGGTACI HI I UGll I 1GTAI 1 1 1 1 AGTAGAGATGGGGTTTCACCGTGT 
TGGCCGGGCTGG I C 1 1 GAAC1 C1TG ATTTCAAGTG ATCCGTCC ACCTCAG 
C CTGCCAATG TGCTCGGATTAC AGCTTGTG AGCCACCATGGCTGGCC 1 1 1 1 
TCTTTTTTTTTTTI AAAOGAAAAAATOTT1TTAATTG ACA AATA AAAATG 
ATGTATATTTATG OIGI 1 1 1 1 I UCI 1 1 PGCATCATCAGTCTC1 1 1LIIA 
TCACTGAAACCTACAAATATTTTAAAATCTTICCATTAAAAAAAT^ 
TOATCATTCAACXJICI 1CAAATT ATT AA GAGA TACTTACTTTGTATC AAA 
AATTTTCTCGAGATGTATAATCCA I 1 1 HI I CTGGG AAG AGAGTCAGTT 

> Sequence 13 

TGGGGTTGCTTNGCATCACTTAGGGCGAATTGCGTTJCGAGGTA 

TX^TTCOT5CAGCAGGATTTAA(^GATTK^GATCTGG 

A TCTGTG TTAA TGGTA TCA G ACTT AAAG A A GG AAAGACCTGA TTTG ACTG 

CTGTTG Ol I IG GTAGTGTTCCCTGATCCGGAGCCAG I I I I G I GGOAGGOA 

GTCCCAAAGCAGGTTTGAGCTGTGGTAATGACCGAGTT^ 

CAAAACAGTAGAATGGTAOCTGCOOG 

>Scquencc 14 

CTTANWTTGCTGAGA C I TCI ATCOCGOTOOOOOOOGAOOTACOOTATTCT 
CTT AAACAAGAGCAAGCCCATCATGATX3CCATTTGGTCAG1"! GCTTGOGG 
G A CAAACAAG AA GGAAAACTCTG AGA CAG TXjGTCACAGGCTCCCT A G ATG 
ACCIGGTGAAGGTCTGGAAATGGCGTGATGAGAGGCTVjGACCTCCAGTG 
AGTCTGG AGGGACATCAGCTGGG AG TGG TGTCTGTGG ACATCAGCCACAC 
CCTGCCCATrTXnXiC>TXX>OClClCt lOATOCTCATATTC GICI I I UGG 
ACTTGGAAAATGGCAAACAGATAAAGTCCATAGATGCAGGACXrrcTGGAT 
GCCTGGACTTTGGCC II 111 Id I G ATTCCCAGTATCTOGOCACAGGAAC 
TCA TGTCGGG AAA GTG AACA TTTTTGGGGTGG A AAGTGGG AA AAAGGA A T 
A l ICI 1 1 G GGCACGGG AGGAAAATTCA 1 1 C 1 1 A GTATTGCATATAGTOCT 
GATGGGAAATACCTAGOCAGTGGAGCCATAGATGGAATCATCAATATTTT 
TOATATTOAACTGOAAAAC 1 1 L I GC ATACOCTGGAOGOCATGCCATQCCC 
ATTCG CnCl l G AO CinilX XXXIG<rrTCCAGM C LI I GCA llGUI AGA 
TGA TGGCT ACCATAAG A TCT A T ATGGCC 
> Sequence 15 

GAGGTACTGCICCCT O CACGATCCAGTCAGOOOCTGCOOGO C1 00 1 1A T0 
TAA CAAACAAG ICT0T0TCTOT01 OGAGTGTTOCAGGACGAGTGGAAATG 
ACT G 1 1 1 C C AAGTTCATCGCAATTCAGAAOG OO C 1 ICA GCCAGACTOGTT 
CCAGTGCCTCTCCGATGGAGAAGTATCrnnAAGGAAGCAACTTOCATAA 
AAAGGGTCAGAAAGTCIGI IGAOOGATCA CI III 1 1 III lU OATAACTOT 
CTGCGGCTGCAGGAAGAGTCAGAO G rr C TT C AOAAGAGTGTOATCATTGO 
AGTGATTGAAGGTXXJAGATGTGATGGAAGAGAGOCTGAGGTCAO 
AGACAGCCAAOOOOOCTGTOGOTOG C1 POCTTCl OGATGGTITTCAAGGA 
AATCCAACAACCCTGGAGGCTAGACT ACCCTTGCTCTCATT1AGTTMCTQC 

GGG AGGTGCTTXiAGTGTATTGAA AGAAG AGTGGGACTTATTTGAGAAGTT 
1 1 ffOOCI I ATCAAGTAACAGAGCGGOGCTGTGCCCTGACT1TAA G 1 1 101 
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PCTAJSOO/XLm 



TACCAGCX3CAAT 



ctctccatcaaagg 



TOCTCCTTOAT 



ACOAOCOGGGAAG 



^cgtaattttaatcccattttcatxwcacactroggagctagattacaag 
aacocaacactaoaatdoocgagatoacttdcacu^oooaactcctcaagg 
tctgaaggaaggt aggaaaaggtoggacacattocagacatatttagggg 
tggaggtggttggatatggggagtttaaaggggaaggaatctggggtgat 
ctggot ggtc^ gtcagtog gtatrgg tg acttg t a^ 

AAAACAO l i lOAC TXfl GOGCTX.1 1 ICAT A I 1 1 1 AA H I! GGA GCXTTCTCG 
OCTTTCTAA1 1 1 ItnXjTATTCTCATTTTTACIYXnTCACITTTGGGGTTA 
TCAGAAOCCTCCCTTTTTTAAAATTTTOCOLAAi 1 1 1ULAAA I fTOOCl IOC 
C TTAAA TATTGTTCATTTTGGCCCTTTTGTAATATTTCTAAAATTITTCC 
ATTTTTCAATATTTTOGATGCTCTGAAATTTTTAATAAAATA ICIO I IOG 
CAAAATTATATTGTTTACCATATCAGTCAT 
ACATTCTATACCOCTTTGGCC 
e!7 



TOCTCAC TCTO I GGGCAOOGGCA TTACAGCATAGOG G I DOC 1 1 1 IGTCAO 
^^ATTTATGATCGCArCACACGCAGGATTCAGAGAGCATGAATTOAAAAA 
TACATATGATTGGCrOGGCGTGGAGGCTT ATGCCTGTAAtCCCAGCACTT 
TGGGAGGCTQAGGTGGGTGGATCACCTGAGGTCGGGAGTTCGAGACCAGT 
CTOAOCAACATOOAOAAAOCC 1 1 FCTCI ACT AAAAATACAAAATTAGCCG 
GGCGTCGTGGCACATGCCTGTAATCCCAGCTACTAGGGAGGCTGAGGCAG 
GAGAA I rp C I r GAACCTCGGAOOCGGAOOTTCCAGCGAGCCGAGATTGTC 
CCACTC41ACTOCAGCCTCKjACAATAAGAGCGAAACTCCATCrcAAAAHAA 
AAAAAAAAAAAAATOGTACCTT - 
>5equeacc IS 

TOGC GATCGAOACCTNAOCOOGOTQOOGOOCGAGGTAO 

TTGTC1 PCI AC^ATCAACTOGGTCATTAGCACAGCTCAAAC L ' I O C 1 1 FOG . 

C^ACTCCCTCCCACAAAACTGGCTOCGGATCAGGGAACACTACCAAAOCAA 

cagcagtcaaatcaggici i rccTTcn iaagtctgataccattaacaca 
catoctcacactgogoccagatciocatct o tt a aat^^ 
tgacacctggtacctoccco ■■ * 

>Scqucacc 19 

OCGGCCA OGTA C1 1 1 1 H 1 1 1 II 1 1 III I HAI 1 11 II 1 1 1 1 1 1 1 1 111 1 

nun iccccccggagaggaatigggaagaocaaattgctgciuaaaat 

TTCTACATTGATCCAGACAAACAAGTTAGAGCAGGCTGAAAAAGAACCCT 

TGOTG1 1 1 I IACTCTGTTCAACCAGATCAACTOGAAAAGTATAGATACCT 

TAATTAGCACTGTGCTCTCTGGGATTCTGCTCACKX^^ 

1 1 1 1 COCC I GAACACGGCTOAAAGGGGAGCTCATAATGACTGCTGTGCAQ 

CTCOCCGGGGAGGGGGCrrCCTATTTGATTTAGTGGCTGA^ 

TTACCAATTATTGGTAGCCCCATTTATACATGGTGGAAAAAAAGTACCT 



T GQGGT GTTGOCT OOTA OOOOOCrrCOGCOCAGGCAOCACAA 1 11 1 1 11 A . 
AGTTCTAAOGTAGCI 1 1 C I CAAAGAAAAGCATITCAGGGTGTCCATTAAA . 
AGAGCATCTOCOAA1 1UI I 1 1 1 O CAGGGACTOCTAATCAGTCAGG AG AAO 
TAOAATOTAAGCAAAGTCACAAAOCTOCCGTAAGAA I ITOG1 llA CCAGC 
ACACAOCTOCTCTCI 1 ATQAAGGGATGAGAAGCAGACCCCAAACOCAGTG 
CCACAGTCTCCCIUGAAACAGCAGCA03CTTOGCK3AATOC^^ 
CTATGOCATTCAAGGTCTCAGG 1 1111 IGG I lAAAAATACAACTTAGGCC : 
AACTOCAGTGGCTCATGCCIGTAATTAATTCCAACTCTGGGAG^ . 
CGGGTGGA IC ILX.JGGGGTCAGGGGTTTGAGACCAGOCTGGCCAACATCG 
JGGAAOOCCATCTCr^L^^^ 

TOGGGAACOI IGI I CGACTCCGGGTGGCGOCCOAGGTACGATTCrACTGT 
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1 I i 0 1 CTI CI AGGATCAACTtX^rTCATTACCACACCTCAAACiL J GC 1 1 IG 

GGACTCOCTCCCACAAAACTGGCTCCOGATCAGGGAACACTAOCAAAOCA 

ACAGCAGTCAAATCAGGTCI I 1UI I I I 1 1 AAGTCTGATACCATTAACAC 

AGATGCTCACACTGGGGCCAGATCTGCATCTCnTAAATCCTGCTGCAGGA 

ATGACX3CCTGGTACCTGCOCG 

> Sequence 22 

TTATGTACGTCGACrCACCGGGGTGGOGGCOCAGGT ACAGAGT AGAGAGA 
GTTCTGCAGGGATGAAGTGGGAGACGTTGATAGGACCAGACCaGACCaOG 

ccttgtaggccatggaaggactttggattitacacc^agtccaacaggta 
actgctggagggaattcagcaagagagtgacaggagctgattgacaattt 
gaacgoccactctggctgocatgtggcaaatagattgtaggaagaaaaga 

AG A AA AGG AAG AG AGCACTTTGGA AGCTACTAC I G TT G TCCCaGAAATAT 
GTAATGGTGGCTTGGCCAGGGTGGTGGATGWCATAA I I ATTGTG 

TG AAATTTA TTTCCTTATTAATTTTrOAAACAACTTACTAACTCTGAGTA 
TAAAATITAAAGACTGGGTTTCCAAAATATGATTCCTTATTTCATTGAAT 
GTTATAGCTCTAAl nSTTCTTTTTTTTTI ICTG ATACATTA 11 1! fCTA C 
TA TATT ACTAAATCTTAA AA I C 1 LOG 1 1 A GAGTCTGATATATAATGGGTC 
CA TTTTA A G T G l llllll H 1 1 1 A CAAATTGCOTAGTA G I ITGTTTTTn 
TACTTTTAATTAATATAA Ci I I 1 1 1 I AATTTTTT ATTTTTT 
>Seqoence 23 

GGGTGATGAGACTTCATCGCGGTGGCGGCCGAGGTACACAGTAGAGAGAG 

ttctgcagggatgaagtgggagaggttgataggaocagaccagaccaggc 
cttc taggoca tgg aa gg a ctttgg atttt a caccaagtgcaaca ggtaa 
ctgctggagggaattcagcaagagagtgacaggagctgattgacaatt7g 

AAGGCCCACTCTGGCTGCCATGTGGCAAATAGATTGTAGGAAGAAAAGAA 
GAAAAGGAA GAGA GCAGTTTGGAAGKTT ACT ACTGTTGTXXXAGAAATATG 
TAATGGTGGCTTGGCCCAGGTGGCTGGWTKNKTH ATATAAA 1 ITTCCTT1 
TT A CA TTGT A ACCTCG TCT A CT A TTTCTCAA CCAA A TTA T ATATTGGTCC 
TCATTT AAAATA AG AACT AGTTCCCAAA AATG AA TATATCTAAGGTCTTA 
CTTACCGGTATC AG AA CCATTTTAAOL IGI 1 I U GOCOOCTTATATTTATT ■ 
C AA TTCATCCT A TTTTGCCTG AA TA GAACAAA TTCGCTTCTGGGGGCCTT 
ATTOGTTATT TTTCT ATTTAATTGTATTCCCTCATTCAATAG^ 

CAGOGTCAG ci men i actgtttaa gi iiui i atcctcctaatattt 

TATTGACAAAAAAT 
> Sequence 24 

TGGACTATCCTCAOCOCGGOGCGGCCOAOGTACAAA 

gctctgggttagaoacatoctoactoatgagatcaccaaogc^ 

AGGAGACTCCGGTAGTGAAAGGCAATGCOCTTjTTAGCTCTAAGCAGCCTT 

gctgtcgtoctatctagacatcaagccagcctctocicagactctgacgg 
gctcctggaggttcaacctaa 1 1 1u-1 1 1 la atgaaagagtgggtttcca 
tggtacctgcccg 



iTACGOG 



TGGGGNATGTATCAG 

ggaggcacai rem i ci acgtgaac^gttttgtaaactoaact i mm 

TCAGTIXXXXKnXX^GCCATCCTC^XKTTAGCTTGOC^ 
ACtt ^r il G ACCCATGACC C I tXllCllllt^lllClUOClLllllXXX^ 
CAGCA GT GCATOTOCA OCATAOCA CCTGA GAGTCI G TGGAATCTAATTTT 
CTGT TATAC nCI 1 1 ULTTACACTCA 1 1 1 IGCIOICI I IATTATOATAOT - . 
CTAAll 1 1 1 ICIOCTCAAAOOOATAO CTOCC1 1 UC I 1 ICATOAAAACACA 
CI 1 1 1 C" I AATGGGGAATTAAAAGAAGGGCTTTGCATTTITAAGCOCCATG 
CtTTTGACAGAATTTATTAATAAATAGGGCCTTTCAAAGGGGAAACCGTrC 
CAACATGCCTACAGAATGTTTTATAACCATGAAATATTTACTGGOOTT AA 
GTOCAAAATGCTGACTATOCTCG I UOGTATOCTTTOGAOCACTCTTAATG 
TATAATTTTGCAGGTGAATGGTC 



TGGGATGTGCCTCATCGGGGGCGGOCCAGGTACGGATACAATTOCGCTGA 
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GTTAGATTCCAAATTCTAACCTCTCCATCACAOGCC^ 
AGCCAGCI IIICIGGATOCTTTCCCXAGCAATTGACTCCATCACGGTGAC 
CATCCAGCGAAGCAAOGAATT3GTTTTGCAAATAC 1C G I 1 LXIAGTTTOGTA 
GCATTTAAAOCTCTTATATA I fCTOOl GOGACCTCAAAAOOATGTAAAGC 
AOOATCATAOTI It l I G G AACTCTCTCT AAGTOCAACTTC 
CATAATTGTCCCGATTOCGGCTCAGCATCTTCACCTTCAT 
CTTC 



PCT/US00/XU12 



WO 01/42447 



CTCCCTCA T A TTACTA TTCTA TCTCG TAA TTA TTGTT AATT AATTTACAA 

TATTT TATCAATTAGTAA I L I ITTC1 1 AATTTAACAAWW ANCNCANNKTT 

GTCTGTTOTCGATXXGCTTCCACGCCOCGOCGGCCGAGCT 

TTCCGC I GXGTTAGATTCCAAATTCTAACCTCTCCATCACACGCCCCAGA * 

AA GGACA GT A GCCAGC T FCTCI GG ATGCTT TGCCAAG CAATTGACTCCAT 

C ACXXrr GACCATCCAG03AAGCAAGGAATGGTTTroCAAATACTC G TTCC 

AGTTTGGTAGCATTTAAAGCTGTTATATATTCTCG I OGGACCTCAAAAGG * 

ATGT AAAGCAGGATCATA G IT! C 1TO O AACT C TCT G T AAGTCCAACTTGG 

TTTCGCGGACATAATTGTCCGGATTCCGGCTCAG 

TOGGTTGCTCTTC 

TOOACICTOOOCCn rCCOOGGOGOGOCOOAGGTACTCAGTTTOCTTATC 
TA TAA CA TGGGGATAAT A TTA GT A GCTACA TCGTTGTTA TG A GG ATCA AT - 
ATCTGTAAAGCTCTTAGAACATGCA I J 1 1 1 C I I CI ACT AAATTTTAAGGT 
GTGGCAGGCGOCKnGGCrcACACCTGGAATCCCAGCACTGTGG 

aggtgggggcagtgggg a gcgaggggttgtt a ctactccaa tgt aa ctgc 
1 1 1l icagaaattaaggcaaaaagtcttactcaocatgtaaaggaaatcc 
aacaattataaaca g fctci g<jl i 1 1 aaggagcttatagtctagttaag a 
aaccagacttaaaca tatgaaaagttaaacattggccaggcacagtgoct 

CATT^XTTaTAATCCCAGCACTTTOGGAGGCCAAOGCAGGAGGATCAO^ 

ACGTC^CGAGTTODAGACCAGCCTGAOCAGCATTJG^ 

ACTA AAAA TACAAAACTAGTTGGGCATGCTGGCGC^TGCCTGTG^ 

OCT ACTTCAGAGGCTGAGGCCGGAGAATCACTTGAACCCGGGAGGTCTAG 

cggccgaocgggcaggacgcggtgat 



TGGATTATOTTGAGCTCCCCGCGGTGGCX^^ 



. ,„. . . . ,» . v, ■ unvn. i wwvuvu-iJ * VAJUAUAIAUU I At 1 U\U III UCI I ' 

ATCT ATAACa TOGGCATAATATTAGTAGCTACA IUjI 1G I IA TGAGGATC 
A ATATC TGTAAAGCTCTTAGAACATGCA 1 1 1 llll ICl A CT aaattttaa . 
OO rC I GGCAGGCOCGGTGCOTCACAOCTGGTAATCCCAGCACTGTGGAAG 
GCTGA GGTGGGGGCAGTGGGGAGCGA GGGGTTGTT ACTA CTCCAA TGT AA 
CTGCTTTCTCAGAAATTAAGGCAAAAAGTCTTACTOACCATGTAAAGGAA 
ATCCAACAATTATAAACAG ICIC1 GUL 1 1 1 AAGGAGCTT A TAGTCTAGTT 

GCTCATCCCTAT^^ 

CJTGAGGTAAGGGTTTCAGACOCGCCTT^ 
TTTCTTAA AATCCAAACC 1GI 1GGCT 

TGGGGA 1G 1 1 GCAGCTCI G I tXCOGG><GG<XXKXXlAGGTACTCAGTTTOC 
TTATCTATAACATGGGGATAATATTAGTAGCTACATCGTTGTTATGAGGA 
TCA ATATCT OTAAAGCTCTTACAACATGC AI III ICl IIJ1ACTAAATTTT 

aagctctogcaqgcgcgg^ 

AC IOC 1 1 1 CI CAGAAATT AA GGCAAAAAGTCTT ACTGACCATGT AAAGGA 
AATOCAACAATTAT AAACA G I C T C 1 00 C 1 1 1 A AGGAGCTT ATAGTCT ACT 
TAAGAAACCAGACTTAAACATATGAAAAGTTAAACATTGGCCAGGCACAG 
TCGCTCATGOCTATAATCCCAGCACTTrOGGA 
ACCTG AGGTCACGAGTTCGAGACrAGCCTCACC^ 

1 1 1 1 1 ACTAAAAATACAAAACTAGTTGGGCATGGTGGOGCATGCCTGTGA 

Pase6 
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PCT/US00O3312 



TCCCAGCT ACTTGAGAGGCTGAGGCGGGAGAATCACTTGAACXTrCGGAGG 

TtXiAGCGGNCGCCCCGGCAGGACGCGTGGGATGN 

>Sequem3l 

GACTGATGTCGACTCCCCGCGGTGGCGGCCGAGGT ACTCAO 1 1 ICL I I A T 
CTA TAA CATGGGGATAATATTAGTA GCTACA 1 CU 1 1G1 l ATGAGGATCAA 
TATCTGTAAAGCTCTTAOAACATGCA ITTTTC1 1 C I A CTAAATTTTAAGG 
TCTGGCAGGCGCGGTGGCTCAC^CCTGGTAATCCC^ 

TGAGGTGGGGGCAGTGGGGAC<X}AGGGGTTGTTACTACTCCAATCTAACT 

OCTTTCl CAGAAATTAAGGCAAAAAGTCTTACTGACCATGTAAAGGAAAT 

CCAACAATTATAAACA G II- 1 C I G CCTTTAAGGAGCTTATAGTCTAGTTAA 

GAAACCAGACTTAAACATATGAAAAAGTTAACATTGGGGCAGCACAGTOG 

CTCATGCCTATAATCCCAGCACTTrGGGAGGCC^AGGCAGGAAGATCAOC 

CTGAGTAAGGAGTTCGAGACGAGCCTGACCAGCATGGaOAAACOCCATTC 

TACTAAAAATACAAAACTACTTGGCAATGTOG 

TGGGATCTTjCCCCTCCGGGGGCGCCCG AGCTAOGTATGCA C I IOC I IG CC 
ATCTAAGCAGGGACAATGGCAGTTCATATCATGATGTTA C 1 1 1 G ATTCTC 
TGACCAA ACTGGCCTGTG AGCACCCTGGGUJ IllCllULK-lUl CAAAGG 
CCTTAAGACAGGTTTACCCTGTAGCCAGGTCTGGAAGA 

aaagctgogtoggagaa gtg aaaaa ogtcaggtttaca ttoct ACGCGGA 

AAAGCvATGTAACACGCGCCCACATOCTATGCCCAATCCCAAGGCAGCGAG 

gcagggaagtggctgccaaacc reran aggagagtaataaatcacttga 

GAGTAAGCCTAAGCAAACTCAAGTGGGAAGGGGAGTTiGGCrrGTAAAATAG 
TTT A A GA G ACTCTCTCAGGAA GTCAGCGT AA TTG A TGTGT AGAAAGGT AA 
CAGTCAACAGTTCTCXn"AACAAGACAGCTTCAAAG<^GCAGCTA TAGTGG 
AGCATTCCTGAGGCCTGCTGCAGATCAAAGCATGAATGTGCAGACrGGTC 
CTCTTOCCCAGC G 1 1 ICl I ICAAATCl 1 1 GCACATGTTATATTTTAGAGG 
CAAGTTCAGTTCTAGaGGAGCTGGCCTGC 
> Sequence 23 

TGCCTGAT G 1 1 1 G ATCGA G 1 1 CC CCGCOOTOGCOOCCOAGCTAOGTATCC 
ACTTG CTT^OCCATCTAAGCAGTOAC^ 

TCTGTCAAAGGCCTTAAGACAGGTTTACCCTGTAGC^ 

cagagctggcttaaagctgggtgggaoaagtgaaaaaggtcaggtttaca 
1tgctacgcggaaaaggatgtaacacggggccacatcctatgcccaatcc 
caaggcaggoaggcaggg AAGTGGCTGCCAAACCTG ttgt aggagagtaa 

TAAATGACTTGAGAGTAAGCCTAAGCAAACTCAAGTGGGAAGGGGAGTGG 
GCTGTAAAATAGTTTAAGAGACTCTCTX^GGAAGTCAGCGTA^ 
GTAAAAAAGGAACAGTCAACA O ITTTOC1 ACAAGACAGTTTAAAOCAGCA 
GTITTGGGGAGCATTCCTGAGCCTGGG 



TCTTACGATGCTCATXXJG4XJGCGGNCGAGGT ACCAGTT AAAGTCTTCT AG 
CCTGT ATXXXCACTCCTTTTTGOCACTTGCAAA TTCGGT AGCCCAGTTAC 
CCAGA GGGA GGCA T AGGAGGG AAAAXXlAAGACTGAAAAGCiGCTAAT A TGA 
CTTTTGTCTCI 1 A CAATTTATCTGCATCTT A I UL I rDDC O CAOOCCOCAT 
CA TT AAA TCA TT AAACA TTCT ATCCAAA TAOGATGCC C I TCTG I GG AACT 
GCA T A TTTGOAAAOCA T AL I LAX fOl 1 1 AACTTATGCACTCCACTGGGAA 
CTTACAGTA 1 U 1G 1 II CCCACAATACTTGCAGTCA TA TCAGTTACAACCG 
CTGGGTGTGTATTGGI ICAAAAGGAOCTA£XTACAAGGTTATATCAATCC 
ATTGTCCAATTTGAGAGA II 1 1 1 1 L I GAATCCAGTTAAAATAATTTTTGO 
CTACACCTGOG<^CACTTCCCAGGACAACAAT0ACTTGTAGTCT AGTOCC 
CAAGAAAGCCAAAAAC<K30CXJGCAAO_I IQOl IGCCACCAGATOCOCAAC 
AGACAGATTCTAAGGGAGAAGAGAGTTTATCAACTAACACTCACAGG 

GGTATCTTGGT<CAKTTTAGAAGCOC t c io c o ogctogcogcooaootaco 
GATACAATICCGCTGAGTTAGATTCCAAATTCTAACCTCTCC^ 
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TAGCCAGCTTCTCTGGATGCTTTGCCAAGCAA ttg 
ACTCCATCACGGTGACCATCCAGGGAAGCAAGGAATGGTTTTGCAAATAC 
TCCTTCCAOTTT<XrrA<XIATTTAAACCTClTATATATTCTC G TGGGACCT 
CAAAAGOATGTAAAGCAGGATCATAGTTTCTTGGAAC rCTCTCI AAGTOC 

aacttogtttcgcggacataattgtccggattccggctcagcatcttcac 

CTTCATCTCGGTTGCTCTTC 



CTAATTACTCTATCGATTTCTTATAACTCTC^ 

CTT A TTC A TOCTTCAA TTAG A CCG 7TTACTAT ACTTTTT A TTCT ACCAAC 
CTACTTCTCATTATCTACT ATAANNTTATAATGA NM M'TI GGCCTCTTC 
GAATCCCCCTCGAOGTACA1 1 IGTGTTITATTGTGAAGGGTOCTCAACTG 
TGTGOCTGATTCAGGCT G 1 000CACTGCA ATGTATOGAG AGG AG AG AA AG 
GG ATG AA AGTG AAGGCAGGGGGGGGG ATGTTTGTTTCACGGGGTG AACTT 
CTGCCTCaGCAAGTTGATGTTOGCTTCCG AGGTATTTGGACAC 1 H UH 
CAATACATTTTTATTTAGCACTTATTCTGTGTCTGCTGCCCTGGGATACC 
AGAGTGAATAAACAGATTAAAAGGTCCCTGCCCTTTTOGAGCCTACAGTC 
T TTTTGT AG AG AAAATTGA ATTG ATAAACCATAC l II 1 1 1 1 1 1 1 1 1 1 1 O A 
ATT T TGGTGGG 1 111111 1 AAGGTTAGAACAAATGCTTAGGGTGGGAAAG 
OCCCCCACAGAAAGGOGTGAOGGGGAOTTA< X I I HJ UU UG GTCGGGCOCT 

tttcagggattaacccaggaaataaaaccttgtaggcaaaaatggcocat 
caaaaagg ccaag gaaccg ttaaaa aggccccogttttttgtccattttt 

ctctaaattaaaggttggggataccccccagggcttttaatattccccag 
ggttttcccctt 

> Sequence 37 

gg a gcgttg a acccnttttagt a gcgctctcccggg tggcggccgcccg g 

gcaggtacgcgggggcaa^tggcggcxrttagcaagctatagctgcgaga 

tttcaattactocactcgtagctattgcattcctgacgatggcctctgtg 

gcttcgtgcgattcgcgtccgagctcagacgagctcoctxkjagacooctc 

ttcacaagaagaagatgaggactatgattttgaagatcgggtcaocgact 

cgggttcatattcctcagcg aotaocgattatgatg A TCTTGAGCCTGAA 

TGGCTGGACAGTGTOCAGAAAAATGCAGAG C I Gl 1 1 i ATTTGG AATTGAG 
TGAGGATGAAGAAGAAAGCXrrCCTTXXTGAGACACX^AACTGTGAACCATG 
TCAGGTTCAGTGAAAATGAGATTATCATTGAAGATGACTACX><>JNKAN^ 
NATTTTTAAAAAAAGTACCT 



PCmjSOO/33312 



PCTAJSOO/33312 



TGAGCGTACGAGCCCTCTCTGOOGGCCOOCGAGGTACTTAA G WHICH 
CAOTTACAGCTACCATGTGAAAATAA U C 1 V I GO 1 1 ATCAAGTTTACAAC 
TTTA ^^^^^^J^j^^J^^^ TTTTCTCATTT A CTT A TCACACA GTCAT 

^ATC^^CTCTGTGCAAAAAGTATTGGGTGCTTroGTAGAAGTCTATACAG 

AAGACACTGGAGACACAAAAATGAA ti l l G I C CAGGTO AGTTOATOTCAQ 

AAAAGGCTTAATAATGGAGATOAGGCCOGGCATGOTGGTTCACACCTGTA 

ATGCCACCTG TI7 QGG AGGCTGAGGCAGGTAG ATCACTTGAGACCAGGAG 

TTTQAGAOCAGCCCAGCCAACATGGAOAATCCTGTCTOCACT 

AATAAAAAATATTWTGTT0C1 GOOCG 

>Seqoeace39 

TGAOGTTTJATTCAGAGCCCTCACCGCCKrrGGCG^ . 

GC OGGA AAGCAAAAOOACAAOCAOGOCCTGAGCAGAOCGOOOOGAATTCA 

ACCTTTAAGTGGAlAACTTGOCTTCTGGrrTGCCAAGG 

AAACAGAT GAAA CAGCCTATTOTOCATTTCAACAGGAT1T1TCAGOAGTO 

GGGATGATCITrCAAATTATCCACAACTTAATTATrrAATATTTTGATAG 

TCAATTACCTAAGACAGGGCATCGTCACTGACCAATCAGAAGAGATGGCA 

GTAGTTGGGCGCAGTGGCAGCACTTTGGGAGGCTGAGTGGACAGATCACC 

TGGGGTCAGGAGTTCGAGACCAGCCTGGCCTACATGGTGAAACOOCATCT 

CTACTAAAAATACAAAAATGAGCCAGGCATCGGGGGCACCTGTAATCCCA 
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TGGGCGTTGACTGG ATOGGCT0GO0GOGGTGGOGGCCGAGGT ACAGTTTA 
GAAAACTCTGGGGCTGAGTGCTCGGGGCCGTGGGGCGCaGCGTGGCTGAT 
CACCATCATAACGGGCCTATGGGGATACA 1 1L1H I A G ACATTTTG AAGT 
AA TTAATGCTCTCGTTA GTG A TT AAGTCTGTG AAGT A GTCCTTTGCATAA 
TCAAATOCATGCTTTTCTTTG A TGCCATTGCG AC AAACAGTGTA ATTATA 
GAAGCGAGAATTCTTGATTAATGCAAGCCATTCTCOCCACCCAGGGGGGA 
TGTAGCTGCCATTATATTCATTGAOGTATTTTCCAAAAAAGGCT G TTCTG 
TAGCCAGTGTTGTTAAGATATACAGCAAAAGTCCGAGGCTCATGCATGGC 
CTGCCACG AGGGGG AAG AGCAGTTCTCGTTGTTOGTGTAGACATTOTG AT 
TGTGCACATACTITCCCGG7GAGCATGGAGGACCGTGACGGGCAGCACATG 
GGTTGTAGTCACAAAGGCATTGATCAAAGTGGCCCCCCCATGTTCCATAA 
TCTTTCTCGT7TTGTTCATGACTTGCAAGGACCCCAGCTCCACATCTTGA 
TC ATCGG TAAGC A CA AGA AT AATGTTGGGTCGG ATGTTTTT 



TGGAGTGCTAAGCNAAAfrrrCAGAAGCGCTCTACCGCGGTGGCGGCCGCC 

CGGGCAGGTACACGTGCACATTGTOCAGGTTAGTTACATATGTATACATG 

AG<XATGCTGGTGCGCTGCACCATGGCACATGCATATCTATGTAACAAAC 

TTGCATGTTCTGCACATGTATCACaGAACTTAAAGTGTAATAAAAAAAGA 

A A G AAA AACAGCATGCAA TTCA GCCC A CA CAAAA A A AA GA AGTCAAAG AC 

AGCGAGAATTCTTAAAACAGCAATAAAAAGTATAAAGTCACTCTAAAGGA 

ATCCOCG TTAGATTAACAACACATTTCTTAAG AGAAATCTAACAGGCCAG 

GAGAGAATGGGATGAC^TATTCAAAGTGTTAAAGGGGGGGAAAAAACTCC 

ACTCA AG ACTACACGCAGAA AAGCTATCTTTCAG AA ATGG AGATAAAAAC 

ATCTirCCCAGACAAAGAAAAACTAAGAGAATTTACTACCACTCACCAGC 

CTTACCAAAAAATGCOCAAGGGAGTOCTACATCTAAAGCAAAACGACAAT 

CATCACGAAAACATGCAAAAGCATAAAACTAACTTGTACCT 



TGGTCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTGAGCCGGAATCCG 



CTATGATCCTGCTTTACATOCTTTTGAGGTCCCACGAGAATATATAAGAG 

CITTAAATGCTACCAAACTGGAACGAGTATTTCCAAAACCATTCCTTGCT 

TCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCC 

GAAGCTGGCTACTGTC C 1 1 1 1 1 G GGGO G I U I O ATGGAGAGGTTAGAATTT 

GGAATCTAACTCAGCGGAATTGTATCCGTACCT 

> Sequence 43 

agatgctgagccggaatccggacaattatgtccgcgaaacxaagtiggac 
ttacagag agttccaagaaactatga 1 oc 1 gc 1 1 1aca1cci f 1 1 gaggt 

CCCACGAGAATATATAAGAGCITTAAATGCTACCAAACTGGAAOGAGTAT 
TTGCAAA ACC^TTCCrroCTTCGCTGG ATGGTCACCGTG ATGGAGTCAAT 
TGCTTGGCAAAGCATDCAGAGAAGCTOGCTACTOTCCTTICT^ 
TGATOGAGAGOTT AGAATTTGGAATCTAACTCAGGGGAATTGTATOOGTA 
CCT 



CCGGOCAOGTA U H 1 1 111 1 M l 1 11 I I II 1 1 11 il 1 11 A CTCTGGAAO . 

ATGATCACAACACTXX^CTCAAGCCTGGGCAACAGAGCAAGAGOCTGACT 
GT AAAAAAAT I"IT1"I TACATTAATTTTTAAAAGTG AGGTTTTT ACCTGAT 
GATTOTGTAGGTTTCTCCTAGCTCCAAAGTATCCGGCT^ 
AATATAACCTTCAAGG AAAGTGGAGCTGGTTTA C. ' I U 11 1 IIIU ATAATAT 

CAAGCCATTCCTGGCTGGGCGTGGTGGCTCATGCCT 
TTGGGAOGCCCCGCGTACCT 
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CCGGGCAGGTACGOGGGAATTCAAOATGGATTAAAGATTTAAACGTTAGA 

cctaaaaocataaaaaccctagaagaaaatctaggcaataccattgagga 

CATAGGCATGGACAAAGACTTCATGACTAAAACAOCAAAAGCAATGGCAC 

CAAAAGCXAAAATAGACAAATGGGATCTAACTAAACTAAAGAAGGTTTTG 

COCAGCAAA AOAAAC CTACCTTCAGAGTGGACCGGGCAACCTTCCCGATT 

GGGGOA AAATT TTTTGGAA ATTTG OCCCTTTTGAACAAAOGGGTTATT^ 

CCXXXiAATTTTATAAAOGACTTTTAACCAAATTTTCCCAGAGG 



GGAGCTCCCCGCGGTGGCGGCCGAGGTACTCGGGAGATCOTGCCACTGCC 
CTCCAGOCTGAGAGAAAGAAACTCTGTCTCTAAAAAAAAAAAGAAAGAAA 
GAT GTCA GTGCTATTT ATAGTA ATAC AAAAA TTTAATGTAA 1 1 1 1 1G1C A 
AAATtrrCAATOGTATATTTTIGC^GATTTTTCAAATTATATATATATGAT 
TT A T A AA TTA TTOTT A T A G ATTCCTGG AAA G TT A ATCCATCTC ACCATT A 
CATA A TACC AATCTC 1 C 1 CGGCCGGGCGCAGTGGCTCACGCXTrCTAGTCT 
CAGCACTTTGGGAGTCCGAGGCGGGTCAATCATGAGGTCCAGAGATCGAG 
ACCATCCTGGCCAACAAGGTG AAA CCCCATCTCT ACTA AAAATA 
>Sequcnce47 

CACACACTCTTCTATTCTGCTCGCTCTATTTCTCGTGTC^ A CCT 

A ll l rCl 1 1XI C 1 ATG 1 II I 11 1 



CACGTAGTCCTCTCCGCGGTGGCGGCCGCCCGGC^ 
CTCGATGOCAAGCAG I IUXXXXJ I AOCGTCTCACTGOCCCTCAAGACTTC 
AAO GCCACTC TCCCCATAAACATGA 

TCIGCATCTCCCAGAAGGGATGAGTCTTTGTCtXXW 
TCGJTGGGCTCCCAGCAGTGCTTACCTTCT 

III I ICCCCCCAA C1 111 11 1 1 11 1 IG AAACGGGGT U IGOl 1 IG TCCCC 
C AGGCTTGG A GTGCACTGGA CTTCKi TCCTCTCCTTG A TGG AA CCCTCTGG 
CCTCCCAGGTTTAA AGCGA 1 I H I 111 1 G CCTTAACCCTTCCAGAGTAGC 
G TGGGA ATTCCAGAATACGTGCGCAACCATTCCCCGGGTTAATTTTTTAT 
Aim I AAGAGACCGGGAATTCAACCATXXnGGGTTrAGGCTTGGTCTTG 
GAAACTCCTCACXrCTCAGGTCGAAGCCACATGAC^ 
OCTTGCCATTACAGGCGTGOAGCCACTAGOGCCTGACTTCC l 1 1 1 ICC 1 1 
TCCTGCCCCAGGCCGAACCACATC 



OC CCCT TOGGCGAAAAAAGGCCAAAAGTT G TTCCTGGGGAAAAATITTTT 

CCCTTCCACAATTCCCAAAAAATTTAAACCOGGGAAAAAAAAGAAAAAAC 

CGGGTG GGCCCCC AAGGGGOGGCCCAGACCAAAATTTTGTGGGOCGCGCC 

TCCCCCCCrTTTTAAAGGAAAAAAATCTGOCCCCCTTTTAATTAATACAC 

CXXXCCCOC)CTK300GGOGOOGGGTTTA>^ 1 1 1 1 11 1 1 1 IC A 

TATATAAAGGGG 

>SequenceSO 

GGTAGTTGCATACCGTGGGCGGCCGGAAGA GCAACCGAGATGAAGGTGAA 
GATCCTG^GCCGGAATCCGGACAATTATGTCCGCGAAAGCAAGTTGGACT 
TACAGAaAGTTCCAAOAAACTATGAlU. I UCI I 1 ACATCCTTTTGAGGTC 
CCACGAGAATATATAAGAGC1TTAAATGCTACCAAACTGGAACGAGTATT 
TGCAAAAOCAI llll 1GC1 1 WKHtXj ATGGTCACCGTGATGGAGTCAATT 
GCTIGGCAAAGCATOCAGAGAAGC1TJGCTACTOTOCI 1 ICIUOOGCOTGT 
GATOGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAATTXnATOOGTAC 
CT 

>Scq*ence51 

TC CGCTATGA TCCTCTCCGTGGGCGGOCGAGGTAOCTT^G 

AAG IU11 1 1 AGAG I" I GO 1 OAOTTCCCCG I U OL'l lUCAGAACTCAAOGCTA 

GGAGGAGCAGCCAGTGAGGACAGACXrrCT ATGCAGAAACATGGGGAACCT 

CTGGAAATGACACACTCTOCGGGCACAGGGGGCCATTCGTOCA7CTTGAO 

GTGGACT AATCATTiGAGATTCTCG<^GGGCCGC<rrt^ 
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CTAATCGGAGAAGGAGAGAQATCAACTTCCATCCACTOCAGTCTGTCGGG 
GGCTGATGAGTGAGGTCK^GCAGGCATCCGCGTGGATTTG^ 
G ACTTTTT ATTGTGCTOAAAGCTG C 1 1 GI 1 Ki I G ATGATCTCATACTTTGT 
AGTTGTTCTATCTGCAGCACTGACTTCCTAAGGGATTCTTXXAACCTA 
AA TCm lUlllA TGGAAOGCTTACAA llll l lllllGUJlilllCIIO 
AAA TTCTT AA AA TTGGG AGO 1 1 1 1 (JC 1 OGAGTAOCTGCCCGGGCGGGCGC 
TCGAAAATAATCTCTCTGCTCCTA" 
TGG ATACCCCTTTTTTTCTTTCCCACIGGGCCCCTT 
>Scqueoce 52 

TAGTTGATCCX>lATCTTTNO ANGCCWCCCCCGCGGTGGCGGCCGAGOT 
1 1 1 1 1 II 1 1 U 1 1 1 1 1 II 1 1 1 GGCATTCTO AAAATTCATGAOGC 1 0 1 G 1 1 
TTAGGTGAGGCTATTTCTTCATTCACTOAACGGGGCACCCAACAGGCrCT 
TAATATGAAGA CTTGGGCCCTTCCTG AGTTCT A G AAAA GC ATTTTT A CTA 
GTTCTTCAGT A ATTTCCCCTCCCCTTCATTC 1 CI G I ll'ICIl I 1CC1CGG 
ACTCC A A TTGG ATCTTGGGCCTCT A AGTAT AGGCAAG A TCA TG TTTCTAA 
A AAGGTTCTTAG AGGG AGGG AGTTCCTGGG AGTGTTA TGTGGGGTGGTGC 
AGAAGGTGCTAACAGGTGGG I I IUCI 1 1 AOGATGAGCAGGTGGG ATCGC 
* * ^^TC"* GGCTGGG ACCTTTCCCTCCA GTGCT AAAATG AAAGTTTT A TT 

TGGG ATT A A TTGG TTCCA T AAGG A A AA CTCT A TTTTTCT 
>Scqucncc 53 

CACTTACTC AA TTATG TC TTG ACTA TTATAAGTTATTaCTCT ATATTCAT 
TGATCTATATAATTTTATA 1 1 1 1 n ' I' I ACACCCAACCAAGATOTrT GC I CI 
CGTTGGCGCGOCA A CGGGGGCTGCCG AAG AGCGACCG AGATGAAGGTG AA 



TACAGAOAGTTCCAGGAAACTATGA IIUOU 1 1 ACATOCTTTTG AGGTC 
CCACGAGAATATATAAGAGCTTTAAATGCTACCAAACTGGAACGAGTATT 
TGCAAAACCATTCCITGCTTCGCTGGATGGTCACCGTGATGGAGT^ 
GCTTGGCAAAC*CATCCAGAGAAGCTGG^TACTGTCCTTTCTGGG 
G A TGG AG AGG TTAG AATTTGG A ATCTAACrCAGCOGAAATGTATCCGTAC 



CT 



A CTT ATT A (XT A CA TGTT ACTTCTT A TCTTTGTTCCT AATA T AGTATA TG 
TTCCAAATATTATATCATATTTTTGATATTATTTTATTAATAATTTATTA 
ATATTA(rr>WNN>rrGGTGTGTTGACCATTTGGAC<XX 

ggccgaggtacactgggaaaatgaagaacttaactacataaaaatagagg 
gaca gtcaaaacttcacaggggggaaatcaagttaaattcagagctggat 
ttagatgatgocattctagagaagtttg c i 1 1 C I c caatgctctatgcct 
ttctgtaaaaciggcaatttg ogaa gcatc^ctggataaatttattgaat 
ctattcagtcaattccigaggctttaaaagctggga^ 
tctcatgaagaagttatgcagaaaatpggtgaac ll ' l 1 i g ui' i a aogca 

CCGTATAAACTTG A G TTCAGACTTCC1G ATTACTOCTGATTTCTACTGGG 

acagagaaaacctggaaggactttaggataaaacgtgtgaattccttagc 
attggccgaagagttaaggtcatgaatgaaaaacttaagcactgcatgga 

ACTAACAGATCTAATTiCGGA^TCACCTC^TGAGAAC^ 

TGG A GGGG AAG A TTGTCAA TCCTATT ACCA TA GAAGGAA TGGTTGAGCTG 

GGACCAG1 1 1 1 1 1 1 IGATCAGTOATACCAAGTOTACTGCAGAGATATTAA 

GTG 

55 . 



TCCTCCCCTCCCTTCCTTTGTTACA TCATTTA TTTATAC 1 I I ICI IGIII 
TCI I C1CIA TTCTCATTACTCACGTTA IC1 LC 1 IC1 A I CU I l ll l I G1A C 
AGTexnTTATTTTTr>IGACrNCW>OWKrN W TT G 1 1G1 1 G ACCTAOCTCCA 

ccgaggcggwkjccgccccggcaggtactttgcaaagtggatgcagca 
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TTTOGATTGAGACTCTOCGAGOOGOGGOOGGAAGAGCAAOCGAGATCAAC 
GTGAAGATGCTG A GCCC<; AATCCGO ACAATT ATGTtXXXDGAAACCAAGTT 
CGACTTACAGAGAGTTCCAACAAACTATGA 1 CC I GC 1 1 IACA1CCI 1 1 IO 
AGCTCCCAOGACAATATATAAGAOCTTTAAATOCTACCAAACTGOAAOGA 

gtatttccaaaacca mccTTOcn UGCTGGATGCTCAOO GTQATOOA GT 
CA>llGCTlGGCAAAOCATCXVVGA<lAAOCTOGCTACnUIXXnilCrGCGG 
CGTOTGATOGAGAGGTT AG AATTTGX3AATCTAACICAGCGGAATTGTATC 
CCTACCT 
> Sequence 57 

I IC TIC I CCI C GGTGCATATAATA1 mum I I I MCI IA OGGTCOGTGA 
CTCTATTTA rTOTTTTTATTCTTTI I U ATCACTAATATTATTAA NNNNNN 
NNTNNAA ITCTTTG > CGCTGCACGCCOAGGCACXXIATCACTCAO 1 1 1 O 1G 
CAAAGGAOAAAOGCCCACAGOGAATCCGOGGCGGCTTCACCTOGCCATAC 
CTGATCOCCTtfl I rOTGCAAGATGTAOATTCCI I UATGAAACAGACTGOC 
AATGAGACTGXAGATACTGTATTAAAGAAAGnGGATGAACAGTACCT 



T AA TTTT A TCT A TTCA TA TT A TTGTTTTT A CTCTGCT A A TTT AT ATTTCT 
TTOTACATCATTATTTA C ' 1 1 1 1 1 1 A TCATATAATATTTATTTNN ATTTCA 
ANNA I lOl I I C I OI 1 1 CJHTTnX^GOCTCCACCGGGAGGCG<XXXXXOO 
CGCAGGTACC<XXWXn"ATTGTCATTCCX^GTGACXXlATAGAACAGGAT^ 
CACT ACTCCT ATC ACATCTGACTCCGCTTC<KMAGTTCC<XrrCTCAGGTC 
CAAAAATCCTA AGGTGGG A TCTTCGC1 TTGT GAAGCAAATTAATTACACA 
ACCAAATATTC<XACATTXnTGAGGTCTATTGACACAATGGGAACTTCAA 

ococtacttagcttagca i i i i i i m i laaaoagtg aaaactggtccac 

OTAOAGCACAATATAATTTAAGTAAAGGAAGATTAAAACATaTTTTTATC 

catttcttatggtgogaaattaacatgttttagatttgagctccccctct 
caggaaacxxtttcaacttcgtattattcactcctcagtagtatcgggta 

G AA AATGA GTGG A AA TCAGTTTCGCCACTATTTCCG AGTCTTTTGCACTG 
CAATACTTTCATCAATATTTACAATATTTCAG 1 IC I O I 1 1 ACAG ATGGGG 
ATCACATCAOOCTCAAOCAAGTTACAGAATTCITTOGGTTTTTATCrGGA 
GCTTTTAATTAAAACTAAAAG 1 1 1 1 1 1 1 1 1 1 



CAOOOCTACACACTATTTTACTOGTAATA O Hi l l A CTCA 111 1 11 I C AT 
GTTTTACTCCAC^CACAGACTCTTA 1 1'ICI II A T AT AT AT ATTTAGA TTG 
TTTTA C ' ICI I ICI I A TAGTTAATATNN ANOOGGGGATTGGCATCOOOGOG 
GGGCGGCCGAGGGACGCGGGAAAG ATCAO I IA CCTTGGCATTCAA 
AGA C I men I O ACTCCCATOGTTCTCAAAGCGTOATtXrroGTCCACCA 
CCATCAGCATGGGGGGCAACCTGTTAOCACTOCAAATTCTCAT1CCT CC C 
TAATTTTCTGAATCAGAAATTACGGAGGTGGAGCCCAGCAA1 CIO III 1A 
AOCAAACITCCACATAATTCrAATTAATTTATOClTTGAGAACOGCTCAT 
CTA OI I IUI C CCTCTCATTTTOCAOOCAAAGAATTOAATTCTAGAGAGGT 
TAATTQAC 1 1 U iUCAGTCATACAGATAGO rTCTGTTTl C I A TT ATTTATT 
TATTTATTTA 1 1 I T I ATTTTATTCACTTTACOOOOCAGGATTCATAGTTT 
TCITTCTAAT ACICCATATTTGACTTOA C 1 1 1 1 1 1 ACAAGTTGTAATTAC 
AAATAAGTCTAAGA TGGGAAAGTTGTOGAAAACTTT ATAGAGAACATGAG 

AAGCAGTAGGGAOCACAGTGAATAAAGGTAOAGATAGGTATGTTTAAAAA . 



GCAOOGCACTAGGTGGGATGCTAGCOGGATOOCGACAATATGTOOGOGAA 
ACCAAGTTGGACTT ACAGAGAGTTOCAAGAAACTATGGGOO 1 U- 1 1 1ACA 
ICC M l I O AGGTCOCAOGAOAATATATAAOAOCTTTAAATOCTACCAAAC 
TOGAAOGAGTATTTGCAAAAOCA 1 TOCTT G CTTOGCTGG ATGGTCACOGT 
OATGGAOTCAATTGC1 1 G O GAAAGCATCCAGAGAAGCTGGCTACTGTOCT 
TTCTGGG<3CGTGTGATGGAGAG<nTAGAArrTGGAATCTAA 
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ATTGTATCCGTACCT 
^Sequence CI 

TGGA CXIAATTG TT>KX XACTCACOGCOGTGG CGG 

TCI fCOGTOCTAl 1 1 rGTAGTCTCTGTTCTGOOCTTTGGAACATCTCl IC 

g g ioi 1cc1g igggat c tci c 'i actgcattctactttatotaataatctg 

ttcaataaataatttttaaaaggagacaacaaogcogcaggtoatctgga 

ggctcctgoagoacctcagogactcaggtccagtccaagcagcoococag 

atcaggctgaaggatggattxacatgtttagaggagatxxiagaaatgcag 

aagagagatgcagcagagaaatgccacagaaagggcagctggagagaatc 

aaagcatgacaggaattcaacctgctgtcactggaagggotccagatgga 

aogcttgagaagaaaogtgtgtagcatctaggagtaaagactcgcoctgg 

ctcacagctagtaaggaaatgggaaoctcagtgctiicagcctcaaagaat 

tgactttaacccacagcctgtgtgcacttagaagcggatgcattcacaaa 

tcttccaa 

>Scquencc62 

tgggtcgttg tcttntcc gcggggcggcc gcccg g 

CCAATCTGGGTAGCATGGTTAGCCTGTOCAGTAACAACGTAGGC^ 

gatgggtacct 

TT ACT AA CCA CGATTGGATTATTTACTCTATGA TTTT AA7TATTGCA TAT . 
ATTTAATA 

ACNGCGGGGG 
CCCAGCAGAJMNOICCUgCAGC^^ 

CaGCAOCCANCAGnGGGN>>CCAGOWCCACCAGGCGA CCCW <GGaCCCGG 
GOGACGGCNGAMCCAACNCMGAAGGAGNCKNAAACI nTTTTCTCI I GAG 
CG^GNGhWCNCNCCCGCGACCCCGNG 

CGGfn^NAN>^ACGOCCACGGGG<jNCACAAACAACCKNNCAAAGGAAGAA 
WNGCCACCCACCAANCKhfNAGCAANACAACANAGGAAN'CAANACAAACA 
NAACCG AAAAACG AGG AA AAAAAAAAA 
>Sequeacc65 

TTCTO 101 TA COOOOCCXjAGOOGGCTGAOGGACTTTA C 1 1 1 1 1 1 1 II I I I 
1 1 I I 1 1 1 1 UGAGGAGATGG ACAGTGTCAG 1 C 10C1GATAAGGGGGTGATG 
GGTAGCTAATTTAAAAGCTTCTATTATAAAATCTAG f C TC T C T GA CACTG 
CCCTG7CCACTGCAGTCAOVTCTCCCAATACTO 

GAGCGOOCATGACACTTaCTCACGTCATTCACCATNC 1 CO 1 IO IGCCTGC 
COG 

> Sequence 66 

CIO 1 1 1 OCT ACACGQ3GTGGCGGC1 GCCCGGGCAGG ACCGOGGAAATCCC 
CTAACTTCCTrOCTATCrTCCCATCC^ 

GTGT ATOTOTOTOTOTOTOTOTCTTOCTCOCACAGTOATGAACTCTA AAC 
ATAAATGAAG ATATGGAAAAAT ACATCAATTAGG ACAACA TG ACAATTTC 
ATTAGACTCCTATCAAACACTATCAOTTCACAGTTTTTATAOATACTAGT 
ATAAAATTCAGATCTTCA C rCTTTTCl OG GGATAAAGCAAGGCTTTACAA 
TTTAGCAGTCTGTAGCTAGCTTGAAACAGTAAAACAACAACAGGAGAGCC 
TTAAOTOTATTTTTGTOACCT AAAACATCAACTCAQGGTTTCCAAATTCC 
TAACAATOAATAGTO ' 

GATTOGAOCTOOOOGOOGTCXXXSGOOGA^ 
ACTGTGTtAAATTACCCACAAGTTAAATGCCC^ 

TCTTAGTATCAGG C 1 1 0 1 G ATAGAGAAAAGOCTOCTATGAATTCT ACTCA 
GTGTGXrTTAGACCAAAGOAAACCACCACAGGGATTTCACAGGC 



GGCKXXXXX>CTGACTTrjGCGCITG<XX^ 



Page 13 
(of 261 pages in Table 2) 



235 

Table 2 



PCT/US00/33312 



236 

Table 2 



CTGOATGAATOGATGGCCTCACGGCCCC<XXXIAGAGC0GCGGTOTTGGAC 
GGGCCGCTGGTGGCGTT A GCTGGCTGACTGGCTCGGGTGGGCTGCAGGGG 
GCCGATGGCGGGTOGCG^GAGTCACTCTGCCTCGAAAGCGOTAG 

ATGCCCAGN 

GGTCCCATTTCATCTTOCACCCGCATACCAGGGA 11 0 1 I G CGAAGAATCA 
0 1 rCTGTTATATTCTOCAAATCATCAAAGATAOCCTOAOGT AAATTACTT 
AGGTTATTATTGGACATATCCAGTCGATAGAGCTGCCTTAGATAAGAAAA 
ACCATTTOOOOOCACCCGATTOATG IOOI 1 ATCTTOAAGATAAAG C 1 1 CC 
TCAGGTTTGTG<XTGGAAGGTTTACTGGTGC>GCAGTCAG 
accagggacaqct c t o T CAAATTAACT AGOTTG AAGAAAAl' 1 1 rOTCACC 
TAAACCATOATTOTTCAACAGGTTTCCATCTAGAACCAGGCGTTTTAGAC 
TAGTGAGACCTTGAAGAGATGGTGA TGAAATAGTGGATA TGCGA7TATCA 
TCCAAGCCTA G I I C 1 1 C I A TAGTCC I GGCOCAACCCCAGGGAATTCTGCT 

ATGCTCCC I C 1 1 C I A TOCTAACTOCAOAGACaGAGT I G I CATCTAAATOT 
AATTCn CCA GATAGGOAATTTTICAAAGTGAATCATAAGTGATAGTCCT 
TATOTTA I ITTC1 rO CAAATOTAA CTCl 1 1 I ACATA CI 1 1 lO GGAGGTTO 
GTAGGGAATTCATTN 

GACGTACTTTGAATAAAAGOCT 
ATOTCn CCAATCAATCACACAGAGCTTOCCCTGATAC 
TCAGCCACACAGTCCACCAGACCTATATAOTTTAAGGTTrcATGTTCAAC 
AGCA C I 1 1C AAGAGCTCCCACTCCACTGACAICI 1 1 CA GAATATGCTCGA 
CACTTTCAATGTAACCAGACTTCAGG AGATTTTCA ICTCTCTCTTTI AAG 

or i reel o oggtgaaactatoctttccaaggcttcgtogaaccotttccc 

TTGTAAAAAGAOGTTTGAAGTGTA 1 ICI I l AAAGCCATCl IUCCCAGTT 
OCAOAATCATCCG C I GI I 1 CCA CCTCTCGAACAAGAAAA CC 1 0 1 1 U 1 1 1 I . 
GTCATOGTCTGCTGAAGGACTCOOGTt^CACTTOOTATCACA I ICC 1 I IG 

caagggga ttttcaaagg aactgaa gg a tcacttgca i rToon i a tcac 

TTCTCTCTGOATTGAAGATAGOAAACCA G I '11 I O l O GCACTC GT CT OlC C 

tcac c tt ogi n O GCAGCTTATGCTI octca cggttccacag AGCAAAGA 
1 1 I I IC I C CACCGATC C0CGGGTCTOG 0 CGAOGOC I C T G GGTGACAAACA 

>Scqacace 71 

AGGTACTTGAAGGATAAGAAATTAC TO TG 1 LAAA TT ACCCACAAGTTAAA 
TOCCCATGTTCCAG ACC 1 0 1 O OCTCTTAGTATCAGGC 1 I OIOATAOAGAA 
AAGGCTOCTATOAATTCTACTCAO rcTGcn AGACCAAAGGAAACCACCA 
CAGGGATTTCACAGGC 



aogtacatatatcatttattcaagaggcagattttaaac g i i n IUI AAA 

AAGCTAAATAACACCCAGAGTOACTCAAAAAATTTCrCA^ 
GTGAATAOTAAGTCTAQAG 1 1 M 1 lO Q OI llllllll IG 



GCX7rTTGCACCXACACCGCCGT*XX3^^ 
CACACniGCACAAAAGGGACrmiQ^TGGGTT^ 
GMOCAGCNGGACCANCKA1 1 1 1 1 ILCICCICCICIGAGHCTGCCTTTAAA 
AGCTCATAACAGTAGAOATCA O I IOICI CIO U ! 1 O CAAATCTAACATATA 
TTCATOCAATOOAGX»KG>UO aMllllCII I II I I001IIO GG>^^ 



CCCGGGCGXXXXXWCCAGAACCAGGOGGACCCOCCOXjGGOCACG^ 
AAAACCAAGOOCAACGAA 



ocagoo^mgggccccagaaggagggggaa 



CCTG T AAACAGAGG A GAAGCAG AAAGA GAG AAGGG AGG ACA G A GCTTTT A 
lm GCTCCAGQTT AAAAAG AAAAAAAAA GCACA TT A CAA CTCT A TGTCA 
GTGTCTCTCOCAGGTCCTAGAACTGGAATAGACCAACCAAGCOCAACCCT 
TCTTAAAAGT AAGACTAGGTOCl I CCTGATTATATATTCAACTCCCTGGA 
AGCATGCAAGTAAAAI I ICC"! I OATGGCATTTCTAAAGTTCAAACATATT 
CTTCCT AAAAATGCA TTTA CAAAAAA T A TT AA G A TTGTGTTTTTTTGGTT 
TGGACTTTAAAAAAAA 1 1 0 1 1 1 1 CA AAACCAT AATTOGGGCCTACCCCAA 
AATGGA I ICI CC 1C CCTACAGTOGGG ATTTCA I I I 1 1CC AG-TCCCCACCC 
G C I II 1 1 AA 1 1 1 1 I G ATGACCTGCACC I GG rrCGGGGAGCCACTTGTGGG 
COCTTAAAAACCAGCAA I CC 1 1 1 1 tO C<X^TGGCA G1G IO ! 1 A AAAAGGG 
AAAGGAACAAGCCOC1 1 1 1 GGGAAGGAAAGGGA GTT A A GCCCOGGA A GG A 
AAT I I I I OCT I O ATAAAAA AGGAT AA AGGTGG O TTTG T O CCGGGAATTTA 



GCGAGGAGGTGCGGXXiAGAGAGGAAGC^JGCCTGTGCACTGAGC^ 

AAACATTAGTGGATGGCCTTGCGTCTCAATC TGCA GTA AAGA GGAAACTA 

ATCTGAAAGOGAACXlATAGGAOGTGTGTCTrri'IATTrTTTAAAATACG 

GAGTGTGCAATTTTACTGAATCTTGAATXATGCXXAAAAGAA 

CGGTGCTGCAG ICO I GACCCAOGCTGA 

> Sequence 76 

GAAGTAACAGACACACG A TGTTGTCAAGGATGGTTTTGGGACTAG AGGCT 
T A TTGOGOGG A G AG A TCOCTGCA G AA CCCA CCA A CC A GA A CG T GG TTTOC 
CTG A GGCTGT AACTG AG AGAAAG ATTCTGGGGCTGTCTT A TGA A A A T A T A 
OACATTCTCaCATAAGCCCAGTTCATCACCA I I ICCICC! I I ACCI 1 1 IA 
GTGCAG I 1 1 I CI 1 1 I ICA CATTAGGCTGGI lO O l I CA AA CI I I I O GGAAG 
CACCGGACTOGTCAGTTTCTTTTGXXIAA^ 

CAAGGGCI r C TO C TOCTC TQ GTCI 1 1 1 O GGAG AACCCGGGG C I I I M ICA 
CGCCGCTTT AGGGAACTGGTCAGGC 1 0 1 III CAACCAGGAAG 

CAGOACGCGGGGAGACAGCAGAAGGATCACTGGGCTGGAAGCTCTAACAG 
GCATTGCCAGCCT AGCTACCTGCAGTTT GAGGCAAGGGCAGGGTCACTTA 
OCCTGCTCTCTGAA lGIClCC'lOGGACAACAGGAGOCTGCACTCACTGGC 
TGAGTTCAGACAGAACAGGGATCATtXXlACTGGAAGCTCTGGCAGGTATC 
GCTAG CC I GO 1 1 A CODGTAOTGAGAATGGAGAOCOCCACCTGCOCAOCTA 

cacaaa rem cccaggacaacagoaggctg i o t cc actgacagttcaga 

CXXIAAGTOOAACCACTGGACXXtGAAGCTCTAGCAAGTC T I OCCCACCTOG 

CTTCTA gtga gccttgaaacca GCGAAACAA T AA tcaaaga gcagttctt 
GTCAAGAAAACCACATTAATTAGGTACCCTO^CCGCTCTAAACTrATO^ 

A^CCGAOCCCGGOAAGGCAATATAAGATGTATAAA 
CCTAAATOXJAGGTGTAAGCTAAAACrTCAACA TTTAATTTGCCCGTTGCC 



cotgococaaacctgca atttaatt oaaa att^ 

COXnTTTOCTTCGCTrCAACTTGAA 

TAGGC FT GCGGGCCAACCCCCTATTCAAACTTAACTTCAAA - 



CAGOTA C I 1 1 U GC C I C I C I G GGAT AGAAGTTA TTCAGCAGGCACACAACA 
GAGGCAGTTCCAGATTTCAAC 1 OO 1 1 CATAGATGGGCCGGAGAATGAAAA 
CAGATCG-TOXAGCCACAGMCGI 1 1CATCTCCACC1 1 GO 1 CCC I CCGOCG 
AAA GTGACCGA TGI Ctrl I CXATATTGTTTACAGTAATACACTGCAGA 



TACCI lOIOAGAA 
ATAAGAACTAAGATCAGAGCATACTAGAGAAAGTAGC 



GAGATGCCGGGGGTOCCGATATACTOTGCAGAGCTAAAGGATATACnGGC 
TACGATTAOOOOCTCTC I 
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TAGATAGCTCCX!GCCKJTOGCGGCCGACK}TAC^GCCXACCOCCTAGCTCrro 
OAOCAGCTCAGGCAGGTGOGCAQATATOCAOAGCGACrrOOGCCTTTOCC 
AAAGGGTAACCACAAAG AAOGAGTCAOGG0 I rCTGTTOOAOOCACTOTTG 
GOATTAGGAGCCCGAGGGACCTACTTTGCAGGAAOCTAGCATAACTTTGT 
GTGAOGAGACrGCACAAGACAAAOCTCAGGCAAGTGGCrCAGTAGTTGGC 
CAGCCCAGCAGGGTOCTCrTGTATGAGTGTGCACCCAGCTGAAGAGAAGAA 
ATGGAGAGCAGCAA7TCGAGCTTCAGGACCGGCTTGCACTGTGGCTCCAG 
GTTATACCACCACTGCOCAAAGCAAAAGCTAGAGAAGCAAGTGaAGAAAT 
GCTGGAGAAAGCTGCACCCTACAGGCAA CCA GCACTTTAAAAACCA ctcc 
ACGCAAAGTAATGGAAGC AAAAAAGCC L. I CC 1 1 1 ICA OTAACCTOOGCCT 



FCT/USOO/JJ312 



GAC^CXLATACCTCTCTGTGTATGATCTCNCTAAGTCATATCGTGTAACGT 
GT A C A CTT A CTCA TTCA GCA T A T ATNTCA A CGTC AACTTCTG ITTCTCTC 
AGGTTATTATTrC ATAA CTACTTATATCTGTTTCACATCAGTAACATCGT 
CATATCTCT ACGTCTITAGTGATCTATTGTATTTCTAAGAGAGACrcCGG 
TGGCGGCCGAGTACGCGGGGGAGTCAGTXrrCAGTCAGGACACAGCATGGA 
CATCAGGGTCXCCGCTCAGCTXXnGGGGCTTXTCCTACT^^ 

gtgccagatctgacatccagatgacccaggctccatc c 1 1 11x 1 o tctto 
c a t a ttgg a gg a a g a cag aa gtcacccatt a acttgocccg aaca a gttc 
agaagcatttgcccagggtattatgtaattgggttttcaaccaaaaaacc 
cagggt ataaa gcccccoctaaagggtaccttgaatcttatagcttgcca 
tt^ccagttttggcaaaaggttggggcgttxxxxx^attxtaagggtttc 
aa g a tx^gcca ag atggg attoctggggg acaaggattittttaccttct 
taacccaa taca aggcaagticctgocaaactcctogaaaagaatccttt 
gocaaaacttttacctaccttggcd^ 

ATGG GAAT TCCCCCCCGGGGCTTTTGCAAGGGAAATTTCTGATTATTCAT 
AGGCCTTT AA TTCG AATAOCOCGGTCGG AACGCTTTGAGGGA GGGGGGGG 
CCCCT 
>Seqncnccl3 

OATCAGTCGACTGOCOOCOGAO Ol ICC I 1 G TTGCAGCTCTTTATTTCTTA 
GTOCCACTCOOOCGAGGTAACACATTTCTGCTTTTTTAGCTGTTTO^ 
^GTOTAGGTTCACCTTTCTAATTTTTGATTCAATCACTTAA^^ 
CAT ACTAC AAAATATCACTATATTATG AOCATG ATTATA I HI 1 1 HC1 1 
TTTCCC TTCATC AAGGAAGTIXIATCAAAGAATTTCATCAAAGTTCAATGA 
TGACCTCTTTTT A AA A TTTTCTT A G T A TTCTATGTAA CT ATTACCG ATCT 
TTTCCCCACaCACTTCAAAAACTTTTTAATTATAATTTTTTACATAGCCC 
TTAGCACAAAT AA CCAATCC MI1HI1II C CCAATAAAAATGTGCCTTT 
CGTAACCTTTGTCCTL 1 1 ICI 1 1 1 A CCTGGAATA I IGCI 1 1 1 1 AAGGCTG 
TTGTGCA ACITAGAAAClTATTrCTTATTAT 
1 1 1 I'l l IGTOCTGGAATCO C ' I T T I G COGGAACCT 
•4 



CICIC1 1 1 1CC1C1 ICIACTAGTACATCATACTAGAGTATCTNTGTATTT 
TCACACTGATANGGTAAATCTGTAATAACATTATTCTTTATAATOATAAT 
AATCrrAATTCATGATCAATTATCTATAGATCGAATCTATACTCTT ACATC 
TOGACTCrAOGATACTTTAATATAGAGATGACICOOGOGGTGGGGGOOGA 
TGTACT ATGGCCTATATXXK1ATAGAAGGTATTTACXACGCACACAACAAA 
OGCAGTTtX^TATT TTAAC IXiCrCATCATATOGOGXrrAACATOOOOACA'r 
ATOGTOCAACCACACTTTCATTTGATTTAACAOCTTXK3TAAOCCOOGGCC 
GCTCCTAG AAACCTAATTGGATCCXXXXCGGGGCTGGCAGGAAATTOG AA 
TATTCAAAGCTTTATTTCG ATTAOOO GTOOGA CXX^I rTGTAGGGGGTGOG 
GCTOOCGGGTAAOCCCAAACTTTTTATG G 1 1 1 ULXX TTTTTAAGTGG AAG 
GGOGTTA AAATTTGCOG CCGGCTrrGGCGCTGTrAAATTCAATOGGCTAC 
AATTAG ACCTTG 1 1 1 1 lOCXDCTTGGTGTTGOAAAAAATTAOGTTTAATTT 
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CCGCCTTCCAACAAAATTTCrcCACCACCAAACCAATTAAACGTAAGCCC 

CCTGCGGG AGGCCAATTAAAAATCTTGTTAAAAAG A CACTTGGGTGGGTT 

GCOGC TAAAA TTGGAGGTTGAAAGCCTTAAACCTTCAACAATTTAAATTT 

OOCOO IIIII GCGGCCTCCAACTTGGCCCCOGC C HUM C CACAGTTOC 

GGGAAAAACX1 1 GG 1 rOGTGOOOOCAOOCTGOOCATTTAAATTOAAATAC 

CCGGCT 

>Scqucacet5 

TTC ATGTGC T C A CtXKXXrTGGCGGCGGGT A CTT ATA TTA CA TT A TGCT AA 
AATGCAAACATCTTATGCTAAATGTTATATTTGGGAACAAATTGTGTAAA 
TATACTCATGACCTCAATCGATCATTACAA7TAATGTAGGTGCCGTCGGC 
AOG AAAGCTAACTTT AGCTG AAAGCATCTGAAA (XJTGCTT ATTTTTAATG 
CGCCCTCAAAGGAAAGGGATGAGGCCAGCCATAAAGAAAGGCTTGGCCAA 
ATATAGTTCTTGTTTGTCAAGAACAACAAA7XCCATTTCACAACAGAACT 
AACCTGG<^TTXXIATTCTATCOTAGGTTCTGGCGTGCAGTGAGCGAGGC 
AAGCA 7GGCATTCAAGATTTCAI ICC 1 1 IG 1 1 CCACGGGG AGGCCCCTTT 

crm aact tctto aaagcaacatatttggcaacaacxxttca i i i in i 

TCCCCGGTGCTTTA CTGTTTAAGCCCTTGGG 
> Sequence t6 

TGTGAGACTCCCCCGGTGGCGGCCGAGGTACATCCC^ 
CATCCACCGAGGCCCAACAGCATCCATOATCIGTTTGCAGGGAAGCtrrCC 
CTGCTCCC GTGA CAGCTATCTCACCAGCTGACACTTTACCATATCTGGCA 
ACAAACTQTTTOCTC I L'l ' I CI ' I GGATTTCAAATCCACCAGCTTTTACCAG 
GGCCAGGGOCAGGCCTCCCCCATGCAGAAGAT^TTCATTGCCTGCATT^ 
CCACAGCArCAACAGCATGTGTGGTG AGGTCA I C 1 1 I CC AC ACTGATAAC 
TCTATCCTAGG AGTCAGCA 1 1 1 1 i C I G AACACTTGCAG AGATTTGCTGTT 
GCCTTCCTGAACTGGAGAGACCAGGGTAGAGATACAGCCAAACTTATTCT 
GGAGGACTTC^CACAGCTGACGCTCATTATTGTTTAAAATTtTGAAGTCA 
TTGTGGT TAATGG GAAATTTGCCAACTATAG HI I L I C CA AOAGCACCAA 
TCTCTGA I 1 1 I iCATG 
>Sequcocc t7 

G TXrrrCACTTTTA CTTTGTTGCTATA AGTTTTTA CTT ACTTTTC A TA TTA 

TTGCGTTTATAATTTGTTTTATTTjTAGTTTAACn ACTTATT 

TATATTATT OTTATA TTATAATAATCOACOCTTGACTt^CCGGGGTGGCG 

GCCGAGGTACTCTTCAAAATTGTCAAGGTCATGAAAGACAGCAAAAAGTG 

AAGAATTCTTACAAACTAGAGGAGACAAAGATTGGAGAAGAAACAATGAC 

TGGCTXKXJCACGGTGGCTGATGC^nTJTAATCCACTTTTXKJAGC 

GAAGGCCGAAGAGGACAGATCATCTTAGGTTTGGGAAGTTGGAAGACCCA 

GCCXTGTACXCAACGTGGAAGAAACCTCCCATTCCTCTACTTAAAAA 

C^G<^AATTTAGTCTTGGGTGGTGXnTGGCTCCCATTGCCCTATTTAAAT 

CCCCAGCTTACCTTTGTGAAGGGGCCTOCGTGCAGGGAGTAATTCTACTT 

TGTAACTCCGGGCGAGGGCAGAAGTGTrrGGTTGGGTGAGGCCCAAAAAT 

TT GCCGCCCA TTIGCCACTrOCAAGCTCTGGGGCAAACAAAGAAG<XiAAA 

TAti 1 1 IGTCCTCAAAATTAAAAAATAGATTTTTTATATTTAGGGGTTAC 

CCTG1 CCCCGGGGCGGGGCCGGTTTTTAAAAAACTAAGGGOTGATTCCCC 

COGGOGCTTTjAAATOGAAATTTCGATTTT 

> Sequenced 

TCGGACCGCTTrCAAGNTACAGAGG^lXSGGCCGAAAACCCCOGAOCAGGG 
ACCTTATTAAAGAAATACCAAGOCOULXj 1 1 1 lUUCCICTGOGGAAGCTTC 
NCCT^TTCCGCCTCT^ ^ 

T GOCCXK XI 1 1 ICI I CAATAAOCTTAAOOGCCTOOAACGgTATlTCTCAA 

AGrriXXGGGCKKrrAGGGGTCCGTTTCGGCriTCCC^^ 

CTGG TTCC CACCAAAACCOCCCCCCCTTTTAAACCCCCAACCGCGGT^ 

GCCCCTTTATOCCGGGAAACC 

>Sequencel9 

CGGTCAGGTACCGCTCAGCCTCCTTGGTTCCATOCTCOGCATGGCG 
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AGCTCTCAGATCTCAAGCTCAGCATCCTTACGCTCGGCCTCAO 

AAAGTGATTCTGGATCTCCTTAAGTCCWTC<^ 

TTTCCCCATTXJTCCAGTTCACGTGTTAAATTCTCT^ 

TGTGCI I ICI 1CHCI CI 1 1 ICI 1 1 GC ATGCACCGTTTCAC I ICCICIAA 

CTCAAATGCCATTC<XKTGAAGTTCAG 

OCTTCTCCAG GIC CI OI I 1 1 I ICC GCTCAACCC CI 1 1CCI IA ATCTTCAG 

ACCTTXCCTTGGTCA A CCTG A TAA GTTTG AG 

>Scqucace90 

AGGTACXXXKXiATCACAAAGCAGACAAACAGGAAAGACTGAACCATCTAT 

TTG AAAAAAG TG A CTTCATTCAA 11 GG 1 1 L A GCCACCCGTATCTGTAATC 

TCTCCATTCTGCCCTXrrroATTTTAATGCAGCTATAAAGG 

AAAAGTGCCTCCCAGTAGCAAGAACAGTCACAAGGCACTGTTATATCAAT 

TCAGTGTGA(^CAAGCCCTGATTATTTAATAGTATAACAGCAGTGAATCA 

GAG TTCTTTCA TCTG A CTTTGCTG A C^TTTCCAGCAGCTGT AT A TTTA A T 

TCACAGTTAGGGGCTGAACAAACTACAGCCATTGATCAGAATGTAAGCAG 

GCATXXTTG AGCTTCTTCTAGG AACA TATAC AGATGTGC ACA A AATTTTC 

ATTTATTCAGTN 

>Sequence9l 

ck»attggagctcccoqoggto<kxxhxgaggta 
agacaaacaggaaagactgaaccatctatttgaaaaaagtgacttcattc 

AA TTGG TTCAGCCA CCCGTATCTGTA A TCTCTCC A TTCTCOCCTCTTG A T 
TTTAATGCAGCT ATAAAGGAGAGTATTTTAA AAGTGarTCCCAGTAGG AA 
G A A CAGTCA CA A GGCACTGTTA TATCAA TTCAGTGTG A CACAA GCCCTG A 
TTATTTAATAGTATAACAGCAGTGAATX1AGAGTTCTTTCATCTGACTTTG 
CTGACATTTtXlAGCAGCTGTATATTTAATTCACAGTT AGGGGCTGAACAA 
ACTACAGCCATTGATCAGAATGTAAGCAGGCATCCTTCAGCTTCTTCTAG 
GAACAAATACAG ATGTGC 
>ScqueBcc92 

NGGGGTTAGGAGOWACGNCGCGCGNGG^GGCCTGNCCGTINCNGTCGCAG 
CCCCANG A GGNCACCAA GCANCCAN CA CCCCTACCCN G A GNNG TG A GGCA 
ANGC<XXXX^GGCAANGCCACANCAAAANCCGGTTTTC^^ 
hKjNGCACCXX!AGAAAACAAGXjNCNCAACNACNGAC><GGCCAAGAAGGGGC 
CCGCCCWNGGCCAA CNN CA CCANACAGNNNA G AGCTTT 
TTGAGCACCGGGACTATCCTCTTGACTACAAAGTACCT 



G03ATTT^AGCAACCCGCGG>tGGXX}GCCIta<CCCC 

GGAA>K>INNGCrrNNTWG>KX^GATGCTTTGNCGN HCTTT AGACACAGNG 
CCT>WNC<^G>WAAACXXKACGTTTAGAACNGGGGGGCAGACCCCGAACXi 
NC>lAGAACAGNGGACCXXCGGGCGCAGGAA>fNCGAACAAGCKAANCGANA 
CCGNCGACCNCGA 1 1 1 1G1 1111 1 liGCGCWGCWGNGNGCCCNCTCCCG A 
GCGAAAAAAGCGCGCTCNGGCGAAGG 



TGCCCGGGCAGACACACtCTCCATCAGGTCACCAAGCATCCCATCACOCAT 

ACCGGCAGTTGCA TGGCAATCKXTIGCCAGGCAATGGCACATCAAAATCCG 

GGCAG^nTjrTGAGCACTGTOCAATTOAGTCAACAAGGTCTCAACTACTG 

ACTGGCTAAGATGGGGCCTGCCCTTGGCX^CTTC^ 

GCAA7CTTTAAAGTGG<XTGAGCACXrrcGACrrATCATCTTTJACT 

TACCT 



AGGTACCTQTATGATAACATTGCAGTCAAACATA I CI 1 U 1 GACAGGACAO 
1 1 1 1 1 iGTGGGGAGGAGAATTAaACCAAGTrcGGAGATATATTTTAGGAA 
CTAAAAGGAACCTAAGATXTGGGGTAGGGiGG^TGAGCAGCTIXACAGtXT 
GCTOC1 CTG1 GAGCTGTGCGCTCCCGACTGC<1AAATGTCT 
AAAACATGAGATXiAGG<XK^GGGAAGGGG<rrACTroCAAGOCTrrC^TTA 
TAATAC 1GIGIGI AACCTTTTGCATATTTTCAGAAAAGAAACCAGTAAGG 
TGGOTTCAGTTCTGGOCTCATCCroACT^ 
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AGAAATGTCGCCAAACTGCCGTCTTCCCTCCTC^ 

CTCCCCGCGTACCT 

> Sequence 97 

GT A TGTCGTTGA GCTCCCCGCXKj TGGCGGCCG A GGT A CCTTCCCCTG A GG 
AGCtCCCTTCAGAGGGGCGAAGAGCAGTATTJTTCAGAGGCCATCCAAGTT 
TTAGCATAACAAGGAGGGAAAGAGAATGCAGAGAAGAGGCTGGTGATAGA 
CA A GTTTCA TGTTCACAACTTGAA TTGCAG AGGTCA AG AGTTT AAA GA GT 

ttgggatggaaagaaatcgagaattgggct 

> Sequence 91 

ggttmatggttgagctccxxgcggtggcx>c<x:gaggtaccagcagagat 

GGCTTCAAGATGATTTAGGACTTGCKiTCAGTAGCACTTACTGATGTAGTG 
G TTTG ATACACACTG ATTAO C I IC I I CC I llll I ATTCTCTGGCATTTCT 
CCTATATAACTAGCCACITTTAAACAATATTTGTXXKjCTC 1 1 i icttct g 
CTTGTCTGTAAATATTAGGGTTCCTGAGTCCTTACCTAGA 1 1 1 IC I icic 
■™nTACTCCTCGCClTTCtr^ 

CT AGATATTTGTGATGTCCAAACACATCTCCACGTTAGGCTTCT^ 
AGCATCAGACCCACACTTTCAACTGTCCACTAGATAGCCTCACTTXIiGATG 

cttjtccag^cctaaataacctttgcggacagattaacagggaaaaaatat 

TAATAC<jAAAAAATATTAGATTTTTATCTGATGTTAATATTTCTATGTGG 
CATGGAGGACTTCACAGANAAAAGTXjAAAACTCTAAAGCAGTTAGATTTG 

agn 

> Sequence 99 

TCTCTT ACACA CTCT A TA TGCAT AT AATTACAA TCCTG TTTAT AT AGTA T 

CT I ICT I agtatatactaacatctattagtcaaaatatatatatatagat 

TATACTA ATT ATCT AA ACA TCCNCANTA AAG AAC AGTTTCCATTCG A 

GGCG AGGGATTGAGCT CCCCOCGGTGGCGGCCGAGGTA C I riTTTTTTTl 
TFIIITII 11 I lllJI'II I AAATATGTTTTAATATGCATATCATCCAGGC 
AGCATAATGTTATATTTCAAAGACAGATTTATCCATTGAATTAl IG I H I 
T AAAAO TTGGG ATTCTCTACATAGAACATA T 1 1 ICI GAAATTTCAAGAAT 
Aim I LA GGT AAA TT AA G AATT AA 1 1 ICI ICI AAGACTATCCAATCTGTC 
T CAA TCT ATTCCA TAA TATAA TCAA TG A TAAAG A TTCACATGTATCACCA 
AATTCGAG<KIAGCTTAGTTGAAAAAATTTXiAAACAGCTTACTGAAT^ 
TTTGCTG ATTCTGGGGGGGCTTCCCCAATGGOkTGTGTG C TC C I ' IT G GAT 
GCCTGCAGGGGTGGTCA CTGCAAA.GTCGTC A TCTG TGOCA CTGGG AGTTG 
C<lAGG«XKXTGCTGGGGTTXXXTGC<rrGG<^GGATTTACA 
CCTGCTGOAAGGCTTCCATCCTGGACATCTOGA 

> Sequence 101 

CTC1 Tt CATTTACACTCTACTGTATTCTTACTATAATATACTTATATATCT 
TTTCAGTCT ATAATTTGTATCTTATAAATTTTATTATTCGTA C 1 1 1 CI AC 
TCATTATTATATATATTACATATTAoATATTTAATATTTTAGTTAGGAGCT 
CACGTGGTGG<XXjCCGAOCCCAATTCTTGAT1 ICI I IC CATOCCAAACTC 
TTTAAACTCTTOACCTCTGCAATTC^^ 

TCAC CAGC CTC1 1C ICIGCA 11LILI 1 IC CCICC1 TGI 1ATGCTAAAACT ^ 
TG IGATCGCCTCTGAAGATACTGCI CTT CACCOCTCTX1AAGGGGGTCTCC ' 

TCAGGGGAAGGTACCT 

>Sequence l(X2 

tcgaggt accataataatccaattaacaaaatcx^ggattraaggatttc 

tataagattaaaaaaaaaatgaggtggttjtccagtxxxxiagagaaaaaag 

cagcaaacaaaact ggtga gagcaaatgacoccctgatgaaagatcttaa 

ACACCAGGCTGAAGA 1 1 1 lAGATTTTJTACCTATTAGAAATGAATATTCAC 
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gaaagattctroaoaactoctccatagcaatgaactccaataaoggcaoa 
ttacagaagaactaggccatgagggcctagtatccagaatgaggcagagg 

CAGCCAOGCTCQATOTGAGCAGC 
>Seqneaoc 103 

TTCGACXXXMQATGAGCTOCCCGCXXr^^ I ITCT1 

GTTTAAAOOCK^OCACTCACCAGGAAGTCTTGATAGaGOCATCTAGTAA 
I fCI l A AOTCCTACCTCATCCAA CCl I GI I I 1 U ACTCCTCCAGTGAGCAC ' 
ACCTTGCCCTCACCClCCCCTCTCTATCCCCrCAOCT^^ 
TCAATTTCTCAGTOCACATCACCTCTCAQAOCACCAAAOCAAG<KiTTATT 
TTTT CTAAA AGAC A 1 1 I GI ICCCA A1 G 1 I C C I C I G ACTAAAGTTCCCTAC 

ACCGAAAATrmTAAAACTXTTAAAACAACC^ 

ATTTTCAOCTT ACCOOCOCGGGGGGCGOCCTICOOOOCAAA AATCTCAAA 
TTOCCTTATAAATTTTCAGCGOGTCGACACACaCTTTCTAAATCGCCCCC 
OGGGTGGGGCGOTCTA mUllCI I C I C I Cl l CI I CIIGIG I O OOGGCC 

oc 

>Seqacoc« 104 ■ 1 

TCCAGTGGATGAGCTCOCXXXXrrCWCG^ 

TGCAT0CATr0CICAACTC1trTATX!AC1CT > ACCAAGATACTCTGAAG 
^Sequence 105 

GACGATGTGAGCT ACOGCGGTGGCGGCOGCOCGGGCAOG7AC 1 1 1 C I AGG 
TATATCATGTOCOCTAATOTGCrOCT AATA TCATAAsATGTTTACTTTCCG 
AAAAGTATTTCTGAAAGGGAGCATATTTTGGAAAGTGCATAGGCTrGTAA 
TO^TACTTOTTTTCAAGTITCAACTITGCTATrcAACTAGAATAATCTro 
TGCAAAAOCTGAGCTGATTl I C I CA TCTATAAAATGG AA ACAATACTTTC 
TGTGATAATOGGTQCAAAACACA AGGTA TACTGG 1 1 IC1 1 1 G CTCT GOaT - ■ 

tcaagt1 1 ici ici i agtrtcaaaattttaaagogaaaocaaaaatcttt ■ 
c^tggcccaactttgcagaaaaggatttttcctcaaaaa agaa attaagg 
ooog g 1 1 1 1 1 f 1 atgggacccaaaaggcgttgtggocagtntagtaatt 
ttataagi 1 1 i oo gactoctctaaca oc 1 1 1 1 1 ataaagcgcooccttgg ■ 
ctggggggggtttatttttggggggoggggtaaaaaaaaaattttttt - 

> Sequence 106 

1 1 1 1 OCG 1 UAOCACCACTCGTCCOGTAGTOOGCAGCGA TCAGGGCTGGGG 
CTCTTTOCTGAGTTGTGTCAGGTGAGAGATTGTGAGAACTTGGCTroCAG 
OGTTTGGGCATCAGCTGOOCATTGAGGGGCCGTTCATlXrrcTCAAAGTCA 
ATGTGGGG I OG 1 i I GATCTGCATGTOTCATTTGTATTXACACAAOTTAAT 
T ATTC T 0C 1 1 1 I UI rOTAOTACCl P0G1 I G TGAAGCAGAAGCTACCAOOC 
OTCTAT0TGCAGCCATCTTA 1 UiC 1 C I OCATTAA OTAAGATG AGGATTCA 
CTCTTAATTTATOGOCACAATTTAG 1 1 1 CTTOCACACAAATTTAGGCCTT 
AA C1C1 1 1 1 A 1 1 ' m 1 U CTACAGTGGGGGTTTGGAGTAAT ATTCATAOGG • 
CATGOACTTTACXIAAGATGGGOTATTTAAGTTTACAGTTTACATCCCTCA 
TAGC lCTOC AGAOCOTOACnjTOCAGT AGTTGGaGCaCAGTCTGCTTTAT 
TGTGGTOCACAG 

>Seqaence 107 ■ . " ' i . 

roTTTTTOl OOTCAOCAOOCOTOOGAAATAATTGCAQAOAAAOCTTGOCA 
AOOOTOATAAOTAOG ITTOTC1 A GCAGCACTOATGOGTOGTOGAAGTTGA 
TOOlCATOAACATACACroTOATAAOCrATCIOOOCICrTOA OC HI I C I 
AGTAGTGCTATOTX^TTTTGGTACTAAGGTAGGTGAATTTTOCAAOTGTT 
CTTGGAAATAAGOAAACATCAAGAATAATOTAAAAGOCTCATATACAATA 
ATXMATAATAAAGAATAATXTroAAGOCTTCATTCAAOGTTOGGGTTTOOC 

OCAAAGGATTGTAATGATCGC1TATGGAAGTGTCAGCTGGATAAAGAGTG 



ATXKJOOCATGGOODGTTAGAATAGGGAOOCAATTAAATGGAGACCAGTCA 
AGTGGGGGGGATCAT 



10* 

TAAA T G TGOOCAOOGTOGAATOGATTCTACATCAGG I GTCTG lOCCTCOC 
TOCrGAAGOATAACOCAGAGTGCAAGGTCATCI I IGI 1GC lUAACAGGGC 
TGGAOCTGTCGCACTTAAGCACACTTAAAGGATTCTATTCTTCATTCAGG 
TCCO CCAGAGAAATTGGC1 CC 1 1 A 1 1 1 1 1L 1 1 1 AOCTAI lOCI A GACTTC 
C I 1 rrGTCTAGAOCCAGI 1 ITC CAAAOCCCA C1 1 1 I A T0CATCTCAGTTA 
TTCCCAGA GGTGACAGAA TG A GT AAA CCA T A TGGCGCAAAT AGCA TA TAT 
GAGCTAAA CCAGTTAACTCTTAACCAAGGCACATGGTCAATGCCTTACTA 
1 1 1 1 1 1 1 1 1 1 1 1 AATTCTTCTAACGGTATTTCr AGCTGTACATTCCCAAA 
GGAATGOGTGGAAOCAAATCGATTCTGGAAGGGTCAATGO ICI IC CAGCT 

AAAOCCCaGTTTTTC 



GAAAAG ATGTGG AGCTOCCACGCGTOOG AG ACACTTCTCTGACTAAOCAT 
AG ACTATGTGG AAAATGGTAGCTGGATTGC C T rT G GGTGG AGTOCTTGOC 
CTGTGGCATAGGAAACAAAGGAAAGGAGAGAGATGCCCTTTGAGATTAAT 
GAAAATGCTCTCAGCCAAATAAAATCTAAAAATAGCC I CC I rGTGATACG 
AACCCGTGGCCCCTAAGGGTCCTAAAGAGAG AGCTA GCG GAGG^^ 

tatagotttaaagaccagg rcr GTorrn GATAACTG aacttgctaatag 

CTGGCCA CTTG A GTTGC 1 ICI I H AC K- II 1 1 I GI I I OII I IA AATAAAGA 

gattcagccagtaataatgogaagagctgcaaatgacttcoccag^ 

AGTOC C1GCI IU I 1 1 1 ICC 1 ICTGCCTGGOCATGCTGATGTOCAGGCCAC 
ACTCACAGACTTACACGTCTGAGGAGATAGCCC 

TGTTTGCACGCCGTAATACACICCIC I OI I 1 1 I ACA G I GC TGCC TGCACT 
GTGACTAAGACTTTCTCGACTATCATCATGTTTAGOAGTTGATGAOATTA 
TAGTTTCATGTAAGTGTATCATTAGATGACAACTCTACATCTTTAGGCAT 
GGAAACAAACA I 1 1 1 1 IX I O GAAGAAAAAA AAGTGA ACATCCAACCTCCA 
TTTAAACAAATTTGA TTG T 1 1 C I ' I ' I GCTATT AAGAAACTCGG I GC 1 CI T I 
CTCCCA CTCTATT A T A TTGTCAA AATACATCTGG AG A CA CT AT ATAA A CT 
1 1 fTCTOCl 1 1 AAATTACCTGGTTTATATATTATCTCCT O TAGCCTGCAT 
ATAOATAAAGGTTAAACATAGAGOATTTAGGTrGTTOGTAATTTAATAAA 
TA 1C1 I C I 1 1 1 ' I A CAAATCATATAA 1 1 1 1 P0TTGT I UATTTTTTAGAOAC 



GGAATCTTTACCTTAGCTTITTGAGTAGCCCGCCTACA 



gtttgagggcgagcacgcctcocgggattggaccgacgcaoccatogtag 
gtccagatcccgtagaagggagc<k3ggtcccataogttacggcogattcc 

TGGAGCTTCTCGACTGAGCGCCGCGGTAAGCAGTGGTCT^^ 
>Seqoeace 112 

GTAAGAGGGCGCGTGGOCGAGCGGTTTGCATCGOCAGCTCGOGCAAGGGC 
ATGAOGTTOG TC I GGGTGAAG AACGCATCGATGGCGGCACXXXjOCTGTTC 
COGCAOGT AGACCTTGOCGTCACGCAGACGCTCCA GCAATTCGOGOGA TO 
Q<^OGTOQATCAOCAOCAOCTCATOOOC1 1 CC 1 O CAAGAOOCAGTCAGGC 
A AGGT XnCGCOCACTTGC^CGCCGGTCATOCCCCGCACC t GG I CGI I GAG 
GCI I ICCAGATGCTOGAOGTreACTGTGGTGAATACOTTGATGCCGGCAG 
AQAOCAATTCCTOAATOTC1 rOOCAOOOCI 1 1 ICOTO GOOOCTOOOOOOO 

gcxji rocio TO ooocAoi roa wxA ccAoeAocAooTTooocrroooooc 

GAGCAGOOCGTCTAGGGCCATTTTCTTCAGCATCAXACOOOOGTATtTGO 
TACCAAGG CI I IC OGCTTG 

ACCCCGG 



CGAGGOCTT IGC I GAAGCTCATGTGAGGGGGCGXCTGCCCCTGACAGGTG 
TTGG ATTCCAGCTGCTG TA30CCCTGAAGGTGGGTCGTGGG AAGAAGGGGA 
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GAATGAAGOCAGCCTTGGGAGAGGTAGGACGCCAGCCOGGCCCAGCTGCT 
TCCAGCATCTGGATXXAGCCTCACCTGAAGCCAOCCACC I rCTOOACTGC 
AAAgrCATJOTCAACAQWAAAC^ 

OCTCOCTC OCTTCCA TOGGGO CTCTAGCTAGGGAC IUI GC 1 IGCAGACCT 
Al I ICI UGGACTGAGCCTCTGG TT AGAGGCCAGTGGTGAGGGAAGAGAGA 
CCATCAGAGAAAAGAGTGGAGCCTCGGGCTTGTTAGCAAATGGCAGAAAC 
CCGAOCCTGCAAGAGGAAAACATTG 

>ScqoeBCCll4 

TGGAGATGTGGATTGAGCTCACOGCGGTGGOGGCCGAGGTACGCGGGA 
CAACTGTCAGCTAGTGAGATTACTGTGTATGGCCAATCCAGATAAATAAO 
ACG ATCAAGTCTTTATG A A A AGG AAAGAAAAATTTGGAATGCACATCTCT 
CTCCAGCTCAATTCCTCA CICCI 1 1 1 1 1 A AGATGGAGAGCTGTTAGGTTT 
GTCT ACACAGT AGGAAACA CCTGATT AAA TAA CA GCATGGAGGCAATCTT 
GACAAAGAAA1TGGCTGCATCCAATAGAATCCCAGGGCCGGTCGTGGTGG 
CTCATGOCTCTAATCCCAACA C 1 1 IG 
>Seqacnce 1 15 ; ^_ 

GGGCCCAGCTACTCGAAGAACAOOCAATOGATTGGAACGTCCTAGGACAG 
ATCO^CGOCTTTGACCCACSCICTCGG^ 

GTTGGTOGGAGGGGGAAGOCCCATGGGTCCACCAGGATGAGGTGTTTAAC- 
TCTATCAGGGTACCT - • 

>Seqacoca 116 

GGTGATGATGAGCTCACCGCGGTGGOGGOCGGTAGOCCCGGTAGGCOGTG 
TGGACCAGGGGCTCGI COG I OGCOOGCAGCGAATTGGTGACGAOGCTGAT 
CTTCAC GI 1 GC GCCOGCGGATCrCGCGCATCACCTTXlAGCCra^GTGGCAC > 
OOGGAATCAGGTAGGGCGAGACGATGGTXACTTCGGAACGCGCGCiGGCGC 
ATCTGC1 OGAOCACGTTGTAGOGCAOGCTGTOGACATOCAGCAGCGGCAC 
GCCGCOGTAO0AOGO CO ICI I OCOQATXACGCGGTCAGGCGAATOGGCAT 
ACCCCTCGGCGG I GG I CXAGATX^GGOCOAGCTTGOCGGCCTTTGAGGTC 
TTOGACCATCGGGCTGTAGOCGA GGAG 



TTGCAAAI ICI IGGCGCGGGCAAG 
TCGGGCCGC It 1 IGAAACAGGGTGGGATC 
e 117 



AGG AC I G I C 1 1 AAAAAGACAAAAA TAOCTCCTACAGTTGTT ATCATCAAC 

otca gttgciggci 1 1 1 octaaa iti u1 uti ct agctcag atctaaacca 
tttgataacattagggcaatatcatggcaatcgtggcccagtaaaaccat 

AGCAAA TG1 1 1 ICICOCTAGGACACTATCTGTI I ICA CAGGAAAATTTTT 



CTA IC I CI ig gaaaacaacagtaggoagcgtggattagaaiuici I GG GT 
GCGTCAAACAGGCAGACAATCCTGAAACATCI 1 1 rCTGGGGACGTAAGOC 
ATCAAAAATTTCTATACACITAGGAGGGCTTCTAGGAAACAGGAAACCAC 
AAAAATGOAATOGGCTTCATTCAA I II 1 11 1 1 1 A AACACATOCCTT ACAG 
GTGAGOTTCTTDAGGGOCCTOGAGAAGAACAXXlAACXXrrTTC^ 
>ScqacBccll« 

TGTAGA TGACTCAjCXXIGGTGGCGGCCCA GGT ACGCGGGGAAjOOGAGGCAG 
CAOOGQAOGTCAGOGATAATTKXXjGATATGOAGGATCTCI I Uj GGAOOGA 
CGCCGACAGCGAAGCTGAGCGTAAAaATTrTGATTCTGG^ 
ATTCTCATCAAGAGAATGCTG<XrrTrTGGCAGT AATGCCTCTOGAAGTCAA 

gtttggagatgacagtgaggaogagggagcttcacatcatagtgct 

ataatcactctgaaagatcagacaatagatcagaagcttctgag 

gaccatgaggacaatgacooctcaagatgttagatcagcacactoggatc 

AGAAOOCOCT AA TGATGA TGAAGAOGAAGGT A TTaGATOGGATGOAOGGA 
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GOCATCATTCAGAACGGAAGGTTCTGAAAAGCACATTCAGATGATGAAAA 

gggggcagagaaataaaa 

> Sequence 119 

TAAAGCGACCGCCGTXH3CGGCCGAGGTACCTGAACACCAGGCTCTTTACG 
GTCCCTGGOCA GTG AAA GGGTCT AA TATAAAACACACCGAGGCTGAAATA 
AA^TC^OG u^ i^o ^ ' 1 ^ 1 CAAGCTCAATGACTACCTGCAGATAGA 

gctgtgccattgtgta i u ic ig caga 1 1 i cc ccagggttgggatgggttc 
atccta caac ggacaagatgaagtggacattaagagcagagcagcataca 
acotaactttgctacaatttcatooatcctcagaaaatgccatacctoaaa 
gaggaaocttattitggcatogggaaaatggcagtcagctggcat^ 

TGAAAA TC 1 GG I GGACAGGTCAGCGGTGGCAGTGTACCTGCCCGT 
>Scqnence 120 

AGACTCACCGCGGTGGCXKXXGAGGTACCGAGCT ACCAGGCTGTGGAATG 
AGACCGGGAG C I 1 1 I I CGTGCT AAG A TGCCCTT A COG AAA CA TCGCTGTC 
GTTTCAAGAGCTATGOGCATTGTTTCACA 

> Sequence 121 

TCATTAGATGaGCTCACCCOGGTGGCGCCCGAGGTACAAGTTTA I G 1 1 1 1 
OCTTGCOGTAAGGCTTTAACAGTTCCCACCTTTCXG 
AT TGCTCAC CTACCACTATGACTAGATATC ATTOCA' I G I GCTTTTCA CTA 
GA TTCTTTGTCTCTTG TGT ATGG AAAGTG AG ACTTT AA GTAA TAGTT A CT 
GCTGAGAGAAATAGAAGAOGTGACAAOGTTTGCITItXCATTCAGTAGTC 
AGCGGTTOAATGGAATTA 1 C I ICG 1 1 1 1 1 G GACTG ACAGA 1 1 IGI 11 1 AC 
AATT CAGCTATTCCCAAGOCTTACTATTCAAAGCAGAAOCCT fCTGTC TI 
CTTTCTGTAGTTGCTCTCTCTCCCTATAITCT G IT G TA 1 1 1 1 1 1 ICA AAT 
AACTTATTACTATCTCAAGTAAAAl TGT1 TTA IG I 1 1 I Gni 1 1A TCTAC . 
CCTCTT AA TCaGGGCAGGGA T A TG TCTC TTGT A T A TTTT ACTTTTCCCAA 
ATCATAAA G I 1 1 1 IG GGAAT C T G CT GG TTATTA 
>Scqacac< 122 ' 

A GG T ACA CA CTGGA TCTCCTT ACTCA TTTTT AA CXTCTGACTGGG ACAOCA 
GAGAXlATGCTGCATCTTCTATTAGGTGTrrCA I C I I G CAGAATGGCTGTG 
CTC CTGAA ATA 1 1 1 CC I G I GAAGAAAATTGTTACAATCCCATTACATCAC 
TGGCTTTTA TTA TTAAATTGAATGTTGGCTGG AAA CAA TTTT AAC^ 
A TTG TG ACAAACAAAACT A TA TGGAAAA GGTCCCTGOCCG 

GTTGATGCTCACCGGGTGGCGGOCCOCCGGGC^ 

CTGCAAACCAGTAAOCTGCT ATGGOCAATTGTGAAGAGATOGGAGTCTCC 
CCGTATTGCCCaGGCC GG TCTCAAACTCCTOGGCTCAAGCAATCTTOOCG 
CCCCACTTOCCGAAGCCCTAGGATTACGGGAGTGAGCCaOCXjCACCCAGC 
CAGAAAAACGTTTCAAATATTGGAAAACCTTAC 1 1 11 1 ICA ATGAGCATT 
TTTCCATCAAGGGGTAACAGGGACATTAGGC 1 1 1 1 1 1 IC ICI IA GACTCC 
AAACAGTAAGGTCAGAATTTATCAAGACATTACATAGGAGTAAGGGCACA 
GCCAGGGGTGGTGC^XIMGGAAGGACATTTriXIAGCACTAATTAACAGGT^ 
TTATGATTCACTAGGTTGGOCCAACTAC 1 G 1 1 C 1 CA GCTAATTOOCAGGC 
CAGCCTnCAGGAGGCCAAATGACACTKTCCAGTOCAA G I GCI IG I A CTA 
ATCACCTACTGAOCA 

C 124 

TT^AGAGATOAGCTCACCCCGGTGGOGOOCOtt 

CATTCAACAAATATTTATGCATCAGCTAXIATGOCAGGATCTGTAATAGAT 
TCTGGGTGTCCACTAGTGATTACTGCA 

tcttgagagc^gagacagcaaocaaataaacaattacaaaaaagtatgtaa 

GAAOGGOCCACTTAGAGTGGTt^AAGGCTTCTTGAAGCT 
TGAGaCCTGAAGAAGGATGCAAAAGGOCCAGCATGTAAGGAACAGAGAAT 
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AAACATCCCAO AAAT AOAAAATAACACACAAAAACCTAAACTCATTAAAG 
AACA TCA TCATCTTTCAA GAACTAACCCTTC A G A TCA CA GTACTTTC A TT 
ATAGAGGAAAGGGGTOAGTGCAATGaAACGTTAAAAATAGCCAGATCACG 
TAGAGCTCTCTAGCCTTTCGTAOAAAAGG 
>Sequcnce 126 

TTATG ATGATTGAGCTCCCCGCGGCCGGAAGAGCAA CCGAG ATG AAGGTG 
AAGATGCTXLAGCCGGAATCCOGACAATTATGTCCGCGAAACCAAGTTXiGA 
CTTACAGAOAGTTCCAAGAA ACTATG A I ULIGC 11 1 ACATCCTTTTG AGG 
TCCCA CG AG A ATA T A T AA G AGCTTT AAATGCT A CCA AACTGG A A CG AGT A 
TTTGCAAAACCATTOCTTGCI I COC 1 OGATGGTCACOGTG ATGG AGTCAA 
TTGCTTGGCANAGCATCCAGAGAAGCTCGCTACTGTCCrTTCTGNNGCGT 
GTG A TGG ANN A GGTTANAATTTTGGAATCTACTTCAGTGGG A ATTGTATT 
CCGACXXTCGGCCCX7riTTAGACCTAC<XXrc^ 

A A TTCG A TT A T AA GCTTAATGG A TCCCCGCCCACTTT A A GGGGGGGGCCC 
CCCCCCAA I 1 1 1 1 M I J I CTTTTAGGG A AGAAA A A ACCCCOCGCGGGAAA 
A A AGGG A AA AA 1 1 1 1 1 1 1 1 CGGGGG AAA A A TTTTCCCTCCA A AA TTTCCA 
AAAAAAAAAAGGGG 

> Sequence 127 

ATATGGCTCACAGCTCGGGGGGGGGGTACTGAAAGTGAGGTGAAAAAACA 

AGAAA GCTGAGAGAAATCAACATGTTCCCAAGTGCTGTATCTGAACAATA 

AATCTG AGACATACCTCTAAGGC I I I I CCAGAG ACAA GAAAGCTCTCAAC 

CTGTAAAGAATTXXrnXiGACATGACTGAGAGCAATGAGAACTCCAGGCAG 

AAGGTTAGCAGATATAGTGTAGAGCATACACAGATATACTATAGTTCATA 

ACACrGGTGGCrTAGCrCTAAATCACAAAATAGCACTGGAATTATACTAG 

TGATCATAG^CATAGTCCAAGAAGAAAAAATTTTGATCTTCTTCTTAAA 

CTTTGTGGAGCCAGTGGTGAAATGAGTCACACAAAGATGCAACAATGNAT 

GAACCCAGCCCTCTTTAGACTAACATATTCTTGCOCATCACCAOCAATAT 

TACAATAAAAATT^ 

CAATITGAATCAATAAGACATOGAAAGGGAGCAAAATGTAACTCATGCTA 
AAG AA AAAAAAAG AGTGAG AAAG AG ACAACAAAAGCAG ATCC AGAAATGT 
TAA AACTTGTGC ATT A TAAGGG AGGGA GCTTTA AAT A CAA T A TTNT AA TT 
TAGAACATCTAGTGAAAATGTGATGAGATTTATCXACTTAATGGAATTTGA 
ACAGAGAOGTAAAATGCTATATTTACAATNCATATTTTATATAAAAAGAG 
TTGTT AAAAT AAAA TTGTAAAAAACAATGTTTCAAAAATAAO ATTA TGTN 
GATGGCTTACAGTTGAATAAT 
>Sequcncc 124 

GTGAAACAATGCTCATAGCTCTTGAAACGACAGOGATGTTTCCGTAACGG 

CATCTTAGCACGAAAAAGCTCCACGCrrCTCATTCCAC^ 

GGTACCT 

> Sequence 129 

GAGACTX^CCGGGTOGCGGOOOOOCOGCAGGTACAGTCAAGGCCGAAAAC 
CACTGAGCTTTTOCCTCTGCCTGGCACATATCCACTGCOCTOCCTT^ 
C AGCTGAT GAACTCTTCATATOO L 1 CC 1 II 1 G GGTGTCAGTOGAAATOTC 
ACTIXrrrrCTAGAAGCTrCTCTGGCICTCOCAGC^^ 
GCTATGAG CTTCCA TAACAOCOCTAG 1 1 11 CC 1 C ACATTGCCCTCATAGT 
ATATGGAA TTTGTTCATTCAATTOCCTOGCTTCCAACAGATGCCA GCTCC 
AAGAAGOCAGQAQC 1 OC 1 1 C 1 UGGTATTGCTTOCCATCAAGGCCCTCACA 
OCCAAOCTAATYKXTOGGOCAGAGTAGGTGCTTAATAAAAAATTGTTTGA 
GGCCGGGCGTGGTGGCTCACCKXrrATAATCXX^UK^ 

GCAGGTTGG ATCACG AGATCAGG AG ATTGA G A CCA TCCTGGTTAACA CAG 

TGAACCCCOGTCTCTACTAAAA 

>Seqoeace 130 

GAGACTACT 



TACCTATCTGCAG 
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AACCH5TCATTAOCAGTI ! ■ rcCAAACAAGOGACTTTTAGCAAATTAAOCG 

ttaa tttt aatgag a ttca aaa gtt aa t a gccattctt aacgtttt a taa 
ttagaagctgttatataattagagctggacacccacatggagaaactaat 

TTGACTGTGCTtjCATTTCACTrCACTTrGGTAACAGGAAGCA C 1 1 1 1 1 A G 
TCTGTAGACXXrTTGGGAGTTGTAGGGAGTTAAAGCTGATCATTATATACT 



AOCTGG AGTG AGTTATTA 1 1 ItXJIUGTAATACAATTCTCTGCCAGCCAGT 
TGCTGCATCAAAAC AGTTXTGATACACAC ACCTAA AGTCACCACTTOCTC 
ATTCTGGTCCCCAATAACCCTA TAAGCCTCTCTCCTTGTAGGTG ACCTCT 
GCCCTGTCAAGGGTTGGCTCACCOCAAGATTCCATAAATAAGTTG 
>Seqoence 132 

ATACXJACTCCCCCGGTGGCC^CCCtAAACCGTX^ 
TrGCCGGACGCAACCTTGAAGATGTTCTGGG<XJG<XAGCAC^ 
CTTGCCG A CG A TG ACA TTG TTGGCCTTCaGCCCGTCA ATA TCGCCCTTG A 
TGTCG ATG 7TCTGGCTCTCCTC A TCA TGGCTGA GCGC AA TGGCGGCGTTC 



GCTGGACCCGAGGGTCAGGTTGCCCTGAAGCAGCCO CT T C T IGTCCTGGG 
TGACAT^ACOGCGCAGCCGCGTGGCGCCGGCAATGAACTGGATATTGCTC 
AGGC<^AGTTCCGTGGCAAGATCGGCCCG 
AGACCGGATACCTTGCtXmXOCGCGTCCIT 
GA^GAAGTCCGtTGAAGGAGAACGCGCCTTCCTCAGCTTGCCCCGGAAA 
GTTTCKXATCCGGAACCCGGCATTGAG 
>Seqocacc 133 

GAT ATCGA GCTCC<XjCGGGXjGTGGCGGCCGAGGTACXiATAATTCATXjCCA 
AIM I'l l 1 GGGA ATACTTGTTTCTGATATAATAGGTTACAAAGCAAAATT 
GAGATGATTTTTAAAATGCCATGCAGTTATTTTTTCTGAATAACATAAAT 
TTTAAACAGAGACCTGAAAAAAACXDCCAAAAGTATTAACCTTTAAATACA 
T A AACTCAAT A G A A A TAA TTT AACTGCCTTCTCTTCA CAA G AGGCA ATCA 
GAAGGCAGGACTATAGTTTTCTGTG I I 11 1 1 1 1 C CACAGGAG AGATAATT 
ACATTTCTAGAGACCCATAGAAACAATTCXLATAGTTTTAATTTCATCTCT 
CTATXnCTAAGGGTGTGTCCAGGTATCTAACAGCAATTATCTTACATTOC 

tgaatcaacaacaatgatatcactgaagaaatacagggagacccaagctt 
ccttggattgg<xccxcaaaaattogtgt aaa ca tttt aaagg^ 

TAACTCTAAAG AAAGGGAA 1 ITTCTTTl 1GAAAAATT 
> Sequence 134 

TAGAGATTGAGCTCXXXXXKiTGGCGGCCGCCCAAGTGITGGGATTACAGG 
CATGAGCCACCACGACCGGCCXTGGGATTCTATraGATGCAGCC^ATTTC 
TTTGTCA A G A TTGGCTCCA TGCTCTT ATTT AA TCAGGTGTTTCCTA CTG T 
GTAGACAAACCTAACAGCTCTCCATCTTAAAAAAGGAGTGAGGAATTGAG 
CTGG A CA GAGATGTGCA TTCCAAA 11 U I CI 1 1 OCCTTTCATAAAG ACTT 
GATCGTCTTATTTATCTGGATTGGCCATACACAGTAATCTCACTAGCTGA 
TACCT 

GGAGAGAOGATGAGCTCCCCGCGGTGGCGGCCCAGGTACCTCTC 
GGCCCTOCATTCAGGG 1 C I ICC I GGAAAAOCCOCTGGAGGAAGCGCTCCT 
GTTGCAGTOGGAGTGAACAOOC G 1 L 1 1 U 1 1 1 AACCACCAGCAGGGGGATT 
CCTTTCTGGAGAGTCCATGTAGTCATCA 1CI CI 1 1 UACCTCTGCATTTTC 
CCCCAGAAAGGCC^GCATGTTACTTTjTCATCTT^ 
CTCCXCCAGATOTAAAATCACTTTCTAAACAACTA 
^Sequence 136 - 

GATOTTG AGCTCCCGOGGTGGOGGCCGAGGTACTT AAAA GTATATCAGGG 
CAGTTTCATGCCAGGGAGCCAGGG AA GGCACCCAAGGAA GTG ATGG AAG A 
GTAGAAGTTCACCAGGTGCAGCTTJAOQAAAGGGCTCAGCAAA 1 1 ICIC1U 
TAAGAGGATGCAGACCCCGGGTOCTGCCOG 
> Sequence 137 

TGTTTGTGXiATTGACACCGGCGGCGGCCGAGGTACTAAATTTAGCAACT^ 
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TATTCATGAGGAACACCAGTCCAATGGTGGTOCTCrTCTCCTTCATC 
ACATO GATO AAC TCTCA TT r i TG ' lir i C CAATGGAGATGGAGAGATTTTCT 
G AGG AGTTT CTTGC 7TTGA CATTCAGTG AAAATG A GAAAAATGCTGCTTA 
CTATGCT ITAGCAATAG TCCATCGAGCGGCrCCTrATCTCCCAGACTTCr 
TGGACTACI I IGC1 I T IAA TTTCCCCAACACTCCAGTGAAAATGGAAATT 
CTGGGCAGGAAAGATTTlXjAACCAACCCCCTTTrAAAATTTTAAACTAGG 
GG AAA CGGGAAT rTTGGOGGGGGOCCCAOCCOOGGO 1 0 1 II 1 1 I GGGOGA 
A AAAA1 1 Mill I GGACAAAAAAAAAAAAAAATGGTCGTTTTTTCCCCCC 
CCCI rT TTTTTTTTl 1 AAAAAAAACCCCTTTTTTAAAAA 1 1 1 1 1 M l 1 1 1 
' ' ' l^C0CC gXXXX3CGCCT^TTAA ^ 

>Sequencel3I 

ggtgagttgagctcacccnggtgckxkk;cgaggtactcc^aggctgaga 
caggacaattgcttcaaoctaggaggtagaggttgcagtaagccaagatc 

CTGCTACTACACTCCACKXrrGGCrrGACAGAGTAAGACnCCATCTCAAAAA 
AAAAAGAAAAAAATTGACrTTGGAACCTCAGATTACATATCAGTTTGCAT 
ACA TGCTAAACAG AGAAA TX7TCCTCAAAATTCA GTT ACT AAAA ATTA CTG 
ATATC TOCATG ATTAGAAOCACACTGTGGTT G TG TGT G TAGTCAAAGG AG 
GAGAA1 1 1 1 1 AATGCTATATAAGCATAACTGATAACTGCTATT AGAAA TA 
AATATTCX>CAAATTTGGAAAGTTATTAGAGGAAGAA H 1 1 1 1 1 I OC MO 
TAATTTCCAGGTGTTTATATTAGTTOGGCCATAGTGAAAATTACATGGAG 
GAA AGAA AATAGG AAAA TAA GTCACAGAAAAAGAAAATCAAAACAAA TAG 
GAACTTTGGGGAACAAGTGAGGTAA 1 1 rCTGCTCl 

> Sequence 139 

AGOCCAATTCTTGA I I IC'l I "1 CCATCOCAAACTCTTTAAACTCTTGACCT 
CTOCAATTCAAGTTGTGAACATGAAAL 11 G I CI ATCACCAGCCTCTTCTC 
TGCA1 fCHI 1 1 IXC rCCTTGCT ATGCT AAAACTTGC A TGGCCTCTG AAG 
A TACTCCTCTlCACCOCTCroAAGGGGGCTCCTCAAGGGAAGXn"AOCT 

> Sequence 140 

GAAAGTAGOGATTGAGCICACCGCGGTQGCGGCCGCTGTGAAACAATGCT 
CATAGCTCTIXJAAACXJACAOCGATCTTTOCGTAACGGCATCTTAGCACGA 
AAAAGCTCCACGGTCTCATTCCACAGCCTGGTA GCTCGGTACCT 
>Sequcncc 141 

TTTTGTGATAGAGCTCCCGCGGTGGCGGCCGAGCCCAATTC^ 
TTCCA TC CCAAACTC TTT AAACTCTTGAOCTCTOCAATICAAGTTGTG AA 
CATGAAACTTGTCT ATCACCAGCCCCTTCTCTGCATTCTCTTTCCCCCCT 
TGTrATGCTAAAACTTGGATGGCCTCTGAAGATACTGCrCTIT^CCCCTC 
TGAAGGGGGCTOCTCAGGGGAAGGTACCT 
142 

ATTT GATTTAAAAGAATTGGGCCCXXXXXXXXfGGAGGA 
ATTAAAAAAAACCG 
TAAGGG 

TATAAAGGGGGTGGGGCCAAAAAAAAAAAAT 
1 1 1 1CCTAAAAOCCCCCCCTT 
>Seqoeace 143 

CCTTITTOUil 1 1 1 1 L 1 CTAAAAAGACCCTTGGGCTCGGGGGATTGCKjTG 
AAATAGGGG<UTOCCCCOGAAAAAAA I 1 1 1 1 L AAAAAAGCCCCCA . 



GrGTGGttnTOAGCTCCCOG^ 

TTTGGAAGTGACAGGTATTAAATAACX3GCATATGAAAGCTTAAAAGTCAT 
CAAATACAATC ACTGG GTACTITOGATrAOOCAAACCAGGCACTTTOCTA 
A ACTOO OCACI HI I lACTTCTCCG G TCTOCTT lCn 'n ATTOOOCCGCG 
TAGCTGCCCG 
>Sequcnce 145 

GAA«JATCGGATTGAGCTCCACC<XXXrra^ 
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CAGGCTG TGGAATG AGACCXnXKjAGCTITTTOGTGCTAA G ATGGCGTTAC 
GGAAACATCGCTGTCGTTTCAAGAGCTATGAGCAIIGITICACA 
>Seqaence 146 

TGGACGACGGAATTGAQCTCOCCCCGGTGGC^X»C<X» I IC1 G C I I AGCCAG 
TTTATTCTTTA k 1 1 1 1 1 IACTGGAGTCATTGCCAGTGATGGAAAOGGTGT 
TT GCTTCTCT TTCA GTCAAG A TCTGCACAAA GTA T AGCATTAGGTGGT AT 
TTATTGTTTA T A TTA TGA GTTCT ACA TTCA TCTTTCCAGCACTCTG AAG T 
TATCAGCAAGTTCTCAGTCAGTTCAAGGCATTCGATTCTC 
TTTTAATTCATTGI I n 1 1 G ACCCCTTTGAGAGTTTTAATAG AGAGGAGTC 
TGGAAGGCAGAGATCTCCA CCA CXn"AACCGTG AGAAA TTTGGAACTAAGG 
ACTT GCACT GGTCCCCAAGTTAACAGTGGATATACTICCIGCATTTTtrre 
TGG I C M I C 1 1 GCATTGGGCAAAATGAATGAAOGGGACCAGAAGGCCCTC 
ACCCCTTCTCGCATTTCCAAGTCGACAGGACTGGGACCCGW 
AATAACCCGAAAAACGG 
> Sequence 147 

TGAGG ATGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACCCAAGGTG 
GOCATT TTTTTAAAA A ACCCATGG AAA TAAATGCT ACTTCTTGTTAGTGT 
TGTTTG AAA A TAAA CAA AG AAAA TGCAA A CAAAACAAA A ACCATGG TCCA 
TTCAAGCTCAAGAGTATTTAACXIAATGCTCTXjITGCCTCTTA 
GTAGCTATTTCCCCATCTACAAATACATGACAATTAACTAAG<XXLAATTC 
TTT A AAA CT ATCTGG AA TTA GGTCA AAATT A TCTAA 1 1 1 11! IC1G ATTT 
A A TT ATGG ATT ACG T A A TCCAA TA GTTGGCAACA TT A T A A AACCCT AA CT 
TTACCTCA T I GTM GGCTATACCAGGTCTCATGACTCTGGACATAACCAC 
CATCCTTNCTCCCAAC^Ct>ICGCOT 

TGATAACCATGAGGCCCGCAGCnTCTGNCrcAAAOCITTTCTGGCCTAAC 
TTCCGCTGCI ICI 1 1 C I CACTCGGCGTTTAAACTGGT 
>Sequcnce 141 

GGAGGACTCACGGGTGGCGGCCGAGGTACCTATGTGCXiCXJGTGGTAGAAA 
AGCACCTGGGTCGGGTGCAGACTGCXjGAGCGGGCXXTAGCGTGTGCGCAG 
AAAG AGG AGGCGCTGGACTTATCCTACCTTAAGTTGAAGCAOACCAGCAA 
TTGTTGTGACCTACAATCTCCACACXCATCTTTACTCTGAGCCAAGGAAG 
TGTCTGTTCTTCTGCTGAGTriCAGGGGCXrrTCAG 

GAAGATGGCCAAAGACAACTOAAC 1 0 1 ICO H OC M OCAGGGCCTGCTGA 
TTCT ICCAA ATOTGATTATTGGTTGATGOGGCATTGCXXriTjACnjCCGAG 
TGCATCTTCATTGT ATNTGAOCAACACA GGCTCT A COCA CTGCTTTG AA G 
CCACCGACAACQATQACATCTATGQOGl I PC 1 1 OG ATCGGATAATTGGTO 
GGCATCTGGCTtTTCTGCTGC<XXX}TCTAGG 
C^^^GGA^TTCTCTGGCGAATTCATCTGAGGTTAT 

TGOGTCTTGGATTGAGCrOCCCGOGGTGGOGGCCGAGGTACCITCCOCT^ 
AGGAG COOCCTTCAGAGGGGTGAAGAGCAOTATCTTCAGAGGCCATCCAA 
QUI 1 A OCATA ACAAGGAGGGAAAGAGAATXjCAGAGAAGAGGCTGGTGAT 

agacaagtitcatgttcacaacttgaattgcagaggtcaagagtttaaag 
agttt«kmtggaaagaaatcaagaattgggct 

>Seqacncel50 

gctcttoaaaogacagcgatgtttccgtaacggca tcttagcacgaaaaa 
gcr0cacggtctcatt0cacao0ctggta<knoggtacct 



-'^eqacne* »i • 
TGAGCTAGTCACTOC<XXX3GGTCOCG Garo 

1 1 1 1 1 1 1 II II 1 1 1GI1 1 1 IOI III 111 CTGT CCCCTCTGAGCCATGGAA 
GATACTGGAGTTAACAAAAATTTTATAAACTAAAGAAAGCAACTTTATAA 
TCTAAAAGAAAGCAAC 1 1 1 CUC I CC 1 GTCTITTGAATTCTTATTOCTGAA 
AGAATGGATAATGAATCAGGAGATGAGCAAAAACGTATCTTTTACAAAGC 
TCTAG 1 L 1 1 LXJAAAAGCCTCT AAACTCAAACGAAACXTrTTTTAAAGTAGT 
TTTGTAAAAGCTCAAGGT A TGCCA TTTCCA GAAAGTTGCAGATGAGCaCC 
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ATWXiCATTAOOCAAATTC TOTC ACACAlTCACCAATCAAATTCAOCAAT 
TCGACAATGAIC I C M G GCATATGAAAGAATTAAAAGAGGOCTAGGOCTT 
GGGCAAGGGATCTAATCGNGAGGGGA I b I IGCI I ICCGAGGCTTDULI IC 



agaaaacattggacaaagaagaaaggcgacagaaggct agagac aggcac 
cacaaattgcttcccgact1tgcttcaccacacaaaogctttatcoaaac 
tocaatgca tcttg a ttctcctgag aa tgat a ttcct a tog agatcacca 

CGGCAGAACCACAGGTTTXXGAGGCAGTATATGACTGTGTTATTTGTGGA 
CAGACTGGCOOC fOCTCI GAAGATCGACCTACTGGATTAGTTCTACCTGC 
COO 

>Scqoaxe 153 

CATGGCTCXXXXXOTGGCCGCCGACCTACACCTGCAAC^^ 
CCTGTTGCCTCCTGCATI I ro O CCTCTCl I C 1 AT AAAOCAAGAGTAAAGA 
TOGAGCTCCTCCT G CCTCCATCACGAAAGCACATATCAICl O TCOCTTTG 

AATTTG^AATGCAGATAA I* iCn^ ATTTCTTGAGA 

AGCATGACACTACCTAAACACTGAAGATGGGCTTATATTAGTAAGATTTQ 
CACAAAATTAAGTATACCTATGCAAACTATTACTTTGGTTTTTAGGAGTT 
TOGTCAQATGAAGAAGTAATGGGATCACATATATATGTAAGAAGACAACC 
ATCATTA I 1 1 1 1 G I AAGTGTTTTATTAAAACCAACTGGTTAACTTGTCAA . 
ACACAAATAG AAGTCGTATTATTAAGCTOC 

>Scqacn cel54 

1 1 1 rOCGl luAGCTtXACCGCGGTGGCGTOCGGCfXXXXXXi 1 1 fCTGCG 
GCI I rCAGOGCGCGTI rCAGGTCGTCAATGAGGTCGTOOGCATCI fOGAG 
AGCGATGGACAGGCGGATOGTGOOCTGGCTGATOCCTGCGOOOGCCAGCG 
CI rCGTCOCTCATOOGGAAATGCGTGGTGCTGGCCGGGTGGATCACCAGG 
CTGCCGCAATCGCCCACG -.„■--., 



TATAGCGGACTCACGGGGTGGCGGCCGOCXXKjCAGGTTTAAAAAGAACAT 

otataaaggcttagcaaaccctttttaatg1 rctoaaotcaotcl i io i a 
agtgaaatcgctgoagactagaaagtatgaaatggcagtctaoctgggca 
aoctacaaaaaatttagcttoaaaagacttca g t c 1 oc oc 1 o ooc tgi ig 
atctcatggagtggggaatgggaattgaaccagaactggaaaattattta 
cgaaa gi 1 1gha acta cici 1 igi 1 ga tctcatgoagtggggaatqgoa . 
a ttg aacca g aa ctgg aaaatt a tttggg aaagttt att aa ctactcttt 
ctoctgaotaaatttaaatotg1 pci ggacattqttoaggtctagaatto . 
tctatacaatoooctgtacct • " : i 

TOOCJ I CC^TAC ^^ . 
TGAAGT ATTA TCATXnTGTAGCATTTCAAAAGGAACGTOGAAGTCATOCA 
AAOOATOGOAACCACAATOT IC1 rOTTOTTOCl TOGO! 1 ICI 1 AATQATT , 
TC TOAATCATCATTATTAATT AT GOAA TTC fCTGGlCGAAAAQTCACATT , 

room rcTOCTCAGi i ictcacatcttttttci IQCAGCTCI 1 ICTCAG . 

C ICI ICI 1CC TTO CCI 1 1 1 I TTA CTOTC CI I iOL ' l lUl C ITACTI CAOOTv.. . 
GGTTCTAT I I "I QAOC1 I I AAAAG T I QAAGOG I U 1 1 CAACATCAOOC I U I T. >— 
CAAAATAATTAATOTOTTAO ITTCT01 TOOCl ITOTI I A AACOCATTOAG • 
CI 1 1 1 AAGTTGGATAAGTTOO G I HI! IU CACCTA 1 1 ICIGO OOCCAATG 
T 



157 



GTACAGGGTCAOCCGGGGCGGCCGAOAAATGTCGOCAACTGC Oj IC I IC C 
CTeCTOCqCO^rGOG^AAA^^QaCAOUiAATI 

ccci agaatcccocgagtdooctctocccocot acct ' 
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AGACCAGGO000C 
AGGGCCCAGCTACTCGAAGAACAGOCAATGGATTGGAACGTOCTAGGaCA 
GATGCCAOGGCTTTGAOCCAGGCTGGGGGTGCACGGATCtCACTXKKKSCT 
AGTTGGTTX«ATGGGAAAGCCCCATGGGTTXACX^GGATO 
CTCTA TCAGGGTACCTTGC 



gaggta ch n i urn 

I rCTTAGT On I rCTGATGTCI 1 1 ICTA ACAA ATC 1 1 IGCCTG 
CGCAAAAGTCTCAAAAACATTCTCACGTTTCTAGATTriTAGCTTTAGCT 
1 1 I GIGI 1 1 O GGACTATGATCCATATTTAGTGAATTTA 1 1 1 1 IG GGOOOO 
CAGAGTCCA TO! rOCOCAAACTOO tXnGOAAOCACCACACCCAGCTAATT 
TTTG TG AATTGCGGGTACCAGCACACCGGCGOCGTCCTGG ACTGCGCCTT 
CTACG A TCCAACGCATGOCTGGAGTGGAGGACTAG ATCATCAATTGAAAA 
TGCATGATTTGAACACTCATCAAGAAAA I CI PCI I C GGACCCATG ATGCC 

CCTATCAG a ro rg i I gaatactgtccagaagtgaatatgatggtcactog 

AAGTTGGGATCAGACAGTTAAACTGTGGOATCCCAGAAC rOCTTGT AATG 
CTOOOAOC ITCTC1 CAGCCTOAAAAOGTATATACCCTCTCA O I C I CTGGA 
GACCGGCTCATTXnXXMiAACAGCAAGCCCGATAGTGTrOGTGTGGGACTT 
AOGGAACATGTGTTAOGTGCAACAGCGCAOCGAGN 
>Secj»cnce S4t 

GGTA CTTXntTTTATGCITGTGCCTGTXnG AAATTCTACAGTGCTGAAAAT 

CTCATGCACTCTAGCTATGAATCC*GGTCTACTTG^ 

ATCTAAI IO I 1 1 rCTCAATCl I TGI A AAOCAGTTTTAAG AGTCACCAGAA 

ATCTGTAGTTTAAGGCACCAGATACA I ITC1 I G OCTGAGCCTTOTAOOaC 

CAATATGCTCGAOCAATTOGGTAAAATACACCATAAATTATGAC 1 GC" 1 1 1 

ATCTGAATGGATOGGACACTTGCTAOGATGGCGGGAATrATTACCAGGAG 

TTTAGGAGCCAGACATGG G 1 I CT G TA I I I 1 1 C ATACATTGGTGATCAATT 

CAAATCTCTTTCCI ITGCAGOCCAGG1 1 rGGTCAGTCTGGCCAGGAGTGC 

AGATTATGACAAAAAACAAAGCTAAAAGACCTGAGCCATTAAGCTTACAG 

TCTCAATAGCACCGAGTTAAACAAOCTATrTAAATGGAAGACTATTGATT 



ggtcggccgaggtacaaaagttctgaaataacactataggcttaaggaat 

AAGGGAOCA GAAT AGCCTOGAGOCA OGTATTTCTGGCTTT A TACATTCCT 
TAGGAAAAAAAAACTrTATAGATGTATTTAAGTAGAATTAAGGTITACAC 
AAATGA 1 1 1 1 1 rGAOAGAGAGAGTCOCTAGGAOCT AAACATTCGTTCTAC 
GGAGATAGGGTCAACACGCAGATATTTATTTAGCAGCATGGTCTGCAGAA 



TTGGGAGGCTGAGGTAGAAAGATTAClTOAGCCCAGGAGTtyTQAGACCAG 
OCTGGACAAAATAACAAGACATCATCTCTOCAAAAAATAAAAAAATTAGC 
GAGGT 

>Seqneaoe 150 <• ~ '. 

ggtaccacctaacaaattggaggaaatcaaaagacgaatcaacaacattt 

TGGAGAAAAAATlTATrCTACTTXTAGAATTTCATTACTACANAGTGCTT 
ACQ I I C 1 1 GO 1 1 1 GGTAGATX3AAQTGAAATCAAAATTOOATATTTOQAAC 
ATTAAATATGGQAGCAGAGAATCrCTGGAATTATTGCTGOAAGACTGGCA 
TAAATTTATTGAAOAAAAAGAATTOCTAOCTCOACTTOATAC I ICI 1 1 IC ,^ 
AAAAATGTGGAGAAATTTATAAaAATTTGGCTGGAGAATGTt^GAATATT 
AATAAACAOTATATOATOGTGAAATCTGA I G 1 1 1 G I A TGTATAGAAAAAA 
TATATATAATGTGAAGTtXACTCTACAAAAAGTGCTGGCATCTTOOGCTA 
CTTATOTOGAAAA O C I IOOCI IACTAAAGGCI rOCTl 1U AOOAGACAATA 
GAGGAAGAAATTAAAGAGGT 
>Seqaeacc S5I 

ACCTATATTCTATGCAAAATTTATAAAATAATOCTTGAACATGAAAACTC 
ATCTTAAAATTACACGAATTAAGTAAGCATOCAATACAGACACrTGCAGG 
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.TGTAQAAOTGA 

OOCTCAAACTCTCTCI 1 AOGAAAATTTTCGCTTCCCACTGCCCATCCATT .* 

TAGCAG I CC TCCCATGCCAACAGATTTGGGG ICC 1 1 ATCTAAG I U \ 1 ICI 
GCAOCC GGIC 1 ICCCI IC CTGA CI ICXXAjI ATTGOCICGT I A AAATOATT , 
AGCTGOCAATaCaGGTA IG I rrG GACTOCTATTOCTGOTOACTTTAATCT 
TCTAACT OTGn 1 1 C 1 GAAAGGAAATATTCCCTAAAAGC 1 1 IGGTOTC AC . 
TTAAAAAAAAACAACT ATATATGATTGAAAG AAATTTGAGATA1 1 1 1 IG I 
TTCAACAAAAACCACTGAGTrrATGTCTAAGAAGAAAATTCAATAAGCAT 
TTATCAA G I OCI I A GGATATGCTCCAATGTATGTACCTCGGGCGCGACCA 

cgctaaggg 

ggtactagcagatgatggcacagtgacagctgggagggatgggatgtgct 
tgcttca i g 1 u-xjc i oocctctqcctocctcaacctacacag toctgtct • 

GGTGACGTGCCAAA G I CXr i TUCIG C C fl G CAGAGAGGCCICTCITCGlCG 
AACA TGGCCC TCAG GAAAGA CAGC CTGAA TGCCACTACOO CAGG C I i g i g 
GAAGGi ICTGCATCAGTGTDGCA1 IGI IGCGATAGCOCTCAGI IGA1GCI • 

IGI I ICTCGTOT 1 UGGAGGCAGGAACTACTTTAGGAGGGTGG AGGGGTGA 
OAATOAOAGAGGACTTOCCCTGAGCCACCCAGC I O 1 O 0 I CA CCTOATGOC 
CCGOATOGCTACATAAATOCTOGGAGA I COG 1 IGlOLl CATAACCAGAGT . 
GAGCTGGGCTCCAGACCAGGCCTATGGGAAGATCCTGrCTGTGGGAAGCC 
TTTGGCCACGTGTTTGCTGAAAGGTGTGGGGAAA GGCAA GGTCAACT ACG 

I I IC I 1 I I I IGI LA AACTCOGAGACCCTTG ACC I I IG CC IGI IAOCAGTG , 
O AAAOGGGCCATAOCCAGAACOCTTTTAATATCACOCTGGC 1 ICCI G C 1 1 



tCAATA ATCGCAGG GA 
.CCCACOCG I IGG 



CCCTT AGCGTGG TCGGGCCGA GGTACGCACA T ACA T ACACACTAACGCTC 
AGCaTAAACTTTCCaTTACACTTaGACAATGACTTOTOGAGGAAAAACAA 
GGATAAACAAGAGTCTCAAGAACTTAAOAAAAACATCAGAGTIGATTATT 
TAGCA C 1 1 rCTCAGGATTCTAAGGCAATAAGOCTAATTCAAAACbTOAAA 
I IGI IC'IC'I ATTTCCCATTAGTCATTAAATGAGATAAATGACAAGCTATT 
GC IG C1 rCTOCAl ICIG I 1 1 1 CA AAGAACATTACAAAAATAAACCAGTGT 
CI I C I CI AACAGTTCT AAAAACAGTTTGAT - 
>Scqacxx 134 

GGTACCAGAAGCAAGGCaGTTT AGGGACAAAGGGCA TGAGCTTAGAGTCA 
OATTTCCTAGGTTCAGATCCAAGCATCACTACTTA llllllll AAGAACT 
TCGGCATCTGTAAACCAGGGATAATA 7 Cll CTl CAAAGGGCTTGT GTGAA 
GA TTCAACAAGGTAATACA TATAAACXTTCACAGATCAGTAGACCAGCCAA 

gagtraaaggcctccggt/roatcattcgagaggcggcaacgcattacaaa 
gtogtgoataagggaccoogttogagagotctttaaaoctgtttaacagg 

ACACTOGG 

>Scqoa»oe *55 , - 

OGTAOCTGGGACTACOCACCACCATGCCOOGCTCA'1 1 1 1 IGIATTTTTaO 
T AGAGACAGGGTTTCACCA TGTTGGCCAGGCTAGTTJTCAAACTCCTCACC 
TCAAGTGATOCACC I GC C 1 1 U GCCTTOCAAAOTOCTGGOATTATAOOTAT 
GAOCCAOCGCACOD^GCCTTCAA l 1 1 1 1 U I J A ATTCTGATAGAGCAOCA » 
TCTAC TACAT GCTTAATATTATCCATAAACAGACATtnCTGAGCACAOAA 
GATCATGTTAATGAAAGATTATTGAAAGGT • - 
>ScspacetS6 

ACAGAAAAAAGCA TAATGAATACAACAACT AGCATCAAACTCAGTGTAT a 

taagaatggctaagtgaocattagtcatgtgaaaagcttaacaactatta 

AGCTCTT A TTTTCTT ACT AAAAAACAATTTT AAG I ICI 1 1C AAGGCTATA 

ottaogctttacataagaggcoctattaoccactaattcttaaaatttct 

ACCTACTTAAAATTTCTTrAGACATTTCCAAAGGTTAGTAAAGGAAGACA 
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TAAGATATOCTTACTTAAA I C C 1 IU CI GG I I CC ATOOCTOGCCATACATG 

> Seq u en ce S57 

CXCTTOAGCGGCCGCCaKXX^GGTACCATGAA^ 

TTAAAATAAA IG1 1 IGI 1 1 1 1 1 ICI A GCCCTCTAGGTCAATGAATCCCTG 
ACTCCAGTGACAGACCATAATTATCCAAATCICTCATTTA TOAATA TXXjA 

atata aatatg ct aaatt ga ttatg tcatgaa taoacttctttttt gcat 

AACAATGTTTGGAGT I rCTCAGCI nCTOCTAOCCI ICTTTI JCk liiCl T 
aaatgtagcctggaggattcctatcta TTCCATATAACTAAAAGTAAACG 

TTTATTTAGGAAAGGGACTCAGGAG 
>Seqtxncc S5S 

GGTACAAATGTOA G 1 1 C 1 1 C I CCAGACCATCAATATAGATTOGATTTATA 
CACrOAICGCIGlGICI CI CC I I CO I A ATAACCTTACTXCATGTTGCAAC 

aaacatggacttgttacaacatcccagagtgaaatctgaai G I GGTCAAG 
a aagttcagaaacaa TAAGA GTGA tgcaatgcat ACCACAACTCAGGCCC 
AGTGCAAAAGTCAGGCCCCAGCCC 1 1 CCCATATAAGGGA C 1 1 GO I CA TTT 
GAAAAATCAAAACCCAAAAGGAACAACrATAGGGACCTGTAATCAATTAG 
AATATTCT 
>Scqncw 159 

ACTtXKTrGGACTTGAGG I GG 1 1 1 AAGTTCGC^GCTACATCGAAGGACTTC 
TGAAAAGCTCAAGTGACAGTT ACACCTTTGCACTCTCCACA TTCAGCTGG 
O CT 1 1 1 IXXJ I CAAAA CATGG A T A A TCTTCAAACCT OCC TGAACA GGTGG A 
AATG CG IC I 1 ICCIC 1 A AGCCAAGTTCTCAGTCCACATTAGTCCATACTT 
GGCTA CAGAA TTGACG 1 1 1 U 1 OGCCA CAATXXT ACTA G AAA TGA CCTTTG 
GGTAATATCCTTA ICI IG I IG ATCTAGTTAGGGTCAACTAAAACGAAATA 

> Sequence 860 

GGTACrrTATGCAGAAGGAAAGCAATTGCAGATOGAAAAAGCTGAGATGC 

TATAAGGAATTACGGATTTTATAAAGAGATCACCATX7TGOGTGAATTJTAA 

ATATAGATGAACAATCAAGCATAAACAAAATTTTAATATCTTACAGGCTA 

AAA TATTT AGAAATC AAAGA CAACAATA GCA TA T AAGTTAAGA AA GGGGG 

TAAAAAGAATX^AGAGCATTCTAAOGTOCTTATATTACCTCGAAGGAGAG 

TAAAGATAATCACTATCTICAGGCTGATAAATTAACAATGTATGCTGCCA 

TTTT 

>Sccpeacc K6I 

COCTTTGCGGCCGOOOGGGCAGGT ACCAGCACAGCAATTGCrrG I ATGTTT 
GTTTTTAATTATCGGTrTTCACTTXMAGGGGCCAGl TC rCTATATTTCAA 
TCTATTTTCTATATCAGAAATGAGCACGCATTTTAAAAAATCGCTTTCAT 
TGATGGAGAGGTAAAAGTGAAATGG C I I IGI IGI A TTTATATTATAAAAG 
GCCA TTTCCCAAA TCT AGAA TTT ATT ACT AAAAATCAAGTTTGCA TTG AG 
OGGAGGAOTATGA 1 1 1G C IC AAGCTTA C I 1 1 1 1 1 1 ATAGGTGGGGTTTTT 
ATAT TI I CAATGTGATT ACTCACC 

GGTACACATTJCCATGCTGGGTCATACCTGAGTG 

GGAAGGAATAACG I IGI 1 GAAAAACATCCTCTACAGACAATATGAACAAT 
GOCTT AGTCA TCTATTGATTATQACAA TA T ACTCTTGAACAAA 1 IGI 1 1 1 



ACTACACCTCACCACCTGOG IGTCICTCAGACGTTACCAAGAGACAGAGT 
-.-AAACCCATG C ' l 1 1 ' CICCrA TCCAAACCAGTCrCTCCT G TTOCCT O CTTTO 
' TCCAAACCCAGTTTJCAGGAATTTATaTClTAAAGT AAACCATCGTATGAT 

AATTTCOCCTGAAAA 1 G I UUJ I ATTAAAAAAAAATAGCATATGATGGGAG 

gcagacataaa ca n ctg gtca atttattcgtcttattattt atttcagt 

TAATAAACTGCCCTTTCGCTATGCT ICACI I POCACGTGT I IAGGCAGT 



ACATGCICTAAAATGTAAGGATTCATTTATGAGAGAGTGAACATACTGCT 
TGTAGCTAAAACATTACAGGAQACCTTAAAAAGGGGTATAA TT GG 1 OCC I 
ATGTOAj^TGAACCTGACATA 1 1 1 1 1 ATAAATTATTTGTGCATGACTATC 
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TTTTGTTG A TAGCACT AGG AA OA CTTCT AA CGTTT AAA T ACTTTATTTGC 
CCTCAATTACTATTTAAAACTCCTATAATTTTAAGTAATTNTACAGCTGA 
CAAAG ATAAATA 1 1 i 1 1 1 IC1 I 1 1 AOI 1 1 1 1 1 1 A ATGTCTTGOAGOTAAA 
GTGG AAA TGGCCTG TTTTO ACACA TAATTTCTAOAA CTTGGAGTTAATTT 
0ATCAGTTACA TIT0G O 1 1 1 1 I I tTAOATTACAGI [CI 1 OGGGTAGATAA 

ATATCCACACaCATGGAGGACGGTACATA 
>Scqueacc 165 

gg t a ca tgttactgggtattaaa tgcg ttcat act aggct a ttaaatcag 
caaggtrxccatcccagaaaaatgtgcag ntottcaatoogaaaoatoc 
agagacagtttcagttaatatactaagtgctaagattgggatgtgcacaa 
oaagctcgagctaaaaattctcgaaaactgaacgtgaagtcaccxactagg 

CAAGCTGCCTGTAATTGAG C I I GC I I CT ATATGACCAATCAACCTTTGCT 
TGTTGAAGGATTAGTTATCTAG ] 1 1CC1CCI I II CMIII IG GAATTTGG 
TCTTTT AA GG TCTTGAT A A7CTTTCT AG TTT A G A GCA TGTG AA CAGAA CA 
GAAGGAAAATCAGGACTCAGTTTACTTAATTTAAGCAACOkTTGCTTGCT 
GCAGATTAGGGGAGGTTAAAGTT G CTGGGCTCCACTCTTTTATTAGCATG 

oatgcttaaagaacttcaoggtttggaggttagattgaacagcctgtttt 

TGGACCTXXXXXXKKDGGGCCCTTCAAAGGGGCAAATA 
CGGCG ATACTAATGGA TCCAGGCTTGCTACCAG A 
> Sequence 166 

CATTTCCCCTTATA IUI I U } | 1 1 1 1 1 A GGTACTATGGTA TGCCCTAACTA 
AAAAAT AG ATAT A OGA TA GTG A T ACTTTG A TG AGGACTA TG AAAAGGG A C 
AGTACGOCTTAGTGGAAAAAGTTTTAAGTTITCTACTGTTATTGAATAAA 
ATTA CATATAGTCTGATTCTTATTACTTGAAATTAGGAGGAGAAAGAATT 

ttttg aggtaaa tttgaaa ag aca t aaa a tag acta ooctg acaaaaa tc 
ttc acag att a aaaata ct aa tatt tgca ttgtcat gt a t atta c aaaca 

attaaatataacttcttt atgcagcct agacttattttgtatctattccc 

tgaccttgatgtcatagataagaaagccatactctaagaaaactaagtat 

ctgctcgggtggattgtttgaaagggcgaaattccagcac^ 

agttcatagttcgatcccaactatxmsaaocaaatcitggcgaaatcatgg 

ataatatcatcaattttcgtgtaaaaattgtaatatcgatratcaatatt 

CACAAGAAATAATGAGTCAOGGAATCATATAAGTGATAATGTCTGGCTAT 
GCT7TAAGAACTAGGOCAAACTCATATATTAATATGGGACAGATGAATAT 
AAGACCTATTTTCTAATATCATGATA TATA TTACTTT ACT ACAATTATTT 
ATATATGTAATTAGACAAC1CTTCGTCTGTGAGAGAGTTTGTTTCTOGTA 
TATCTGG AGTA CTATTCA CAATTTCA CG A TATTCATA TGCA 
> Sequence t67 

COGCGGGCATGCAGOCAGGCTAGACCGGCTCAGOCCCACTTCAAGACAAA 
ATCTCAGCAOCCATTACTCACCATAGATATTTATGCAGTGAGCTGCATCA 
TGACCAGCTATCATCTTAOCTCATA GI 1 1 1 1 i 1 C TCTGGTAGAG A T AATT 
AACTTATTATGCrrO ATCAGTTAACTCTTGCTT AGAAATTTAAAAAATAT 
TTTTAAGTGACAAA I 1 CI 1 1 li 1 AGAAATTTTTGAAAATAO AAATATTTOA 
AGTAGAAACTTAAAATCACCCACAA 1 1 C 1 UC H T I G 1 1 AACATTTGAATA 
T O T I O III 1 UL A TGATATATAACAAAA 1 1 IGICIO GCTATTGCATATGTC 
G1C4J1 1 ICC1 1CI I AATATTGCATTTTGAGCATTTAA CCAGAACACTAAA 
TATTCIC^IAGAJ^TATGG^^ 

TCAGGGTATOAA CACIT AAGOCTCTTGACCACATACrGOCATACTGOCAT 
ACTGGCATACTGCI 1 1 1 AAAAAATAATTAAOCFGAOTQOQATGQCfCAOO 
CCTGTAATCCCAGCACTCTGGG A GGCGAAGTCAGGTGGGTCATTTGAGGC 
CCCGAGTTTOAGAACAGCXrrGGTGGACCrGGGTGAAA 
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CClTTCAGCGGTCl-rri UGCAGCTACTTCCITCTTTTTTGCTAATTTTGC 
GGGATGTTGTATACTCTXTTACCATGGGGATGAAGACACAAGAATTATGAT 
AGTTCATTGAAAAAGGTTGAGAATTCAGAACTTGTtlAGTTTCCAOCAATA 
ATGGCAAAGATACAATATGACAAAGTTCAG I i oc h aaatoaatctagoa 
ATGAAGAATCTAGAAATTATAATGGAG AGOTOATTAGG AGTTT AAAATGG 
TrTATTGATTGGAGATXXTTTATCTGGATTATATAGGGAACACTTT^ 
TAGGAGAAOCAC^TATGATCTAGGAAAACGGCTTTTAAATCTACCTCtKjA 
CG A G ACCACGCT AT AGG 
>Sequenoe t69 

TGTACATT AAA TTAA GCA TACT AAAGAAAAAAGGAATG 1 1 1 111 IAGCAA 

tttaagaacttgcitaaaaagaaaaaaagatcaagcactccctctagtga 

CAAAAATTACKXIACAAGATOAAATTCAGTTAAAATTCCAAACACTCTGGA 

GATGGAAAGCCTTGTATTTTAGATGAAAGGATTTATGGCTGGAATTAAAA 

GAAATTAAAAGGCAGAAAAGTGGGTGAATGGAAAACATTTACTTTTTGTT 

T TTA A GTGTTAATAGCCA C 1 1 1 1 I G TCCAGTCTGTA ltlCC I I I CA TTAG 

TCTTTATATATATATATACACACACACACACACGTATGTTATATATACAT 

ATAATG GTITATCT ATTATATATGCTATATATACACTTATATGTTATATA 

TATGGGTTTTTTTCAGGAGCATTATATCATGGGAATGAGTTCAAAAGTAC 

CCGGCCCGGCCG TCG TTCG AAAAGGCCAA TTTCCA CA CACTGGCGGGCGG 

TACTAGGTGATCCGACCTOGCACCCAACTrGGGGGAA^ 

T IGTI'I CCTGGGGGAAATGGTTTCCGTTTACAATTTCCCACACACTATAC 

A A GGGGG A GCCCIT AAACTCTCCCCAATTTT AAAA TTTTGGCCXjTTTTTC 
CCGGCCTCTTAAAAAT 
AAAGAGTTG 
» 170 



PCT/USOQ/3J312 



CCCTTGGCCCCCCGGGCAGGTACTAATATTCTrCAACAGAATGCAATAAA 

ATACGAGCTACATAAATCCAAACTTGGTTCAAAOGTAGCTATG tt tt r t t 

AAA AAA GTT A TT AT A A CA G A CAAAGCAN ATGCA A ACIT A TCCTTCCAAAC 

CXriGATAATTGCTAATACCAAATAACTGCTATCTAATAAATATACAAATC 

AAGAGAATACCTTGCTAGCTAA ATT AAAAAAAAAAA AAAA AAA CTATCCA 

TACTTAACAACCAAGTCCAACTfnGTAACCAAAGTGTTCCTrAGCTOCCG 

CGTACC 

> Sequence 17) 

CCCTTAGCOTGGTCGCGGCCTATGTACAAGGG C T 1 X1 ' 1 1 1 GGTGATAGTTT 
CTACTCTCTTTAAATACTGTTCTGTTA T f IT » GAAATCTGATCAAGAATT 
G ACACAAT AAATCTCn 1 GA T ATTTAT A CTTATGCCTACTTTT AACCTTT 
TAGG AAAA CTTTATGAATTCGAATATTCTAAAATCCTG AAA TAATTTGGA 
ATATTCTAAAATTCTGAAGAGAATATGAACGGATTGTTGGAATOGAACTT 
TTACCCGATTCCCTCAGACTAGAGTGTIX^TACGACATITTGCC^AGAAG 
TTCCTA TAG AGGCAATATCACTTTTAGGATGGATGGGTCT AAAA C<^TCA 
TATTTAO 1 1 1 1 1 G O U ATTX^TGGTTGCACTCACTTTAGAGGATGTGTTC 
CTATTAGGTTCCTGCTACTATNTGTCTCTCCTAAATAACAGTATGGAATT 
ATAGAAAGAAAGGTTGGGAGAATAG ILCTOIUATTCI IC I GO ICAACATA 
AAOCC1 rGT T CA TCCAG CCACTCACTATTTTG NTCI I ill IIIUULI IGA 
AGCCAAGATG AC 1 1 1 1 1 CATTACTTtXiATGTTTTTATGGTCTATACCTCT 
CTCTTGCTtXATATTATTTTGCAGTGGTGCGCAGATTATATlTGATT^ 
TTAAAAATGAACCTGGGTTTTTAACCATTACCCTOGAAAATTCAAGAACT 
TTCG«XAJCII IG ICiXOCOCGXXXXXXraX^ 
GAAAAAAAAAAGGGGGGGGGCOC^ZCCAAAAAAAAT 
>Seqaencel72 

A CAGTTCTGTGTTTTTCAATTGATACATACTACTrATCTAA G AAAAATG A 

CTAAAAATAGAGGG<X^CACAGGCAAC^GCCATTAGGTTATGCACAGAGA 

AGGAAAAACTTCAGAGGTTGTGCTXiOCATCriXTrCGAACAAACAAGAATC 

TACAGGAACAGAAACATGATGGAAGAACAAGGGTTAG1TACTGCAAOGAA 

AAAACATGGCAGGAAAAAAAACCATTTIGAAGCCAAGCTTTTGATTTAAC 
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CATGAATG AAAACAAA TGGGAAAACAACAACAACAAAAAACAAAACAAAA 
CAAAAAA CAA G AA TG ACCAAA TACAGA AATT ATT A A TCTTTTACACA TCT 
TGTACC 
>Sequcocc 173 

OCC1 TAGC<nTJGTCGCGTITX3AGGTACTIXnTAAAATTCAC^TTCCTGGA 

CCCACCCT A GACCT ACTGGATXXAAATCTCTGCAG ACATGGCCTGG ACAT 

CT TCATT ATAACAAGCTTCCACATAG A TTATTTTGTCAGTXKKXATGTCT 

TGCi i iKj^i i CTGTGG AAACTACTCTCCATCTTCTGG AGTGG AATGTCOC 

CCATTGCTATCCACATGGTOCTCGOC 1 UCCTGATACTGTAGTCTCAOATG 

GCACCTCCTGAACTGGGCCGAGCTCAATCACTTTCCCAG 

CTCOCTOG A OCrcAGTGGGCCCATGGTOGGCAAA GG AACCCAGGTTGGGC 

CACAAAA CCCTATGCA TTTAT AACTAG ATGGGGGCTG AATT ACAACACAC 

AAGCACTTAAGGGACT7TCIGAATATCTGGACTGATAGGATGGCGAGCAC 

AC<>AGAGTCCAGATTGAACTTACTCTTAGTAACAGA7TG7GACTCGGAG 

AGACCCTGGGTCXKjGATGGTTCTGAGTAATGGCAATACTCTTATTTGATA 

TAAAGAGGCACCTGGAAACTCXTTTACAAAACATGTCTCTTTGTAAGACAG 

GTGAT ATG A G A CT AATTCT ATTA CTGGGCCTCTCCA AA CATTTCAAAAA G 

AAACAAGGGTCAAACTTOCGATAOCTCCCTTTCATATGTGACCGGTAATA 

gggcttataaggagggcatgccatttactgactattogacgtcitaacgg 
tatacaaattacatctacgcct 

>Sequea«er74 

■ 1 oCGTXKJTXXKnTTTCGAGCTACTGAGGATGACTAGATGACAAAT 
AATAAGAAAAAATGGCATTGACTTTCTATAGAACTTAATAATCAGATTTT 
TAAAGAGGTTAGTCTATTCTCTTATTTGAGAGATATGGAAACTATCTAGG 
CCTAAAGACTCTAAATCTG<XTGGAATCAGATAGTTGGCAGCAAAATCAG 

aaatagaaagcagttactcaacaaccaacagtttaatttaagaaacattt 

GACAAGCATCTCCTGTGGATAAGAOCCTATGCAAGATGTCATGAATATAA 

ATATGCACAGTAOT 

>Sequcace r7S 

cctrrr agcgtgctcgcg tccg agctactttaaaaat aacagactctc att 
taagaatactcagactagagccttcagtgagttgtctgagggaaaggagt 
gaagtcaggacttagatagaaagattacaaagaaactcaaagtaagcaga 
ggaaaaagataccaaaatgacagcttcagaataagcagtaagggaataaa 
GAAAACAAAOTTOTG1 GTG 1 G I gcatgtattacatgataaatccatggaa 
aaagaactcgcaatttactaaaggaataattcatggtcataocaatt^ 
gig 1 ccaaaactaacttgattagtatcagaaggaaagtcaatctttaaac 
agtccttcccacatctgctacttccata^ 

TTAAGAGT AGAGAAGC<^CAGGGCXXACTGTCAAAACAAACACKjC^ 
CTGGGTTCCAAGTTrCATATAA 1 1 1 1 CI 1 G AGCCTG AAAGTCGTGAAAAC 
JOCTTCTTCTA^ 

GCGGGCGCTTAAAGGGGAATATCAT GCTACCTGCCCGG 

CCCTTAGCG TGGTCGCTGTCG A CGTACTTGCTAAAATTt^G ATTCCTOGA 
CCCACXXTAGACCTACTGC^TCCAAATCTCrGCAG^ 
CTTCATTATAACAAGCTTOCACATAGATTA Ti I IGICAGTGGCCATGTCT 
TGCTTTGCI 1 CICTGGAAACTACTCTCCA I C I ! CI GGAGTGG AATGTCOC 
CCATTOCTATCCACATGGTCC rCXjMJLlULCIGATACTCTAGTlCTCAGATG 
GC^CCTCCTXftAACTGGGCOQAGCTCAATCAClTroOCAG^ 
CTCGCTGGAGCTCAGOGGTCCCATGGTGGGCAA^ 

AACAAATCOCTATGCATTTAGAACT AGATDGGGCTGCATTACAACACACA 
AGCACTCAAGGACTC1CTGT AATATCTGGACTCATAGGAAGGTGATCACA 
GCAAGAGGGCAGATGAAGCAGACTT AGAG AAACA GATGAG ACACAG A GAG 
AOCCI GG1 rClGGTI 1 G 1 1 C I GAAACATGGCCAATCTOCTATTTAGATTT 
AGAGAGCT ACCTGAAACATTCTTACAAAAAAAA I I ICCI I IITOATATOA 
CGCTT AA TTTG AGGCCT AATTTCTAAT ACTGTGCAA TCTCAAAGCTA TTC 
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AAGGAAAATAAAAGGQSCAAAAATGTCTAATACTGCCATTGGATTGGTGC 
AAGGATTAAGGGCTTTCAGGGGAGGAAGGGCCTTTACCTGGAAAAGTTTG 
OCCTOGAAGGCIGTGACAATTA lTIO ' l 'T Cl CCCTTCT 
>Sequ eoc« 177 

GGTACI 1 1 1 I 1 1 1 1 li AA I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 A ATAGAGATGGGGTCT 
TACTATGTTTCCCAGGCTCGTXrrCGAACTCXrrGAGC^ 
CA CCTTAA CCTarTGACTAOCTGGGACTACAGGTOCAGACC^ 
TTACTTCTATTCTTACTTC ACAAAGG AGAGG AAAAAAA AAGG AAGTTTAG 
A G AAA TTAA G T A G TAACTTGTCCAAGTTTA COCA CAACCACTAAGTGGT A 
AAGCTGGCCTTTGAACTTCAGCAATGTGCTTAAATCT 
TACACTATGGAGGACCTTTAGO 1 1 1 IC1 1 AAATTCAGAAG UIll I 1 1 IC C 
ATGT 

>Sequcncc 178 

CCCTTTG A G CGGCCGCCCG GGCA GG TAC A TGTTTGTAAAA TTCCTTAA A A 
T A TTTA TG CTC A AACCA A CA TTTCC A TTTT ATCTA TCTT AA ATA TA TCTT 
CCT ll III I I ACGCCTAATTTCTTAAACTCCCAGA G 1 1 1 II I IL I OTA AG 
ATCTAGTCATCTXjTAGCACTTCTX^CAAATTAAGGTCTCITATGCCCCAA 
ACAGTAACGAAAGAGGTCTCTTAGTTGGACAATAAGCAGTGAAAGATATT 
TCrTATGGGACAAGAAATTAACATTATTAGTCAAATGTTGATGOCGGTAG 
GCTGAGAAATGATTCTCACTTAAAAGCCCCTGGGTrTTAAAGC ICICIT A 
G AA AAACA TT A GTT AG A TG AA AA A NAN AAAAAAAAAAA AAN A MGGTACC 
>Sequence t79 

GCTACA A GG AG CTAGATCA TGAAGG AAGGTCAGGGCAGGGTTCACAGG AT 
GAGGGCACTTTGCCATTl 1 1 1 1 G IGATATTOGTCAACAAATGACACAGGT 
TA TTTA CAA TCTTGACCTTTTGGAAAAGATACAGCAOGTAATAGCCT ACA 
GGAAAGAGGAGCTAGAAAACAAGTXKX^CACTAGAAACACTTTGATAGCT 
AAGATC CTGTCTATCC 1 1 »G >GGNTATTCTGTGCAGTTGTCTGCCTGGCT 
TXTTTGG AAAA GTCCAA TCT AAA GGTGCTTGATTGCGCCCAAGGATGTC^ 
CA TTCATTCA CTGGG A A CTTACAAGCCCTCTTGCTTCAA TCAA CTCCTCA 
AJCAGTnTCCAAACICTTATTCTTACT 



TTAAAGGGCCA 
>Sequence t«0 

GCT ACATACAATAG ACTATTATTC^C<XrrTAAAAAGGATG AAAAAATCCT 
GACATGCTAAAATATAAATGAATGTTOAGAACATTATGCTAAGTGAAATG 
AGCCCATCT^^AAAGG^AAATAC TC 

ACTGGGAGTTACTTGATATAGAGTITtlAATTrTGTAAGATAAAAAAGTTC 
TGGATATTGGTTGCACAGCAATATGAATATACITAACACTACTGAACTC - 
A CACT TAAAGATG GTTAA GATGGTAAAT IT ' IO ' I I A G G T G TT' I C T1 A CCAC 
ATTTTACAAAAAAATTTTAATTAAAGGAATTACAAAATCTACAAAATACT 
ATTCATCAI 1G1GI 1 1 CCAGTTTATATTCAACACAGCACTATTTCAGGTA 
TACTAATTAACTTACTATX1ATTTCAAAAGATGTCTATAGCTT ACT AAATA 
TCCAACTCTTATTCATACATTTTGTGATTATCT 

CAATGGAATGGAGTTCTCACTACTTCACXrTGCCA GCCTTCAAAAAAAGCC 
TCTATTTCACACTACCTATTAAATGGCTACC I GCCCCGOCCOCCGTTCAA 
AGOG - <• 
>5eqaeatc Ml 

ACCACTGCACTCCCACCTTJCCTGACAGATC^ 

AATTTAAAAAAATAAAAAATTT AAGAA TATTTCTATGOCCTTTA OCAGGC 

CAGCTTAA TCAOA C 1 1 CI CI aggcctaggacaggcttaagatcagttaat 
^/A^CACrTC TGA IGI I1CTIGAGCATIGAAAAGTTTTATTCTTTCTG 

CTAI I I GO tjGTTATCTGAAGTTTAATTOCATTC A I IGIGI I I A T ACT 
A TTTATCCCTGT A C^TGTTGGAATTACCAGTCACTTACATrcAT ATTTT AG 
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1 1 1 1 1 RiOCI rATCTOCTCAAAGTGTGGGGGACTITQAATGGGTQTGTAA 

taaaaaag cttcg i tct agaaagtaata g tt i ci c ca tgtctt aaa t att 
ttaaatgacactgaca 1 0 1 1 i i laaaaatcgggattgttggctgggcacc 
gtggcoca ogoctctaatocta gcactttt5c<iaggtggaggcgggccgaa 
cacaaggtcaagagaacgagaaccatcttgccacacgggtgaaaccctat 
ci i ioc 1 1 0 1 oaaggaaoaagatgatacatoatgaagggtcocttqgcog 
ggaccaccctaaggggogattcccgaocatgccccgcctttcaag<^^ 

CCACCCTCCG 
> Stxftmx U2 

ACl'1 1 1 1 CI I oaatatttocagggcacaagatattcttatacagaaacct 

CA GAATG<LAA AATAGCTAAGACATAAOCAGTOTTTCACAOAACCATCCAT 
CA OTC1 Mil I A OOATOTAOCA C I C 1 1 CIA TOTATCaCTTAACCAATCAT 
TATTCTTACOOCA ICI 1 1 1 1 CCCCAGCCCGTGCTAGAATTTAAAATTTAC 
CATT ACTAAGACACCGTGATAGTAACCATAGAATTTTCOGATOTCTTTTT 
1 1 ICC! I GCCCTAAAOCTTCACAGTrcTGOCAGGTOATTCAAATGTTAAO 
ATCOCATAATCTCGCC rCTCTOCI CAAGCGAACACTAACACTTTAAAAAG 
TQOO AATOAAAAATCWiAACTXnTQAATrAOACACAOTATTroOOOOOCA 
TCTTCAATTTCAGAAAGAACAAGTGGACATATCAACCOCATTGOCCOCTT 
CTXTTACTCATACTCAACAATGATCTAOCTTOGCCOCGGAACA 
>Scqocxxc tS3 

cccttagcggccgcccoogcaggtactcaaaaatttaaatagccatctaa 
aaacatrtcagotaaaaaatctctcccctccatttca^ 

TTrrCTCACTAAAACACATTTTATTTAATAGTGAGGTOAAATTACATTAG 
CC C1CI I CACATTTATTTOATTCAAA CC 1 1 1 1 1 1 AAAAAACTTAGATTCT 
TTTAAAAAAATAAATTAAAQAAAAATCACATCATTCATCACATAGCCAGC 
TACATKrrGTAGTrTGATCATTCACTTTAACCCTTTTATCACTGTTGATAT 
G AACATTGAGTACC 



GGTACTTTCATACATGTAAAGTGCAAGGCACCTTGCTAGAGAGCATAGGA 
GCTA TACT AAG A TA T AGAGTCCTGCCACAAAT A CACACAAAA T AACATGA 
ATAGAAAGTGTCCTAAAAGTCATGCCAAATAAAACAGAGCATATAACTGG 
GCAGAG<XlATGGAGAGTCACATGCTGGAGGAG<jTXjAGC<rrTO^ 
TTATGGG ATATGAACTTGAGATGTTGAAGT AGAACTGAG AGATTTCTGGA 
AAACT AGATGTATGAACAGAAGCAGGAGaAATAOGAGAAGGTTTGGAAAA 
CAGCAAGCAGCTCAG1 1 ICI I PGGTGGTCCAGGAGAAGAAGCTCAAACAA 
CAGTCAGTGATAACA CTAAAAAAATCAAAAATT1TAAAAGTCTGGAATCA 
*^GCATAAAGAAOOOGTATGCAGGATTTTTATCTCGCAGOOCTOTCTOOC 
^^Q^^G ACAGA GATCCAGAATCACTTTOCAGAATGGTTTAGGGTCACCT 
TOCAGA 1 ICI I ICI I AOCAAOOCTTGACCACACCATTTTAAGATTTOCAA 
TGGACCITJACOGGGOGGGOOATOGAAAGaOGA 
> Sequence M5 

GGTAC AATAAAC AAGACAGTGCC 1 PC I roTOACCAOGOOCTOOGOCICTT 
CATAO CTCI 1 1 roOCTGOCTTl 1G1C1 IC AOACTTO AT C TOCI I CI IA CA 
CATTCAll 1 1 1 iCAC^CrrTTOCTATCTTAGAAGCAAGGATCATTTTTAAT 
TOPI I JU1 1 IACTTCAAAGTCCCACTCATCAGAGGCAQOGTTTCGCTTAT 
ATTTOOCTCAACTAC 1 1 ICICI IOCI lUGTTTAOTAACACTAATGTTTAC 
TAACATTAAAATCAAACCAOTTTTGCAGCTAGCATCTATTGACAAATATA 
• ATTATTTATTTCAAACTOTATATTOCAAATrrAAACATA 
TTGAACATTCTAACATAATAGCTTATOATAAAGOAAAATATAACATCTGG 
1 1 nOGATCroAAGCACAACCACTGCTAOATATTTTG^ 
TTOCAATTC>AGOT0AATCTCCGAGGGTTJTGa 

CAA AAACCTO TCX1AATTTOOOATTGGTAGAAATAAAOOOOATGAOCATTA 
OCTTCTTn I ATO00CAAATTGGATTTT ATGOCrAOCTAATXKJCTTTCTT 
GGATATGATGCTTGGCAATAGCClbCI I U 1 AATCTATTTTGOATAGAAAA 
GGGAACTTTAATATTCAGATTAGGGGGCTTGATTTTGAC I III 1 1 1 AACA 
CCGAATGCGAT 
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>Seqncacc M6 

GGT ACA T ATGGCTCGGCAAAGGGGGACTGGATT AAT AAATTCTGGTAAT A 
TAGTAAGGACAAAATAAATGTAAAAAAGATAGAAGTAAATGTGAGAACAT 
CAACATGAACGCGTGCTCC1 rTCAGTAGAAAGTAATTTTTCTCCTTTGTC 
ACTCAAATAGCWXKAGAarrtMCATC^CCCTOCC 1C I O CI I UCATGCTC 
TAAAA CI 1 1 C C I OO GOCTCAGATTTGGATCCTAATATGATTTTOCACTTA 
CTOGATAACAGCTOOCl OOAOAAGGGCTCATTCTTGGATGGACAACAGAA 



CAGAAAGCTTGGT AAAGACTGTCCTGOCCAATCTGATTACAGTCAGTTGG 
> Sequence U7 

ACCGATGAAAGTTTAAATCrAATCAACAGTATTATGCACTGGTTGAAGAA 
AACCAGGATTAAGACGGAGGATAGTCAGCATCGAATCTAAGAAAGGAAAA 
GTCOGGTAACTATATGTGTTCATTAGATrCTAAGCTCTTAAGGGAGAAAG 
ACOCTG AGTCTAATG A A TATAA A CTTTAAATTTAAAG AA AA ACATNGTCT 
GTTATAGAAAAG rGGTCTTI CAGG 1 1 1 1 P I A A AGATG AACTATTTCATCT 
1 1 1 0 1 A GTTOAATGCTCATGOGGATTAGCTAOCTOCA 1 1 IUU 1 IAATOO 
AAA CCI I 1 1 1 1 AACCACAACCCATTTAO 1 rOCTTGACTCATATCAAOAAA 
A GGTX>CXXTTGGTTOGG AAAA GTGGAAAATTCCTAATTA TGGAAAA TGAT 
TAT0CC1 > 1 ATG ATAATTAAATATTTTA 1 0 1 1 ICATGCTTCA IC ICI IA C 
ACTTATAAAGAGTAATTTTACCTCTATTTATAATTTATTCCTAATGCGTA 
TTCTT A G A TTTTT AT AATCA TTTT A TTTCT A G A ACTT A TTT ATGT AA TG A 
TCTAGATAGTACTA 1 1 1 1 C I P ACCTC ATATTCAA I fCTGI 1 ATGAATTTC 
TTATAGGTCATTAGTTAATTAGTTG AATCA 1 IPCI ICI ICI 1 1 1 1CIA TT 
TATATAAATOGGTCOTaTI GTTC I' ACTTATANAG I TGI l OA ACGCATCCC 
TTTTA nrrCTTCn IG CTATCTCTATGTAACTAA ICI 1 1 I AT ATT ACO ATT 
TAACGTATAATAATA Till I ACCCTGATTATCCATTCTCATGTTCGTACT 
TTCAGTA T1CTT T ATCATATTATTGaATATTTG 
>Seqotnce US 

CCCTTTAGCt7nrnXKXjG<XX} AGGTACX^TTAAOCGT C 1 1 1 I AAAAAATT 
ATTATTAGTTTCA O I GC I GTI - l CTI G AOGOAGCACCGGTGGTGCAOGTCN 
AT O I 1P1C1 fCTl AATTACAAGTCTGCAGTTGOCAGCTCTA ii 1 1 I C I I AA 
AAGCGGACATAGTATCTATGACTTCTGACrAOCACATTCATGCTGAGATT 
TOCTGCTCCACTTTATATACTTTGACATAAAATATGTTTTACGGTAGCAA 
AAI G1H I I IAATCA CH M l ull I I ATTAOATAA LI11 1A A H 1 1C I U C 
CTCGAGTTTATTTTA IP! 1 ICAICI ICI I I I A TAAAAATTGTCAATTCTC 

ci i rcTA A cn 1 1 1 ri Acri r t a aattaaatatat atacga ticttcigc 

TAATCGC TGC 1 T C 1 C M ATCA TTCT AAT ATTAAAl 1 1 1 1 1 I A TCTAAATT 
CCTACGTACTTAC ICI I I CI IC AI CI 1 1 1 1 1 ATTTATTAAACTAATACAA 
TOGA TATA 1 1 1 FCOTCOI I IA TGG CI 1 1 IA TTTAI CICI 1 1 1 1 ATCAATT 
AATATAATTTCATA 1 1 I CC 1 1 IA1 CI IC1CA ICI 1 1 ICICG GCTTATTTT 
CICI 1 1 A TCTATAATAA TGTATTAA I nTGTA TAA ATCl I FCTGI 1 A TGT 
ATCACTT A 11 C 11 1 CCT1 CAT CTI CO t^TCTTAAAATATTCATTTAGA T 
TATATTTAA CI 1 1 ICC1 1 C AAATATGGCACTTACTCCTTCT C I ' l l ACTCT 
TTTACTAATAOCATTTAAAAAATATAATCATAGTTCATGTTTATCTAAGT 
CCrOOCTATTT ATCI 1 1 IA CTATATTAATGCTOTAAATTTATAOGTATOT 
TGATT 



GGT ACT AAACAGGCCAGATAT A 1 1 C ' IC ' IC ATTAACTTATTGOCTAOCAGA 

gaagaccaacatttttaaaagtttatacatatagttaatttctattatoa 
ttatatgatacaaatggaaagtgctatgaaaatctggaacaaaagagaat 

AATCTGTC1 GA ACaGTCAAAGAAGACTTCTOGGAOATOACATCTOAGCTA 

aaggttgaacaaggaattggaaaacagctgck^tgtgcaaaagactt^ 
oactcaaggagttag<xtttaaaaaaatcaagaaagttct^ 
agcagagtttcaaatagtgoctcacaggccaogitaaaoacctgaggoct 
ttattctaggagaatagggagctgctcaaggaatttaagctttiagagtga 

Page 37 
(of 261 pigei in T«ble 2) 



259 

Table 2 

caagatcagatttgcaatgcctttcaaqaggtact^ 

octoaccct ttoc aattataoccattctaact 

tatatcctgtcrttaatgagtgagaaaagaaagaooaaaataaataagtt 

CCTroO C r O CCOCTDGT IO OAAAOGO 



PCT7USO0/MJ12 



AC1 1 occ M OCAAAATTATATTACAAGAAGAAGCACACTTGTTATAGAAG 
TCCTOAATTCTATGGAACCTAAATCTCTCAAGTTACOC 1 P rCTTTCAGGG 
T TXGTCTC CCCACCTCCCAGACCTCA 

ATCTCTI I AAGGCTAGGCAAGTATTGCGCTGATGAOCCAGGGACTGCCCA 
CCAA TTXiOCAOOCOCATTGOOTQAT AAA TGTOCAA GGACCTCTA GGCTGA 
CGACACATTTTTCATCATTAATCCAGTCTATTGTAA<XAGC • 
ATTGATTCGGACTAGGGGGCATCA I CI O CI G I 1 A AOAGGGTGATGACTCG 
CTA AAAA TGAGGGCCTGAAACTAATCAAATATATTTAGAGCCTTCCCTGG 
CAACTTGCTrGGGAGAGCAGCAGTAGACAGCTAATAGGGGAGCCCCAGACA 
GGTAGCGCGGAGCTCACCATGCTTTGGATGGGAATGTGAGAAATCCATTT 
TGG AAGCCTOGTGTGG AA TTCCA OCT A TT A T ACA TTC TA GTACCTTCGCC 
GCGACCACGCTTAGGGGC - 
>Sepnc»l91 

ACCACTTCATG<XnAAGCATGTGCGGGATGGAACCGG 1 C I ICC IP GGCTT 
ACA ICI I IOCI I raOCTCI I CTTTCGCTGTGATCAGTCTTGGGGTAGGCC 
TCA AAGGC TGAATCTTCAATATAAATACAACAGTGAATGAACAACAAATG 
GTTATTTTAAAGATCTATC7TC<^TGGCTATTTAATTTCACT AAACCCCA . 
GGTTOCTCACCTGTTGACTGGAACAAACAATA G 1 O CC Tl CTTCATCOGGG 
CATGGTGAGGGTTTTAACCCCGCATnnTJCACAAAGACCGCTTAAATTAT 
ACT AGATGCTCAGCAAATCTGAGCrATTATTTTrATCACGAClTjTCAGAG 
GTCAGATCAGGCTTCGGGGTCAGACACACCroGGTrCAAA 
G CCACTTA CTOTTOOAGCCGGGGCAAGTCAOTTA IICII CCC I GA OGGTC 
API 1 1 1 L 1 LATtXXTAAAATTCCAACTAATAATACTT^TCTTTCAATGAT 
GCCGGGAGGTCTTAAAAATAATATAAGTTCAGAATCATAAAACAGGCTGG 
CACAAATTGGATCGCAGCCAATO I CC } lOCACXCCIUIOIllCCTCCCTT 
AA TTTGTGTTGAGGAATAAGGCCAAATCTGTACCTCGGOCGCG ACCCCGC 
T 

>Sequeoce 192 

ACTACAGAACAGGAACAATCTGOCA 1 0 1 0 1 0 1 1 1 A CAACTTCAGAAAGCC 
CrOGAATGACAGT TOCCAGG GCAOl I CI IC I O AATTTGCAGOTCAGAATT 
AGTGG ATGAT GAA II I 1 1 1 ICA CACATGGTCAAC IC IP I OCCACCTGCTA 

CAAGA i c 1 1 ggaacaggtatatttatttatttaatgatgatcaatcattc 
ttccaacatcaggqaacatcagc<iaaatcaoctagtatatgc 1 I 1 1 1 1 I P 
AGOATi n cagctccaaatcctgaaagcattcatgaaactacataaatta 

CM 1 1 U 1 1 AAGCAAATCATCATAACTTAAATOCACTCATATCAATCTGGAA 

GGAI 1 1 OC IPO i pggcactaacactcaccacatgtttcagtgtggccaag 

TrrACCATCCATCACGGAI 1 1 IPIOCI I UP I OAATTGTAGGGAGTGAAAG 
AGAGAAGGATCTTTPOCCCA OI lOIC I 1 1 1 1 lA CCTATATCTOAAATTCT 
CACTTACTrCAAOAACAAAACATTTAGACATTTAA 1 1 1 CC II 1 1 UG GGTTN 
TAAGTGAT-ACATGTTTAAAATTGTaTATTTAGAAAAAA I ICI I II I A TT A 

tatataatttataaatcagtgoagagacaatttatactgagaaaatattt . 

AAT TGGAAPi I IP I IPiCTTICTCACACACACGGACAAOCCCAACTTTTT 
Al IOCI ICI IPAACTTTGCAAAAATGGTTAAACIUUCI JULACATTCATTT 

TGAAGGGAGGAG ''■ i- • 

act agcattaaaaaagtcctacaaattattagagagaaaatacaggttgc 
acgcaaagcataaagaatgagaatggcatagacatcttaacagtgccaga 
gaaa ctaaaaagtaottctgagtaaaaatgaactattta 

CGTT AATTAGGT A TAAAGGT A GAGTT AAGACATTTA T AGACA T ACAAGA T 

AT TAAGATTA CTGAXnTJAATTGATATTCAACAGGGOTGCAAATO 

AAG ICI 1 1 1 CAACAAATAO 1 OP I POGACAAATOGATAGCCACATGCAAAA 
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g aacatatat ataagagctaaaaccataatgcttttaoaagaaaatatag 
ggtttatcttcattjaccttgaatttgacaaaggatt^^ 
caaaagcacatgcaacaaaaoaaaaattggagtgatatgattaatatggt 
ggaacaqgaagtcttcagcttgcatccitjcgcc rici ixj acacaaacaac 

AATCTGGCAGCCATCCATGGACAAAAGTGOCTCrGTGGGAGCTCTAGGAT 

CCAGGTAAGAAGGTATCAAACCCTCCTAAAGCCX^CACGGAGGAGAGGT 

ACCTCCCaXGCGACACGCTAGGGGC 

GGTACAGGTCACACAGCACA TCA GTGCCTA CATXjTTjAGCTCAG ACCTGCG 
TCTOCTGCTOTCTG ICII CCCAATATCCATGACCTTGACTOATGCAGGTG 
TOCAOOOATAOGTDCATDODOG ICCT O CI ogagcccagagcacggaagcc 
tggccctccgaggagacagaagggagtgtcggacaccatgacgagagctt 
ggcagaataaataac iicii i aaacaatttt a cggc a tg aa g aaa tctog 

ACCAOTTTATTAAATCGGATTTCTGC^CAAACCTT^ 

atct 

>Scqwace 993 

tctgctgctgicioil i ictxaatatccatgaccttgactgatgcaggtg 
tccagggatacgtccatccccgtcctgctogac^ 
tggccctccoaggagacagaagggagtgtcggacaccatgacgagagctt 
ggcagaataaataac i ici 1 1 aaacaatttt acggcatgaag aaa tctgg 

ACCAGTTT ATTAAATGGGA TTTCTGCCACAAACCTTGG AAGAATCACATC 
ATCT 

>Seqmc«l9« 

cccttagcgtggtcgcggcgaggtaccttgagctgcctcagca c ic i 1 1 1 

gcc^ttcgtgctagaaacagccaaagccagacaaccaaattacagatgct 

taaatgttaatg<xagacaccaag<?ctcogtgaacttccctg tt g aacat 

ctoaocccgactacttgaggacatgaaacctaactgtgcagctaattaca 

octtccaagggcaatgacatcgggtccn'atgattrtattcaggaaag 

taag<x>atcc<xx7rcactgtgaacatcatttgaagggaagt aacttctt 

agctttatttjcacaaatggtctatc 

ogta ccggtctagtgtatagaatggtrtgtatcaaactagatctacatta 
ctttactagaa atatagg gcaataataaaatttccaaagccaaactgaac 

GATAATATATA 1 1 ICI 1 1 AGAAAGTCTCAGAAAACCGATTOCTGAATGaC 

aaaaoggagagataacttacaactaggtgatatctoaagttaaattttct 
tggttatctatttcaaaaattcacaactattctgcactaaaatgttt^ 

TGGGTCAGGCACAGTXjGCTCATGCCTGTAATCC^ 

GAGGCAAGAGGA 

>Seqacoal9S 

COCTTT^GOGGGCCGCCCGGNGCAGGNW 

GGTTTTTAAAACT n CT>IGGACCATTG^CTTTGAAACCCCGCCAAAGAAG 
GGCTGMGGGC1 1 OG I GGATTGTAGCOCCAACCTTAAAAATTGKFIT0TCAA 
AAAAAAATTaCOO OTTAC O 1 CCC 1 1 rO CAAOOTOGAAAAAGOOOGACTTT 
llllllllllll ICCX- ACAAAAAGAA CCCI 1 1 1 1 1 1 H 1 1 1 A AAGGGGGG 
GAAAAAAGAAGTATAAAGGAAAATTTTGGGGGGATI ICI rOO QCCOOOOG 
C 



PCT7US0Q/33312 



ACTGACAC^TGCCTGGCTAACCATTiTCCAATGTrc 

. ogacaoat aoaa ogctctoct oc acc c r n lootci ico oototoo g cnt 
gta^gaaatccgccaca caagaaagcactgacatttggagcctca tcagg 

TTCAGA G TTGAAAGT GAAATA AAGG ATAATAA IClllOlCIIAIII I C I I 
TGTTTTA A 1 0 1 1 1 UJCAACTTACGTTAGGACAATGTCAACAAAGACAGAT 
GTCCCT AATAGTAATTGCAGGACATG IP 1 1 1 1 CTCATTCCTATCA 

A TTOGAGGGGGCCA T ATCCAGG 
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acctgtgatctgtataggcagaccagactggtagogaagaaaagcagaga 
i atcaagtgggggacatctotttoocctggcccl lti atlggcxttogaa i i 
ttctggtaggaggaaggcacaaaaagtagactocgattacaggcgtgtgc 

CACCGCOCCCGGCCTAAA O rGTOTTTl ATAATAAAACCTCAATCTGAAAC 
A TTTT AA T AAAACCTTTAO ATG ACT AGA TTT A TGTTT A TTTTOGA7TT AT 
GTTTATATGAATAAAAAAAGAAAAAAGAOGAGG 
>Sequcace 901 

GGTACCTATG AGATCCATTTG AAAACTTACC I ICTtTATATCTl PCTTCI 
GTTGCAA TTTCTTCCA TTACCTGG AA T AGCTGCTTTGGACGGCAAACCAA 
GCAATGCC CI 1 1 C ACAGCTGTCGC ATGAATGCGGAAAGAAGTCTTGGTAA 
GGAAGCAATTCAGAGAACATCGGAGCATCTCATGGCAGCAGTCACAATTT 



CTGTCTGTAGGCCTTT AATG A 1 0 1 1 1 IA TTGAA I 1 1 1 G OT 
>Scquence 902 

GO T A CTTCT A T AC AA GGCAAAATG AACTCT AAGT A AAAAA G AAAATCACA 
CTTCTA A ACACAAATTAACXATTTCAGT ATTTAATTGCTCCTAAAAGGTG 
TATTCTACTTCATTAAATGTAAGAGAAAAGGTTAGCTACATTACXjCAGTT 
TAAGAAACAGGATAAACTNTAGGATATAAACAGTCTGATTACATTTTCAC 
ACITTCAAOCATCTTATTTATACTCTACATrAGATAATCTTTAAATTCCA 
TCATAAGGTTrCCCATGTTAACTOCATATAAAATTTTGTAATCCTGOOCA 
CCCCA TG TCAACTCA GTGT AT A CM 
>Seqnence 903 

GGTACTGGGTGACAGGAGAGAGCTCATCTOACOOGAGTCroGGTGGTCTC 
AGGCATGGTATAAAGAACTAGGCCAACGAACTGCACTAGACATAGAAACT 
A GCTG AA T A A ACTCATCCACTCCG ATTTC A TTTCAGGT ATCTCA TG AG AA 

CCA TC A G7GCCAA TG AACAT AAA TG ACTGCCTC AG AGTT AT ATTAACAAA 
ATAATTAATTCAGACGAATTAAGGAATTAAACCAGCTATGGGAAATATAC 

actctatacttagatgcacattt 

> Sequence 904 

ACTT AAA TAAAATAAAATTAAAACAAATCA TTTT AGAGATAAAGAGTGAA 

GTTACTGAAAAAGGTCACTACGACTCTGTTTATGAAGAAAGGTTAGTATT 

T AAA TCATG AAAAAAAGTAAGAATACTT AATTATTCAAGTAACTTAAAAT 

TGTAATTCAGAATGG Cl Til ATGT A TCT AAAACAATCTGGGCTGCTATAA 

AAATTT^GTCAACTTTTAAACTTCCAAACACXAAA 

TAAGAATATCCGACCTACCGTGCAGGACCAGAGGGCTCATCTCT 

>Seqoene*905 

A CTTAAAT AAAATAAAATTA AAACAAA TCATTTT AG AG AT AAAGAGTG AA 
GTT ACTGAAAAAGGTGACTAGG ACTCTG TTTATG AAGAAA GGTTAGT ATT 
TAAA TCATGAAAAAAAAOTAAG AATA CTTT ATT ATTCAAG TAACTT AAAA 
TTGTAATTCAAATTSG C 1 1 1 I ATCGTATCTAAAACAATCTGGGCTOCTAT 
AAAAATTCAGTCAACTTCTAAACTTCCAAACACAAAATAGTTATACTCAG 
TCTAAGAATATCCGACXnACCGTGCAGGACCAGAGGGCTCATCTCTTTJCC 
G AGCTTA TT ACAGTTTTG 
>Seque»ce 906 

GGTACCTTTGCTTTAAATGCATACT AAGCTGTGAATG ACTGATATCAGAG 
AC! 1 ICI I GGAAGTAGGTTTCATAGGATGGAGG ACAAATG AAACTTTATG 
GG<XlAAGAAAGAAGGGTCAOTrOGOTXXITOCATTCAAATAAG TOG ITOCA 
AAAGCAAACTAGGTCAAGTTrTTAACTGGCTAGTGAAAATGA 

AGAACAGGATTTATTTAGGGCATGCAATOTGGAGGGGCCrAATGCGAACA 
TGACAGTGTT 

> Sequence 907 

GGTACAAATTOCATTGTCAATTTATATTTGTTTCCCCACTAA^ 
AAOCTTGCTT01 I'M GTTTAAGTATOOCTOGGGCTCATCACAGGGGCTGT 
TGAAGTTCI 1 1 1 GAAATGAATTGAAOAATGTGAATAATAGTTCTAGTTCT 
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TCGGGATA ATGGAAAGCT A^ATAAGGTTTATGCTAGAfiGCTTTTTACTGCTG 
GGA C rCTCI ICI rGTTTI I GGTTrTTAGOAAAAAAGCTAG AAAATOCAAC 
TTCAGCT AG AGTAACAGTAOT AA CTQACTTG AAAGTATGTCAAAA CAN AA 
ACTGTTAAG 

> S equ e n ce 90S 

GGTACCTATGAG ATGCATTTG AAAACTTA CL ' 1 IG1 1 IA TAT OI I I CI IC I 
GTTGCAATTTCTTCCATTACCTGOAATAGC I O C 1 1 I UGACGOCAAACCAA 
GCAATGCC C I 1 1 C ACAGCTOTGGG ATG AATGGGC AAAG AAG 1 C I I GG TAA 
GGAAGCAATTCAGAGAACATGGAAGCATCTCATGGCAGCAGTCACAATTT 
TGI G 1 I GCOTAATATTTCAGGAACTTGCAACCCIOATAACTTOTOCCTGC 
CTGTCTG I AGGCCTTTAATGATGTTTrATTGAA ITTTGGT1 

> Sequence 909 

A CCCTCTTCTCA A TTTTGCT A TG AACTT A AAA CTGCTCTT AAAAA A AT A T 

TTTTTTT A AAAAA GG A GGG AGTT ATT A TCA GA G A TCCCATA G ACCTTA AA 

GGATAATGAAAGAATGCTATOGATAACTTCATOCTAAAAACTNCAACAAC 

TT A G AAG T A TG AAATG A ATG AA CTTCTCCA A AAAAAT ACA AGTT A CCA AA 

ATTGACATGAATAATAACAGAAAATCTGAATAACGCTCTAACTATTAAAG 

AACGTGAATTTGTt^AAAGCTT(X:CCAAAATAAAATTCCAGGACCAGATG 

GT 

> Sequence 910 

ACTCAATGGGGTAGGO POTCI I O OO ATCTG ACT O 1 1 ICI I AGaCCTTCAA 
TG CI rCTTO G Cn IC C IC ACTGCTAGTTATAATTCA GI 1 1 1 C I LAGGTCT 
AAGTCATTCATCACTC 1 1 1 rOTCTGCI J 1 1 CAGCTTCCAAAAATTCATTG 
CTATTATCTCCTCTCCTG 1 1 1 1 OC CTATGOTOT Q I ITCTGTCTTTTTCT1 
TAAAAAAATTCCTTTGTGGTGGTTTTAGGGGAG I I I 1 I U GGAATATATAT 
TTAATGTACCTCTGGCGAGACCGCGCTTTAGOCGATATCCrGCACACTG 
> Sequence 91 1 

GGT A CAA CCT AGCC AGCTGCACA GCA GCTCTCCAAG AAAAAGGTGTA TAT 
TAG AC AG A TTCA A TT ATTCA TCTTGTG A TT A TG A GTAGT AA CCAA A TTGT 
CTATGTAA 1 M I C " I ' I ATGGTCAACTACCCAA AGCAAGGCCTCACCTTAGG 
CTACCAGCTTGACTCTrAAGTGGACAGAAAGAGCCAAAGGCTAAAAGGTT 
TGTGAGAAACCTCATGAGCACTGAGT<GTTCTAGTTCCAGATGAAAACCGG 



>S equen ce912 

GGTGG ATTTGAATAAGATGAAAGATAAATTATG A 111 lOI IC AAOTGTT A 
AA AT AAAA CT AAG ACA CTT AA GG ACC A CAAAAATTT AG ACCAAAGT ATCT 
TGTAAATTCTACCTGGTG AAAGTTTGATATAGCACACATATGA C 1 1 1 1 CT 
ATATTA 1 1 1 IC1G 1 1 1 I GAGTTTAGTAGTAAGCAGATGGTTTGTA H I IC 
TTTAGTTGCAACTAAGTGATCAGTTTCATGA II IC1CI 1 ACTATG AAACA 
TTTTTTTTT1 1 1 1 C 1 1 A ACAGTTATCTTA 
>Sequeoce913 

ACCACAAAGTTATTGCCTACATCCAGGTCAAGAAGATCTTCTACTG 
TTCITCTAAGAGCrTTTACATATAGGTCAATGATCAATCTAAAA 

gttgttxxatcattaactctagctttagacttjGT'atactaattg g i i pen 

ATA CGAA CTGGGTT AAAGGCA T A GG A CA CA TGCA GGCTG T GTTCA TTTC A 
CAGCAOGC^XrrGTAATTAGGCAATAATTACTTACCATCATACCTAGTGA 
GGGAATATGGGAGAAACAAAA(^OGOC^TACAGCTTCACTATTATTCX-TA 
CT 

>Sequeocc9l4 

aaaaccttagcgnggkco(X>gccxi;agggactngaggaccaagccacagag 
caagcgctaaaaaaaaag^n^aactagaacc^>jaccaclgnkncacxx:acc 
ccaatttcat aaaa tgt atcagtaaaaaaaaacaatta tctaaagttttt 

TAAAGTAAAGAAAAATTATTTATCACATAGGTAACTTG<iTGTCAACTAGG 
TAACTGATCTATTTTAATTTAGGAAGTTACj PGTCTTC C 1 1CCTCAATTTT 
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CTCTCTACAAGGA 



CAGAT 

G AAAGCAA GCCAG AG AATCTG A 
>Sequence9IS 

GGTACCAG AAATOOTAAATATATO AOTAAATATAACACAC T TTT IT C TTT 
TAAAATTTTATTTAAAAGGTAACACTTTGCAGCAAAATAATTAACAATGT 
ATTGTGGGTT ATA TAGTAGTAAGATGTTTGA CAT AAATTACA TAAA TAAT 
TGGAGCAGGGAAATAGAAGTGTGTTGTTGAAATGGTTTGATATTATATAT 
G A A G TGGT A T A TT ATTA TTTC AA GG TA GCCTTG ATAAG TT A A A GGTT AC A 
TATTGTAAACCCTACAATAATCATTACAAAATAAAGAGATATAACAGTAA 
GG 

^ S equence 916 

ggtacttcatagaggtccagaccccttgcgtctggca itcctttgotcta 

taattcagtaaactctgct aaaaa ggaaacgagactagcttgctgtggcc 

ccttaag<x;acxx:agggtagcttgtgatggttcagat™ 

agag ci 1 1 1 c cag agccagatgttg agg agtttatcctatttgtcccctt 

cccttta a a c aaacaa aagtgccggctgg a cgca gtggctca tgctggt a 

atcccagcattctcagaggctgaggcaggcggatcacxrrgaggtcagg^ 

> Sequence 917 

ACTGCCTGGCATCCA PCTTCI CGATGGTCTGTTA > C 1 1 GTGGGAATG ACA 
TTCGTTAAOl PO TT I rCTOTGTCCATCCCACCCAAATAAAGAATGT 1 1CA 
TCAGCAAAGTCAATTGCCCTATAGTCATCAGACTCTAGAAATAAATTATC 
AA CG A TGA CTGCAGTGGGTG AGGCTG TTTGTTTA TCACA TCACTTG AGAA 
CAGAQTAAAGTG AGTTTCATA 1 1 1 1 CC I O AGTCTTG AATICTCATTTTAG 
A CATCTGTTCA GAAGCTTTCTAA GOCATGG AG T A TTCT AAATGA GCA 

> Sequence 911 

GGTACrACAATTATAAAGTTACCAATAACTTTACATTAAGAAAATCATTT 
TCTTCCCCTTG AAAA CAAAG TA TG TCCTCAC1 ' I'l XXXTI GCT CTT TT A TTC 
ATGGCAGTATGAAATGTGTCCCTGATTCCCTCCGACCrc 
TGAAACAGTGGCCGTGGGAAGAAATACCAGATGGTATGCATATGGCTTTG 

ggaacagctttcagcagttxjTCacttgt ct'T r n r i aatgcatttca aaa 

TAGCAAAAAATAATGAGATAATTCCCTCAAATAAATGG 

e9l9 

GGTACAACAATTTATCCA I IC C 1 1 1 AGCAATAGTTGGACACTTAGAATGT 
AAAACTGTTT^AACAAATTGGTATATTGGAGTTTGGGTAGAAAGAAGGGC 
GTTGGAAGAGG A GGAAAAG AGGGTG A G ATG ATACATTAAT ATAAATTACT 
GAAACOTGGTOTTCACATTTAGAATT T T T I " M l I AAGTTTJCA POTT! A GO 
ATTTTAGTGCTT^GGAGGAAAGAAGCCCAGTGTTCCCCTTCCAGACCATC 
GCTGCCA TTTCCCTGT AA TATATCGTGTGTAG AGGAACCT AATOCCTGCA 
G 



GGT ACTGCTATTTCTAGTTCAAAATCACAGA1 ITICAGATTGAAAAAATT 
TCAATCCA CTT ATTTTTCAAA TGA G A TAACTGGGACAAAGAGA AATTCCA 
TGA CTTGCCCA A G ATT ACCT ACAG TTTAACTGTCA GCGGGGCTT AAAACC 
ACAATCCACATCTCC TGACTC CCAA ICC 1 1 1 CACTT AAAACAAACAAGCA 

GTATXIACTATTTTACAAGGAAAAAATTAAATCATTTTAACAGATTXXJC 

; GGTACTCACATGTAAACTTCTACTriXXXXrrTCAG ATT 
ATCCCAAAGCTATACACTCTCAGGGAA ICXX 1 U 1 G GATTTCACTOATGAC 
CACTTGACX^CTATCATAAAGATCAAGGCCAC^^^ 
ACA 1 1 IGIUIL^ICATCTCOLCI 1CACCCAGAGACTCCCCAGGOCIGCIG 
GGCCACACTTTGG 1 1 IG 1 1 1 GACTCGAACATAGTTTGAAAGGGATGGAAA 
TTTCCAAAA GGTGTT AATAGA CACATAAAGATTTTT AAAT ATTAAAAAAA 
AGAAAAAGAAAGAA 
> Sequence 922 

GGTACATACAGTATGCACTCO L ITCTCT0TSTTTTTT01 CTGAOTTOATG 



ATTTGGAGCTCAAAGAGCT agcggagggaaaagctgaagccattcaaaca 
cataatgagaattggagatgtaaaagaaggctgagttctagcwgttgcaa 
caacttaggagataacagaaccaattcggaatgagcaggaattgtaggaa 

TGCAGGCG A GG ACTAG AAG AATCAGCT ACATGCTGTTT A CTGGCAAA GCA 
GGAGAAATGTGACTGAGGACAOTATGCCACTGAAAACTGATGAAAGAGGA 
GGGAGACAGGAGGA 

> Sequence 923 

GGTACTGTTG TCTCA TGCTCTCTTTCTGTTAAT A GCA CCTC AATTCTA CT 
CTGOOOOACATTCCTC CTCTCTl 1 1 1 GGTCTGGAATCTCX3CCTGGCTTCA 
GGGACAGCTCAACATGGGCCTrGGACAGTCAAATTCCATCCC^ 
GACTCAC<XiAQACCATCCACTOACTCaTCCTC 

gctccclttctgcggckrrgctgagtgatgggacgacag 
gngctctccaag<xxkjTgcc<iaggaccag<xtgcctg 

CA 

> Sequence 924 

acttgccttgcaaaattatattacaagaagaagcacacttgttatagaag 

TGCTGAATroTATG^AACCTAAATCTGTCAAGTTACCTGTCTTTCAGGTC 
CXjTCTCCCCACCTCCCAGACCTCAT 
CTCI 1 1 AAGGCTAGGCAAGTATTGCGCTGATi 
A A TTGGCA GGCCCA TTGGGTG A T AAATGTCCA A GGA CCTCT AGGCTGA CO 
ACACATTTTTCATCATTAATCCAGCCTATTGTAACCAGGGCCACrCACAT 
TGATT 

> Sequence 925 

GGTACCTACTGTGTTGAGCCCTC I ICC ATCTCCTOTAGTTTCGTCAGATC 
CTGGAAGTGTCCCTGACG^AGAAGTTTTACAAAATGAACTTCGAACTGAA 
GTATCCCGATTGAAACGOAGATCTAAAGATCTGAATTGCCTTTATC^ 
AAAAAG ACTTGTGAAATCTG AAAGTTCAGAG PCTCl ICI 1 1C1CAOACAA 
CTGGTAATAGTAATCACTATCATCATCATGTGACATCCAGAAAGCCACAA 
ACAGAGCGGTCCTTACCAGTOA C 1 1 0 1 CCATTC<}TTCCAATTCCTAGCT 
>Sequcnce 926 

GG TA CCCAAACAC AA G ATTGCTAA T A G ACTGCT AA T A ATA G A ACTT AATA 
AATG AAAT AATTTATTTCATTTATTGTTGCTTXjGAATACAGAAAGTGCTT 
AGTAAATATTGAATGAATCAACAAAGTAOCTCCCAATATAGAGAAATCAC 
TTCTGAAAAGGATAAAACCAAGTTGATCCTATTCAATCGAAGGCATCTTT 
TGGGGCTGTTACAGTTA 1 1 ICC 1 1 1 ATTTGAAGAAGG AATATGATATACC 
T A CTTTGTTCCAAGTCACTGCTTATAA TCTGCTAATGGTACCT 

> Sequence 927 

GGT ACCTGTGAAO ACAGCTACACCro O 1 1 1 C CTC C CT C ATGCCTTG ATCC 
CCAGAACTGCTACCTTCACACC^GCTGGACCACTCCCAAGCTGTGAATCTC 
ATCTCAACAACCTCAGCCAGAGTGTCAA 1 1 ICTGTGAGAGAACAAAGATT 
TGGGGCACTTTCAAAATTAATGAAAGGTTTACAAATGACCTTTTT1AA 
ATCTTCTGCTATATACTCCAAATATGCAAATC<aAATTGA^ 
AAAAAGCATATGAAAGAATTCAAGG l I I 1 G AGTCGGTTCAGGTCACCCAA 
TTTCGAAT 
>Sequencc 92« 

CACAAGAATGGGAGGATGCCAGTTCCAATGCTTTGTAAAG 
- CCACA TTGCAAAA CAAACAAAAAAAAACGA GAACGTTCCCGA G TO rGCCT 
CCAAAACATAAAGGAGAAAATCATACAGAAAAACCTCATGTAAGOGTTGG 
AACTTGACCAACCAGCTATCCAAATACAGAGGXXiAATCCICOCl IAGCTA 
GGG<^TGGCCTGAGAGAAGCXXX"1 1XJCI OCT TTCAGAGCCTACAAGTAGT 
CCOCAO 
>Scqaeace929 

GGTACTT AAGCAAT AAATCTGAGCAATTATCA GGTTA TTTT A TTGCA TTT 
CTAATGAG I fCITCTAAAAAAAOTCAATCAATTATCACIGCTATATATCT 
rCTOTOTGTAAGGAGTOCI I GAGAGTCTTTAA TTGT AACATTTATT AAAT 
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AAGAATAAGAGGACATTITTAAAGGAATTAAAGGAACATTAA ttocttca 

TAAATGTATA GIOI I I AAGCTCTGCTTTAAAAGG Il I 1 HC A IOIGC I C T 

TOOGTAACCACTTAGGGCTGAATTCATAGTATAAATATCAATAAATGTTG 

CAATCACAAT 

>ScqoaKx930 

GOTACCCXXXntKXlAAAG<MAGOATGACTCACrTACTCTXiAAA 

ecAGGAAGGAccTtnTxx^mrrrtxiAGccTT^n^ i icrocTOTCicic 

ACTOTttXtXXACCTCTACCATGATGTCCT^ 

OOOATAOTOOCTTGCOOCCTrcCTCTCCCTTTTCrOOCCACACTTGCTTC 
CTTCTAACA 1 1 1 IX r r O O C TC ATCTGACATGGAAGGCGCAATOOTTAAOCC 
AGAAGGG A GGGCAGAA AACAATCGOCCCACA TOCTGCCTCTGOCTCTGAC 
AAGCTGACT 
>Scgacoc« 931 

ACCCOGGATTTAGAOACAGGCTCTGOCT C1 1 1 fOCCCAGGCTGGACTOCA 

gtggaacaatcatggctcactocagcctcaccctcctgggctcaagagat 

CCTCCCACCTCAOICl CCC1 AATAOOTAOAACTACAOGTOCACACCACCA 
COOCTOCCTAATTT AAAAA TTTTTTTT A TAG A G A CAA GGTCTCACTA TCT 
TCCCCACACTGGTAAACTATTITTAAATTCGAGACATGAATAATOATGCA 
AATCA rOCTTTCI ATGGGTCTGA ITCI P1 1 C I UCTAOCTTATTCAAGOAC 
TAAA 

>Scqacaee 912 

GGTACU I 1 11 n I I n I II 1 1 1 1 Ml I I I 1 1 1 fO O AT I II 1 A GTAGACA 
OGGGTTTTCGOOGTGTTAGTCAGGATGGTCTOCAPCTCCTGAGCTOCTGA 
TCATOCG UG1 1 G QCCTCCCAAGTGCTGGAATTACAGGCATGAGOCACCGT 
ATCTCGOCAGAG AAATTTTTTAATATAAA 11 1 I ' J ' I CAGTTACCACTTAAA 
OOGAAATATGATTAAAAAAACTAAATAAAGAAGAGCTTTAGTAAAACCAT 
GCXXTCTTGCT AATCTA TTAAGAGTCAAATCTGAAC 
>Scqacncs 933 

ACAOTATO 1 1 1 Cl- ACTTA TOGA GAGATAATTACGTAOT AAA CATAGAAAC 
ACAOGAACTGAAAGGACACACACCAGTATXAGAAXTAAGTCAOCCATGCKJ 
GAGGGACAGAAGGAAATAGGATGGAAAGGGaTTaAGGGACTTCAACTCTA 

tttctoatgttttag rxcm aaaacaaaaatctaaatgacatttgaaat 

ATCAAAATGTCAACAATACTT AAAGCTGAG 
TTOGTCTCTGCAN 

AOOCAGTATATGAGCAATTCCTCAGCAG I G 1 1 IG GATATAOOGAOTOGAT 
AGCTATTATTAATTGCAGATTATTTTGGAAGGAAAAACACACAGAGAATT 
ATGTATCTTTCAGTGTAAATGTTAGTTCT AAAAA CAATCATATTATTTAC 
AAAOCTOCAGTTATAGAACAGAATTCTGA TTTCTOOCTCACOOOCAOGGT 
TAATACnrTAAAACATTTOCTACGTTTCATCTGATAGTCTTATTAAAAAT . 
AOCTOTTATTTTT AATAOCT ATACTAAAACATAAAAATOTTTAiQOOCAOO 
CGTO 

>9eSMBM 935 

GGTACCTAATTCATAAGATAAGGATTAAATGAATTAAAATATATAAATOC 
CTTAGATAACAATGCTAGQCATATGTTAAGCACTATGTTAGTATCATCAA 
ATO T TO T IO f 1 A CTOTTATGGAATTTATCACAAATATGTAATTATATGTT 
TOOTACTGATTATT CAT CAOOOCTACTG GACTCT 

TGTCrATTTGGTTTACCACTGT ATOCrcAACAACTCCTOOTTGTOOCTAT 
TOTAOGTGTTAGGTATTAAGTOCATQATAGTCAATACATAAAOGTTA 

>5cqacace936 " 

CCTACTA CAO ATT AAGT ATTAATATGCTCTGAGTGCAGATAG AGAACAG A 
AACAGGCTCTTTGATTTCAXX1ATOGTCAATTX7ICT : 

ggaggttgtgggaacatatagacagtgaccaaacitttaatgaatacaog 

AAGA TTI 1 C * I G GAAAAO ATOACATOTAGCAGACAGCTGACAGACGAGTTT 

accaggttcagaacttaagtgataa taa Tern TI ATCATAAAATTTTAA 

GTGTGGTAGAGAATAAAAGTTTTaAATTAAATXrrTGAATGAAATGTGTTA 
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TG 

>Scqttcnce 937 

ACACTAAAA>TAGAATATAAGCCAGTCAAATCAAATtXnTGGC1t^CrrGA 
AGAAATAACAOTCTOTOGOCAACTOO I IOH III l AGGTCACCTCACGGO 
ACAGATOGTCCXTAAGOTOCAAAAGAATOAAl rCCTOCI OATATATOACT 
CATAAGTITCTCTAAOGGGOCACTOAOCATTTCAATTDOCAAGGAACATA 
AATTAOCTTTTAGCCIGTGTATTTACACACAAATATGCAACCTGCAAACT 
TCTTCTGAGGACAGATGTCAACT ACI 1 1 1 IC A 1 1 1 1 1 1 1 1 1 1 I ACAGTCA 
AAG 

>Seqaenoe 932 

ggtaocaagtatacttcaocagatatctatagaacattocactcagcaac 
agcagaatccagcagaatatata i i c n C I GA ACTCTATGTGGAACATTC 

TCCOGGATAGACCATATGTTAACrrCATAAAACGAGTTrCAATAAATTTAA 

aaggactgatatcataccaagtatgctctctgaccagaatggaatgaaat 
tagaaatcaataacacaagaaaatttgggaaattcacaaatattjtagaaa 
ttaaaaaacacactccttaaacaaccagtgggtcagaaa^ 

GGGN 

>Seqacacc939 

COCTTAOCAiSOOOCOOOCWCXlAOOOOCICr iC I ro CATA CICI 1 1 1AATT 
GGATATCOCAGTGTGTCTCAGTAATTTOCAGTG<XrrGTAAAACTTTGAGA 
AA l 1 1 rGTAGCl 1 1 1 AGAAAOCACATA OCTG T A TTGCCTGA TTGCTT A TT 
AAGTGATCIXrTTAGAGXnTTOCAAAGTTAlXiAGTTlXjAGTTTACAAGTGC 
AG 1 I I I 1 1 1 (XA TGAAAATTTCAGTGGTGACAAATTATAOAATTTATCAT 
TCAATTCAGTCTTAACTAGAAATAATTGCATATAATAAAACAGGTTCTTG 
ACTGTTCTTTTT 
>Scgucnce 940 

ACTGGCACTTOCATTTrGTAAGTGAAGOOCAGAG^AGCAAAGAAATGTGC 
OCTAOGTCACATAGCT AGTOGGTGGCAGAGC I it IGATTGGCAGOTTOGTC 
GAATGOCTCCA AAGCOCTOGAC L I IUJC ACTATACTTCACGCATCTCTAG 
AGAAGAGACAGAAGTAGGCAGGATGAAGGTCTTCAGGTTTAAGAAGAACT 
ATG AAAAA OCA AAAGA 1 1 1 HON I IDG I GO! II 111 I A CTATAAAGGAA 
AACTTTAAATAATAGCAAGAGTGCTATAGGTAAGATATCAGAA 



GGTAOCTOGTtXinTGAACTTATTTCGGGACAGAATTGAGACOGAAAAATT 
TG ATATCAAAGG AAGTA TCAAAACCCTTCATGTGGTTAAGAGCATGGAT A 
GTGAAACTAAC C 1 C 1 OATQTATGGTGAG AGA GCAAAA GAG AAAGGATTGC 
AAAGAAACTGGAATGTAGAGGATGAACATATTGGTAATAATAATACTGGT 
GGAATTGTTATTCAOGAAAAAATAGCAATTA TT O CT G TT CA TATCTCAAA 
TCATTGT ATOI IGTI I A TTTAAAGGGAGACATGGTAG AAGATATCAAATA 
TAAAAAT 

GGTACATCAAAATGG CIG 1 1 1 1 rOOOCACATTAGTCAGCTCTGOA II HO 
CATGTGTGGGG CI 1 1 I I II I I O ATAGTTA 1 1 lOI 1 1 1 1 IATTTTAAAAAT 
TTATTTTGCCAACOCAGTAG AGAACAGCTGAGCA I CI fCl CA TGT ATTTA 
TT GOCCATCTGC AI I HC TO C TO CT I A TTGGOCA TGTA TTTATTG GCCATT 
TGOOCTCIGCIGraAAATGTCITAAATrrnTGOOCATI I I1CIAGTGAT 
AAAACACTQAAGCACAI 1 1 1 1 AAAGACTTCTOATGATTTTTATTGTC 
a ~i ijm bo 943 . -„ «-> ■ - - ■ . ^. 

ACTTCAGOAOATACATTCroCTAO 1 1 IGGOOTOGT O IO f IV I A TAAATOT 
CAATTTAATOCAGTOGGCTTATG A 1 1 1 IIA OI IIIX1 ATTCTTACTOATT 
AATGTGTATATACTA G 1 1 C 1 0 T I A CTAAGGAGGGATGTTAAATTAATCCC 
TAGCTGTAATTGTGCATTA G 1 1 rOTCTCI mC AOC TOl ICIA OCTTCAT 
AAATTTTTGGAOCTGTTAOGTOCATATACCTTTAGOATTA 1 1 1 HilVl I V 
TTGGTOAACTAGAOCTTI 1 ATCATTAGGAAACT 



GGTACAAAAATCAACT n CC I 1 1 1 TA CTATCTGGAAATAGGAAAATGTTC 
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cattca ctatggtg acaaaactgtaaaatagg aa tatatttctg aggaaa 
gtataggtatitacaaatagataaactatattcttagatgagaaf actta 
atacccactttacaaaattaaaaatgaattacagctttttaaaaatagat 
taagctgggtgtgatgacatggcacctatagt/cacaoctacrrcaoaa 
TO AGGCA gg a gaagcacctgagcccagg a GTTTGA ggctctagtc agct a 
TO 

> S equence 945 

AOCTGCAAGTCC A AAGAGGAOCAGG AGG ATO00CX3CCA AAAGA AGGGTAA 
TCGATGGGACACCAAAGTTATCAGTCAAGTAAGGCAGAAATGCTTGAATC 
AATAAATGT A T A T AGA TA G AAAGTAG A O ACCTTGATAAAGTCAAACTOCT 
TGCCTTTACAA G TG I G I G Tr CA GCAGCCATGCAAGGGAGATGCCCATCTG 
GCAGTGGCCCAGGGCAAGGTGTCAGAGCCCT AGTGGCAGGGAGATGGGAT 
CCAGATATGAGGGAGGGTGACATGGTGCTAACTGGGCATCTACATAGGGC 
AGGG 

>Sequence 946 

ACTGCA TATTTAATGAA TTA TTTTA TAAA TTOCTOTTOTGAAGCA TTTGT 
OAATOACCTGCC 1 UC I AGCTTITAATOCTATTGCOCACGCTG A C 1 1 1 1 A T 
TGCAA CIG I'I T T ATGATACAG TTTTGCA TTOTATC IG IT TAIT I 1 1 1 A AA 
GAAGCA I I I U. rO OGAO GI rTCTTTTTCTGGI I ATG AAAATAATATATOC 
TTA TGGGGAAAAA TTGGAAAA T AGAAACCAGTA TCT A G AAGAAAAA TCAC 
TCATAATTCCAGCACCCTOTTAATA l 1 1 IG HI 1 1 III I ACAG I I H IAA 
TA 

>«fr«pirnri 947 

GGTACCAGTAGATGAGAACT ACTTA TTTAGAGTGGCAGA OCA TGCTATAG 
AAACAAAA T A TG ACT AA TTCTAA CTGT AGTT A TGTT A T A TT AGCATAGTG ■ 
A OA TA GTAA CA TT AA T AG AA TTCCTT AGGTGGAA TTTCTTTAATGCCTTC 
A GTTTCAA TTTT A AAAAA A AA AAA GTGTTT A TG T A C A A GA GGG AGTG AAG 
OTTTCTTAGAGGTAAAOAGGOTGAGATTTGATGGTAT. rriTTTA GTTAGG 
ATGA G A T A GT AG AGGT AG AGGTTAT AGGG AA TGT A GGTTGT AGTTTTTT A 
TTTN 

GCGCCI 1 1 CAGCGGCCGCCOGGCAGGTACTCATAT1TAATGAATATTTTA 
TAAA TT OCTG TIG I GAAGCATITGTGAATGAOCTCOCTCCTAGCTTTCAA 

TGCTATTCCCCAGGCTQAC 1 1 1 1 a ttccaac tctttta tgat aca o IHI 
GCAi rGTATGTOTTTACTTT TT AAAGAAGCA11 1 OC1 GGGAGG ITTCIT1 
TTCTGGT1 ATGAAAATAATATATCOITATGGGGAAAAATTGGAAAATAGA 
AACAAGTATTJTAGAAGAAAAATCACTCATAATTCCAGCACCCTCTTAATA 

cinoic r mcnA CAGTT 

>Scqtacc 949 

accaagaactaaattgtgataogataggtgactt atgagtagcacagaat 
gtaatagogocatctcraoctagi ii iooii accaca u rctgtcaaogt 
agctoggagaq«ggtgtctacttattcagcacatc^ 

AACTGAGCAGGCAGCCAATGAAAACCGTGAG C ITfCTTl ACATTAACTTT 
CTGAAAGTCAI I III 11-1 lA TTCCA ll I lOlOCCI 1 1 II 1 1 A AAAGCTGC 
AGCTTCATGGAATTTAATCCTGGTATTTAAAACACTT 
>SccjBOMe930 

ACTTGGTAGGTTGA rCTCI 1 1 IA TTCTCATOGTTTAATTACCATCTATTC 
ACTOATTACTOCCAAAACTOTATCTATAGTCCAAGA l 1 G 1 1 rCTAAAAQG 
TCTGCAOCCACATATGCAAATAAATAOCAGATA ICTCTC1 I GO n AT ATT 
GCACATATKTCAAACTCAATA><GTTCAAAACTGAATTCA I C 1 1 00C OOCT 
AAATGTA U 1 1 MC I ICUXCIll 1 1 1 O ATAAAAGOGATTACCAAAAACC 
CCAOCDGOCAGOTTAAAAAOCTOG1 1 1 O GAAAAATTT ATTOI 1 1 1 1 1 1 AC 
CCTTTTTTAAAAGG 
:y Sctjacocc 931 

ggtactcttaggaaagagtaatgggxtitgaggatggt^ 

cctaali rctctcagat 1 1 i i i fc agaat a1 1 1 igoatggt pctctcact 
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1 1 IO 1 1 A TTAAOCATTTOQGAAO AAG ATTCTOCAOCCTACTCAGGTOAGC 
CAATCTCATGGCATTGAACAGAGAAGATA 1 0 1 1 I rCACGTCTCTAAOCAG 
IOI I 1 1 IC ATAGTGTAAOTCAGGC C I I ICILX.1 1 1 O ATCTAAGTGGAACC 

aagaggttagatactoo l ' i rrrcn i agitata taatgggcttcatgt aa 

CTA 

>S e quco cc 952 

GGTACACTCTGT AC<»TCTACA GOT AAAAA GCTATTACGTTOCAAACA TTA 
TAACGTAATGTAAGGTCTGC A TTACATGCCT AAAAA TCCAA TG ATTCTTG 

gaaccatcaaatctgttaagactgaaaagaataccaatgtttaaatatat 
ctataaaatgcaggtcaaggggcrraagaaaatttxaacactagaaaac3ca 
ACAAACTTAGG I IG I III a acatacatacacaaatacaggagggaogttt 
ATOGGTCACATCTCOGAAACA 1 1 1 II roC CAAAAAOCTOAATTTTT 
>Sc<jucrxx 9S3 

CKTTACC^CCAATAATTATGCCCACAATTTTATCCTAAATAAG actgattc 

OCTG rrocrm i ctacagaaca i o i i h i o pdgocaaagagaataagaa 

AA CATG A CCCCTCCATCCA GAA CCAAACT AAA CTCAGGAGTGATT AG AA T 
CACCTGTGGGCATTTTCtXXXAAACCACX^ 

AGCOCTCTT AAAGAAOCTACAGC l lll IIOCATTOCCTATCTOAAAOCAA 
TAAA 

> Sc c jaco c e 954 

GGTACCAGATGTTCTAAAATTTACTATAATTAATAGGAATTAATTAATGA 
ATGOCAAOGGOCAGAGCCAGACTTOCTATGATAGTI IOCI I GC I A TAAOGT 
GCTATTTAWN O 1 1 1 1 1 1 A CATTTACTOCATAGTAAG l ' IGI IO II I O AGAA 
AAAAAATGCCA OI 1 1 OO I O CGTAGTAGATACGCAGAGGCTGAGAAAGGAA 
CAGATTACCCATTACOCAATGGTTACAGAATCTATAATOCTTOOCTTTAA 
ACTGGT 1GA I I IOI 1 1 1 1 1 1A CA 
>Scqnrwi> 955 

GGT ACCTTTAAGOCAGATTTJATGGTAIXjAAGGGAGCAGCATAGCAGCTOC 
ATTGACCCACATGCGGGXJL 1 UH I I O OOCITCATCAGCOCTTTOGAGTCT 
CAG ATDCCTCA CCTGTT aaagga ga g taa T ACTACOCACTTACC 1 1 1 1 IO 
GO HO I IGIOAAACACACATAAGACAGTATTAGGAGAAGTAAGGTCTGAG 
GGCTCGG C I I IC GAOOCAOCGGC OCCTA OGTAGAGOO CnnTGA ATTQQA 
TGACAGTGAACTTTGCAGCATTTCCTAACCTCAGA^ 

GGTA LI 1LTOC1 1 rATTCAOTCT AOGTAAOAAATGTAATCGATGTGTGCA 
GGTGACATAATTTCAGGGGATAAGGTAAAAATTAGATGAAGOOCAAGCAA 
ATATTCTTAAAAAGAAAAACTTAGGA 1 1 1 1 1 1 1 1 1 ACAAAAOTTAACTTA 
AAATGCATTATCTAGAATAATCTTATAAATCAAOGTATAGAGAanTAGT 
GAATAOl ICCCI I lATTAGGATGTTGAAGGAATATGGTTTCAATATTCAA 
CAAAT G TC C T G ATGOCTATAAA 1 1 1 1 rCTACAAACAAOAOTATOTT 
>ScqBOKc957 

00C1 I A GOGGCCCCCCGGCCAGGTACTIGAGCAGATACATITrTGCrAGTT 
TGGGG I G G I GTGTTCTATAAATGTCAATTTAATOCAJGTCGGCTTATGATT 
TTCA G TTCT A T A ITCTT A CTCATT AATGTGTATATACTA G T TCTGT I A CT 
AAGCAGGGATGTTAAATTAATCCCTAGCTOTAATTGTGCATTA O 1 1 IOII 
llll I I IA G IIOI I C T A GCTCCATAAA III I IO GA OCIOI I A CGTGCATA 
TA COI 1 1 AGGATTA 1 1 1 l Olll 111 roGTGAACTAGAOCl 1 1 IA TCATTA 
GGAAACTGTCCATATAACCACT 
>S<9xsee9SI 

CXXT I rGAGOGGGOGOCCCGGCAGGTACTOCATAATATAATCITTTAAAT 

GGGCAACTTC TAAA TATTGATACAACCATTAATAATAATGCTTATAOOOT 

AAAAGAAAATTTTTGAAGCACTGAATTCAGTAACCTOGGTCATGGT^ 

1 1 1 1 Ol 1 GACTACTTCATATCTTTTATGTAGATTATTCCTATAAACATGT 

TGCCTAAATTCCACATCAGTlTGTAAAGTCAATGGATrAAATTATTCAAA 

TGTACCTATTTAACGGTCAGTAACAATGCCTAGAAACCTATT 
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TTTAAGTCCAGGCTGGACTCAAACTCCTGAAGATTGCTCAAGCAA ICTTC 
CXIACCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGTGATGTC^ 
GGTTCCaGCTCTTAAAAOAGTTGTCAC 1 U roGTGGGCGAGOTCOOTCACA 

tacacatataattataaogtaaaaaatcacaactactacaacaaaogtgc 
aaacatttatoaoaaaaccaaagaagggan 

>ScqueDce 960 

GGTACTCCAGOCrGGGTGACACAGTGAGAATATGTCTCAAAAAAAATTAT 
CAGCAOAACATAATATAGACCCCAAGGCTAAAGGGAACCATTA TCATCT C 
TACGOCTQ AA AGCCTAGGAG AGGGTGCTGTATGG AGAGGAC 1 OC I ICIGA 
CAGAGOGATATAGCCAACCTTGGTGOOCTAATAOAOAGOAAACTAGGCAA 
TAGCTTCACCTTCC 1 1 11 V I AATCTTCTGCTAGTATCCCTATTAATTTAG 
CCT AATTAG AAGCTGGAAGGTAGGAGAGCCTCCATGGGCAAAAAGCTGTG 
TAG AG AACATGG ATCCTG AGGOGGTAAATGGCAGATAATCTAGCACAG AT 
TGG 

>Sequencc 961 

CKJTACTCCAGCCTGGGTOACAGAOTC 

taggcctg aaagcctaggagag<kntjctgtatgg agaggactgc i ictga 

cagagggatatagccaaccttggtggoctaatagagaggaaagtaoggaa 

tagcttcaccttccttctctaatcttctgctagtatooctattaatttag 

cctaattagaagctggaaggtaggagagcctccatgogcaaaaagctotg 

tagagaacatggatcctgaggggctaaatggcagataatctagcaca 



GGTACTTGAGAATATCATTGTAAATTrGATCAGCAGCTACAACATTTCAA 
TG ATGGATA I 1 H I J 1 1 IC AGATOCAT. IUJ I 1 1 G ATTGA ATTT AAAGT CA 
AGCTTGTGCTTCTGGATG G ' I " I GC 1 1 1 G TCAGTGAACACTTGG ATTT GGAA 
AATACAGCACCTOG O I IOGI 1 1 I GAG AOAAAATGGTTTCAACTTTATAAT 
TAC AGTTTTAACCA CCACAACAA CAAAATT A GG ATGG T A GTG AAATGGA A 
CTAAATCAAATGCAAGGTTTTAGTTTAATAGAACAATGTCATCCTTTAAT 
AATCTTTAAAGAAGAACAACTTAATAAOCAATAACA AAATTGAA ATAGGT 
CAACTT 
> Sequence 963 

GGTA CTTG AG AAT ATGATTGTAAA TTTG ATCAGCAGCTACAACATTTCAA 
TGATGCATA 1 1 Tl I T I T T CA GATGCA VI OC TT TG ATTG AATTTAAAGTCA 
AGCTTGTGCTTCTGGATGGTTGCTTTGTCAGTG^C^ 
AATACAGCACCTGGGTTCKjlTTTGAGAGAAAATGGTTrCAACTTTATAAT 
TACAGTTTTAAiXACCACAACAACAAAATTAGGATGGTAGTG AAATO GAA 
CTAAATCAAATGCAAGGTTTTAGTITAATAGAACAATCTCATXXTTTAAT 
AA TCTTTAAAG AAGAACAA CTAAATAACCAATAACAAAATTG AAAT A G 
^Sequence 964 

aca ctgcat aaa gccagagtt aaaacttcactgcca g<xrrctgaacag aa 
ggctgttctatdcacacratcacaagac3ctggrrggagttgag<jcaactgc 
tgaattaccatacagggaagaatgaattx1aagaaaattcccatgcaagat 
agoctcttaaaaaataaatttacacaagaaaatcagcactgtaaaggtaa 
ttgataagcccaatagaagogaaaoctatacaaagaaatagaaataacta 

AGCAATCTG AAATGGA CTTTAAATAATG ATGT 
>Scqunc« 965 

ACACTCCATAAAGCCAGAGTTAAAACrrCACroCCAGCCTCTGAACAG 

ggctgttctatccacactatcacaaaccitjgtxk^gttgag 
gaattaccatacagggaagaatoaattcaagaaaattcccatgcaagata 
ggctctt aaaaaataaatttacacaag aaaatcagcactgtaaaggtaat 
tg at aagcccaat agaa ggg aaacct at a caaa g aaatag aaataactaa 
gcaatctcaaatggactttaaataatgatgtttacaattctctaagagga 
aaa gg agcattagca tca gtg aaacaaaagt a gggct ata gaaaaaacaa 



TACTTATOAAAAAACCAATTOGAAATITTTAG ATGGAAAAGCGTG AAATA 

AAAAATTrjAACACATCGTtrTAAAGAAT*J»AC^^ 

AATTAGTTAATTTTACGAAGAAACAATAAATtnX^CAGAATGTNAAAGAG 

ATAAGATATTTAAAATAAATCAGAGTAAAGAGATATTAAACTATATACAT 

TTOAGTATATAAAATCCATATGGTGATATOOATACATATATATA OCAGAA 

GGAAGGACAOAAGAGATACAATATTTOGACAGAACATGGCrAATTTTTCA 

GAATTATTAAAAGACTTGAGCCCTTCAAACAGGTCCAG<iAGT^ 

CCGG AACACGCTT AGGGGCGATTCCAGCACACGGCGGGCCGTA 



ACGCGGGTCAAAAGGATGAAAA 1 U 1 1 1 fCTG I LAG AATGAAATTCAAOAA 

AACTTAAAGjGAAATAAAAACTATTTAOCACOCAGTCAGGTAAAAATOGCA 

ATCTCTGGTGTOCAGTCAGTTACCAGOCATGGAAAGAGACAGAAAAACAT 

GAGCCATCATGAGGAGAACAATTAGCAGAAACCAAACCAGAACTGACATA 

CATACCAGAATTGGGACACAAAAGGATATTAAAACAATAACAACrrGCGTT 

CC ATATGTTCA AAAAGTTAGAA ACATG A A AG AT 

> Sequence 967 

ACGCGGGTCAAAAGGATGAAAA I G 1 1 1 I C 1 GTCAGAATGAAATTCAAGAA 
AACTTAAAGGAAATAAAAACTATTTAGCACCCAGTGAGGTAAAAATCGCA 
A TGTCTG<jTCTCC A GTCAGTTACCAGGCATGGAAAGAGACAG AAAAACAT 
CACMXATCATGAGGAGAACAATTAGCAGAAACCAAACX^GAACTGACATA 
CATAOCAGAATIGGCACACAAAAGGATATTAAAACAATAACAACTGCOTT 
CCA T A TGTTCAAAA A G TT AGAA ACA TGAAAG A T A CAAAAAT AA AATCAA A 
CTTCrAAAGATGAGAAACTGTAGTGTTTGAGGTGAAAAATATCKrrAAATG 
GCATTA 

acgoggooggt^ 

gccagtgtoccaaaggaaggggtttocatggtaacctcaatggatacagt 
cagctgacgtctggcaccgcctgtgctggtgtg^ 
cctcggccctooctcaatoctttcaactatatttattag ncTcrn AAT 

GG AAA GT ATA TAATCCCTTAATGTCAG ACCTTGA GTGCCACTCAGCTTT A 

TrAATrrATTTAGGTAATAAATrTACCTTCXn'AATTAATTCTCAGTAGTC 

CTGGGAGCTOTATTATTTTAAACIATCTTGCACAATGTC 

A^GCGGGCGGTCTGTGCCCCATCACCATTTCTAAAGCACCC^ 
GGCAGTTjTCCCAAAGGAAGCGGTTTCCATGGTAACCrcAATG 
CAGCTGACXrTCTGGCACCGCCTGTGCTOGTC 

CCTCGG<XCTOOCTCAATOCTTTCAACTATATTTATTAG rTCTCTTl AAT 
GG AAA CTATATAATCCCTT AA TGTCAG ACCTTGA G TGG CACTCAACTTTA 
TTAATTTATTTAGGTAATAAATTTACCTTCCTAATTAATTCTCAGTAGTC 
CTGGGAGCTGT ATTATTITAAACATCTTC<lACAATGTTTATAGTTCrcCG 
TGTT 

>Scqucnce 970 

GGTACX!lAAGATTATGATAGCCTCTTAAAACAAATTXX»AGGTTATAACXnT 
TTTCTATTCTCTGCAA(^GTGG ATATAGCiATTGGAGTTA 1 1 1 1 U IC TTA 
AG IT IH GG<TTGAAAACTAC<CCAGTGAAGTCATTGTGGOTTTC<}ATT^ 
TCTTTGTAOGAATGG I ICCl T A ATrTACTAATATAGCTTTTTOCAAAATA 
TGTTAATGAGTAATTATTCCAGGGG 1 1 1 1 1CIATTATCC1 1UUC1 IGIUG 
ACAAA1 1 11 1 1G1CIGG111 1 1 JG I 1 ACTTATAAAAGATATTGATTOCAT 
GCCTAATAAAGTG T ICTAAATTAATTTTATTTGGX1 ATATCTA ATTCTTTA 
TTTITCCAAATATACGAATTCCTATGTXTATATTTATTT^ 
ACCAAGTGAATACTTTTAAAATGGTTCTTTAAAG 
>5equence971 

OCTACCAAOATTATCATACCCTCTTAAAACAAATTTO 
TTTCTATTCTCTGCAACAGTGG ATATAGG ATTGGAGTTA I ll 111 ICTT A 
A G 1 1 11 1 GGTAGAAACTAGCCAGTG AAGTCATGTXKKrrTTGGA I 1 1 III I 
TGTAGGAAGGTT(XrrAATTACTAATTAC<nTTTCAAAATAGTTATGAGAA 
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TATTCAGG 1 1 1 1 C I AT 
AAATTTGTTCATCTATAATTTTAATATTTTTGGTATAA 1 I i I I ITCAAAA 
TAATCTTCTATTTATTTACAAGACAGGATCTTAATGTTTAATGACAGGAT 
CTAT 

> Sequence 972 

OGTACTCCAGCCTGGGKKJACAGATroAGACCXrrG^ 

TGATTATGAGWmGANGAAGGAAAAGAAAAGAAAAGAAAAAACAAGAAAT 

TAG CTCA TG AAT A GCCAGCCTT A T A TT A T AATTA TGTG A CA CTTTGG ATA 

TTTCAAAGCACATTCACAAAG^GTATGTCACTTAAATACCTCAAAATTTC 

CClTiTTATACATGCAGATCATTCCCCATTC^ 

TGTGT 

> Sequence 973 

GGTACTCCAGCCTGCGTGACAGaGTGAGACCCTGTCTCAAAAAAAAAAAA 
A AG AAAAAGAAAAAGAAAAA AG AAAAG AAAAG AAA A G A A A A A AC A AG AAA 
TTAGCTCATGATACM3AGCTTATATTATAATTATGTGACACTTTGGATATT 
TC A AA GCACA TTCACA AAGTG T ATG TCA CTT AAA TA CCTC A AAATTTCCC 
TGTTATACATGCAG^TCATTCCCCATTCAGCCCTGGTATGCACTGAACTG 
TOT 

> Sequence 974 

ACAAAGCTAGAAGCAGCCTG<rrOCAGATGG<jrATACAAACCCTAAACTGT 
CT ACACCCAGACTTTATTCTTCTACAACCAAATTCCTCAAACACACAATT 
CTGGAACAGTAGCCAGTGAAAAGGGGAGTTTTAAGGGTGG<XKjTGGACKJG 

aag aaggg atttaatatttaatggtttatattagctctgtg atggattta 
tgaattttgttcgtatgt aatcaa tgtctgtgaatattct 
aatcttattgtatgtatataatctaatgttccgtattcgctattttgata 

TTAATAAATC^TATAmAATTAATGGATAAATTCAAACATTCATCCATAGCT 
TCTG TCT A T A CA GT AACAGTATTTTCTA TATA GTT AT A TCTCTAGTCATG 
CTTTTTTCTTCTTATGAATCTTTTAATCGC 

> Sequence 975 

GGTACGCGGGCTACCAAACXrrGCATTAAAAATTTCGGTGC<iGGCGACACA 
ATGATCTTATCTCTAACCTCCGAGCAGTACX:ATG<rr 
AGCTTrrCTACATACnTTGGAAGTTGGGTAATGTGATTCCTtrTAG 
TAG C T CTUn Gill I CACTTAGT ATT A CTTT AACTA TT AGGGC I ' I CI I II 
TTGGTTCCA T AT AAATTGT AAAA T AAAATTTTTTX1AGTTCTGTG AT AAAA 
TCTCAATCGGTAGTITGATATGG^ATAACCATTGAAATCTGTTAOCITTC 
CCCCTGGCGOTCCGCTTCAAAAGGGCCX^ 
GGTCCGTTTACTATATTGGA 1 1 rci I A 

> Sequence 976 

ACCTCTCATn , GTCACTTTTCAACACTTCCT^ 
TGGTCCTC^HTJGGTGATCCAGACCACACTCTC^ 

CAAGCTCOOCTA CTGTCI 1 1 IC ATTTATGTI^GGCAGGGGAAGAACCTC 
AAAGGG<riXTrrcCATXXXlAGTt7TCACTTCCCAAAGA 

caggatgtggggacag<iaactitgggg<^axgcggctr^ 
caccaggagggcct aaattctagaaagoagactcctttatttjggtgaaat 
gtttggcaactgc<;aaaagatttgcctc«^ttc 

ggtactttaaaaagtaaacaaa1ttaactgaagcatg 

GATTCTTTGTAGA IT f T CTGGAAAGTCTTG TT rOTnOTATTAAACATTA 

ACTCTGCTGTATGCTCTAAATACACTGCTAAGATCA^ 

ACAATAATACCAATTCATATGGATJCTTCAAATTAOTCTTA 

TGATATGGTATTATCCAGCCAACTCACTTTCAGACTG^CAAAATAT^ 

ACTTTAACCA GGTGATTCTTGCA I Id 1 1 O GTTTAAAACCTCAAGTTTAA 

AAA T A TCTTTAT ATTT ACA TTTAATTGTCATTAATCA 

> Sequence 971 

A CG A CTTCA CAACACCAACCA CAGGTCTCAA GGTCAAAAAATGAGCTAGG 
AGTAAAGTATCTGCTTXAGAATCTaCCOCCATOOCAGAAAGAGCAAOOCA 



ACrGTGTXXrrcAGTGGCTCTTAGAGTTTAAGACTCT^ 
ATTTAGAAAGOGTGTGGACCAAGGCATTTTTGOTTAAIGI reTCTAAAOC 
ACkXrrGACTGCCAGGATTTCAAGTCAGTGATAA^ 
TTTTTTTTCCCCCGCGTAOCTCC^^ 
TTTCCAGCAACACTTjGCGGGCXXXJTTACTAGG 



ACCTGGCAOCAGAGTAGGCACTAATATGTGTTGAATGAGTAGXnXJAAATA 

AACAAAAACCTAATGC^XJATGGAATTTTATGGAAATAAGTAAACTTCATT 

ATTGCTGAAAATACCGCACiATAAATAGAGGGAC<XlAGTOTAATAGAGTGG 

AAAGAC<^AGTAGACCAGGAGTCAGACAGTCGAGGATTJTT^TTCTAAATTT 

GAAG<nXiAATAGCCATGTXXKrrTTAGACAGGACTCrGAACCACCTTGTTT 

TCTTATCTGTAAAAGGGGGAAGTCATAATAGCTACTCCT5CCTAACTCAT 

AGGTTGTTGAGAAAATGAAGTGATTCA 



ACATTACCTTTTATGTATCCTGGAATAAGAACTTGTGTCTACATGCATGT 



GGGGAGTGATGGCX^CTAGATGACTG<jGGACAGG<XiCTGGTGAGTGAGCG 
CAATTATCTATTTAAACAATCAGAAATGCTTJOCTAAATTACAAGTTTCTA 
GTTAAATGCAGTAAOAAATTCCCCACAAGCTCTGCAAAATAAGTTCTGTC 
AATCAAATCTTACATGATOCATTAACTCAGCTATTTTAAAATACTACCAT 
GAATTC ATCTTT AAA G TGTG ACTTTGT A A AGCAG A T AATCCTCCTGTT 
>Sequeace 91 1 

GGTACAGTATTGTTOACTGGCTAACAGAGCACCAATTAATAAGCCAAAGA 
AA TCGCTCTTT AA CAATG AACA TTTCTGCCATCAACTO A CAG ATCCCA GG 

aataaatgttttccagtgaggagacttctctc<:ttttcagaac^ 
gctgcccctxxxxaccccatagaag<kictatccctccaggtcaggttagc 
atcatcaoctagagccaacaagtcaagoagotgatg xtittocc tttgaca 
tctcta cccag acca g actccactgag aa g actctccc i 1 1 1 i tatcact 



TATAC 



GGTACTTAGATCAGATGGATTCAAACATCACAGCCCCATTTCATCTGGCC 

G<*TTAAGGTCCTCATGGAATGAAAAACACTTIlX!kK3CACTCTCCT 

AGAGAG AATGG G 1 1 III 1 1 A ATTGCCAGA T TG rCT G AACACAGCCTCAGC 

TACTTCTAGGAATAAGACGAAGCAGTG^G<iAACnTG<XlAGTTGAGTGATT 

CTTGGOGAAAAAAATTAGCATTCAGTGCCAGCTCTCTAAAGTGTtXi^^ 

TOGATTCTGGTAGAAGOCAGTAAAGAAAC G ITT T CTCl G GAGTGGAAGCT 

AGTAAGATTTATTC 

>SeqacnceM3 

GGTACAGTGACATTTCAAGACATGGCXXAATGCACAAGCAACT^ 
GCTGTAATTCACGAGATTCCTCAGOG P0C TCTA AOCTCC1 IGAQOGCAOA 
AACTTA 1CI I IGlA TTACAGCTAGCCTTCAATCAGTAOGTGTTGAGCrGA 
ITTTCTTT1 ICCl 1 1 1 1 AAACTCAGAAGTTAAGTIOCAGCTTCAGTCGCT 
ATGCCGAGATGGTCTGATTCTGAAGGACAAGAGAATTCAGTGGCATAAGC 
CCTGTTJCTrGGCATGTAGTAAGTTCrCACT 
ACTGAC 
>Scqaeace 9S4 

ACrrrTAGTAAAGATGGGGTTTTGCCATGTTC<JCTAGGC" I GG I CTOGAAC 
T0CTX3AOCTCAGGTGATO CA00CA CTTOGCOCT00CA AA 
ACAGGTGTGAGCCACCGOGCroGGOOG^GGACACTAl'ITIIJ'lUCI'riUG 
AAGAAATGAATCCTA O 1 1 1 IGOI 1 CAG AAACTGTCAACAGCA I IOIGCCT 
CTTCTATGACTACTAAATTTCAAGCAAAGAGAGCT^ 
GCAG GGOTATrCCCCCKCT TCAC^CAATQC IU rOOC 1 1 ATCAOGGCAGAC 

>Seaaeac<9S5 

GGTACTTACTTAA I HI 1 1 I II 1 1 1 1 1 1 1 1 AGTAGAGATGAGGTTTCACC 
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pCTAJSoomjii 



A T O T f GGOCAGGCTGCTCTCGAAC 1 CCT O AOCrCAGCTGATCCAOCTOOC 

TCACOCTCOCAAAGTGTTtXXMTTACAGGACTGAGOCAOOGCAOOCAGCC 

TGTGTGTG 1 1 1 1 1 II ACTT AAAAA 1 1 I 1 TAAATTTAAATTTAAATGTTTA 

ATTtlACAAATAATTTTATATATCCGGTATAATGTCAT<jriTTQATGTATA 

CAT tO I rO TA TAOOTTOTAATTOTATACA T T OT OI 1 G 1 ATACATGGATCT 

ATACATTGAAATTATTGTATCCAGAAAATTAACATATCCATX>OCTCAC 

>Scqacacc9S6 

OGTACATCGAATACATAATTTTGAAATCGAGTCAGGO C 1 1 ICL I A ATGAT . 
CCA TTTTGT AA TTCAOCTAACAGCTGAGG AAAGGTCCA GAG AA GGAAGAA 
CTCAAGGTTAGTAGACAAACTTGATATrGAGTTGCACTGGCTOOCTTCTC 
1 1 1 1 I GO 1 CCtX 1 AAAOAOTATTTATCATCTTAG ATTCAGCTTAAGTTGT 
GGACAAATATCAAGGGGAAAAOTATTTACAGTTAACGTTGGAATCACACG 
CTTTl C GWGGGTTGTGC C 1 1 1 1 1 ACOCTTCAA C 1 1 Hj GTGGTTCTAAAGA 
CCGACOATTATTAC TT 0C1 1 1 CACTAAGOAGGOOAAGTTCATGATGGAGC 
AACT 

>Sesfoaxc 9t1 

GGTACCTGGCCT AGAAAATA 1 1 1 1 1 1 1 1 II 1 G AAATCOAC T C T C ACTGTG 
TCGCCCAGGCTGGAGTGCAA TTGGCGCAAATCTTCGTCTCAAAAAAAAAA 
AAAACAAAACAAAAATAAACTTTACTCAAATATCA C ITTCTG1 IAAATGT 
TCTTAAI ICCnLAATXAILCUJClCl 1CTAACIC HJACAGCACTTTCTT ' 

OOCTC1CTO0CAC1TCO0CAATCACAGAGTTCCTAAAGAAOCAGQACTAT 
GTTCTACTA OI C 1 1 1 G T A OOCACTOCACTCOOAATGGTGtCTAGTACCTG 
COCGGGOGGOCG , 
>ScqMcncc 9X1 

A CTCCTGTTTCT A CAAATTT A TCTT A T AA TAA TTTGTC AA A TGTTG AG TG 

cacagatttattcattgcagca : i iu 1 m i catatcaaaagatggg aaa 

CA TTGTGCAAACAATGCCCATCAGTAGTGGATTOATTAAATAAATTAGGT 
A TA TCCAA T AA TTG AA T A TT A TCCAAGT AT ATAAAAAA TAA G AATCATG A 
A TA TGG AAA G A TTTCG AAAAT A T ATTGCT AAGA TTAAAAAA AAGGAAGGG 
GCA G AAG AAAAT AAGTTGGGTAAAAAAAACCCCAGAAATGTTT ACT AA TA ' 
ATT AT A TTT AAAAA CTCA TA GGA T AAA CAAG A AGGT AA TG A AATAATT AA 
T 



OOCTTAT 



1 1 1 1 1 1 1 1 1 1 1 1 

tttggta g agacaggotctcacac1 ttgn occagggctgg tctcgaatt 
cccgggactcaagcaatoctoccgtgttagoctcocaaatttjctagggt^ 
ataggtgtgagocacoctgcccagccta tgi 1 1 a tttcagatgttcaaaa 
caacaaacaaaaataacacactagaAaaaatgatcaoaoaatacotgtta 
aatgagaaatagttcagggctitrataaatttctgaoctrcacocttocc 
cttag t cc t tt tl ci ccataaactctaattacaaa 1 1 1 1 ici 1 ccacagc 
aaaaaagaggaataca hl1ici oaa 

> Sequence 990 

OGTAOCraiQATTOTCTG TCTTGAGACTATTACAGAGCTCCAAAAATTAA 
AATAAAAATAATAATTTTACAGAAATACATATTTGCATTGGAATATTTAA 
OAAAGTTO AGTTTGG ATGCCA CAAGATTATTGGAGTAATAGGAAGCTGTG 
CACAGTXXjCTCACACCTOTAATOCTAGCA C 1 1 1 GGGAGGTTGAGGCAGTG 
AGGCATTAGGA 1 1 G 1 I OGAOCCTTGGA G 1 1 1 1 UAGACCTOCCTGGOCAAA 
CACAOOGAOAATDCT PT CT TC 1 fCATTAAOTAAAATTTATAAATOOAATT 
TAOCTQG1 IGOTQGC1 [OTTTOACAAT 1 1 1 1 1 1GGTAATGCTTTGGTGTA 
CCTTO TA 

c991 



AATAAAATTAAAAAAACCAAAAACAAAGATTAACAGAAAACAAAACAACA 
AAAAAACTOOC^GCATATACATTGAGTCATTTGCA GGTTTGGGAGGGGGG 
OAAATGC1 1 1 1 1 10 1 ATTAGGAOAAAGGGAAGTTTTCATTTTAAATQTCT 
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ata ttacttaaagtt7gcaataaatatttattactttcaacagtg aaaaa 

aattacaagagagaatatgaaataggacagtaatacagactatagtaatg 

ctagagaacataatgaaaaacaaagaaaatatcggggggaaaatagctta 

AATOCTAATOCAAA 

tttatacgtatatatgtgattcatgaattatcctattagctgtgttagta 
gtttatgtgtagttattaatgaaattgaggttgagatt^^ 

AGAGTTTATTTGAGGTT AGAAGTTATCATTTAGGATCTACXjCGTAAGAOG 
TGTTTTGCGACCCG 

CCCrTTCAGCOOCO^^ I IL I C I TO G 

C 1 1 1 IX I O TATOTAAACCTTACTAATACTTTTCAGTCACCAACC' i i c i o r 
GTCATTTCtnTrTAAGCAAAAAGGAGAAGTAAAGTGGAATrroGGTTTCA 
AGAGCCATGCTTTTGGCTTTTTCACAAAGAOAG 1 1UCICI IAATAAOGCO 
CCTGGGTGT ACTTTrCCAAACACCTTTATTTTCT'ACTTGACTGTCtrTXjGA 
TATCTTGGOCTTTGAAAG ITGGTT1 AATTTTAGTAG AGGAAGAGGTGTTG 
GACTTTGGAGTAG7An"AAATCTTTACCCTrrrOGOCCGTTGGAACCAOCT 
GOCTTATGGGGCXXtAATATTTCCCAGACCACAACCTGGGTCGGGACTCGT 
TT A A CTT AGTTGTGTA 1 IU LXJI G1 GGCATTOOGGTTACGOCAAAGCTTTT 
GGTCCGT 

GGTACCAA G TTGn C I CAAACTTTCA TGI I TGI OTATACAAAT CAGCT OA 
GGCCTTCACTAAACT ACAGATTOCATGGCCTGGCCCTCAG AGATTTTGAC 
TCAACAGGTtnGAGTTtKXIACrrAGAAATATGCATTG<TAATAGGCACCCT 

gacaattccgatgtaggtggtccttagaacatattttgagaaatatattc 
tgtaotctoocagataaagaattctt/ 

COGCTCGAAAG 
*995 



GGTACCATCA ICTCTT1 C CC I C I GGTTATAAA TCTTT AATGAAAACGG AT 

TTAAAAAGTCACATTATGATtKTIXXLAAGCTCTGACCTC^ 

GAAGCAAAAGACATG<XATAAAGATGATATTTCCCACAGGAACGATATTA 

G A A TT A TG TGA TOCAATCTCATCCAA GGTCATGGT A TCAA ACCA G ACACA 

GCTAANAATOTATCATAATAGCAAGCATACAGTAGCAAGGATGGGCCTCA 

ATAAACATTTAAAGTCGAAAAA ITCTTCTC1 A ACTCATATCAAGTACCTG 

CCCGGGOGGOC 

>Seqaencc996 

ACCAAAATAGATAAGOATC CTOl 1 1 1 1 1 U AAATGAACCCCAG TTGCOCC T 
TAGGCATTOTCAG7TGGCTCATTTCAAGCCAGTTOTAATATGG 1 1 1 1 1 1A 
TTCTCTAAATTTCGGGACCTGATGCTAAGGAATGTGAATATACAGTTAG 
1 TCC 1 G COAACCC1 CTGT 1 O G 1 T C AAAAAOO C f GG T G GAGGGAAATTTAT 
GACACTAAA TGCTTATATTAGAAAAGAGGAAAATTOOCCG A(XlACXJGTGO 
CTCATGCCTGTAATCCCAGCA I T IT GGGAGGOOGAGCCAGGTGGATC 



GGTACTTGGCAACAATAGCTACAAAGGATAGGATACrrCAATTGCAAGTAG 

ACTTTTAAAATAAATTtlACTTACTTCTATTCCCAACrCA 

T ATTGGTGAT AGTGAAAA GACCAGACAGA TGACATT ACTTCCAAATTTTA 

OCAATCTAA 1 1G 1 1 1 1 1 ACTCACACCTGTAQATGTCACrTTAAAAATGTG 

AATATTAA I I ICI 1CA AAACTACTCCAATTTAAOTAATOAGTTAOAGCTT 

TGOCAAOCATTAAAOC' ICICTOTI U0CAACTCTAACAATATGTGGTAATO 

TCTIXXCTGACTTCATTTrATCTTTACACAAA 

G GI 1 1 IC IA CA I 1 1 1 1 1 ro GATATTTAOCTO CI I li I A ATTTAGTTTTATA 

TGTCr ro TATTACAAAACATATTATATTCAAGAATTTTTAACACTTAGAGT 

AGAAGTGAAATTACAGGTTGAAGATTATTAAATTAGCCATTCAGAAACCT 

TCCAAAGTGTCCA T AAAAGGA T ATATTTTA TCTG AATGOTCTAT ATACTA 

TAGAGACAAATATTTAGGAAACAGG<XCATGGAGTTTGGGCACATTTTTA 

AATTCCAAAAAGGGGAAGTCAAAAAGAACTGCTGAGTAACCCACAGAA^ 
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TCTAOG<rrGATTT(XXTGAATOCTGCAAATATTCAACATCT 
GTTTAACAAI 1 1 1 J CCA OCT 



TTAACTCG AAAAT 



GGTACXi IG 1 1 1 1 AC 1 1 GO I GCTG I AGGTAATOCTAATTCATOATAAATTT - 
TGAGAACCACTCTAGGGT AGT A 1 0 1 1 1 1A-AACAGTTT AGGTCATGAGCAA , 
CCITGAG>AATACACTTTTAATt>TOACItA^ . 
OOTOTGA CTTAG AC G T r C CA TGAAACAATGCTTA TCTTACA G 1GIU1 1 1 1 
CTGCTCTGGTA TTTTT ACTT A T A TTCT A TT AAA TA G ATATGTGTGT AT A A 
A CTT A TTG ATA T AAAAATGTGGTCA TCATOCACT AAAG TG ATTTTACAA G 
OCACTAATGGG 
^Scqocdcc 999 

GOT AC i 1 1 1 1 1 1 1 H II 1 1 1 1 1 1 1 1 1 1 1 IUI II i H 1 1 1 IM 1 1 1 1 1 1 1 
ll l lllllll l l ill l liXA CTTGG GI 1CICC1 1 1 1 1 1 A TTATTCGGCAA 
AATOATAAAAACCTAAAGCCTCTTTATATAGGGTTTTCA TGGCTAG A GTT 
GT A T AAA A CTGCA TTTTGTCAGTTTG AA T AAGCCCA TTTG A A TG AGTCA A 

attrrttaaaagcxrrcgagatccaacaaagctggaaaaaagtaggggtgg . 
gggttaanATOCttcatttgagatgttggocttcagt^ 

AOCAGAACAAATGGGN , . , 

> Scqacoc c 1000 

ACTAACTGAATATTTATTTAAAAAAGCATTAATTTATCTATCTATATAAC 
TAAATCTATCAAATATTCTTTAAAACACGAACCAAAGTTAATCTGAAACT 
CTTCCIGTGAAAAAACrrT^TGTATTATATXXXrrrcAACACAGA^ 
ATTA1 1 roCTX nO OCATTATACTATOOOCI I r GT CATATO CI IM I I fOC 
CATAGAGCA 1 1 1 1 1 IC OCATAGAA CI I IG1A TTCCTCCA C1 ICIA CCACC 
TTTCTTI CAAGAACTCTTATTTACCA1 TTCTI GGACTAAATTAGGAAA 
>Seqneece 1001 

GGTACOC AGAAT ATOGTATATCTCTrCATTTATTTAG C 1L I 1 1 1 1 AAATT 
IGI 1 1 rOGTAATATTCTGTOATTTTTI tl TTY1 1 1 1 1 rGGTATGGAGGTC 
TTACA ICTTTTPl A A AATT TATTC CTAATA C I I I O CA TTTTGACATTATC 

ATATCAGTAA II I 1 1 I A TATTTTA A I L I lUlA TCCTATGA CI I I GC TAAA 

TJCATAT ATTAAATAGTroCTOCATAGATTCCTTAAOATCOCAGACACAG 

CTGTTTG 

ACTACTTGGCATTAATTAGATTGTGATCATAAGTCAAAATGTCAT^ 
ATAAAGTGGTCATXAGAOCATGCAGACTATTACrAATATTOGTTATGTTT 
TAGTTTATTGCAGTGA AAATA CAAA ATTTAAA AGTTATTTTTAGAOAATTA 
TCATAOOOOCCAAAAAOTGTCATTGG rOCTOCAGGAC ICTC1 AGTCCOCA 
TCCAAGAAAGACTG 1 OA TAA I IGI LAAGGGGTTAGTATGGTCTGAGCATG 
CTTGATO G TGCIC IOIC ATTCTCGTATTAAACAACCTOCCAAA IGIC I IO 
ATTACATCTTJCTAAAAAAGTGAGGGGAAGACrTCT AGGACAAATGCAAAAT 
AAAATAACAXIATTTAGCTATACTrTTAGTATTlTTTATTATTGAGATTT^ 
ATA TTTAAGTGAOCGATTCAAG AATCTTTTA TA TAAAAAATGAA TATATG 
CAT AACT AACTGTOA T AAGAA CTGTGGA TGGA TAAG AACACTTTTTTGAT 
OTA 

>S*tpcact 1003 

CCCrrAGCGTG G T C i aX XXGAGGTACAI CtGinC I U AAAGCA l tine 
ACTOAAOCAA I 1 1 rCTATAOCT 111 IC1 ICTA1 TCI 1 1 ICCTTA O C1 1 1 1 
GTTTATATCGTroCTATATTTTTCAAGOCTCATACCA 

A TGAT AAAACTTCATCAAAGCAT ACITCGGCAAA TTTCAA TTA TCAAGT A 
AAATTGTAAAGAAAAATTTTTTACTAGTTTGGAAATAGATCTAC^ 
GA I I I ILI llUCIiULUXXlULl I IOH ICIIGILIHCH.IU JU. il I 
OCTAAAAAC1TAATOGCTATCATTATCTTCAOCAAATTA G I O II I O I ATA 
GCCATAAAAATGTCaC 



GGTACTOCTGAACTT AAAAGTTGAACAACAAAAAAAGAAGGAAAATGOGT 
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TAA T A OCTT A TTTJTAA TTA TTA 11111 f G OAAOACTATTTTTTATATTCA 
GAAGAAGTGTCAGAGTCAGCAGAAAGGGATTATTTCTCCATTTACCTACA 
ACAATXXjTTTTAAATGACTGGATAGATAGAAAT C1CTT1 CAACTTAACTO 
CTTAGCACATTGCA 1 1 1 1 ICTC1CT1 IC AACTTA OTI I ICCA AAGOATTA 
CTGACTrTTTACCTAATTTGCTAAOGGATGTCAGGCCT^ 
TCTCCTCAAATAAAGATACAACATGCTTTTACT G T G T A GGAG . 
>Seqneocc 1005 

GGT A CTTCGGTA TTA CAGCGCCA CCC A CTGGCTA GAA GTCCTCA TAGCAC 
ATATCAGATGTAG<XATAAAATAGATGAATTCTTGAAATAAGGAATATAA 
CA CTG ACTA TTCTGA TTCAGTA GAACA T AAAAAATGTCTAACAAAACAGG 
AACCTAGACACATTTATATTATTTTCTACAAGTAAACAGAATATCTATTA 
GATATCTTCACAAGGGTT TTATCAAI 1 1 1GAAATCCAAGTGGATAATCCC 
CAAATGCTGTAAGGACTTAGATTTTATAGCCAAAACAATTAACACATAAA 
ATGCTATTACATATTTGG 
>Scqtieace 1006 

A CAT A G TTCTOCrrOCA TT G GTOOCATTACAATCCTGTT^AAATOCTGAA 
CT AAAAA TGAAT ACCA TAGTGAAGAAATTACTTGTGCATGTGAAAGAGGC 
TGGTCCAACnXTTAATTGCAACAGGGATTTGAl ILI 1CI ACTAGTAGTT 
AGG AAAGGTTGCA TTAA T ATTT^GT AGTTAAAATGTXiCGATTCTAAATrT 
TTTXTTAATTTCCCATGAGAGAATAAA ITTTI I CAAAAATATTOCCAGTAG 
GTGAATGGCTTTAATaCATOGTATCTGTGAAGATGXJCAAATAAAATCACT 
>Sequeoo* 1007 

GCGO^CTTAGCGTCGTCGAGGGOGAGGTACATCTAAAAGGTGA 
ACrTTAAAATCTTTAAGATATAOATTTAA 

GTOCAAGTTGTT ACACTTACATATGATAGTTGAATGGCAAACXriTjAOCAA 
All ITTTOTA CTCAAGTGTAAAATAOOCAAAAGACTATTACAGGT GGT CT 
AACTGGGATTTTA 1 0 1 1 C I A ATATGGACAAA I ILI 1 lUA TTAC TTGCA AT 
TOCAOCAACAATTAACTGGGOGG I OCT ITTTI I OCT I rATGOCTTTTTGG 
TTGTG iX I 1 1 1 1 O O GGTTGGTCTGCCCT AATTAAAT ACCO I ILCCOCTOO 

>Sequeacc 1008 _ 

CTCAGCATCAOCTGTCT C ACA fCTGTC I G GGATCCTCAAACCTGA CC 1 1 1 
GT AAATTTOCACTAACTGAAOATTOT AGAGGAAAAAAAAAAACATCTTAT 
CGAA TTOCTGCTC1 1 ATAGCTGATTTTAGCTATTAGGAAAACATCCCAAG 
TTGAG C I I 1 1CI AI ICCI A OAATTTCAGA I 1 1CI I ICC 1 1 1 1 1 AAAAA TT 
TTAI C I UCI 1 1 1 A TAGTAGTAAAAATA 1 1 1 ICC 1 11 1 1 1 1 1 IO GAATGGA 
GGTCTTAAGCTCA GTGTCAAAAAT AAAA TCA TTTT A 
>Scqaa>cx 1009 

A CC I ICI IGO CTACAG CG I I lA GCT CCG l I TGI 1 1 1 OCATAAAGATCTGT 
TTTCTGACTTCGCATGAGGGGTAGT ATOTTCAOCTTATTCTCACT ATGTA 
AATTACTTAGTAAATAATAGGAAGAGATGTTCAAATACAAAC 1 1 ILI GCC 
ACCAGA CCI 1 LACTCTATTGCAGTCA 1 1 1 ILlC CCACTCTCCaJULlLlL 
TOOCACI I OC 1 CT GA GGATTAU.' 1 IC CCC 1CIC I CA GCA 1 ITXTCTOTCA 
GTGG C ' l 1 1 1 1 1 1 1 1CCI I rGGCATOCAAACATGCTXAAGTCTG ICI I ATA 
AAAA^TAAAAAAAAAATTTATTTGTAOCTOGGOCXjGOAOCACCCTAAOGa 
>Seqacaer 1010 

OOCrrAOO OTOGT O CCI I LC GAGGTA CIC1 1 1 ICA OATG AAAGT GTTOGG 
TCAGCTGGAACCTGTGAGTATGTGGT 1 1 1 1 UA ICTG TGACTAAACTG TCA . 
COCATTTOOCA C H 1 CTCTGCTTXXTTCAAATATCAACATTTTACCAGGTT 
rCTCTGT TGriGCCAAAOCTGTCAl ITTTATTTCG rOTO G CI ICI IO GGA 
aacttccatggcgcatttgatgggaatcaaacagtgaaaacaag^ 
tgcaacagaggtggca tcaggaacaaa tgogtovtaagaacttaccttgg 

TaGAAAGCAGAGGTTGGATGATAGAATGGGGAATGGA rr OC T OCl O G OGT 
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I cacao aaaTGaaTTAaI GCOaG aCACAAOGG i ACCAC TTAA TATTCCTG 

C7CTCCCTAGCATGC<7TCAGGCTCACTGC^^ 

C - 



CQC1 1 1 1COIOO ICCCTGCCGAGGTACTCAGACACTGGATCXnAAGAAAA 
TCAGAGTTATAGCTAGTOOCACTTATCAAGGGAATGCAQAGG rrrcTGTA 
TTCTGAGCA TGITCCTCTAATAGGATaGATAGGC O ATGTOOCA GCAACAA 
CTCCCaATTCGTAATGTCTTAAaACAAAAACaAGTTTTAT7TXXX>TTTA 
TGCCATGTTTCCAGCACAOTITCTCAGaGGGCTGTOCTCCATOCATTTAC 

tcaaggtctgggaatgatcattjgctacactatcttgcaggcaccatattt 
ggaaocrottoccactctgatggcagcagaaaacaaaagaa aoocaa aga 

TCATCTATGAGCTATTCACTOCtCCAOOCCAATAOTG<nTCACTTTTTAC 
TCACCAGAACTAGTCTTOCAGCTOCAOCAAAACTNCACGGAAGTTCAGGA 



GCGCTCGAAAGGGC 
>S cqucnc c 1012 

CCC IT1 CGAGCGGCCOTCCTGGCAGGTAO 
GTAACAGTGAGGGCATGATTAGOCATCTTTC^^GCTGATGTCTTGTGOA 
CAOCTCCCTTCTTAOCACrCTAACAGGOCOGTGTCAGCAGCICCOCTTCC 
TCCTOACAAOCTOOOAOCACAOOOGACAGCACAATCTGAAACTCTTACAG 
AT ACC AA CAGCAA CAAAAA TG AA AC CAGTTATGGTGGGCAAGCATT AATC 
TAAAATTTTTTTTAAAGGA 

ACGOGCOOGGTCTCACCATGTTGOOCAGGOOGGTCTCaAAT^ 
CAAGTCATOCTOOOCOGTCAOCCTOCCCAAAGTG CCAGG AT TATAa GCAG 
G AGCCACCGCGCCCAGCCT AT mUl 1 1CI IAAATITI1 TTGTI I I1CAG 
TCACX^CXATTTCACCATGCATAAATCACXACGGTTAA 
TTTGC C 1 TCTTn OCTOTGCACTTACGTTTTTATGTAGCCAAGATCACAC 
GTTGCATTTTGCTG CI I 1 <JC 1 1 AACACCGTCTAAOTCATCAGCACTCTAT 
TGTG A TG A TTT ATCTT A AAAAT A TTCCAAGCGATCATTTTT AG TAACTG T 
GTAATATTATATCATAAAGTTAAAACATAATTTGTCATTCAATTGTTGAA 
ATTTTTAGGTTACOTATA I T I TC T C TTATAAATATGTA AATAT GTTTAT A 
AAAAGTTATATACAG I 1 1 1 1 1 ATAAATCTTTGTGCATACTTTATACTGTT 
TCCrrAGCATAGAGACTOTOGAATAGGA 1 1 1 C TT. G AAAAAAGGTAAAAGT 
GTGAGTATGCATATATACTOOTACATATATGTTATTATTATAAAGGTAAT 
AA1 W Tl ' I ' ITnTlG GAGATAGAATCTAACTGCACCTCAAOCTGT 

TAACCTTGGGGTTAATAAATATTTGTGAAATGTTTGGTGAT^ 

CCTATAGCC 

>Scquoicc 1014 

acttattcagacaagagttctgactctcatgcttgaggataagattatac 

ATTTCAGTATTACATTXjAAGATATTTCATTTTTAACCAGACTAACTTAGT 

ATATTGTTATTTTTAATOTOAOCAAACAAATATTTTCATAGAAGCTAATG 

CTOA OIll 1 1 IO ATAA 1 1 1 OO 0 0 1 ATCTTAOTCAATCOCAAAAAATTTAT 

TTTCTACTATTTACATATTATCCTAGTGGATATTACATTACTTACTGAAG 

OCTT TOOT TCI ATGTTTCATCTACTCAOACTrAATTCAGGAAGAGCTTCA 

TCCAGA rGTTTTOTTTATTTOTl IC r CGA TTACATOTATOAOATTTCAOA 

ATTrATGAGATCATAGGTCAAGTGAAAGGTCACAGTTGAGAGGTCAAGTA 

AGAAG<7rAAAATTTGTXiAAAOCAAAGAAATOACAGOACAGTGCCAAATtSA 

AAGGTCAAAAGTCAAGTGACAGACTCAGTACC 

> Sequence I0t5 

ACGCGGGAGAACCAGTGACA^CTGTCAAATTATTCT 
TTTC A l l i nU AA inillCtllUJIllU AGACAGGOnOIXjCIGITGCT 
CAGGATXXrrCTCGAACTCCT^AOCTCAAGCAATTTG^OGGAGCrCAAGTC 
TCAGCCTCOCAAAGTGCTGGGATTACATaAGCCATCGCAC [CTOCTOTTI 
CTGAATrTTTTAAACAAAT AAATATCAAGCAA TCAG A TGCCA AAN ATT A C 
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aaagaxaatcagtatcaaaaatttogagtttoagoocaggcaoggtggct 

CAGGXXTATAATCCCACK^CTTUiAGAAOCTGAGOOGGGCAGA i lAiawG 
GTCAGXIAAATOGAGACCATCCTGGCTAGCACGGTGAAACCCOGTCTCrAC 

> Sequence (016 

GGTACTATTATAATAAGTTAACATATTTOCOCTATATGCGOAAA ATGCTO 
ACTATA I I I H I PG t I G CI 1 1 Q GAACACTAT CrCC T CACA ACAOTOCTTG 
TCTACAG AAATGGG A AAGGG A AGG ACAC A I I 1 1 GO 1 1 IL IGCAACATCGC 
AACATTOGTAAAACCAGAAATGATCTGTXjACAAGAAACTAAAGAACTGCA 
CGAAATTCACTTCCATTCACCCIGGI IA AAG C 1 ICC I 1 G AATCAGAGATA 
AG AAACAA CA TG AAAAATCT ATTCCTTTTAG AAA A CAAGTCTTT AA COCA 
G AGGTTGGTTTA TTTrGAAAAGGAATTAGACTCTGGGCCCACATACCOCT 
CGTTCA AA ATAT AATGC ItjTGGTn CAACTCCTGCTAAATQTTOCTGTGA 
CTTTTAAGCAGAGAACTTCrAAAAC<3AAGTAACCTAGGGAGGGGCTGATA 
TAACTCAGACATCAATAATTCATTTTATTGGAAATAGGAGTAGTAGTATG 
A A A TGCTA GCAG ACTGTITCATTTGCAGGC AGGCATTTTCTAATTTAAGC 
CT AG AACAATGCAGTCAGCTTT ATTTGGCAAGCTAATATG AATGC AGGCA 
AAGCTGAATCGAAGGAATGGTTTTTATC^TCTCCTCTAATCACGCTATrc 
TTTAGCATCAGTTATTrAAGTCTGACTTACXXACCTT^ 
AAAGTGACGAGAAAAGGATTTTTATAACTTTGTACCTGCCCCG 
GTCGAAAGT 

> Sequence 1017 

GGTACAATTCAACTATCATTCT GG 1' I XjCGGTGGAAGATGGAGACT GGCT A 

TAAGCrrAGAAATATGGTTrGGGGTCTTGGATATAGTCATOGGTTG^ 

AAGGACTGGTGACAAAGTTTGGACTTTA«rrTGCAGACAGTGGGGAGCCA 

TTGAAGATTTTTTTGAGCAGGAGTGCAG^AATCAAAGCAAATTAAATTTA 

A AAAAATTT AAATT AA GGCT AGCA GGA TTCA GTTTTC A AA CTGGCCA GCT 

GTGGACTAAATCCAGCCTACAGATACATCTTGTTTGACCAGCAGAGAGGC 

TTCAAAGTCTTCA ATACATTGCCAACACTTAAAAATG AG aag attaaata 

TAAAATTTCAAGTlTCCATCATCTTTTTAA^TATTAGGAGTrCCAGCAAT 

GCCGGGCCTTTTXXCCCX}CATGATCACTGAC^ 

CTG A CCCCCA t A TGCATTNNTAAA G ATTTTTT AATTTTT ATGG A TA CAT a 
ATACITC^ACCTC<XCGGK3XXXXXrrcGAA^ 
CCGGCGTACTAGTGGT 
>Scquence 1018 

cccttgagc<xxxgcccgg<x*ggtacxx:cg^ 
ttacttgggatgcatcagtqaacaaaacaaaogta rctgtcci i atgaaa 
tttatatcatagcagaggaagactggaaatgaataaataaataaagaatg 

GA G1 1 1 GTTGG AAGGTAATA 

TGGACKTTTrGGAGTGCTAAAGTGAAGGTGTGAGAAC 
CAG 

> Sequence 1019 

ACTTAGTTACTCCTTGCCCATAOACCTGTriTGA(XrrAGAAAAATTTCTTA 

TACGCAACAGATATICATGAAATATATATTAAAATAAAGCTTGAAGGGTG 

AATTAAATAAATATTTACTTC3GAAGCTACAGTGGGTJGAATTAAACAAATA 

TTTACTIXXjAAGCTACTTTATAGCCACTGGGCIGCAT 

TTCTTGCCCTITJGGAGTTTTACAACTGCTrAACAC 1 1 IX3TC I ATGCTAGA 

ATACATAAAAAAAAAAAAAAAAAAAAAAGTACCITJGGCCGC^ 

TAGGG 

> S equenc e 1020 

GGTACCTAATGCTTTCAGCOCAGGAGCAGAAAGAGAAGTGGGCI CI II GC 
TTTG A G AGTCTCTG AAAATT ATTCAATACCCTGGQ ACAAATTAAATO AGG 
TAGA 1CCT 1C1 1 lG AATTTOTTAATAAAGCATG Cl 101 1 TT G T C TCCA TA 
AAACAGGCTTTG ACCA TTAAGGTTTATATTTT AAATGGGTAAA TTTTATT 
G^AATACACTAATT/rTAAGAAAAGAATTAACTCATGGCTTAAAAGCAAAA 
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ACCAGACCITGGATTTCACCCATAACTTTAAGGCTGGTCATTITAACCCT 
GATTTGACA CACTCTT ATTATGGTGTC I 1 1 1 CTDCTTATTTGGCTAAATA 
TTTCTGACCATCATAOCAA IC1 1 1 1 C I AT AAAGGAAGCAOGCAAOAGAOC 
TAGAOTGAAAATGTTAAAAACAAAACAAAAAAGACAGCATACTGGCTACC 
A G 1111 IC1 rAATTAAQATGATCTOn 1 1 L GCAATTGCCTAAATTAGAAT 
AAAATGTTATTTAACTCAAGGATATTTCTTX^CIXjAAAGAAAACTTACTT 
CTACATGTAAACCTGCCATATACTTTriCAATTAAGCAATOQATXAAAAO 
TTCTOAAAATGA 
>Seque*oc 1021 

ACTTACAGTCTTAAGATATCCATACACOCCCACATOCGTCCTTTGTCCGA 
GAAOATTACTOAAAATTTAATTCCATTTATGTCATTOGATTTGTAAAAAA 
C CXJC 11 C I OGATTCAAAGATGAAGGCCTCACTTACTTTATTTITGTCATT 
TTCACAGACCCCTT ATG TA AATGCCTCAA G AGT AA G A A TCTTGCTCA A GT 
GATITrrGTATCTCCAATTjGCTA ACAAGG AGCCTG ACATA GAGTAGCTGC 
TTGGT AAA TATGTGTTtLA TTCA TTCAACAAATACGCCCAAGGTAACCTTG 
GCCGGGACCACGCT AAGGG 
>Seqooce 1022 

GGTACCGTGTGGG<XACTAATACATAAGCATCTGTGTrGGCTGGGOGTAG 
GTGTAGG<KKnTJCTTGGGGAGAOATTTAAACAAACXX." 1TICCICI ACTTG 
CAACA TCTCTTAAAAGCTrTGTCATCATGTT ACTTCCTA 1 1 1 CI I IAGAGT 
TCATTTGTTTAAG ACCG AAACGTGCTTCATCTTGTTCXX," 1 III JCTGCAT 
rCTTTGI AAACTTAATATTCTAATTAGCCCCAACACGGAAAAGAATGTAA 
CACAACTOTCTTAGTrGTGCCATAGAGTTAGAATCTATCTATTAACAT^GT 
TTT AGT AATAA CAAG A G AAATAATAAAAACACACCTATT A TGAGACGCTG 
CCCATGCCAATAAATTTGAAACATTACCAGGAATATAAAGGAAGGAAGGA 

aacaggtgoagacactgacaacaacaacaaaaaogtaggaaaaataaata 

GCAGAAAGGAATGGAGGTTAGAAGGAAGAAAGAGAANAAGGAGGAGGGGG 
GCTCAAGGAACTGTGGAAAACAAGTGTCATAGCrrrACCTCATATTTAATC 
' TTGTCACCTTCATAATC C I O O C1 1 ACTAATOCAO AGAGAGAAAAATTTCC 
TTCTGCTTAAAAAGTTOTGAATGACTTGGAATGGCGTTACCaAATATTTC 
CGCCTAATAAAATTAATTGG AAAGCATTC AATTTJTAC ATA GTAAGG ATA 

AGATTGTTAACCCAGAGTTT^AAACAACTTOOAACATGAAAAOCCTTT 
> Sequence 1029 

ACATATATTTCAAACAACATTrrCTAAATTAATTAATOT^ 

ATTA rO TT3TTC 1 ItX X^ C lTCT A TATTCTCTArrTGGGGAAATAATCCCA 



TCCCATTGCACA TAAATT AA CTTCTAATCCTA OCT A CTTATCTTTG AA TC 
CA CTCI 1 C 1 A T/ITGCAGTGGCAATACTTAGGGCTTCCTTA C 1 1 1 1 1 ACCA 
GGA CT ATT ACT AG AGCTTCCT AAATGC f FT C I A TCTGT A QGCTT A C 1 CI I 
CTGCATTCT A 1 fTTCI 1 CA AAAACACCAGACOTA A TT. C ITCC CAAACTOC 
ATATTCTGATXATTGTCAACTTCCATACTTAAAATCrcTATAATAGCTCC 
CCATAOTACCICCGGTCGGGACCACXKTrAAAGGCGAAT 



GGTACCCACAATGG AAAGATGA'l C I I CC 1 GCATTGTGAAGGTTGTTCTCA 

TCAACCAAGCCTGCAATGACTAGACATTCTAAAGAGAAGAGTCATGGCAA 

TGGAAAGAGGACACATCCGCTTGCCAGGTCACTTCTATCAGTTGATGACA 

TGCCATATGTTATGGCTAGGTCAGCTrrCCACAAGTATC<^CA7GCAAAA 

TAGAACrrG OGAAAAAAA TCTTTOATriXX^ 

TCACTCrrTTTGAACATATrTGGTGA TGCCAAATTAAATAT AGGATA TTAT 
GTAGGT AAGTATTCTGTACAGGCATITATAG^KTTGAATATACAAGATGTA 
GCCTGAGTCCGA 
GTCTGAACATATACAAGTGATl 
TAGCTCCTGAATAACATTTATTGACATGTGAOCTGGTTOGTTOAATAAAA 
TGAATITGCCTTAATAAGGCGTTCATTN 



e 1023 

GGTACTTGT 1 1 C 1 U QXnTCGGACCACTCTDXX^CTAGACAGCTGTATG 
GCCOGCTTXCTCACTCTCCTCAGGT^ 

ATTGTCTCAAACATTATATAGAACACACAOOCACCCATOCACOCACAOCO 

TCGTTCTTCATCCGCCTGG1 1 OJC TGCACTATTOCAGG ACCTACAGCAGT 

GCCT AGAACACAG AACATCCATTAGCAACAT1 TGTI I AATGAATTT ATAG 

TXKXTTAAACCTGCACAACTXrTXLACTTTG<XrrTOCTA TTAQ AAAATOCAAO 

GCCAGCCC^GGTGGCTCACACCTGTAATCCC^ 

GTGGGCGGATCACTTGAGGTCAGGAGTTCAAGACAAGCCTO^ 

GCGAAACCTATTCTTTACTAAAAATACAAAAATAACCTAGGGCTGATGGC 

ATGTGCCTATAATCCCA GCT ACT AATA GAGGGTG AGGCA GGA G AATCCCT 

TCAATOCCGOGTGGCAGAGGTTGGCAGGAGCCACAACAC^CTACTOCACT 

CCAGCCTGGGCACACAGCAAGA C TT C T G G 

>Sequcocc 1026 

GGT A CTG AGGCT AA TGGTCTTAOTTGG GA T AAGG A G AGTGGGGAA GGGG C 
AGGGGG AG ATGATG AAATTCATTTATOCTCTGTGATCCT ATGGAAG AACA 
A TT AA GA TCA TGTTTCCT ACTTG A TTTTAGTTGCT AGTCA TTTCTTAA TC 
TAAGCACCCCCTATAATTTACCTATCTCATCATOCAAAATCACCATOOGT 
AATAATGTCGGGGCGGGGGAAGTCTAT ACAA GAAT ATT AAGGCCCTG TGC 
GTGAGCATOTCTATAGTTAAAGACTTAATGAGAAAGCATCAAATTGTGOT 
GCAAACAGCTGA AAGTAGAAGTAA ATCACAACGT AATAAGATOCAACTTT 
GGAGGAGCTCAAAGCAACAGATACOrrTTTATCCAAAAAGGAGTAAAAGA 
AAAAAATCGTCAACGOGA G TTC C TTCAGATAATCAAC TGATGA TTTCATT 
TG A A ACCA T AA TTA ACTAGCGTTCTTTG T A A A A T AAC 1 1 1 1 1 1 CCA TTT A 
TACTTTTTAATGTTT A TTACATTA C 1 1 H CT C I ATATATTTOCACATAAG 
A TGTCTT AATGTGTAAGTGGTA TAT 

QGTACTAATTCTTTTCCTC1 1 1 CCTAG ACCG ATTCTAG TTTGTTGCCTTC 
CXT1TTCCTCGGAAACCCCAAGTTTGTGGATGCTGCAGACACTCTCT 
CCCTGCATGCTGGGTGCCTGGCCAGCTGCCAGGGCATAAAGACAGAOACG 
ATCTGGCClTrGTCCTTAAGAATGAGGTTTOAAAGCCCCA Ai I 1C 1 1 U C AT 
GTTAGGTGAT ITCTTGCAGCIC 1 1 GGTA TCTGCA G AA TT AGTGTGAATG C 
TT AAAAAA T ATT AA CA GCTTT A TATCATG AAAGTTTNAACATGT ACCTGC 
CCGGGCGGACGCTAGAAAGGG 
>Seqoence 1021 

GGTACTATGGGTCTAGTGTTACTATTACAGTTAATCCGl CCT1 "1 G I OTGA 
GCTGTTAAATGCAGTG AGG ATTOGAGCACTGTCCACTGAA ICICTO IGCA 
ACAACTTAATGGTGTOGCA GGGGTAT OOGGTGTCTGGCT^ 

A TGGGTCGCGGCGG A TTG ATTCG<XIAAG ATGCTGCTACT AATTTCAGGAG 

AGAATACTTCCGTTGCTCKrrAGAAGATCACXXATGGTAGTT^ 

TGAGAACAGAGGGTGCACACAATGTTTTAGTGTTGCCATGAACXjG II 1 IGT 

ATAGTACCTGC^^GGTTXiCCCCTTXlAAGGGCAAATTCAATAACGCTGGC 

GGOrrCTATTATTOAGTATTAGGTTATCTACCAATCGTGG^ 

GGTCATGCT CT GTT G GCTAGGAGAACATTGGATATAGTTAOGAGrTGGfroG 

CAAGAT1TAGGCGTGGAGCATAGAGAGTAAAGAGTGGAGGGCTGACTGAG 

A GGGAAAA TTTT A GTG A TTTGGOTT AGGCTOT 

>Seqoeace 1029 

ACTTAAACATTTNAGACTCACTGTGTAGCL I | C1 lli GAATOGGCAATTOG 
CTTAATGC 1 G 1 CCGTGAAAAAATAGOCTTTAACA ICTOl 1 IU ATTOAGAT 

ttgtgatacatagaagttgggaggaagatgtggaaagccctaagagagct 

ACTTGCCAA OCCCACCA TCAGGTCTGCCTCAGTGTTCCTACTCAGCACAG 
ACGAGGCCCAGTCTCATATTAGATAOTCI 1 lOA ATGCAACATAAACAOAC 
CACAGGGACTGGTATGTAGCAAATGGTCAATATATAATOTACATAGGAAT 
o AAATOGAAA TGATAGTGAGTTC<1ATTCAGAGTGTAGAAGACTTTATTTT 
AGAACATAGTGGGGGCCTTAAGTTGGTTACTTCCAACOGAGAAGTT^ 



Page 58 
(05261 pmgo. m T«bU 3) 



Page 59 
(of 261 p*e»i.Tri>k2) 



PCT/US00O3312 



FCT/US0Q/3J3I2 



TCTGAGGGTAAATGAAlTATTGAACTGCCTATAGGTTGCACAGCATACCr 
AT AGAAA TOGTOGCACCAG ATTT A TTTTGCGTTC AGAGCGCATGTGGCT A 
TTTAOOOTOTACCTOC1 1 1 ItXXXAAAGGGTGOCTAGCXjTGAGCTT ACCT 
TCTOOTATTCATGGAGGAAT 



_ IT A CTTTCAOCTCT A TGT AAACTCT A 
CTTA CI I I GO III I IIIA GGC II I 1 G A HII I ICA CAATTAAAOTAOTCT 
TTOAGOCTCAOOC P0 CTTT0 C1 CA TAOCTATGCTATGCOCTGCACACTCA 
CCCCAGTATAAOCraAGGCAAACATCGACrCAl I IUI 1 1 1CTAACTTTCA 

ccgattattctocatcattocctoatctocao r oTcn gaaaagcaattt 

TTCTOAOTAGTT ACTTOA I IUI 1COG lb I AOOTTCAOOOAG OCAAATCTO 
OGGTCATPTTA n CI Ol CT lOA CTCGGAGOAGAAAOCCACTGTlTTTTlT 
TACTGCCATAGT ATCCTCTTACCAOCTCTAOAACTACGTCGATGCftSTCT 
AATGACAA TTGG AGATACItXXjAGGGGCTGGACXiAGCAGGGTAGGAGTTG 
GACCT ACCCCaGTGOAATTTCTACGAGGGATGGAAATCGAAGQGGAO AGG 
CAGACTCGGAGGGTAGCATCGGG C C I rT C T G GGATTGCAGOGAGATTGCT . 
GTGGCCAGGGAOATTAGTOCTAAAGGAATTTTXXKMAACATGAATGGCTT. 
>ScqMMl(DI 

GGTACCAI roTTTTCl ICAAAATCACAATTTAAATACTTOa rCATTTTAG 
AAATAATOGAGCCAOGTTTTCACCATTAAGOTGAGTCATTGTTCAGATAC 
ATTTOGCACT G TOC A TAGGTTTATGG C I I UC AA CMOH I A AOAOCATTC 
OCAGAGTGAGAGCIGATTTGOCATXXrrTATGAAGCTTTCAGGATATAAAC . 
T ATAAGAA TO A CAAACTACAOCAO7T0 AAAA TGTGTCTTCAGAT ACTCAC 
TTGCAACTDCCA TTTA TGTCTCTA CGGA TTG AG AA ATGA GGA TOGA GGGA 

CCAAA tctggcttggtcagt aac agtgt aggt a a cat a t aaata tt aa tg 

TTOGITGCAGTTAGTGTGaTACCrGCCGOGGCGGCCGCTCAAGGG 
>Smjmcoce 1032 

OGTACA OT G GT PTOATOOCAOCTCACTOCAGOCTCAAOCTCCOGOGOOCA 
AGCAATOCTOOCAOCTCAGOCTCOOCAGTAGCTtnGTITOCAAAGAAATTT 
ATrTATAAAACAGGTGTTGGGCTGGACTTGAOOOQ TOGGOCACAGTTTGT 
CAACTGOCATTCTOTAAGCTTAACATGTGTTAATTACTGCAATCTGAATA 
ACAATGCTATO ATATAGACA C 10101*1 CC1 T I T AATAGACAAAGG AAOOC 
AGGCACAGAAGGATTGACTAATATGACCAAAGTCACACTGCCAGTGAGTA 
GCAAGCCIGAGCTCTGAAGCATGACAGTTCACATCTTCGACGAGAGCAGC 
TrCTCAATOCTCTrTOGAOGQACCAGAGOOCAGGCAGTAGCAACGGCTAT . 
GAGGTGOTCAQACATGACCAGCAGATAAGCCC I G GGCAATTOGOCAGAGC 
TGGAGGGAGTGG AG AA CT AGCCA TNTGTG ACTTTG TGAACATCCCTGGOG 
AOTCTOOAATTACCCAAOAO ' 
>S^aac* 1033 

GOTACTAOATTGG G t o i g t o t a tt aagagaaagacagg agtcaaagatag 
ttdcaaaacttttgaacagaacactogatgaatactgtttactoagatoo 
ggaacacitagaoaaaaatgcaittggaaagcagaaatacgatcaagact 
tocai 1 1 1 1 oa tacattaagcttggta ffgl 1 1 aattcatagctatataga ■ 

OGTATTAAATTGGCAGGACA A A ATCATAGCTAGAGATAAAAATTTAOAOT 

tcaocaotgtaaaoatoatat1toatoocacaooatoga ci itcttctoo ' 
oatttoagtatacataga - -, - 



ogctataotgagtogcacaogog ioouatcaogogtgggcaggcattagt ' 
tatcooccaotttattxmgtgtgagcataccacaotactgattactotqa 

AGCTGAOTOOCATTTrATATGTT TATTO A TGTTTAAGA 1 1 1 IUOI II AA • 

caa att o tt c attttct1 rocoooti n i 1cik 1a toagtaa7 . tctttot . 
atattctogatcttgatcattatogattataaaagctck^agtatc^ ■> 
gaaaaaacoaa 1" i ' i ' i ci oocog accoaacaa ttta to actgacaaaaag a 
acggggggxxxjoctrtocactctttaaaaaaaaacaooooooocaoctcc 
ccctaagoctaaaaaaaaaag aaaacgcgggtgotggtt accaccttctc 
tctggcacttattogtooccactaaaaaaocatttgccataacttattca 




AAAAAAGU Gll I 1 1 1 IG IL I OX J IG TATACCAAOCCCTCGCAGCOCOCX^C 
TCAAGAG 1 1 1 1 1 1 1 1 1 A TTCGGTGGGGGOCOCCOCOOGTTCXX OOAACAA 
AAAATTAATIT^TTATT GC IT C n'I t^ AACAAOCACATTAATrACTTTTOO 
TTQAOTATGTCTGOOCAGOOGGOCAGTOG1 1 1 ICC I ICTTACAOCAGAGO 
oo pi 1 11 1 IGCTS1 f T O OCACACAAGAATCT GG TT G AATCTCOGTDOAAO 
AAAAT AATGTTGTTAAOTAAAAGCGGCCGGATCOG 1 GG 1 H. I UCOCCTTA 
TAAGGGGOGOOCTOGATOOATaOCCTTTATAAAACGOGGCCGOCGOOOCT 

ggttggcaggcaattaaaaattatcggtctctataggttgqgtogttgac 
caaaoooccattact 

>Scqaaxx 1039 

accatttaactgagtgaaaoctttacaattgaggggttactcattagcag 

GACCTGGG 1 1 ITCTTT1 1 AATCTCATTAACCCL » 1G I I A CCCATTTGATA 
ACAAAGACTTCAAGGAAOAA 1 1 1 GC 1 la aaaatctctoogagacagtaat 
AG CnCI I G GGCCTGACTG ATAAA C I 1 1 1 I GCCTCCAGCAATGGAAATOT 
GXWU^AAATTOCAOATGCTAAATGATCTGGCTTGGAC^ 
TAGTGGAGCCTTTCGATTGAGGCACAGCCCAGGACTGCICCAAGGG 

gca caacag at a 
tgtatatgggagtcgacccacgc 

gcacatatoctggacactcagtaocgcatccatgagggca ICTG1 GOC 1 1 

COCCTCTGTGGGGTTCTACAAGAGCAAGCTGAAGaCGTGGCAGGGGCTGA 
OGAGOOOOCOCA G ro T C CTG GGOCACOCTGGCAAGOAAQCTTJTCCTOTCA 
TCTT 

1ATGTGGT 
GACCGTAGAOCOCCAOGACA 
atcagcaaggcc 
ATCACCAAGAG 

CCAA GGGAGCGAGTGGOGCT A TG TGCTGGTG AGCACCG TOCGCACCTGTG 
CCAAGAGCCACCTGGACCAACGGCXXACCAAGAGCTOGCTC^ 

ACCAAGTGAACGTGGCTGTCACGCGGGC 
CTTCTGOGCTGOTTGG 
O 0C1 T I G GCGTAACCCTOCT G ACTTCI G GGAGGCTCAACA AAA LCC 'l 1 GI 
GCCTCCCCGCCA OGTOCGCOTCTTOA ggaagccaacta TGC CTI TC TGAA 
GAGOCC1TTTCAOCTOCAGGTCOCAGACTGG AGGGAAGATCAGGGOOOOC 
C 

> Sequent* 1037 

AOCATTTAAC1GAGTGAAAGCTTTACAATTGAGOOGTTACTCATTAOCAO 

QACCTGc u T munm aatctcattaaccoci ig u acccatttgata 

ACAAAGACTTCAAGGAAOAA TTTGCTCAAAAATCTCTGGGAGACAGTAAT 
ACCTTCTTCGGCCTGACTOATAAA C 1 1 1 1 1 OCC J IXA GCAATGGAAATOT 
GGGAAAATTCCAGATGCTAAAtGATCTGC<rrTGGAC^ 

tagtggagcctttogattgaggcacaocccaggactgcttgcaagggaaa 

AGCACAACAG ATACCATAAGGAGGTCTG A 1 1 1CIG AAAGGAATACCTTGG 

cccocgaacaccctaaggg 

>Sequeocx 101* 

OC C1 rTOGACCGOOOGTTCGCCCAGGTACTTTCACTAI mTTAOCAACA 
AATTACTn IGACACACAGCACAATTTiATTTAACACTTCCAATTTTGGAA 
CrATTGGATAAATAATGATGGOATTTAAATAAAOCAATOCGATTCTACTA 
TTACAOCATAOO OT C T Cn OTAOTOCI C I 1 A GTAAAAACTATTGTOACAC 

rice i n - n i c i c ca aatattcogcctggaaagacctaaatacaatgcao 

GGATTOAATCAAATTCACACA 1 1 TIT! I ICC I A COO AAACAACAACCTTT 
CTTCCTTATATTTAACAAAAACTAGTA 

GGTACTT AGATCAGATGG A TTG AAACATG ACAGCCCCATTTCA TCTOOCC 
GGTTAAGGTCCTt^TCGAATGAAAAACACmCGGG 

rTAATTOCCA OA TTGTCTGAACA CAOCCTCAGC 
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tacttctaooaataagacgaagcagtoaggaagttgccagttgagtgatt 

CTTGCJGGAAAAAAATTAGCATICAGTGCCAGCTCTCTAA^ - 
TGGATTCTOGTAGAAGCCAGTAAAG AAACG TTTTCTCTGGA GTGGAAGCT 
AGTAAGATTT A T T CT G 1 GO I G ATGAAGCCATCTGAAACCTTACAAOCAGT 
OTOOTTOT A TCAOCATATOOOAGCTCACTOOCtCAGGA C I TTOG * . - - " 

AC7CTTATCAACTGTTTTATAGATGAGAAAACATT 

AAA TOT AGA TTATTTTA CTACAGA CTG AGAAACG AA TT AAACTA GGAGCC 
TAAGATA CI I 1CIO GAATTGAAATGATACATTATATATACCTATAAAGAT 
AATTGGCTATAGCTTCCTAAACTACAAATTCTCATA 

octatatcaattaoaaactgotattaaaattqagtattataaoacaatag j 

AATGTC v ' *" ■ 



tcaagtgtgcactgtttaoacatcatgatctaggcaaacagaattcctc 
cctgaaatatotcactagttagaaacattagaagctttx^gotaaataaa 
tataaaaaaocagtcaaccgt attctta i i fci icotcaoagaatcatgt ' 
green i rocm i a a c t iu-tuc i o gai icig gatggga g i totto aacat 

ATTAATCTCATTA 1 1 MLlUl A QAGOACAOGTTGTCCtXXX'l ICC1CATT 



GGTACC CTGC 1 1 1 oa ttatttccoaatccagtgootagagaaogtaaagg 

CAAOOGCTCACTOGATA T T T l l A AATTGTAGGGA lG I CCff IT G CTCTOGO 
TCAATTTTAGGATCAAATATAAAAGCACCTATAGCTCAGAGTATC^ '. 
ACATAAAACTTCTGAGATAC3CAGAAATTTTCCAAAACATGGTATAAACAG 
TATGAAACACTGGGTAGATAAAAO C 111 CT C T A AATC TTAAA OTGCTCAA 
ATATCATGACCTCA 1 1 1 1 1 1 AO'THT AGAAATCAGATATTTTTCTATTCC ' 

atatcttaaactttn r • • ■ 

>Scqucac« 1043 T ' 

GGTACCC GI I lurU CATGGCTATTCCAAATACCCOCATCTTTATTTAAAA 
7GTATATATAATCAGTTACATAA.AAAGAOGTATGCTTAAATTCTCATGAC * 

tctatgg rro oAoe ictctgot tggagca ogca atagaaat g tctgt a at 

tcatttaaaaaaaaaagtga ci 1 1 IX. I A CCTTTAOATAGTOAOGACAATC .' 
TGTTAAjC TCT T f GTOTT U ATAAAAOCAAACATTTCAGGGCACGGTGAAAG 

aaatctctacxatgtataaggttatatatataocagaagcagtggagtta 
ggaccaaattaagatttgac 



cgtacataatotaattgttacatataa i igi iui ataccataacttacta 

I II I 1 ID mTTAl 1 1 1 IA TATATAAI 1 1 1 1 1 H IGGl 1 IUI 1 IGI 1 ig 
TTTTTT AAT AAACTGTT A TCACTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAGTC * • • 

>9cqmn 1043 

A C! 1 1 HJCm O G TT G I G AAI CI I OG AQGTTOCOCT C TCAOACTOOTGAGA 
tOO CACW TACCTCTtiCTAjQCTAAAOCAAOOAGAACAOACAOAOCCATAO 
ATACT1 1 rOCl I ACT AAAA I CI 1 Id 1 1 UAOGGTAGGGACTGGAGTATGG 
AACCrrTTCAOAGGAATGAGAGGGG CI I G I U ACGAAAGGGTAGAGGAGGO 
AATACCTOOCTGCAAAATCITACACAACrAACTAATGTCATAAGGCCCAiG 
OATOAQAAAOTaOCACTTAAC I C T n CATCCTCATCACACATAAAOCATT 
CCH 



AXlAOCACTTTCAAAiTrAGTGGAATATAAATCTTTCCA TTT AACAGCAACA 
TrCAAATATTTCCCA 1 1 L IGC 1 1 A TT A I ICCTCTCTGAAGGTGATACATA 
GAAATATaGGAGCAAACACAGCAATGCAGGCGCTCTAT 
TCACATAGATCTTAi 

TATTA4 



tgaggtagaaatogtaaactaocatgaoctcgaatcataatttaatatca 

TTCTAA 
>Scqwaicc 1047 

AZ1ATTATTGGTAGTATCTCAGAATCCTOCTTAGCTTTTGAGATAA 

gtcat^gatattttgggtaatatggccataog^atcatgcaagattga 
goocagtatttgcl 1 1 1 1 1 caatatttac1 it g i aaoaaoctgacactct 
aggtcctcaccacaccaaaacctocaacataaacttxaattttooocaac 
ttlatagaccaaaaaagctaaacaaaacaaaaaggaaaaaacccttttatat 

ACAATCACCCroCTrGTCTACATTTAA I HOC 1 1 C ATTCAKAATAAOCAG 
TCACTGC 



ACAACACTTTAAAAAG7GAATTATAAGCTA TCTGAA TATCTCAATAAAAA 

catttittaaataaaaacaattcccaaaggcctggaaattcaggaacata 
attcaaaataatttatogatcagaaaataaa'rcatataaag a tctgagaa 

CTACAATGTAAAAATATAGAAAAAAGTCATAACACTATTAGATAAAAATC 

TGAGCTGGATAACAAAGATAGTACC 

>SC4MWC 1049 

AOCTATAAACAAAGGCATCATAAATAOATATAAACCCAGAAGAAAAGGGA 
TCTAAAGTAGACAGAGAAGaTAGGCTGaCTCTCCAGTTGCAGATTTTCAT 
TATCAOC TCATC ACAOCACCCAAACT C rCTGG I GATTTGCTATCCACATC 

AAAATGGCAGO TG1GI t OACAAGAACTGTCTTAGGTACC 



P»ge62 



ACCTCTCATCTCCAAATCAACTAGACTCTTATGTTAAGA^ 

AAA AA A TCCA AACCCCCAAT AG AAAA ATCCCCAACAACAA CATA T AOCCT 

TAAACACAAGAATTGTATTATTCAATGAAAGCAATACAAGTAAACACAAC 

AGTTACCTTGGCTA ttttttcaatot acc 

>ScqDeae« 1051 

ACCCATCTCTTCCATTCTGGGAATCTGGC^^ 

GCITGTAGAATOAATGATOGAGTAGAATAAATAAG^AAGGAATATATCAT 
TAAATGCACA GGTTAAATAAATAAAAATCTATTAATAAA GAGCCT AAAG a 

aagaaagatgacatttcagcacatattoggtgaaataag rrum agtcc 
agcacttctcaa 1 1 1 1 1 agtgo a t a tg tg aa ttocctattaaaatgcaaa 
ttttaaattagttaatctgggttcqacctga o 1 ct g c g 1 1 1 c caacaacc 
tcccaggtoatgtcn 

>S«qoeaoe 1052 

■ acgcgggtatagctatatactcatatttttatttttatgtaaaatttcca 
aaatgcttaatatggcagt ataataattataactagatttacttcaaaac 
atagacataaagaagattacatccctgtagaagttt^ttgaattaggaat 

CACATGCTATTTATTTTAGCAGATAT X.1 1L1 1 AATTAAATGTTTGACOCA 
TGTGAAGTCA TTTAACAaATCTOTTACGCA TTATTCACATATGCAAAA T A 
ATCTATATCATCTGAATACCATTTOCATCITTAAAATTACATATrCCT 
>Scquracc 1053 

ACAATCAAAAAAAGACAAAAAAJOAAATCGTCTTAAAAGCCACAGTAAACA 
TAAACCICATATCAACTATAAAAOCACACACACTTrGCICTTCATCCOOA 
CAATGCCCAAAATTATACrGAGGTATTTXJGGTGGOCTGATAOCTTCAAAC 
AGGGAGAGAGGCaOCATOTTCaGG^OOTGTATT O CTC G ATTTAGGTGGTG 

actgaa n r r 1 1 1 wn aagacagqgtgtca ct ct ot cacccaggctoga 

ATOCAGTGAOGTOA I C 1 UGGCTCA<7TCCA,GCATCAAOCTCCTGGGCrTt^AA 
GCGATCCTOCCAOCTCAOC 

ACAATGAAAATTACAAAATACTGTTGAGAGAAATTAAAGAAGAjCAAATAA 
ATGAAAAGAGACGCAACA I G 11 1 1 COCTTOTAAAACTCAOT AOOATTAAC 
TATaGCTTTAAAGCAATCAAAATCAGACT 

OGTTTTCTCTG 

TATGAAATTTCAAATGOCAAAGAATi 
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AAGA 7TCAGC ATTTGCAATA ACTC A cttcaaaactcact ac aac aa cg ac- 

CCCAGACTCOOCAOGGT 

>SeqpMnoclMS 

COTACXXACCACCTTCA 1 G 1 1 ICCTCT A GCCAACTATAAAOTTATTAACA 

caagaaocctctcttattcatcacagtatcaoccacagggoctoagacao 

TGCTTACACAGAAATGGCCCrrOATAAAATATGGGCTOAATGAATOAACA 
TATGAATTTG ACACTTTO A a AACTAAATTAAAA GTT A 1 1 iCl ACT AGCAT 
TTTT A AC A C AAGAA CT A TTCAG A TT A CTTA T A T ATTA G T A GT A A AATGTT 
TGCTTT ATTCATTTGATTTXXLAAACTTATAATGAACTCAGTGAAACI lul 
CCACCTTTTTCTACATGTN 

>Soquence 1036 

ACATTAACTCACTOACTT ACTCTOGGTTOCTATTGTATTAA AATTCTGTA 
TAGACATTACGTAGCCTCAGAGTTGAATTTGGACTGCCCTT AAAATAAAA 
AATTCTTAAATCTTTACTGTGCTCrrCrrAT^AATTTTTATC 

AOTTGG aaatgattactttgcaagtcatagtttactttgaagttaataag 

AGTGATTACACTAAAGGAAAAATGCCATATA TGGCAI IP I fC TTAACAOC 
TTATG AAATTTGGAAAACG ATATTTTAG AAAGCT " I rCTCTTGT IUGCTGG 
AATGAAGTGGAGACGCTGCTO 
>S&fttucm 1037 

acagcttgttcaooata mci iciai i i i i ui i ig agi ici ioi i cat 

ATTCTAGTT A ATTTCT AGTAGTTCTTAATGTA 1 1 1 t AACCAATAG Ac I i 1 
rCTCl rOCTTCTGCI 1 A TG TATTCCTCQT AAATOC 1 1 1 1 T O T G AC TTGTC 
T A A G T A T AAACAACTTT A CT ATTA QCTGTAAAA TTTTCA TTTTT AG TATG 
TCATCA ATCTm ) 1 1 G TGTTTA GT ATG A TT AAA TG TTTTTCACTTGO AA 
AGATATGAATAGTCTACTTCATTGAl II I I I I lAAAGTCATTTCATTTTT 
TATTTTTGTAGCTACAAAATCATAAACN 
>S«quetK« 105* 

ACTAT ACCAGAGTTAAATTOCCTG I G 1 1 C 1 1 1 ICl OCCATTAACTGGCTT 
TGGOTTOGGAAATTCAOATAATOCACTTTTCCAACTT TAAA 

AC^GTAATGCTTTGGAAAGTGGAATTAAACATITCAGAATAATAGCTOTT 

AGGCCGGGCTCAATGGCTCACGCCTGTAGGGAGGCTGAGGCAGGTGGATC 

ACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATOTTAA AAGCCTA 

TCTCTATTAAAAATACAAAAATGAGGCATGGTTGGCAGGTGCCCCTTGTC 

CCAGCTACTTAOGAGGCTGAGOCAGGAGAATTGC1TGAACCAGGGAOGTG 

GAGGTTCCAGTAAGCTGAGATTGCGCCAGTGCACTCTAACTTGGGCAACA 

AGAGTGAGATTCTGTCTCAAAAAATA^TAATAATTAAAAAATAATAGTTG 

TTAGATTGAACATAGAAAACACOT TTI TG I AGATAAAAATTTGC CAAGT G 

TCAGCCAACTITGACAATTTriTAAATCACCAA CTTOT GCCAT TTACC^ 

TTTAAATGAAGCA TTTAATGG ACACAAAAACCCTTT AAGCA<i 1 T IT ' I ICG 

GAAGAAAAOCCAGfnjTTCGCTGGACCGOATOCCCACCTATTAGTACGCOC 

CGAGAGAATT 

>Smmcc 1039 

CXXTTAGCOTOGTCGCOOOOCGAGGTACTTTAACAAATTAAAAACAAA 

TTAATTTAAAATATTTTAGAAATTTTACTTAATACATTTATTTAATGAAG 

O CT0CTTI 1 A AGAA CTTTAAATOCTCAOGTAAACAOCAOCACCTDCAAAG 

TATTAATATCAACTITTTCAACAAAATGCCTGCTATGTATAAGCTACTQA 

AAGAAGACAAAAATTAATAAAAIUIGI CGC1 OC ICTTAGATATCTATAAT 

CTAO0AAAATOAACACATTCITITC>GACACTAAACTOCATAAGAACAOG 

CATCAGATCrAlCTTATTTAOCACCACATCCTGAGAATGGAGCACAGTGC 

CTGACACATAATAGATOCTCATAATAGATGflntZAGGGTTTATAGTCA GTG 

AATAAGTAAAOAAATOAOTGAGCAAATATCTCTTAAAAAOAACAG ACTTT 

TAAAOTTAACAAGCAGTGATCTGTTATTCAGT AGCAAATAAGATIG'I i ' I C 

CTAATGTCATAATTCAATTKlXXX'l G C I IX X TACTATGACTAGATGTTGG 

TTGGTXIATAGTTTATATGATTCAGTTATTrGGTTXMTTGATTTAAAACAG 

TGAAATAATCrACAAAACCTOCAGTTOTNCTGCTAT/IClCTAGATXKJAAA 
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ACCCTTATAAATrTGGTCA AAAAGGTGGOC^ OGOCOOCTTAC 
CATTATATATOTOATTITTAAAATTOGCAATCATG ITTT 



ACAOTTACCAAAACCCATCCAACTAAAAATTTAAGCTTTTTGCATTTTAG 

TGGATGCAAATTGTGTCTTAGTAAaAAGAACATACAAAAACTAAGAAAOA 

TAATGTTGA^GAAAATAACAAAGCTTAAGGACTTAAACTATTACCATX>A 

GACATGTATAACTACAOTAATTTTAAAAA CIbl 1 1 ICl IO CATAAGTATA 

GAGAAATGTACC 

>Scqacacc 1061 

ggtactracgctttatgatcttgaatattttcagtxjtttaaggaa 1 ci ct 
tc ct tci 1 1 g atctccactgcatgn aagaac i ctgttgcagg totta aca 
ago a agtttgaaatag a aagccag aaoctgcccccca aag a tctgacagt 
agtagaaggagatccattattaagaaggtataatggcaacanaagaataa 
tcaca aattal cr g t g t g t o taatatgtgttgtgtogtgtggg tcaagg a 
gatg agg aa agtggttagggaag acttt a tgg agga agtgggctgtcaac 
agggattgaatgtgacaagaaaagtttxxjgggaattcatggtaatgactg 

ACTT AGGCATTTTAAAACAAGGTATGCTTGTG AAAGGCA A ACCCCCATCO 

TTOCAACAACAGAAOCTAATTGCTTTXHTTCCCTCCACT^ 

CrnTTTAGTTTTGGTTCClTCCTGOTCTTCGOTOCA 

GTGGAGGAGGGAAAGGAAACCCTGGAAACCCTTTATAGGGTCTAAACTAT 

GCTTGGAAAAACAGGGGTACTTGTGCOOCCATTT ATTTGGC AATGGAAAT 

AACTGGCCACACTGGATrTCCTTAAAGGGGGC IOC I II 1 1 1 GGAAAAAAT 

TOTGCAAAACOGCCGGGTGAAACAGGGGGATTTTTAAAACAGGGGGIHrTA 

TTTCTTCCCCT 

>Sequence 1062 

GGTACTTTAACAAATTAAAAACAAATTTTAA TTTAAA A TATTTT AGAA AT 
TTT A CTT AAT ACATTTA TTT AA TG AAGGGCTGCTTTT AA GA AAA CTTT AA 
ATCCTCACGTANAOCACCACCAOCTGCAAAOTATTAATATTCAAC IT 1 1 F 
CAACAAAATGCCTGCTATGTATAAGCTACTGAAAGAAGACAAAAATTAAT 
A AA ATTTTQTCCCTCCTCTT AG ATATCT A T AATCT AGGAAAATGAA CACA T 
TCTTTTCAGACACTAAACTCCATAAGAACAGGCATCAGATCTATCTTATT 
T A CCA CC ACATCCTG AG AATGG AGCACA GTGCCTG A CA CAT AATAGATGC 
TCATAATAGATGCTCACGCTTTATAGTC ACTG AATAAGTAAAGAAATGAO 
TOAGCAAA I 1 IC I C TT A AAACAACAGACTTTTAAAAGTAACAAOCAGTGA 
TTGGGTTATCAATAGCAAATTAAGATTGTITrcTAATGGCATAATTCAAT 
TTTCCXXKKTITCCTACT ATGACTAGATGTTOGG TGGTG AATGGTTTATAT 
GATTCAATTAI T tG G TTGGGTGGATTAAAACCGGGA AATTTC T ACCAA AC 
CTGCAGTTGGGCCTG C 'I I ICC I I IA AAGGGAGA GGCCT TTTTAATTTCGG 
GCAA AAC^jGTGGGGGTTTTTAGGCCCCTrTCCCA TTTTT ATATTGGGATTT 

>ScojtKM« 1063 

ACACAAATTCTAGGT AATCT AAATTATTTAAATGTCTAGAA I 1 1 111 TTC 
TTTT ATG AA CCAGATCACAGTTTCTG ACATCGCCT AA CCATQTAAAAOQO 
CTGAAG<XnTCAGTCTTOOTCTGTTATCTCTTCOCATTAC^ 
AGTAAACAATTGCAGOOCOGCAGGATAAOCTCATATTCGAATTO1TAGAA 
AACAOOCAOTO 1 1 ' r T C ACAGATOCOCTACATTAACCAOtCTTCATAQAGA 
O GTGGTCTCTI C ATTATGTATGGTTATGGTATOATCOTTAAACCATCAAT 
ATATACTOGTTAACAATTTTTAGTAGOTAGTTATTCCTTTCO 
ATAATATOOCAO 1 1 1 lOfalCG 1 AACOOGTAATTACC 1 1 1 1 1 ICG TAATTC 
GATTTTTAGGTAAAATGGTTACCCCATTAATTTAAGAGATAAAAATAAAG 
TXnGTACTTTTGGAGTCTTTTAAG<nTGTCTA 1 1 IUI IG 

CCAA CTGG AACCA GAAAGGTGCT AAAAOCATAAATCGTTGGCAATTAAAC 
CCGGACTTTTTAAOGGAATGGAGAGATTCTTTCGACCACCCAGT^ 
AAAAAGACCCTAATAAGGATCOCCOATTTAG ATOOGG3QGTX ? GGTyT GTAC 
CAGGAGG ATTATTCAAACCTTATAOAGAGGGr I T1 CI CTTI ATTCTGGCG 
CTITATAAAATATATTTGGATATTTTGGCACATATAGGGCTCAGGAACAA 
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CCCTAGAGGTTTGTGGTAAAATACAAAAAAATTGAGGCTTTGGAGXXKXi 
TCTTCATAAACCCTGGGGCCCTCCAAAAGAGAAAAATAAATAAAAAT 



ACrrACTACAAGCAGCAAAAGGAAGCTCTAGAACAAGGAATTAAACACAG 
T .UH 1 G 1 1 1 IJCAATTXjCAGAAG AGGCCATG AGCA CCAT A TG T GTG TCA GO 
CTTATCATCTGAACOVAAGAAACKXXAATCCTTCACCI I ICT 1 ATGACTC 
TT AT AGGCTCCAAT ATTTCA CTTGGCCA TAAACAACTT AA TATCTCACAC 
CTAGTAGTATTCAGTGACACAGAAACGGAAAGAGAAAGGATGAAGAACAG 
A GGAAAG A QAAATAATTTCCCAAG A TA CA AATTTAA TATTCTTTCCAAA G 
CATAAOAOCAATTAAAAAATA 1 1 ICIClUI I OTAGTTGTAGG ATGC ATTT 
TTCTACATTATTCKTCAGACATCCTGGANATAATATCAAACTTTGTAAAG 
AACACA AAATA TTTTTATTTTTTA ATTAAAAAACA ATCCTTCTAAAGGGG 
TTTAAGAGCTTTGAACACAGGOCTTAGTAAAAGTOCCTTGGCCGCAACCA 
CCTT A AGGGCG AA TTCACG A C A 
>5equencc 1063 

GXTTACATTGAAACAATATAGTAGTCTTCCCCTTTACAAAGCTGAATTAAA 

GTAAAAGTCTGTGTTGGNGAATAATAGGGGAATGTNGGATTGGTAGCTGT 

TTAATANAAGATTTAOGATACATTATAAAATTGCTrAAOGGCCAGGCGCT 

GTGGCTTTACGCCT ATAATCCCAAGCACTTTC<XlAAGGCTG AGGTCGGGT 

GGATCANCCTGAGATCAGGAGTTCGAAGACCACCCTGTrcAACAIGGTGA 

AAGCCC^TCTGTACCTGCCCGGCGGCCGCTCGAAAGGG 

>Scqueoce 1066 

CCXZTTAOCGTGGTCGOGGGCCAGGTACCXIACATGATXX^CAAAGAGGAGGG 

ATOACAACAAAAGGAGTTTTAAAGTGCATClTGAAATAGCACACAATrrT 

TACAAOTTiAATTTCTCAAAAGTATATTAACACAGC^ 
CAAATGTACCTGCCCGGTCGGTCGCTCGAAAGGG 
>Scpjdcpce 1067 

occtt agcgcccgcccgggcaggtaccctccgtg acttttcagggtctcc 

tggttgaatgaatttgcagaaggattaaaatgtg ig 1 1cttatttgtgcc 

tttotatttctcccattaagtagtgtgttxjgaggcttattaga 

g agaagggtaataacataaacacataccgta ggcagccctg acattaaac 

acatkaggtaggagcctgccataaagcagcgtatgtaaagaactaaaagg 

ootgttoi i ' i ccat tt i c atgtotccaagc cn uc ' i ' i ' o catactctogag 

atgacaagaacacaaagtttgctoagcttcacaccaactaattgactaaa 

TCCAGAAAGTTTGGAACATGCGAGAACATN riTrOCTTTTCTI ACAAGGG 
TTATCTAATTAATGJlTGOACATTITAAAAATCCACAAGTGTXKiAT^ 
TCTGAA ^ 1 1 I C l G AT1TAAACAG AATCATGOTGAAAAAGGGAOCTTATTT 
TCAAGAAAATCTTGCATTATAAAACCCTAAAAGTTACCTTC^ 
CAO^UX3«XnTAAAQQGCAQAATATTTCCAAA^ 
GTTAACTAATTGGAATTCCCAAACTCTGGGGTACXXX^ 
GTGAATTCAATGAGGCATTAAGGCXXXrrTrCXXnTGTGTrc 
TATTOOGCGCTOOCCAAA TTTTI COCACAAT AACATTTCTCAA GCCCGGT 
AAAC<XTT TTAA AG AGTGT AAAAAGCCCTGGG GTGTCCC TATTTG G 
ACCTTAAC1 I'lCCAC'll I ATOGCGGGTGTGGGCT CC" 1 1 nGTAGGGGTTTA 
ATATTAGAAAAAATTTGGGXXSCCCTAATCTAGGGGATCOCCT 
>S«c/mom 106* 

GGTACT ATATTAGTGTAGCAATTTTCCAAAAGOCATTX^TCTTAGAGGGC 
TAAATGATTTTACCTTATCAATTCtX^ 

CTAAGCAOGG AA GTCAGGCCAAG AAA T A TCTCCCTGCAAG A G AA GGCATC 

gcacatgtatctctccatgctatttaaaattgcattctoca ac^t aoaaa 
ggataggocatcctocagaagocaggttgcaggaaaac i gc 1 1 icl 1 iggc 

CTTTACACA K I CC I IT I GGAGAGA TCCTGGTG AAAGCAGCAACTACCATC 
rTAITGTtZANCAGGTGGAGGGAGGAAGGAOGGCATC 



GCAG A C A TCATTCT A TT ATCTCAACCTTGC 1 1 ICTOGOAT OCAAA OCCAA 
GAACTTC^TTGTIXXATGCCCrrAGAGCTCTAA 1 1 tOCTGA 

AATGAAAGCTTGAAAGGGC 1 1 1 1 IIjCI 1 iugtgaaaccoottcgtggccc 
GGGCAAATTCTGOTGGTTTCGOG 1C I OTCAGTGGGTCCTAATAACTGTTA 
TAAGTGTG G rTCTTGGCAACATTTGTAAAATATTTTCTATTOCTCACACA 
CCTTTTCTG1 1 1 AGACATTTATTTTTAACACAGACAAATOCTTAAGTGTT 
CCCGCCOCAGGG i iti lAACTT 
> Sequence 1069 

GGTAGCCTGCTTTGATTATTTCCGAATCCAGTGGGTAGAGAAGGTAAAGG 

CAACCGCTCACTGGATATTTTTAAATTGTAGGGAIGICCI I IGCICIGGG 

TCAATTTTAOGATCAAATATAAAAGCACCTATAGCTCAGAGTATCTTCTA 

ACATAAAACTTCTGAGATACCAGAAATTTTCCAAAACATGGTATAAACAG 

TATGAAACACTGGGTAGATAAAAGC 1 1 ICTCTAAATC TTAAA GTGCTCAA 

ATATCATGACCTGATTTTTrAGTTTTAGAAATCAGATATTTTTCT 

ATA TCTT A AACTTTC A TGTTA AA TTCTAGTTCTG ACA A TG TA GGG TTCTA 

IT ' 1 TTTTCA GG TG A TTG TTGGG AGCGT A T A G A A GCA TAT AT AAATA TGG A 

ATA TXTTG1T1 CTT TTTT CCCCTTCTGAAAGAAAGTCAAGCCTCTAATCAA 

ATAGATTGATGCTTCAGAAACTTAAC^GAATATTATCTGCAATTTG 

AAATGCAT NT I rCIT GGGGAAGTTTCCA TGGTCAAAA T TATT AGTCATTO 

CAAAACAG AAAAGTITGACAACTGGAAATGCAGACNCI'ni OC I I GATTN 

TGTAAAGACAGGA 

>Sequence 1070 

CCCTTTG AGOGGCCGCCCGGGCAGGrrAC^TrATATTAATG AAATTTATCT 
AGTCCTTX3CAAACTTGTXXXTATTGATTTTCATTAGTGTAAACTAAAGAG 
AGAAACTTCACACTGACATTTATAATTGTAAGAACTAAGAACCAACCATC 
AG CT 1 1 TC r A TGCCAATCCATGCOCTTCAGG AAGT7XTTG AGGCCT TGAG 
GTTGCTAGTTTAGTAAATIOCTTACTGGGACATTAAAGCACCTACATTTT 
GG AAAO A GGG AGAATTAAG fTTTTTOTTG TTG AATTTATTATCACT AAGT 
AGTTTAAAG C1C I CTI AGATCCCAA AAGAGGAAAAATTCCAGGTCCATTA 
ATCAAAAGGCTGAAACTTAAACTTTAGhn"AAAGTTATTTTGATTAAATAA 
CAAACTTOOO On ICCI 1 CCACAGCCGTTTATCCGAAACTATC 
>Sequence 1071 

CCCTTAGCGTGGCCGCGGCOGAGGTAOCAAAACTGAAAAAAGATTGTGTA 

TCCAAACATTATTTCACATAAAATGTATrrTGATAAAGTAAATIXXXXAA 

CCATGGTCCTCAGAGGTTCTAACAGTCCATGTAAGTTGAAGAAA AAGAG T 

TATX^AATCAATACCrrCACTATCAATCATTTATTTAATCATTATTTAGTTT 

TCACATATCTATGAATTCAOTAGAAGAACCAGCACTCATAAAGGTGGCCA 

TTCCTATACCTGCCATCGATTACATTATTTTAC7/TAAATAAAGCTTATAT 

TA CA TCTGACAA CATTCCTTGTTAA-AAAA TAAATTCCT AAACAGGGCAAT 

ATTCCCATCTTrrAGAAATATGCCAAAAAAAAATAT TTTTA AACTCATTT 

GGAAACATTCCAGGAACTTATTCCAGA^TCTATTrATrTT^^ 

ATTTGTTCAAAATAATCCTTTGGCTTGGTT^ 

ATTTTCAAACCAGACTGGTTrAATTAAAAATTAAGGCCCCAAAACCCCTA 

ATTTATTrCACAGGGGGCCGGTAAGCCAATT 

>Sojucocc 1972 

GOT A C 1 1 1 1 1 H 1 1 II II Ml 1 1 1 1 1U AOACGOAGTTTCAC1CI I GI IGC 
CCAGGCTGGAGTGCAATGGCGCAATCTC^GCTCAOCACAAGCTCTGCCTC 
CCGCGTTCAAGAGATTCTXXXlGCCTCAGXXnCTTGAG^ 
AGGXlATGTGCCACCATOCCTG<JTTAATTTTOTA TTTTT AOTA 
GTTTCTCCA1 G 1 1 GG rCCGGCT IjO 1 C 1 LXIAACTCCCX^CTTCAGGTGATC 
CTCCTGCCTTGGCCTCCAAAAGTGTCAGGATTAC AGGCCT GAGCC^ 
GCCCTGCTTAAGTTTTAATAAGA I CTCT1 OGCAACTTTTTACOACTOOCA 

acttaggtctcacaaacacagaaaag c rroTcn i aagtata rrorcrn 
gaaaagttaatacactctctaaatc<rttxattraaaatgatt^ 
aaatgcatgcactgagagaaaagatatttgaatgatataca wccacato t 
taaatta>ctgtgattgtttcta>gtattggcactatggtcaat 1 1 ici7 
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1 1 ICI lOIl IATGC'1 1 1 rCTOAAGTTnCAACOOOCATAATAAAOATGTA 
TO CI I CI 
>Seqaeooc 1073 

GGTACCTATTGTATCAGAAAAATOCTAATTAATTrn lOCACATAAAGGG 
CATTTTAAA C 1 rCOl 1 1 IAI ICI 1 IOIO ATAAATATGCATCATOAATOGT 
AATOTTAAACAOAATTCAAAAGTTATCAGTTTCGCTAGCCA ga CACAGTA 

GGAAGTTCACGTTTAGCCTGGGCA GCATAGTGAGACAl IGIC I 1 1 IA TAA 

Tr Iti II AGGAATTTCTCTATTTAAAGGACI I 1 I AGGOCTAGAGTGGTGGC 
TTA0KTTCTA ATO0C AGCACTTTGOGAGGOCAATTCCAGGAGAATCACT 
TQAOOCCAGOAG1 rOQOOACCAACCrOOGCAAAOTAOGGAGAOOCTGTCl 

CXrrAGOCACTGGGAAGCTGGGGT<XK^GAAATACriTGGG<XCAGGAATTT 

QAOOTTOTAGTGAGCTATG ATCCCCOTACACATTATAGACC C ' I O I CIC I A 

AAAAAT TAAAAATAA AC C 1 1 1 1 1 1 AAAGGACTTTA AAGTTGGA1 rTTTTT 

CTTOTTAAGTT ATTATCAi 1"JI 1 ATGTCCTGCJ I iGACCTGCCCGGOCGG 

CCTTAAG 

>Scqnr-noti 1074 

OOTACIOQGTCACTCrTO 

CT AGGAGCXXX^GGCTTCTTTXXTTOACCTGCXXX} TCTCCT ACACCCTCTGG 

TtXTGCTCCACACTCKJTCTAATAACTGGTGTTCCACA I'lCCICT AACGTG 

CACAACACACTCCTOOCOO OCTCCTT I TC AjOCTOCTCTOC ATTO C TC TTA 

TAAOOCTC1 rCCCCAAATDOCl P 0 OOCATOOC1 1 t il 1 1 OC I CA TCTCAAG 

GTAGAAACAAACTGTCGCTCAATCAGCTAGAGCCCTCCCACTATGCTCOC 

GCGTACCTOCXXX3GGCGGCCGGTCAAAOGG 

>Scqoeaoc 1075 

ACTTTITCAAAGAGGATAAACTTAAAGAAAATCACTAGATACACATCAAAT 
AAAAAATT ACACAOCACA CA GAGGGG AATAA OGTTTA cattacaaaga tt 

tttcaocagaaatcagagaagtgaaaagacagctaaatggcatcattgag 

otgctcaaogaagcaaocatctactoggaattatatatocacctaaaata 

ttxhttaggaatgaaagtaaaataaatacattctcaaagaaaaacaaaoa 

gaatgtatccxx^gcagactgatctgctagaaaagctaaggtcaacatta 

ggctgaaaooaaatgctgcatcttcaggaatgaagaaagagcaatagaaa 

caataaatatataogaaaacacaaaatactaoa rttttocl ciaagttct 

ATAAAGTAOC 



ACTTCACTGATTT ATGGCAAGTCAGCCAA TXXA TCA GTXXTTCAAAGCTOC 
TTGTATTGTCAGGAATGTCTAACATTATTTGTCACTC^ 
CTOCCAACTAOTAOCA 1 1 1 UI fTTOTO I CTGATAG ATTCTTCATGCAOAA 
AGAATAAGTAAAATGAGATGGGACACAAATCTGAGTATAGCATTGTCATT 
ACTTTTTGCTXMIACAGATTACTTTXIAAGAAATATTCT^ 
ACAGAATCCACAAATTXX>OATTTAAOAAATAGOTXrTATATAAAGCTTAT 
TTAATAT TTOOTAT ATTTTTTAO TTACTC ATTG Oj I G ' l OC 1 1 TATAATOC ' 
AAAAGCA illll 1GCGAATC1 1 G 1 1 1 lOCTACTTAAAATGAAGAAAATCT 
TAACA T ACAGTGGTGAATAGGAACAOCACACAAOCCT A TA T ATTGATTAA 
AGTAGTTTATTAGCTAAGCTTACAGTFXJAAGTAGCTTOOGAAAAAAAAAT 
A TTAAGAAAA CCATTAGAGAAAGGCT ATTTACT ATTTCTTAAGGGOGAAA 
AOGTDCTATTATGAATCATAOOTOTTCTATTTATAAAAOGTATOTCC1 It; 
AGAACCTOGAGAACGQCTTT ACAAAAT A CCTTGGAAATTATTCCAGGGG A 
ACAAATTGACTCAAAAAACAAGAGCTGGGTn"AAAAOCCTOGAAAAAGGCC 
TTATAGOCAAAT 



GGTACAG/ 



TACAOAGTAAOCATOACTTACTAG<nKriTATGATGAAGGTOTATGTGT 
GTGTATATOTGTGCATOCATOTNATAAGTGTGTtjCATTTOCACACAT^ 
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AGTTTTAAGC I GCTCC 1 U ItJATTTATTGATGGTCAAAGO 1 1 Id 1 1 lO OC 

TATTGCTGGACTTrTTAAGATTGTCTTGTAAnGICTrri TGI r CT T GTT G 

AAAATTAA<KXjTGTATATTAAAGGTAGTrTTTAOCCAGATCTTATATGT0 

TGATAGCTCAOOTCTOTAATCAOAAAOCT ACTOTTTAATOOOCAOOCAA'T 

TOOCA TTAGCTTCCT AGAGGCTGATTTAAT AAACTA 1 C I ICI 1 lAAAACT 

CATTTAAAATTACAGACATXnTTGCATACAATGGATTAATCACXriTrTCA 

CAfTA A fXT 'ACAAAAC i TCT'GCTOCA t- 'l TTC1 m "jt l A<KK*CTAA f 'A T TC-A 

TTTCATATGCATTGAATATTATTGGTGAACTTGCATTAATrACATCCTGC 

ATATATGGACATACAATOTCATCTOCAGAATTTAAGATTTTTTATTGTTA 

ATTTTTATAGGGACTGGGAAATTGAGAATTTAAATTAGCATGClTtlATTT 

ATAATAATATTTCTAGTGG1 1 LA TTAACOOCTAAAATGTGATTAGATCAG 

CATrAAATTOGGAAGAAAATTTTTCCTAAAATOGGOCTCGCOCGGCCCGC 

GTTTCAAGGGCAAA 

>Scqaencc 159 

TGGCTATTG AGACCTCACCGCGGTGGCGGCCGOCCOGOCA GGT ACA CAGG 
ACCAATGCTGOOCATOCACATCGAATTTACAAACATTCTACAGCGCAAAA 
GGCTOCAGA C T I I GATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAAC 
ATGCT^TGGAGAOGGGGGAGCTGGAGAATACTTACATCATTTACACCCC 
OGACCATGGTTACCATATTCGGCAGTlTGGACrrGGTCAAGGGGAAATCCA 

toocatatga cn 1xi atattogtgt gcx. 1 ' i ' i i ii ' i a ttcgtggtocaact 
gtagaaccaggatcaatagtcocacagatcgttctcaacattgacttggc 

CCOCACGATCCTGGATATTOCTGGGCTCGACACAC C I C C I GATOTCGACG 
GCAAOTC TC TOCTCAAACn C rG GAOCCAO AAAAOCCAGOTAACAGGTTT 

cg aacaaacaagaaggccaaaatttggcgtoa t acattcct agtgggaag 

AGGCNANATTCTACGTAAGAAGGAAGGATTXAGCAAGAATATOCAACAGT 



TOGATGATCNATTOGTAGGCCTCATCGCGOTOOCOOOCGOOOOGGCAGOT 

ACACAGGACCAATGCTGCOCATOCACATCGAATTTACAAACATTCTACAG 

CGCAAA^AGGCTCCAGACrTTCATGTCAGTGGATGATTCTGTGGAGAGGCT 

GTAT AACATGCTCGTGG AG A CGGGGG AGCTGGAG AA TACTTACATCA TTT 

ACAOOGOCGACCATGGTTAOCATATTGGOCAGTTTOGACTOGTCAAGGGG 

AAATXXATOCX^TATOACTrTGATATTCGT O I MX I lllll I A TTOGTGG 

TCCAAGTGTAGAACCAGGATCAATAGTOOCACAGATOGTTCTCAACATTG 

ACI I GGCXXXXJAOGATOCIGGATATTGCTGGGCTCGACACAOCTOCT GA T 

OTQQAOOOCAAOTC1 G I OC ICAAACTTCTGGAOOCAGAAAAGCCAGGTAA 

CAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGOGTGATACATTCCTAG 

TGGAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGGCAGAATATC 

CAACAAGTCAATCACrrOCOCAAATTOAACGGGTCAAGAACTATOCCAGC 

A GCCAGGGTCCTCGGOCOCCT AGAACTA GTGG A 

> Sequence 161 

G A T AACGTTGAAOCTCATCCG AGGCCOGOCGA GGT ACCA TOCTATT AATA 
CTAACTTCTOCI ICI ACATACTGTAG ACL 1 1 PCTCGATGATAGAAATCAA 
TGCAGOGGGTGGGAOQAGOGCAOCATTTATATTGXMCTGACTGATATGGC 
TTTCTATACCAAAGOTAAATXJCTtlAATGAGAAAATOCTOACTCITGCAAG 
TATCTATATACCAA GAAGTTGACCTCATCACTTXnTATACTCATCTrTAT 



CATTJGATOTOGOQATGTGCAACTQAATA TCC^ 
CXXWGAATCXXOGOGTCnXXXOCXXMJGTI^^ 

GTCTATTCTCACT AAAATA TTCTTTATAA7TTAAACTTTATACGAATTTA 
ATAGTTATTCACTATTATTTATTTTTTATATATTATTACACAATTCTATT 
TTTTTT AAA TCAATACTT AACAC ITTICn 1 AAT ATTTTATT ACAAT ATA 
CCAATAGATTA TAACA TTTTTACTTATTACAT C T I ICI AC 
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QOXSOGCOOAOOTAOCrOOOCTOCTOOCATA Ul IC I 1 1 C ACOOOTTCATAT . 
TTGOOCAAGTGATTTGACTGTTGGATA n C TT GC I O GATTC C 1 C CTT C T I 
AjOGTAGAAATTTGCCTCI 1 1 CCACTAGGAATGTATCACGOCAAA TTTTGG 

can ici roi i roi t cuaaagctgttacctgqc i 1 1 ictoootocaoaaot 

TTGAGGACAOACTTGO0GTOCACATCAGGAGGTGTGTCGAGCOCAGCAAT 
ATCCAGGATOGTG<MKXXXiAAGTCAATGTrGAG AA CG ATCTGTGGGACTA 
TTGATCCTGGTTCl ACACTTGGACCACGAATAAAAAAAGGCACACOAATA 
TCAAAGTCATATGGCATGGATTTCCCCTTG A CCAGTCCAAACTGCCCAAT 

. atootaac^atggtcoocootgtaaatgatgtaaggattctw^gcttcc 

CCGICIUL A CGAG OL l IOI I 1 A CAGGCTTTCCACAGAATTAT 
>Scqoncc 163 

TTATTATCGATCCGCACCACGOrrra 

TGCA III C1CT AATGACTAACGATGTTAAACATATTTTTATGTACTTGTT 
TCATGTACT70TTGATATGTCTATTCAA 1* 1 UL 'l 7 T C AOCATTTTTATGGA 
GCTCTTTTTTTATTATTGAGTTG^ 

CAGOOCTTTGTTATATACATGCrTTGCAATOTACATT G TCI I AAAATCTG 

tooc tto cctot i c aattcatta gtg g i g rtrrch iaagcag r n ita at 

TTTGATCAAGTGTAACTTATTCA TTTTTT ATT ATGGTT A I IPC 1 1 IA TCT 
TTCAGGTCDCAAAI 1 1 1GCC1 '1 CICAGAAATCACAAACATTATCCTATGT 
M ICC I I CAAAAATTATA TGGTTTTA TGTA 7TTTCAATCTCAAAA TATTC 
TCTAAI lllll I OC I t>A TTT ATTTACT AAAGAAATTTGAGGGATTTGCTA 
TAA TGTTAGGGA TTTTTCTAG atgccact 



TCGATOACTCACCGCOOTOGCGGCOGCXXro 

TAGTGTCTCAA 1 1 1 CC ] UC 1 C I ATAAAACAG AG ATAATAGTATTTAGCCC 

aoagggttgtggtgaagtgtgaatxiatttctccatgta^ 

caggctgck>catggtggtgggcacctgtaatcccagt^ 

gagacaggagaatcgcttgaacccgggagacggaggttgcagtgagccga 

gataotgccactgcactccagcctgagtgacaagagtgagagtcc^ 

aaaaaaaaaaaaaaaaaaaaaaagtacct 



CATO COCTGTATATAAAA I CMCGICHGIIG I a tatatatatttaaaaa 
TGTCCATGACGTTT AACAG ATAAA T>^>O^T^^^K>*CNG>WCCTNNTTK^(N>I 
NNAAACTGGWGGWGJIGATTGTATACGACTATAT^ 

AAX3C ATTCTCHAlK3*B¥XCCAiaX3TGXTA Al I C IClClATAATCGGCOG 
CCCCGGCAGGT ACAGACTTAGTACCTTTGCTTTTATATATTGTGTTTrTT 
GCATAGATATCAATAGTTTCACTAATTCCATTCATGC^ AAACATT 
CTTAAAACl 1 TGI 1 1 1 ATGGGATTATCAGAGTAACAAAATAATGTAGTCC 
CTTTATGOACTATAAGTAAC _ 

GAOATGTATCAAAGAATTTGTCGACATATG^^ 
TTGaTGT 



CGATATGG<jAGTCGAaXACGX3GTCCGCTGCC^ 
AAACAJ^AGTGGCCATGGTCKCAGGGATAGACTTTCTC^^ 
7TCACTCACCAAGGCAGAOCTGGCTACAGOCACTGCTGAG 
TCCAGCAOCAtrrGCCCAACACTGAGOCCTTGATATGCATCA 
TGATCACACAGCTACA TGGTGGCAGATTGATTAT ATTGG ACTTCTTCCAT 
CATGCAAAGGGCAGAAOi 1 1UJLTGGAATGOACACTCCAOATATG 

AGTTTDCCT atoctacacgcaatgcttctgctaagactaocatctgtgga 

TTCACGGAATGCCTTA TCCACCCTCATGGTATTOCAtTACAGCATTGCCTC 
TGACCAAGGCAXrrCACTTTACAGCTAGTtTrCkACAOT^ 
GGAATTCACTGATCCCACCA TGTTXXCCACCATCCIXIAAGCAACTGGATT 
CUTACAATGGTGGAATGGCC 1 1 1 1 1 UAGTCACAATAACAATGCCAACTAA 



GTOATAATACTCTOCCOOGCTTOOOCAAATTT1 r T CA GAAAGCCATTCTT 
OCTCTGAATCAGCATTXlAATATATGGCATrrGGTATTCC^ 
ACAAGTCCAGGAAATAATGGGGTGGAATTOGAATGGATTACTTAACATTA 
CCCCT AA TGATOCA TAG AAAATTTGGCTACTG TTCCCACACATTCATTCT 
GGTGGTCTAAAGGTTAQATCCCAAOGAGAAAOTTCCACAGAA 
>Seqaeacc I Oil 

GCrrACACGATGTGGCreACATTTGGCTGGAGTCTGCTAAGAT^ 1 U I ICI I 
ATGCTGGATGGACGCAGACCTGTAACACCTCTG 1 1 1 1 1CA I CI IC ICCAC 
CATA I 1 1 1 ICA TCAGCOGCCTCA I IOI 1 1 1 ICCI I ICI UG ATTTTATATG 
QCACOC TO ATC1 IO CXTATGTA TCACC TCG AG OC1 1 I C I I I ICA TACATC 
I fOCTCAACCTACAGCTt^TGATCTTCCAGGTCCl I CACCTG T ACTGGGG 
TTATTACATXrTTGAAGATGCTCAACAGATGTATATTCATGAAG^ 
AGGATGTGAAGAGTGATGACTAGGATTATGATAA AGAA GATGAAAAGGGA 

TCGGOCTTGAG AGGCACCTCA 1 1 1 CCA ATOOGCAGCATTOOCCTTAACTG 

gaagcctacaooaa ctc c ttg gcaoca g r t gc tt aaagt aacttgccogg 
ccggccgattgaaaggggga 

>Scqaaax 165 

1 CnCCATACI I cgt aacictatacatttaccattgttatcatctactat 

AATTATCCATCrTATACTTCCGAA ClCO 1 1 1 AATAOTATTTATCTAATTA 
TTATATAATTTCTATTTATAAATTACTTTCNACTrG 

TTTTTATCCGCCTG ACGAACGCGCAGG>4ACCGGCA TCAGCATT AGTAATC 
AACCTGTTAATCCAAGGlCn I AGAAAAACTTCAAATTATTCCTGCAAGC 

CAA TTTTGTCCACGTGTTGA ga tcattgct acaatg aaaaag a a gggtg a 

GAAGAGATGTCTGAATCCAGAATC<MAGGCCGTCAAGAATTTACTGAAA^ 
CAGTTAGCAAGGAAAGGTCTAAAAGA rC ICC I T AAAACCAGAGGGGAGCA 
AAATCGATGCAGTG C 1T CCAAGGATGGACCACACAGAGGCTGCCTCTCCC 
ATCA C I 1 ODC I A CATOGAGTATATOTCAAOCCATAA T I G I I C TT AO I TTC 

cagttacccctaaaggtgaccaatgatggtcacccaatcagctgctacta 

CTTCCTOI AGAAGCTTAAATGTCATAATTCTTAGC I II I LA GG AATAA CT 

rr accctggcactattaatgaaaoctctaccggggtgccta i l» ic i iaao 

G O ' I OG I I I G GACCT G CTTCAAATA I 1 1 ICC I ICA CC1TTCCCATCTTCCA 
GGGCTCCTTGGGCGGTCTGAACTAGTGGGATCC^ 
ATATCAACTTATATO ItXXGCOCCClCAGOGGGGGCT 



TTCTATTA TTCG TTGATOGACTA M OCT 1 C n COG I ^ATTGATTGAACA 
GTATTCATT A C I fCTA TTA Cl ICI 1 1 rTATACATCCATTATOGTCTGTTl 
AOGATGTTTATCTATTATTATIGTTC 

CI roCTCA G CCl 1 U CCAGOCOCCTCTOATGAGCTCTCTAATCAGCAGGAC 
CAAGGTGTGAAGTGGGAATGAACATGGATCCATCCCA 
AAAGGTGGACAGCCrU 1 1 CG ICTLTCATGTCAGCCT AGGGCTGGGAACAG 
TTTGTGAGGACTT ATCTGTTGTACCT 
> S <q fM 167 

COGCCCCGAAGTACCTNTt*CCCTAATATTt^TGGCAA I I ICI AOGTTATT 

CTCAA CTCCl 1 1 icatgttacttatatgacatctacatcatcactttata 
gtacataatatntkmikaatgtatgtgctgctagccggc^ 
ocaggtaogcgggatggcaogtgcagcccaagtaggtct acaagaogcta 

CI ICCCCl A TCATAjGA^tOAGCTTATCACClTTCATGATCAC GO CCTOGGA 
ATCAII I1CC1 IA 1 CI I^ I 1 U,I A OI CC1U 1 A TC OXII1 I UC I A ACACT 



CATtZAAATT^TTGGCCACCAATGATACTGAACCTACGAGTACOCT 



CI I G ICCI I ICACTTCACACA 1 1 1 1 ' I CCAACTTCT ATCTTAATATCACAT 
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TCTCTATA I I IIC 1 1 1 1 1 I A ATATAAATATAATATAGTCTATCATATTCT 

ATTAATKhfNKKKTCTTAACJ P DT OC rGTAGCCGCOCGCCGCAOGCTGGCAT 

TCCATCTTCAIXIAOACOCnxn'AGOOCTOOCGCTTTTOCTAGOACAOTTC 

GCGGAAGAAGTGOCTr^COCCriCCAQA^^ 

AG AAGCCCAG AGAGAGGTAGGTTTTAGGAGGCCTGCAGGT ACCT 



CCGTCTKXXCATTGANANTCTGNCTTAOC^ 

OGTACTTCCACTATTATTOAATCTATTC TOTATT ATAATTCTATATTTGA 
TTCCCTATCTOOOCTCAACnXATrATACATTTTCAT<XXrTGAGCCACTO 

ltrni i c actcta tttca gtcooctccaca i nrc togcacatactaag 

CATfOXATGAGTNATXTIXiATXnMATAAATGTAHTTTCOCTAAATTCAGO 
TTCACyTATNOCTTAATCTGHAAAATACTAAAATCOGAAATGCTCATAAAA 
TTCAAAG C Mill I G AGOACCTGACCTOCTGCCTCAAAGGA AATGCTCAT 
TNGGAGCA TTTTG<WCCTTCAGAATITrCAAGATrAI>O^KXKlATATTCATA 
COCGT AAGAAATAAGTGCTCAATATTTCCCAAAATHT WNCA AA AA AOTCT 
TTGAAATO0CCAAAACAACTTTTCTGGTO30CAAGGTATTTTTTGGAAAT 
AAGGGGATTAOCTCANACW'ICTTGTAOOCTNNAAAATACOCATGCANWn' 
ACTWmCGATTAGCK^CCCATGTGAAAGGGGTAICI 11 LILI 1 ANN AAA 
TTG ANACCCrCATTGGGKN 1 1 ICOi ILL 1CAAGCCAAAACTTGACCCTGO 
GGCCCCACTTTCAACA TGNNN GCTTTTAATTCCGTOCOCTNGGATOTTAA 
ATCCOCATG C1 fCCT CTTTTl I T , AOCACATAAATTTCAATGCCOCCATCA 
AOATTGAATATIX>CATTTCCACC^TAACACTGGCCATTCAA GCTCXXT T 
CAACAAGCOCACTX^TAANGGTTTT CCTCC TCTXrrOCATOCA A 1 IT 11GO 
TTOCTTATGAAAATTTCTAOCTTTGGCTTTCCX3CCAGGAAACCTTTAAGT 
AGGTTTCTCGGTCAOGTCCCGCAACACCACCGCAAOO<XX5GGTOCrOCGC 
GTAAOCTTOGGOCOGTTCTAGaOCTAGTGGGATOOCOOGGOOCTOGAGGA 
AATTCOAATTCAAGGCTTATCGATTCCO 
>Scqucocc 170 

TG1GTCCATGCGTCACCOGGTGGCGGOCGAGGTACTTAGCTOTGTTITIA 
TTCAAAGTCTACATTTTATGTAGTGGTTAATGTTTGCTCTTCATTAGGAT 
GOTTTCACAOTTACCATACAAATGTAGAAGCAACAGGTCCAAAAAGTAGG 
GCATGA11 1 1 TJ" I XJCATGT AATCCA GOG AG A AAACAAGCC ATG ACCATTOT 
TGGTTGGGAGACTGAAGGTtWTTGAAGGTTCACCATCATOCTCACCAACT 
TTTGGGCCATAATTCAOOCAACOCTrroGTGGAGOCTGAAAAAAATCTtKJ 
GCAGAATCTAOGA C 1 ICmA I 1 1 IGU IA AAOOGGTAACACAGAOTOOC 
CTTATCAAGGAGTTGQAGATIXrrGCAAGGAAGAGAAGGAGTGAAG GAGA G 
ATX^AGAGAGAGAAACAATGAOGAACATTTCATTTGACCCAACATXXTTT 
AGGAOCATAAATGTTO ACACTAAGTTATC CC 1*1 1 1 U 1 UCTAAAATOO ACA 
GTATTGGCAAAATGATACCACAACTTCTTATTCTXTrGGCTCT ATATTOCT 
TTOGAAACACTTAAACATCANATCGAGTrAAATACATATTTOAAATtrTAG 
GTTAGGAAAT A TTGGTG AGGAGGCCTTA 
171 



AGTTOGTAGTAGGAAATOGCGTOGTGGCATTGAGOGGCATtXXrrCCT 
CCTCCAGGAAAAGCTOGOOGAAGACGAOGTrCTGOGGA GAOAGAGGC ro 
AAOCAOTCTOOG AAOTGTAGTCCAOTTGGCTTAGCAGTAG IHUjII GGG 
GGGGAGOOGAGGTTOOGGCAAGGGGCTAGGOOGGCITGAAAAGA^ 

CAACTCAGGAGATTAAAAAAAAATCTOCACAAGAAOAAOCAACTCAOCAO 
GOXTGGCATTAAAACATTTOCCAGAATAAACAGATATOCATTCCATTAA 
AGGT AATTTTCAAATATTTAAGTTACACCAAGA 1 1 IUX IC CAATATGTG 
OCTTTCTCAAAOCAATGCAACTAATTCATTGCTAATACTGGOGCATGAAT 
TTTrGGCAAATGTTTATGGTTTTAC 1 I ICI J^ ^-^ 7 ^^^^^^]^ 

CAGAAOTGOCCACTGTGAAATCGCAAAAGGTAT 
172 
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gaogatgcattcaocgggcggcggc^gggtacagattt aaggtt gatoga 

CTCAOGGTAAGGATAGCTACAGCrGTGTGGGGCTGAAGGTCTGTXXK^CT 
GAGCTACTPGGCAAOOAGOOCTX TOI 1 1 I'CaTTOTGACACACTQAGTTAA 
TAAAGCACTTACrTQAGG<lAGCCAGAGOCCAAACICTAAATGTGCTGTAGA 
AAAAGOGCCAAGTCATTCACTOCACCACTCCTTX^G(XAGAGGT agaaag 
GAlTTACTCTTCAGCCATCTG<rrAGAG 

ACAAAGGGAGGGAGAGGTATCATXXJTGA7TAATAAATTCAAACAAAGCTG 
AATGATAAGACCOCAOGATGOAATACAGTCTGAGAAAGGOCTGGGCAAAG 

ggaggcagagggactgaaggaagcaggtcaaggaagatacacoc 
ag^atg^(xccttacgcx3tgg<xx^ 

AACACTGCCATTCACAAGTCAAGGAACCCAGCKKXAGCTGGAA 
GCACACATGCTGTGGAGCACACATCCTGTGGAGATTGCAGTGTGTXrTGAG 
GTTTGTGT A GT A GTGG AAG A TTTT A GG TATGT A G A GC A AG TTG A AATGG A 
TTG AG A CT GC A TGGGGGCA T A A ATG A G AAA TTGCCTGT AGCATCTAGTCT 
ACTTCAAGGAAGTOGAGACATAAGGAGAGACAAAAACAGGTTTGTGOCAT 
AAAGTA TTTTTTCAA AGACACCAAG ATGTGOGTAAATGA AA ATTATTAGT 
TCACTTOCCTOCTCGCATGAAAC I ' I IGCGTTAAGAAGGGT OGCTGCAA TT 
CCAAGGTTTGGTAAAGGGCAATTTTGGCTAAAGGACTGGC1 1 1 1 1 iGAAA 
TGCCTTATG 
>Sequeac* 174 

GTTTG A TTGCGGTGGCCGAGCGGCCGCCCGGGCAGGTA CCA CT A GGGTGT 

TGTTAAAGGACrTGATAACCAGCTTGAAGAGGTTOCTACTGACCAGAAAT 

GGAATGAAATTTAAGCATCAATAAGGGTAATAACTGCAAGAGACTGACAT 

CCACTATGGTTT AAATCCATGAGGTCACAATG ATACTT AAll 1 1 iCATTA 

TTCTGAAAACCAGTAAATAAAGGCTAAGATTCAACAAGCATTTATCCAGC 

CTTTCCTCAATOAAATATATCTTAAGAGAACOGAATAGTTAACATAGAGA 

CATGGCCGGGCAAGGTGGCTCTCGCCTGTAATCCCAACACTTT^ 

CGAGGTOGGAAGATTGCTTGAGCOCAAGAGTTCTAGACCAGCCTOOACAA 

CATGXnGAAAODCTGTGCCTACAAAAAAA AAAAA AACAAAAA AAAAOT OC 

CACTTCCCGTTTTTACTGTAOGGGGK3ATAACTTTTAGGAAT^ 1 1 1 I 

G AA T A TT A TTTCTTG AAT AAAGCA TGTGTT AATGGTT A A AAAN ACAAAAG 

ATCAAATAATAGAAATAATAAGGTCOCTCGGCOGCTTAAAAATAAGGGGA 

TCOCCGGCTGG AGGAAA TTCATTCAAGTTAATG AT ACXXTTT ACCCTTA GG 

qOOOGGCCOGTACCAACT 11 1 1 1 IUL I 1 1 AATGGG 

>Sctfaeae» 17S 

AATCAAGCXKZATTATTtXrrATTACTGTACGTAATACATOGACGTCTGCT^ 
CTCAJ4ATTTTTACTTTATTATATATGTACACTCACTCTATC TATATATAC 
TATTATTGTATCTATGAGGCTAThrrATATATrrA>n«WNAAO ITTTCTGTO 

cgcgacoggocaggtaccaaaaccttjGGGattaagctaagaagt^ 

gagagactcrgtggacgtaaagaaggoaatgaacacagagaaactttcao 

ccagattcctgagtgtcacctgaacaagaaaagtc*^ 

CATGCAAATGCAGCGTOTGTGGGAAAGTCl llOCOGTCATTCATTOCTG 

G ACA GGG ACATGAGA GCTCA TGCTtXJ ACACAAA.CG ATtTTGA GTG TGGTGG 

GGAATTXiAGAGAGACGOOOCGGAAACAGAAACAAGATGGGAAAGCCTTCA 

TTTCOCCCAGTAGTOOTQCAOOGOGCACAGT^ACACCAA 

OCTTA TG AATGGAAGGGGTCCGGGAAAGOu 1 1 lAAl lClCCCAATTTATT 

TCAAATCG^TCAAAGAAACTCACAjCTGOA 

GAAAAAGGTGaGAGCCTTT ACAGTTTTCA G 1 1 ICI II I OAAAACATOGAA 
AAATGCATACTrTGGGAAAAAACGCTATXJAATGTAAATACIXjrGGAAAACC 
TAATCGGTrATTXXAGGTTATTTTAAATTCATGTTAGAAATAACACTGGG 

gaaaaacctaccaaaggtaaccatgggggaa aggcttta rrm cogaggg 

TACCTTTOGGCACATTGAAATAAAACTTAAOCGGCTGGT 
c I7« 



attgtgggaagtctt 
tcctitctcgttcc»cjgc<^ 
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gctcksagtccatcaactctaggctxx^ctogttatgaaa^ 
gcct 

>S«qnefice 177 

CCCCCCCG TT ACCCGACXiCCGTCCKXi ATTGO AACTtXXX 

ogaggtact i i i i i i 1 1 1 i i i i i i i i atgaatt atttat men ictca 

GAAAAGGATGCGCtnT^CACTTAGCAAGGarGGGCAGGATGTG^ 

TCTCCCCACAGACGGGGT O G1TCT A GA 

>Seqoc«ce 171 

TGGGGCGTTGAGAC 1 1 O U POOC C 1 UKXKXXX^XGGGCAGGTACCAAAC 

CATTTTCACTAGTTX^OGATAOOAATATTCATCAGATTGTCT^^ 

GTGAATCACAAAAATTCCACCTP G T G T A GGTGTCGGACTGGACAG 

GACAGGGOCGTGGGAAGAACAOAAAOCACTTrTCCIX. 1 1 ICltX lO AAAT 

ATCAGAAGTTAAAAATCTACTCTGAGTTATATGTGCATCAATTTTAGACA 

TATTCCTGATTTTATTATGAAAATGAAGTGCTAAAGACAAAGGATATTTC 

CATTCCTCTGGAC^GGCAGOCACAGACCAGCACTGCrTG 

TACACATOTGTGCTTTOTACCT 

>5MMMt 179 

TGGTCGTTGTTGCXjGGCTXXXXjAGGTACTCACAGT^ 

TCTGCGTGCACOGCTCTCCAI ICI ICI I G OCTTTACACGTTOOCAOO 

TCAAGAGCTTCACCCATAATTAAGACCTTCTGAGC^TC 

ACACACC I OC1 CTGAACCATOCTTGGOCTTCATGGGGTTGGCATTGAGGA 

TCCCTAOGACAGTCCOCTGCTCOXi If I llXAGAG<XiCTTTGTGAACTTCT 

CCAAA TAAGAACAAGGACACACATTGTGTCAGGTCA CO AAGATCATTCAG 

TTTCCAT A TGCTG A A GG TTTTTCCACT A TTCACA CTCTGTGGCG T A ACCT 

TCTTCAATA T AA OCCCAAA 

>ScqycaceltO 

TGANAGATTIGCOGNOGCOGOCGAAAACTQATCAOACTOTC^ 

GGAAAAG ATGGCCAGAGAGAAGCTGG AAGAAATAG ATTGGGTGACATTT G 

GGXnTATATTOAAGAAGGTTACGOCACAGAGTGTGAATAGTGGAAAAACC 

TTCAGCATATCClAAACTGAATGATXrrrCXiTCACCTGACA 

C1 1 G M CT f A TTTGGAGAAOTTCACAAAGCGCTCTGGAAGACGGAGCAGG 

GGACTGTCGTA GCGATCCTCAATGCCAACCCCLATGAAGCCXLAAGGATGGT 

TCAGAOOA G G I U I O I H ATCTATCGATCATOCnt^GAAGGTCTTAATTAT 

AGCCCTGCACOCAGACrGTQAATTTGCGTGACTGTGAGTA 
>Sequcace 111 

TGGATATGTOCATOGGGGGOGGOOGAGGTACTCACAGTCACGC 1CCTCTG 

AACCATCCTTGCKXrrTCATGGGGTTGGCATTt^ 

CXXTGCTCCGTCTTCXIAGAGCCCTTTCTGA^ 

GGACACACA TT G T G TCAGGTCAOGAAGATCATTCAGTTTX3CAT ATGCTGA 
AGCTTTTTCCACTATTCACACTCTGTCGCGTAACC^ 
CTJAAATOTCACC)CAATCTATTTCTTCC^OClTC 
CTTG ATCTGAGACAg I C I GATCAG 1 1 1 1 

TGGATACTGCAATCGGGGGCGOCCGAGGTACATGGATACXi ITCTCTTCTG 
GOGOCGGTCTCCA GTCC-rn CTCATGAGGOAGCACACT^^ 
TGCAGTGGCCTCAGGG A TATGGAATTAACATCCACCTGGTG TGATGAATA 
AXCCCAGACTCTCAGCXACOCAGGAAAAAAAACAAAAACTGGCTTJGGGAT 
CTGGAGTAAAGGATCCTCACATCCACGTCAACCAGGAAACTCTG 



113 



iTACGCGGGGAGCGGA 



TGGATATCGAGACGTCTATt 
AAGGG AG ACTGTGGGGAA CT A < 
TGAAGG^TGTATGAAAAAGATTAGCAGTGTGAATCTTGAXIAAACTTATAA 

atoacttctcacagatagaaaagaaAuATGgtagaaaocaatggaaagaac 

AAT AT ACTGGA TATTCA GTTGGAAAAAA GT AATTGCCTA TT AAAAGTAAT 
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GCAAGCAAAGG AGGTCTCCATTAAAG AAGA ATGTGCTACTCTTCATAATA 

TAATAAAAGGGCTACAACAGACCATTGAATATCAACACiAATTTOAAAGGT 

GAAAATGAACAACTAAAAATAAGTGCTGATCTTATAAAAGAGAAGTTAAA 

OTCTCATCAACAGGAATATAAGAATAATATTGCCAAACTTCT AAGTG AAA 

TGAAAATCAAAGAGGAGGGATATAAGAAAGAAATAAGCAAACTTTATCAO 

GACATGCAGAGAAAAGTTGAATTAAATGAAGAAAAGCACAAAGAACTAAT 

A G A G AAAA AGG AGATGGAAA TTCAKAGTT AAATGCAAAGCTCAG AAGTCA 

AAAAAAAAAAAAAAATOAAATAATCAAGCTACAACTAGAANT TGATGCC A 

AACTAGCAAG AGTTCA GACTA AATCAAAATCTATC AGG ATTTACTTGTTT 



TGGATG ATGCTCATCGCGGGGCGGCXX3 AGGT A CATGG A TACGTTCTCTTC 
TGGGGGCGGTCTCCAGTCL'TI I CI C ATGAGGGAGCACAC'HJj i*^ 1 i*JC I C 
A TTGCA GTGGCCTCAGGGAT A TGG AA TT A AG A TCCA CCTGGTG TG ATG AA 
TAAACCCAGACTCTCAGCAACGCAGGAAAAAAAACAAAAACTGGCTGGCG 
ATCTGGAGTAAAGGATCCTCACATCCACGTGAACCAGGAAACTCTG 



GCNNNA TG ATT AKTCCTTACCGGCCGCCCGGCAGGT A CGCGGGGG TGTCC 
GGCG ATXXjGC ACOOGCATTTCTTCGTTTATA GCTGTCTCTTTGCATTCTG 
ATTGCGAACACTGGGATCATTTTCATCATtKXXJAC^ 
TGTATCCCTTTXX>TGACC(XIACCTG7TJTCTA 

ACCAGCG A AGC ACT A A GT AA T A TOG A TG ATT ATG A CAAAA CX i i^i i GGA 
GTCTOCATTA OI TO QTC1 1 1 G CAATATCGTTCAGCAAG AATG GOGTGGTG 
CAATTt X rT TXj OCAGGTroTCCTTjGTGACAGACGGCl U 1 CI lOGCATTGGT 
AGA GGG TCA CTGG AACATTOCTTA CCCACTCA AACTT AACG AAGTG AG AG 
CAACCGGTTTCCACT ACCT T IICCII 1 CCCATCTAACTT ATATAOCAGGC 
GCGGGCOGGAATTGGAGGGAOCACC<3CGCCCCTXrrTCTr^^ 
AAATCTATTATATATTCACACATTTGTAGGOGGCCATATTATAAl lotGG 
CXXXXXCTGTCTCAAAAAAAACTCCXrTfCGGC^ 

CCCCGGAGGGGGAATTAAAATAATCTAACCCCCCCCCC*CCGOGGGGCCCC 
CCCCC Cl ' l llllll 1 1 AAGAOAGGACACOGCCC 
>Sequtoc. IU 

TGGCKXGATGGAAGCGCIT^CCGOGGTGGCGGCCGAGGTAC^ 
OGCAAATTT^CAOTCTOCOTOCACGGCTtnTJCATTCTTXTTCT 
ACAGGTTCCCAGCTCAAGAGCTTCACCCATAATTAAGACC TTCT GAGGAT 
GATCGATAGATAAACACAOC I CC I L I G AACCATXXnTGGGCTTCATGGGG 
TTGGCATTOAGGATCCCTACGACAOTCCCCTG<^^ 
TTTGTGAACTTXrnXAAATAAGAACAAGOACACACATTGTGTXl^GGTCAC 
GAAGATCATTCAGTTTCCATATGCTCAAGGTTTTTCCACTATTCAC^ 
TGTGGCGTAAC C'l ICI 1 IA ATATAACCCCAAATGTX1ACCCAATCTATTTC 
TTCCAGC TTCTCTCTGGCC A T CTT1 1 CC TTGATXTTG AG ACAGTCTG ATCA 



>S " ,U<,,C * TTGC C ICAi^, >Gu rGoOoG*>AXXXGGGCAGGTACCAGAGAT 

TCCAGAGAGTGGTCTTTGG AATTTCCCAAC1 CC 1 1 [UCTTCAGTGCCCTG 

ATCTCTGAACTAACAAACCAGAAAGAAGTaGCAGCATCGACT 

CAGCACAAAAGCATACTCATGGAATATTTCCCGTAAATACTC<^ 

CTACACAGACTTAGTTKiCXlATXX^GAATAAAAATGAAATTGATTAOCTCA 

ATAAOGTCCTACOCTACTACAOCTOCTACTACTGGATT GGGAro CGAAA^ 

AACAATAAGACATGGACATXXXTTGGGAACCAAAAAGGCTCTCACCAAO 

GGCTGAGAACTGGGCtTJATAATGAAOCTAACAACAAAAGGAACAAOG^ 

ACITJCGTOGAGATATACATCAAGAGTCXXTTCAGCCOCTG^ 

GATGAG<^CTGCrraAAGAAAAAGCACOCATTGTGTT 

ccaggatatotcctgcagcaaacaagoaoag I GCC 1 ogagaccatcggga 

ACTACACCTGCTCCrGTrACXXrrGGATTX^ 

GTTJAGAC^GTGTGGAGAAOTGAGCJTTOCTAACACCT^^ 

AGCCAACCTCTTWIAAACTTCTyJCTTT 

Page 75 



PCTAJSO0/MJI2 



CAC1TGACGGTACCTTGGGCGWTCTAAGACTAAGT 



GGAGCATCTOCAKWnTHTTITCAAKAMGOGACTOCACOOCOGTOOOGGC 
OOQOOOCOCACGTACTI 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 KJTAACTACAGGTGT 
CAOATOCATCACAAAAOCAQAAOTCCCCTTTCAGC rCTTCTCTO roocAT 
TO C TT O T C AATTTCATCCTCCCTACAOCAACACCATAATACTOCAAACAO 
OCATOATCTCAWCTCGAAGTGW1CTCTGTGATTCACAGACAGCGACAOCT 
OCT AOTCAACAG C TOT CC TOCTCAQAAOAATATCAGAACTCAACTCGCTG 
TGGCTCCAAGGGGCTCAA I CXXT 1 C ATTTG AGGGG AGGGA TON AAA TATT 
CTCIXX>TOAAGACMGCKACCXXJATXXJGAAGTGATACTACGTATXn 
GATCCTCACTT AOCTCT AAATGTAAGTTAGACCAGGACAGCCAGAATCAC 
CGAA OG IU l l^ l T AAGGTXXtriClUTAACAAGGCCCTAOAA GOOOCAqA 
AA TO TNGGTGACA OCO A GACACNATTTCTTAAACTCTTACAHCTTGTTCT 
AAATGAGTAAG AAAGGTGACAN I rrCTTTTGGAAAATOOOOCTOOOCAGC 

c ci nren icoc aagaaactcagttattcaai 1 1 1 icc t gg t g coctaa 

CATACACTAXnTOOCTTAAGATAAAACACTACCTACTTGCAACAAAATCA 
TMAGAAGTGCGAG AGCCA TT AOCAAGATGGGTTACCA T AAGAATTAAAAA 
AATATTATTGCAAAAAAAATAAAGGTrcTA\AAAGTTAA\AAAATGGCATTA 
AOATOGTAACTCTrAOCTAATTOOCTAAAAATXJGCTTGTTATTAAOOGAA • 

ooggcttggtacaaaacaccg room i a atctacccqgaaa c rrroorc 

TTAA C I 1 CU .1 1 UC I LCCTG ACAATCTTAAAT AOCT 
>S*xfxxxx in 

OOGtMXAGGTAOGOGGTWAAGGAAAGCAGCTGCAAACTTOOCATCTOCAG 
I G H "I G I irOTCl C OOCroOGCCeATCAjCTOOCAOGATTAOOOCTGOATO . 
AATTCCTCAGTOGAAATATCAACAAGACTCAGCOCAOCTGCACCCAGGTG 
ATTAAAAAGCTTTATTGCTCACACAAAGI 



CiGGAGATCAATCCCCIGICClCCrOCTCI riOCICCOTGAOAAAOATCCA 
OCTACQACCTCTC C T C CT C AGACCAACCAGOOCAAGGAACATCTCAOCAA 
TTTTTAATCAAGAAT A TTCIGTGAAAAAGACTAACATATCAGAOAAATTA 
TT AGTGCACA TT A TTA GAA GA O AOCTTCA GA TGA AAAT AAAGATCAAGAA 
AAGA CIC I I OC I I IU AOAAOACACAAAOAAATCACATCATCTTATTGGGA 
TTACTGGCTACCCATATGCAGAAGATI 



TATAAAAACCCCTGCAGGACAATCTATOCAATACCATTCTOGACATAT^ 
AAAAQCAAAOO ATTTCTG I G CAAAACACCAAAAGTTATTTOAACCAAAGC 
CAAAAATTGACTOOTOGGATCTAATTAAACOTOAGAACTTCri O ACAGCC 
AAAGGAAATTGCGGCCGAGTAAATAGACGATCTTAATAATGGGAGAAAAT 

attjgcaaactatgctatcttcaagggct^ 
ran rocAAATTOoc 

>Sc«)ttCM* 190 

TXlAATTJATCTGATCGCOGGGCGGCCGCCCGGG<^GGTtKX^TOGC0GTCC 

CATTGCTCACACWOACTGGGAAGGCaATGOCTOGCOGGAOC^ 

AGAG ACTCOGGATG AC1 CCT G CTC A GATTCAGG O C 1 1 GC I CAGGAAAGOG 

QAAAAOT J 1 GO 1 C OAOOAGTGATAGCOGGACTOGTItjACATTOGGG AAAC 

TTTOCAATGOOCCGAAGACTTAACTOOOQATQAGGTTGTGGAACTAGAAA 

ATCAAGCTCTAOCCTGATXiCTACAaAOGAOOACATCAOCTCACACATGGA 

AAGOOAOGAOTTGAATOGTOCATACAAGGOCATOOOOGTTGOOCAOGAOC 

TGAACGOGOCTTCTOATTX?OOACAGOOCTGGOAAOOACAOTTATOAAACG 

ATrATATAAOOGOAAAGCCAATQATOAOAGCAATOAOCATTOOGATGTGA 
TIX^TAGTXACGAACITTCCAAAOTCAOCCGTGAAT^^ 



ATAACAOCTTOATTTTCTATTTN 



TOOGAAOTGATCTAATCOC i CI A< 



AOGOOOGGGCAGGTAC 
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TtXXTIGGAAAGTXXlAGCTGAGAAAGCGATCCTGOCt- IC1GC lULIOCCAG 
GCTTACOCICCTG TA A GTC1 fCTGCl TAGTGTTCAGAATTGGOGGATOCT 
GGGACTGGGCAAGGACTTCTAGGCAACACCCXATAOCCTGCI CATGCCTG 
TTGGGTTOCCTATGGATCA TTCOCTOCroOGCTCACTCAOCGGCTTCGTA 
TAAO OIOJI I 1 1 1 U AOGTTTATTA 1 1 11X1 I O I CXA TATACITQATGCTC 
TTCATTGGCTTGTtriGGGACCTGOCTT AGGTTCTOCGAGOCATAAAAGGG 
CCGGaCaGCCCCCGAGTTGGGGOAACTCTGAAOCT^ 
CCnGCICATCTT AAAAA1CCI ICAGtii 1 1 I AGaX. iGIOCOOCCAA GACA 
AGGATTTTTOCAGAATCTTCTACTTCAGTAGTTACTGGTATGA 
CGGOyiCTT CTCOCTGATOOOCAAGT<20CAATTACACOAACTOCAAOCGG 
I ITCCfTCTHOOGOGTAOCT 
> Sequence 192 

GAATGATGaAGCCCTCT ACCGG GTCGCXiGCCG<!X^GGGCAGGTACTTTTT 
AATTTTCTACTTGAAACG 111 1 C 1 OCX I C \ IJL-ACTGAGGQGOCAAGGOOCT 

GGAaatgtaaagggccaatci nun acagaggggttcattocagtgaag 
ggccggttctcouu^gacaaacaggtctcacagatagi iuoxxcocgta 
car 

>Scqococa 193 

ACTGTACAGATCTAGTACTTTATCATACTTAATACGTGT^ 
CAACGATTATCTGTATACAATTCTATAATTTATATAGAATATCTTATAAT 
GGTTTGT ATA A TT ACGTTTT ATT AAAA TT ACANNT A>fNNATGGOGOGTTO 
AATTAGATO<X CCTATCGGG>^^ 

TAGCCTCCCGC 
> Sequence 194 

CGOOCA TCTTC TOTCT ATAO TTAAA TCA TCA TCTCTGAG ATCA CT ATTAA 
TTGTCACCGTATTCGAA1 MU I I CAGATGATGATTGAACAATAGCTTATG 
TOATATCATGTA CGTCT61 rocn IC ' l C A ANCOrrTOGCCKAGATOATTT^^ 

ICG^TCIOCGCTCAAi i FCCAAGAGOCAGC1 1 iuAAGCCAAGTGOOCCCG 



TOCTCAOCIGOCI I <XAAAGGAGG AACAGATOGGCAA GTGCTCGACGOGT 

GGOCGAAAATGCTXiCCGAAGAAAGAAATAAAAACCCTOAAACATGACGAG 

AGTGTTCTAAAGTGTGGAAATGOC 

TACTTTGAGCTCATAAGC 
T^GGTATAAAATATCAAACATTTTGACTOTTTAA^CAACTC 
TTOGAAAATTAGAAAACATTATACAGGTGACTTAATTAATATCTAC^^ 
ATTATACACAACACATCATGCTGAAGATTTAGATn^ATTTOAAAACAClT 
AGTCTAATTTATATTAGTGCAOAAAAATCA CATTC AATAAACCACAATTG 
TAGAAO AG ACAGAT AAG I O I G T 1 1G I CA CATTTTCACACAAATATAATTT 
GATATTTAATTAAGGGATGATGAA 
>Scqucacc 197 

TTCTATCOTATGTATATATCTATACAT0TCTTATCTATGT0TCTAT 
TA I 1 IG1 1 1 1 IG CATCTATATTA I 1 1 1 1 1 A ATGOGTGTATATA7CTATWT 
ATTTTGGTGTATOOGTTyroGhKnXKlCOOCOQATOTAtX,' I OCCTCACAOT 
GCAGOOCQOTATGCCGCXZAAACGCTTCCQCAAAGCTCAGTCT^ 
GGAGCOOCTCACT AACTCCA TG ATGA TGCA 
> Sequence 19X 

CTTGCTCAGOC I nCX>OOCCCCTCTGATOAOCTCTrTAATCAGCAGOAC 
GAAGGTGTGXATGTGGGAATGAACATGGATCCATCCCATTGGATGGAGAA 
GAAAGGTGG ACAGCCTG TT CO TC I C rCATGTCAGCCTAGGGCTOQGAACA 
GTTTTjTGAGGACTTATCTGTTGTACCT 
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GTACTTOCTCAOCC I 'n O CAGGGDOCTOGATGAOCTCTCTAATCAOC 
GACCAAGOTOTGAAOTXXXIAATGAACATGGATDCATOCCATTOGATGG 
AA GAAAGGT GGACAOC CT O rroOTC TTJTCATGTCAGCCTAGGGCroGG 
CAG1I rOTGAOGACTTATCITJl IGTAOCT • 
«oe»s> 200 

AAAGATOOOCAOAOAOAAOCTOGAAO AAATA GATTOOGTCACATTTOO 

\GCATA TGGAAACTGAA TGA TCTTOGTGAOCTGA CACAA TGTGTGTCC 
1 1 Id I AH 

c 



z 201 



0 TO G TT G I T C I A CTAAGTATATTAO G TGTICTT A ATCTAGTATTATAC 
yrnCTAATATACTCTCAATCTTA r i f lGI IA TATTATAATA I HUH 
rATATTATTATTACATATXXlAATANAT04ATT>n"ATGGTAGTTCTCGG 
3 GCGGCCGAGGTAI 



AC I U I Q 1 1 1 ATCTA 1 1 ( CATGTATCTGTAA 1 1 C I A TTTA TCTATCTAT 
VA ILT 1 1 I M A 11 II I TaTT CTATTTTATCATATA TIG 1 Tl T ATA TAT 

GAGAGGGTTTCAGAGGATCCT IG I GAAACACTAGTTAAAAGATGACGA 

jGggagaaotgcgaggaaagaaggaaattagtctgactggctttctgt 
iocaocattgattcaatggagactggcoooaogaaatxxsaagactagg 

ICGAGATCGGATGGGTCGGGCAAGGGATGGAAAGGAAAAGGCAGACAA 

vatgogttccatttataacaagtaatatatatcaaagacttaaaggag . 
raaagaccaatcagaataatttggcaactttaattcttagoaagatxa 
gttcoctocaaaoctaatttgatgttttattact a 
atggtacctgcccg 



n I CI U 1 1 ICAA 1 1 1 1 CI CA t aatggatctattta ttgt actgttta t 

TTTCTATTTAT I '1 T TCI AAATrATTTATTATTTTTATATATATTAATT 
ATKTNCCKCTTKTTO 
GOTACGCGGGGAAG 
5 c i g n o ottooooocc otoo oocrocta 

aaagaagacgaaaaagtcgctggagtcga tcaactct aggctccaact 
ttatgaaaagtcggaagtacct 



GATGTAGTTGATGCXXrrCACCGCGGTCGCG^ 

TCTCAaATCAAGGAAAAGA TGGCCAGAGAGAA GCTGGAA GAAATAjGAT 
GGTCACATTITJGGGTTATATTGAAGAAGGTTACOCCACAGAGTGTtlAA 

gtcgaaaaaccttcagcatatggaaactoaatga Ten CG 1 g acctga 

CAA IUIOIOIU ^I IGI I CIIA TTTGGACAAGTTCACAAAGUUCILIGG 
GAOGGAGCAGGGGAL 1 U J CG I AGOOATCCTCAATOOCAACCQCATGAA 
OCAAOGATOGTTCAGAOOAGGTGTOT1 1 ATCTATOGA TCATOCTCAGA 



ATGTOX1 1 1 IGAAGCCTCTACCGGGTGGOGGCCQaAAACTGATCAGAC 
TCTCA<GATCAAGGAAAAGA TGGCCAGAGAGAAGCTTXiAAGAAA TAGAT 
GGTGACATTTGGGOTTATATTQAAGAAGOTTAGGCCACaGAOTGTGA 
gtggaaaaaocttcagca ta toqaaacigaa TGATCTTCGTGAjCCTGA 
jCAA I L» 1 Li I U I IX \ I G i 1 L 1 1 ATTTGGaGAACTTCaCaAAGCCCTCTGO 

XXaOGGA gcaggggactgtogt aggqa toctcaa tgocaaccccatgaa 



gcccaaggatggttcagaggag o I G 1G 1 1 1 a tctatooatcatcctcaga 
aggtcttaattatgggtg aagctctttmcctgggaacctgtaaagccaao 
aagaagaatcgaoaoccgtocacgcagactgtoaatttgcotgac^ 
ctacct 

rrAcrcACAGTC 



CT ACGACAGTCOCCTGCT C CGTCTTOCAGAGCGCIIT G IGAACTTCT CCA 
AA T AA GAACAAGGACAGACA TTG TOTCA GGTCACGAAGA TtlATTCAGTTT 
CCATATGCTGAAG O I 1 1 1 lUU ^CTATTXACACTrTOTCOCGTAA CCTTCT 
TCAATATAACCOCAAATOTCAOCCAATCrATI ICl 'IOCAGCTTCTCTCTG 
GCCA i e n T T CC T r GA TCTGAGACA G T C T G ATCA G 1 TTT 



x 207 

TGGATGATOAATTCAG 
G l I C f 1 CC I AGAAAGTG G I TCTTCCTTA ATCTGT 
rCTTCTTCl 



got a catc 
tctttttaccccttt 
ttttcttct 



TTCACAGA" 

IPC rCTTCn C A ACTG AATAGOGTAAGTGTAAAOGCACAACA AATTAA CA 
CTGTATCAGATCTCA 1 1 CCT I CC AAAAACGTTTGAGT CCTA G I 1 1 1 11 1 C 
TGTCATTCTCATCAACTACOCAA PGTTTOTnTOTI I ATTTT AT AATTGG 
GAAGGTTCTCCAAGGOCTACCACTAA<rrTTAACGAATGA TAT AGATAGAG 
CTCAGAGCAA 1 C 1 1 C 1 C aCGaTCATGAAGTCATGTATAAAAATCAOOATT 
AAAACAAAGGTCATCTGATCTCCAATCATTATTGGGAAGAAAGTCAATTA 

tattagaaatwhtaagagcttgcactctgaagtcagacgc^ 
aatctac ctg ct o caaccctgaaaaattgtatttac oc t tgttro 

CCTATCT 



*7I 

3laT«hfc23 



ATGOI ICI roCTAGAAAGTGCI fCI ICCT 1AA IG I UI rTCTTTI l A CC CC 
II I 1C 1I C I ICnCnCA CAOAI G lT ICTICMCG lCr G CCACn 1TTCT 
TXTTTCCTCTTCTTCAA CTGAATA GGGTAAGTGT AAAGGCACAACAAA TTA 
ACACTOTATCXGATCTT^TTCCTTCCAAAA^ 

T 1'CTGT CA TTCTCATCAACTACCCAA rQTTTGTTTTOI TTATTTTATAAT 
TGGGAAGGTTCTCCAAGGOCTACCACTAA CTTT AACXi AATGA TAT AGA TA 
GAGCTCAGAGCAA TCTTCTCACGA TCA TG AAGTCATGT AT AAAAA TCAGG 
ATTAAAACAAAGGTCATCTtiATCTCCAATCATTATTOOGAAGAAAGTCAA 
TTATATTAGAAATGGTTAAGACCITOCACTCTGAAGTCAGACGGCCITjO _ 
TTTAATCTAX3CTGCTGCAACOCTGAAAAATTGTATTTACXA-' 1 IOG1G AAO 
CTTCCT ATTrTATAyAAACTTAAGAATGTCTTATCTTACTGGACT 
ATTTAAAAAGAT 

CAT ACT ATAT AAT A TTACGAT A TAA TCATTAT A TtXlATCTTCT AAACTTA 
ACTATGTATATAATTATAAAAATAATTAATACTACGATGAGTATATCTTA 
TGA TCAACT A CCAAA TTCPGT A TQA T ACGT ATCTCCA OCGCXXSCGGCXiGA 

cxxaggtacaccagataggcacatctocaaacacaaagaaggtgcgctgct 

G Cl I CI I IC TAT CTGCCCCTA C ACCAGG C I L X.I I IOLI IC ACGT AAGaTG 

aacaattcctcactgcctataaactgtagtcacatgtgggat 
gaacatgagaatcax^aacaatctgggcaaatggctatggcaaga^ 
acaccacaacaogacaoatgccaactctt^ttcatc^^ i i i ro oc 

ATATG GG I GCC 'I IC 'I G I OIC T 1 C T 7 T O C A CCTAI ICCI ICA GICTCAACA 
ATCTCTTTGACCCTGACCGGGCG 

GGtLlTCTCA ITT CGC rC A CO G C GCTCGOGGQ^GAOGTaCTCaCAGTCACG 
C ' lC CT CIU AACCA l CCn GG GCTTCATGGG Gr r GG CATTGAXK^TCCCT A 
A G AGOO CGGTOTOAACT 1 CT CCA AAT 
ATTGTGTCaOGTCACGAAGATCATTCAGTTTOCA 
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TATGCTOAAGCTTTTTO^CTATTCACACICTGTCOCCTAACCTTCTTCA 

atataacoccaaatotcaoccaatctatttcttocaoc rrcicici oocc 

ATCTTTTTCTTGATCTCAGACAGTCTXjATCAGTTTT 
>Sequcac«21I 

TCGGCTATOA I G I C G CI C A CCCKXXnXXiCOQCCGAGOTACTCACAOTCAC 
GCTOCTCroAAOCAT0CTlXXjOCTTCAT0GO<nT0OCATTX ^GGAT0 0CT 
ACGACAOTCCOL1ULIUJU1CI I UUA Q AOCOCGGTGTG AACTTCTCCAAA 
TAAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCAT TCAOl 1 1 cc 
ATATGCTGAAGGTTTTTOCACTATTCACACTCTGTGGCGTAACJL 1 ICI iC 
AATATAAOCOCAAATO 
• 212 



CAGTCTACATCTAGT N fCICTTl ICA TMATCTTGTTATAGATGTATAACT 
ATCA TCCTTCTCTTCACATATACCTTATTCCTOTATTATGG ATAT ACATA 
TATCAATTTACATTAGTTAOAATTTTATGTCTATAAACAAOCAA GACGAT 
GATTTC«AGCeCTTCACCCCC<jNGGCGGCXX^^ 

AATTTTTTTTTTTCTGTAGAGACOAGGTCTTTXn'ATGCTGTTCAOGCTCA 
ACTTCATGGGTTTAlTGGGGATGGCTAATGGATGACAT TOGOG GTGGTCC 
TTGATACXXAGATAAOOCCTCAGTGTGAAGCAGCICTTATI njICCTTGTC 
TTGAG ATT G CTCTGGAATGGAAATTAGG C l TIT rT G AAGOTGTGACCCTT 
TTTGTTCATTTCTTCAGCAG TTA CTTTTTAATTTTTAAATGTTTGA ca ca 
CAGTO^TXIATAAATGATCATTCAOCAATCACCGATTACTCTCCTTOCTC 
TGTTAAGTGTGACACTGTCOCTTTGAGAATXTIXkXXlACAGCrATXjTA'TOC 
CATAACCACACAOOCCAAAAAAAAAAAATTTATGTCTGGTTCCAGGAGTT 
ACCTTTTATGAGAAGTOCATTTCTGAAGAACCTGGATGTTCAGAGAACTT 

TTGGAATOOOCACACTTTCGGAGGCTTAGGTGGGCAAATAAACTGGGGTC 

AGGAGT 

>Sequence2l3 

TCTCCCTCG T A CTCG ATCA TCAGA GT AT ACAT ATGAG TGT ACTCT A NTAC 

TACTACGATCTCTATACTAAAGTTATCCTATTCACTTTAGTGCCATCTGG 

TTCT ATATG AAA CTCT A A T ATAATCAT AGCGTGTT A T AT AT AT A CTAT AT 

ACATTACCATGGOGGTAGATTCGAAGCCCTATCCGOGGAOGCGGCCGTTT 

GAGAAOCCACKXXrrCACCCACCCG<KKrrCTXrr^^ 

GCTGACTTGGAGAAAAGCACAAACACGACCAGTOOOGCGGCGTAOCT 

>Saqgcace214 

TOGCGATOTTTGATCGAGCTCACCOCGGTGGCGGOCGAOGTACATGCCTA 

CAOATAGTCCCAGCTACTCGGGAGGCTOAGGCAGGAGAATCGCTTGAACC 

CAAGAGGOGTAAGTTOCAGTGAGOOGAGATtlATGGCACTGCACTCCAGCC 

TOGGTOACAGAGAGAGACTCCATAAGAAAAAAAGAAAAAAAAGCGGGGCA 

AAAAGAAACAGATOAAACCAATGTCAATAATTTATTTTAACACAATATAC 

CTAACATATTTTTATTTCAATATCTAACCAGTATAAAAATlTACTTGTTr 

TGOCCTCTAGAGATAGTAAGCTCCTTAAOTAAACAGAAGTAATAOCTGAT 

TAATTAGAATTCCCAAOOCTCATCAAGTrGTGTGCTTATATAGAAGAAAOC 

CAGTAAATGT 1 TGI I UA TTGAAAO ATATTAATAC I C 1 1 GO' I GGATGAGA 

GTGAGOAAAA AOGT ATTAGTATTOGCTTTTCACAACOOCCTGGACCrGOC 

>Scqueace213 

TTTTAATGTGCATCTOGCAGGGOGGNGGCGOGCGAGGTACTrroOAGTGC 
OC TOOl I 1 C TA AGAATTOOOOTTOA CTC1 1 ICI I IO GC1 ICKjC IO GCAC 
GGTAACCAGACTCCCTACAACTGCACTC 1 1 IXjTCI ITGTCATGGAAGCCG 

ogagogtagaggttccgggtgctctggcggacttgagcagotcactgggt 

OCTTTACACTTGTOAATTOGAAGCTTOCCAGATGTATCCTCAATGCATTO 

oc Acnc roooooo o r TOTTC AC ^ 

GCTGGCOkTATTCTCTACTCX^AAGAGGATXjCAACCAAGGG^ 
CTTTAOCACTO AAGATTATG ACATOTTTGAAO H-iIOI I 1 IG AGAGCAAGG 



GAACAGGGOGGATACCTGACCAACTOGTOATOCTAGACATGAAGCATGQA 

GTOOACKXXMAAAATTACGA^GAGATTGCAA AAGT rGAGAAGC tCAAA CC 

ATTAGAGGTAGAGCTGCGAOGCCTAGAAGACCTTTCAOAATCTATTGTTA 

ATG A TCTIOOCT A CATGAAGAAGAQACAAGAGOACATG 

>Scqucoce216 

gggtcttgatagatogagctccacogogtqgcooooga ggtact ttgoao 

TOOOCTGGTTTCTAAGAATTOCOGTTCAL'IC 1 1 ICI 1 1GOCI IC IUCIGO 
CACGGTA ACCAGACTCCCTACA ACTGCACIL"! I IOICI I TG TCATCG A AG 
CCOOGAGCGTAOAOOTTCOOC Cr I UC I CT G CCGGACTGTG AOCAGO TCACT 
GG G 'I CC1 1 I ACACTTCTGAATTCGAAGCTTGCCAGATGTATCCTCAATGC 

attoccacttctgcooo oot roi IC ACAOGCTGTCTGGTACGAGATCTCC 

GAOCAGTCTGGGGGOGCTGGCGGCCTGCGCAGCCACCTCAAGATCACAGA 
TTCTGCTOGCCATATTCTCTACTGCAAAGAOGATGCAACCAAGGGGAAAT 
TTGCCTTT AOCA CTG A AG ATT ATGACATG TTTG AA GTG TGTTTTG A GAGC 
A AGGGAACAGGGCGGATACCTG ACCACTCGTG ATCCTAG ACATGAACATG 
G AGTGG AGGCG A A AAATTACG AA 
>Seqacnce 21? 

TGGT^^ACCG^X3GACCTCAC0GCGGNGGCGGCCGAGGTACTATCAAACAA 
CATGATACAATTTAAATGTGTCATAGCAACTACTAGTXXjTCACCTGAAAT 
CCATmCUJLliXI I CACAGTAAGAGTnTAGCTGAATOAOTOOCCACT 
CATAGAGAGATTGCATTTCTOGC! ' I OCX* 1 ' I G CAGCC AT AGGTAGOCATGO 
GACAAAGTTCTAACCCAGOGGGGGTCCAAICII 1 TGGCTTOCCTGGGACA 
CACTGGAAGAAGAAGAATTGTCTTGGGCCACACATAAAATACACTDOGAT 
CAAGGATAGCTGATGAGCAAAAAAAAAAAAAAAAAAAAAAGTACCTGCOC 
O 

>S«queacc2II 

GCGGNATATGTGCGCTXXXXJCGGTGGGGGCCGAGGTACCATCCTGTTCCA 

CAGAGCCATTGCCTATTDCTAAATTOAATCCGACTGOGCGTOCCOCTCCT 

CGGAACACAACAGTAGACCTTAATAGTGGAAACATCGATGTGOCTCCCAA 

CATGACAAGCTGGGCCAGCTTTCATAATGGTGTGGCTGCTGGCCTGAAGA 

TAGC I C C I OCC 1 CCCAGATCGACTCAGCTTGGATTGTTTACAATAAGCOC 

AAGCATGCTGAGTTGGOCAATGAGTATGCTGGC I I IC 1LATOGCTCTGGG 

TrTCAATGGCCACCTTACCAAGCTGGCGACTCrcAATATCC^ 

GCTGCAAAACTAGGCACCATGGATATGTCTATTACTCOOCTTCTTAOCAT 
TCACATTCCTGCTCICTTACOOCCAACGTOCACAGAGCTCGA FGT1 CC I C 
ACAATGTCCAAGTGGCTGCAGTGaTTOGCATTGGOCTTGCATATCAAGGG 
ACAGCTCACAGaCATACTGCAGAAGTCCTGTTTGCTGAGAA 
>Scqutacc2l9 

CACTACTCATCTCATATAACTCOATTT OATCA TTTATACTAAATACTTCT 
CATTTTTTTT ATT ATTTTACT A CX1AAA.TCTTT ATTTCTT ATA TAAAATAT 



GOCGNTATTGGTGGTQ AA GACCCGT AOCAACAGTGGGCA IOICI ICIOOC 
>Scqueacc 220 

GATATGTTGAAOOnTTAOAGAOGCTTTCCGOOGTOGCGGCCGAGGTACC 
ATX^TATCATOTATCCTGCTTOGACATlTTGGGAAGGGOGAOCTGCroTT 
TGGCCAATTTA TCCTA CAGCTOTGGACGGTGGGACCTCTTCAG AG AAGA 
TCIGGTAAGGTCAGCAGCACAGTOOOCATOGAAAAAGAAAAACTCTACAG 
CATATTTCOGAGGATCAAGGACAAGTCCAGAACXiAGATtXnCTGATICTT 
Crt3TXrrCCK3AAAAACCC AAAACTTG TTGATGCAGAATACACCAAAAACCA 
GGCCTGGAAATCTATG AAAO ATAOCTTAGGAAAOCCAGCTOCTAAOOATO 
TOC AICI I OIG GATCACTGCAAATACAAGTAICIGI I I AATTTTCGAGGC 
GTAOCTGCAAGTTTCCGGTTTAAACACCTCI TCCTGTGTGGCTCACI IG I 
T1TCCATCTTOGTGATGAGTGGCTAGAATTCTTCTATCCACAGCTGAAGC 
CATGGGTTCACTATATCCCAATCAAAACAGATCTCTOCAATGTCCAAGAC 
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CTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGA 

AAGGTGAAGCCAGTTTATTATGT 

>Scqoance 221 

CATOGA1 C I C TCT N TGTCCATCACTA TTTTG TAAT ATCO ATATTA TAATO 
TCGATAAGTA ICIH 1 1 1 (J ICTATGTATTTTATACTGTC TATCGA TCTATC 
TGTTATTAT>frAATA\ACKAKAHCAOA>OTXnTGACCATTrrCTGAGGOIC 
GTCXKXXXKKXIAGGTACAGCAACAAGAATCAGATOCTtrrTTAQAGATCCT 
CCATTTCATT ACTCT AACATTCTTCAA IG 1 GG I ICCAGCCACGCATAGTC 
ATATAGATACT ACATATTCAAAGATAACTTACTGAAGCTTGTTCACAG AA 
CTOCT G ATAG<ntnTCTTCCCCTACCCCGCACTrrrGOAAG 



TATTAOCOCAAATGCTCTrCAGGATTTAAATAACAATTTTTAAAAAGACA 
CTTAACACCACAAAATGG AA 1 1 1 G C 1 GGCATGACOCG AACAATAOGGTTA 
CTOCAGATGCTGTATTCAAACTGTATGGGTOCGTroAAAAA ATAG ATA TA 
AOCA rTTTTCl C ATAGACAOCATCTACTTTATCACCAATTCCTOGGAAGT 
CTTCTTCT ATTAGTCTOGGATAGTCTTTATCCATAATATGGCTAGTATCA 
TCATATCTCCAGACC I GG I i I CC 1 UAGAACAGGA GAGTCTT GCCTGTATC 
CTGAAAGTGAACAGCTOCACTTATCTTCTTAACI'1 CTITI GGAAG ACCCA 
GTTCAGATA III 1 1 1 1 GG GATAACCTTOCAAAATGTCATAAGCATT 
c222 



TCATCACTCACATTCAGT ATOCTCTCATTGTT ACTCT AATTACAATCGTT 
CTAATATt^CACHtXrTATTTCATAATATGTTATAACATGTTGACTT ATGT 
TCTAGGAGATATCACTT ATATTAATGCACTTAGTGGGGTTGATTCGAGTC 
ACACTCCGCGOAGGCGOCOGAGCTAOGCOGGGAGTGTAACTATGGCCGGC 
CTGCOGAACGAAAGTGAACAGGAGCCCCTCTTAGGCGACACACCrrOGAAG 
CIAGAGAATGGGACAT1TTAGAGACTGAAGAGCATTATAAGAOCCGATGGA 
GATCTATTAGGATITTATATCTTACTATGTTTCICAOCAGTXnAGGGTTT 
TCTGTAGTCATGATGTrcCATATGGCCATATClOCAAAAGATTGATCCGAC 
AGCTGA T ACAAGTTTTTTGGGCTXK3GTT ATTGCTTCA T AT AGTCTTGGCC 
AAA TGGT A GCTTCA CCTATATTTGGTTTATOOTCT AATT A T AGA CCAA G A 
AAAGAOCCTCTTATT G TCTOC A TCITOATTTCOOTCGCAGCCAACICCCT 
CTATGCATATCTTCACATC^CAGCTTCTCAT AATAA ATACTACATGCrGG 
TTGCTCGTGGATTGTTOCOAATTOCAGCAU-m 1 ICAGAACT1GTTI ACA 
TTOCTTGGAGAAAAAGTGTGACCIGGGATGTGATTAAACTGCAGATAAAC 
ATGGTroCACACCOGGTTACTTAGCGCCTTC 
>S«qaeacc223 

TGAGGTTGATTCGACTCCGNGTGGCGGCCGGAGTG^ 
TT0TOATCTOCAATGA I ICI I CCCI 1 CGAGGTCAOCCCATTATCTTTAAT 
CCTGAC Hil l IG I U GAGAAACTCCOACATOAGAAACCTGAGATrTTCAC 
TGAGTTCGTTGGTCAGCAATATCACAAGGCTCATCGATTTACCTGGAACTG 
AGTTGGCTCAGCTCATGGGGGAAGTGGaCCTTAACT^ 
GGCCCAGCATCAGGATTCITOC G< riCTCrC A TGTCTCT C AAGO^ 



TATACCAACTGATTOAGTATCTACACAAAAACITOCGAGTAGAGQCTTTG 
TTTAGAGTACCT 

tgoaatottcoa cct ci I1X UAGGCCXXXKXGCCCGCGCAGCTACTOCCT 

GTAAAGGGGAATTTCCATCCCGTCTACAG<1GATGACCTOA^ 

AOAGACCXIAGT G TCCICAGTATATCAGGAAAAAGOG TGCAG ACGTCrOGT 

TCAAAGAGTTGGATATCAACACTCATGGTGCAGTTAACTTCCAG<^GTTC 

CTCATTCTGGTGATAAAGATOGGCGTtMjCAGCCCACAAAAAAAGOCATGA 

AGAAAGCCACAAAGAGTAOCTGAGTrACICOGCCCAGAGGCTGGGCCCCT 

OOACATGTACAGACTCrCATTTTATOATGTATCCTACTGCATC 

AGGAGTGAAATACX^AGGACGCCATACTAGGAAACCCAGGTCTATTTGTT 
ATCAGAGTAAOGaTCAAOCCaGATAGXX' TO I IATGTAATTTCTCCGATAA 
AAGATTTTGAAAGCAGGTGCTGTGGGCATCTGTAltXXJGAATCG 
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TAGAATTATTTTCATTTOTAAATATT7XK3TATCAGGCCAAGCAAGGGAAA 
GAAGCTTTACTGTATTACCATCTTT 

TACTCACAGTC 



GGGOGATGAT 

ACGCAAATTCACAGTCTGCGTGCACGGCTCTCCAT 
TACAGOTTXXX^GCTCAAGAGCTTCACCCATAATTAAGACCTTCnCAGGA 
TGATXXJATAGATAAACACACCTXXTrCTGAACCATCCTTGGGCT 
GTTGGCATTGAGGATCCXrTACGACAGTCCCCTC<rTCCX3TC^^ 
CTTTGTGAACTTCTC^AAATAAGAACAAGGACACACATTGTGTCAGGTCA 
CG AAG ATCA TTX1AGTTTCCAT ATGCTG AAGGTTTTTCCACTATTCA CACT 
CTGTGGCGTAACCTTCTTCAATATAACCCCAAA 
>Scqt*cocc 226 

TTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATGGATA 
GCCGCTTGCAGGAGATCCXjGGAG<XKM1AGAAGTTACG<^CACAGCTCCTC 
GCGC AGCAGTTGGG AGCTG A A AGTGCCG ACAGC A TTGGTGCCCTrGTTA A A 
TAGC AAAGATGAGCAGAGAGAAATTGCTG AAA CAAGAGAAACTTGCAGGG 

cttccta tg at a cctctgctcca aattjc aaaacgt aa g t a tctgga tg aa 

ggagaoacagatgaggacaaaatggaagaatataaggatgaactagaaat 

gcaacaggatgaagcttatcatcaattcattgtataaaaataaaoagatt 

ttcctgagagaactcatttcaaatgcttctcatgctttagataagataag 

gctaatatcactgactgatgaaaatg 

>Sequence227 

TGG I TGm C CNNTAtWATTTCAAGX^XnX>CCCCXKrrGGCG 

ggcagotacgcaaagtgattcagagaaccctgxxjg^tcacag^cgctgta 
gcaa acctgcaactc1tgagg a acacttaag acgccaccattcag aacac 

AAAAAGCTACAGAAGGTCCAGGCTACTGAAAAGCATCAAGACCAAGCTGT 

TACTAGCTCTGCGCATCACAGAGGGXKXM^TXMTOTTCCACATGGGAAAT 

TGTTAAAACAGAAATCAGAGGAG^CATCGGTGTCAATACCCTrCCTACAA 

ACTGCATTATTAAG AAGTTCAOGG AGTCTTGOGCACAG ACCAAGCCAGG A 

GATGGATAAAATG 

>Sequencc22l 

GCATAGGAAAGACTTGGCTGTTGGGAGGGGCGTGTCTTACACCTTAGGAA 
GAATCCI I A GCTGTACI I ICCIUICICICCIUOAGCICCCiC C I ACCCCC 
TAGCTGAGTAGGCCAGGTTTIGGTGCAAAATCTCCCACATTGGCAAAGTT 
CCTGCATATCCTGCGCAGTATGTGCCTTGAATAAAAATCCTGAAGATTAG 
ATGGTICAGGCTGCATCAT0OCAAAGCAAAG AGCAOC 1 C 1 1 IGAAGCTCA 
CCTCOCOGGGOGCCOGAGOTA C 1 1 1 1 H 1 1 1 1 1 1 1 U 1 1 1 1 1 rCAOTATG 
TAGCT7TAAAACAGTT ACAT ATAACATGGAACAGTATIG ACA TGAAAAGAG 
AGAGGTTTATAGAGGGAGAATTKjAATTC<K}ACAGCCCCTGCTTACCGAGG 
TTCKXCCTCCAGTCCTTGAI ICM 1 1 G<> ATCCCAACTTCCTGTITOGCTG 



TCTGGCCCATCTTTTAATTTTGA I II 1 1 IC H AGCAACCCOGTTAAAGGT 

TTTGTGGGAG 

>Seqaeacc 229 

TG A TG ATTtiA GACXnXTTCCGOGGGGCXXKXXi AG^jTACT A CA GGATGA TGG 
CT 1 rCTCTTXXTCTC<}GTCACAGG<^GG<HX^TOGAGTTGGGXSAGAGAAT 
GTCTAAACCTCTGG)GGGTATGAACGGGTAGATGAAATTATITGGGTt^ 

ITAOCTGOO00GGOGG 

TACTTNKTT 

TA IIIII lllllll I l lll 
C^GTTTTTACTTGTAAAAAGTATTATAGAAGAGTTTrATTGGAATGTTAT 
TTTATTAAGOCATTITCATGGGTTA l llllll l I AAAGTTTAAQAAGTTT 
TTACAACAGGCTXKX3GGGGGGGGGTTACACXrTTGGCAATC^ 
GAGGCCCGGCCGGGCAAAATACCTGAOGTGjGGAGGTTAAGAACOGGCCTC 
COCAAATGGGGAAA CC1 rTOTTTTTTCTI 1 A AATTCCCAATTAAATTOCA 
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GAATTCGATTTAAGCTTAT 
>Scqococe 230 

AOCAACTCTCCCTCCAOC ATClXnxrr TCA TTTTOOOQ OCATTroAACAGC 
AGTTGAAATCTGGAACTG CC 1 1 IO I IOC G IUC CIGC IGG ATAACTTCTAT 
CCAGAAGGGCCAAAGACCCTT 
231 



teen roTOTCTroGOTCTcn roTGTC i ic i rAtci 1 1 rooi roci 1 1 rc 

TGTC1 rOC TC O TCI I r C TACl lll lll ICI AI! rOOTCTCACACTACAAA 
AMNCTTTATGCTTTTATCAACTCCCCGCGGTGGCGGCOGAGGTACGACGT 
TTCCATCAOL I TGT CT CT 1 1 LA TTCCCTC ATOTTAOGAOCAATaTGACCA 
TCTICI ti I A TTCTO GAAACTGACAACACGOGG C1T T TA TCTTCAOCT 

TGCA CTCaGTCOGA GCCCTCACCGCGGTGGCGGCCGCCCGGGCAGGT A CT 
TTA I 1 1 I I M l 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 IC I I 1 1 A AA AAAAAAAAAACATAT 
TTTAATA TATTCAGA TCCACAAA TATGA A ATAAA ACTAAGTAG AGCTGGT 
ATTCATTTACACATAATTATXnTATACDGTTTGGAATAAGAATTTGGGGC 
A CGTT AGCA AA OCA AAAGGCTCA A A A AGACGTCG A G ATA TTTAGTTCTTG 
TCTCCCTCT ACAAA TGTO AA GCAl 1 1 1 1 1 1 A TCCGGCATTOCTAGGOGAO 
TrOCTATTTTCAAATTTOCAAATCAl 1 rCTGGTOCTAAGCAATClCAAAA 
AAAACATTTACTAAAAAOCAGAGGAAAAAAATCTTATAACTTTGGGAGGC 
TGAGGCAGATCGATT^CTCAGATCAGGAGTTTCAGACCAGCTTCGOCAAC 
ATCATOAAA CC OC C I l l l lll 1 ' I A AAA 1 1 OC AAAAO O 1 1 ICC I fGOl lli l 
GGTGGCAGGGGOCTGGAGTCCCAGCTrrnXAAAGGCITAGGGAGGAGAA' 
TTACTTC AACC 1 1 1 G AGGCGGGGGTTOCAATGAGTTTAAA TC I OOOCTAT 
TGACTCCAACCTGOOAACAAGOOGAOA CI I TO 1 1 1 1 C AAAAATAATTTAA 
AAATTAAAACTTGTT 

• 233 . , % " . '/ V ' 

ACACTTACAACTTCTACA 1 1 POOGTC ICI CO I I C 

ici ioi rem ixxjTcqttotai i i ic i rcxn ro cTCA i roi roi roocA 

TNAAATNANNNCANTAGCU 1111 CXXXTTCCCCGNGOT^GOCOOCCGCOOGG 
GCAGGACGCGGGGGOCAG IIL ICI I OGGGGACTAACTGCAAOGGAGAGAC 
TCAAGATGATTCC C 1 1 1 1 1 AOCCATG 1 1 1 1 C I C I A CTATTGCTGCTTATT 
GTTAACCCTATAAACGCCAACAATCATTATGACAAGATC1TGGCTCATAG " 
TtXn "ATCAGGGGTCGGGACCAAGGOOCAAATO1CTtjTGCOCfTCAACAGA 
T7TTGGGCACCAAAAAGAAATACTTCAGGACTTGTAAGAACTTXCT 
AAGTCCATCTYnXXMCAGAAAACXSACTOTOTTATATGAATGTTGCOCTGG " 
TTATATGAGAATGGAAGGAATGAAAGGCTGCOCAGCAGTTTTGGOCA1TG 
ACCATGTTTATGGCACTCTGGGCATCGGGGGAGCCAOCACAAOGCAAOGC 
TATTOXMCGCCTCAAAACTGAGGGAGGAGATCGAGGGAAAGOGAATCCT 
TACTTA CI 1 1 UGAOCGAGTATQ AGGCTTGGO 

TTCTO O r OTCl CI CQTACATATAKTOCATCriTAf AAA IT C ICTCTOTTA 
TOCTAOCCTCrrrCAAGTrcATCTATTATAAGTTGATOGTATTATTGTCTA • ' 
TATAGGATATTTTTTACATATTACTATCTCNC>0<CTCACAGCrAG7TGGA 
NOCATTTAgAOTOCTCTTCOCG<3UfcGOCOOO^^ - ' • * 

ago i mom i uuc rem 1 UACGCTTTATATATACOTAGACACACATAC 

acatgtatatatacacacacacAttttacatatatatatoaaactotata ■ 

AT0T01 IOUCI IC ACTOTCTOO CIUCI II IA CTCAACATTOTOAAATTAA ■ 
T rCC I G I T ATCCTATATGGGATTA AAATn UI I lOOCIAUl 1 1 1 IOCCII , 

ATGTCTGGAGAGATTTTTTATTTTCATAATTT^ ' 
TGTGCTACAGAACTT AGTAGGTAGAGQACAGGGTT AGTGCTGAAOGTTCC 
ACAGTACCT . 
>Seqwocc 235 

IC I ULA 1 1 1 IC I I O I I A TTCT CAATACA 1 1X 3 0 HOI ATGTGT OGAGT TT 
TAIOI IOI I ATTGATOGACTOTOOGTOATC 



GG I I I CI 1 1 ICTATGTTAAOGGOCAOWCAWNK1 1 ICI I IOI ICG AGTGA 
00GO0GH0O00O000AGOA C1 II 1 1 1 H 1 1 1 1 1 1 II II 1 1 1 1 1 U 1 1 1 AT 
AATAA I 1 1 IOICA1 1 1 1 1 CTACAGACAAGCTCTCCCATCTT CCOCA GGCT 
GGTCTCAAACTOCTAGGCTCAACTGATOCTCCTAOCTOCACCT 1 ICOCIC 
CCAATTATOOCCAATTGAGAGATGAAAATTCTtMCAAGCTCTCAAACGTT 
AACTGACTTGCCCATAAATGACAGTTCCAAAGTTATAAGGCTAGAAC 
«236 



CCACTCTTAGC1 CO I I G CATATCCGAOGATA< 
TNCKCKTAI 1 1 IOIO GGACGCCTGGCCGAGCGGO 



JGGCAGGT 



acctaogocacagacagocagagggaaagcgaoocagacagcagcogctc 

CTCG ACAGGCOCACOCTGCAGCTCAGGCAOCAAGAA AACaGOCGATACTG 
GCAGOCATTGCAGCTCCAAACTYjCAOAGGCAAGGOCAATTTTAACTITTC 
AA TTTACAGTCGATTT1X5AAGAGCTTCTACATATOGGTTATGTAAATTCA 
TATATCTATTTTTGGAATCAGTTCTTATAAACAGCTCGATTC^GTTTTAG 
CT A AA TTT ATAGTCTA GGT A GT A TGTT ACA TTTG A ACTTTTGTCTT AA G A 
AAA GTTGACTGTTCA G A T ATTTTTCTA CTGT AAAG AAA T A T ACTTTTCT A 
TTAAAGATCTGTAOCT 
>Scqmacc237 

0CA G1 1 1 I UIO ATCTOCAATOAI I CI I OOC 1 1 COAGGTCAGCCCATTATC 
TTTAATCCTGACTT IT I T G T G GAGAAACTOCG ACATGAG AAACCTGAGAT 
TTTCACTGAGTTtXTTGGTCAGCAATATCACAAGGCTCATOGATTTAOCTO 
G AA CTGA GTTGGCTC A GCTG A TGGGGGAA GTGG ACCTT AA GTTGCCTGGC 
GGGGCTOOCCCAGCATCAGGA I ICI I OJGGTCTCTCA TGTCTCTCAA GCG 
AAAGGAAAAAGGAGTGATATTTGGGTCCOCACTGACGGAGGAAGGCATTG 
COCAGATATAOCAACTGATTCAGTATCTACACAAAAACTTGCGAGTAGAG 
GCTTTGTTTAGACTACCT 
>Seqacnoe 231 

COCTATGATCAOCTCAOCGOGOTtKjCOOOOGAOQTACGOGOOOA 1 1 G I O I 
GCAA AATCAG AG AGGGGTGCAAGATCCTGATTTrTCAGGAGTTCAAGOGA 
CAATGGCAGGGCAATAOGGGAGTATGAGCTTCAACCCCAGCACACCAGGG 
GCCAOTT ATO GGCCTGG AAGGCAA GAGCCCAG AAA TTCCC AATTG A GAA T 
TGTGTTAGTGGGTAAAACCGGAGCAGGAAAAAGTGCAACAGGAAACAGCA 
TCCTTGGOCGGAAAO 101 1 1 CATTCTGGCACTGCAGCAAAATCCATTAGC 
AAGAAGTCTGAGAAACGCAGCAGCTCATCGAAGGAAACAGAACTTGTCGT 
AGTTGaCACACC^G<X^TTTTOGACACAGaOGTGOOCAATGCTGAAACGT 
CCAAGGAOATTATTCGCTOCA ll C 1 1 C I C^ CCTCCCCAGOOCCTTATOCT 
CTGGCI I IGGTGGTTCCACTOOOO0G<nTCACTOAOGAAGAOCACCAAOC 
CCCAGAGAAGATCTTGAAATCTtTGAGAGAGGACTTGAAGTTTCT 
>SeqMcaoi 239 

C1CI IOI IC I ICIC C OCA I 1 1 1 O ACTOCTAAAOCAOC I C I C 1 GCATAACT 
TCCA HOCI ICI I IA TCATCCTAAI ICI 1 CC 1A C I CI I CI O CIC1 I A TTC 

TTTCCcccho^CAjrrKKxrrroTT^ 

ACCAGTTAAOTOAACAGCTCOTCTAG OI CI O CI 1 1 IOI A ACACOCAAATA 
CAATTAGCACI rCTCTOCTGGTAl roOCI C OGOO GTCI I A ATTATCTAGA 
GOOCAOGAGGCAAAGCCT AGCAOGTAACAAAOTA 1C 1 OC1 ITGTAACTGC 

. TTCACACTATTAATACAC 10 1 C1 IO C I O AAGAGTCTQACCTGOGCAOAAC 



GG<^Gl^AAGATGAGGCTAGTGCTtXXrroAGTC O CAGTTAG<j I C 1 0 1 1 AC 
TGTTCTG n UCAACT ATAAATCCAGGATGACTGTT ACTCAGATTCAGTGC 
TATGTAGAAAATAGAATGCACAGGCAAAAACATAATTTGGGGATGACTGG 
CAGCA CC 1 Hi l l ICCCI 1 I C I I AAGAGGCTAACTG 



TCATTraTOAAAT TTTATT CATATTA 1 1 1 1 1 C ATAAACTOC ATAGT TCT 
1 1 CI ATCTCT ACTAGTI 1 Tl ATATTATCTAT T FCAAC 1 ICI 1 ATITICTI 
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1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 OO TATOACTATOAAGGCTA O I GO ICI 1 1 1 1 A T- 
TAGCTATCAAGTTCATTTAACAGACAAAAAATTCAGTTCAATOGGOGCAT 
TAAAA TAGGAA G AATT AACAA T AGTTCA TTAATCAATCTTTCAGCTCTTC 
CTATTTTATCACAATAACIT Tl O CT A TAATTGAGAGATCCATGAGGAAGT 
CTTGAAAAGAAOGTA I O I I ICI I IC AATTOCATAAAACATTCAGCCAAAA 
TAATAAAAGAGGCGCTATTA C 1 1 1 0 1 II I OG GTGAATOATATGCAOOCTA 
GGCTTTGCTGTAGTACCT 
>Sequeac*241 

GOOOTOGOOOCOGOIOIO C IUI O C'ICAG CIOCC 'I IC CAAGGAOGAACAGA 
TOGGCAAGTGCTOGACGGGTGGCCGAAAATGCTGGCGAAGAAAGAAATAA 
AAA CCCTGAAA C A TG ACGA G AGTGTTGTAAA GTGTGG AA ATGCCTTCTTA 
AA GTTTA TAAAAGTA AAATCAAA TTACA 1 1 1 1 1 1 1 1 CCAAAAAAA AAAAA 
AAAAAGTACCT 

>Sequeac*242 > .... 

GATCAGA L PGTC1 CA GATCAAGGAAAAGATGGOCAGAGAGAAGCTCGAAG 

AAATAGATTGGGTGACATTTGGGGTTATATTGAAGAAGGTTACGOCACAG 

AGTGTGAATAGTGGAAAAACCTItlAGCATATGGAAACTGAATGATCTTOG 

TGAOCIXjACACAATCTGTGTOCI IOI ICI I ATTTGGAG AAG7TCACAAAG 

CGCTCTGGAAGACGGAGCAGGGGAC * 

^Scqn mcc 243 - ■•■ --AT 

TTACAGGCACGAGOCAGTGCGCOCAG CIOC C ICIOl I IC I 1 1 1 ATTAGCT 
CTTC1 O GAC roT G CCO C I CC 1 1 O OGCAGATGCTGTATTATGGGGATAAGC 
CACACAC Mil PGAACTGGOCCGGTCAGGGGGGACATAAGCA 1 1 ICCIO I 
GCXAO0CCATCAATCOOCACCTATTCTGAGTGTAGGCT0CTOO CC I GC T7 
GAGTAATGGOCACAGATCTTGGCTCGGCACTOCTAAGCrGCATGTTGAAT . 
TCCTGGGACAACAAGACTGGC 1 1 0 1 OO 1 1 CCA TTCTCCAGATOCTTOGGT 
TGGC TT C I G GC7TGCACTAGGAG ATtTTGAAATG C I C I C AGGOCACCAGGAA 
AGTACTXXMAGTAAAGTCTGACTCrAAAGAAGATGAAAATCTAGTAATTA . 
ATGAAGTAATAAATTCTCCCAAAGGGAAAAAAOGCAAGGTAGAACATCAG 

acagc rrOTGCn gtagttcicaatgcacgcaaggatctgaanagtgttc 

TCAOAAOACT ACTAGAAGAOAOGAAAOGAAOPCT07COCTGTAACTTTTG 
AGTGAAAAGACAAAAATGGCIC'I icotggtogcaaaaaag 



O IC I 1 1 1 IA GTAGGGATAA GG 1 1 ICA OCATGTTGGOCAGCT O GTCTTT A A 
CICCnSAOCICGAOOOATDCAOCCAOCTO GO^ 

CAATACAjGTACCACCACAGTTGCTAICTCTI UAACATCTTTCATTAAAAC 
ATCACCGTCTAGTTTGAGAATACTTTTAAGCC TOC I OGUC 1LA.I I TO GOO 
CA I 1CIII1 IICICI I I IC A GCACGCA IC I UCIIIIC CACTTACTOCCT 
AAGCTTTTACOCATGTTrTAOCTTGAGGGOOGAAGTTAACITCACSCtXXxA 
GTCAACGACAGGGGTGGGCTCCACTTTA TO^ 

CATCT00GGACTOGAGCTCAAGTATAGO0CAOOGAGTGTCAAOAAAOGAA . 
ATTCTTCAGGGTGGOGGAATCAAGOOCAAGTCOCATGTITACTGAOOGGG 



jgocaggtacaa 



TTTCTTGAGTGAGTTCA TGKTTtXXrrAGGAOiA 
CrnXACIGI I lC7ACAGTCCTOGCCAGCAA U 171 G aAOTTAAGOCTTCA 
AAAftXTtXAGCACACACATOOOOAAOGTATTOOOCAGGA TCI IOT0 0OT 
CICOI IOI AGTAGCAGT ACCCAA 1 0 1 1 lOIOOCIO CTATOAAGAGTTCAA 
AGGGGTOGTOCTGCTTT ATGTTCAGTGTTCCATTCTTTATTTTCrrCTGC 
ACCTtnTX^ IICllllCIIlC CGTGAOtSCTAAAODOCAA CII II I I II 
TATAOCACOCCAAACACTTGAAGGGOGGAOOCITrACAAAGTCGC77ITTt5 
GAATAACCGCOGGAACGAAAA ll l llKtO CCCCOtX XJICIIII I ICI 1 1 



CAAC OCCCCAA TTTCCCACAAAAGAGGGAGA TTTT I IGO OGCTAAACTTA 
CTOCATTTTTTAATGGGAAAATI 
GGGGCAGGGGAAAAAA iTTTTTl IOG COCCCaO 
>S rqne nce 246 

O OT C IOGI rACACCTOOTOGACI IGCO I 1 C I O I A TGTTACTT ATTATTAT 

ATTTGTTACTATGTTTATAACTATCTATCTTTCAGTOCATCATAATAATT 

TOCTTTAOCA TGTXSTA TAGTAOTTTA OGTAAT CTTTro 

GCNAhfTTCGGo i iui ATGTCAGCCTKTCnsOGGTGGOGGTOOTGOGGATC 

AG OGTA GCTGAGCTTTrGGCCTTTTTX^AGGTGCTGCAGOCATAGCTAO^ 

GCGTTCGCTACGAGGATTGA GCGTCTDCACCCA ICI rCTGCCOGGGACCA 

TCTACATAATGAATCCCAGTATGAAGCAGCAACAAGAAGAAATCAAAGAG 

AATATAAAGAATA O 1 1 C I O I CCCAAQAAOAACTCTGAACATOATTCAOCC 

TTCT<X^TCTOGAICICI 101 I OGAAGAGAAAATGAGC TGI OCGCAGGCT 

TGTCCAAAAGGAAACATCGGAATXjACCACTTAACATCT 

>Seq»=nee 247 

GCTCT AAGCT AT AA CGT ACT AA TA TTTG A TCT A TTCA T A T ACA TT A TCAA 
TCACTAA TACACACATCAATA TACTT ACG TAT AA T ACACT ATCTTAGTTC 

CGGO0GAGGTACTOO0CAGCAAATATCCTTGGTGGG XJI IOC I lO ACTAGA 
TCAGCTGCTATAGTAGOCAATCCTC T T AGACTTGG ACCA I IOI I I OILIO 
AAGAACGGCC^TCTlTrCGCTT^XX3CTGAGCACT^ 

TCAGTCCCAGGGA C 1 1 C I CCA OTAGCGACAACTCTGCGGCOGOCGOCATC 
TTC 

>Seqacaee2tt 

TtKCGC CTATATCCAK CTTtXOGCGGTGGCGGCCOAGCT 

l l llllllHIIIIII I CIllll l llllllllllUlimil ACAGAGA 

CG AGGAA TTTAA T T A GC^ TTGT AA CAAA TGGTT AA TT AT ACT AAG AAAAA 

TAATGAACATCACATAATGAAGTTA CIUL I I IC CAGATCTATAAACAGGC 
Tt^TGTAACTAACTGATACTCAGTAAAAGGGTCCATAATCCAAATTTATA 
TAACAAATGGGGCTTt5CTATAAAATCTtnTACATITTAATACTTACTCrT 
AATAAATCATCTATTCTTCC CI CC1 I CI 1CI CI A AGGCAGAATTCTTACT 
Ol I I ICI A GGGCAGATA I 1 1 1 1 ICI A I lOlO AGGTCGGACTOG OlCIGIC 
TOGOCTGGATGGAGA ICIOI 1 1 1 rG GGAGCTGCAGGAATGCTCTCTGTTG 
CCAGAT00COTAAATGAGGGA C 1 0 1 1 1 1 UCTOA GCTGAACAAAAGTGAAG 
CAGG 

>Sfq o mc« 249 

GATCAGACTG f CTCAG ATXAAGGAAAAGATGGCCAGAGAGAAGCTGGAAG 

aaatagatrgggttjacatttggggtratattcaagaaggttacgccaogg 
agtotgaatagtogaaaaaccttcagcatatcgaaactcaat^ 

TOACCTGACACAA IOIUlOlCCIIOIICIlA TTr0<lAOAAGTrCVU>AAC 
CGCTCrGGAAGAGGGAGCAGGGGACTtnCCTAGGGATCCTCAATGCCAAC 
COCATGAAGCCCAAGGATGGTTCAGAGGAG G I G 1 0 1 1 1 ATCTATCGATCA 
TCCTCAGAAXXfTCTTAATTATGGGTGAA GC 1 C 1 1 OA CCIOOOAAOCTGTA 
AA GCCAA GAAGAAGAATGGAGAGOCXTrGCACOCAGACTGTGAA 1 1 I QCOI 
GACTGTGAGTAOCT 



■ ■ CCTimxnATGCTTATOGCGOOCGGOOGqAGTCATra ~ 
GTGATCrGCAATGA T 1CTI CCC T TOGAGGTCAGOCCATTATCtTTAATCC 
GGACI 1 1 1 1 nCTGCLAGAAACTCCGACATGAGAAACCTGAOATTTTCACTG 
AG I IOOIOOICAGCAATATCACAAGGCTCATCGATITACCTGGAACTGAG 
TTCGCTTlAGCTGATGGGGGAAGTOOACCTTAAO 1 1 OOC1 GGOOOQGCTOG 
OCCAGCATCAGGA IICIICtXjUICIC I C A IO IC I CI CA AGOGAJUWlAAA 
AAGGAGTGATATTTGGGTO00CACTGACGGAGGAAGGCATTGOOCAGATA 
t ACCAACTGATTGAGTATCTA CACAAAAACTTGCGAGTAGA GGGTTTGTT 
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Table 2 

TAGAGTACCT 

TTAXCTCCCACATTGATTTCTCAATAAACA 1 MICH I C GATCAACAATT 
ATTCTAGTATAATATATAI 1 1 1 1 IOC I ICCG1 IGI 1 ATATTTATCACACA 
CAAAAAAATAAATGQG T CTTO ! CI OOATAACCTKTOOCOGONOCOOOOOO 
AOTACCACCACAAAOCXXXXX^GOCTCCTAAACTGCTX^ 
^^TACC^iGCAATtXGGOCTCOCTGACCGA 

GGGG>MCGTTGCTTCATCGCNGGGCGGCCGAGGT ACATTTTACTACGCAC 
OCTT ACOCAI ICI I 1 1 ICTC AOCICTOTG rCT OT CTOTQOGTCCACATQC 
ACACACACAAATOGGTOAAACAATTCTCACCATACCAAOAOOCAOOOCOC 

cctgcco a ga a tttgca tttct aaca agttcccagg tg a tgctg a cactg 
ctggctcatggaaccactgctgtagtattttcc^aattatcctgattcta 

AG^CCACCTATGACCTGTGCTGT^^ 

ACATAAA TICI I I f AGGATTCAAACA IGI I I G I O ATATTACTCAGTATTT 
A CA TCTTG CTTTTA CTGCAGCA TG A TGG AAAAATTAACCACA GCT ATATC 
ATAACAAAAAGAACATGAOTTACXIATTTTGACAAAGTTCAGATATATTTA 
AATTAGOCTATTTAA1 C I 1 1 1' > TTGGCTOGTCTTC AAATGG AGTCTCACT 
CTOTC I C I CAG GCTOC A GTAOG TGC IQfalll AATTGTCCAAGGCGGGTCT 
GGACCA GACAA C1 T nGTA>GGGCTOGGCCGTGTCTTTGGTGGTTOGAGT 
CGCTCTCCTTTGGCCC 1 1 1 1 1 1 GO I OGCCGG AATCGTGGCTGGCTGATTC 

aacagttcaaaaggaaatttogtggttagaacggcc 

> Sequence 233 

i »H- nCGCXKXXX rroiCI 1 1 IGCCI 1 1 OUAATTTTATTTTCTCTGCTT 

ACAOCI 1 1 1 1 1 1 I ACA TAAT ACATAATTTTATTTTTTCGAATAATTTTTC 

TACCCACAAAAAAAATTrroANNAGGTTGCTrGTAGCGOrTCTOGNGGNG 

GO OGCOG CCCGGGCAGGTA CI 1 I I I I 1 U 1 1 1 U i 1 1 I Id A COGGTAHC 

CTATTTCAGATTTATTAAAAAACACATAGGTAACGAGtCAGAGCTTTOGC 

TAGOAATOAGTTGGAAAAGAACTGAAGGCATAATTOCACAGGACATTCAC 

AGTTG7GTGCTAGAGACAGAGAGGAGCAGGAAAGTGTTTTAGAAGCATTT 

GCGGCGGACAATGGAAGG<XCGGCTTCATCGAA7TCCTGT7TGCTGATCC 

ACATXntKrrGGAAGGTGGACAGAGAGGCX^GGATGGAGCCACCGATCCAG 

ACAGAGTATTTOCGCTCCGGAGGGGCAATGATXTTGATCTTT^TGGTGCT 

gggtgctagggccggga icicc1 ici gcattcoggcogcaatoccagogt 
accto 

>Seqacace 254 

TGTATATAGATAGAGCTtlACCGCGGTGGOXIACGAGGTACTCATGGTTGC 
TGTAAATCTGGOCGCOGTTCTGCAGGGTTATGCTTAGCCAGGCTOCTATG 
A^ATXnGGCrATTCTCTCTTGTGGATGGTCAGTGOOCGCGTACCIXjCCOO 

>Sccp*ac e253_ 

GJ>^AATDGTTGAGCTCOOOGCGGTCGOGG<XGAGGTACGOGGGGATrGT 
GTGCAAAATCAGAGGGGGGTGCAAGATCCTGATTTTTCAGGAGTTCAAGC 
GACAATGG^GCCCAATACGGCAOTATGAGCXTCAACCC^ 

A ™I2n A * 3TOWAAAA ^^ 

CATOCTTGGOOOGAAAG 1G IllCATTCTGGCACrGCAGCAAAATCCATTA 

GTAGTTt^CACACCAGGCATTITCGACACAAGAG^ 
CGTCCAAGGAGATTA 1 1 LXiCIUCA I ICI 1 1 1 UAOCTXXCAGGGCCTCATO 
CTCTGTTCTGGGTGGTTCCACGGGGCCGTTACACTCAGG<^ 
GOCACGANNAGATCTGAAAATGTTTGGG ~ 
>Seqoence 256 

GCCCCAGATTCAATXTrGTGGTGACGGTOGGATACGATXiAGGGACTACACC 
GCA CACCA CCACTTCTG tttaa tgttttg aa TCT AAACGTTG A GGTGGGG 
CTNCA CCATGTTGCCCA GACTGGTTTTG AACTCCTG AGCTTAAGCAATCC 
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AC CTGCCrCGGCCTCCCAAAGTCTTGGGATC^ 

JCCGGCCTCATGI ICI 1 1 1 1 CATTAAAGAGAG AAATCAACTATTCAGGAC 
CGGCCOCCACCIIICC IC AGGAGTCA II 1CIG M C CGCACAGCCCTGCTG 
AACTGGGTGCTTTATATAGGGAAAGTGGGCCTCA 1 1 1 1 1 IG GTCCCIOIC 
CTGA AGCCTT AGGGGCAAAAAAAACXTrCCAAAATTGAAAAGGGTTTTTTT 
TTTTAAATCGGO AGOGGGGCCOC LC I C I I lOIGIC OOOOATTTCOOOOAA 
AAAAAAAAAAAAAAAAAAAAAAGCCCCCCCCCCGCGCGCCCTTAAAAAAA 
AGAAACCCCCCCCGCGGGGGGGATTTTTTATA 1 I 1 1 1 HI I I ACCCCC 
> Sequence 257 

GGAGATCATTGAGCTCXXXGCGGTGXKXXKXX5AGGTACTCTGACTTGCAG 
GG^^CAAGACCGGCCTTCCGAGCGTCGTTGCCIGATGGGAGTAGAAGCC 
ACAGAGAGTCTrcCTCTTGGAGGTACAGTCAATTCTGAGCTTTOGGCGTC 
AT AGACTAAA CC^GAAAACAGAACATTGGGAAGTCTTCGGAATATTCTC 
7 A JH™HI7^^ ATOACTAA <*<^^ 

A °9?IS^raTTATCAAAAAAATCAACAGTXX^^ 

GGGGTTAAAGGGGGGCCCCAAACATTGGAAACCCCTTCtXAAAGAAAAAT 
TCTCCAAAATTTTCTAAAAGCKKKXiCHXiTT^ 

AAGAGAAAAA WTCTCCTTAATATATT G T G IG I I ICICGOCCCCAAAAAAG 
A T A S!S52£P^ C '"'** T ^'"' 1 '* * A ^^^^^^ AAAACAGGGGGGCCCCGGGGGG 
GGGCTGTCAAAAAAAOCCCCTGTACACCAAAAATTTTTATCTDCTCCTGG 
TCGGG AAAACCGGGGGGGCTGATATATA AT 
>Sequ ence 25t 

ttagtcg 1 1 1 1 g agococggtogcogtooggtacacgggccacgtgaccg 
^^^P^^cattgcggcgcccagttcggtccacctgcttgtocgcagaggt 

TCTCATA G AATTTTCTCTTCA CCACTCAATCATA TCTA CTT A CACAA GCA 
GTCAAGCAGTCAACAAAGAAGAAA T 1 1C< 1 1 l l 1 cggagaca aagagata 
TXTCACACAGT ATAGTrTTGCCGGCTGCAG 1 1 ICI ICAGCTCATCCGCTT 

cc^aagcacataaagaagccagactatgtgacg^caggcattgtacctoc 

>Sequence 259 

A TCTTATAT TCGTCTAATAGCTACATTGAGTaSAATCGTATTATGlTCGT 
ATCTli 1 1 iA TTTATG TATTTATTACATGTATCTATCGTATCTGATTACXj 
ATACGATTACGTTTTATCCTA ICIC 1 1 C NTAATGGTGTATGCCCACCGCG 
GTOGCGCCCGGCGGGAGGC 
>Scqucnce260 

GCTCG TT ATGTCGTTA CTA TCTGTGTCTGCA TCGT ATCGCA TTCTCA TCT 

ATTATTATTCTATTCTCTTGTTATCTG 

>Sequence 261 

TCT ATA T ATC TATCG TTCTA T ATATTAATTATTTA TTCTTTGTACTTG TT 
TATCOAATGACrTTAATATTTCTATCTCTTTAATCTATACATCro 
CTTTATATATAGGTAGCGCGTG 
> Sequence 262 

TTACTOCACACTCTACTCA 1 1 1 ICI 1 CATTTCATTTCTGTAC 1 CO i 1 1 1A 
AT AGTATTTACTTA TTG 1 TCTA TXTTT ATGTT ATCATCATTATATCATATA 
A^TAJTCTCTTTOATT^^ 

CCGGGGCG<KDCGAGGTACCCGATAGAAC^^ 

CGACAJC<iAAAAGATCTGGCACCACrCTT^ , 
CCCCTG^GAGCATCCCACCCTGCTCACXXM 

G^CAA^CGGGAGAAAATGACrcAAATTATXnTTGAGACTTTCAATCTOOC 

ACKXATGTATGTGGCTATCCAGGCGGTOCTGTCTCICT 

GCACAACTGjK^TCGTGCTXWACTC^^ 

CCCATCTATTAGGGCTATtSCXrTTGCCCCATCCCA 

GGCTGGCCGAGATCTCACTGACTACCTC^ 

GCTATTOCTTOGTTACTACTGCTGAGCCTGAGATTGGTCGGGACATC 
GAAAAACTG luti ATGTAACTCTGGACTItGAAAATGAGATGGCCCCTGC 
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CGGA TCCTCATACTCOCTTO AGAAG AGTTACTAGTTC 



GCTAGGCGGAAAGAGGTGTTACTCAGATTTCTTGAACITG 
GGTGAC^CGCCAGCCAAGGAGAAGGGATGGTCAGGGACCTGCCCG 



S2 A iT^TTS332«ErJX^^^ D ^ AA TCNC ACGTTTAATAATGCGTNCCCGGC 
CTNNAAAGCCTTCGC 



CCCGGCTACCGCGGGGGTGG AAACC ICI I CAGCANNNGCTMGGTTCANNG 
AGCTATNANACAAhW^ACCGGGACCCCAGCTTTTCAGAACTGCAGGGTAA 
CAGCCATCATGAGTCA<XSTCACCAAGAATTCCCTGGAGAAAATC<^^ 
CAGCTCA^TGC^TTTCACCTCGAA 

TTTTTATCGAGTGGATAGAGNGCXJCAACCAG<JTTCAATTTTTAAAC^ 

AGTTCAAAGCTGGCCATGTACCT 

> Sequence 266 

CCGATlXK5A<K^CCCCOCG<nX5G<X^^ 

TOGOCAAOA1CC1 IGI 1 GG AGTCCTCCTCTTTTGCTCCCCCACTCAGAGG 

TCAJTCTCACTGCXXrrTACCTTCT^ 

CCTGCCCGGGCGGNCGNTCNGAGCCGC^ 

CTTTTTCATTTGGGCATCACAAAGACGAGTCTTCTGAT^ 

ATATG1 1 i ATATGAAAGTCAGAAGTTTAGOGAAAATTCGGCCTAAACAGT 

AA T A ^ TCAAAA raAATGCAAATCAAAGTI^ 

JGGGCACGGGGGCTCACGCCTCTAATCCAGCACTTTGGGAGGC 

OCCGGATCACGAGGTC^ 

££S££I^ A ^ A ^ A ^ TACAAA ^^ 

^^^^^^^G^^T^CCACKH'ACTCAAGAiXXrrTGAGCAAGAAAATOOCGT 

? A j*S*?^<'£ A ^ 

ACCTTGGGCGACTGACAAGGCCTTTGCGCAAAAAAAAAAAAT 
e267 



AGGTACTTTACCTCATTTtXTAOCAATCATTTTAAG A GAATTTGGTTGTA 
TTTCAAAGAACAAAACAACACAAl I ICI G ICXJI O CTGTTTATTTTAGOGT 

JAACCJCT^TCAC^CGCaXACAAAGGACAGTAGC^ 
TGCrTTGCC^GCAATTGACTCC^ 

' A ^ A ^^^^y^^^C^TCAAAAGGATCTAAAGCAGGATCATAGTTTC^ 
SA^TCTCTGTA^^^ 

CCOGCTCAGCATGTTCACCTTCA TCTCGGTTGCTCTTC 



^J A 5 A ^ A I A IS AAAGTO ^ cin ^ 

CTGAACTA1 1 1 ICIACTTACTrJGACTGAATAAGCCACTCAGATCGTGAC 
TGCAAGTXXJ AAGACXTTTCTCCK^CTGOGACCACT AAAACTGTAACTCCAA 
T'^ti^I^ ^^^^^^ZII^^^^^^ CSTATTCTTA TAA^TXQ<XTXJTX3C1ACT'CA C3CA 
AGCCATGGJC1 1 JIC1GAACCCAGAAGOTGTCAATGACAAAATATAATAC 
TAGAATGATAACTGTGATGGCAGGCATX^ACAGACCTTTCA^ 

ATAAATTAAACACTGT AAAG AGTTATAAC 1 1 G C 1 1 GA TAGTATTGAATTT 
CTCT GAGAAAT TAC 1 1 C 1 1 ICI I GCACCTTATAACTTGACATTCTCAfi AT 
TTAATl 1 1 1 1GCI IAAGOCNCN0CGC00GG 
x269 



AGGTAOGCGGGATAGTGGAGGCACTGAAAGACCAGCAGACGCATAAGGT^ 
CGGGAAG A GGTTGTTACCGTGGGCAACTCTGTCAAOGAAGGCT 
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acctcgagcggccgcccgggcaggtacaoatgcacaggagoccatagggt 
ttagocaaaggggagcacaaaagttgaagatgaggcgctgccaccaatgc 
tgggacttcagg<x^gggg<^ggagctgaogaagccacaagggac^^ 

1 1 « t iwCAGTTGCTGAACCAGTAGCAACCAGGTCXrrG AGAAAGCCCTCTC 

ttotcgaagaataacagccxggaggaaaagcttttcattctocaa 

GGGCAGAAA GI ICI ICII »OAATCCCOCGTACCT 
>SeqMence270 

NGCGATAGGAGCACTOCGCGGNGGCGGCTGCA GAG ACGCTTTCGGC 
>Seqneacc 271 

GCGCTAGNGOIACCCGCGCNGGCGGCTGGCAGTTTMTCGACGACAGCOGG 
G AGGCGNNAGCG AA GG AAGAGACCTTCNGAGNCNG AATAAACTCNAGCGC 
CCCCCACGNACCN 



TTtKSAGCTXXIACCGCKNGGTGGCGGCCGAAGTCCCACAGTTAGCTGCAGC 

A * A ACGCAGGCTGCCTCAGGC AAAGGAGCCTGCGTTGATTA AC1TGTGTG 

TCAATGTCCCACOCGTCCCAGGTAACATTTTGCCXXCTGAGGTCCGGGGT 

>^TTTAATGGCTGCTGGACAAAACCrCCAAAGTTCTGAAAGATCAGAAAT 

GATAOCTAOCTGGAGTOCAGCTCTACGGCACTTGGCXfTAAAGCCGCTTCC 

CTCAAGAGTAACTACAATCTTCCCATCCACAAGATGATTAATACAGATCT 

TAGCAGAA TCTTG AAAAGCCCAGAGATCCAAAGAGCCCTTCGAGCACCAC 

GCAAGA AGATC CATCGCAGAGTCCTAAAGAAGAAaXACTGAAAAACTTG 

AGAATCATGTTGAAGCTAAACCCATATXKIAAAGACCATGCGCCGGAACAC 

CATTCTTCGCCAGGCCAGGAATCACAAGCTCCGGGTOGATAAGG<!^GCTG 

CTGCAGCACCXKXTACTACAAGCCAAATCAGATGAAAAGGCGGCXKiTTGCA 

GGCAAGAAGCCTGTGGTAGGTATAAAGGCAAGAAGGCTGCTTGTGGTCTT 

aacaagcaaaagaagcctctggtgggaaaaaaggcagcagctaccaaaaa 

ACCAGCCCCTGAAAAAAG<XTOCAGAGAAAAACCTACTAC 



GCGGATTAGGAGCTACTACCCCGCGM4GGCGGCTTANANGACCTCT ACNG 

GCTTCGAGCCCGCGNCCCAC^O*GGGCGAANGANTTTTNG^ 

G<XfAGGCACCTCCCTCKXCG 



GCOATTGGAGCTCCCCGCGGtCGCGGOCOAGGTACCGCOTCGATGCTATG 
CGCTCAGTTCrAGTCAGAATAATCTTGCTC^ 

CCACCAAGGCAGA ATTC AAGCCCTCATCTGCCAAAACTACCACCAAAGAC 
TTACAAACGGGAGCTTTCGCACCCCCCATTOTACCCGGGGGAGGAGCCT^ 

agcaagagggcggcgacggtggtggtgactcagcggagcccggtgacagg 

atgttggtgttggtattaggagatctgcacatcccacac^ 

tttgccagctaaattcanaaaactcctcgt^^ 

ttctctgcacaggaaacctttgcaccaa^ 

actctggctggtgatgttaatattgtgagag 

>Seqeence 275 

accttattcccattccttggtacacataa 

caggctccgcggaaccgactacaacgtcat^ 

ttttatccccccaata^gtggaaaactcac^ 

aaaatgcccacccaaaacccccatgaacccttaaaaaaaaggccccccca 
git ticca 

>Seq»e poe276 

AGGTACG 1 1 u i ATTCCTGCTOCTATTAGGT CCn CTCACOQCACOOOCCC 
TCGGi^TTAO G<XTC TOCA OI I C I C<1 GGGGACCTTCTAGCCTCGCCC 
CACGOGOGTCCATCrrTAiX^ 

ACTATCCCTCCACTCACrcCCCC TGGTTCT ACCCCGGCTCCAAGAGCCTC 

TCCCGGWtNCCACTAATTTATTCCCAAATTCT 

NCCCTCCTCCGCGTACCCTGCCTCGO 

>Scqacncc277 

a< ^^ a OGCGG7IGGAGCGGGCCCTACCGTGTGCGCAGAAAGTCGAGGCGCT 
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TOOL I IC AGCI I G l lXKiAAATaX GAAGAT GGCCAAAGACAACTC^GCTO 
TTCQOracnOCAGGGOCTGCTSAI 1 1 1 lOOAAATGTCATTAl'IOGTTGT 
TGCGGC AT TCOOCTACTOOGQAGGTGCA FTC1 IC I I ICTATCI IC AOCAA 
CACACGOCTCTAOOCAC 1 I OC 1 1 GAAGCCACCGACAACCATGACATCTAT 
. OCCCCroOCroCATOCOCATATTTGGOGOCATCTOCC TCTTC TOO CTCTC 

tcttctacgcattgcaagcxatcatcaacttcagcaggaaaaa nci ICI 

CGCGTATTTCATTTTGAACTTTATAGT ATTATGCCCTTTGAAAGTGGCAT 
TTTTGTATTACAOOATTCACCCCCACCCAOAC I J n I icaactccaactt 
1 1 1 1 C 1 GA AACAAATCCTAGAAAAGCGAOCTGOCCCGGGOGGCTOGGTTC 
TACAAATAAGCGGAATOCOCCTCGCCTGGAGGAATTTCAATTTCAAGCCT 
T7TTAATCOOOGCTCAOCTTCTGGGOGGTGGCCCGCG<rrcCCACAATTTT 
I 1 UI I C CICI 1 1 A AAGCACGGCGTAAATTGOCOOOGC 1 1 roOOCGAAAAC 
ATGACTTATACGG 1 PCI I VC I CTOTCGGTAAAATTCTGATTTCACI FT I A 
AATCTTTCCQACATGACATATTCAAGOGACTXXXXXiO^ 

> Sequence 278 

GCGTTTGGACarTCCeCGaX5TGGOC^ 

ATAAACOCCTGCACCTC1 1 'I CATTOATCCTTGCAGTTCATOCA TCAOCAA 
CTOCATACAGTCAAAGA CI 1 rOCTCTGG T ICTCTA AT A1 1 1 I CTUO TAGT 
CAGGI \ 1 I UTAT TA>GAACTTC ATTC TGAGAAGACCCAAGATATGTCATA 
OG 1 1 CCACTTTQAOCTCAGT AATTTTGGCCTCAOTTOATOC1 CTOQACAA 
TA TCTCT1 1 A GOC flX I OCT CCTACTGAGCCAACAGCTOATOOCAAGTCT 
GA CGTTCT AAAGAAAACTTTGTT A TGT ATTCCTTCA TCTCAGCCACAGAT 
CXTTTCX^AAGAAAuAATCTGATGC 1 1 1 1 CCATTTGAATCTTCAAAACATTT 
TTCTAGACTTCCATCACn IC CAGTOOglCTOCAMA ATgl I ICA A TTC1 I 
CAGAAAGAGAAGATGCTTTGGCTCTAuAAACTTTCAAGACTGAAGCCCTTA 
GTGTOCCTTAGOAAAGGTTCAAGTTTCTOAATAGAGAACTXJQAAACTCGG 
AAGCAGGAGACACOCAAGCCG1 1IGC1 1 rCTO CTAAATOGACACTGATAO 
ACOOO CTCAGCrCTC TOATO 

ATACT 

> Seq u enc e 279 

GCCTTAGGAGCACX^CGCGGTGGCCTCCaAG43TACTACTCn«CACTGTTC 

men rci a ataaaactttooctgtcg aaoctatactao i c i ictotaa 

ATicnci I ACTACCCTATGACCOGTQAGOCAAOCAC1TTCCGATGCCAG 
GGTTCTOACAOCTCACCroOCATAATATAAA G nOTTTTI 
CTTOCACTTGGAAAGACTi 

AAATTTTGAACCAAGGGAAATCATTTAAGAAGTGTCTXjGTATTTTTCAAA 

TTTCIXnCAGTTCT TACAT ITGTCATAAGTAAATGTrrAGGAATAAAGGA 

TGG^GACATGCTTATTTTATTTAACTOXXCAAAATTNAAAAAAAAAAAA 

AAAAAAAAAAAAAAOTAOCTGGOOGGGOGGOOCSCTCGA 

>Sapja>cc ZIO 

TOCCKJTOACTCCCGCCKTrCXXXKKXXKlAGTG^ 

G ATCTOCA ATOATTCTICOCITCXMOGTCAOOCCATTATCTTTAATCCTG 
ACTTTTTTGTGGAGAAACroOGACATGAGAAAOCTCAGATTTTCACTQAG 
IT0GTGO1 CAGCAAT ATCACAAjGGCTCA TCGATTTACCTGGAACTGAGTT 
GGCTCACCTOATGGGGGAAGTGGACCTTAAG I T GC CTGGOGGOGCTOGOC 
4^AGCATCAOOATTCTTD000TCTCTCATO 1 CICTCAAOOGAAAGGAAAAA 
GGAOTQATACTIOGGTOOOCACTGAOGGAOGAAGOCATTOOOCAGATATA . 
OCAACTCATTOAOTATCTACACAAAAACTTQOGAGTAGAGGOl I 1GI I i X T 
GAGTAOCT 

TATGTGGTCAOOGGGGTGGOGAGGTAGGOGGGGGGAGACATGTGGAGTCC 
CAOCAGAOOCCAAOC r G T G I C I C I 1 C ATCTOOCT G GGAAGGGTGOOOOOG 
AAGTGAAAGAaATGOCCTGGTGtjAAAGXXrTGGGAGAATGAATAAACAGAC 
TAGGTTGAATCCATACAATGGAATGGTAGCAGACAATAAAAAGAAAATGA 



TTTATACC 



e92 



^CTATTG^TGOCOCXTTACTGCACAGCAGAAGCTCTGAATOSTCTTOCTQA 
ATGAAAGAAGTCAGAGATGAAAAGATGGGOCAGGAGTCCA G I 1 ICTO GAA 
. GGOCAAGAATOGAAGTAGCAAGCTGCAGOOGTTTTOCAGACAAGCATGAT 
GTGGGGATGCAGAAGAATTCAGXjACTGGAGGG43CAAACTCCGATGTGACT 
GAGXjOCOCACTGCCAAATGGOGG4^TGCTCAGATAGCACOCAAGAATTTC 
GGGAAAAAACT GG TGCTCACAGCTGOCCAGTTAAGC 

ATTATATTTCTACTGCTCAGTATAACt3TACTGAACGACAGGTGTACCACG 
TCTGCATCTCn TI CGTXjGGCTAATCGTCTCGACGCGTAGGGAAOGTATA 

cgagactatagttttctttctt atctacttct atttctacact atatata 

tttatcontcttgcggatcgactcacxxkxkict 

a octa tcttcxactcctcagca gtgctcgta gt acg a ctcg atgt atgtc 

aggcacgagacagaccc nn i ola cttgtca tgttgta ttgccacttoog 

cgogagoa tattctgataggatgog t ctctc ic fc agatcaacacggtag 

gcaacg ti ltt i gcgctggtacctt i iccacxttttccctl ttcccattct 

GGCATTAACACOGG1 1 CCACCCAACCCTGGCACTTA A GGGCTTGTG A GAC 
TT CAAC OCCAACLI r O C AGGOCTOOOCATTGGGG T OC TO Cl IGOCACCTT 
CATTTGGGTrCGTGGGATAOCAGAGTTGGAACAAGGXXJGCOCAGGAATCA 
AAGC CTOI I ULI 1 1 1 C AACGOOCACTCAATTGGGCTCAAGGGGAATGTCT 
G I CCC I U^ GTAAOOOOGTTTXCCAAAOOCCAACAAOOAAAAAAATCTTC 
CAAGCCTTTGAAGCTGG aagtggccacttgta tgcct aaggcttogaaaa 

AGCCACAT AAA AGGGG A GGGGGCCT A GGAACCA CCGCAAAAAAGGTTTTG 

GATGGCCAAGAAAAAGAGGGAAAGC<XXjCTYX^GTGGAATATAACCCTCT 

GGGO GCAA-ITCT W TI I IUJ A AI 1 1 1 IClt^ TITGGCCTTGGCCCATTAAA 

TTTCCA GGGGCG AAGG ATTTA ACCTCTGGGT A AAA GGGTGTGG NGNKNGG 

GGGCCAAGNAACCAACCTTTATTGGACACCCTGGTGGAAAAGAGAAGCCC 

TCTATTAAGAAAAA | 1 1 UUCXAAAAATTGGOGAAN 

>Scqacncc213 

A GTTGTGACAOG ATT A T A TTG AATGTTGTCTTCAACG A T A TA ATTTACTT 
CATCAATATTCTAATAATrACATGCTAATATGATATTTATATAATAAATA 
TAGCTAATGAATAACGTACTTGTCTA 1 1 I fCTCl AG AG AGCTATCGGGAG 
GCGGTCGAOTACAGCATTGGAAATGGA FCTOTCTTI O GTAAAOATCAOOC 
TATAAnCl I U I G CTGTrOOATATCACOOOCATGATGG OT OTOCT OO AOO 
GTGTOCTAATGOAACrDCAAGACTGTGTOCI lO OOCTOCTOAAAGAATGC 
AT^OG^^AOCAAATAAAGAAGAOGTTGOOCTTCAAAAG^CCTGGATGTGGC 
CAI PCI I ti 1 GGGCTTOCATGCCAAGAAGGGAAGGCATGGAGAGAAAAGAT 
TTACTGAAAGGAAATGTGAAAATCTTCAAAAT00CAGGGTG4^TGGCTTA 
GATAAATACGCCAAGAAOTCAGTTAAOGTTA 11 G II G I GGGTTAATCCAG 
CCCATAOCAACTGCCTGACTGCTTCCAAGTCAGXrrrCATOCATOOOCAAG 

gagaactttag i roci roACTrom ioo atcacaacooagctaaagctca 

AATTGCIC1 1 AAACTTGGTGTGA CTGCTAATG AAGTA 
>Seqntoce2«4 

TC ACAT CTCA I IC 1 1 G IUATTATCTACATTCTTTACACTTOGTATCATCA 
CTCTTTACATATATTACCGAATGTGATATCAATGTACT ACA TAOTTCCTT 
CATATATATATAATTTTTCATAATTTAGAGTGACTCCOGTOGCGGCOGOC 
OGOCCAGGTAOGCGGGGOCTCTAAGCTGCAGCAAGAGAAA C IGT O TC (OA 
GGGGAAGAGGOLIG1 1 ro OCTOTD GGGTCTCTA OT TCI IGCACGCTCTTT 
AAOAOTCTOCACroOAGGAAJCTOCTOCCATTAOCAGO ClOCTTTCl IO OC 
AAAOOOAOOOOGAAACATACATTrATTCATOOCAOTCTO 1 1 OCATOCAGQ 
CTTTATGGCITOCTAOCTTOCAACAAAATAATTGCAGCAACTOCTTAGTG 
COGATTCOGCOOOCAGAGAGAOCTGGAGCCACAGAGCTTITTTGCTITGC 
ATTGTAGXIAGAGGGACTAAGTGCTAGAGACTATGTOOG C 1 1 ICCIG AOCT 
A CCGAGAGCtXXXXTTG AACTG<fc\ATCAACTGCTTCAGAAG ATGT ACCCTA 
AGGCAAACAGGG1 ICC'l I GGOOGGTTAAAACTAGGGGATOOOOOGGCTTG 
CAOOAATTCTATATCAACTTATCG 
e2t5 
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c o i o 1 1 oogggtogcoooogaggtactaogtdocaaa rem CA ACCCAT 

TTTACCCTATGTTTTCAAGGGTATTATAGAAGGGGAGAGGTATOCTGTAO 
TGATGTOCACGT ATtTTTtXlAGTTATGOGTOGAGTTCTACTACAAAACACT 
A OI 1 1 1 1 1 IC1 rCACTTACT AAATGAOATGCCOCATAAATTTAATCAGGA 
GATGG^CCAGCTTTTGGGAAATATGATTGAAATGTGGGTTGAT^^ 
ACAACATTAGOCAGCCTGAAAGAAGAAAACITrCAGCTTTGGCTTTGCTC 
I CTCTTC 1 O OCATCTGATAATAGTGTTATCCAAGATAAATTCT G TGGGAT 

tataaacattttaagtagaaggcctgcatgatgtcatgagggaaagatcc 

TGAAACA GGAACTTA TAAA G A CTGTATGTTG A TXjGGTCATCTTGAGG aac 

caaaagtaacagaacatoaagaacxacccacagaacaagataagagg 

>Setp»cnc* 2*6 

GTOCTACAOCACTGGATTACTATGAATTATACTTTAATOCTAGATTTTTC 
T G I 1 1 lOATTCTCAATAGATCATGTCTCTCAGTTGATTTGAAATATCAAT 
A T A T A TGT A TTT ACT AT A TO TTG T A TATA TN A TNT ANTAG A GAGA CGCGG , . 
GTTJMjGGGCCGAGGTaGCCGATAGAACATGGCATCATCAGCAACTGGGAOG 
ACATOGAAAAOATCTGGCACCACICI 1 1 C I A CAATG AGtrTTOGTGTTGCC 

CCACCOGGGAGGAAAATGAACTTCAAATTAA I G 1 1 1 IG AAGAACTTTCA A 
ATGTOO OCAOCOCATOGTATGGTOGCCTA TTJD CAG GOOGGTTGOCT GTCC 
TGCTCTAATGGCCTCTGGACCGCACCAAACTGGCCATCTGTGCTTGGGAC 
CTCTTGGAAGAATGGGTOGTCACCCCAACAAATGG 1CCXXCATTCTATTG 
AAGGGGGCTAI rO T DCTl IC C T DOOCATOOCOCATTCAATGGOG G1 ICTG 
GGGATOCTGGGGCTGGGCXX»AAGAATCTTCAACT<KjA^ 
QAAAAOATTCC1 1 GA CICTAAGCGTGGGOCTATTTC OC 1 1 1 O GGATAACT 
AACC | GC I GGAAGOGGTGAAGAATTGGTOOOGGGAACATTCAAAGGGAGA 
AAAOCTGGGGTTTAATGGTAAGCTTC1XXXIACTTTTOAAAAAAG 
>ScqncoCC 7X1 

gatgtoaoctooooocootggoggocgaaaactcatcagac tc tct CA GA 
tcaaggaaaagatggccagagagaagctggaagaaatagattgggtgaca 
ttt gogo ttatattoaagaaggttacgocacaoagtgtgaatagtgqaaa 
aa octtcagcat a tgg aaa ctgaa tga ici icg i gacctgacacaatgtg 

IGICCI1GIICI1 ATTTGGAGAAGTTCACATAGCGCirTGGAAGACGGAT 
CAOGGGACTGTXXn'ATGGATOCrcAATGOCAACGCCATGAAGGOCAAGGA 
TGGTTCAOAOGAGGTGTGTl I A TCTATOGATCATOCTCAGAAGGTCTTAA 
TT A TGGGTGAA GCTCTTGADCTGGG AACCTGT AA> GCCAA G AA G AAG A A T 
GGAGAGOCGTCCAOGCAGACTTTITjAATTTGCGTGACTGTGAGTACCT 



GTGATGAO 



-ax 



7X9 



1TACTC3CAGGGCACTGGOGG 



gagatgct a tgaggtggoggocgatgaccgtca t t g i c atggacagactg 
gctcagtgaagacatttactttgatqggaocagatagaatoogataattt 
ttctc ataaoctgagaggagttatoocaogaa o 1 1 1 ps a a1 nttctttt 

tgtaagtgttoctttattgaaatctataaggagcagatatatgatctact 
ooacictocatoggci og actgtactigooooggkatttit1aaaaatggo 

GOAOCATTAAAAGCaTAAAAGOCATTTCGGOOCTCGGGGACAATOATTTA 
TAC r T TDCAOGA TTTAOCATClCrAGOCCAOOCTTA 

' CTACTATACAGOGGGGTGGGCAGOCTAAAG4jATGGTATTATTTGApOGAA 
ATTGGG43GGCTGCATO0CATATTOATCTTCAATCTATTOG 



ATCTATACAATACATATTA TAAA TAAATGGTGTATA TAT ATTGTTATTAT 
AAGATATTATAA 11111111 GATAATCTAATTGATAGAGTTATCAAAAAT 
ATATATCTTAATTTATTTAATCTATACTATTATATTAAAGATAC I CCGGG 
TGGOGGCOGOCOGGGCAGGTAiXrrCGGGGClTOOGTAGGAAOCCICATCrc 
CCT AACT AGCTGCTTACACAAAGAA C I <JL I PG ACAACTTQAAOCI IPC CA 
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GGGAACTAAAAAOOCATTOAAAAGAAGOCTCTCAATC CCI nun 1GCA 
CTA rCTCTI U GCCCTOACCTTTAOACGGGATATGATAAACCTA rCTOGTC 

tcttaatogaaaatcaaccaatogaaaaaogocattcooaaaa rTcrm 

AATACAAAAOCTATAATTTOOO CI 1 1 1 C 1 A TTTAAAAOCAGGGAAOGAAA 
TATGTCAAAAAATCCCCC 1 1 1 1 1 1 A TTACTOOOOCTCTACAATOCAAAAT 
OGATGGGGGAAGA TCTCI I rA AAODGI IC T C AAAAAAAAGTAGGGTGATC 
AAAATAAGAAAACCTGTCATTCTAAAAGGC fCl I IGOICUXA AOCAAAT 
GTCTTTAAAAT GATOGA GTAACCTCTCC C 1 1 I UI I AGATTTATACTATTT 
TCAGA AAGAT ATTTTTTtnTOGAAAAAAOOCOTCTTAAOOCAODCACAAA 
TTXKJGGTTTTATATATTGGGGAAATAAAOCAAAAATGGGCTTGGAOCTAT 
ATTATAAATTCCGGA 1 1 1 P0C1 1 1 ICTA AAGGGGAAAAAAGOOOOOCTPC 
CA TG AGGGTGGCACACXXACAA T A TTTA TATA CA TCCTTG A GA GGGGGA A 
I 1 1 I 1 1 1 1 1 1 I ATTATTTA 1 1 1 1 IGA GGA 
TAATAATi 
TGTTTAAATCATCTATATA TTATAAAT 



(Ml 



TGAGACT SACFDGGGG*PSGOGGOCGOOGGGGCAGGTA C 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 rGGGGGAGTTAAATAAAATAAGCATGTCTGCATOCTTTATTCC 
TAAACATTTACTTATGACAAATGTAACAACTGACAGAAATTTGAAAAATA 
CCAGAC^CTTCTTAAATGA I 1 TtXX-T IG GGTCAAAATTTACCC t-T ltnTG 

1 1 1 rcrci roa 1 1 ic aggtaattaa ctci men nTAcn iua acta 

TGCAGTGCAAGA TTCC TCTG TAG ICI I HXAAGTGGAAGGGTATAAAAAA 
AA ACACTTTATA TTA TGCCAGGTGAGGTGTCA G AACCCTGGCATCGG AAA 
GTGG1 1 GG CTCACGGGTCATA G430T AOTAAOAAGAATTTTACAGAAOACA 
GTCTAGGTTXXjAAAAAGAAAGTTITATTTGAAAGAAAGAACCGTGCCAAA 
ATAGTTCTCATICCAGAATGGGAGAAAGGGCTTCTCATAGATCATTCG 
> Sequence 293 

TTGATGCTIXXX>GTGGCGGCCGAGGTAC » » I I I 1 1 1 1 1 1 1 1 I I 1 1 1 I H I 
TTTTTGCTTGTTTTT A TC 1 1 1 1 GGCt_ 1 1 1 1 GGTGA CTTGGTGCTCCTTGG 
AGTCACTOGAGTTCTACTTTGAATCOCACTCTCACATCAATCGACTGOCT 
TAATTCC I G G I CCAGCTCCCXX^CCCTGACTC TC I COCXjCTC 
>So|H»c«293 

GAAGTG*jCTCQXXrrOGCGGCCG<XXXXXjCA GGACGCGGGG ACA TTCC AG 
TGGGGATTAAGAGAAGGAAGGCTOO C n GC T CG AGCTOT CTOG TCTTCT C 
CAAOTOA OAOTOG CAGGCAATAOAACTA CTfTGCTT r iG GAGGAAAAOCA 
GGAATTCATTTTTAGCAAACACAAGAAAAGCA G 1 11 1 1 1 1 I C AGGTGCTG 
AGGGCCAGOCAGCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCA 
CGTTCn CAAGGCAGAATAATTGCAGAAAATCTTCAAAGGAOGCTATCTG 
CAGA I o 11 C 1 GAATaCCTCTGAGAATAGAGATTGATTATT CAAOCAG GAT 

TTGCAACATTOGACCAAATACAATXjAAGTAI ICI Hi CTCTGCTCT OGn i 
TGG C T G ro C TGG GCACAGAATTOCTOOGAAOO CTCrG I lOG ACTGTCAAA 
TCCCCGAGGTTCAGAGGACGGAT 
>SecjBeacc294 

l^imC^A CGT<SA45 S?f^ 

GCTOCAGC CATOCroGGCTAG C TTGGCAATAGATGAATG OCA IP 

ATGTOCAOCATAOCAOC1GAGA G1CIUIU SAATCTAA I 1 1 FCTG1 1 A T AC '■ 
I ICI I roCTTACACTCAl 1 1 1UL1G IC ITTATTATOAT AGTCTAAC 1 1 1 1 
TCTOCTCAAAGGTATAGCTGOCTTOC1 1 ICATQAAAACACAC1 1 IU.IAT 
TCTG ATTTATCAGAGGCC1T TOCA TATCTCAGOCACTATGCTATGACAGA 
TTTTATAA TTAATA AGTCCATTTCAAAGTGAAAAOGTTACAAACATCCTT 
AWCAGA I UI1II fATAACATGAAATATTCTGC IGOGTTAAGAACAAAATG 
CTGaCTTACI IGIG 
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TGAGATOACTACOGOGTCOCGGOCCO^kOAGCAAOOGAOATOAAOCnXjAA 
GAit-^JiGA GCC GGAATCCGGACAA i rATGTCCOCCAAACCAAOl FOGACj 
TACAGAGAGTTCCAACAAACTATGATCL 1 G C I II ACA ILL1 1 1 1 G AOOTC 
CCACGAGAA TATATAAG A GCTTT AAA TCCT A CCAAACTOG AA CCAGT ATT 
TXJCAAAACCATTOCTTQCri CCC1 C3GATGGTCAO0GTG ATGGAOTCAATT 
GCTTGGCAAAGCATCCAGAGAAGCTGGCTAC1 G 1 CC 1 1 ICt O OGOCOTGT 
OATQGTAGAGOTrAOAATTTGOAATCTAACTCAGOGGAATTGTATOOGAC 
TCT 

>Srqueocc 296 

TtnX»ACTGGACCtXKrrGTG<HXMXXGCXXGGGCAGGTA 
TTGTGAGTAGACTATCCAAAGAAAAAGTGGGCCAOCATATCTGGAAACTA 
CAGTCT ATGCTTTG AAGCGCA A AAGGG AATA AACATTTAAAG ACTCCCOC 
GGGG ACCTGG AGO ATCG ACTTttCCATGGTGGCCGGAGCAGCAGCTT ACA 
ATGAATAATCAG AG ACTGGTGCTCTTGG AGAAA A CTATAGTTGGCAAATT 
CCCATTAAOCACAATCACTTCAAAATTTTAAAAAATAATGAGCGTCAGCT 

gtctgaagtcctccagaataagtrtcgctgtatctctac^ 
cagttcaggaagkxiaacagcaaatctctgccagtgttaacaaaaatgcto 
actoctatgaaagaattatoagtgtggaaagatgaactcaccacacacgc 
tagatoaggctgtggagaatccotccatoaaaaaat 

GGCTTAOCTTGGCACTCCTAGAAT 



TGCG A TCTC GCGCGGGTGGCGGCCGOCCGGGCA GGTA CGCGGGGGG A GGG 
CTCCtjAAGTCTGGTTTTGGGCGGG A A TTG A A A CCGCCGCTGAA GCCAA CA 
A **^7TTCAGAACTCTAAATACCAAGCCTrcAAAGGGACCATGGTOCGGC 

A J|^^ATTrCCCCGGOCTGAG^ 

GGGGGTTTTACG<XCCCOGGGGAAATAAAATAGGGGTAA CI lllllim 
GCGGGAAAAAATTTATTCCCCCCAAAAAAAAAAACGCCC 1 1 I I 1 1 C CCGG 
GGGG AAAA AAA T AOGA TCCCCCGG GGGGGGGGGG AA I 1 I II I I t ATT ATT 
Mill 1 1 J'ICCCCTTCCCCTCCGGGGGCGGGGGGGC 
>Sequcnce291 

ACCACACACTTCCATCTCATTATATCATCTGATTGTAATC^ 
TATTACTTATTTATAAAT AGTA TCGATATA C III III AAATGAA TTATXjT 
TTTATAGTAATTTAAG I U J 1 1 1 ATTTACATTCTTAAGCGTTOACTCACGG 
GTX3GCGGCCGAGGTACTCCCCAGCAAATATTC 1 1 ICI 1G GCTTCCTTOAC 

TAGATGAGCTGCT atagtagtcaatcctgttagacttggaccattgtttg 

TCTGAAGAACTGGAATCTGTCCCT«XXXrrGAGCAC 
>Seqocnce299 

U 1 1 <^TCCATATCTCTATAATTACTA I I ICC 1 CACATTTAACGATTTCT 
^^^ ^^"^^ ^"^^^^ATACrCATATTTATATCrATCTATCTTACTTTTAT 
CTAATTT T A TATAT TATATCGTTAGCTXXXIGGTGGCGGCCGAGGTACTTC 
TJGTCTTCCAG 1 1 1 1 CXIACTTCAAAinTCTATCTTCTCCAAA 1 1M 1 1 C AT 
CCTACCACTCCCAATTAA 1 C 1 1 1 CCA 1 1 1 ICO 1 C 1 OCGTTTAGTAAATGC 
GTTAACTAGGCTTTAAATCACGCAATTCT^ 

Tc i 1 1 lA ATXACCTTCGO TTTAA ICICI 1 1 1 1 A AAAOATCOICI 1CAAAT 
TATTTT AATCACCTAGAACTTTTAAACTAAACITrAAOCTOTTTAAOTCA 
CCTTCATTTTAATCTAAAAGCATTCCC^^ 

CTCTOTAGTOLI I IC1CICAAI 1 1 ICI 1 1 I AAATACATTTTTTACTCCAT 
GAAGAAGCTTt^TCTCAACCTCCGTC^ 

TCCI ICC ILATGCTA C I CI 1 1 1 1 A ATCTTCATA I I I II IC 1 1 A AAATCTT 
AAGCTATTA 



GOGATG TCCXKKjGGCGGCAGGTACTTAAGGTTGACTGGTAATCAGGGTAA 
CTTCTGATACTT ATCACACAAGATGGTGOCTCACK1ATTTAAATAAATGGA 
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GGTAGGGGAGGGOGTGGTGGTA aCA TACTTTTA A A OC AfJCfS A TTGC A C*G 
CAAA CCACAATOCAAGT ATTTCTGACTCCCA A GATTGCCGTTTCCTAAAG 
AGCAATTCTTCTGCAGGCAACAGCAAACCTA CC 1 1 I CC I ( G CTAACTGCT 
TTCAGTAAATTCTTGAT<KjCCTTXXj ATTCTCGATTCAGACA1C I C I ICIC 

ACCCI ICI 1 1 1 1 cattctagcaatgatctcaacaogtgg acaaaattggc 

TTGCAGGAATAATTTCAAC. 1 1 1 1 ICI A AAAA CCTTGG ATTAACAGG TGGA 

GCTAATGGTClTGACATGGAAC<XXXXCTTAAATTrcGCATTTT 
> Sequence 301 

GCGACTGTCCTCACCGCGGTGGOGGCCGGAGTGATGCCAT^^ 
GTGATCTGCAATGATTCTTCCCTTra 

TGACTTTTTTGTGG AGA A ACTCCGACATG AG A AACCTCAG ATTTTCACTG 
AGTTGGTGGTCAGCAAT A TCACAAGGCTCATCGATTTACCTGG AACTGAG 
TTGGCTCAGCrGATGGGOG AAGTGGA CCTTAAGTTGCCTGGCGCGGCTGG 
<XXrAGCATCAGGATTCTTCCGGTCTCTCATCTCTCTCAAGCGAAAGGAAA 
AAGGAGTGATATTTGGGTCCCCACTGACGGAGGAAGGCATTGCCCAGATA 
TACCAACTOATTGAGTATCTACACAAAAACriTXJGAGTAGAGGGTTTCTT 
TAGAGTACCT 
> Sequence 3CQ 

GGGATTtKjAGCACCACGGNGGCGGTTGGGGACCACGGAACTGCATGNTCA 
GGACCCACAGGAGCGACCCTGAAAGGACCATTATtCGCACAGAGCTGCAA 
ACAACTATACATGATATAATTTTAGAATGTGTGTACXnGCCCC 
>Sequcnce303 

GCXKlArrTGGAGCNACTCChKJCGGNNGGCGGCTCGGW 

CCCXANCANGGCGGACCCNNAGAGAAAGGCCCTGNANNGACTAOfTTGAA 

TACNGNNOCCCGAACACAAGGAGANOGA 

> Sequence 304 

TCOCCCGAG C I I I CIC1 I GTCCATCTTCTCCCGCTGCTCAAATTTCAGTT 

GCGGGCGCrGTCACCrcAGGACCCCTCCCCCCGCGTACGCTGGATAGCCT 

CCAGGCCAGAAAGAGAGAGTAGCGCGAGCACAOCTAAGGCCACGGAGCGA 

OACATCTCGGCCCX3AATGCTGTCAGCTTCAGGAATCreCGCGTACCTGCC 

00 

> Sequence 305 

TTCCCGGCAGGTACTCAGGTTTTATCTCTGC^CTC 

GTAAAGAGC AAAG GCTCATGTTTGCCAA O IC Hi I CC I niGIA ACAAAAA 
ACCCAGCAGCTTTATCAAGCAGAATTCCACCTGTATTTCTTAACTTOCCA 
GAGCTG A GTCTCATGGCCA CXXTTTAGCAGGAGTTGGGG AGGTATTTTTAA 
CAAGGCACATTATCATCTCCCCCACCCAAAGTGGAGCTATTGCTAATGAA 
AAAGA TACAATGAGATGTTTATCAAATTATCTGTAGCTATTAATGTCAGG 
TTTTTG AAA TTTACTGACCTGG AAGAATACTCATAATGCAATGTCAAGTG 
AGAAGCAGGACAAAGAACATTTGCAATACAGTrGTATTTATAAAATTTTO 
TTTACACACAA 

> Sequence 306 

GCGATTGGAGCTC«X»CCWTGGCGGCTOGAGTA 
AAAGCTt^GCCCATGTGAGGTGCCTCCTGCCAATCACAGACT 
CIGGTCCTXKSAGOTTCAAAGAATTGCAGGAGGGTAGAAAAGCACCTXKKn* 
CGGGTCCAGACTGCGGA GCGGGCXCTACCG TGTGCGCAGAAAGAGOAGGC 
GCTTCKXHTCAGCrrGTOGaAAATCCCGAAOA 

CTGTTCG rTGClTXXAGGGCCTGCTGATTTTTXXLAAATtnX^TTATTOGT 
TGTTCCGGCATTGOCCTGACrOCGGAGTGCA IL11CIIIO lATCTGACCA 
ACACAGCCTCTACCCACTGGCTTGAAGCCACCGACACGATGACATCTATG 
GGGCTGCCTGGATCGCATATTTGTGGGCATCTGCCTC 
>Sequence 307 

TO AGCCCGGOGCCANA TCA CX^TT ATTXXXXTTTAGTCACCTCAGAGGCT 
TGTTAATGCI 1 ICI 1 ICI AATTAGGCTATATCTGGTATCTGTATAATATC 
TTCAGl ICI ICI 1 1 ACCAGGGGTCTTACTCTOTTCTGAAACATGGCACCT 
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CAGGCGGCTCOGQCAGCGCTGGACACAGGAAACTCCT 
^^^^^^CCTCTACCOCCTCTTCGGTTAACTCCGC I »GI I I C1 CI A CAAA 
ATGGCGCCGGAGGTCCCCOGCGTACCT 



GCGGTTTCGGAGCNAACCNCGCGCGNGGCGGCCTGGNNGACCANTACKGG 
AAACCAAACGAACGGCNGGCNGCACGANGCNGGOnTANOfttttGCCGCCT 
TCANGCNGAGCAGCCCGAAANC>WGGAACCG<KXNChfNGKNGNGTrCCNN 
GNNGAAG AACGGGCNN ANCCCCAG AG AG AGCCA AA GNN ACCCCGGCCCGC 
NCNAAGAA CAAAG CGGANW3CCCCCGGGCCGGCAGGAACNGCGANAACACA 

ggcccaanci 1 1 icc1 i i i i 1 1 1 1 1 gtxjttgggggggcgcgccgnaccoc 
cag<>iaaaaagaaccacaanaagccgaggggnngaag<kk;ag<^gcncw 

GGCGNAAANGATTGGNCAANAGCNNGCChWCCNGGNGANGAAANW^IGCNA 

CNCCGCGNC^CAANNNCCACACNAACANNACGCAGCCGGGAGCANNAAAG 

NGNAGAAGOCCGGGGGCGGGCCCAAGGAGGGGAGCNNAACNCACANNNAA 

NNNNGCGNG 

> Sequeac e309 

GOTrTTGGAGCACACXrGCCGNGGCSTTCGAGTACTTACCAAC^TNCNNAN 

ATGGNhWACCTTCTAAAAATGTTACACAGAAGAATAAAGG<^AC^AACCG 

^^'N'^TNAT^AGNGCCAQAAACCTTTTACAAGATGGTAAAAAAAAAA 

ACAGAAAAAAGAAAAAAAAACAAAACCAAAAACAAAAAAACTTTACAACC 

ACAGCTAATGCAATITI I ICX^TTCTTCCCATTTT T f I C CAA ACCTATTC 

GGNGCAAAGCCjCATTTTTTTCCATGCATCTAAATGATAGATACAGGCTAT 

GAAAT ICI I I ATTCTATTTGTAGCAGCTTATGCAGGTGCAGCXAAACACA 

AAGCTTCAGGACAAATTGTACXrnXXCGGOCGCCCGCTCT 

> Sequence 310 

GGCGTTANGNGNCNACT>ICGCGG>0>IGGCGACTCGANGNCNGCATCTAAGC 
ACGCNCACCG^KXMCAAGAGCAGGNGGCCCTAGN^f^rcACNGT>^^TATtJCT 
GCNCCGCGANGCAWAGGCNC>^ACAACC^CNACATtK^GAAGACKXX>C 
GCCCGGCCCCCGGGAAAAAGAGNGCGA 



GCXJCTTTGGAGCNACANCGNCGGNGGCGCGCTXJhWCGNCGGTACTCNGAG 
G A AA A GCN CGCACCA GG KG G A CGCGG A CCG TT A GCAG NGG TTT A GGCACC 
CCAACAAGCCGCGCKKK^AAAGGNCCNCGNATTT 



gcgctcggagctacaccocggtggcgctgccgcgccagactcttggagaa 

ACTA TAGCAGGAAACAATGCCTATTTTXZACAGG AAA CAGAACACATACCC 

agaaaaatgccctggcaatcatcaaatcacagttttccaacatcaataaa 

gtotttaactcctcatttcaaagatcgtgttcctgc 

aatt^tagagaaacitcagtctggaatggaggttatggatxagatt^ 

gatgttlagaatatctgacataa tgga tgtatatg aaa tg aaa ctatccac 

att agctt ccaaagaaagcaggctacaag a ic i i 1 1 o gaaacaaaaactc 

tagcccxixxacaggctgatagacrgattgctcagcatxx^^ 

actcaagc 



AGCXiATTGGAGCTCCCCGCGGTGGCGGCTTCXX^ 
TTAGATTGAGTTATXJTTGCTAGGAGATCTTTATTCATCAGCTX^ 
AGCATATGGGGCTTACTTGG<XXXXXTATCAATTTG<MTCAAAA 
AATTOTAGACC 1 G I C I ICI 1 1 1 ATGAAAAAGCXATXTTCATAGTCTTTAAA 
TTTATCTT TCT AAACAAGA CACAA GTTTACA CATTA CCCAGCACA GTAAC 
CCCTC1 1 OGTATTGTTTACCTAAAAGGAAGAAGTGTAGGAAAAACTGATA 
TAAGTAG AGAGTTTATTTGGGCGAAGCATG AGGGTT ACAACCCAACTCTA 
TGGAGACAAG1 1 G I CC I GAACAATACACATTCTTATTAGCAACAGTTATA 
AGTA GGTTTTCAAA G AAAAAGAA G A 
>Seq»eace3l4 

ggcgattggagctccaccgcggtogncggtcgaggtacgcggggggtcct 

GGAGGTTCAAAGAATTGCAGGAGGGTTTAAAGCACXrrX3GGTC 
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ACTGCGGAGCGGGCCCTACXXTGTCCGCAGAAAGAGGAGGCGCTCAGGAA 
TGCATGAATTCATTAATTAAATGTCG AGAGCTCTAGATGG C 1 1 1 ICICA A 
GGTGCTTT^GTGCAGAAGCCCAAGTCATTGACCCACACAC^ 
TGTTCCTTCCAGAAAATtXTrCAGGGAGTGCCTT 

ATTTTTGG AAATGTGATTA T1 O G I IGI I GCGGCATTGCCCTGACTGCGGA 

GTGCATCT 

>Sequence3l5 

GCGATTGGAGCTACTXXKjGGTGGCGGOCTCOCGGGCAGGACCCT^ 
TAGATTGAGTTATX3TTGCTA0GAaATCTTTATTUAGTX^OCTC 
AGCATATXXKXKTTTACTTOGCCCCCXrrATCAATTTGCGTCA^ 
AATTGT AG ACCTG T C FIG I I II ATGAAAAAGCAATGTGATAGTCTTTAAA 
TTTATCTTTCTA AACAA G ACACAAGTTTACAC ATTACCCTTTTAGTAACC 
CCTCTTGGTATTXj 1 1 1 ACCTAAAAGGAAGAAGTGTAGGAAAAACTGATAT 
A AGT AG A G AGTTTA TTTGGGCC AAGCATC AGGGTTACAACCCAACTGTAT 
GGAGACAAGTTGTCCTGAACAATACACATTCTTATTAGCAACAGTTATAA 
GTAGGTTTTCA AA G A AAAAG AAGAGGCAGTTCCTAAG 

> S e q uen ce 316 

COKKX^GGTACAGAGACCTCCTTACTTAI 

CGCGTAAGGAGTGGTAACAACXH^AGAGTAACGCGGGAATOAAGAATCTTA 

GGCGGGTGCACCCAGTTTOCACCATGATTAAGGGTXTrTTACGGAATAAAG 

GATGATTJTCTTCCTTAGTGTTCCTTGCATTT^^ 

AGACCTTtJTGAAGGTCACTCTGACTTCTGAGGAAGAGGCCCG 

AG AGTGCAG A TACACTTTGGGGG ATCCAAA AGGA GCTGCAATTTTAA AGC 

CTTCTGATG 

>Sequence317 

GCGTCAGGAGCACACCCCCGTOGCGTTCGCOOGGGCAGGTACTCTGCAGA 

CCAGAAAAATCCCCTGGCAATCATCAAATCACAO I 1 1 I CC AACATCAATA 

AAGTGTTTAACTCCTrATTTGAAAGATCGTCTTTXTGGATTC 

AG AATTAATAG A G AAACTTCAGTCTGGAATGGTTTTTAAGGATCAGATTT 

GTG A TGTG AG AAT ATCTtiACATAATGGATGTATATXj AAATCAAACTATCC 

ACATTAGCTTCXIAAAGAAAGCAGGCTACAAGATCTTTTGGAAACAAAAAC 

TCTAGCCCTTGCA C^GGCTGATAGACTC ATTOCTCAGCAt^GCTGTCAAA 

GAACTCAAGCTGAAACAGA 

> Sequence 311 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTT"ATTGATGTTC 
AGATX1AGAAATXTTCCTCAGACTGAAAGTTXK1ACTGACAGTGGAGCAGAAA 
ATGAAGGTAGTTGTCACAGTGATCAGATGAGCAAOGATTTCTXXAATGAT 
GATCGTGTTG ATG AAGG AA TC T G I CI 1 GAAACCAATAGTGOAACTGAAAA 
GATCTCAAAATtjroGACTTGAAAAGAATTCCn G ATCTATGAA C T I T I U T 
CTGTTATCGTTCATTCTGGGAGCXjCTGCTGGTTJGT^ 

ATAAAGTCATTCAGTGATGAGCAGTGGT ACGGGTGGGAATAGCACTACAC 

TGTTCATXrTAGCCTTIjrAGAATAAGTXXCAGTtjAACTGATA 

ATCTTCACTGTTATATA 

> Sequence 319 

AGGTA C1 1 1 1 1 1 1 1 1 1 1 1 1 1 H I II 1 1 1 ICAATOTTCAGI HCC1 1 1A AT * 
G AOCOOC A I C I CCC I GAAGG GCAGG TGCAOGCA OCTAG OTCATOGCAAGA" 
GA1G1 1CACTTGAAGATCTTGCCCTGATTGAA<KjCTTTO 
GAAG GCCCCCTCCCAGGAAAa GTAC CAGACATCAGCTXjC C ICI ICI I C AT 
TTTCAG<XLAAAGAAAGGGCACGTTCAAATGAGGTCAG 

ATrGTTAAAAAGCTGGCCGTT 



GCTGATCAG AATAGAAGTGG TTGC II CCAATGCATGOGAT 
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OCPCTA<KXXiCAAC0O0CGGWGOCGOCT 

<^OCCGCKCACGCCN»«XUGAGCCACANCATCTG7XK^ 
CCCAGOGAGAGAGACCNGNACGAOC3CCNGCAGCNACCN 
>Scqucnce 321 

CGGGCTTGAAGOOUTNCCCGCNGOCGGCTC^ 

ACAAGCGGAGNNAACCGAAGAGGGC£XntlAAAGlMCGCCTTAJICCC<iACC 
CACCAGWWCCNGNONCCACCGO*GCC G I 1 1 I C CC NG AGGGGCCACNNCC 
CGCAAAGGCNGCAGNGCAGCG^X^CAANCCCCCCNCAO^^ 
NANNCHGCKOCAGGWGACCAGCA U.I ITTCTTTm 1 ACCT AGAAGNNG 
OCAAGOCACOCGNCAACAAAGCANACAAACCGAAACGGGCGGGGGGAAGG 
A NCCCAG A TXj NVGANGCCAGG AAANGGGA NG AAG A CCAAACGNGCCA NGN 
WCAGAACNAGAGAAGACCCCNGGGAAAGAAGAACCGAAGANANNANACA 
GANACCAGANAAAGCCCAANNACAAAGAAAGCANA 
> Sequence 322 

GCGTTAGGAGCTACACCGCGG>WGOCGXTTTGGGGACAATACTTaCAAAG 
>ScqacSKc323 

OOCAinPOOACCTCDOOOOO CTGC CGGOOQAQCTAOCTICAATACrT AAAA 
ATAG TCTTOCACA AAAATACTTTATTTCTGATCTATACAAATTTTCAGAA . 
GCTTAI I I IC 1 1 1 ATCATTCCTAAACTGATGACTTACCATGGGATGGGGT 
CCAGTCCCATCACCTTOGOOTA C 1 1 1 1 HI 1 1 1 1 1 1 1 1 1 1 1 II I 1 IUGAA 
AGCTCTGCCA TAAACTTCTAGCCTGTCCCAATGGTCACCTCCCACACTCG 
CACt^GCTTGTCCCTGTAGCCAGCAAACAGAGTCTGGCCATCAGCAGACC 
AGGCCAOGGAGGTGCACTGGOGTGGriC 1 U CCTTGCTGCTGGTACCTGCC 
CC 

TGATGTCGAGCTtXXXXXJCTGGCGGCCGCtXXXMXAGGT ACTTTTTTTTT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 AGOGGG AGTT AAA T AAAA T AA GCATGTCTCCATCCT 
TT A TTCCTAAACATTTA CTT A TG ACAAA TGTAACAACTC A CA GAAATT7G 
AAAAATaCCA G ACACTTCTTAAATG A 1 1 1UJL-1 1 OGTTCAAAA TTTACCC .'. 
CTTCTTOm rCTCTTGCI 1 1 ICA GGTAATTAA CTC1 fCTCl 1 1 1 » AGTT 
TGAACTATOCAGTCCAAGATTCCTC 1 OTAGTCTTTCCAAOGGGAAGCGTT 



ttgaaaagcggtgggttta<xggttatataggtggttataaaaaaatttt 
caaaaaacaattttatggtttttaaaaaaaaaag tgooci ug o 1 cc 1 1 1 1 
taaaaata atggggtccocxxxxxxkkttogggg aaa ttttt a ttaa taac 

TTTTTTTTAACCCCCCrrcCCC^ 

1 1 1 igttici rn igaggggggggagataa 

TTOGAGTOGQATrGAGCTCCOCCOGCTCGCGCOCGAGGTAOCATCAAGrr 
AAAAC^GAAGATGCTTCTGGTAGAGAGCATTTAATCACTCTCAAGTTGA - 
AO OCAAAOTA TCCTGCAOAATCACCAGATTATT I lOlGG ATT ncC lGTT 

ocatt n m gcctcctogacacctcaggtaaa ttci xx.tcagaoctoctt . 

AATAAGCATrrATAGTCAOTTTTTG<XlAG<lAATAGAATCACTAAAG<3CAT 
TCT C<X3ATinT ATGGATQAAATCCATQAGAAGACCTOGOTAC T I OC C COO . 
G JCGI I Ui MIA TATTTATCTTTCTGGTACTT A CTCTTTTTATCCATTTT 
ATTOCATOCTATATTATTATCTATTTATTACTT AATCCATTCATTCCT f T 
TTAOOOC CTCCTAATTTCTCAGTATCCroCATA 1 1 OG 1 1 1 ICCTCTATTT . 
I MCI 1 1GI | f AICI lUClCI CIU I CC l CIA OOCTATACA CICICI 1 IA C 
ATCT TACTT TATAA CATCTTTCTA 1 1 CC I 1 1 ICI I ATATCTGTATQACTT 
CTTCAATCATTCTCTC 
>Scqaence 336 

TATCATGTGAGCT OOOGTGGTGGCGGCCGCCXXjOGCAGGA C 1 1 111 1 in 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 AOOGGGAOTTAAATAAAATAAGCATGTCTOCATOCT 
TTATTCCTAAACATTTACTTATG ACAAA TOT AACAACTTOACAG AAA TTTG 
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AAAAATACCAGACACTTCTT AAATGA 1 1 1 CCC1 IXXjI tC AAAATTTACCC 
CTTCTTon i icici roci 1 1 I CAGGTAATTAAC I C 1 1C1CI 1 1 I 1 AGTT 
tgaactatgcagtgcaaga rrccTCTtn agtctttccaagtggaagggta 
taaaaaaaaacactttatattatgccaggtgaggtgtcagaaccctggca 

TCGGAAAGTGGTTGCCTCACOOGTCATTAGQGTAGTAAGAAGAATTICTA 
GAAGACAGTATT G GI r CTA AAAACAAA CTroCTTGCT O Gl 

GCTGCCACWAATATTTTGATACKXXAAGTTTGGCCCTTTTrA^ 
ATOCOCOCCGCGGGGGGAA 1 1 HO 1 1 1 AAAAGTTTTTTGATACCGGCCOC 
OCTTAGGG0OO0OQGO00CGOO00CA C1 rrTTTTTTCTCTTTTI IU GCOO 
TAATATTCTXCTTTGGCCACATAOGOGAAAAAATG 1 ICCTTOOI OGTGTA 
ci rorsi 1 AAATTTCAATTCCCCTCAOCATTOCACACAA C 1 1 1C1 IC CCG 
GGAGCATTAAGGGGTAAGCCCOGGGGTCCCTAATGAGTTTAACTA 

CCOOCTOOOOCTCTACTCTCACAGACACTOCrOG O 1 1 IO GAA nTTOTTG 

I rCTCTGTCICI 1 1 GATTTOCTGGAAGACGACACCATGACAATTTCAAAG 
AAAATAGAA CAAAATO AAGGAAAAAOAGGC HCTOT CTl AGCACATTCCTG 
TCAC CAGCC TGCTGTCItnXXXXnX K SCtjrCCTGGOCCGGO CT 1 G GCACAT 
CTTOGTTT1 rGTGGTTG 1 1 C<XrrOGACAGC<^CTCTGCAGGGC 1 OC I IC 
TCTACGCAICtXll lO CCTGC^IO(XIGlUCCAGGXX«irG1CAAGGCCrr 
TTGGGTCAGAGTGGGCACtXXI 1 1CTO CAAGGCTCCCTGCAACAOCTGOC 
CTGTCCCTGGTGGGGCTGACAGC 1 1 ICI 1C I I A CCCTGCCAGOCTGGOCA 
AGCCOCAGAGGTGACCTATGAG^3CAGAAGAGGGCTTClTGGGGCCGTGGC 
TACTCACTAG<rrTGGATCGGCOCCTGTTXK^GCCCAGATOCTT G GTAOCT 
TCACTGGGTG 

>Scqaeacc 329 

AAACTATACTC CTAG TACTATTCATTTCACTATTATTGTOTAATTATATT 
AATTCAGTTAACTTTACTCTCTAAATACTrCTATAAATACTATClTCTAT 
TCA T A TTCATAA TTTT A TTCT ATTTATTA TAAT AA T ATTT A TT A T A TAAA 
rriTTCGTTCTOGTI C<XXXXKIAGATACTTTACAGGATGGCATTTAATAC 
AGATATTTCGTATTTCCCCCACl GO 1 T I T A TTTGTACAGCA TCATT A A A 
CACTAAGCT CAGT TAAG^AGCCATCAGCAACACTGAAGAGATCAGTAGTA 
AGAATTCCATTTTOGCTCATCAGTCAAGACACCACAAATTGAAACTCAGA 
ACTATATTTCTAAGCCIT3CATTTTCACTGATOCATAA T 1 1 IC T T A TTAAT 
ATTAAGAGACAO 1 1 1 1 1 CI ATGCCATCTCCAAAACTOCATGACATCACTA 
GTCT TACI 111 GC 1 1 AATT1TATGAGAAGGTATTCTTCATTTTTAATTGC 
TTTTGGGATTACTCCACA I C 1 HIGH 1 AATTTCTTGACTAATCAGATTT 
TTAATAGAGTCAAGTTAAATTGTGGOTCATAAAAAGCATTG<IATTGACAT 
ATGGTITGOCAGCCTAAGGGTTTACAGGX^TTOTCCAAACATriTTTOAG 
AACTATATTTATAAGCAGGCATGGATTTCTG 
> Sequence 330 

O ATGA TOACTCACCCCOGTGGCGOCCOAGGTACGCOOOOATAOTTCACTC 
ACTTTCAAAGOCAGCTGAAGGAAAGAGGAAGTGCTAGAGAGAGOCCCCTT 
CAG rOTQC 1 1 C I UACTTTTACGGACTTGGCTTGTrAGAAGGCTGAAAGAT 
CGAOCX> GCOOC OCOG<K^GGTA CIIIIllllllllllllllllUX 3CriTC 

I I roCTOCl I ICI IATOATCAOOCAC AI I IC I rCOA CC TOCl rCTOCTTC 
ATOCTCAGAATCTGAGAATTCTTCATCACAAOCTATCOOC IT01 C 1 GA TO 
CTCOA ATAOAAAI IC I C H U 1CI OGA IC1 IC I CCATCTTCATCTCCACTG 
TCTTCATOAACAGCA 1 C 1 1 U 11JGAATAGCCTOCATCTOGACAOCAGGTGC 
ATGAGGTAACATGCOCAAATTTTCAAACAAACGCTGTTTTA I C 1 1 1 1CCA 
TATATTTGGAGTGTTCTGO 1 1 TO 1 CATGTTTGAAGGACTAATATCCAGTT 
TOAAOTCTGGT 



331 

TCroA TGTOAGC TCAOCOOOTC<»CGttOOGGGTA 

GG AOOCT I rGTTCOATTOTATAACACAGAATCACAAAOTTTCAGAAAOAA . 
GTGC1 1 CAAAGAATGGATGGCTCACTGGAATGOCXj I C 1 1 1G ACCTGGCCT 
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GGC TTCC TOGTOAACTTAAACTItnTACAGCAGOAGGTGATCAAACAGCC 
AAA TTTTGGGACGT AAAA GCTGGTGAGCTGATTGGAACATGCAAAGGTCA 
TCAATGCAGOCTCAAGTCAG 1 1 OCC 1 1 1 1 C I AAOTTTGAG AAAGCTGTAT 

tctgtacctggccg 

tgatggagctagcgcggtggcggccgcocgggcaggtaccatctgacttg 
ocaatotaaoacacacacottaotgtggggcacaaacgtggaatattagg 

AGAGAGCTGG TTCC AGCACCAA ATCCAGAGTCACTCGGGGAAGGAGGT AT 
GGTGGCAA CaCTTTATGCTT AA TATTCAA TTCTGCTCCAGTAGAACATGG 
TACCT 



^ATATGATCACCGGGTGGCGGCCGCTCGOGCAGGTACGCGGGGACTCTG 
AACGTGCTAAAATOGGAAGG ACGCG O rGTTTTGCI O ATCTGTTAAATTCT 
TAGTGAA GI 1 I CCl 1 G A TTTCCAGTGGCTGCTG I rGTTTGAGTTTGGTTl 
GGAGCAAAACTG AGGTAGTCCTAACATTTCTGGG ACTGAATCCA GGCNNG 
AAA AAAAAA AAAA AAAAAAA AAAA GGTACCT 
>Sfytrnc> 334 

GATOTQATCTCCCCQCGGTGGCGGCCGAGTTTGATTTCTrc 
GCGATGGAGCOCCGGGGTGCCTGGTTATTGTCCGC M t C I CTCTCAGATG 

cttggci rorm I CA AOAGAA OC 1 1 1 1 1 oga tattcattoctocatoga 

TTGGATCCAGTCCTTGTTCAGAAAA I I G I 1 1 C A A GGCACTT AAGGCTGCC 
TGAAAGCCTTGAATCCTTGCTAAATATTCCA Ol lOl I 1 1G AAGGTTGTAC 
CT 



TGCTAACAACTTTCAAATrtnTCTACTTA CICCClCI I C i 1CA GCTTCAC 
ATCTGGGAAAACTGATAGGGAAGCCTaGGTAGGCCTACC 1 1 roCTGCCAO 

agkxsaagctcaatccatgcaagccceagataatatatgagaacctcocca 

A CCTTACC CTACACCCCrCACCTCCCAATa^G 1UCCT 
GCI li C 1 CAAACCATGTTTGGACCTGCTT GG AAGCTOCCTCTCCTCTCCC 
TAGAAAGCTT CATTATG TGAGTG ATACA 1 C 1 1 1 1 CA TAT C rTCTTGGTGG 
IGIGIGIUII^TATCATCAGCCIGAACATCTGAAGCAAATCTTGGGGGGG 
GGTACCT 

agatgccculTm*<^^ 

caggaggttcagccagaggaaccgacttttaagggatcacagagctcaca 
ccaaagaccaooggaacagtcagaagcctggci iocicctcaooctocca 
ogaacctgcctcaaaacacaogtctccacgaocaggagacaggt g ctgi g 
ctctggacagctgggccccagggaccagccatticgttiacaacagagctgt 

ATCCCTC I GTCAGCAAGAATGGATGTGCCCAGGCCCTGCACAAAGGGCCC 



AGG I C I GCC CTGGGGCTATGAAGAGACCAAGACGCTCCTGGCTATTCTTA 
GTAGTTCTCAATTrTATGGGAAACTCCAGACCTGTCAOCAGAACAGOCAG 
AACTACAGGOCCATGGCGGAAGCACTCTGGAGAAGG G I ITTOG 

TACCOGGGATAATCAAGGT 
ATGOCAGT - 



GGAGOCATTOTCCAGCC 1 1 OC1X L UGACK3AGCAGTGTCTGGC 1 1 ICTOCC 

ATCATTAGCAGTGACTOCAC^QGG 
CACGAGGCCCAGGCTCC^GAAAGTGGCCTCATGOCA 
AGGOCTGGATTGCCGCTTTCAATTACTGGCATCCAGAAATTTjTOT 
GGWSGCGACCATGGCCTTCTaAGGGOCTGGOAC^ 
*33» 
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GGiAGACGCTCGATTCGGCGGCCGATGACGTGACCTCTCTGGGAAGAAGTT 
ATTCTGCAGGCACACATTAGACCCAG2<MTGACAACAACNGCACATCAAA 
AGGCOOOOGQAAGATOACAGACOOTGCOOGCO 

GAGAACTAGAGGAOCCCCCCGGCTCAAG<^ATGCGAAATCACCOCAAGCG 
AAACCGGCAACXXKCAGGGGC<XKXXXKX^CCCAGG<X^^ 
AAGAGGGGGAAACGCACtXTAGGGGCGAAACACGGGCAAAGGACGGCTCC 
CCG^^CGAAAA^A^GGG AACCCGCA CACAAA AA CCACAACAACATA CCGG 

TCAAAAGGGAAAACCOCGGGCCGTCGCCCA 

GGCCAAAGACCGG 

>So|mce339 

ttaatxktn^ag at7tagatttatttga aaacactttagtct aatttatat 
tagtgcagaaaaatcacattcaataaaccacaattgtagaagagacagat 

AAGTGTGl I IGTCACATTTTCACACAAATATAATTTGATATTTAATTAAG 
GG a TGATGAA tcaca TTCCA TGT A AA TAATGA ttt A TTCTCTCAGTAAT a 
GAAGGA I ICICI III GGCTATTGAGGGGCTTTTOGGGTITATTTTCAATA 
CAGTGOCXXKrrrTAAAAATATAAGGGAA 1 1 1 1 1 1 1 1 1 1 1 1 AAGAACCTTT 
TOCXrTTrCCAATTTTTGGCCAATTTCCOGGAAAA^ 1 1 1 1CU C 

GGCGATA AACOCCOC COCAGG TGGAAAAAAACOCCCCTTTTGACAAAAAA 
TTTTTT GGGG GCCXrrGTTrrTATTTTGAAAAA l 1 1 1 1 11 1 C \ CAGG AAAA 
AACCCCCTTTA 



TTTTTTTTOTT1 1 A AAAAAAG 
Mlllll IA AAAAAAAAA 1 1 1 1 1 1 1 1 I I I 1 1 1 1 AOAGGOOGO 
>Seqacacc340 

ccgggcaggtacgcggwggagcgggccctacogtgtgcgcagaaagagga 

GGCGC 1 1 UCJC 1 1 CAGCTTCTGGGAAATOOOGAAGATGCCCAAAGACAACT 
CAACTGTTCGnGCI I CCAGGGCCTGCTG ATTTTTGGAAATGTGATTATT 
GG TTG riOGOGCATTGCCTGATGGGAGTGTA ICI 1 1 1 1 rOTA TCTOAOCA 
A CACA GCCTCTT A CCAC 1 1 G II GGNGCCAAGGACCACGATGACATCTAAT 
GGGCT GCCTGGAAOGGCATAA TTGTTGGCAA TTGG C 1 1 1 1 C I GGCTGGCT 
GGTCT AAGCAA TGGAA GCCTT ATTG AGGTCCACAA GGAAA I 1 1 ICIGG GC 
GATTTCATTOGGAGGTTATAGGTrAAGCCTTTTCAAGAGGCATTTTGGAT 
TACAAGAAGCAACACAAACGAGAC 



CTCTTTCT1 

TAACACAATACTATAGAAGGGACCCTTOGGCGCTCTAAAAACATAAAGGG 
ATCCC<XXX»GCTGTTCGGAATATTTAAATTAAAGCXrrrATT^ 
CGO0GAACCTCCAAAGGGGGGGGGCCC<^GGACCCCACA1 1 1 1 rGTTGCC 
T1 1 F I G ITGA GGGGGGTAATAO C CC XX ' ICI T TCGGGAAAAATAAGGGAAA 
AATPGCTl ICTGTCTOOAAAAAATTT ATCTCI 1 1 rC CAATATTCACAAAA 
TTATAAACCCGCG<KIATTAAAAAATAAAAAACCX«CCOGGGOCCAATAAGG 
GGGCGAACT TAACA CTATATTTOGGGNGCCCCTi 
OGGAAAAAATTTTOOOG C OOC ICCl I ATAAAAACA* 

AGGAQA G I 1 1GIC I A TATAOGGG CI III ICI ICI 1 1 ICI AA TATAAGAAG 
GC 

>Scy 1 341 

AO00AaX»0 OTOGlOC OO00000 q00C AjCGTA^ 
- OiAATAXrTGCCAGTTTTCCAACAAATTTTCTAAA . 
ACTCTTGC CTTAA AAC H 1 1 CI CACX>£ACCCACCTTCXXACATOCATGA 
TATCC AAGG T ajACAGA OCTOGATTAGAATCACTCTAAOCT^ 
GOGTATTGTA 1 1 1 1 CT COATAAGAAAGGGCTCCCTCTAGAACACAGTAAO 
TGT ATTTGCOCAGT AGTQACATTGOCTACATATAXXXLAAGTGTTATAGT A 
TAC CAACTTAGTATATTTTTCAAGGA GAGCTAAACCA CC 1 1 1 rPTAATGT 
ICOGI I rCTCACTGTTATCl ICCl 1 1 CC I ATAATTAATTTATTTTAATCT 
ACAAATTGACATAGGGCT AAAAGCTTCAATATTTTACAAAAT attaatta 
ATGTAATrOTTOCCAATTATTAGAAACTI 1 1 1 ICCA TTTTTCAAAATOTT 



pcrrusoo^ujn 
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TCOCAACTTCACACAAGTGTCn'AAAAATAGGOCTCTGCATTTTCAAAAGC 

ACA TA CATCAA T AA TTT A TT A GCT ATTCCA GGCAA GCT AAG TACCT 
>Sequawc343 

TTIX^TCACrcACOG<XriXJGCGCCOGAGGTACAG<rriTAGTCTCAATOCA 
CTGTCA TGAA ATTTAAACTTTCATTATAATA C I O I 1 1 1 AAGAACTTACAG 
CA TCTGCTITAC AAATGGTCTTAOCTACATGTOGACACAOCATCTTrAGC 

cagi i i ici 1 1 1 ugaagttcatctgatgtcatctggaaactgagtaocac 
atttgcctgc icici i ggtggcctcacaagca aggcaa aaccatta tggc 
aatctagggttccagaataaocataaacattaagtgtcactccttggaaa 
atgacagatgtatccaagtttagttccctcagagcaatgaaattccaatc 

AAATGAACTATCACTTCTCCA C1 1 ICCI I G TOCTATTTTTAATAAGACAA 

agaacatcaocatattaagttgaagtaoctgcccg 

> Sequence 343 

CCG GGCAG CTAC ATCAG A GATGCTCACACATTCTTTG AG TAGTTTAA AA A 
CTCA TTTT A A CCA CTTTTT A fTt I T I GTA TTCAAACC AATCACTGGCA AT 
AGCTCTAAGTAGGTCATCAACTCTCCrCCATGT CI I LI I I CTAATTCTGC 
CACAGACTCACTTCTIXXXjTAAATTAATGGAAGGAAATGAGTGTCTGAGT 
TCTTA GAA TCTCAAAA GGCATGAGG ATAAA GCTTTOCTGG AG A T AA TATA 
AGTGGTGGCAGGAAGATTTCOGAGCCAGATCATACTCTTTTCCTCTTAGA 
G AAACTCTGTGO AA GC TC TOOL I ATACTGTGGGAAATAAATTCTAOACGC 
TGGCTTCTTTCTOTAGTAAACATOTGGGCCCTTTAAAATGTTGAAOCAAA 
ATGTGCTTCAAATATAGTTTAGTTATAAAACATTTATCGGGGaGTATGTA 
TGTG OCAACTACAGAGGCTTCAGAGATGAAGAAACaOTTCTTACCCTAGT 

O I IOC I iagaatctagtagtagtaagtaataattactaacatatgcattt 

ACTATATAGGCAATACTAGGGTAAATATTTTACATAGATTACCTTATTTA 

GTAGCTCTTAGCTGCTAAAAAAAAAAAAAAGATTAAGATGTCCAGTCTAG 

AGTCTC^AATTOT^lT^GTAAA 

TACTGAGTGTGGTCTTTACTGGGTAATACCTGAAGAAAGAGAGGTTGGCC 

CCCACAGCAGAGGAAGTACAACAGGCAGGCCT 

>Seqvcacc344 

aogta ch i n i Min i mum hio tgggagttaaataaaataag 

catgtctxxattctttattoctaaacatttac^ 

ctcacagaaatttgaaaaataccagacacttcttaaatgatttoccttgg 

TTCAAAATTT ACCC C 1 IC I IOI 1 1 ICICI IOC1 1 1 1 CAGGTA ATTAACTC 

ttctci 1 1 1 1 agtttgaactatgcagtgcaagattoctctotagtctttc 
caagtggaagggtataaaaaaaaaacactttatattatgcgaggtgaggt 
gtcagaacoctggcatcggaaagtggttggctcacgggtcatagggtagt 
aagaagaatttacagaagacagtataggttcgaaaa 

AGOTACACTCOOGCGGGGGCAGAAAAGCTOCAAGGAACAGAACCAGCAAT 
GCAGAAGCTCCTOG^GGGCCACCATCATOCT^^ 

CAGI CI C17ATGCTGTGGC4CIUCIt^AAGGATGTCTCAAGOGCTCOGGTG 
GTOC TCTCC rOCl C I ATOOOCTGCTOTGGCAAATOCTCTAAAAACACOGT 
T1TC CACAG CAGAGAGCAAAGTCCGCTTGTTATTCCACCCGATACCTGAG 
CTCAGTTT0OCAQCTAGT0ATC I IPC ICCC I 

GTTG<X3CTCAOOCTniACACATG<XK3ACCCTOCCC^ 
>S cqo « ct 346 - : 

OOOTACAAOAGATAGAAAGACCA G 1 CC 1 1 GC I OA AAGACAAGTCTGAATG 
CTOCACTTTTTCAATTCTCTCIXXIATTCTTCAGTAAGTCAACTTCA^ 
CGGATGGATOAAAOOCAGACACATAGCAATTCAGGAAATTroACTTTCCA 
TTCTCTXjCTCG ATGACXrroA CT 

CTTGAIGICIACAATA TCAC C1 1 ICI I ATAG ATTCGCATATATGTGGCCA 
AAGGAACAACTCCA I O 1 1 1 IC I AAAAGOCCTAOAGAACATATATCGGCTG 
CC 1C ICCICI 1 1 OC11 1 1 0 1 G 1 1 UU I cattttggogaattactggaag AT 
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C ^GGGC AGGTACCAOGCCCAGCTAATTTTTTrATGTTTGTAGTAGAGACG 
AG 1 1 1 CACCATGTIGGTCAGOATXKJTCTCAAACTOCTO AOCTCAGCTGAT 
CTCCCTGCT TCGGO CTpCCAAAGTCCTX^GATTAGAGGCATGAGCCACCA 
TACCTGGCTLI 1 1 IOCI I CATOCATOCCTTAA 1 1 ICI I lOCTGGAOCATT 
TTAAAGC AAA TA TCAG A CA T A CCCTTTCACGCCTCACACTTCAA CATGCG 
GCTTGTTG AAA TTCGTGCTCCA CTCCA GC A A CTGCTTTC A ATCGG AG TTC 
CATC CI CCG CCGCA GTATG CCCTAACGCAGCGTrATCTTt>OAGCTACTA 
CCCAOI I I CCGAAACTTTTtXSAOGG AGCGCTTTGOCACCACCTTGAACCG 
GGAACGGGTGCGTAAACCAAACTTGGAACtjCCAGCOCCCOOGCGTACCTT 
GGOCCCTTT 
> Sequence 34 J 

AAC G A TG A CTCACCGCGCTGGCGGCCGCCCGGGCAGGTACTTO ACTGCTA 
CAACTTTCAAATTCTTCTACTTACTOO C IC I I CI 1 CA GCTTC ACATCTGG 
GAAAACTCATAGGGAAGCCTAGGTAGGCCTACCTTTGGTGCCAGAGGGAA 
GCTCAA TCCATGCAA GCCCCA G A T AA TATA TG AG AA CCTCCCC A ACCTTA 

ccctacacccctcacctc^^ 

gcttcattatgtgagtgatacatci iticatatcttcttggtgtgtgtgt 
gtggtatcatcagcctcaacatctgaagcaaatgttgggtggggggtacc 
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G AGTCGACTCACXCOCKSTGGCCKKXXjGAAGoawao AOUTO C 1 0 1 OC I O I 

gtatga agaggca gtgaagactctgccaacagaggco^tgtggaagtgtt 

A CA TG ACCTTTTGCTTGG AAAGA TTT A CTA A GAA GTCAAAT AGTGGGTTC 

cttagagggaagaggttggaaaaaaccatgactgtattcaggaaggcaca 
tgaactgaagcttctgtcagaatgccaatacaagcagttoagtgtttcgt 
tgctgtgttataactrcctgagggaagctctggaagtcccagtagcrgga 
actgaattgtttagagactctgggacaatgtggcagctgaagctgcaggt 
gctg atog agtca aagagccctg acatagccatg c t 1 1 hu aagaagcct 

TTGTGCACCTGAAACCCCAGGTrrGTCTGCCATrGTGGATrrCCTGGGCA 
GAOTGG ACTGGAA OGTGCCAAAAGCCAAGAAOACACTOAOOCAGTCTTTA 
AGAAAGCTCTTTT ACTGTATAAGTGC 
> Sequence 330 

AAGA CTTT TAGTTCGGCTCTCCCAGTG T'l 1 1 1 1 1 1 1 OGTCG ATTTGGGCA 
CAGAGTTTCCTGGTTC^CGTGGATGTGAGGATCCTTTACTCCAGATCGCC 
AGCCAGI 1 1 1 IG II I I I TT TCCTGCGTTGCTOAGAGTCTOGGTTTATTCA 
TCACACCAG^TGGATCTTAATTOCATATCCCTGAGGCCACTGCAATGAGG 
CAG A C^IAGTGTGCTCOCTCATGAG AAAGG ACTGG AG A CCGCCCCCAGAAG 
AGAACGTATCCATGTACCT 
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GTAGATCGTTGaCTCACCGCGGTOOOGGOCGCCCGTTjCTGGTC 
TTOCCCG1 1UI 1 ICTGGATOTGAATGOATTACAATGT ATI 1 1 ri TAOGGA 
AATCCTATTATTATCAATCTXiACrOCACGGGGGAGTCCATGGTGATGATG 
ATGAGGAOOAGGATG ATGATGATG A G ACACCTCTAAACTTGGAACAAGTT 
TAAGACTTTATOAGACAAQAAAAAAAATCACCAACAAGAA I IOI | IOAGG 
AAA AATCAXA A. CTATCCT G T G ll C Ar i 1 1 III I I ' ll A TAAACAATAAG AA 
AAAGM O TIOGA I IHIIII | AATGA II ICI 1 1 1 1 1 GGGGGAGGGAATTT 
TGTTGCA GTTTTATGGTGG AAAATGCAAAAACCAGA GCCAGGTGCA TAAT 
CTTGTAATCTGTGGATATX3CCTGGAGCAGGACTGATX3TCT 
>Seqee»ce 352 

^^^^^^GACTCCACCGCCGTGGCtjGCCGOCCGGGCAGGTTGGTAACA 
ACXJGAGAGTOOCGGGAAGCAGTGGTAACAACGCAGAGTCCCGGGAAGCAG 
TGGTAACAACGCAGAGTCCOGGGAAGCAGTGGTAACAACGCAGAGTOCAG 
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AGGCAAATAGAA 



GGAACCAGTGGTAACAACGCAGAGTAC 
TGAGAACCATATTATTjTACCT 
>ScqoCBC»353 

GTAGAOAATOAOCTtJCCCGCGOTOGCGGCCGAGOTACACOCAGCTTrGTC 
TCCTPOa XCAAAT C T CCI 1 1 I CC I I ACTTTGGGCATTAACTGCTGTTGA 
GG1 C I CACAGCCTCATCGTCATTATOCCTGAATCGCATAAATCAACAGGC 
TGTATGAGCA TTGTGTG AG ATTCT A CATG AGGG A G A GCATTTCA A A CCCA 
TGACAGATGAGAGAAGTTAGTACACTCTCACTGAACTGGGGATGTTTGAC 
TT A AAATG ATGG A CAA TA AG A T AGTG A GCAG T AAGTGTGCTCT A GGCTA G 
GCTACGAOAOGCCATGAGCTCCTCA 1 1 1 H IC I C IG I ICIGAGCTCTCTG 

TACI I I C I GCATCATTGGTTCTCC^^GCTCACTrcCATAATGTXCTOCTA 
GGCTGCATTGG AATTG7CTGTTGTCTAG ACCCATGGCCA ACACTGTCA TT 
GCCTOTGAGGGAGACCAAGCTTACCACGAAAGGCTTTTGCG 
> Sequence 334 

G A TGGGTTG A GCTCACCGCGG TGG CGG CCGCCCGGGCAGG TA C I l l l l ll 
TT 1 1 1 1 1 1 1 1 1 1 1 1 ' I GCCTTTAG AAGGTTA AAATGOCAATATA AAGCTAA 
AACAGTAATCATCAGAGACAGCTGTAATAAGGCTTTGCTACIU 1 1 1 1 1 A C 
TATA TAAATC1 1 I ACGTG TT A ATGG AAA G AA AATT AA TTCA TTCTG TT AC 
TCCAi i 1 1 1 1 ICTCTCCATATTGTATGCCTGAAGTGAGCTGATGAOGGGC 
AGAAAGATCATACAGTTAGGAATGAAGACATCAGAATGTTCCACTAAACA 
GATATTTAACTAGATACTATTATACTACTAAGAATAGCAAGAATGTCTCT 
CAATTCTGGGAATTTCTCCTAGCTCACACAAATGAAACGC^ 
GAA7TX.I I IL I AATAAATGCTTCCAGGATAGTATCATAAACAAAGTCAAA 
ATTAAGAAAA ATT^CXriGCATGGCATOCTGGTCATTCTOCATCAGCTCAC 
CTT IH ICriATCAGAATCCACAA CIOC I I I I I 1 UG I'l' n CACAACAGTG 
> Sequence 355 

GCTOAGATGAGCTCACXXKJGCGCGGCCGAACCGXXATCTTCCAGAATTOG 
CCAAAATGACGAACACAAA GGG AAGG AGGAG AGGCACCCG ATATATGTTC 
TCTAGGCCTTTTAOAAAACATOGA G I I GO ICC 1 1 IOGCCACATATATGCG 
AATCTATAAGAAAGGTGATAATGTAGACATCAAGGGAATGGGTaCTCCAA 
AGATTCAGGTTTACTCACGCCATCCAGCAGAGAATCGAAAGTCAAATTTC 
CTGAATTGCTATGTCTCTGGGTTTCATtX^TCCGAC^ 

ACTGAAGAATGGAGAGAGAATTGAAAAAGTGGAGCATTCAGACTTGTCTT 
TCAGCAAGGACIOGICI I ICI ATCTCTTGTACCT 
>5eqoence 356 

GATGATTGAGCTCCCGCGGTGGCGK>CCGAGGTACCTGACTC 
TCTGCGTCGCAGCAGCGAGAGAACAAATCACTCCATATCCGATGAGAGGA 
AGGGTGGCACAGAGATGGTGTCTACAATTAGAGACATTTCTQACTCCACC 
TTAGCCT AAGCAAACTTTATGTACTG AGTAAGATTTGAAGO I IP ICI 1 1 1 
AATGGTGGGGGOlOl 1 1 1 I ICCI 1 1 1 1 AAACTACAGTGCTTGCACAAGAG 
AGGGA GGG ACTCAGAAAAGGTTAGOGCAGGTGAGGGAG ACA GTA GA TGGC 

CT GGGAT GACTTGAGTOCATCATACT attgcttggcaggtgtgctccccc 

ATOI 1 1 OATTCAAATrCCATGAGTGACCTA CCTI I CUCCAGG AA TGGGAC 
TGAGAGGGTAO ICI I CtJAGCAACTTAGTCTGCACAGGGKrTtXCCGTTCAG 
GCTGCCTTTtjGTGGTTGTGCTTTTGTAAO 1 1 I CI I I l lCl OCACTTCOAC 
TTACCTTTCAATCAGAAAGCAAOCCCAGCAGGTGAATOAOGOATG ICTO1 



GAJ]GCAGTTGAGCTCACC<XXX^^ 

ACTTTjGCAATGTAAG^CACACAOGTTAGTGTGGGGCACAAACCTGGAATA 

ttaggagagagctggttccagcaccaaatccagag^ 
ggtatggtggcaacactttatgcttaatattcaaii C'lOO CCAGTAGAA 
catcgtaocaocattcttccaagttcaaaaattatctt^^ 
ttccccatrcct ctaa ta tgtca ccaa ttctgctg a tacattctttgtaa 
tc1c1 ccatctattttaatctgttattcacctgagct aca caaaca ttca 
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^mu^o u^!£aT^^ 

ATACACAGGGATAAAAAAAGTAGAGGCACTTATTGCATACCTGTAOCT 
>Sequeoee35t 

TGTACCATGAATCGAGCTTJCACCXXXX/TGGCGGCCGAGGT AC i 1 1 ICI A G 
^PT ^^rc^^CTCCATACTCAGCTCAAAACA C T Gn ' I C AGCCCCCTC 
TCTCCTGACCTCAGCC I I C 1CCAGGTGTA ICI C I I UATGATCTTGGAGAC 
CA GCAGCCA CAGCTGCTGCT A CTCCTGCAGG A G ACTGTCAGGCTGTGGTG 
GGGGG CAGGGGTGTTGGAGGAGAAGTTGAAAATCCGTC 1 0 1 1 C I CTGTCC 
CTC1GC ICC ICCATCTTAGCTTCTGGAGGAGTTAAGGCAOCAAGGGCAOC 
AAGTCAGGTTTGGCA GTTTTTGCTGCCCTTTGC3CCAAC<KTTCAACAAAA 
CCAAGCTGGTCCCCTTGCI'IGGTTGGGTCCCAACCCAGGGGGGATTTGGG 
GTGGG TGGATAA G A A CCC ACCA CTTG T7TTTTCCCCCA CTTTTTTT ATT A 
GGGGAGGOl III GGGTTTGG TTGGGTTTTGGGGGGGAG AAAAAAAAAATC 
CCACCTC llllll I AAACTGGAAGGCCGGGGGGTCCAATTTAATTTTATT 
TGGA CCTCrrCTTTTCGGGGT AAAA CAT 
>Sequeoee 359 

ATATGAGCTACCGGGTGGCGGCCGCCCGGGCAGGT ACTGGTGTTGTGATC 
GCAACGTGTCGATCCCC IC I ICIC A TCACIGCIOCICC A ACTQGATTrAT 
TACTCCGGG A ATGGTAGAGAATAAAGATTIGTAGGAAAGGTOCTGAACTG 
CCAAGGAAGGCATTTCTTGTGCCCTGTCIGOAACtXjTGTATCCTTACTAC 
ATCACTGAACGACACCAAGCACCCCATGCACTTCTGGCTCCAACCTTGG 
CCCTGAAGAAAGACACTGAAAATTXKJ AATGCAAGCT ACTTCCGTAGGGGG 
GATT TCT I I I A TAATGGTAAGGCXXrTTTAAAAAAGGGCTTAACAACAAAA 
AAAA 1 1 1 I 1 1 CCCCCGGG GGAGG I U 1 1 1 1 AGGGGGAAAAAGG GI T I T I CC 
CCCGGGG AAACCCCCCCCC I III! I L I GGGAGGG AAAATTTTTTTGGGTC 
CTGGAAGTTTT CAAAAAATAAA ACCCCCC CI I I lOl 111 1 1 AAAAACAAC 

CCCCCC1 IG! I GTGGTTTAAAAG 
>Sequeoce 360 

TT A GG A CTACGGGGCGGCGGGTCCTTJIGCAAAACTAAACACGCCCG AGG A 

aatttgck:cagttatccaattgatgaactagtagatagagocaaacaatc 

TTTTCAAGAGGG I G 1 1 1 0 1 G AO ATATGGTTG ACCAGTGAAGACACGGGGG 
CTTATGGCAG agatattgggaccaatctgcccacac I CC I G I GG AAACTG 
gttgaagcgattcctgagggagcaatgctgaggcttggcatcacaaa 
gcoctatattttagagcatctggaggaaatcgcagaaatcctraatcacc 

CCAG AGTCTA (XKnTTTCTGCA CATACCA GTCCAGTCTGCCTCCGA CAGC 

GTACCTGCCCG 

>Sequence 361 

GTCGACGTOCATTCAGCTCACCGOGGTrOGCGGCCCAGGT ACTTAAAACCA 
AATAAAAAGTGACATTTGAA 1 1 ICI 1 1 1 AAAAGGATTTCCGAGCTCACAG 
TCAGCTTCCGAGOCATTCTtXXXKXiTACCAGCACAAACCG^ 
CTAAACTGCTCA TTT ACTGGGCGTCTACOCGGG AATCCGGGGTCCCTGAC 
CGA 

GTCGAGATCCATTGAGCTCACOGOGGTGGCGGOCCUGGTAC^ 
GCCTCACACTCTATAAATGTA 1 G 1 G 1 CC I GAATTTCAG AGCTT AATAATG 
AATTATGGAACTTGATAATGATTGGATCAGGCAGACAACArxr^^ 
CCTAATATCAGAAAAGAGACAAGTAGACATTATGTGC 1 ICCIUAOGTGAG 
GCAGTAGTAAGGAAACAACATCACACATGTAGCAGTCTTOGGAAAAAAAA 
. TGTAACCTGT ATCTCGTAATGAGGAAA CAATCAGTAAAAAAGTCTAGATT 

gtgggacattccacaaac 1 1 ccc i gaactctttaataatgtcagtgtcat 

GAAAGACACACCACACACACACACTGCACATCATACACAAACACC^^ 
ACCACCCACCACTCAGACUCACUCAAAAG<XK^CTCTAATCAATTAAAG 

gaaacaaaagagaatcacaactacatataacgtataattcttgattggat 

CCTGGA TTTAAAAA T AAA CAGCT A TAAAGGATA TTTT 
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c363 

GOGATGAGACTrrCAGCTOOCCGOGCTCGOGGOCGACCTACTTAAAACCAA 
ATAAAAAGTGACATTTGAA I MCI I J I AAAAGGATTTCCGAGCTCACAGT 
CAGCTTGCGAGCCATTCTCCCGCGTAOCAGCAC 

taaactgctcatttactgogcgtctaooooggaat 

OA 

GTTCCGTG 

gcagtcgtaacaacgcagagtcccgggaac^gtggtaacaacgcagagt 

ooogggaagcagtggtaacaacgcagagtoccgggaagcagtggtaacaa 

cgcagaggcitpcagcacagcccagggtgcocgggactgaaaactccttc 

aocaococcctccacagoatatagaaoacttagatcactacgagatgaaa 

gcagagoccattagtgggaaaaagttggaggatgaaggaattgaaaaaaa 

aaaaaaaaaaaaaggttcctgcccg 

gaitatgtgagtgattgagctccaccgcggtggcggccgaggtaccaagc 
actgggtaaggcacttttgtgoagcattagacagtaaccctcaaogagct 
a gagaac cgg atgggagacatgagcggtaa ttaa ctcaci ioi ro oQ CAO 

AGTT ICIAI I IOI 1 1 JCj 1 1 1 ICI 1 1 1 ICIGIOACTTAI 1 1 IUL1AI IMC 

1 1 I IX 'I u catgtaattttcactatggcocaactaatataaacagctggaa 

ATTACAAGG AAAAA AAATTCTTOCTCTA ATA A CTTTOCAAATTTCTGGAA 
TATTTATTTGTAATAGCAGTTATCAGTTATGCTTATATAGCATTAAAAAT 
TCTCCTOCTTTGACTACACA CACAACCACAGTGTGGTTCTAATCATGG AG 
ATATCAOTAATTTTTAGTAACTGAATTT TGAGG ACA 1 1 1 CI C I OTTTAGC 

TTTTTTTTGAGATGGAGTCTTACTCTGTCAC 
• 366 



TGTGACGTGAGTTGAGCTCCCCGOGGTGGCOOCOGAGGTACTTTGCATCC . 
TTCAACCCAATCAAGCTGAGACTCAGTATTAACCATCACAAGGOGTGAGG 
ACaGATAGCTGCATCOGCAAAATAGAGAAOCAAGAAATAGTCCCACAOCA 

aagtcaggatcaaatgattcctggacaagccaccaagtca^ 
gagaaagaagoctttocaccagttggtgctggaagttctggatatgcacc 

TGOATAAGTGAAOCCOCCTOCGTCACCACACACAAACGTTAATTTGAGAT 
OGATTGCAAACATAAAAOCTAAAAOCATTAACACrrCTTOAAOGTAACAT 
AOAATATTTTOTAATOTTATOATAGQCAAAAG ITJT C T1 AGGACACACAAA 
AAAATTAACGATAAAAGAAGAAAATGGCTGGGTGCAGTGGCTCACACCTT 
TAACACCAGCATGTTGGGAGGCTGATGCAGGAGCGTCCCTGAGCTCAAGA 
GTTCAGOCCAGACTCGCAACATAT 



GTATGATAAGAATCGACTCGACCGGGGTGGCGGCCGAGGTACATTQAGAT 
TCAAGAGAAAAGTCACAGCAGGTCTGAGCTCCTOCAGCAGGOCTTATGTA 
ATGCTAAGATTTTTGGGGAAGATCAAGTTGAACTCATGAACIGGCTGAAT 
GAAGTGCATGACAACTGAGCAAGCTCTCAGTCCAGGATTACAGCACTGAG 
GGGCTATGGAAGCAGCAGTCTGAACnTCGGGTTCTGCAAGAGGACATCTT 
ACTCAGGAAACAAAATGT AGATCAGGCTTTACTAAATGGTITAGAACT AC 
TTAAACAAACX1ACA OGTGATX1AAOTTTTAATAATTCAAGA TAAATTGGAA 
GOCATTAAA GCAAGGTACTGGCAGATAOGAATTG AGCAT ACCACAAAAAA 
OTTCTCA I 1 1 ItnGTOCTOOCATCg^ATTCTCCTCACTAACCAAAGGCTA 
OGAATTATCTGTGAATCTAGGACCACTOGATTTGCAOTCTTt^ 
mCTGGGGAGAG 1 1 1 CI AGGAATGAAAT 

GCAGOCAGCTTACAAGACACAAATATGCTCTTATAGGCTGGGGAAATAAG 
AAAATATGAATGAAGCAAOCCAGGTCTTGAGCCAAAGAATTAGCTGGGGT 
OOGTTGAGTTCAAATCTGAAAA 1 1 fCTOTCTl I CA AOGTCAGCATCOCOC 



369 

TGTTGTGATCGATCGACTCACOGCGGTGGOGGCCGCCCGGGCTGGTACGC 
GGGGGTTTCOGGTTTGGGTGTGGCCGCATGGCGTGCTGGGGTGCAGGTGG 
CCGAAGGGGGCGTTACTGTTGGGACTGGCATCCGCATOOGGCAGATGTAG 

atggaaccaaagccagaagttaogggtcacoc I rocTCi acagccaaaca 

TGCAGGACTCTAGTAACCCGCGAAATGATGGGATAGCGTTGCAAATCCTT 
AA AAGA GTCTTAACGGAGAAGGAAAAATGTTACATTGTCAAAGTOOCAAA 
GOCTTTCAGOCTGAAGCCAGGAACAATTGTTCAAAG 1 1 1C I I ro OAACAT 
CA AGGA AGGAAATCCAGATTTTACTTTAAGTGCAATGGGGAGTCATTAAG 
GATTTTGTGTAGA TACAGCAAAAAG ACAACAATCTTCAAGCCACAATGGC 
CCTCACCAGAACCCAGCCATGT G GT CAGCCTGATCTCGGACTTCACAGCC 
AGCAGAACTGTGAGAATTAAATCT 
> Sequcncc 370 

CAGCCATTTTATGATAAGGCCACGGTTGGGCCGGTTTAAAACAAGGGGGT 
CCCCCGGCGTGGGG AAGA TTTTATTAAGCCTTTTTTGTACCCCCCG(XTC 

AGGTTTTTTGTGTGGAAAA 



g gacgo ggagttgagctococgcggtggcggccgooogggcaggtacgat 
t a ttttaaacaagcctacgtccctgactaagcgagtggaaggtgtgagtg 
gcactacaaattcacaaaagaactgtagcctcagataatcaaaggagaga 
aggtcagatgcaatcactgatgcatgctagtaattctcaaaccttogttt 
tgagaaacgattggattttcagatagatttgcagtaagagaataacaagt 

CTTTA I 1 1 1 1 1 ICA TCCCAA CI I CI I IC I I O CACA 1 J 1 I ILI I C TAGCTA 
TATTTAA TATCTGTTCTCOOCACACACTTGCTAATCTaCATTTCACAATC 
TTTTTCCACTTCACTT7GTCTGCANAGAAATCTACCTGGACAGAATAGCA 
fCTTTTTTTI I LC CCCTGACCCTTGGCAI 1 I C CI CI ICCI CCAA CI IC I O 
CCTG A TCCTaGG A TGGAC IC TC 1 CATCCCTCATTCTCTATCATTAGCTCT 
CAGGCTGG 
> Sequence 372 

TGGACGATGATTGAGCTCACCGAGCGCGGTGGCGGCCGCOCGGGCAGGTA 
eXHXXXaXAlGICICTIl i lCAGCT CIC l I I C AGAAGACCTGGTGGGGCAAG 
1XCCTGGGCATCATGTTGACCGAGCTGGAGAAAGCCTTGAACTCTATCAT 
COACGTCTACCACAAGTACAAOAGATAGAAAG ACCA OT CC1 TGCTGAAAG 
ACAAGTXTGAATGCTOCACTTTTTCAATTCTCTCTCCATTCTTCAGTAAO 
TCAACTTCAATGTOGGATGGATGAAACCCAGACaCATAGCAATTCAGGAA 
ATTTGAC Ml OCA TTCTCTGCTGG ATG ACGTGAGTAAACCTGAAT C TTI G 
GAGTACCT 
> Seq u ence 373 

GGAAAGCCTGGAGAAAGAGGATGAAATGACGGATGAAGCAGTTGGAGACT 
CTGCTOAOAAGCCTCCI ICI ACTTTTGCCTCACCTC AGACIG CTCCAG A A 

CAAGAAAAG AOCTTTOATCAGGAGAAGAC1TCTC G I CI CA TTTCTGGGGA 
CACATTCAGGATTTCrCCAAAGCAGGTGAAGOTAOCTGCCCG 
>Scyimcc 374 

- TGAGATGGICACGGGTGGOGGCCGAGGTAOGCGCCAGTCACTAGCAGGTC 
CITOTGAATCTOCTCAOGGAGGCACTTGCGAGAGTTAATGGGCAGATGGA 



GGTGCTCCCCGCGTACTOCTTATACTTATTAAACACAAAATTAATTGTAA 
AATAGCCIX^GGCAGGTCCTTXAGGAGGTATOCAGAAGAAGGCATTXjTGA 
TCATAGGAGCTGATGGCTCCGCCTGGGTTACroOCCCTGTAGACrrOCAG 
TtKKJACAGGATATOGAGGTGGAAC^CAGTGACATGGATCATCO^ 
TTGTAGGTCTAGGCTAACGGGG G T OT T TGT OI LT1 A GCTTTT AACAAAAA 
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AGGTTAAAAAGTTAAAAAAAAAATAATAAAAANTAAATTKTAGGTACCTG 
GOXXGGCGGCCGCTCTAAACTTGGGGAATCCCCGG 

GATGCCCOCGGOTGGCGGCCGAGGTACCTCAGCTGTTGATCKnX^ 
TAGGAATCATTTTACTGGAAA TGTrCT CA GGAATGAAACTCAAACATACA 

gtcagatctcaggaatggaaggcaaacagttctgctattattgatcacat - 
atttgocagtaaagcagtggtgaatg<xx>caattccagcctatcacctaa 

G AGACCTTATCAAAAGCATGCTTCA TGA TGATCCAAGCAGAAG AATTCCT 

gctgaaatggcattgtgcagcccattctttagca rrccm iocccctca 
tattgaagatctgg fcatocttdocactocaotoctaagactgctqaatc 
tgctggatoatoattatcttgagaatgaa gaggaat atgaagattgttgt 
agaag atctaaaag aggga gtgtc a aaa atatgg accagcggta tttpcta 
cttggtdcaaaggaaaatcctcgcagaggaacag rctttg n u agtatgc 
aaaggctggggattcaaagttgogcagaa - 

>Seawocc376 

CA CATCF rATAATTATTTA TTTTCA CTACTTATTATTCTAATTTATACAC 
AATCTTTCTTATTTATTATTCTTI I C I ATTTATTTA C ' I m i l ATACTAC 
rTTTTTTCAATTTTOAGATOOAATCCOOGCGGCGCT^^ , 
ctocccagotcacaogtctooaaaaagcgootg ot gca at gcrocatooo - 
gatgaggggagcacgcagtggagccagctcggtgtgggagaggt acctct 

AAGGTG M L I ICC 1 ACCTAGCCTA O ITTTTTTC1 A CCAACCTAGTTCACC 
TAO l 1 IU. IG CLI A A(XICGrrAOATATCACrCTTCGL IU CI ICAACAAT 
ACT AAA GCAACACTCCTG A TA TTAACCTA CT ACTCA GTTTTG TGTGGCAA 
AACAGAGATCACATCCCA 1 1 ICTC1 I rGTOTCTCTGGCTOTTAGCACAAA 
G T7T AGCACTT AA TTCA TOCTCTACA A TGTT A G7TGA A T A GGTG AGTG AC 
AGAATITGTTATTCTTAAACCATT A C IOI I IOI I 
TTAAACTAGCTCATTGACGTTAi 
GGTAAGAT 
> Sequence 377 
TGTATGCGTGAG 
OCTCTCTATGAA GAGOC AGTGAAGACTCTGCCAACAGAOGC^ 
GTGTTACATCACCI rTTGCTTGGAAAGATTTACTAAGAAGTCAAATAGTG 
GO I rOCI rAGAGGGAAQAGGTTOOAAAGAACCATGACTGT ATTCAGGAAG 
GCACATGAACTGAAGC TT C T G TCAGAA TGCCAA TACAAGCAGTTGAGTCT 
H O GI ITiCTGTGT I A TAA CI I UC 1 OAGGG AAGCTCTGGAAGTGGCAGT AG 
CTGGAACTCAATTGTTTAGAGACrTTGGTAOC^ 

AAGGTG TTGATCCG A^^ 1 1 1 HIP 

TTTTCTGGKjCAGaGTGGAGTGAAGGGTCCCAAAAGCOCAGAAGACACTGT 
>Scquescc706 

GGT AOGAGTAAATTTTCA TTA CCTTT AA TTAGGCAA 1 0 1 1 IL I IA GATAA 
CCATAAAACTCCAAAAGCAATTTTTAAAAATGA TAAAT aggacttcatca 

aaaagt aaacocttcaaaagata ctactgagaaagtcacagaataggaga 
aaaatctgatcagactttatgtctagagtaatga a i fcttgi ia aogaat 
aagcaaccooci 7 tt aaaaa tgggcaaaagatttgaataaacatttcact 
acagacaataaacaaatggocttaagcacaagagatgctcaacatcagta 
attattagggaaatgccaatcaaaactacaaccagataccctatatccac 
tagtatggctataataaaaaagagtaacaaaggttgaggaggatatggag 

AAAC TOGAGOOC IUO I CA GOroTOOlOGATCACACCItyTAATTOCAACAC 

tttggc^ckttgaggcagccagactacttcactga^ 
act aa octgggcaa ca ccxxx»aa>xcccatttct acaaaaaa ttcaaaaa 
TTAATCACGCI iooiuoiooiocxxxxxtataatccaacttcttaggagg 
CT aagatgggagga itqgi ioa acccaggcaggtggagggtggaotgaac 

CAAGAAAAAACCGGTGGACC1TTAOCCGGGTGAOCGAGTGGGAO0CTACT 
TCAAACAAAACCCAACTACTGOOOOOCTATAAAACTOG OOGTI IC I I A AA 




CATAATTTACOC1 1 OCT 

GGTAOCCATATOCAAGGCTTATTOCAA C 1 1 1 I I A O I C 1 1 OCCCCTCCTACT 
TACACAGTCCAGAA7XACTTGGTGAGCATTCCAGTAGGACGGTGGCATTT 
TAGGATTCAGAATATTAACCT ATAAACCTGTCATTTGATTCTTGATTATT 
AATGTCTGGATCGC C I G I GGTAGGGGTGTAATCOCAOGAAGGCATTAAAT 
ATA7TTGAATTAATOTATATTTTGAOAATAAAAGGCTATTTCTAGAAAAT 
ATTACACA C II U I C 1 1 A TGTT AAA T AAAAA TTTGCTATTT A TTGAAT A TC 
CCTTACCCACC C I ICI I CCC AATGAAGATCTTATGCATACC1TCACTGGA 
AGGTTTAAGATGTGACAATCTTAATAG A l C T TTP1 GA GACCAGOCATTTC 
I CIOI 1 1 ATA 1 1 I lO GAACCGCCAGAGC^GGGCCATGCCA CCI 1 1CICA 
TTGTACXrrGCCCCHXKXK KX OC I CAAAGGG 
>Seqstencc 708 

ACATCCT/TTTGCATGCtCAAGAGCCCA I ICI 1 1 ICATCATTOGGAAGCAA 

cagcggcagtcccctgcccaagttatcccactagctgattgctatatcat 
tgctggagtgalctatcac<x2accagacttgggalcagttataaacrcta 
GAGTGGTAAO IO ICI I cacattctttaagcactaaagaaaacttttaa TT 

AGCTA CCI I OCI ICCA GTAATCAAACTAGAGCICCICIUCCI fGTGTAAG 
TtGCTATAAAGTATTGaCTATTAGA A I O I C I Itt AA Cl I IOOI 1 ACTGTGA 
OCCAAGTOGO I GC I CAAAGTATATTTCATAGTCTCAATTATATAGTAATT 
TAG G I I C IGAAAAATAGOI I C IUT CU IOC ATATGTAATAI 1 1 IOIO AGT 
ATTTACTTTGGAA^GTTTXXiTXXiACCTAATGATAAATTTAGAGTTTATTr 
TOCI 1 1 1 ACAAGCTTACTGCATTGCATOGTATTCAOTCAGC1 X riGATGA 
AGCTATCTCATACIGGTCGATATCA IUC 1 1 ICA AAGGGTATTGCTGGCAC 
TTCAAA G A TCATG AA GAGCAA GGT AAGT AG AACATCCA T ACCCTCCT AAA 
CACTTllGGACCTCTGAAAATGAGCI IOI 1 1 1 1 IAGGAAAATGGCTGGGG 
A C I I IC I AAGGGGTTCA CI 1 1 i IC ATGGATGATG C 1 1 I OI IG AACTGAAA 
TCATGGAATAGAAGTGGAATAATACTTTACATAGGACAT - 
> Scqu cac e 709 

GGTACAAGGATGGTCCATACCACTCTTTA C 1 I 1 I C TAG AAAGTTGTTAGA 
CTAATTTTTCAACAAAAATTCTTTA Tl G TCTTGGTAACAAAAG AAGCATA 
CTAAAAATTCTCAATAAGGCACA G T G TCTCT A GAAGCTTG AGCATTCAAC 
ATAAA C1 1 C T A ATTAACACGAA C I IOIOCIC I I ATTTCAGCCATT G CTGT 



ATTACAAACAT 



TOCATCaGCTGCAGAAAA II m c i a< 

on ha t 

>Scqaeacc7IO 

ACGCCGCCTAATCCCAGTTATGaGGGCTCTGCCCATG^ 

ccagaggccttaccatctaataccaatacattgggtttagaatttcagca 

TGAGAATTTGGGCGAGACAGTCaCvACTGTAGCGATGATTCTGGAG^ 
ATCATTTAAGAGACACTTTAAAAATGATCAGAAAGGAGAGGATTiAAGGCTA 

gaactaagacittagccttgaacatggaaaggaagtgatgactogcagata . 

1C1ULAGTACC 
711 



ACTT 



1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 1 H 1 1 M GATAGOCATATACCAAATAA 
A IO I ICIG ITJA CTAGG G G^ATGCC A CAAT GGCTA TTGAOA C ACTAA AAA 
CT CTGCTT CAGGCI ICC AICCI CI I AATTTTAGAATATCTCTGA 1 1 1UCT 

• A Al lT TCIOA TTOAC AICI II lU CTAQATCA ICOIOl I 1 1 JA CTTTATGT . 

• TATTGACTOATCCTTTAGAATGA 1 1 1 ICI II I tXH fCTO OOAAAAAAAAT 
GCATTCTAAATCAGATTCACTAATA C I 1 tO ATTCACTTCCAAGGATT 
>Sequeact7l2 

GGT ACTTACAAA-AA ttttt aaca TT AGGaGGTAA TTATAAGTAGATTCTG 
TOATTAGGACTTCATTCATGTA IC I 1 1 IOCI aCATAAA CC I I IO I IA GAT 
TAAATGGLAAGACAOCTXXTTAGGrrGATACTTTTTATAAAACATATGAGTAA 
GTCATATA ICI I IO I I A AA 1 1 1CIOIAIOI ICI 1 1 1 1 IOI ATAAAGATGG 

agagaaaogatggagtgatact aaggacoct aataacatctctgtt 
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TTAATTACTAA GTG ATA G AACTA7TCA TA TGCCAT7AAAG ATTTGCCAAT 

TCTATTTG 

>Scqqcncc 713 

ACTGACACAAGGACTCCAGGOCACACATA 1 1 1 11 1 1 G AAAGCCCTTTICC 
AAU^I^^l,^, .I.?!^ 

catgctgaccttgaaagacagaaattttcagatttcaaactcaaoggaoc 

o cagg taattctttgcctcaagaoctgggttgcttcattcatattttctt 

atttccccagoctataagagcatatttgttnuttgtaaggtgcctggc 

>Scqwcn cc714 

^CTTAGOGGCOGOCCGGOCAGGTACATATGCACTATn"AGAATATGACA 
TT AA TCA A CCACTA G A A TT AAAA TCA GG TTA TA A A TC CTCA A A ATCACCA 
CAGTATAAATTTAAATGAAAAACCCAGACCACAGAACAAAAACAGAAATA 
CCAAAAA ATAA TCAC AA AATATT AAA A ACAGTATATA A ACACAGTGACAG 
AATTAGG ACTA A AC ATATCTGTA A A ACAATA A A TGTAA GGGTA ATCTC AC 
CA ATTATG A A AA AGACCTTCAGATCATATTTTAAAACA AATTTAAAAACT 
CAAC 

> Sequence 715 

GGTACGTGTXXrrGGATATGCAGGCTTXjTTACA TAG AATTGGTGT AAAA TT 

TG A AAACCA TG AA AAAA TA A A A CAAT AAA GG A TCT A G A TGCT AAT AATGT 

GGTTAGTTAACATGTTGACCATTTCAAAGCAAAATAAGTCTTTOATCTTT 

TATACTATTCATAGCAAGA 

>Sequence7I6 

A ^^GT^TGTGATCTTXXiCTCATTGCAACCTCCACCTtXTGGATrcAAGC 
0 ATTCTCCTGCXTTC AGCCTC CCAAGTAGCTTjOG ACTAC AGGCACCTGCCA 
CCATGCCCCGTGAAT 1"! 1 1 1 Gl ATTTTTAGTAGAGACAGGGTTTCACCGTG 
TTGGCCAGGCTGGTCTTGAACTCCTCACCTCAAGTC 

GC CTCCT A A AG TGTTGGG A TTA TGGGCGTG AGCCA CCA TGCCCA CCTCCT 
GGGTCATTCTTCTGGATATTACCAGGCATTTTTATGCTGATCTAAGTGAA 
AACCTGCLATATTT7TTTTCTCCAAAGTT ATTTCTTAGTTCTACCTATGAC 
ATGAGGGTGATCTTTATAAl lilt ITTTGTI I ' I CACTG AAG AAATAAAAC 
ATTGCTTAAGGGAGAGTTGGGGGAGTGCATAAGGATxrrGCAGTTGGGACT 
GCAT TTTTCGG GI I IOH I I A CCT ACAGCCTGG 1 1C IO 1 CCACCTTTCTG 
AGGAI nTGTl 1 CCCCCTTTGTTGGTCXCX7ATGAGCATTTCIT ATGGGAA 
TATTTGTG AAAG AAAAAAACAC C 1 1 1 I I I J AAACACCOCAGTTCATGTTA 
TTAACAAGCAGAATTCACCTTAACCGCTGTACCTTXKjTXXKXIAACACACT 
TAGGGC 



717 

GG T A CT AA TCTAAA TGCTA G ACAGTTCAA GTG TA GCTTTGGA G ACTTACA 
OATAGCCAGCTAGAGAACTACCAATGATGATATXCATCACGAGGAGTTTO 
OTXXXXAGCCTCCAAGATCGTCCTCAATGAT^^ 

CACXX7rGTtn"AGTCCCCTCTCTCAGGGGATTAGGGTIGGTCTGTATGATC 
ACCACATGGCTGCAGTAATGGTATGTX^CTTCTCAACTT AGGTTATAAAA 
GACTATGACTCTCATCTTGGGTGTOCACTCTCTG ICICIUT U ATCTTACA 
CTCTAGTGKiAAGCTOOCATATTGTGAACCTCATGGAAGGCCCACAGGG^ 
AAAAACIGAAGCA7CTAATCAACAGTTAGCAA0AAACTCAC<)CCTGCCAA 
CAACCATGTOAGTGACCCCGGAAAGAATTTITCAGTCCCAGTCAAACACT 
GAQATAACGGCAA OCTCAGCTGACA GCTTA (XTTGCAACCTGATAAAGACA 
CCCTTGGCCCGAACCATAGGAACCATTTCTACCC^ 

GGACCrTGTrAGATAATAAATATTTGTTTAAGCATGGTTAATTTGTGGCA 

ATCT GerATATAACCAATAAATAATACATGGCGGATAGAAAl 1 111! I I C 

CTTTOGACCAACCGCAAAGTAACC C 1 1 HI 1 1U.1 1 1 A CAGCCAATTTCC 

TTTGGCTAAATACTGTACAAAAGAAGTrCCCGAAATATGAAGGATGGGGG 

CAGGTTTTOC 

>S«iueacc7lf 

COCTTAGCXJTGXrrCGGTTTTXXOT^ 
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GTtnCTGCATGAGCGACCCACGAACCAGCTGGATCT G CTCATCCGGGCCG 
GGAAGCATCAGTTCACAGCAGTAATGCACACTGTGCCAGGAGAATCGCTT 
GAACACGACAGGCGGAGGTTGCAGTCTGACGaGATTGCACCATIXXIACTC 

cagtctgggcgacaagagggaaactccatctgaaaaaaaggagaaattct 

TTTA I 1 1 ICI A C1 ICIU IC AOAI I IXjTCTTATOCAI 1 1 ICCAACTATGT 
A TGCA TCACAGCT A I1 U fTTTCI O AGTTATAGCTACAGTTTTCCTACTG 
TTGTCTTCATGCCATTTCATTTCACATGGT 

> Sequence 719 

ACTTNNNTTTTA 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 NGGAGACAGGGTCTCGCTCTATCA 

CCTAGACTGGAGTGCCTGGTGCAATCTCGGCTCACT^ 

CCA GGCTCAA G TCTC AA TCCTOCCGCCTG A GTA GCTGG AACCACACGTGC 

GCACCACTAAACCCAGCTGTTTAATACACCAITTITAACCCAAAACATTA 

AG A AA AAT A T AGG AAC A GTA AGTAG ATTACATTTTGTAAACAG ACAAGCT 

TACAAGTTTTCTCAAATATG AAAGTCATACTAAACTGGGAGACTGTT AAC 

TTCTTG A TGGGG TT A A TCTCTA A TA TG A A GCC AC AGTC A TA GCT A A CT A C 

AAATTACATATACAATGCCAAAAATATTCAAAAATAACA 1 I I rnG CACC 

TTA ATG ATTACA A A TGCTA ACCAGCATAAAGACACTGG A AAG TTTC A G A A 

TCTCCICATCACATACTTTCAAATATCTTCCCTrTACTrTCAATGA 

GT ACGCGGG A TTCT A TGGTAATGA TGA CTTGCCA A TGTTCCAGCi 1GO I 1 1 

CTTAGCTA AAACTAGAG AATGCCCCTAACTTAG ATOOTTTTTTG AAGGCT 

ATTACAATATGGTATTTGGTTTGAACCCCC^ 

CI I'lGllTl ACACCCCCTTITCCGGGGGCCCCCCCCCCGGGAAAAAAAOC 
CCCACAACCGCCCGCC 

> Sequence 720 

GGTACTTGAAGAACATGGTAAAAATATGTTCACAATAATATTTTATCTTA 

gaaatgtattcagtaaaaaa rcTcm ANTTCAACTATCCTCTTGATTCA 

GGGGAAAAAAGGATTAGCATCGGAGATAACAGAATAGGAAGTTTAGGAGA 
TAATGAGACTTCTGTTTTAGTAAAGTAA ATAAGCTTT a ATAGTTTTTTGG 
TC A TGT A TTCAGTTTA CCA GCCTTGA A G A T A TTTG TAGG A A ATTTTAAAA 
GTTTCTCTATTTCATCCCCCATGATAAA 

ATTGAACTT IC 1 1 CA CAGCA CACTG AAAAATATCTTCTATAGCATTAATC 

AGATCACAGAATCCATATITAAACAAAAATrTGACTAATTTAATTTTTAT 

TTATTTA'I TTTTTTTC1 G AGACCCAGTCTGGCTCTGTCGCCCANGCCTGA 

GTGCAATGG04GGATCTCAGCICATTGCAAOICTT4CGCCTCCT 

GCCA ITU 1 1 CCCXjCCTTGCtTTTCT AAAGTGCTTGG ATTGCAAGCCTTTTG 

CAACCTGCCTGGCCCCA GAAAAACTGG TTTTTG AA 7GTTOGO 1 1G1 1 IG G 

GGOl I III 1 1 1 1 1 CCCTAAAAGCTrAAAATTTCCCTTTGC I rTTTTTTCA 

AAAAAAAAAAAAAATT ACCC 1 1 1 1 I I I 1 1 1 ACCCCTCC Cri 1TTT ITT TA 

AAGGGGAAAAAATTrCCCCCAAAAAAAAATAAAGGGGTTTTA I ITGTTO T 

GGAAAG 

> Sequence 721 

ACCCTIGAGCGGCCGCOCGGGCATGTACXjCGGGGTTAACTATGTTTTC^ 
TAACAQAAAGTTCTGT 1 1 1 1 G I G ATCCTTTTAAAAATAAAGCTTCAOGGA 
AGGATtjAGAATAGTATTTTlCAACTITAAATrrCTCATTACCAGAAGAOC 
A TGTGGTAA TTCTCTGTA T ACAGTT AGAACAGCACGG AAA CTTG AA GGCC 
TAAAAA ATTAGCTG ACCTTGTTAAAAATGTTGGCXrrGAGCAGTATATTAT 
TACCT A1CI 1 1 1 1 1 1 A TTGTGTGT G T 0 T G TGT G T G 7 G T rT'lAAACTAATT 
GGCTGAAATATCTGCCTOTrTCCCTCTTTACA 1 1 1 1 111 1G1 1 ICITIC C 
TTATTTATCITIG 1 CCATCTTGAGATCTACTGTAAAGTGAATTTTTTAAT 
GAAAACAAGTCCAAGTTTTACTCTCAGTGGGTTTGGGACATCAGATGTAA 
TTGAGAGGCCAACAGGGTAAGTCTTCATGTCAG I G 1 1 rOTTGAOOAACGA 
GOCTATG ATGTCAG 1 1 1 1 1 CCCAAAGGQAACAAGGACAGAAGGGATTTGT 
TCATTTTTACATCTCGGTTCTGTAATACCACCTTTGACrrc 
CAGAA TTTG AAG TCT AAACCGAACGTAA GCACTTGGGGGTATCGAATTTT 
AATACCTACCACAGTTAGGACAA 1 1 1 1 1 1 1 1 LA AAGGGCCATTATTTTTT 
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TGGGGCAACCCTGGGGGGGGGGGGGGC Cn 11 1 IT GGAAAACTTTGGGGG 
ATATATTL 1 1 1 1 1 1 1 1 CCCCCCCCCTTTTATAAAAAAAAA 
>Sequeoce 722 

CCCTTAGCGTTTlTaXXKXXXSAGGTACATGAAC^ 

ATGCT TCCCAGTTTCG CAGATGTGAGCAAACTATGTATAGGAATrCXAA^ 
GGTAACI 1 1 1 ICCI 1 1 LA TTACTTT A CA G AAATACTGTCAA GTCCAATA G 
AGAOCACAGACTTGGGAOGCGGATTCGGTGCKjTTTGAATCTCTGCTtn^ 
C A CTTTT A TTA A TCATGTG A GTTG AGT A TGTG ACTT A A TCTCTTTTAGCT 
CAATTTCOCCATCrGTAAAATAGGAATAATAAAAATACTCAClTCAGAGA 
GGTTTGTGAGCLATCAATTAGACAGTCATGTTAAGTCTGTAAATTGTTTCT 
GT AA TGGGCA A G A T A GCAAA TA TTTT AG A TTTTGTGG ACCATGCA GTCTT 
TATCATAACTGCTT AACTGCC A TTA TAGTG AG AAAGCA GCCA CAGACAAT 
ATGTAAA7GAAAAAGTGTGTCTCTGTTCCAATAAAACTTTATTTTCAAAA 
ACCAGCTGGCTTGTCACATCTGGCCTATGGGCCATAGTTTGCCCATCT^ 

TTAAAAAATGTTAACATCTTTACTGTTATTAATATTACrACTGOKTCTAT 
TACA GTAGC AA TTG AGTAATGAATACA TG AATGTTATA ATGGTAAA TTAC 
TAACCl I [ I AAAAATATTAAGCATTGGCATATTTTAATACTTTAAATCTT 
TTAOGAAQATAGTTACCCTGCAT 
>Seqoence723 

GGTACTTA C I 11 G l I UC'I CITTT1CTAAGTTTTAAAGATCOATGCCAATC 
TCA GG CTTCTTTTCGTGTGTG TA TGTGCGT ATGTCCA T AA A TTCTCTTCT 
AATT A CA GTGTA AGCCA CATCCCACA A GTTTTG ATA GTCACA G A A CTGT A 
TCGTCACACT ATTTTTTAA TTTCA GTAA G TTCTTCACTG A TCCCTG TGT A 
AT ^AGAAATGTTTCATAATTTCCCTACATTGGAGGGGAAGATAGTTTTG 
I II 1 1 ATT ATTAATTTCTAC<rroTATrG AGCTtrTTGTCAG AG AATATGGT 
TTAT rrrAGTC GCTTOAAATTTAAGATTJ ro CTTAATGGCAAAATGGATGG 
TCA GGTTTTTG T AATGGTTGCCA GTAAGCTTGCG AAA CAT A TG T A CCTGC 
CCGGGCGGCCGATTGAAAGGGCTATTTCCCA 
>Sexn»eoce 724 

CCCTTTCGAGCGGCCCrrTCrGGCAGGTAC^^ 

CTCX3 AATG ACTCGCGTTTCCTTXTITrCA TCACTACACCTCCCACCGCTCT 
CCATCACCTGC I C I GCTCTTATAAGGATCCAGAGAAATGGAATAATCTTA 
TTGCTGATCTATCTAAACAAGTTGAAGAATCGTCTCAAAGAAAATACAGT 
GTGTCTAAACTGG AAAA GTCCTOTAA TAG 1 ITG1 I LATG AGCATTTGCAC 
AGTGGAGTTACTGTTCATCATGGGGGTACC 
>Scqmcc 725 

GGTACTAACTATTCCTAAATATTAAACACTGGTCAACTAAAATGCACAAA 

TT CATG AATTGG A TTTGCACTCAAACAAAAAAAAATACCATA GGCAGTAT 

CATTT CTACCTTTGTAAGAGGCAGGAATATTCATTAGACTCTATGCITGA 

CTTTTCATAT OTATTTTAACACTGTAGTAGGCTATCGGGTXn'AGTTTAAG 

CTTCAI 1 1 C 1 AAACTACTCAACAGCTCAGAACTCACAAAGATCACAAGAA 

ATCAACTATTAACCTCTIGCCTCAAGACACAAATGAAATAT^ 

TACAAAGCAAATTAGATTOCAAGATITTCCAAAGOCATACTCCTGCAGTT 

CACTTGCinTt^AACTTAAAATCATAATAGTAATATACACATATTTACAT 

TATAACCCATTACACATTATTITCAACTCAATGCAAGTCAAACAAAG<nT 

TCACA AAATAA CCTTACTATCTGCAATACACTGGTATTTTCTATTCrACT 

CAGAATTTTTTAAATACCTATCATGAACCATTAAATTGTCTTACCACTAA 

TGCLAGTGACXATACCCAGATTGAAAAACItX^TTAAGAA 

ACCCATAATGGTTATTTGGCATTACTTAGGCAAAAATA 1 1 IL1 UAJ HVf- 

ATAAATTC1 1 ACCTTTTTAAQCAAAACC 1 1 1 1 1 IA AACCAATTAAAATTT 

TAATGAAGGOCCATTTCMCCGGTHAATATTTATTAGGGGTAAAAAAACCC 

AAAATTGOCCTAAAAAAOCTTCAACACATTTXIATAATGX3AAGAATGTGGC 

GAAAT AAA TCTAAA 

A CTCA CTTAAATA AA TAATTGGT AA GA TG A TTTT A TCTG ACAA TTAAAAA 

Page 114 

CeT261 paces .Table 2) 



PCT7US00/333I2 



336 

Table 2 



AAGGTATATGTCAAAAACCTT AAAAAAAATCTA TTTCA TTACATGTTGAA 
A TG TTCTG TGCTT AATCCAATA CA TCATTTAAATTCTTTTCACA TTIGGA 
CAACAGAAAAACTGAAATCTATGGATTCCAAGCTGCAAAGTATTTTATCT 
AAAThKXIAAATCAAAAAACATCTATAACATCTTGTTGGGGATACAAAGTT 
CTCCTCGCTCATTCTCATGCTACAGAAAGCCCCAC I1 I CIGU C I O I AAA 



A C A TTCT A TTGTT A TCTCTA TTTTTTGG A TGA AAA A AC A GCA CCA CA AAG 

aagttcagtaactggcctaaggccaciac^gcttgtcitcctgaagactgg 
acxxtaaacxcaggcagtcatagaacattjctggtcgctattgggccgcttg 
ctctatgggggacggtgctccaggaacacagcaatgcggtrragoattcc 
aggacctggggcagctgctgc i i c men agttctcgacagaccactga 

GTGCAGTTTTTtrrAAATCTTTTCCCCACTTrGATATGTGGTCCATAAAAC 
TGCTTCC A C A CGT A T AA C CCA CTGTG A A G TTT A A AA TGA TTTCA TG TTTG 
GGC A A ATTCCTACTG AA TGTTA AGCTAG A TAGG A A AC AAGTTCTG ACTA A 
C A CA A ATGA AGGTCTG AATG AAG AAGTCTTACTTTTATAAAGG A ATTTTC 
CCCTCCTCACCAAATCCAAGTTTAAATGTrGATATCTCTGTTCCAAAAGG 
ATGATAAATAAATXKJGTXXXTTGGTTAGTAGTGGGTGTATGGGTGTGGGT 
AATAAGGTATTCAATGTACATTTAATACTCCTTCTCATTCTATACTGOAT 
CTATCTTGGAAATGATGCATTTTTCATCTTTAACATCACTTTJCTAA 
ATATTTTTGTCTCTTA A CTATTATA ATATTCTTGCGTTATATAAA TTATA 



ACTATTCCGGTtrriCTGCTCTTACAACATGTAATTA IT T CTCI A CTGCTC 
GCTACACGACTG AACTTAACCA ATATATCACTCTCTAGAAACTTCCAGCT 
TATCA 
>Seqvcoce 721 

GGTALI I INI 1 1 1 M 1 1 1 I II 1 1 II 1 1 1 I I OGTAGAG ACGGGACCTCACT 
GTGTTGCCXAC^CTGGTCACAAACTTTTGGGCTCAAGCAATACTOCTGCC 

ttggcctcccaaactgctgggattacagggataagccactgtatagagta 

TGAAAAGTATTTAAAAGAATCTTCCAAAGGAGGACAGCAGAAATGAAAAT 
AAAGT AA GTTCAAACTAGAATCCTITG ACACA ACTGGTTTTATTCCCAA TG 
CCTCTT AA AAA G AA TCXiTTCCATGGGTGGCA GG A GGGGTG TTTTCATGG T 
GTGATGCACCGTGACTTGTTATTCAAGATGTAGTXXAGTCTTXX1ATCTAT 
CACGTTTTATAGCTTTCGAAAAAAAAAAAAAACCGAAACCACAACGACAA 
CAAAAA TTATT CCAATTAA TGGCATTCACAGCAACCTCGATGGG ACTGGA 
G ACTA TATTCTAAGGG AAGTAACTTAAGAATGG AAAA CCAAACATA TCTT 
TAGTTXXjAGCACTTATGAAGATTCCAAAGGCCTAAAAATTG 



AAAGATAAAAAACATCCTAACTTGGGTTCACGGTACCCTGCCCGGGCCGC 
CCGTTCTAAAGGGGAAATTTCAGACAACTTTC<KXX^ 
ACCCCAC<rTTGCTACCACAGCTGTGG TGTAA TCATGAGCATAACCTGTTT 
CTCTGGTAGAAATATTAATCCGTCTACAATTCTCCACAA^ 
CGAAGCTTAAAAGGTAACCCCTGGGGGCOCTAAGAGAGAGCCAATCOG 

CCCTTTCG TTCTGCCGCCCGGGCAGGT A CTTA TCA GGATOAAATCAGAAT 
CACAGTTGGOCTTTTOCCATAAGGGAAGGGTATTTW 

TATOCCtOCnm IUIX)CTGGCTCAGTAACTGCAGGCATTAG G TT C C IC ' I 
ACACIACGATATATTACAGGGAAATGGCAGCGA 

CTGGCC1 ICI 1 1 UC J IX I GAGCACT AAAA TCCT AAA CATGCAACTTAAAA 
AAAAA TTCT AAA TGTGAA CACCA CCTTTCAA TAA TTT A T ATTAA TGT ATC 
ATCCCACC LI I III IHX ' 1 ICTCTTTCAAOGCCC 1 IVl I llT ACCCAAACT 
CCAATATACCAA TTTG TTTGAACAGTTTTACA TTCT AAGTGTCCAACTAT 
TTKrTAAAGGAATGGATAAATTGTTGTACCTCGGCCGCGACCACC^ 
G 
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ACTCACTTAAATAAATAATTSCTAACATCATTT TATCT CACAATTAAAAA 

aagctatatctcaaaaaccttaaaaaaaatctat ttcat tacatcttgaa 

A I GI IC TO TOCl I A ATOCAATACATCATTTAAAI ICI 1 1 ICAC ATTTG OA 
CAACAGAAAAACTOAAATCTATGGATTOCAAOCTGCAAACTATTTTATCT 

aaattocaaAtcaaaaaacatctataacatcttottogooatacaaactt 
ctcctggctoattctcatgctacaqaaaocoogao 1 1 rctcttctgtaaa 
ttgggacaaotcccogcgtaoc 

> Sequence 731 

A CT IT lOU AAGAATACATCr iOG ri CA ATCTtXrrCGTA TTCrrAA TnT 
TlO'ATAATATTOCTTtn'AATCTTTAGAGTTATGGTITCATTTTTTQACr 

attaaatttcaaattgttgacatcagcagttgac rcncrci GTAC ATCA 
taattttttaattaaoaag acactctcaag tgttc aactata attctaga 
gtaaattctaagtggaggatatcgtaaai ici 1 1 1 tigtc 1tggtat7ca 
catctaaatgttaacatatgtgaataattc agto cacgattg tcac aggt 

TCTATGTCTTTAC C I CC1 I ICAAAATAC I I fC I 1 1 AACAA ATACTTTG AC 
AAATTTATTAACATTTATAAGACAAGACTrACCAAGTTCTGTTOGTTTAT 
G ATTCTTTAAA TGTTTTCCAA TACTT A GAT A CA TCAAAATTAT AGGA CTT 

ctcaattccatcctattottacagaataataaattaatcagaataggaag 

ACCCTTAAAAGA I CI I I CI C ATGAGTTCAGATTTCCCAGATAATAATTAC 

AGAAATTTCATTTGTACCTTGGCCGCGACXACGCTA 

>S ey c) 733 

CGTA C1 1 lllll I ICI I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 G AG ATGG AGTCTCGCTOT 
GTTGCCCAGGCC<jGAGTGCAGTGGCACAATCTCG<nCACTGCAAACTCGG 
CCTCCTGGGTTCATGCCATTCTTJCCTtAGCCTCCC^ 
CAGGTGCCCGCCACCAAGCCCAGCTAA 1 1 1 1 1 1 ICI 1 1 1 1 1 I IGTATTTT 
TAGTAGATACGGGGTTTCACCATGTTAGCCAGGATGO ICI IGATCTOCTG 
ACCICO IG A I CI OC CTGCCTCGGCCTCCCAAAGTGCTGGC^ 
TGAGCCACCACAOCCAGCCTA U CCI 1 1 A C 1 1 IC I I AAACTTTCTT t C AC 
TTTACTCT ATGG ACTCACCCTGAA I ICI I ICX-IGCICAAGATOCAAGAAC 
O CTTTTl r GAOGTCTOGATOOOGAOCO CTTTOC I O T AACACCACTGTATC 
OC C 1 1 O GCAGACATATGAATCTGCACOCCCGCTTGGTCTCCAATATCCAG 
GG A TGAACAAGGGA GG AAA CCAGGGAAAA TGCTT A CTG A GGCA TCTTTTA 
TGAGCAGTCAOCATGCTAGGC I C 1 1 1 ACT AACA I roCTTTI IG CACTGTT 
CACAACAAGTCCTGGATATCTTCAATTAGAAATGTGAAAACTGAATrCXX} 
GATGAAAAGCOCCCACTOCI 1 1 1 O ACTGOOGTGGCTTATATOGGG C 1 1 1 1 
GAOCAAOA TGG ACTGAATGCCA I C I rCTGfl CAGaGGGACTTAGACATTTG 
AGGGAAGTT 
>Scqocac* 733 

GGT A CAAAACT ATGTGAG AAOGTATA CT ACTTCTCGGCCACAA CTACTAT 
TTTTAGATATTCATAAAATAACCTCTGA rTOTGTTTI C ACATTGAOCCAT 
TCAG 1 1 C I G I Ct-AATCTrATAATTCTGATTAAATGTTCTCGGCXTCAAAA 
CTAATTTTTAAAAGGCCACTAACTCCAAATCTAGiGAACAAAACACTCTGT 
AAG ACTACTGTAACTTGT ATAAAATTAACTTCAAAAATTCACTCACTCCA 
ATAAAACTATGATT^AT^GTAGCTCATAAGAGGGTGAAITTTCAATATTTA 
C TCTATG AAAA AGOCTA AGCAATTCAATAAAAACTTXiATAACrGCACGTr 
TAG 1 1 PQCAGGA1C1 ICI 

TOrTTOBTCTOGAOOOOCITTCACOTACl I rCTCTGAATTTCATTAOCTA 



TGCTACATTAAAAAAAATCCATTTGAATACACAGTGAACTAAAACACCAG 
AGTGGCTAAAAA O I HX I 1 C ATCCATATTTACTTAGCAGAGAGCTCTTGA 
GAAAGACCCAACCAATAAACOOCAAOCAAAOCAAATOCAOCTAC1 PCTC1 
AC<TGAGAGOGTGGAATOACTCCAAAATA I FBI I PCAAGCTCAAAAAGCC 
TAAAACAACTCCACATAAAAHAACXAAAATCTATCTAATTGGACATTTAC 
CTTTrrGGAAATAAAAGG^CCCAGTGGGAAAAAAAAAAAAAAAAAAAAAAA 
GTAOC 
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A Cmil l llll l lllllllll l lGI CACAOACACAGGCTGGGAA IIICC 
CAAATCTTACAAGI ICTOGTCOO CTl ICCCI IA ACAA CICI I IC GGAGTA 
T CTCC GIOCI I 1 CACACTTT ATTGTAAGCGAGGAGAGCAGCCAQGCTGCA 
CCTTTAACAT-rrCATTCACAC^ATCTCAGCT 

TCCTCTAAGCCTCAGCAGAAAGGCCTTCAACATOCAC' 1 1 ITOCACAACAT 
TCTGTCTATGATACCTGCA I rCTCTGAQATGCTAGAAOCl I 1CIC I CC AO 
C TCTCC CC1 1 1 CC I C I C » GAGCCmACCCCAGTTJCCXATTGATGTCCGT 
AIT I ITACCAACAAG C IC I 1 CA CCGCTATCGAOOCTT TCT CCAOCAGOTC 
CCroAAAACGTCTGCAGCATGTACGOGGGGAAG C rCTGTTI GO I GC I 1 1 U 
GAT/CCATTTCCATCGGGCCTTACAGCCCGTCGGTACACTCCAGCAGGCAA 
G A ATGGTGAAACACTAACGAG A G ACAG ATTGGTTTTTAAGAAACCCTTGG 
ACGCCITOCAGOGATAAAOCTGGAGTTAOTTQA C 1 1 1 ICA CCCCOGGGGG 
TGGGGCCTCGG A A AA AGAACAAGGOCTTTTTTCA TTTCCCTTCTTGAAA A 
G ATTTCCAACGGG A 1 1 1 1 C I C I GAATAAATGTGGATCACTGCCCGGATCT 
TGCTTCAAAGGGGAAAAA 
> Sequence 736. 

ACTTGTCTGCTTCAATAAAA 1 1 KjTCTTI OATITCACTGGTGO AAOOGTG 
CTTOATCCAG CI 1 1 1 PC I IC I C CATGAOOAGGA CI CI G 1 1 1 1 I CAGI I IC 
CGCTTTTAI riCCICTGAGGGGAAAAAAAAGAAGCATACATTATAAAACT 
GGACAGCAGAAAGACTGAGTAATrrCTTAAGTTCrATAAACTCATTTGGA 
ACTTCTACAAAAAGTTGGAAAG AATGCAAA TTTAATA AAAATT AG ATGCT 
AAAA TTGTTTCATCT AAA TTTTTT AATTCACA CA AA T AACATAAAA CT A T 
ATG AATAGGTAOC 
>Sequencc 737 

GGTACTTNN I 1 1 1 1 1 1 1 H I 1 1 1 1 1 1 1 1 1 IGU I I O AAAACC CI I I A TTC 
GGTTTCTCAGTAACAGTGATGCATTAAGAAA I ICI IGICI GC TAAACTTC 
ATAGCAAACCGATCGCAGTOCTCACCTCATTGTGTGGTAGCCCAGCAGCA 
GAG AAGATAGG AA 1 1 1 IC I CK XXXCTAGCAATACTGTTCATCCCATCGAT 
GGCCGAAATGCCAOTCTGAATCA 1 1 I CC ICI G GOTAO ATTOCACATTGAG 
GG TTG A TTGGCTGACCT A A TGT ATTTCCAAAAA GG AAAA TTTCA ACAAGT 
TGCCXK^TTATTCATGAATCAAATTAGATATCATATCAAAT^ 

AAAAAATCAGTTATTTTTCAAATATGAAACTrGAAATAA l ICI I ICCTI I 
AC I C 1 1 1 1 GGAGACTCACAAAACATTGGGTAATAQAATTCAAGTTCCCTA 
AGTGAAC^TAAAGATATAGCAAATATGAAAGAAAGCCTAATTICAAATTC 
A TGGTGTT AOCA T A T A CA TTTCA GAAAT ATTCCAGAT ATTTT ACACG ATC 
TT AAGA T ATTAA TA CCT AAAATTTTACG A T AA TTTCT AAGAAAATCTTAT 
TTAAGTATAAAATAATTTATTACCTATGGGACXTrGTGGCCTATTAACT^ 
AAGGGAATCACAAAAAACACTTTTTATTTGGCAAAGGACCTTGCC^ 
GGCCGTTAAAAAGGGCG 
>Sequextcc 738 

GGTACT A TCTGCTC ItlAATTAAAATTTAGAACAAAAATCAGCTOOCGTGC 

cactacacatogacataatcaactgctaaattatga i rrom ici icca 

GTTA C TT H tXLAATT ATTTTT ACA T AT ACAAATA 1 1 1 I CTIG GTAGAAGA 
ACAAAAOTOOCACTATTCATTGTGTAG 1 1 1 1 1 1 im a acttatattttac 
OCTAAGCA nTTCTOGTTOTCl I AAATTATTAATTQAAAATTATTCATOG . 
CTAAATAATGCCTAOOCTOCCATOA O ICI 1 1 ICT0C1 1 CI ATAAAOOGTG 
TCAGCATTCTTTTATATATA I C 1 1 ILA GCACA lCTGCA AIX jA 1 1 ICI I 1G 
GAATAAATTTCTAAAGTTOGCTGGATCGAAAAGATTCACX«GATrrTTAGT 
GTTCTTTCAATTTGGCAAA GTA TTTTTCAGAAACAAGCOCA TTTCAGTTC 
TGAATAAACAAA I ICI 1 1 1 1 1 ATGTTGCATTTAAAATCTACC I CC I IGl A 
GCATATGCAGGGAAAATGAATTA 1 1 1 GG I CAACATOCTTTCAAAT ACTTQ 
A AOAAT GTCTA I 1 1 1 IC I 1 1 ATGACT A7TCTGTGTTCTGGACTATAOCAT 
TATTITCCCATGATTTACATTXiGAAGGTCGTGATTC^AGC^ 
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AATTGCTTCI UUUAGGTTTTT AATAATAGATGAAGTGGTTAGCTTCTAAA 
TAAAOGATATTG TAGOTGGAATGTATAATATGGCCTAAOCCCOACAACTT 
CCCTTOGTTTOT 
>Scqacnce 739 

CCCTTAGCGGCCCCCOGOGCAGGTACACAOTl ICC I IC I rCGAAACAATC 
CACAAGTAGGCTAGCAATGGTCACCCCT ACATACrrCCGCACACATCTTT 
AAGAACAGGACACt^TTACCACACXXAAGAAAACX^G^TTTAATGAA^ 
TATTCAAGAGTATCATCCAACATACTtAAATATCCACAGCTGTTCCGAAA 
GTATC CTTCAAT TCIGGATCCATTOATGOTTt^ 

TT ACATCTI 1 1 1 AGTTGTTATOCTTCAGAGT AAAACTGGOCTOOOOCTCT 

I ICI I ICI 1 1 ACAATATTGA C I CC 1 1 1 G AOG AACCCGGGCTGGATGTGGA 

GCATTCTCCATTCATCTGATIGTTTCCATGTCACCAGATTCG 

A TTTCTGGCAAGAAOOCTTCACAG A TG ACCATGTATTGGTTATTAGGTAA 

CAATAGATTACTCAAGTAGAGAACTGGG A AATTGTC C 1 1 1 G I UC ATTACA 

ATAAATTTTTTTTGAAATCTAOAA TTPCTTATG ATTCATTGA I MC I I I I 

c m i ici 1 1 1 icc 1 1 n m 1 1 1 1 *i gaaaacagtttca c i roonc ooc 

CG GCTTX jAGTGCCATGGCACAATCTCGGTTAACTC^ AGCCTTT ACCCTCT 
C GGTT CAA AAGA I ICICCI IG l rCAACCTOCTGAATAGCTGGAATATAGO 
GCCTGGCACCT7GCCOGCTGA1 1 1 nn ATTTT ACT AAAATAGGGTTTAC 

caatgtgcccagctgggttgaacttttgac 

>Seqocncc 740 

acatt glctgcattttoagaitncctattaic'l i rctggtgttcatttc 
tgtttaattatactgtgatctacaagcaacactgt attatttccattctt 

TTAAATTTGTTAAGGTGTG fill ATGCTCAGAATGTGGAGTGGACTATTT 
TGGTXjAGTGTTDCATATGGACTTAGAAGAAT G T O 'ri I rCTOCTGI IG l IA 
AATGAAGTAGTXrTATXrrATXrrC^TTA TTG TTTGATXM TTG ATGG TG TTG 
AAATCAGTTATGTCCTCACTGAI 1 1 ICI GCCTOCItSGATATGTOCATTTC 
CAATAAAGGTCTGnAATGTCTATCTATAATAGTGGATTTATCTA TTTCT 
CXCTGCAGTTCTATCAGG I I 1 1 GCC'l CATGTAGTTTG ATGTTCTGTTAAA 
TCCATACACATTAAC<jACTGTTAAGTATTCTTGGGGAATTGACCC 
>Scqoeace741 

ACTTCAGGTTAGAGATOACTrcAATATATCTCGCAGACCTCCCAAGGTGA 
GCATCACACA GCA CTTA TCATAA TCACGAAGCA GCTCCACAGAGGCTAAG 
ATCAAAACAAAAATCTCAGGAAATTITATGTrrATAAAAATGATACTTGCA 
AAAAAATGAATGGAACX^TCTOCATTGCTTATT^ 

CAATAAGATTTTAAATTAGTCAATAGTATTGGATGCCTCTATATCTGCAT 
^^ GAAT AGGCTCATAAACAAGGTroCTCAAAGAACTTXXXATCAAOCACT 
TGGTTTCATCTCTGGACACCACACTGTTATL 
JAA^G^CTCC|WG^TAC<jTGCACCAAAGGAAAAGAAT 

GTGTCACTG TCCCT AACCCAAAGCAAGCT AGOGTGCAGAAOCCAAGOGCA 

Gt2ACAATAATCCCAGCCATCCOGGAACATGGGTTAGCTTGAC^ 

A TTACAACGA TCOCAGCC 1 1 IGl 1 1 AAAGGTOOCAAAATTAGTTTCAAAO 

CAATG ICI AACCTTCCOCAOCTTTAACAGGAAAGAACATTTTGAATAATT 

^OCAAAAGAAGTOCATCGACCITAGAACTOXGCAAAAAAGCTTTATCCTC 

TAAACT 

G GTACAGGI 1 1UCCI roCCTXAACOCTCATOCTQGOTlQATOACACTOTT 
ACTTTCCTTCTTOTATAAAOAGOCOUiCTrT^ 

C TACTTTTAA CAAAAAGGAAAATCAGA G T GCfl I JOOPG OAAAATCAGAGT 
GCI 1 1 ICI roCATCTGCTATTTTICAA GTGTCTl I A ACTCAAAAAAATCA 
ATATCOCAAAGTGGCATGTrroGGGGTATC T GG n C TGAATTtXTTTCAGG 
AAAGATAGAAAGGAAAAGCAAAAT AATAGGTTTAAAACT AAAAA TA TCCA 
GGTGCOGTGGCIT^CGOCTAT AA TTXX1AGCACCTTGGGAGACTGAGGTGG 
GCAGATCATQAGCICAGGAGTTCCAGACCAGCXrT^ 

CCC I G I V I CT ACTAAAAA T ACAAAAATTAGOCAG ACATGGTGGCCGGCAC 



CTGT AA TGCCA GCTACTCAAAAAGGCTGAGGCA GG AA AA TGGGTTG A A CC 
CCAGAAGCAGAGGGTGCAATOAACCGAAAACATCG<^TTGACTTCAGCCT 
TGGCAACAGAACCCGA C1CIOI I ICA AAAAAAGGAAAAAAAAGGAAAAAA 
AAG 1 00CTQOO0GGOGGC0GT 
(743 

ATCAATCAGGCAGGGCTCAGCCCaCACCCGGCTC 
AGAGAAAGCAGAATGGTGTTTAGAGACCATCGCA 
COGACCTCATCtnGAAAG^arTGTAGGAAATTG<XX^ 
TGTGACAATCtCAGTCAGCCACaGTGACGGAGTGGAAGATCGGATCACCAC 
ACAGATCCAAGAGAGCGCTCAGCACACCTGAGAAACAAGAACGCAAGACA 

gcctcatggaggtggaaccgtgctaogcagttatggcttcactactgaat 

GCGA IC I I UCANAAGT 

GGTACGCOGGTGI l llllll I GGGTAA 1 1 I ICI I GAGTTAGAAATGTAGT 
TAG AA CTGTGACTAACGGCATTGCCTGG AATG TGCTACAAACACG ATTAG 
ATATTCATTTA IC I I C CIC GI ATTAGAC I GCI I GTATAC AGACTCAGTGT 
TTAGACATTC A VII CI C IT CCTTGTATAAGACTOCTTGTATAAGACTCGG 
TGTTCATTTA I C 1 1 1 1 1 AAATTAAACCACAACA AATA T ATGAGTTTTTAA 
CC ATTG CAATGTGGAATAAATAAATATATCTCAAGTAOCATTAGCCTTCT 
AGTTTTAAATAATAA 
>Sequcoce 745 

GGTAC CI 1 1 1 1 1 1 1 III 1 1 1 II 1 1 1 1 1 CG I C AAAGTCACTATTTCGGOOC 
TAACATAATNCCTGCTCA GAGCGACGG AAAAAAGGCAAGCCTTTTCAAAC 
ATAACTCTCTC 1 ACAAGCCAGCT ATTATGGCAAGGGAAAAAAGAAAGCAT 
CTAG ATAAATATCT ATCA AAA TTAACTTT AAG AGAAATAC 1 C I C 1 1 1 CCT 
T AAAAGCCCTT A TTTTTT AAGA CACT A G AAAA T AA G TT ACT A TAAAAAGT 
GG TGGTCTGGGGGCT AAAAACAAAACAAAA AAA A TCC I C I n ICTACATT 
1 1 1 IAGT1 1 IC"IG 

GOT A C 1 1 I 1 1 1 1 I 1 1 1 11 1 1 1 1 1 1 1 1 A GTTAAAATGCTTTACCTCAATGG 
TTG AG ATA 1 1 1 IGAATGGAITITTCAAGGGGGGGAA^TGCTTATTATAAT 
AAT AAA OCA A A A TACTT AA CA G A AAA TTGTCAGCT ATTCTG A CAAAAA T A 
AACA 1 1 1 1 GAGAGACTTTATTTCI 1 T I G rOOGT I I C I G I GGTATCACTCA 
TTCTCGTT AAGTAAGT AAAGCTTTTTAT ATTTA GGTAA GAA CTG ATTTTA 
TTTTTT AAA TTATATTTTATATTTATTAGCACAGAAGAATAATGAGAGCC 
ACATTTTAGTTCAACTT 
>Seejaence 747 

A C ICI I I IG l 1 1 AGGTA r nC OCTOCTGCTGT G TC C AGGA I PGCTGTG T O 
GTGGTGATCAGTGCTGGGAGGTGAAAAATTAAAATAAGCCATTTAC^ 
CAGCATXXX^^TTAAATATTTGATGTAACTGTGATCTTTG^ 
ATAT A TTCATTTT CAAGCAGAGGAG 1 1 CCCXATTTTAAATAGA GGCA TTG 
TCTGAIGI GI 1 1 ATGCTTAACTGCATCTOGCTTGOGICrTTClGrrnOC 
1 1 I CI I IGC TGAATTAGAAGGGGTTACTCTGAAGAGTCCAGGTCTTACAG 
TGTGG l I l 
>See t oeoce 74< 

iiiii urn iicatt caaga aagataattttacacttai ici i igaaaga 
. AAAATTCTATGGAA 1 1 1 ICI1CI 1 C I aattaaattccaaaatacattctc 

TCAA COCT A TCJCCCTCA TACT ACT AACTTGA TGGTTAGCGGGTAA GT A GG 
TAGTAGTAAAAGAOCAAAAGCG GAAA 1 1 1 OO^eAGCAAAAAAGGGAGAAA 
AAGAAAAAAAGGGAOOCTTCTAGT 1 1 OCI AATAGAAAAGCTAGAG AATTC 
CA TTCCTG AAAA TTAAAGATATTT 
>Scepcac» 749 

aocactcactacattacaaaatagtctctaacataaaat^tgocttaataa 
ctatactattatagaatctgataaaccttacattattaaattgattataa 

AAICI ICI IGGAAAAACITTOGTATCTATOTCAjCAACCTTTTTTAAAAA 
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TAATATTTTA^GGGCCTGT^ACATTCCATTCrATTAi^GCACAGCACWk 
TAACT AATGG ATATTCAACTGCA TACA GAATATAG A A TCAAAAAAA CAA T 
TTATTA TGTATTTGTAG AAAATCATTACCAG ACT AAGCAAAAAA 
> Sequence 750 

CGTACATTTCATTCTGGCATaTTCAACTATGATTITAGACAAGATGTGTG 
TGTGTGTGTGTGTGTGTGTAGACAAAATAAAATTCAGAAAGAGAAAATCT 
ATTCTACAATC AAATTCAATCTCTTACTTAGCTATTTrGA A ATTGTOTCC 
CA AT AOC ACATTAACAGAGCCA A A A TG AAA TTTAAAATTA TCGTTATACT 
ATTA TTCACACTAGGTAGGGTCAGG rTTTTl fCTCI U AATTAAATOGCTC 
CTTT A CGCT A OCT ACTT AGG AA CCA CTTCCCA TA CCCTCAA GCT AG A GT A 
ATA 

> Sequenc e 731 

GGTACATTTG A TTGTGGCATATTC A ACTA TOATTTTAG A CAAG ATGTGTG 
TGTGTGTGTGTGTGTGTGTAGACAAAATAAAATTCAGAAAGAGAAAATCT 
ATTCTACAATG A A ATTC AATCTCTT ACTTA GCTA TTTTG A AATTGTGTCC 
CAATACCACATTAACAGAGCCAAAATGAAATTTAAAATTATGGTTATACT 
ATTATTC^CACTAGGTAGGCTCAGGTTTTTTTGTCTGAATTAAATGGCTC 
CTTTACGCTAGCT ACTT AGG AACCACTrCCCATACOCTCAAGCTAGAGTA 
ATAGATACCTGACCC 
>Sequence 7S2 

GGTAL I II I 1 1 1 1 IN 1 1 1 1 1 1 I I I H I I GGG AGCCATGOCAATCTTTTT 

acact tgattttagccaaaaggccaagaagcaatgaaagccatgataatc 
1 1 1 I iatgcaatgttatcaggtaaaaaaatggctaaagtatattagcatt 

TACCCG A GTGGT A TTCTTTTA TA G A A CTCA GCT ACT A AAA CC A GGG A GA G 
T A CTTGG TGT A TTTCTG A A A CA CTCTGCG A A GTTG TOG ATA GCTTCTGGT 

ggtaaggatggtattgaacacgtttao 
ctgcttcatacaagg 

753 



GGTAL J I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | I GGTATTATATAAAATAATAA 

TGCAT CTTACA GGGGAAGTCATAAATCCAATGAAATAAAGTATTTACCTG 

ACATATTTTTCCCATCl ICI 1ATTTCAACCATTTGACTGGTTGTCCAGCC 

CCAAATTGTTGGACTTTTTTAAACAATTCACACrOACT 

CTTTAAATAGTTGAGTTCCATOCXrTTTAAAATCATTTAAAAACATGATTT 

TT AAAT TrATCTXXATTACXrrrATTTTGTGTTTA C » 1 1 MI ACTTTTATT 

TATTTCCTC 

> Sequence 754 

GGTAL 1 1 1 1 1 1 1 I I rn 1 1 I I I I GGTGGGGAGCTGTATTTATTTCCCAGG 

GCTCTX^AAACAAATATCCATAAATTGGGTXKIATTAGAACAACAAAAA^ 

TATTCTCTCTAOAOAAGAACGJ I I ICI IGCCATTCCCTGGCTGCTGGTCA 

TTGCTGGCAGTCCI 1UICCI I CCCIG ACT AG TAGCTACATCATTCTCATT 

TClGCc i ^ i <j i<_ i ivATATGGCT ii 1 CATV 1CACTGTGTGC T TGTC I C 1 K» 

GTXTTCAAGTGG<XTTTTTATAAGGACACTGCrrCATTtM 

ACCCCAATCCN 

>Seqoeace_735 

OQTACATOTTOO AAOGO 1 TTI1 AAAATG TTTTG AAACTGTGCACA GGCCA 
AACCCAACTTTCAGGACATGGGTITrcAACTTCTCGATGG^ 
TGATAGTAGGGTATAAAAGTATOCrrGAGAAGTTGAAAGCAGTGTG TGAAT 
GGG G1UI ICI 1 1 ICIUCCCACAA I (JC 1 1 IUUCATCTGCTGACAGTAGACT . 
TAGCAOCTCACAGATGCTTGGGCCTGGAAAX6AACKXATGAAAATGAAGC 
CCTCAGCC I ICI I OGAGATCAGAGCCATGGTCCTCACCCACAGCACATGG 
GTT 

GGT ACACAAAA TATTAAAT AGG ATATTTATTTCTAA GCCAAA TTTCAG AA 
AACAATTTACAAAL 1 J 1 1 1 1 1 AAAGTA TAAA CATAGTGT A TGCTTA CTAT 
AAAAGGAAAAGTATAAAACATTACTCAAGT ATATATAGAAAATGAGTOOO 
CTGCTGATCOCCCTCTATATTATCTATTOCTOTGTGACAGT ATTACCACA 
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AA TACAGT AGCTG AA ACA A C A CA 1 ' I ' IBIITIC I CA CA GTTTCTGTCT^TG 
AGGAGTTCAAGCATAGCTTGCTCCTCTGCAAGCTTACAATCCAAGGGTTG 
G 

> Sequence 757 

GGTAL 1 ICI I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 AAATGAGT AGG AAG AG ATGGTA 
TCACAAACACAAAGCACAGGTTA L ' I G l C I 1 IAAAAATTTGC G T ICI I L I A 
TnrrCCAATGGAAGTGGGAACAAAGAGAAAACCXXrTGTGTXrrOCTAGCAC 
A ATATGGGCATTTCTGTGG ATTTAA TAA ATGGGCA TTTGG ATTGTTGGG A 
AAATGTGATCAATCAGCAGGCTATAGAAACACAGTTTGATACGATGGTGA 
AAACTTGTCTACAATGATOTTTTTTt^GAAATtnTGXSTOTOATTAGAACA 
ACTCAGCAATGATGA7GACAAAATATTTACATAATGTTATAGATGTGGCT 
TGCTAATCGAAATACCTATCTGAGGCTGTTTAGGAATACACAAA 

> Sequ ence 751 

GG TAC t r on TI AA A ACAATGTTGG A A ATP AGG A A A A TG agcaat atcaa 

cattttatcctgagggacagggagtagaaaacaagccagaggctgctagt 
tacatagttx^gtcttagggatgaagggatttatgtctctcctccctcag 
gtacgcgggg actacactggtg tctg acttttttcctag ag atttctccc 

TG AA AAA T A CA A GGGCTG TTGGTG AGAGCAGACTTG AGGTG A TAATAGTT 

GGCCTCTGGTCTACAAAGATTTCATAAC1CCTTGGAAAGCTTCT 

> Sequence 759 

ACTCCGATTGCCTCTCCCATGCTTCTCTGCTTTCCAAAGAAAAAACTGAC 

CTTGTATAGATCCTGTCAGCTGATTGCAGTGCTCTT AACTTCTCCATTGT 

G A GTTGTTCA GTCTG A GG AGTT A GG T A T AAACCC A GAG TGGT A TTCTCTT 

TTCIGTTGTGTTTGGTTrrGCTTACATATTCAGGAGCrGCTCrr^ 

CAG AACATCCGTATATATG I I 1 1 1 1 HIGH I C I A G ATTTA AA AATATTC 

CA G A AGCCTGG CCTCA AG A T A G A T AAT A TTTT ACTTTT A 

>Seqoence 760 

GGTACTTI I Mill ITTT mTTl II 1 1 1 1 AAAAAAATATCCTTAATTAG 
GT AA AAATTCTCCTTT AAATTA ACTG AA AGTTTTTA TG AAAAA AGG ATGT 
TGAA TGG A TTTG AATGCTCTTTTTGCATCGGTGGAT ATA TTTTTTT AA AT 
TTTTC AA GCGGGT A A TTGGGTTA TTT AATGGGGGGTTTTTTTT A AA GTTT 
AAGGGA 
>Seqwocc76l 

GGT A CAG A T ATAAAAA G GCTA CT ATTCCAA G AACAAAATCCTGGAAA CAA 

ATCTCTATCAAGAAAGCAAAGATAATCTAAACAGCAGCATATTCATAGGA 

TGACAAACTATTCAACCATTATAAAGAAAACCGAATCAAAAGCACIX3GCT 

TATTAGACAAGAGTTTCCCAAACTATCATGCTAAAACAGTAACAGCGAGC 

TTCCAAATTAATGTTGC C 1 I 1 I 1 1 1 1 1 1 1 I 1 1 1 CCAAACTGAAAGOAGGG 

TGGGGAAAACAAACGCATCATATGTAAAGCACTGAGTCCAGCCTG 

> Sequence 762 

GCGCCCTTCtjGCCGCCCGGGCAGGTACGCGGGTATGGTT^ 
Mil IAAGGAAAAAAATTATCATGGTTCTAATCTTACATGTTAACATTTC 
CI 1UJ I A TGTAGGG ATCAG ACTTGTTATAACATAATTCCACTTTATAATT 
CAATGAAGAAGAAAGTTTTGTCTGATTCTGAGGTATGTAATATTTCATTA 
TTATT ACCA TATTG ATATTCTCTATA TAAAAAAATTTACATATTGTAGTT 
TTCAGGTAAAAGCTG TTGT G AACATTAT* 
TAAA AAAAA A AAA CAA CTG A 
>Seqaence 763 

GGTAC GOCTAAGGG^OAGCTGXSOAACTCATCAAAOAGACAAAAAdATCXrr 

TTTTTGCTCTG AAGGCA TCGCTrGTGGTG^SCnTjGTGCGCAAGTAAATATA 

OTTTOGXjGCiCCCCCGATrOCCATOG ACTT1TJGC' 1 1 1 ICTGGI G GGAACAA 

ATGGCCATCAG<nXX5ACCCAACCTTGCACACATCCCAAAGACCTG<^ 

CATCTTGGTATCAAGGGAGGTTAAAAATAAAAGTGKTITGA^ 

GGA 1 G 1 G 11 T AGGCCAACCTTGGTTACAAGACOCCTGGAATA 1 IG1 UI 1 1 

TAAA AGGGGOOTAGXjTTGOGAATCCAAAAACCCTTOGGGGACAAAATAAG 

TTTCATTCCGTAACTTGTTG AG AAATTTCAAATTTTA I IGOl ICCCCCAA 
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OTA TTOAATT AAAAA AAAAA CCCAAAAATTTGGGGGAAGAAAAAAAA GTT 
TTCCTTGGGGGGTTGGATTGTTTGGGGCAAT 
CC 



CtXrTTACCC<KXXXXXXXXK*GCTAC^^ 

TCAAAGCAGTGGA OTA GGCAAAAAAAA GAACCTCTT CA TT AAGG ATT AAA 
ATGTATAGC^X^GCACGTGTAACTTCGACTTTAAAAAATTCTtiAATCCCA 
TATTGTACWTATGGTTTCAATTGGTCGTCGGCAGGGGGTAAGTGAATCCT 
TGCG CAC<nCAACCATAGOCTTCTAA CI HO I 1 1 I AACTTTTTT AAGCCT 
TTTTGATCtlAAAAAAATCTTTAACnTTTTATAAGGG AGCCAAG 11 1 1 TTC 
AAACTTCCCTTAAAA TGG TTTG AAA TTATT AATTAGGTCCCAGGTTAAAA 
ATITCCACCCAAGGCCTTCCACCAGOGG AAATACCC AGGG A ACCTTTTTG 
AAAGTGGGAAAAAAAAATTTGGAAA 1 1 U I C IT GGAATTAACCTTAAAAA 
ACCAATTTCAAAAAGGAAATTTCAAAAATT 
> Sequence 765 

GCTA CAG AAGCA ATGT 1 1 II I GAAAGTTTTCTATCTGAGG ATTGTTG AAT 
CCACAGATGCAGAACTCATGG^AACAGTGCCCACTGTATGTCACAATTTC 
AGAAAATCAGTATTTCATACAATCAOCTAATAGCCTAATTTGTTGAGCAC 
AGAAAAATACACTGAACCAATFCTGATTA1TGCAGAGAAATGATTGGCAG 



ATACTCTCXXnCAGGAATTTTTGTTCCCTTGATCrrTTTTC 
CTTATTTATTGXMXXXXTCTCXTAATAGGTGGGTT^^ 

GGTACAGAAGCAA I U I 1 1 I T I O AAAGTTTTCTATCTGTGGTI T G I TGAAT 

CCACAGATGCAGAACTCATGGAAACAGTGCCCACTGrTATGTCACAATTTC 

AGAAAATCAGTATTTCATACAATCAGCTAATAGCCTAA 1 1 1UI IG AGCAC 

AGAAAAATACACTGAACXIAATTCTXjATTATTGCAGAGAAATXjATTGGCAG 

GATATTGGG AA ATAGAATG A AGGGCGG AAAG A A TTTCACATGGATTCAGT 

ATACTCTCCGTCAGGAA 1 1 1 IG1 1 LtXTTTJ ATCTTTTTG TG TTTA TGCCT 

TATTTATTG^KXXCrCTCATAGTTGGGGTTTTC^ 

TTCCTGTCCT 

>Seqncacs 767 

GGTACAATCAAAGGAGTCTAATGGAACXAAGTACK^ATGTTCCCGAAAAC 
A A ACAAACAAAAAA CCCCAAACATTTTGCTG 1 1 ICI 1 IC OCCTCTGTATT 
TGCTAACTTTATCATGACTTTATTCTTAAAGCCTATCACTGXiTCrG 
TATTAATAGATTAGTTXJAAATTTTCACCrGGCXrrATTAGCACXTrATAAA 
GAAATAGATTAAGAGTAGGAAATATATAGATGAAGATGTACTGTATAGAA 
GTTCTGTAAAATCAGTATGAAAGTTCAATCTTOCT GT T Xn i G CTCAGTGA 
TTTTAAAGAAATTGAOTAGTTCCTATGTGA I I I IIDI 
ACTGC 
>Sequence 761 



TTCTTTTCTAA 



ACATATACATTATGTAATCAAAAAGCGTGCATGGGGATCAAAAAAAATTT 
TTTTGTTTA TA TGCGGAT ACAA TA TA TACAAT AAAACACCTAAACCGCA G 
AGGC 1 1 PC 1 1 G 1 1 ATCCACAATAGTTAATACOCAATAGTAATT AAA TOGA 
TGTGG^ATTKnTAG^CACCAGTACAAAAAAGCAAGCGGGACGTT ATTTAA 



AAAAGCACOOCOCAAOTAAA I IUI 1 1Q 1 1GGGATTOCCAOTTATTTCAAA 

OOG I C 1 1 AAAGGGOGOGGAAATTTTCCCAAGCG 
>Sequeoce769 . 

AC TT A 1 1 1 1 1 1 IACTAAG Q1 1 1 I G1 1 1 tOQAOACI IUI I 1U AAATAAAGT 
GATCCTCATTCAGGATTTAGAAACAAAAGTTATA(nTJCACATGCrAC<}GA 
TTAGGAAGGCTAATGTGAACTGATCAAAAGTATGAATTATGGAATX^ 
TAGAATAATCAACTTTTAGGTAATTTCATACTGCTATAATTTCAAGCT^ a 
GAGAAAAGTTGTAAGAATGGCATAAGGAACTCCTATATATCCTTTATCTA 



G ATTCACTAAATGTTCA T I T T GTG CCATTTCTGTTA T I C 1 1 T G TCTCA TC 
CTACKXC*GTCAGCCTAACACCACCAGG<SATAAACC^ 

> Sequence 7 70 

ACXTTCTCATITGTCACrrrTCAACACr^^ 

TGGTCCTGCTGGGGACCAACACACTCTGCAACICI I ICC I C 1 G AGCCAGG 

CTCCCCTACIUICI 1 1 1 LATTTATGTCAAGGCAGGGG AAO ACCTCAAAGG 

GCTCTTGCATCCCAGTCTCACTTCCCAGAGAGGCACGAGGCCCTCCAGGA 

TGTGGGGACAGGAACTTTGGGGCAAGCCGGGGCTGTCCAGAAGATCACCA 

GGAGCGCTAAATAGTAGAAAGGAGAGTCTTATTGGTGATATCTTTGCAAA 

CTGGGAAAAGATACiCCTCCAGTGTGG^GCXAAGATGCTCCTTCTTCA 

AGGGCAAGGGCAGCTTGGATTTTGTGCCTTACAGOGTCGGTATTATATAA 

TAG AGTCATGCATATTCAGTAGGTTTXjGGGGAAAA GCTA TATATATTTAT 

GAGGGGAGCCAACTACATGGGCAATGGATAAACATACATGTAACACATCC 

C A TGTTC A CTT A NGG GC A GG A TTTT AGCA TT A A A ATG A GGTGG A A TTTGG 

CTCTTTACATCAAAAAGTGAGCTATCAGACACAAAGGCGGTTTGTGCACA 

AGCTCTCCAAAGG ACTNG A GGGCTACAACTGCTC ATTTTGA A A G AAANTC 

TGTAAGACCAGCCTTGTCAACX:AGA7^AC«AGGCATCTGACAATTOCC^ 

A T A ACTGT A CCTCGGCGGG A CACGCT A 

>Sequencc771 

GG T A CAAA TAAAGT A TTCCAA GGGTGCA G AATNG AA AA GGAA GGCAAACA 
ACTTGTTGACATrrGGGAAATTGG^ATATCCTrTGGGGAAATGTAGTAAT 
CAGTATATTCTGGG AAAAACA TTATAGAAG A ATGAATA A ATAAAATTCCA 
TTG AA TTTGG AA TATTJTTG TCCATTCTTCCCTG TAA CTA A TGCT A TCAA G 
ATAAAGTTAGAA ATACCA CATTTCAG AAACAGCTGG AAGTAGACAGGGTC 
TTCATAGGGCTAGCTTGGGAAACCTAAATAGCT ATTAAATAAATGAAATT 
TTTAAGTTATTACTTCTGGGAATTTJTAAACAAATGAAACACACCAGTGAA 
ICI 1 HI 1 I GACCTTGGCTGC 

> Sequence 772 

GO TACCACCAATA A TGATGCCCAC ATTTGTATCCTA AA AAAAA GTGATTT 
CTTGTTCCTTTGCCTACAAG AACATGTTTTCTGTTCCGCAAAGG AG AATA 
AGAAAAACAATGACCCCCTTCCATTCCCATAACCCAAAACTAAAACTTCC 
AGGGAGTTGAATTAGAAATCCACCCTGTGGGGCA I I 1 1 [ I ICCCCCCAAA 
ACCCACCCCATTACTCTTGTAAGAATTCTGGATTAAGGCGGCTTCTTTAA 
AGAAAGCCCTACCAGG C 1 I CI I I C CCCCAATTACCCCTT ATTCTGGAAAA 
AC<X1AAGGGAAACCCCACTTGCTTITTGGGTCCCAGGGAAAAACCAGGGC 
CATTT ACAAA A CCA TTCCAGG AATG TTGG ATTTT ATTAA A AATGGGGCGC 
C ACCAAA TTTCTT AA A A A AA GG AAA A A ACCOCAA AAAA AA ATT AAT AAA 
> Sequence 773 

GGTACTATCATOCCCCAAGGCCTTTTACAGTCTGAAATATCAAAATTGAA 
A GCAAAAA TAGG ATG A CCAAA GG AACT ACT ATTTT ACC I T CT TT TCAGGA 
ACTTCTACAAATAGTTAGAATACTAAAATTTCCCTTTATGGG^ATCTTCA 
AGGGGGGGAATATAAATTGTGCCCATCTTTGGAAAGGGGGCATACAGATG 
T A T ATGG A TGTACCA AGGGCTCGGGCA TTTTTTTTCA GAGA TOGA TGGGG 
TTTCATT AA CTTG AAA CAAGGT AAGGCCA GTCTTJTTCCCTTT AAAAACCA 
TAGC TCGTO TTTAOGCAACOOCAGGCCAC<XAATOGAACT 
GGCCTTTTTAAAAAA CAAAAA TTTTTTCTT ATGGGAA CCTTT AACCGCCC 
TTTCTT ATGGGGGCCCCTGAATTT A TGTTA TAAATTGGCTTT AATTG AA G 
>Sequeacc774 

. AC^TATACATTATGTAATT AAAAA GCCTOCATGTGTA TGT ATT AAAAATA 
A TGGTATATAAACAAATAC^eWArTA TACAAT AAAACAOCT AAA OGTCAGA 
GGGCTGCA TGTTA TTCCACAA T AGGTAAT AA CCAAA TAGTATTT AA TGAA 

ttgtxjtaatgtatgacaacaagacaaaaaagcagcgggacxjgtaattaat 

TAGGGCAGAACACA CAAAAA GGCACCAACAAAAAGCCAAAAGCAT AAAAA 
GGCATCGGCCAAGTAAATGGTTTGTGGGAATCGCAGTAATTCAAGGATT^ 
TGGTCCATTATTAGAACATT AAAA TTCCCA GGT AGGACCACTTGCTAATT 
AAGAI 1 1 1 1 1 1 GGGTATTTTTTAAACCTTGG AGG ACCAAAAAATTTGGGG 
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775 

CCTA C I 1 1 1 1 1 1 1 1 II I I I 1 1 1 U 1 C AGACGGGTCATOCTCCAATCATTA 
ACTACTTCTAATCTTCACTXXn'ACACAGAAGTTTCCAATATTTTAGCAAC 
AGATGGCT1TG CI 1 1 I A CCTTATAGATGAGGCCAAAGCACCAGGTAGGTG 

GAGGCAGTGGGG AGCAACACGC I U 11 1 1 AACOAOOOCCTXXKTrcCAOOCG 

TGCTTCAC<TGAAAATGGCATTCAGCCCCAAGTGAGGACAGGGCAGGGGT 

TTTCACAATCC C I I rCTAACAOGAAGI PCI I CCA GCCTGA TA TG ATTGCT 

ATCTAC 

>Secp eoct776 

GATOGOAOOOOCAAOAACNAGAGC1 ITCT1 IaOaOOCI G I l OI AOTAATC 
CA GGT A A A GGCTTTT AA TCA TGTCCTG A ACAA TC A TCAGCAA TGGCAATG 
G AGATGACAG AACAG AATTTAAG A A GG AA T A A A AAA GGCTTGCTG ACTAC 
TTGG A TGTGGGTGA TGCT ATCCTTTG A CACAAA GG A TTT AA GATG A AG AC 
CATI I 1 T ' 1 GCGGGTAAGT AA A AGGTTTGGA 1 1 I II 1 CATCTTACAGCTTT 
i 1 1 1 GT ACT ATT 
> Sequence 777 

ggtactgcaagccaaatgcaatgaacaaaccaaggttattgataatttta 

catx^cagctcaaggrtactcaagaaaagctcttggatcttc 

cggaaagccagtttccgaaggtaaatcggaaatgaaagaaactttagaca 

gccagcttgaggcagcttgagaaacagaattaaacattttagagaatrga 

aaaaagaatxxxttgaaagtaagcaagggcttagtagccatttaaccaaga 

AOAGGCTTCCAAGGGOGAGAAGAAGCTTAAACM3CTITACTAAACCTTrTA 

aggaaaaaattttcaagtgaaagtccagttcaa^gtgaaaagtagaact 

TTTGGGAAAAAAAGAACCTTTt^AAGAATTTTTGGAAAAGA^ 

GGTACTGGTTATCAGGATAATACTAGCTTCACAGAAGAAGCTGGGAAGTA 
TTCCCTCCTCTTCTATTITTTTGGAGGACT ATGTGAAG AACTGGTATTAA 
T AAAAACTCCTT A TT A A GG AAA TTTTTT AA CATA CCAAAAAA TA G T AA G A 
ATAGTATCATO AGTTCCT GTC T GTATTCCCGCCTAACTTCAATAATTATC 
A^TAGTO CACCATTCTTATTTTAC TTATA CTTOOCCTCCCCAACACCTTA 
c ict m GGCGGGGGCTGAAATTATTTTAAAGTAAATXXXIAAACATATCA 
TTCAOCTrTAAATACTTCAATOTATATCTCTAACAGATAAAGACTTTTrT 
TA 

ggtact acq aa gctgcag atca tt acxxhg at atg aatgacttgcttg aaa 
gaacaatgact ctgg cacagcca l i oc 1 1 1 1 c aoccaooaaaocagtttt 
tcacagaatggchttg atttat actttgcacacca ttg a g agaat aaaaa 
gaaaatctaaaagttagtcttagagcatacaaacattctatatactattt 
catcaactttatgtoataatgatatataatttatatatactgaaattatt 
ttcagatccacttac rcTOcn AAACCG AAAGTCAATGATAAAO AGCAAT 

GAATTATCTAATGTATCTTTATAATTAAGAAATCAAG 



ACAG ACAGTOTG ATGGATGATTJC ITJCTOG I IGTAAATTTCATOGTOTGTO 
TCTAA TTTTT 11 1 CC TGTTGAATGGGTAAAAACAAAACAAAA Cn T T TTl 

agaagatgaa 1 1 1 rocro i ca potttti o t q gaaatoaooo at ocott oa 

GCTTCACTATOCACCTTGGAAGTTTGAGTTTGAAGOCATO^ 
GCCOCAI rOOCI 1OA CGCC3EBCAAC0GOC1TOGAATCTOCAAOGTTCOC 
OCTTTOTAAOAOOOATTOCTTTAObDC 1 1 OCJ I AAGAGAAOOCATAACCGC 
1 1 1 rCTO GAAAAACCTAA Cn ITJTC1 I IC AAAAAAGAAOOOOCTCTGGAG 
ATTTAAA CCO I 1 1 ICA AA CIOll 1 1 1CAATTAAGA 



1 1 1 1 1 1 1 1 1 1 1 111 11 II lO OOOCATOA OlCI 1 1 1A ATAGA 
AAAACACACGTOCAACAGTATCAACACACATTTrTTXKXAATCCTGACAG 
CGCTGAACTTCAGTTCTTCACCTTGGGGGGTGGCCT^ 
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CTATCAAATTGGACCCTCAACT ATGCATTTTTCTGTGTGCAAGTTATATC 

tcaattacaaacaa^caaaaacacaaaaccctatggttaacocaaaacct 
aaactatcaccaagaaatatcaattggggttatggcatgaccatcctcoc 

CAAGAAAATAAAATGCTTGACAGATTCTOAGCGGGACAAATTTCACTGAT 

CATATCOCAT 

> Sequence 7S2 

ACAAATAAATGAGTTTGCAGTGAATTGGGCCTTCAAATTACCTCAAGTGA 
CAGATAGTAAGAAAAGCTTCTTGaGCAGGTCGAGGTCACTGAATCCCCTA 
CTATGCACrrTATCAAGLATTTTACTTACTTT^ 

TTTAAAAAATGACT ACACTGT AACAAGGGAAGGQATCTGGGT m I HO I 



I IC1IG1 I'M I 1 1 1 A AGTAGTTCAAATTCTGAAACTGTGATTT 
AAAAATTTTTTACAGTCAAGCA TTCTGATTTTGAACAT a ACTCCCTTCOC 
ITTCTC TCTAACAAAGaiXnxnPCTCT TATCT CT 1 AAATTTTGTTACATCT 
CCCTCAT 
>Seqt»eace 713 

GGTACTCTTCACTGTCTTTGCCATGAAACrrrATAACATGGCT 
TGTTGAATCTCGTGCCXrrGTCACXXrrGTGCTCAC 

ATCAGCATGTGAGGOGCAGAGGGAGGGCAAGCTOO O C HOI OATATTTGA 
GCrrATT^OCTOACTCAAOICICI l lCC CMC I C I CC I IA TTCTCATGCTA 
CCTC I CCCAACCA I ICTCT1 AAC 1 1 CCC lO OOCAGGATGCCTGCCATATT 
ACATGGAGAGGAGGCA CTl [CT AAATGGCTTGA CTl I G GTG A AGTCTCAA 
CTCAAGAAGCTCTG AAATTAATCCACCCAACAGAG AACA TT ACCTTCCAT 
GC 

>Scqucncc7t4 

ACTACTCGATTGTCAACGTCAAGGAGTCGCAOGTCGCC IU OI1L1 AGG A A 
TAA TCGGGGAAGTATGTAGGAGTTGAAGATTAGTCCGCCGTATTCGGTGT 
ACCCCTGGGAGGTGCCAGTCATTGAATAGATAAGG 

GAAGTCTTAAATATATTGTACC 
>ScqwBCc7l3 

GGTACAAG AGG A TA TGTG TGCA TT A CA TGCAA CCA CTA CA CCA TTT AAT A 
TCTGGGo PCI G AGTATCCGTGGG 1 1 1 1 OGGTATOCGTGGGGGTCCTGGAA 
CCAA TTTC 1 O C I O GATA CTGAGGGATGACTOGATTAC TGTG TG 1 1 I O I OI 
GCTTUI IlTTAAGCTTCAAAAaATTATCTGATCTAGGAGTTGTTAGATTT 
TATTATTGGTCTTAAAAG ATAAGCTT AGATGTTjTTACTTTTTTGG AGTTT 
TAGTTTACAGTGATTCATGAATCOGGCAGCTTT^OACCACAGGAGACATG 
AAGCAGGTAGAAGTTTAAGAAAGCTTGACAAGCAAAATATTTGATTTGGT 
TAGAG 
>Secpencc 746 

GGTACTAAAACTAAAACTCAGCAGTTTAAAACATTCATTTAAAGGGATAT 
CTA AIOI Ol 1 1 ATTATTAACATAAA TAA TGTTTTATG AAAAA TGTAA OCT 
TAGTTTTCCAAAACAAAAATCTTTAGGGCAAGAGTAACATTATTTTACAT 
TATTGCATCTCAGTXIAAAAATAAATGGCAACAAAAITCTTATATCTGCTT 
CTGCAGTTAATTCTGTTCAI 1 HOI 1 1 IOOI I OAAATATATGAAGGAAAT 

TGTGCACATTA TTOGGGATGA 1 1 1 It'll TGA TACAACAAC 
>5eqococcTI7 

OOCTTTOAOOOOCCOCOCXIQOCAOOTAOOOQOOA1 rOCT O ffl IAAGCAGG 
CA I IOC ' 11 lO^ CCTGGAOCAGCTATTTTAAGCCATCTCAGATTCTGTCTA 
AAGOG OI 1 1 1 1 lO GOAAGA CO 1 1 1 IC1 1 1 ATOGCCCCTGAGAAG A TCTAC 
CCCAGGGAGAATTJCTELAAGACATTCTTGGCCT^ 1 1 1 ACTTTATTTAG 
CI 1 1 r D CTOOCTC ATTTCATA I IC1 1IA TACACCC I 1 1 ICCI 1 1 1 1 TO GO 
AGAQATTGTTTATTGCCAATGAA 1 1 1 1 1 1 1 OGGTA TTTTT ATGTT AACAA 
AOGAATTAATTTACCTAAATTTCTA rTTTOCTTTI A TOTTTTTAATTCCT 
AAGTTAAAGAGAAAAATGGTITCAGGGGTCAAAGCTCATACCAAAATTAA 
CCT AAAGGCTG AA GGGTTAGG AG AA 
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GGTACCTGCAGGCCTTXTACACCTACCTCTCTCTCGGCTTCT 
COCGATOATOTGGCTCTGGAAGGCGGGAOCCAC 1 1 fCTTC C GTG AACTGG 
CCQAGGAGTAATCGCCAGGGCTACOAAGCG 1 1 1 rOCTQAAOATOCATAAA 
CCACTTCGTGTGt^iACOOCG CI l llll 1 1 CCAGGAACATTCAAGGATAGC 
CjOWCTOGATA^TGAACT 

A ACGOG AOOCCOOCAATCTl ICITGTGGAOCTTC 
ATT I IOC 1 1 A 
> S rq aen cg 7t9 

ACTTTAA ITICm ATAA 1 1 I O I 1 CAGCTATTTAAAA AGATAATCCACAA 

TTJTCCTAOXXX^TTAGAGCACAGGAAAAAAAAATTCAAAAATA^ 

AAACATGGCTCATATATCTACAGAAGTCACAAAAATACTATAGGGCACAT 

A T ACCCA GGCCTCA GCGGTGGG A AG A A AACA T A CA A CCA CCGGGC AAA AT 

GTTTGAACACTGAAGACCCCAATTTTTTAGGGCCAT>nT^ 

GAAGGTAACTGGGAAAGTCCTGCATAGAAATAGATTAAATN 

> Sequen ce 790 

CCCTTAGOG TOOTCTCI OOOGAGGTACTCAAGTCGCCCTTATGGAGCCCT 
TGATTCAOGCT TCAATA GTGTGOACAGTGGTOATAAGAGATGGTAG 

tgaagtaagtoti r m a tgttccg tgtgtt at aacacctga ttaag aga 

AAACAGAATGATGAAAATGAAAAGCGTCTTAACTGGATTCA G T r 1C I C AC 
TACATAAAATACAGAAAAGTCAAGCTC<^GGCAAGATTCCCACCCTCT^ 
A GCAGAATTGGCA I ICIOCOI C CI i a ocgg ci I ICIOICA CGTGGATTTC 
CGCC1 G I TTCCT CA I TCOCTCATCG AAATAGTTTCATATCATAGAAAGGC 
AAA CAGGAG CrGAGCCAGTTGAAACTGAAOCCTACAATCTOAGCTGGGGG 
GTAATCTCQAGCAGAGGTGCTAGATTJGTCAGAAAACAAGTANGACTITCG 

gctgatgggtagaaacaaggaccttaataaagactattcatgt g ctcaag 
aagaataaci itxlggctaattctfgtcgngtctcgti i i i aaattatt 

GGATATA ICTTGTCTOCTCTI AAAATTACTGTGTTCACAGAAGTCTACAA 

aaaaaaaaaaaaaaaaaagtacctgcccgggcggcxxxrrtt a a agggcga 
tttccacac acttjggg gccgtacttatooatocagcttcgtaccc^ 
ggggtaatai lulctaactcttgctgtgggaaatlgi 1 i ctxctccaatt 
ococccacatt 

>Seqocac* 791 

GGT ACT AA I IL1 1 1 ICCICTI I CCTAGACCGATTCTAG 1 1 1Q| IQC CTTC 
" CCIII IX, I COGAAAGOCCAAGTTTGGGATGCTGCAGACAC TC iO I occcc 
CtrTTXATGCTGGO IOCC I OOOCAOCTGOCAGGGCATAAAG ACAGAGACGA 
TGTGGCCTT T U ICCTTAAGAATGACGTTTGAAAGCCTCA G T T C TT CCATG 
TTAGGTGAI I I CI I OCAGCTCTTGGTATCTGCAGAATTAGTGTGAATGCT 
TAAAAAATATTAACAGCTTTATATCATCAAAGTTTTAACAGT 
>Sc9BeaBc792 

GGTAC1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lOAAGCTGAAGGOCAC 
APT AO CT AGCT AAAOOOCACACCACTOAACACT AAAACTTAACCTTTACT 
GGCTACI ITOT AGA TAACATTCACAGCTCACCA TGAA TGCAGCTOCAGTC 
A ACTAA CAGATATGAAGTTACCACTGTATTACATGGTTATATTAGGGACT 
PC 1 i C 1 AC CTACTGGAGGCTTJGGGAGGAA TGTAA CAGCACAAGCCATAAT 
- GAAGTTTATATACAGGCTTAATATAAAAGAAAACCCTAGAATGAACTCAA 
CACAATTATOT 



ACCATGCAGGGATAGCTGAGTCTTCATCCI CCI CAGCCCCTATCTGTTCA 
gT GCACnSAA CACCAG CIOllCIC l lCX I C ICl OX 

TGGTCTGTTGCAGAGAGAAGAGGAI IQCCIOI lUXTCTTTAAGGCAACC 
HXJU rn 1 KiCTTTCTGGAAOCACTCTOTAATGC 

ACGAACTTAAATITATGATCAATATCTTTOATAATGAG^ 
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ATTTTACTTTCAATTTTATTTTAATTrGAAAGAGCATATC 
T A TTTTTAACAT A T A GCCA T ACTGTTT A TTT AAATTTG TAA TAA TAG AAA 
TAGAGTAATTCTA CTGTTGGATTTTAA TTTTT AA TCATA TTAAAGTTT A A 
CTGGATTTTATTTTAGGACTAAAATATTTAGGACTAAATAAAATTTTATT 
AATTAATTTAGGACTTTTGGGAAAAGATA7TTCAGAAGTTCAGTGCATAT 
CAAAAAAGCG AACAA CAGAGGCTTCATCTTTTGAAAACTTC^ 
' AAGTGT 
> Sequence 795 

ACCCT AGGTGA I C I I TG G CI ICCT CA A O 1 1 I I I O CACCACTCAGA ATCAT 
TTCATATACCACCTTTGGCAAACATGCCAGACGTGGAGTAGACT&AAGGA 
AGCTCTCXX^AGCTCTAAATTGATTAATTTATTAGTTCCTAGAAGAAAGA 
GATTACATGTTTA ICIIIIIUI I ACAGAAGAAACTTTG AATAGC AGTTGA 
AAATTTTjGCAGGCTCGACCACCTAACTTGACAGTGTATTA r I g I g TCTGT 
TTTG AA GG AAT A A A ATGG AA TT A TTT A T AAA GTTTTCATTTGT A TT A G A G 
AG 

*796 



GGCTGCAAGGACA 1 1 I OC 1 OCCATCCAA 1 1 1 0 1 OCXAGCCTGTTTTATCA 

ATCTTJAACCTATATTATTTITAAGACCTCACGGCATC^ 

TATTATTAGTTGGAATTTTAGGGATGAGAAAACTGACCCTC^CWGAGAAT 

AACTCACTTGCCCCGGCTCCAACAGTAAGTCGCCCTGCTGGGATTrc 

CCAGO rp TGTC I G AOCCCGAAGCCTGATCTGACCTCTGACAOTDGTOATA 

AAAATAAT 



C CC1 I G GCCGCCCGGGGAGGTACCGAAAAATGATTTTOTTATATATATTT 

aocacaataaaaaagttttaaatttattataggtgacactgtttgctcac 

TGTAGGTCAGGTAI l TT TIGXilTn ilTI TC I C TTT A TTTTATTTTT G AC 
CAATGGATTCACYjTCACCAGGTGA TTTTTT AAACAGCTTTATTGAGA TAT 
ATATCACXrrGCCATAAAATTCACCCATTTAAAGCACACAGTTAAATGTTT 
TTTAGTATAGAGTTCTGCACC I C I I ATGACAATAAATGTTAGAATATTTT 
CATCACTCAAAAAGAAACCAGTATOCATTAGCAAT 



ACAA 1 1 1 1 1 ATGTTTACAGCTGTAACCCCTGAGTTATCAAGAGATGGAAC 
ATTAGATATOATTTATTCCTATTTAAGATAATAGGACA TT GCTT G ATTAC 
ATTTTCAGAAGATATTTATCCAAAGAAA 1 1 1 I 1 1 1 1 1 1 1 a atctaaagga 
AAGGTTTTGA TTCTT A TGAGAAAAGAATG A G A TTTCTTT AACTGG AAAAT 
TGATTTAIOILX-IACAGTCCAl io i oia gtga io i i o gatcaatcaggta 
TCGCTAGGGTGTCTGTAGAACTATCTATATA TTGCTI 1 1 lA A O l I CI I AT 
A 

ACCA TGTA GCTCTAC 1 1 1 1 OCA TATACAGA G I IOI I rOCTAGCl HC I O C 
TAATCTAACTGGA 1 1 CCTC1' "f LXJCCA F 1 ICCI CATTTACT AGA TT ATAA T 
GCACATCACA TAA TAAAAGCTT AAAAA TGGG C 1 TIC aCAGTTA C T O ' I 1 1 1 
CI I U T AAATAATTGTOAGAGAG CI I TTG CATCATTTATTATCTAATCAT 
GATTCAAGTOACT AOGCTGTAGCACCCAAGAAOC I i OCL 1 1 A AAACACTT 
TATTTTACCCAAT AATACTACTTTGCCTTCTT ACTTAAAAATGTCCCGTG 
CTTAAOCCT X 1 IGCTCTTTATT1 rGATTTAAGCACn GACCC 

GGTACTCrTCT^VrTTTTAACAAGG C 1 00C I C AAGATATT AATXTTOACAAAC 
TTACATAGCCAOCTTJTAAGaTATCTTTCAAATGOGCAAGTAACCTAACAG 
ATTTGTOCATGTCAGCCAGTAATTTCAACATACATTATAAATATGGCCAA 
1 1 1 roCCAAATTCTAAATTJAATGGAGATAAAATOCTATATAATAAATATG 
TTAGAGCA CCl I ICI lO AGAAACTTCTAAAAGCAAAAAATAAAAGACATA 
ATTATACTCACACCAOCAGTAAAA OC I C ITj GTCA CCTG I 1 1 I OGOI IOIO 
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TGGATG G T A ,\ TC CTCC AA CTTGOCA CA G A 7 A 7 A 7TCA G7 AGCT7CCCAGG 

AATACAAATCTC^Ttn^ ATTAA CTCAATGTOCCAAGCTATCTCAOATTTOA 

A GOCT AAA TA CTT AAA TTTTT AC7TT AO AA TGAG T ACCCTGCCGGGGCCC 

CTTCGAAAGGCGAATTTCCACAAACTGGCCGCCGGTACTAOOGGATOCAA 

OCTCGGACCAAAOCTCGGCCAATAAGGCCATAACTGGTTCCTGGGGAAAA 

TCGCTTCCCTTA CAATTCACAACACATTCCAAOCGGAGCCTAAAGGTA AA 

CCCGGGGTGCCAAAG 

>ScqunceS01 

OOTACTG ATTA TTCTCCTGCTTAGCK3AOAAGCGOAAGAAGGCOCTTGGAA 
CTOTCA GTTTrG CA TTCCAACTTGCTAATTCAACATAGATCCTAATrOCT 
T A A ATGCTTG T AA TT A G AA A TTCTCGTG AA CTGTA TTCGTTTTTG TCAA G 
CAAT C OGGGAACTTCAGCAACTOGGCCACTCCTTJGCTAGGGTGAA 
GTTTATTTAA | llo i I I | I ATGACATTCTTCATCTTGGAAATGGGGTTTT 
C A A A T A TTGCTTTCCC A GGCA TCATTA CTTA TTTGCTGGTrrTTATTTCA 
AGATTGGGACTAGCTCAAGGTGOCAGGGAAGCGGTTTGTGGTGCTTTATA 
TTAAAGTCGTAATATCCAAAAAAATTGTCTGATTGTATGGGGTATCTTGG 
ATGTGGT ACCTGGCCGGGCGGTCCG7TC AAAAGGG 
>Sequcocc 8Q2 

CCCTTTG ^GCGGCX»CCCGGGCA GGT a cg a t AGGCA tgcaattaaagaag a 

CCTCCCTCAAACaI I I PCTG1 GTGACCTGAGGCAAGTCCTTTTATAGCTA 

T A AACTA GG G A C AA TA TTTGCTG TCA Hill 1 V I ACAAATGTCACAAAG A 

ACAAA 

> Sequence 803 

ACGCGGGGGGTTCAGCTGTCTC7TACTTTT AACCAGTGAAA7TGACCTGC 
CCGTG AAG AGGCGGGCATG ACA CAGC A A G ACG AG AAGACCCTATGGAGCT 
TT A A TTT A TT AA 
>$cqoaKX 804 

GGTAOCI I GC ACAGTGCCTTTTA AATTCATITTGCTGGaCAGTTGGCAGG 
CTCITTCACTTGAGAGGCTATATCTTAACGATTTaGAATGGAGAGTTTGG 
CTCAAGCTCCCTGTGTG7tXrrXrrGTG<rrTTCTATACTTTT ATTCTTGGTA 
TTCCA GAG TCTGG A GGCrTCTCTTTTTA AA AA TTGCT A GGCTCCTGCCAA 
ATGTTATAATTTGGGGATGTGAGTTCACTAAGAAATCAAC^ 

cagattaataggagaaatgacatogaaatttattagcatgcagggggaaa 

AAATTGATTACCAAATATCCCAGTAGGGTAGAGATGCTTATATACCCCAC 

CTXrrTAAGAGAGAGGGAAGTGGATCATTTTAGGGGAATAGTAAATACTTT 

NTA TGGG AACTCACTCGGCTTG AAGAATA T A A CAAA A GCCTGGG A CAAA G 

TCTGTTGGGCCCACAGAACAGACAGTGGTTTATGACAAAAGTCTTGTGAG 

ATOTTATOACAGACTTTCAG C 1 fTCTTCTI i o tatatgattcagttaatg 

AA AACTA GGG AAGGGACTAGAGGT AAATGG 1 1 1 1 1 1 IC TTTGATGGGGCC 

CAA^CITAAACCGGATAAGAGGACCTTAGAGAACAAAACCTTATrCTGGG 

CTTTGG<SAGAAACAGAGGATCCAAGACAAAAGACGAAAGTTGGATTCAGA 

GAGACCCTGOGCIT>CriCAATTCAACA7GTCAAAGGGO 

TGGGATTTTAAT 

>Sequcnce805 

C OGGGC AGGTACTATTACTAGGTTCATT G 1 TTOCAGAGOOGTOAAACGGG 
GCTITGGAGAGGTTAAATAACrrocOCAGGGTCACACAGCTATTAAGTCG 
TAAAGCTGGGATTTACATGAGOCX1AGACAAAGAACCCAAGAAGCTAAGCT 
ATJCICTITJTAATACCTCCA^ 

AGGTroAGGAGATAAAGGGGAGAGAGGOCTGCAGTGCTAACAGGAOGAGC 
TGOGATTCATCCTGGCTITJTTCTGATAGGT^ 
CATGACGTtlACCTACTCAAAATGCGGCTCAGA 
TGTCGAGTYJGGCGCAGCTACAGTCrrcCTtKX^ 

GTCC OCACGTCOC ACTTCAGTGAGGK^TTCATGTGOiOCCAACACACTTT 
CTAGCTTTATTTGCXntKJ AGGGG AAGATTCTOCAOAACCTTGTT AAG ATG 
CACAGTGTGGTCCTCGGACTGGC^GTGTGGCCrcGGCAGTCOCTOGGAGC 
TTXHTAGGAATCGAGAATCTXIAAGCTCCTCCCTACTOAATCTAAAG 
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>Sequcnce KM 

GGTACACATATATACACACATATATAGATATATACACCCACATATATATT 

TGCTGACATTTTAATGTGAAGTTTTAGTCTOOGATATAAAATGGAATGTA 

TGACATCCTCAAATGTCTOAATACrrGTIXlACTOCT ATCTTTTACATTTAA 

TTTTCCAAAGCAAAA CA TTTCA GTTG A GG A TTTT A TT AG AAAAT AAA TAA 

TCATTTAGOCATATCTAGAAACCAGAATAAACAATGOCATAAAGCCTATA 

GGAAAATGCAGGTCAGATTCATAAATATTCA PGTG1 1 I ACTTTCAGTACA 

GGG AGGAATTTG AAGTAG A T AG A A A CCG ACCTGG A TT A CTCCGGTCTG AA 

CTCAGATCACGTAGGACTTTAATCGTTGAACAAACGAACCTTTAATAGCG 

GCTGCACCATCGGGATGTCCroATOCAACATCGAGGTCGTAAACCCTATT 

GTTG ATATGG ACTCTAAATAGG ATTGCGCTGTrATCCCTAGAGTAACTTG 

TTCCCTTGGTCAAGTTATTGGATCCCGCGTACCTGGCtXKjGCC^CCGGTT 

AAAGGG 

> Sequence 807 

AATTCCCATGATGTCAGACCACTGGAGTTTtXAGGGGCAACACCCCATAA 
CCGTCCCGCTGCAG AAGAGCA TCAG ACGTTC A G T AAG AATG CAAAGGG T A 
TCTCAGTGGGAACCGCGCACCAGGAGAGCTCCCAAACCAACACATGGCTA 
GGGCT CTCTAGGCCCTTTCAGGCTAGATCTTGACGAGAGAAGAGTAAAGA 

tcttrctgaggttggtgcaactgaagaaacgaaagtttcggcctctgctg 
tcag atctatcaa agcaaagaactgtgaacttgtcoc x. 1 1 1 ig i i t i c i i 
tgacttaaaacaaaagaaaatcactgoaacaaagtcttaaagtaataaca 

GAAATGTCAGAAAAGTTGAACATCTTATGGGCACATGCGGTGAGTTACGC 

taacttatagcatccactgagattagccgcatagcattcttoocatctta 

GAGCTAAAAGGACCTACTGTOCGOCAGCTGCATTGCAGTACC 
>S«quencel01 

G GTACT ATCtXCTACCTATAAGGCATTTATAATGTGCTGGGCATTGTGAC 

ACM 1 I C ATATA TTATCTCATG AAATCCTCACW AATAATTCTG AAGGGTA 

GCTGGT ATTTTTATCTCCACTTTACAATTCTCAGGCTTAC^GAAGTTAAT 

TCAGTGGCCCAGGGTCACACAGT1TACAAGTGOCACATTGGTGAATATAA 

AGTAGCAACTTCTAAGTTTCACTCTCCCACTTCCCTAGTT 

GGCATG AA TGTCTGGG AAATAGCATGCATCAG ATTfTTCCACCTCTTT AAA 

ACTCTTCAGTTCATATAAT>rrAGGOTGTGACTATTCATAGATACCTTTGA 

GCTAATCTTCTGGGAOCCAATGTAACXXjCAATGCAC>.CTGCAAAACAATG 

CACGCTTTCTCTGT AAATTAAAAATGCCAACXX}AGCTTGGGAA-AAGCOCA 

TCTTTTGATATGAACCAATAGGGCAGTTTAGTTTTAGAAATAAAGAAAGT 

CCACI G1 1U.IOCI 11 111 11 1 1 1 ACACACAATAGGTAACTCTGCTCTAT 

c i tci acaaagagtcocagtca o un c i atgcctacoctcttaaaagtt 

TCA TTACACAAGCCAAAA CAAA TTGCTtXIAAAAAAGGATAATGAATCCTA 
TTAATGAAAAGTGGTA1 rilClUTAATCATTNTTAATAAAAGGAATGGGG 
GATXIAAATCGCATTAAAGGTCATTTTTGAAACAGAATrAAAAATAAAATT 
GCAAA T A TTGTAAAAAAAATTG ACAG A TCACAGCCCCCTG TTGT AA GGCT 
ATTCCCATTA AAG AATG 



PCT/US0CV33312 



ac i innn iuci n rnriTi 1 1 n tggaaoaatattocataoctat 

TAGAAAAGTCTTTTAACAATTAAAATTGAAAATGACTGACAAACTTACAC 

TATTTGATTTAAATAAATAAATAAATGGTTIACATGATAACAATCTCCTGA 

TTCATATGGTITATTTAACCAGGTTCTCAAAOCATTGGATGTC 

AATTTTACAATGCAGAGGTAAGTGTTGAGTGTTTAATXK^ 

AAACA7^\AGATCGTATTT0ACTAAAAATCTCITATATACATTTCTAATA 

CTGAAGCAAATOG<XlAAC GTGAC TOTAAATTATrTGAAAAAATCACAAAT 

TTCAGTTAAAATTGAATAATTTTATTATAGGTCTCATAATCTTTTTCAGC 

TTACATGGA ATCA A TG I G 1CI I G ATTTTTA1 1 C I CG 1 1 AATTTTATAAGG 

CCTTCATCIUCI 1 1 CGGTAAATGATTGCCCTCTCATTCCATTTAATGGTG 

GTTCTTACACTAGCAATCTGTCGAATTTTACATGTGGTTCG^^ 

AAAAATTGGAATTAGTAGATCTAACGCTTGCAAAAAAATTAAATATCACA 

TGGAAAAATACTGACAGNTGAACTTTACACATTAAATITTrrCCAGGTAG 
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TAGG TTGGCAGCGAGAATAGGTGCTGAGTTTGGTGAATCGTTTTAAAAGC 
TCTTGGGAA AACAAATTTGGCAAAGGGGAAGTACTCATTATTGAAGTTCT 
Ik 1 1 1 1 l 1 1 A CCTTAAAAAAA GGATAAA TG AACTTGCCAAA TAAAAAA AA 



PCT/US00/3J312 



COCTTAGCGGCCGCCCGGGCAGGTACTCCA I J ICI 1 1 1 1 ATTCATATTAT 
TTCACCAAATAATATTCCACTTJTGTAGATCTATCACATTTCGTTrAGX^ 
TTTATCAGCTGGTGGACAATTTGG CIO I 1 1 C CA TTTTTTGGCTGTTATG A 
ATAATGCTGCTATGAGTCATAGAAACCAI 1CCICHACTCAAGAAACAGG 
TTCTCCAGAAACTAAGCTAAACTrGTTTGAAATGTAAATTCTCAGGTATT 
CTCA GTATAOACCTATAGATTCACTTAGCTGGTGGGGTCCACCCAACTrC 
TTTTAACAAGTCCTOCAGTGGATTCTGATGCAATGCTAACATTTCTGAAC 
ACTGTC AAAATCA AAA TGG AGTCACTTGTGTTTA AAA ATCCTGAC AAATA 
AAGCCAGGG ACAGCTA TG A AG AG A GGGTTCTCA TCCA TCA ATGCCTGATT 
A ACANAAACT A TCCCAAA TG ACTCTGCAN AAACC ACAATCCTGC ACA AAG 
GTCA TC ACAACCTT ACACAAAAAATA TCTTCACAAGG ACATCTGTCC AGC 
AATTGCCTGTCCAATCTCAGACTGGTCACACTTG 
>Seqoene* SI I 

GGT ACAATCATTAAAACTATGTTGTAATAL r GTTTGTCTI 1GTATCCATT 
CTGGCGTGTCTOCATACACTTCACT AATATTTGATATACCTGTTTTATAC 

caatata a tgc tgctgct gtacgtag aagctctagtcaccatatcctcta 

tttgttcaattattttttcatcttctggcacactackjatctataacaatc 

acaatatcttcaaaoccattattattc 

>Sequeoce«!2 

GG T A CCT A AG A GTTATTAA TACT ATTTC AGT A AAAAAAAA A A TTT A A TAA 
ACCCTGTGTG ATCCCATTGTAACAC AAAGGCTG ATG r 1 1 IC1G TTGTGAA 
A TA CAAA TGC AA GGAAA AAA TCATTTCTTTGTTTCA AA GG A TCCA TTTCT 
TCCA T AAA G A A TAA TTTGTA TTT A TTTTT A A GGGTTTATTTT AACTT A T A 
CATCAGCCTATATAAAA TACATT TCAAAATCATCTGTGCTCTTTAAATr^ 
CCAAAAGCAAATCTTAA 1 1 1 1 1 1 1 1 1 CCCTCTAACAGATAACAAGTTTTA 
CTCCTATGCTGA 1 1 1 1 1 C 1 GGTCCCACTCAAGTTATTTTGGAAGCCGAAT 
TAAGCAGAGGAGATGGGGATGTCGATTGGGAACACCCCCGAGCTGTTTAC 



TCCTACTOC*XnTT TCTAT AACCCCGGAA^ 
CATATTACAAGAGAT " " 



^-^-TTGAGCGGCCGCCXXXiGCAGGTACATtTTGC 

CCTACATTACT CCAGAAT ACAAAG C I ICI 1 IC TGCCTTTCTCATCCACAT 
AATGGAAGACAC 1ICT JGGGTGAAATACTCCACAGTTATTTCAGTTCTCA 
CTTJGTGAOTCTG^ATATAAGCTCTATGAGACCAGGGACCITGTCAGTCT^ 
ATTCACAAT ATCCC CAGCCTCrAGAACAAGGCTTJGCACATAGTAGATGCA 
CAAAAGGTG IT 1 OCIX» AATG AA TGG ATGACTGAGTCTGTGTGGGGTAATG 
ATAGGGCTAAGGATGGGACTCTAAACTCAGG 1 1 ICJC IC I UI GGGTTTCAC 
AG ^TTACTGGTCTTAAG AGGAGAG TTTCCT AAACTTGCCTTATGATAAAA 
ACX^CX^TCAGCATTTGGTAAAAATTACCCATTXXrTOT 
AGTG AGCTGAA OTGG AGCTGATGAA rCTGTl I 'I ' i ' l GI OATACTGCTOCTG 
CTGCGGH 1 1 lAACACATGCTTCAGGTGG I ICI A AGCTTAGGAAACCTTG 
CCCAAGGATACCATCCTGTCTCTT GG GAAACTGTCTCr A T 



CCCTT A GCGTGGTCGCAGCCGA CGTAC 1 HI 1 1 1 IU 1 1 1 1 1 1 1 1 1 UI 1 
TATA AAACA TTATTCATATTITATCTTATTTTAATTCA 

CTAA 1 1 1 1 1 ATCAAAAACCAACAACCAAACAAAAAAAATATTACAACAAA 
CAC^GAAACGAATCAAACCAAAAACCAAAAATACTTICTGGAATTC^ 
GA T ACA TTATATATACCTATCAAG ACAACAAACTACTAAjCTACCT AAACT 
^CAAATTATC^TAAAAATGACTCCTGTOCTATATCAATAAAAAACroCTA 
TTAAAATTGAGTATTATAACACAATACAATGTCT ACAGCTTTT 
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ACA A GT A TT A TG T A TCCA TAAA A A TTAAAAA A TCTTT AAAAA TGCAT A TG 
GGGGTCA GTACG TAAAAGAAAAGAGAACCAAGAGAGCrroCAGCGGGGAGC 
ACAGCTTGCTTTAAACATGAGATCCAGCTCAGTGATCATGCGC^XM 
GGC CCGG CATTGCTGGAACTCCTAATATTTAAAAAGATGATGGA 

ACTTTG A AGCCTCTCTGCTXSGTCAAA CAA G ATGTATCTGTAGGCTGG ATT 

TAGTCCACAGC 

>Scqueace8)6 

GGTACAACTGTAATAGCTATTGGTCTTCAAGTGGGTTTAGATTTGGTGAC 
ATCAGTTTG ATA TTXTTCTTAAAGG AAA TAAA TATTCAAGAACTGATTATG 
TTCT A A CATGA TTA TA TTCAT^GOTGTTAC ATAGGCCTGA ATTTTTTCACA 
GAAAGAI I I I I GGAACAGGACTGTCAAGTGAGGCTTTTTAAAAAATTATT 
TT AT A AGC AG A G A AC AC A GCCTGATA ACTT AG TC A AGG A TATACTGTCTG 

tctcactactttggacttatatggcttcagattaagtcatccaagaaaca 

TACATA 
>Sequence8l7 

ggtac atgt aatagacactatgctacagcaaaag ch I ICI I A TTCTCTT 

TAAAA TTTTOCTGGG TGCATA AAA CTATG T>KX»T A ACTCTTTCCCAATTT 
TTAACI I I IACATTACAAGTCATTTTCAGAGTAAAAAGTCATTTAACAAA 
GGCAGATAGAAAGGCCTCAAATCCCTGAGGACCAAAAATCCCAACACATT 
TTCAAAA GGG A G A AA A TTTCTTT AA ACTTCA TGGG AAA A GT A TTTTT AA C 
ATAATAGAGAGGCTTTATGCAGT 
>Sequ cnce818 

ggtaci nun rr n m ini it tttttai i rrrTTTm rrrrrri i 

TTTAA CAC I'M CA ATTTTG GAACA I rTCTTTTI I I I ITG AOGGAACAAAA 
TTAAAi 1 1 I I lAATTCTAA I 1 1 1 1 I I 1 1 I I 1 1 O G ACACATGTA I ICCI 1 1 
TAGTGGAAACAAAGGAAAAAATAACTTTTTrCTCCAAATACTCGGCCTGG 
AA AA ACCAAA A T ACAA TGC AGGG A TGG AA TCAA ATT AA CAAA 1 I I 1 1 I 1 1 
CCTACG GAA ACA AGAGCCTTTTTTGGGTATTTTTACCAACACCT AGGAAA 
AATTCCCTTTTTATACAAAAGTCATAGGGA 1 1 1 1 1 1 1 C 1 1 AAAAAAAA AA 
ACAAGGTTCTTGGGCT AAAATAAATAGGTATTACTAACATAATTCGGGAA 
CACGCXXAATGCCCAGATAATAAA(X^AACCCCK3CCCO^^ 
ATAAAAACAACCCTCACGCCCGGGGAAAGGGGATATCGGCTT1GACCCCT 
TCTCCCCTTACACGAGGA^TAATTTTCCGGCGAAAAACGGGT^ 
AAAATTTCA ACAAAAATACA AGGCGCGG AACTAT A A AGT AAAACCCGGTG 
GGGCTAAG A GGGGGGCAAACCCCATGGCAAAGGGCCCCCAAGGGOCGAAA 
ATCTCAAGGGCCACGGTIGTGGCTATTCCAAAAACACCCX3CCCCAACAGG 
AATAAAAATTTCCACTTAAGGAGG 
>Scqueacc SI9 

GGTA CAACT GTAATAGCTATTGGTCTTCAAGTGGGTTT AGATTTGGTG AC 
ATCAu I t I oATATTCTCTTAAAGGAAATAAATATTCAAGAACTXSATTATG 
TTCTAACATGATTATATTCATGGTGTTACATAGGCCTCAATTTT 1 IC ACA 
GAAAGA 1 1 1 « 1 GGAA.CAGGACTXTTGAAGTXIAGG<rrTTTTAAAAAATTATT 
TTATAAGCAGAGAACACAGCCTGATAACTTAGTCAAGGATATACTGTCT 
TCTCACT ACTTTGGACTTATATGGCTTCAGATTAAGTCATCCAAGAAACA 
^C*^ A ^TTCTAAATGGTATATATTGGGAATATATGCCCCITTAAAAGA 
ATCAGGTCAGAAATXjCAATAACAATTAGACTAGACTGTrGCOOGTG 1 1 AG 
3AGAATGTOTGGTCATCCTAG 
>Sequeaoc 820 

GGT ACT AGAATTAGTTC CAACTACTC<n"GGTGATAAACTCACCATCTACC 
TTCACI IIjI 1 1 IC1CTT AA TTCTCCAA GAAGT AATCA GGTGAATAAA G AA 
TCA TCATCA G A TAA T A TTCTCCAA G A TTCTTTAAG AAA TT AATTTTT ATC 
TACTCTT AAATGATTGCACAATTATAGGATAGAAATTACTATCTTGTGCT 
CTAATTCAAA T TGCTCT TAATGATCCTAGAGAG AAATGAATTACTAG AGA 
TAAAAGATAAATTTTGCTGTGGTTTGCGA rcrTTGTTTCTI ICCTTAAAA 
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CTTAACAC 
>ScqaeoccS2l 



ctgccacagcggatttcaaaaagacaaagcaatgcaagccacgtgttcaa 
aatgccctaagtggctattcaggtaatatataaaagtaagaccaggctaa 
ttaotatacaatgoogtaaaccagacaocacaaago oc i rcn i aaaato 

AGCCTACCACTCCrroGCCTCACTt nOAATT TAOAOOCCATCTTCroATA 
TrTCACOACAAAGTAAAAATCTACATTTTTATCTAAAA I CTTT1 T AATTT 
TTAAACACTCAOCTCATTTT 
>Se«peoc* tX2 

CCCTTCAGCGGCCOCOOOOGCAGGTACAGAOCATCrTAAGGTTGGAAGGA 
CTCTTAG AG ACCATAGTCCAGCCTCCCACTTG A TACTGAAA CACGTTTGT 
G AA TTCATGGCCG ATGTCTAACTTCCCTCACCaCCTTTCCG ATA TCG ACA 
GTTCTCATGCOCAGAA GCAA AACCTT C TTTATTG TGOCTGTOCT O CCTTG 
ACTCTCATGCATATAATCAGCATCTTTCCCaCTAAGTGAAGGGCOCAGAC 
TCGaGCACAGGAGCACAGCACCOCTTAAACTCACGAGGGGCTGCATTCAC 
AC^TCAGC^GGGAGATTACACTTGTGTCATTTG 
>ScqacnttK23 

CXXTITAGCGGCXXKXXXX3GCAOGTACCAAGACTTTAOAOGGCAAAGAACA 
GAOCaTTCTTGAGAAAGGGGACTTGAAGGTGAAGAGATAAAGCCTGGTGC 
TTOCAGG AGCGTGGG TC TOCTAO C 1 IT GTCTTC CTOOGAAOAATCTTGGA 
CTCAOGCCTGGCCAGCTOGATGCCTGG GI ICC I IA GGCI rOCTOCAGGCA 
ATGTAGTTGCC ICTTTCTC 1 CCCCGCGTACATAGTAAGTGTATG ATAG AT 
GTTTCATTTGTAAATTACAAATATAAATTATCACCOCCATrTCCATTTAT 
TTTCTTGATATATCAAAATOTGTTGA 
>ScqDcacc 134 

GOT AOOOOCATTATAGTAGGGAG ACTGAATCTTCAAAGTTACaGGGTGAA 
TCAATCATAATGATCTTTGCAGCTTTCTGGAGTTAAAAAGCATCAAAATT 
GGG AGA T A TTAG A TG ATG A CA TCT A A GTA TTAAAA T AAGG AGA T ATT AAA 
TGATGACTCCTAGAAATOAACCTGAATAAGGACTACXXiCAATGTGTGTGG 

totoooaaaooaca o ncrm aatggctcgctgacccagoctcaatttt 

CTTGCAGCTTCGCCG A CACG AGGTGACCATCTGCAA TT A CG AAGCATCTG 

CCAACOCAGCAGACCATAG 

>Seqaeace 125 

GGTACCTCTCATGGCI 1 1 TTGGTTTXAGCAGTG AGGGCATTGGTG AGATC 
A GTGGT AAA CTGTOC AAGC 1 1 IC'f 1 1 1 1 ATCATT AGG AAATGTC AAACCT 

TGGACAAATTTTGAOTTTT aacaaggacaaaaagttgaaagaaaaggcac 

AGTTAACAAAAAAGGGTGGCTAGATTTATCTIXXjGTGATGGAGGAAATGA 

gagaggaatgctcttgaaaggtggtctgtggatctgtctgaataoa^ 

GCACAGTAAGTATGCATTGCCOGAGAAAACO ICC 1 1 GAAOC r OCT T O TCl 

catgtgtatgatgtgc 

<^^ctcaacaagcagctgacttatg ri ttattggacattgtcatacagg 

AAC1 CI TTOCAGAGCTCAATAAGGTACGCGGGAAAGTCAACTCAGTTAOC 
1 CIU I I ro GTtnXnXTrATCACTTGCAGATOCTOTCTAOCA OCrTITC AGT 
GACATCCTAGAAG CI I CI CI ATTAOCACAGTAACTGGCTAACTAGATATG 
A ICI I lUX TAATTTTCATGAOCA ILI 1111 1 CI U ATATAAACCAGGOAO 
OOAAAATAACAAA U I roC T TC ACICTGAAOOAOTAI 1C1CCICI AGTACC 
TGCCCOGGCCGAC 



GGTACATATATGAAAAGOCAACATTCrAAAGTAGAGGTrcACTTAATTTT 
1 1 1 Ull I rCAAGAGAGGCl TCI I GO T A O! I IC ATCACACA GT GG1 1 1 >A 

ttaggggatotaaogattacagaaacatcgtattttttaacatatagtat 
tttttqaatatoatttgaattaatatagaaaaotocal'itl n ccagttt 
ttttagggaaaaggagatacttcaccaggaggataaaaaggaacaagagg 

GGAAGGGGAAATAAAAATTCCAGAAAGATCAAAAATTOTTGATGTAAGAT 



sl32 

3 b Table 2) 



GGAGGCACATTNT 



GGTACAAACAAGC1 HOI I A AACTAACOCTTGCCATCCTGGCTACTTTAC 
CCAA TT AACCACCCTAGCCCA GG A OGTTTGCTTT A TGA CATGTTCACAGT 
TTGCTA I I CI I rOTTCAATCrroTAACTOACTOCAACTOCI rCTOTGGOT 
CTCTOTI PCI I IATGAA OI 1 1 UCCAGGCCATACAAAAC I ICTG1 IAGOCT 
AICI PCTCn CAGTTrAATTGTGGAACTCAGCCAGGCCCTTAAGAGGATGG 
AGGAGAGTnTTCCCA CAGCAGTTCTGAATGGG ATGAAGTG AAAAATAAA 
ATCTCCCCATTGOCACTACACCACCTCCTGATGAGTCTTGCAGCAGAAAT 
ACCGTTTaa CTOTTTCTCCI ITTATTTTTITCTGATTATCATCCAGTTn 
ATATATTCATATCTGCGTGCTTTGATAATTATATATACATA CI ITTTTOA 
TATTA TTTACTTATTCTTTACATTGG AAAGGAAC 1 1 GC 1 1 IOIAATCTAC 

attcc ci i ic ctcctac a i i ii i i ma gi 1 1 1 1 1 i c atttgg 1 1 ici a at 
tgaaactaaaggtacactgactgttaattgaaaagagtttcagctttagg 
actttaa 1 1 1 1 1 ra ag cttci 1 1 c aa tggtccgg accta a tttcg aattg 

CAGTATTGTCCTGCCCGGGCCGCKXXjTTTAAAGGGCAAATT^ 

GGCGGGCGGTATTAGTGGATCCT 

>Scqocnct 129 

ACTCACAAGCAATAACAGATTCATAGATCAGTTGACATrOGCT GGTC TOC 
AGGACAOGAATGTGGCCAAAAG G ICCT T TCI ATAGACGCGGOGCACTGAA 
TCTGTOTCTCCCCTOTTACCTA C I I 1 1 GOC A GTGA A A TTT AA GTTTT A AA 
ATACTTTCAGAATGTA 1 1 I I I ACTACTGCAAGl I I I I GGTCTTT AAAATG 
T CAAGTAG CATCTCTCT C i- riC I CICIG l C l Ul 1 ICTOTl ICTCTCrCCA 
Gl I I I I 1 1 1 1 1 1 1 1 1 1 ! AA TTTCC AT A TGGGCTA A A GAATCC A AAT ATTT 
TAAAAATCTGG C T C T Cr 1 1 ICI rCTCTCA TAAAGTGAATTA TTOCTC T TI 
IMG I I I l ATXrrAAGTGTATATATTCTT AG l TI I ICI I' G AAATCATTGTA 
ATGCTAA CI mCTTGTI IC AAATAICI IU GT O ATT O C1 ICATTAICIC1 1 
CAACAAAAAAAACCTTTAATTTTGCCATTGAA ACTGTAGAACT ATGCCAT 
GCTTTTATTAGAAGCAGTCCTCTGTGTTAACAACAAGAATGGTGTAATTA 
GAATTGGGATGGGGATATTTACTGT ATG ACAACACATTTACAGGTCTGTA 
ATGCAAGG A TGCA A TTTAAAA A TGTGAA GTA A TGA TGGGTTTTG AAAT AA 
CCTTTAAATATATGGACTTGAGGGCTCCTGGGGAACTA 1 1 1 1 I I A CCTAG 
ATAAAAGGGTT 
> Sequence 130 

ACAAOOCATTGAATAAGCCTCTTC C1 1 1 1 III rGCTCAAACATTCCACAT . 
CCI IGIOGATTCOCCTGCA I TO1 HO I 1 1 1 A TATAACATTTGATA 1 1 PGT 
TGTAGCTTGTATATG AACATAA 1 1 1 I CI 1 1 AG AGGTAGTCACTG T 1 CTCT 
CCAGTATGACCCAG O I I IC I I OA CTCTOAGTAATGCAGCTTCTATAACTA 
TCTAAATTTCTATTGAAG C I 1 1 1 1 OG ATTATGAGTATGCTGA C 1 1 1 IC AC 
GATTGGCTGGTGCATCriTTAG^CTTAAATGTCATATCCTr^ 
AGOCAAAATAGTAACATCTCATCTCAGAACAGAGCTGTGACCAGATGOCA 
ATATA TTjTGTCA CAAAGTCT ACAT ATGTT A CA I ICC I rO OAAOTCTOCI I 
AAATGTTTCACAAAATGTCAACAAGC1 lOI I I lO 1 1 ATTGATATTTOOGA 

gaatgogcacatttaagacagtaaaggggaaaggtggtgaagatgctata . 

agaagatgctgtatcttgagaattgaaaaatgagaatctgacatcgttto 

gaaaatcatgiaaagotttatataaaggatgcatgtgtaggagccatttaa 

ATTCATAACAATATGTGCCCCTTCAGOGTTrAAATCTrATGAAGGGGTTA 
AGAGATAA O I C 1 1 1 OX jAAGTGGACAAAAOGATTTGAATTTAGU ITCTGTO 
GATAATTAG 
> Sequence t3t 

CCCTTGAGCCOCCGCCCGGGCAGG TACGCGGGCTGGAAAACTGLAACGTG a 
AGTCACCACTAGGCAAOCTGOCTG I AATTOAOC I FOCI I O TATATOACCA 
ATCA Acci r ro cTTOi IGAAGOGTTAGTTATCTA OI I ICCI 1C 111 ICI I 
TTTTGGAATTTGG ICI 1 1 1 AAGG I CI IOA TAA ICI 1 1 C I A GTCTAGAGCA 

tgtgaagagaacagaaggaaaatcaggactcagtttacttaatttaagca 

AGCATTGGTTGCTGCAGTTCAGGGGAGGTTAAAG I I GCTGGGCTCCACTC 
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TCTTATTAGCATGGATGCrrAAGAACTTCAGGGTrTGGAGGTCAGCTGAA 

CAG CIOI 1 1 1 rOCACTCTCOCI lOI 1 1 1 AGTAGCTGAO I ICI ATAAAAAA 

ATACCACTCGGGTAAATGCTAATATACTTAAGCCATTTITTACTTGATAA 

CATGCATAAAAAGATATTAGGGCTTTCATGGCTTCTXXXX C 1 1 1 [ GGCTA 

AAATtTAAAGGTAAAAAGAATGCCATGGTTCAAAAAAAAAAAAAAAAAAAA 

GTACTTTGGOGGGAACCACCCTAGGGCAATrOOCACAAATTCCCGCCGTT 

TTTTATGG A1CCGACTTGGGTACAAACTTGGCGTAA TAAGGGGAAAACTG 

GTCCCGGGGOAAAATGTTITCGCTTCAAATTCCCCAC^ 

AACTTAAAGGTAAAACCTGGGGCGOCAAG 

>S«jaeace 112 

GGTACCCT AOGCAGGGACAGTCAAGAAAACTTCATGGATCTGTAGTGTAA 
AGCTAGGGAGAAAGAGGAAGAGATCC 1 0 1 1 1 UAA 1 1 IC ' IOI AACTAGCGT 

atctocagataatgcatgaacagccagtaaagatgaacgcagattattga 
tggaaagaacacacatggagaagagaaaaagcaagtccacagagcttttt 

AACA TACACTCCCTCACCCCTACCCNCAGCTTAGAAGGGCAGG AAOCTGC 
TGTCCAAAACAGGAAATATAGGAAATACCAGCTGAGAAACTATCCACTTG 
ACCTCCATGAGCCCAGCTGCXXCTCTC^ 

TOACACACAATCATG C 1 1 ICCI II 1 1 1 O CACCTGAAGGAGTGATGTCACT 
CCAGACrOA GTOCI I A TTAGAGGOGATGATGGAGTGA I' l l H AGACCTGG 
GAATGGTCTAAAA CC 1 1 1 1 IOOC I I AGCCTAATCATTOGATCCTTCAAGG 
AAATTGG^TATTTTGAATGCACATCCCAACCCGGGGTOCTTATCAATGAA 
CCCTTACCTTTAAGGCAC I rTGTTOGl 1 OAAAGGOGGGACAATGAAGCCC 
AOAATOACTTCT GGT TCCTCCCI 1 1 IGCAAT AAAAG CTrTGACCCAAAGCT 

ocaatggoggctttci n iiiiAcccc 

> Sequence 133 

A CI I 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 II I I I I i og gtcaagtagaaatcaaacagt 
cct aatggaottcata tcttattjgcat^ 

TTTAGAGAGACIOI 1 CCAA TAACTCTCA TTT AATTGGTGAAAAAATT AAA 
TA TTGGTTT AGAT ACTT ACCT AAATA TTA CT AGTTAAATTCAAAGT AAA T 

gagt ctg tatctt taaaactacttggcagtaataatttrtaaaagtagat 
tt ttai rocttttci luaactaactagtgttcatacaacacaggtagttt 
tatttotcoctggaattaaggagtgagacacattigtaaaatgttcacaa 
tcaacgoctctcocattttaaaatctcacaa oi 1 1 1 ic i i latgattaac 
aca attc acaaaataagaaatggta rrrocTCATTCT C i oagttcaatct 

C IGCICI ACT AAA TA T AACTTGTGA GGAAAAAGTAAAAAGGTCAAGAGTC 

TAATTCATTrrCAGTTTTTAAAACTATATTTTAAAAAAGAATXiATT^ 

GT AAAAATAAA GAN 

ggta ci iniiininiiiiiiiiiinn go i i i i i i i atctga ccac 

TATCCATTCTCTT 



TTTTTTTTTTTTTTTT 
TTOCA G GAACAAAG 



tgtatcatccaticcatgtocagaacacattcacatc^ 
cctcattxikxtgaagcctgctggcrrogggcatagt atgaatacttococt 
catcatgoocatttcacagatgcataaacagaggocagtcagtatggctg 

CAGA CTGTGGATAGAGOOCGAACCCTCAGGTT AGGCAGCTTGGATOCAGC 

tstgaotoocagctaccggaactgagtcagoct^ 

g ui 1 1 rctoaoctclxaogtcggiatcatoatgctogci i lo oagggtag 

ctotoactattaaattaogcixiatccaog^ 

gggtttagcttgogagagttctiggctttgoct aggaaa taattcaaggg 

CTTCAAOOOCTAGCCAGIlKilGI IAGCAA CI 1 1 ICI IGA AGTGOCaCTGT 

GGTA CI 1 1 1 II II II 1 1| II | U n II 1 1 1 1 1 II 1 1 1 AATTCAATGGAAO 
AAAAGTCCAGCTT AA T AA CTTTAA TGGAG AAAGAAGGAAGCA GTATAAA 7 



TTCTGGAGACTtXAATCACA 1 0 1 CC I CCACTCIXXTACCCTGCXXXrAAA 
0TCT1 ICCCTTGATGGGAATGTGA 



PCT/USOO/33312 



TCAGGTGCGACATGGGCTCACAGCCTCACTGAGGCTGGATC' 1 1 1 1 1 1 1 It 
TGTTCCCTCTGAGTCATGGAAGTGTTCAAAAGGAATCATGAGGGTATTTC 
GTT ACTTTACTT ACTTTACCCCATCACAATCAGTGCACTTTCCTA 
GAATTTTATTTTOATTATCOGAAATTTACAGCTTCTCCTTCTCCAACT 
AA I I I I IC 1 1 IC ICCI b rrCTTACTA II I HC 1 I A TTACAA A ICIO 1 1 ICI 

T CTTAA TTAATTATTCTTCCA 1 1 0 III CPA IC O ICTGGGTOGCATTGTGT 
GTTTTTA CCTGGCCCG AGGOGGCGCCTTCAAAAGGCCGAATTCCACACAC 
ACTGGCTGGACTCTTATATATGGTACCAATtrTCGATACCThKKTT 
> Sequence S36 

GGTACTTAGCAAAGAGACTT ACACATTAGTCAAAAATCTAAAATCAGCCT 
TACGTGGGATCTGCOCAAAGTATTATTTGCAAAAGTATCATTTTCAGTTT 
t a ACTTTT AGGGGG agcagggta ggctggggtg ACACAC ACA AATCTAGG 
CAGGCAGAGAGCI IOC 1 1 1 CCI CAGCTTCITACCCTTAGTAAG ACCACTT 
TAGTAGGACACTTAAGTATTTCAGTCAGCGGATTTGAATCTGA C I IC I IO 
GATGCATCTGTATCAAAACATAGCATTAGATGTGTTACAGAACTGAGCAG 
CATATCATTAGATGTGTTACAGAACTGAGTCCTACTTACAATAATTAATT 
T A AT T T CAA TA GCG A TCCOC ACCATTT A TGTCCTA GGCA T CT ACACAATT 

CTTTGGCAIOI 1 1 1 1 ACAATCCCGCGC 
>Scqurncc 837 

ACI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GCAAACTTTAATAGG 1 1 1 ICI I AG 
CTTGACAACTCATICTCTATATTCACGAACAI C ICCTGACI IO I ICC I I C 
AGTGGAG ATACOC I I 1 1 C" I AGCCAG AGTTGGCAA AAGTAGCAAT AGCATG 
CATTGG C 'I lO I I lO AGAGGOCCTGGGTGAG CC 1 1 lOI IO CATAAAGTAGO 
AGO 1 CIO I IA I r CTCTTO OTAOCATATO OC I I C ATTATAA O 1 1 IOC C IC I 
TTCAAAGAATATTCAAAGACCAACACAAAAGAGAACATTTCCAGATCCAA 
GAGAGTGTATGTAGAAACAGTGACAAGTTAG AAAATCAACTTAGGT atca 
GA TaGCAGCCACAAAAT AT G ttc tga ocaaaaattcataocaatttataa 

CAGCTGAGAAAAAGAGGGAGGATGCGGGAAGGTAGA 1 1 1 IO ICA GAACTT 

actagactaaggatktattgcatattttttactaattaaatgttggogat 
gtcagaco roon gaaaataattaaaa g ictgoi I a aataagcctttttc 
accctagcttaccta 



actacaaaaataatgaagccagctaattaocatcaggttacaactttaca 
aagaagtgaagcagcaaagagctgaagcagaaatga<^taggaaaacag^ 
agcaaagtocttgagtcccaacagtccaoctcaaagagaaacatactaaa 
gaacaaaggoocctaatccaoctoct c aoocgccta ctttki I III I II I 
1 1 1 1 1 m 1 1 1 icc a oi i mo i l icaaai m 1 1 attatacatcatggt 

TGCACAA TTTGA GGCTGGTT AAAT ACAA 1 IOC 1 1 I IC AAAAT C T C T I IO A 
ATA11 1 7 CT OGCTT A TTACA TGCAAA TGAOCA TGAAAATA TTTGGCATTT 
TAAAATTCTCAAACTCraAATAGGCAXnTGCATC 

ttcatagatatocacatgtagaacagatgctocagcacatggtggtacc 

>Se^wae« 139 

ggtaoggacaagggggcgactggca ioiooi t ICI I ICIOOIC1 rOTAGT 

CGGTTTGGAA 1 1 1 < C I AAGTCAGGGTGGGGTGGGGGGAC 1 0 1 OCACGAGT 

catgtgcagactggaacoca t c tccccctc g gtctgcaagttaaaacaat 

■ TGG OIIOICC l lCl l ^ GCATCTCOC ^IOIC I CllA CTCAATCrrGGATC - 
AAAAGGGOGTTGGAGGAGGAGGCTGGGAGGGAAATCCAGAjCAjG I IC 1CCG 
CCTCTGACATCAGGTtX ACCTGTTA GCATCGT GC I G PO G GTOOC PSAACA 

GGTGATTCTGATCCATGCAG<XXTTAGAGGCGACAC^ 
CATTCTAGGAAGAAAOAGOCCACTGCOOGGTGA C C I G 1 CTAGTTCACATC 
CACTCACCA 1 1 1 CCC 1 CC 1 CO 1 1 CC I ATTCTTAGA.AATAAGACTCTGACG 
CTCTC1 1 1 1 A TACAGCOrAGTOOOCTATAGGCATGTCATGGTGATTATTT 
GCAA TCC ICCTGACI I I CC * I A AGAAGAGATCAGACTTAGCAGGGTTaGTC 
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GG TACAAA T AAATGTA TCTTCGC TT AA GTGCT AT AAAGGAA AAG AACA GG 
TTCAATTXSAAGGAAAAATTAGAATTGTTGATACATGAATGGAAGTAAATG 
ACCCGGACTTCCAACTCTAAATCTCTGTtTCATTTtACC 1 C 1 1 1U1 AAAT 
AAT^TTOCTATTATGTrAAATATO^CAACTACTgrCA fTTCTTOTTTA C 
CCACT ACATTCTAAGCTTGGTGCTGACATCTTTGTATTTATTATATAAAA 
TTCTCAAAATTACTCTGCCCGTTAGGCTTTGTTATCACTT atttcaaatg 
caaaaataaggtocaggoaagataattatgtaacttg TTCATGATTGGAG 

AGCTAATAAGTXrrCAGAAATGAATrGAACXlAAAGTTGGTGTGACAAAGCC 

TCTGTTTTAAGCAAAAGGGAAAAAAAATTCTCATTAACTOCAAOCATTAT 

CA TTAGG AGTCCAACAGGGTTCCCAA TTTOGG AACTACTATTTCATTATC 

ATATGGCAAATGGTCCACTATGTTAGATGAGAAGGCAAAAAAAAAAAAAA 

AAAAAAAGG 

>ScqucnceS4l 

SSZ^E^E?" 1 ^ A A A A A T*^ 7 " A "TCrrCKTrOTT-CTA^TGCrrACTT ATT" ATT A TTCC 
CTTCC TTTC TAG AATGTA TCNCACT AA A A G TGTTT AA TCCTG ACTA T AAC 
AA TTA TTTG7TA A CTA TTAAAOOGGTAA TTAT A CrCTAAGCTTCCACTTT 
TCAGTTAAAACAAAAATGATTAATATGCCTATACAGAACTTTCTCCAGCA 
CTTGGTAAGTATTTTTTAAAGTGAAGTCTATTCAGACTGCAACCAGTAAA 
CTAT TTATGCTTA TAA ITTTTC 1CACGATGGATTTCTGTTQJ ITTCTTC C 
ATTGTTTG Iui i i ATTTTATGTGAT C n 1 1 1 I AGCTACAAGGTGGGAAAA 
TGA CAGTGG TTTAGAGATAAGAAGCACATGAATGGAAAGTAAATATGTGG 
AGA , GGCCACrcTTGTAACTACTATCTGAAGTAGTTTTAAATATTT 

tttagttggtaagaggatgtacctggccgggcgggccgtcgaaaggg 

>Sequeace 142 

GGTACAGTGGCGTGATCATAGCTCACTGCAACCTCCAC 
AGTGATCCTTX^CCACAGCTTCCAAATAGCTGGGACCACAG^ 
CACCACACTTATTAATGTAGA I I ICC I 1 1 GTAG ATGTAO A rTTCTTI I A C 
AAAGTGACAGCl 1 1 iCAGAGCTAGTXXrrATGTCTGCAGTTTCTCAGAATA 

TCTCCTCC^GTr^TATTTTCG^ 

GGGTTCA TTTTTGAAAATTGCTCTTO^GTCCCACTC G 
CCCAGGAATCACCACCTtmOATTTCCTAGGCATCTTC^ 

XS^^T^^P^ A TTTTTAAAAGCC AA CTATCTCACTGTGG TCTCTTAAT A 
GTC^CCCTCTGGGCTGOrtATTTC^ 

*^ *^^ G ^9T ^ GG ^ AAC ^^^^ TTTTTGTCTCCGG AA ATA CCA A 
55£ TA ±lL l 1 ** 1 ^^-^^^^^ A^SGAAAATTTAACTGGGGATrAATTCTG 
GCT>CATTGGTTGGGAGCCCCCAKTAGATTTTACTA 
TCCCGGGGGCCGCTAAAAG 
>Scqu encc M3 

^«5J33ii*' ~~ 11111,11 •GCCTATTAATTGATTAGOAAAAATAG 

GTAC^CCCTGAGTGAAAGTAGAAAAGAACX^TTCTGGTA^ 

AGT AGAAAAGAACCTTTAGCTTTAAAGGTA1GTCITAATAGAGCAGTCCT 

AAGACAGGTGGTTAGGTATGTGAATGCATXK3CACTTAGAAAAGAATATGA 

AGGAOAAG GGACCAAGAAGOCAGATACATTGCCCCTICIATA a A n A Afrrr- a T 

AATAATO^llI^^ 

ACCTGCCCGGOCGGGCGGTCCAAAGGG GGCTCATTAAT 
>SeqnoKeB44 

y^J^^^^^^^^'^CCATAATCTTCAAGACCCCTCAAGACCCCACAA 

AAGTCCAGTGCTCTCT10CACTTTACCAGGTAACCTTCATAATACCGGATT 
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GG AAA TCA A A C CTGTC AC T^ACTTTCTATGTOCCTG AG TGACTCACAACT 
TCTCTCAACCAGC I I 1 ItCATGTaCCTTGGGCGCGACCACGCTA 
>SeqtMnceMS 

GGTACCAGG AAATTGGTTTG ATTGCCA TAGGCTAACCTTGG ACCAATCAC 
TG TCGC CAAATACATGAGGATCCTTATTGG C rC CT TCl ACT AGCAACAG A 
TGGTtTAGAGAACAGTGTATCACAGAGAAATGGGGATCACT ATTATAGGC 
AGATTGAATAATAAAT GTTTJACr CTACTACTCAATAAATA f TTGI TO AAC 
AAATCAAAGCTGATCCCI I 1 1 1 I CAAAATTTTTAATGTGACTCTTAGGGG 
ATGGTGGA TCCA GGAGAGAAGATTAGTGCX^CACTGAAAAGAGAATTTGC 
TOAGGAAGCTCTCAACICCTTACAGAAAACCAGTGCTGaGAAGAGAGAAA 
TAGAGGAAAAGTTGCACAAACtCTTCAACCAAGACCAOCTAGTGATATAT 
AAGGG ATATGTTG A TG ATCCTCG AAACACTGATAATGCCTGC ATGGA AAC 
AGAAGCTGGGAACTACCATGACGAAAC^GGTGAGATAATGGATAATCTTA 
TGCTAG A AGCTGG AA AA G ATGCTGG AAAA GGTGAA A TGGGTGO ACA TCAA 
TGATTAACCGGAACTTTATTGCCAGTCACrCTCAATTCAATAAAACTTGT 
GGTTG AG AA ACG AG ATGCACCCTOG AGCG AGG ACTTCTC A AGCTTACTGC 
CTTGCGTGG AACTG A TGGTCTCCGTGTAAGCCAA AGCCCCCCG A AG AGCC 
TATTCTTGAAAAAAGG 



ACTTTACTTATTTATTTATTTATTTATTGTlTrACTATTTCACAAAACAA 

AATGTAGC1 1 1 CI I AAAATTGTTAGTTAAAATG I 1 1 1 IT I I OTTTTCCCA 

ATAAAATGTAAAGTTTAATATGTGATGGCTAAACrCCTAAGGGGATAAGG 

AGGCGCTAGGAGAATAGGCAGGTTGGAAAAGGGTAGTCGGGACTTGTCCA 

G ATTCTTGTGTGGTAGTCTGGGTAGTCTGT A TATTT A CC A TATGGGCTAC 

AAGACACACACACACACACACACACACACACACACACACACACACACACA 

CACACACCCTrGTGAGCATTTATTAATTCGGAGTTGATGGTGCATAGTTT 

GCGGAGTrjGGTAAAGGATATGTTACTTTTGTAAGTACCTCGGCCGCGACC 

ACA CCTAAGGGC 

> Sequence 147 

GGTACTATCGTGTGTGTGTGTATGTGTGTGGTGTGTGTGTGTTTTAAGTT 

TAGCC l I I Id I 1 1 IG > 1 1 1 1 GGTTGGCAGTAACCGATTTTAATGACTAG 

CTI 1 1 AA A AATACAGTACTO ATCATTCTATTTGCCCCTCTATTG ATCCCC 

ACCTCCAAATATCTCTATCAACAACCCACTAATCACCACCCAAC 

AATCAAACTAACCIXTTAAACAAATGATAACCATACACAACACTAAAGGAC 

GAACCTGATCTCCTATACTAGC^TCCTTAATCATTTTTAT^ 

AAACXTCTATGGACITCATGGCTTATTTATTTAO^CC^ 

ATTTA TTAAA C CCT A ACGATGGTCCATTCCCCTTATAAATCX3GTCTGCAG 

A AA T A TTTTGG TTTTCCGTTCT AA TA TTAA AAAA TTCCCCT AA TCCCCAT 

TCATAAT AA T AA GG TA AAATCTTCA TCTCTT AAA A CCCTCTGGTTGTTTA 

TAATTGAGAACTATACTTCTACTTATTTAACCAAATAATCCITGTCGTAC 

TTGCCCGTXJCTGTCA CTTTTAAA GGGCTAATTTCAACACTA CTTGGCTG A 

CCTATCCTTGTTGAAACXXiAGACTTGTrrACCATACTT^^ 

TTGGTAT AAA CTG A I 1 1CI I I GTGAATATTGTAATCACCATAATAATTTC 

CAAAAATACTATCAAGCCTGGAATtrrATAAGTTATAAATCACTGTGGTGT 

37i 



TCTTTCCCCTCATATCTATGTTATTTAATATTAA III 1CI 1 1 1 A ATTGTA 
^^ AT ^^ AT T^ A TGTATTAATTATTATt^TTCTATTCTTAATATATAAT 
TCA>WCCOCHACATGGTTTATCTCTGAGGCGGCrrCC 

CCAGGTGGTGAAACCAACTG<nGAACGCAC^GOCTACCTCCIGTATrAtt - 
GCCGAGTGOACXTGCTGTAAA<XXTGTGTGCGCTXrrGTG^ 
CCCGCTrTGTAGGAGA CCACCTTAC ACTCACTTGCXXjC C ' ICTC TTi AGTG 
GCTCTTTAGAGAGAAACTCTTTCrcCC^ 

J^ AAA ^9p A ^ ATC G<x ^^^ 

TCTTOTTGAC TTCT AACTTTrCAAATCA^ 
TGTTCCTTGATTTrAGAAAATACACAAAAACCCATATTTC 
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CTGAITCv I ^AGATAAQAAAGTGOATTIGATCCCCAGTCTCATTGCTTAG 

TAGAATAAATOnTJC^CCAG<^CAACACrrrOTAAATTTC 

TJJJ AA Tili^ X ' 1 1 1 AAAAAAGAA^TTTTTTAAACCATCACACTTTTTT 

TCCCTACCCTTT AGATT TTGATAAATGATAAAAATGAGCtXATTATCAAA 

AG AAAA A CTTGTTTTT ACTCCAAAA TGG AATAA TCTAAA TTTCAAATAAT 

GTACCCTGG 

>Scquej«e379 

S?S!^IH^ TATCTOT ^ TCT tnCGTATCTTAaJTTACTTm 

™ ^ATTTGTATATTATCNAK1>WNCNNGGGGATGGTGTCATAGAG 

GCGGCTACCGAGG>KKXXJGCCGAGGGACTGCTA^ 

AAACTCCA I 1 1 U I C 1 1 A GTT A T A T AGAACTGTG TTTCCA GCTT A G A AAAA 

GTCAAAOCAATGACTTGTAGAACAATCTACTCTCATTTTTT^ 

CTAGAACATGGAAGCTTTAAAAGTGAA7TCKJCTAAATAGGCAAGACCTTC 

TGAAAGTTAACATCTTAATGATTAAAAACAGTAAGTACGCACAACCGAAG 

CGT A G AGTCA CA CTTGCAACAAAA GGTTAC A A T A TTG T AA TGGGCTCTG T 

CCGGJTC I UC 1 1 G 1 CCAGCTGGACXIATCTATTTCATCCTCCrCCTCTGAG 

GTCTCATTTA^TTGCTXIATAACAGTAOAGATCAGTTGTCTCT^ 

^<^TATA TTTATG<^TGTAGGGTGTTCTtXATGCATGATTACAG 
CTGOGJ I IC I C I ACGTGTTCTTGATG ATCTGGAACAAGACATACCTCG AC 
CGGGCCA CCGGCCCCTT A TA TT A TGG A A TCTITGCTTTTTr.C.rr A C, a nrrr 
CTTTGC1 1111 ICAGGACACAAGGGCTTTTGACAGGTAATACACCTAACC 
TTGCAGTGACGGTGGT 
>Segencc 310 

TCGTTCpTTTTTATCT^TTAAO 1 11 1 I C I 1 1 1 A TACTAGCTA TTGTA 

A T A 2j™ A in*r ixn ^ 

A TNN>nV TTGTC ANATTXnCTACTGAGGCX!XXjCTtXG AGGTA OOTTAdrT 
CATTTTCCCTTAACKXKKnTGTGACCTt^^ 
TTCCAACACTTGGTATACACrTGTAACCCAGCITTGTTAATGAG 
ATCAAAATCAGATGAACAATTGACXKXTIGTTTTGCAGTC^ 
TAG^ACAGTTGTAGCACTGCAGGCTATCTCCTGAATCjGCAGAATGACAGT 
TT^GACGAGCT AGTAATCTGAACAGGACAG AACTCTl J 1 I G TATTCCCTA 
TTGTGAT KST1 CACAG AACTACrrGTGTAGTAGGTTTTAACTACTACACC 
AATTGGTGGCTAAAOACT G T C G1C I CC I ATTTATCCTTTTTTAGCCTCG A 
GCCCGTTTATTCCCGCG1 1CCI I C<T CXKK3CTOGCCGTTCTAGAACTTAG 
TGGAATTCCTTGGGTXrTGCTTGAATm 

TAACTGAGGGGTTAATTAGCTXrrcCTTGT^ 

GTACCCTGGGTG A AAA TGTTATTCTTTTA CAA TTTACATT A CAACA TACG 

A TCCTGGCAGCTTT AAG TTTAAAGTOCTGGGTT 

>Seqecncc3Sl 

TTAGATGGCTCACCGCXKrTrJGCGGCCGAGCTA^ 

JOCCTTTG^GCTAGCCTCGC^^ 

GCCTTCAGAACTTTrGAGACAA 

GGTACCTGCCCG 

>Scqucncc 312 

CCTCTCCTCTCCTTAC^ 

TACTC1 1 lATATATTTATATTTGTATTATTTCTTATAA ICI 1 1 1 I A frcc— : 
TATTTTA TTACNANCAG GG I KnCCTOOTACCTC M Cl IO 0 OOGNGO0GOC 
CGAGGTACIillllJll OIUIbliUll l lllU AGACXMAGTTrCACTCT 
TGTGGOOCAGGCTGGAGTGCAACCACACGATCTCAGC^ 
7GCjCTCCTAGGTTCAAGCTATTC 1 CCC ICC I CA OCCTCCCAAGTAGCTGG 

^^ A ^S^TSE A ^ A S^ CCtXC ^^ 1 1 1 1 1 11 G GATGTTTA 
GTAGACCTGGAGTT7T£TOC^^ 

CTTAGGGGATtXACCTGTCTCAGCCTCC^^ 
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TG AGCCAT AA CGCCCGGCGGCAA TAA TTCTT AACA G ACTACA TG AG TA A T 
TGCATAAATGGACGATGTCTTICCTCTACrTTTAATTTC^ 
TTATTTATA AAATGATC1 C 1 1 1 1 » AAATOATCAGTTCCTACATTnTATT 
CC TTAOAAGCC ICI 1 1 ICCCI i II 1 1 1 1 I I CATCTGTCCCAAAATTTTG A 

CACci i rcrn i aat tcagttattaagcca ci i i icig ao i i i i n ica ta 

A T AACACCCTTTT ACGG A CCA TGTTAA TN 
> Sequence 313 

ACXXICTCTTCT C T G I I C 1 1 1 ATTAAATTCCATGCTAAATTTACTTATCGT 
GTA CA T A GGTCTT A ATCTA AATTACT A CC TOGA TCCCCA CA T A TCT AA TT 
CTTCCKNNNhWNAAGGGATGTGCTCCTC^^ 

CCC A N A TTCGGG TG TGGGACA CGGCTCTCC A I rCTTCTTCl IG GCTTTAC 

AGGTTCGCAGGTCAAGAGCTTCACCCA TAA TTAAG AGCTTCTG AGGATG A 

TCGATAAATAAACACACCTCCTCTTAACCATCCTTGGGCTTCATGGGGGT 

GGCATTGAGGATCCCTACAACAGGCtXXTGCrrGCCGCTTCCAAAG 

TTGGAACTTCCTCCAAATAAGAACAAGGACACACATTGGTGTCAGGGTAC 

GAAGATCATTCAGTTTCCATATGCTTJAAAGGTTTTTCCACT 

CTTGTGGCGGTAACCTTTTTICAATATTAACCCCCAAATGTCACC^^ 

CCT ATTTCCTTCXAAGCTTCTTTTXriXXKXCATCI 1 1 1 ICC 1 1GAATCTG 

AGACAAGTCTGATCCAAGTTTTCC<KXXXJTCTAAA 

CCCGGGGCTGGAAGGAATTTCCAATATCAAACTTT ATCTGATACCOGTCC 

AACCTCCAAGGGGGGGGCCtXGGTACCCCAACTTTTTGTT^^ 

AAGGGGTAATTTGCGCGGCTIGCCGTAATAATOGGCATAGCTGGGTCCTT 

TGTGAAAATTCG 

>Sc<Juenoe 3t4 

AGACTGCAGGAGATGTGGGCCGTGCCAAAGAGATGCIATGAGACTGTTGCT 

GAGTTCATX^AGAGGACCATCTTGAAAATCCCCATGAATGAACTGACAAC 

A A TCC TG AAGGCCTGGGATTTTTTGTCTG AAA ATCAACTGCAGAC^ 

ATTTCXXiACAGAGAAAGGAATCTGTAGTTCAG<ACTTGATXX^TCTGTGT 

G AGG AA AA GCG TGCAAGTATCAGTGATGCTGCCCTCTTAGACATCA TTTA 

TATGGAATTrCATCAGCACCAGAAAGTTTGGGATGTTTTTC^GATG^ 

AAGGACCAGGTGAAGATGTrGACCTTTTTGATATGAAACAATTrAAAAAT 

TCGTTCAAGAAAATTCTTCAGAGAGCATTAAAAAATGTGACAGTCAGCTT 

CAGAGAAACTGAGGAGAATGCAGTCTGGATTTTJCAATTGGCTGGGGAACA 

CAGTACCCT 

>Sequence 315 

TACCKXTTACCTCACCGTCTCG 1CIG1 C I ATATACTTGTACTATCTANTTA 
CTAACTAGTCTCGTCTTCTANCACACTCTCTCTTC 

ATTATCTCGTATTATATATCrCATATTATTNGATACTATCATTATAATTT 

AATATAANAAGTATCCGTTGTGCTTCTACXKXXKKKXrrCi^^ 

CGAGGTACTCCGTCTCAGAGGAGGGATGCAAATCITCGTGAAGACAC^ 

CTGGCAAOACCATCACCCTTGAGGTCGAGCCCAGTGACACTATTC 

GTGA AAOCAA AGATXXAAGACAAGGAAGGCATTCCTCCTC 

GTTGAICI 1 1 GCCGGAAAGCAGCTCXjAAG ATGGGCCCAOCCTGTCTGACT 

ACAACATTOIAGAAAGAGTCTACOCTGCACCTGGTGCTC^^ 

GGGATGCAG A I C 1 1 IG1GAAGACCCTGACTGGTAAGA 

>S eqoence 3t6 

CAGTGTOGGCCCTTTTOAGGTCGCGGTCCCC^^ 

AAAGGGGAATTTCCATGCCGTCTACAGGGATGACXnTji^ 

AG ACCGAG TG TCC 1 CAGTATATCAGGAAAAAGGCTCO^GACGTCTGGTTC 

AAAGAGTTGGATATCAACACTGATCGTGCAGTTAACTTCCAG 

ATTCTGGTC^TAAAGATGGGCGTGC^GCCCACAAAAAAAGCC^ 

AAGCCACAAAGAGTAG CrTOAGT TACTGGGCCXlAGAGGCT^^ 

CATGTACnrTCAOAATGT 1 1 G 1 CATATGGTTCTTGCAATGCATATTTTTT 

AATCTCA^CGTTTr^TAAAACCATTTTTC^ 

tcaaatcgagtaattcagaaaaactcaagatttaagttaaaaag 

GGACTTGGGAACAGGACTTTATAOC T L I 111 A CTGTAACAAGTACCT 
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AAOCAATO pgtcoci aattgatctccacttcncacxxw 

CCACCCGANfnTGATTCTTCACTOCTKAGCCATCGACOCCAGGCTCOCTT 

OTTAI rO TO OOCI I IC ItrrCTCAAATCCTTO OC I lOl I N I I C AAGAGAAC 

CTCTCTCGGTCGTCATT G CTOC A TCGATTOCATOCA G I OC I I CI I CA AAN 

CATTCTTCAAGGCACTTTAAKGCTAGCCTOAAAKOGCTTOAA 1 UX 1 1 GC 

TAATACTATlXX^GTGTXiATXrrGAGAGGGlXXn , AOOCTCThK«OOOGOCTC 

TANGAACTACNCn^GGAlTXXXKXNGAGGCTGCATTGGAATTCNGAATATC 

HAyMGCTTATTKGAGTAOOOOGGCNGACACCTCGACXKXjNGCGGGOCTCC 

NGGTACTCCAhKMTrrATTNGTTACAOCTTATAAGTNGACTGAGTITAACT 

TNGTCGCACChTTATAGGCNGTCAKTACAATAGTGTCAATAOGOCTTGTNT 

TGCCTOJGTTGTGAGAAGTTNGATTATOCTG^GTCAACTAATTGOCCACA 

ACATACAATAOC3GACGO000GCGCAGGCTATAANANGTOGTTAATAGCTC 

TGGTTGCTWGCCTHATCTCGAGGTGAGGCTAAAOCTCAACAACTTAAATT 

cctcgtcggttxxxacacttggcgatt™^ 

ctcgcxxk»tggagag agcgcgggttnacact atttag aggcgcttagttc 

tcg ci » icl l 1 ugactcaatntacl [ 1 cocttgcgcttcagggcgt atca 

«xrrtogcggccaaga<xxjtaatcatactctcatxrt^^ 

atagcgcgttatttcaacantatatcagtgggataacogcaagtaaataa 

cactttoagc acaacaggooogcacaaggccccatacccgggaaaagcgg 

AAAATATTCGCACCTCTAAGTACAAGGCG 
>Scqvaxx Jit 

OOGCGCrrTACACATTGAGTGCTCCTrTOCOCNNCCAGNOGAGNA . 

CCCXZAGGGAGAGATCAAAAATCATCACCAACCATAATATATCATGGACTA 

AOCCCTAAAOCTTCTCCTTAATGAATTAACTACAAATAAOGGGGCAAAGA 

GAGOCACAGCTAATACCOCXTAAAGCACACTAGCTAGCTAAGAACAGTAA 

AAGAGCACACTCTTCTATGTAGCAAAACTAATGCCAAGACTTATATCTAG 

AATCQaGAAACCTACCTAGOCTO O TOATAGCTGTCTGTCCAAGAAAGAAT 

CTTAClTCAACrrTAAATTTGCCCACAGAACCCTTTAAATTDOCTOCTAA 

AATTAACTTMTAGTCCAAAGACGAACAGCTCTTTGCACACTACGAAAAAA 

CL I D0 1 1 AAGAAGAGTAAAAAATTTAACAOOOCATAGTTTGOOCTAAAAC 

GCAGTCACTCATTTAACAAAGCrGTrAAACCTAAACAOCCACTTACCTAA 

AACAATCOOCAACCATA TAACTGAACTT ACTCACA CCCAA CA TOGA CCA G 

ATCTATTACCOCTAAAGAAAAAACTAATGCTAAGTATAAAGTAAACATClA 

AAACAI 1 1 I 1 UL 1 LG 1 LATAAGCCTG ACTTCAGATTCAAACACCTG AACT 

G IC'T TT I AACACOOCAATATCTTCCATCAAOCAOCAGGTCTTTATTACCC 

TACTGTCAAGCCAACACAGCATGCITCATAAGAAAGGTTAAAAAAAAGTT 

AAGGAACACTGCAAATCTTAACOOOCATTTTACOCAAACACTTAOCTTTT 

AOCTTAOOCAGTATTAGAAAGAT0CTTC1 1 1 OOCAAGAAAAATGTTTAAC 

GGXjOOCrrAAAAACAACTGAATOOOCCGGCTTCAATAATTCAATAOC 

>S«penc« 319 

OGAG ACTAG TGGCGCTCrrGGAGGTOQCOGTTOCTCAOGOCTGTAATCTC 
AGCACTTTCGGAGGCIGAAGCAGGCXXjATCACGAGGTCAG<MGTTTCAGA 
CCAOOC roOOCAACATOOTGAAAOOCCO OT C TC I ACTAAAOATACAAAAO 
TOOGTGTGGTGGOOGGCACCTOTAATCXDCAGCTACTTGGGAGGCTGAGGA 
GAAQAA ICQ 1 1 It3 AAOCTOOAGOCAflAOGTTOCAGOGAGOCAAOATCAOG 
OCATTOCACTOCAGOCTGOOTGACAOGGCAAGACTC TOTC I OCAAAAAAA 

aaoaaaaaaggaaaaaaooctttcl lg atgctottoooca ft tci xx act 
aaaacgccioci 1 1 ic'i i aactccacaccg aaccaacctgaaatattttg 
goccagaatgccaacaagaattgaagaaaagatgctttacaaaaataaca 

ATATAAAAGCAAATTATATTATCCCTTTTATtrrCCATTCTTACATTAAAA 
AAAAAAAAAT 



ODCAATC1 1 1 1 » <X I UGCGAACGCGATCTCTCTGTACTTT ATTTAATTTT 



TCGCTTAOGGTGCGaTATTT 

GGGCGGOCGAGGTACOCGGGATGGGA1 1 ICIG 
ACCAI 1 1 GCOC I UCCTCTTGCAAAATAGGTCTAATGGCAGGATGGTGTCA 
TAATTaaGGCT ACCAAGACTOCOCATIX r rfOCAGOCTGGGCAGTTCATAA 

tgggggcagacaatagtgcaaaaaaattttacattttatctttagagtgt 
cagggtcaaattgatttxxatggttgaggatgtagccaagtgtggaatca 

GGTGGA AT AGGTGG agag ttgccca t agtgg tttgg a A A AG a g a a G a G g a 
ctttgaaaagtggagggctcattaggtgacccaaattttacctggggcat 

CCCCCCTTTAGGGOCCCAACTTAGTCTCTCAGACATC^ 
GGGTGC TGXX ^CCACTTTGGAATGGTTCCCTOCATCACTGAGGACXrrGAC 
TTAAAGI 1 1 1 IL1 ATCTCACTTAAAACAACCCTTTAACCCTCTCAACTTA 
QGCAATAATAAATTCC1 1 1 1 CaTGAATTCCCTTCAOCACCATGCACCACA 
CAGAOCACATGCOCGGaCCCTCTG AC 1 1 C 1 OTAACCTTTTCTGCATAGCT 
AGGTGGGGI I I C I O OCCT 
>Seqwocc 392 

CTT ATATTGCCTTATATTTTA TTA ATACTATA 1 TTTI CI CA CCG I ' I n II 
ATCCATAAA 1 1 1 1CI I G TTAT A T A TXXTITTTGAACACTCA T A T AA TTTTA 
TTATNTTAMTATTA fOTTTOI AGCG ATTCACTCT 
t 393 



CCGGGCAGGTACAGGACACAGGCACI Ij CI I \ OTCTGGTAGAG AGGAGG AG 

gggaaatggagctattccaggatac^ 

gcrrggggagctag^tatcagaaacctcaaggatccagcccxxrtttgtdc 

TACTAGTGTCTA" 



ct a gaaggga tg A GGTGCGA TCAAAAA T AACTTGGCTGGTGT AACAGTTT 
AGAGAAGGAAGTCACACCTGTAGCCTGXfCTGGCAGGCAGGTOGACATGAG 
GCTGAGAAGGGAAGOCAGATGTCAGAACATACTAGGCTAGCATGCCfGCT 
>Scqoence 394 

GGTGCGCTTACCGGGTGCXXjGCCG AGGT ACCAGGCTGGCG ACAGGTQCT A 
CCAGGAGTGGGCTG AGGGGAGAAAAACTATCTOCCA C I C 1 1 I IO GCCCAG 
GCAATGTCAACGACTTCCACAT 
GGTTOGGCTtrrGTATAAGGACCCTCOOCTl 
AGTGCAAATCAACCAACAATTTACTGGTGGAATGGCAATCAAAGGAAACA 
GTT AAACAOCAAACAA I UUI I AAAGCCAAAAA ATATTTTTCATGG AGTT 
GAACATTTTTCOAG IU I OTTTTTI 1 CAAGTGTAAAAGCAGTGACATTTTG 
TTCAAACAGAAGCAGCATCTAGGAATTCTGGCACTTGGGTTCTAGGGGGT 
TACAGGTATGCATCATOGA 1 1 C I IC 1CCCTCGTATTTAAAAAOA 

GG03AOCCTTATCIGGlTKxOGGCCGAGTACTTCATTTACACTTAAGCTAG 

AGAGTTAGGATClTAATTTATTrAAAGCCATAGATTCAGTTTAGCTTTAA 

CCTAGACAGAAAGTGAAAAGCATTTTACAAGTAGAAGAGGCAATGAGAAA 

TAAGG CAACAOAT AATACGTCAA AGCTGGAACAAGGGCAOAATCAOAACO 

TGTCTOGCTATCAGCmol 1 1 1 1 OACTACf AAOOOCAAO CT T TTTA TTC 

CTCIGGATGGTCTCCAGACCAAGTrCAGAATTTAGGCAAAAGGATTrCCA 

AATGGATCCCTATACATTTTCAGAAGATTCAGGTTGAGGAAGAAGCCACA 

GAGGGCTTGTGATGAAOOCAAAGGAATCTTTAAAGAAAGGGGTTCTCAAA 

ATGCATTOGOCAGGTAGA M TOOT 1 A ACTTGGCAOOGAAAACI ICIOC TO 

GGGAGC 

TACGGAGCCCGGGGAGCCATAAAAAGTGTTAAAAGGCCTGGGGGGTGCtX: 
TTAATG^MGTGGAGGOCTAAAOCTOCACAATTrAAATTOG CXj 1 1 1 1GOGG . 
CTCAACTGGOCNCGGC1 1 I'll OOCAGTACGGOGGAAAAAOCTOGTOCGTG 
>S*q«oc* 397 

CTCTTAGTGGAGGGGTTAAATTOGCG<X3GCCTTGGGCGTAAATCAATXXXi 
TCCAATAGCC I GG 1 1 1 1 COCTG TOG TGGAAAATTGGTTTATCCCGOCTCA 
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CAAATTTGCCACCACAAACCATTAOCGAGGCCCGGGGAGGCATTAAAAGG 
TGTTAAAAGCCCTGGGCGGTGCOOCTAAATGGAGGTGGAGOCTAAACCTG 
CACCATTT^AA^MLWU^^n^ 1 GGOGG^TTCAACTTt>GOCXtJL»C rTTTTC 

> Sequence 391 

GGGACCACTCACCOC?GCGGCGGC0GAGGTACAAAATTTAGAGGTITCtX3C 
JTTATCAACAAGAGAOOCAGGTGCCAGCATGTrACTACCAGATOCAGTrC 
ITCI 1 AGGACAGTGTGGCTCAAAGGG ATGAGACCTTCCAGACACTGGTAT 
CTGAGCATCTGGGCCTGCOOCTGAGTTGTCAAGAAATTTCTTATCTC^ 
AGOAGTOCAOACAOGAATGCTTOCACTPCTCGGTtXKrrGC 1LXJCCOCTCT 
TGCTOCTTAMfCGCOOOGCTCAOCOOCTTOCTAGCACAGO OIGI CI I A CA 
CACTTTATGGGAC 1 1 I ICTGl GAACTAOCTGAGGGCAAGAAOCATGTCOC 

actccl roci rocro ci ca aatattttataggaaagcagtccacagtctc 

ACACAGAGGAAACATGAAGTTTAAGTTCTAGGCCTATGA 



GCCTCCTTCGOCTTCTATCTCCCTTCCT 

T ACTCa TCTCT TC'T I CTT ATACOTATTCTATTATTTOGTTTCACGCTCAT 
AGTGTATNAC IC1 1 1 1 1 AATAAAATAATATATGGGTTGTGCGGGG AGGOC - 
G OCOAGTACrO GGGGAGAGAGGAAAAGAACACAGATCTOGCATGGTTCAG 
A 1 1 1 I ICI 1 1 1 1 AGGTOCAGGAGTAAGATATATCATACGAAAATGAAAAT 
TATAAI I CI I CI I GGATTCCTGGG AGCCACATTGTCAGOOOCACTTATCC 
CACAGOCrrCTCATGTtrrGOC^CCAATAGCAATGAOT^ 
AATAATGGTCAACTTTTCCCACTACAACTTCAGGGCCCACTTAATTC 
CATTCCAUJI I ICIC IGGAATTTTACAACAGCAGCAGCAGGCTCAAATTC 
CAGCACTCTOCCA GI ICIC1 1 1 ATCAGCTCTAG ACCAO 1 1 rOCTGCACTO 

ctcccaaatcagatacocttaacaggagaggccagttttocccaaaqgag 

OOCAGGCAGGCCAAGGTGATCOCTTAAGGTTTTAAAACACOOGCTAAGAC 
ACAAOCAGGOOOCAATCACGTGAAGOOCTATGTAT IC roc I rC AAAAAGC 
CTAAAGAGGCAGGACAGATGTTTAAATACTATTCCAGTTACATGGGCCTA 
OCCTGG GAAOOCO CTCAGAAACaGGTTCCAGGGCACCTTAAOCAAACAGA 

- aoggtatcig in ggggagoocattcca i n nurn aaago - 



I U 1 U I ATTGCOQAGGTACAGACAG ixsci I gatgttcataaaaaatacaat 
GCOCTCGT AATOTC TGCATITAACAATGACCCTGO C 1 1 rCTG G CTOCTC l 
TGATAAGGC I nSTGGI CGCIl CA T AAACAACAAOOCGGTT ACCAAG A TGG 
COCAAT^CATOCA^AAA^^ 1 CGATACTOTOACTOC 

AClCAATCAACTOATOa I IGTCI I CAAOTAOCTGCCOGGGCOGTDGAGOG 

OOPGOOOGGGCAGGTAOGCGGGGGCTAAOCAGGOCAGTCACAGAAATOGA 

TTOGA AATA CCAGTGTCTGAAGCTXfAATGATGGTCACITCATGOCTSTOC 

TGGGATfTGOCAOCTATGCOOCTGCAGAGGTTOCTAAAAGTAAAGCTCTA 

GA GGOCGTC AAATTGGCAATAOAAGCOGGGCrDCACCATATTGAGTGTGC 

CCATGTTTACAATA ATOAG GAGC A GGTTOGAACTGGCCATCCAAACCAAG 

ATrG GAAATTOGCA TTTTGAAGAGGGAAGACCTT AATTTCCATTCAOAGG 

C1TOGGCCCAAATCCATTCTACCOCOG OIGI 1 1 ICA CtXlAXXJITOAAGG 

GGGCCTCXAAAATATTTCATTATOCCATG 

>Seojaeace 401 

GGTOGATCXJGCGGTGGCCGCCGGTTGAOCTTGT ' , 

GAGA43TCAGAGGATGAAATAGATQAAOCOGACCAIZ3CAGTTAATCACCAA ' 

catcaactactagocagaogggatoaaccacagcctcacacaatacagto 
I fOCTC I igtaa gtgtaacaacacactgcagctggtagtagaagoctcac 

CGGATACrCTGCOACAACTACAGCAGCTGTTrATGG^ 

G 1 G I U I UJb I GG IG IGCAACIXXlAAAOCAGTAACCTGCTATtXiCXAATrG 

TGAAGAGA TGGGAGTCTCOOOGTA TTGOOCAGGCCGGTCTCAAACTOCTG 

«3<^^ACX^TCritXX>SCOOCACT^^ 

G i UAGOC^CCGCAGOCAGOCAGAAAAAOG 1 1 ItAAATATTCGAAAACCTT 



Table 2 



ac 1 1 1 1 1 1 caatg aocatttttocatcaaocggtaacaggoacattaogc 
1 1 1 1 1 1 1 ici 1 1 raaacttocaacaggaaoggtcggaatttatcaagaca 
ttacataggagttagggcacaoccaogog TGG TGGT G GGG ago acatttt 
gcagocttattaacagggtttattataaacagggtggggocactacttgt 

CTAAOCTAATTCCAGGTCAAGATGTGT 
>Sequem4Q2 

GCGATTGGAGCTCCCOGtXXrrCKXXXXXXXrCXX^^ 
TCTGTCXKjAAAAACTGCTCTCAGAGTGTGCACTCTCCOCACAAGCCAGCG 
CTCAAACTGG AAAAAGTATCTCAA f O I IX 1 OA ATGTGGGAA A ACCTTTAG 

CCCAAGTTCTTA rcTron oogcatcaaagaatdcacacagoooagaaoc 
ctcacaagtgcagtgagtgcgggaagggctttagtgagcoctocaaoctc 
actgoocacct acg aactcacacaggggagaggcoctatgagtgtgogca 

ATtnXXX)AAAAGCTTCAACCAGAGTTCCAGCXrrCATTGTCC 

gccataooggggaaaagccttaccagtgca t tg tc tgtggaaagagattc 
aacaacagttoccagttcagtgctcagogggg 

>Sequeace403 

aggtacx^aattaagtattaaaatgaggattoaactggggcaaacaggtt 
attgtg aaaacagtcaatatgtaagci IX'I I caagggaaatcaactactg 
ttcctcaagattagaagatgtccaca c tcti t g cattacctocctaaagg 

A GGAAACACCCaTTAA TTTTCXXTT A TGG AATCAAT A TGG AGTGG AAA T A 
TGAAATGAGGAGATGTTTTAGAAAGCAGGACATATCTACCTACCATTACT 
GG AA TTAAAATCTATCCTC1GGGCCCACTCCATTG ATTOCG ATCTGAGGT 
GAGGAGGACTAAAAGCA GCAGCAGGTTACAGAAAG ACTGAATAAGATG AA 
AGTATGCTACGTATCrrCTAGCICGGGAAGGGGGGATCTGGAAAAAA 



TGGGGTGAGG 1 1 IU AT1 
AGGGATCOGCGOGGAAGCTAGCAOGCAGCCTACCCAACAGTCTACACAGC 

cgaccaaagcccxxccgtacccagaggagtcgctgctgagtgggagctca 

ACCCrGTrCAGTGCTCTGCTCATCAAGT C TC t GG AGAAGGAGGTTGCGGC 
ATTCTOCAGATACACAOCOCGCAGGAACATCCCTOC1 1 A 1 1 1 rCTO GCTT 
TGGTGCCACAGG AAG AAGAGTTGGATG AOCAGAAAATTCAGCTG ACTTCT 
CCAGGCTTCCAACTG ijICI I I I IACOCI 1 1 OC 1 GO I UATAAAAGG AAOAT 
GCCI 1 1 1 1 C I OAAAAAATTATGOCOCCTCCAAAAOCAGGGGGOCATGAAG 
AAGTGTTTTTT AAG AAAA TGCTTTTGCTTAACAA T A CAG AAGGTGCCA TT 
TTTAAAAATOO OOCTTS TCTOCATrAACCATTT^ 

TTGG^OCCTTGTATITTTATGGGAOOCGGAACATAGCAGGGTTCCTAACTT 
TOCOCAAGTGTGAAG CI I IG AATAAATGGCCGGGCCI C IC IG GGTGGTAA 
TTATAAGGGTTGTT G 1 1 ItXXXCCAAAAATTAAT I1 T I I U GAGGGTAATC 
T 

>Secsanet 405 

GXXMGTCT0TAGATOCCACT0CGCGOTX3GCGGCOGAGGT AOGOGGGGGGC 
GGOGGOGGAGAGA GC1TXjCTCAGGGCGTOOGCTAGGCTXDGGA£CACCTGC 

TGAGCCTCOCAAA OOGC TTOCATAAG 

CAGTGTTAGCTAAGTTTGGAAAGAAGGAAAAAAGAAAATI 
rTTTCTITTCTTCTTTG 

CTOI rCTCI 1 1 1 rA OAGGTGCTATCTCCA GTTOCI ro CA CTCCTCl I A AC 
AAGCAOCTCAGOGAGAGCAGCAGCaGOGATAGCAGOCGCAGAAGAGGCAG 
CGGGGTOGCCTAGTGTCATGaGCAGGGCGGGAGATCACAACOGOCAGAGA 
GGATGCTGTGGATCCTTOGCCGACTACCTQAOCICItj 
CTAOC1 l UOI U^ r iCICIC I C IA CmSGGG AG^ r UjGA TGTGGCAi LI I I G 
OG GTGTCTGTGT rT Xr rC GTAGAGCTCTATGGAAACAC OCTOCI I I G ACAG 
CAGTCTA 

IATGCTA 



TTCACA GTC C 1 1 GATGATAATAAATGGTTATTTTAC IGG M CATGTATTT 
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*«. i a i A 1 i_A 1 ACT i r rrTTCATTACAGTGTGCTOCTTCTACTT ATGTAAA 
AAAAAAGTTACCTCAGGGAGGTTJCrTCCTGAGGT^^ 
TTCTT A TCA TAG AAAA TGACAGCTCCA TX7TTJTTJTT ACTGGCCA TTACCAC 
CTTCCAGTCGGAAGGATGTGGAGGTDG^AAGCATACroATCATTTTGTOC 
CCGTCGAGGCCTAAGCTAATGTG I U I G I HUIOlLl t AGCTTTCAACAAA 
AAAAAGTTT AAAAAOCAAAAAAAAAAAAAAAAAAAGTACCTGCCCG 
' Te407 



CCATTTCTCCAAGGGGGG 



TGGGGCGTTGGCCCTCTCCCKXTGGCCGCCCGTC 



GTACCTOCACT GGCTG AAGTCTCTACATAGCTyrCAGGAACCTTCGGAAA 
GGGA 2°^ CTCT ^ACCAAACTTA^^ 

I CA IS!I CAGGAAAACX 1 ' ^- ILI [CCTTCATATACTCATGCTTGTGTTA 
I GG J? CCAGCCTACCG <^T^^ 

cc i£ G J ACCTOCCGG ^ G ccc^^ 

A CCTCCACACTCTCCCGG a ctaggtgg tc aa ca g GGCC ACA GGG TTGCTT 

TCTGTCTTTGGTGGGGCAGGGGAGTTGACAGGGATGAOGGTCCAAGGAAT 

TAGCATG AATGACA AGATA^CAAGGGAAAGAOTTAACCCTCTCACATAGT 

AG ^^ AA ^^^^ AG ^ G ^^ C ^^ G ^^ G J^CCTT ATTOG AACTTTCACTG 

GCTGAGCCAGATCACGGGAACCTGGGAGCTTTTACTGT^ 

AAAAAA TTAACA ACAATGTCAACTGGGTTGGATGATTTGTTAAGGCCTTT 

i AG j£i AC i mA ^ TAACAiT ^^ 

TGGCCGTTTAAACTGGGGTCOOCCGCCTGGG G 1 1 IC l FTC AA CI 1 1 IC l I 

CCCOACTGGG 

>Scqncnce 409 

CCACTOGCTTCATCTATTTCTATTTATCCATATAC TC T G l'I G TTCrTXKjC 

gctatatatttgt gtat taacta l i rrrrmrci i c txactaattttgt 

GATeTTACCTAATATTTTCTTCACAA l Cl N 1 1 1 C I A TATTTTTTTTCGK AA 
T TTATTTTTCCTC ATCCGGTGGCGGCCGAGCACCTNATTTTTTTTATT^ 
Gc^. » i 1 1 i i OGCGGGAGTTAAATAAAATAAGCATCTCTTCATCCTTTAT 

ATACCAGACACrTCTTAAATGATTTCCCTTGGT^^ 

H^ili. 1 ^— tGCTTTTCAGGTAATTAACTCT lLILl 1 1 1 1 A GTTTGAA 

AAAAAAACACTTT ATATTATGCCAGG TX3 AGGTGTCAGAACCCTGGCTTCG 
GAAAGTOG TTGGCTCACCC OGCCT 

A li . 1 » " 1 1 ICl ICIGICCIG CATO Ol I ICl ILIUM Id IC 
TATnTTCXXXXTTTCTACA TAAAA TTCCACTTTTTCAA A TTTTCCCCATC 
TTGCCTTA1 1 1 1Q 1 rjAGI 1 1 ICICCI I IGTTTCCACTCTTGGTTGAATT 
!"" '*"" i CATTGTTJLI ICl 1 1 IX! I TTTTTACAAGTTCTAOQCTAT 
CCCAGOTTTTTAAAOO Ol 1 1 II I OJ 1 A ACTTTTTCCACTCGGTTATTCAA 
TT 



TGTACTGATGCGTCGGCCG^ 

ATTCTAAATOAGGATTATGGA 1 1 H ICI GGAAGAT TCTI 1 1 1 IC CTGTGG 

A ^7 G d^ GAAATC ^ A ^<^°Q<^^ 

A J A J A *^ ATATATO,GIU " A TGT^^ 

G J?^?Z AC ^ C ^^ CG ^ C ^ CA ^^ 

GCACAACTTC^CTCTATATITATTCTC I OCCTTCCCTX3GGGGACTGATCC 
^^■^^TCTTGTAGATAOCACATOOGGXJGGTGCTCATGTOOOCTOnOCC 
AATAGCTTAOTCCG<XniXK^ 

TCCTGGCCACATTTGTTAAAOCA 1 1 1 1 1 1 1 ICl 7 A ACCTACAAAATATTT 
^t?^Tt^$^ CCGCCCICT ATCAAA7 ^^ CTACTA ^TCAGG AA ^jcGCA 
CGAGAATGGTTGAACCCGGGATGGGGAGCTTCAG<KKXXXrrATAGCATGC 
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TGTAGATCGTGCGGCGGGTACGCGGGGTGCTGGCATTACAGGCAOGAGOC 
AGTGOGCCCAGCTOCCrorG 1 1 ICl 1 1 1 ATTAGCTGATCTGGACTG AGGG 
GCTOCn uAGCAGATXXrrtn"ATTATGGGGATAAGCCACACACTTTCTGAA 
CTp^GCOCGGTCAGOGGGGACATAAOCATITCCTGTGCCAOCOCATCAGTA 
^^Q^ ^^QTCAGCGAAGGCTCCTCCOCTGCTIGAGTAATGGCCACAG 
ATCTTGGCTCGGCACTC^^ 

A CTGGCTCG tgg ttcc a ttctcca GA TCCTTGGGTTGGCTTCTGGG tcca 
CTAGG AGATCTGA A ATGCTCTCA GGCCACCA GG AAAG TA CTGG AAGT A AA 
GTeTCACTCTAAAGAAGATGAAAATCTAGTAATTAATGAAGTCATAAATT 
CTCCCAAAGGG AAAAAACGCAAGGTAG AACATCA GACAGCTTGTGCTTGT 
AGTTCCTAACCACGCAAGGATCTGAAAAGTGTCCTCAGAAGACTACTAGA 
AGAGACGAAACGAAACCTGTGCCTCGAGCGGTCCGCCTGGCAGGTACAAG 
J^GTAGT AA A ACAAAGCTTA AAG TTTTTTCA TCTTTCTACAGCA A ATCGT 
CAGTTA TTT ATA A ACCT 
>Seqtte ncc412 

GTTG ATGCK^G CGCCGGCAGGTACTAGAGTTTTCAAGTATGTTCTAAGCAC 
AG AAGTTTCT AAA TGGGOCCAAA A TTCA GACTTG A GTA TG TTCTTTG AA T 
ACCTTAAGAAGTTAGAATTAGCCGGGCATGGTGGCCCGTCKXXX3TAGTCC 
CAGCTACTTOAGAGGCTGAGGCAGGAGAATCACTTT^ACCCAGGACKnW 
AGGTTACAGTGAG<^GAGATXXJTGCCACTGCACTCCAGCCTGGGTGACAA 
GAGAGACTTGTCTCCAAAAAAAAAGTTACACCrAGGTGTGAATTTTGGCA 
CA AA GG A GTG ACAA A CTT A TAGTTA AAAGCTG AA T AA CTTCA GTG TGGTA 
TAAAACGTGG II li l AGOCT ATGT7TGTGATTGCTGAAAAGAATTCTAGT 
TTACCTCAAAATCCTTCTrCTTTCCCCAAATTAAGTGCCTGGCCAGCTGTC 
ATAA ATTAC ATA TTCC l J I iGGTTTI I I I AAAGGTTACATGTTCAAG AGT 
G AAAATAG ATGTTCTtjGTTGAAGGCTACATGCCGG ATCIGGTA ATG AAOC 
TTGTAATGCTGTAT7TGCTT(^CGGCTTACTATAAATG7TACTTAATAC^ 
TATCA A CTT A TTA CAA TTTA CTA T A G A GGG TA TAA GT A A ATT AA TCTCT A 



TGG ATGTGTGGGCCG AGGT ACCTA G TCTAT A TG A G TTTG A TGCTT A CAGT 
CAA GGCTATTAGCAAATA TTCA GGAAAAGTAAAGCCT AAAGAAGAAAAGA 
GGGAATOAATAGTTTGTCTAGAGATAATAAAAGGAAGGTGAATTTTTAAA 
AAG ACA AAAA TAA GGCT A G AAA AG ACTG AGTGG A G AAA GCCTACAG AATT 
TCAGAAAGCT AAAGAAATTGGAAATTAGATTGAATATAGATAGAAATGGG 

aggacaatccagccaatgaaagactgtggggactaataaagggagagccc 

TGTGGTTTGGAAAGTGTCCCTTAATCAC^XTGCAGTGCTGC^ 

ACXXZAGAGAGGGTGCITOAGAATATACAAGAACCCTTGCGGTGGTGACTO 

AACAAAACtK>GCCAGGCATTTCATCAGAAGCATAATCCAT^ 

GA ^THT^*^"^ ^ t GCTTXjOG AGCTGGT AAO A TACACA CA GGCCCAGTGTCC 

AGTCTTGATTTGATATGCTGGTA 1 1 1 IOC I I C 1 GTGGTATTCTTTTATCA 

AGGACTAAGGGTTCCCATGTtXCTTCGAGGGCATATThnTTCCACCGACA 

<XnTXKXKiTCTAGGCCTACtXJTGGCTTTAACCT 



TCGAGATCTCCATOGGCGOCOGCAGGTACGCGGGATCCAAGATC 
AGAC AAAATAAAGAATOCAAAGTCATAGTCATGAGGACAGAATAAAGACA 
JTTTAJGOCI 1 1 1 lOI I 1 IG1 1 1 lit I 1 14C1CTTTTTGTGGAGAACAGGGT 
CTCTCJATATIGCCCAGGCAGGTCTTG 

CCTGC 1 1 c 1 uCCTOOCTAAGAGCTGGOATTACAGATTjTGAGOCACCATGC 
CCGG^GA^TAAAGACATTTTAAAACTAAAAAAAAAAAAAAA^GAGTT 
TGCTTTGCATTAA I C 1 1 1 1 1 1 1 ICU II 1 1 1 ICOI I II IATTTTTTAGTT 
TTTAi i 1 1 1 1 "i "i GAGACGGAGTCXCACTCTGTCAOCCaGGCTGGAGAGCA 
ATXKXZATGGTCTCGGCTCACOGCAACCTCTGOCTCCTGGGT^^ 
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T A J£iE G SS^ 

>Setf xmce4l5 

S^ AA £U^i. 1 ' ,U,C,u< - , I CXJGCTAGCCGATTTAGTTA^CTCAAAGC 
TTCATATAG7TGGCCCCCTGCCAGTCTATAAAAAAGT 



PCTYUSOO/3J312 



PCT7US00/33JI2 



TGGTGATCGAGACXnX^CaSCXKSTGGCGGCCGAG^ 

I^^3^??I3^ G 5 TG<mACCTCAC 

r^^^r^Tt-J^^^J^^^^ A A ^^^CGCG TGCCAG A GG ATA AA TA T A CTG 

gccaggtgjgatgccgaagaaaaagaagatgtgaaatct^ 
ga J3 aag £ ac ?J aa t tc ^ 1 ^ 

aca^atc^d a ctt aaaa ta tattttcaaaaag aaaaaaaaaaaa agtcct 

>Scqucacc4l7 

TGAA hrTTXj ATG CTCTXXGTCTXjCGCGGCG 1 1 111 I I II l i ■ i i 

1 ' " 1 ip^^OOQACTTTTG CICI 1 1 1 1 OOCCGGGCTGG AGTGC 

AATG^CGATCTCCXK^ 
TTXrTXXTXJCCn- AGCCT rTTGGGTAGCTTJ^ 
ATGCCTGCCCAATTmxn^A 

TCCCAAACrrOTTGGC*™ 

GCAAGAAGGGCCAAACOCTTroCCrrTCAATATTACAAA C ICl 1 1 1 1UAAG 
GAA/TTCAAGGAAGTTGAAAGCOCCTTC^ 

^f^SfrVr^i \ ^333277 'TAAAAAAAAAAGTTTATTTTTTTGGT 
A ^^"GGGGGGCCAAATCTT^AAAATTrrAAAAAACCC C ICl ICl 1 1 IC 

TG AATGAA ^TCTCAACAAAATGTCCTAGCCACTC^XXIACGCAAAACAA G T 
AA GATaCTGTTG^G^ 

^fI AAA i T I? X ^ A 5 OAAAA ^ OATCCG ^^ 

TGA^ATOGGAGGACTCAGGAAGGAGGATCGTCTTTACCTGAGG 

GCCAGAGGT€CX^GATC<XntKX»rKUCAACOCAG 

C^AAAGCAGACACAGGGATAAGAACA^GGA^^ 

CCCCAGGCACAGTGGCAGTTAGGATGGCTTKIAGA AGGATAGTTCTATC 

GGTTGOOCAAAAAATtnXiATGTGCTrcATOTTT^^ 

TOCTAGAGACCATAGACATGATAT^fiAXTTAACrTOCCCAT^ 

AGGAA^CCATG<nTATGACTTACCT^^ 

GAAACCATGTXXXTCATGAAACCTACAG^ 

>Seqwace4l9 

CAGCCACCTTAGAOCAOATAGGACAAGOCAGATACTGCAGAGAAGTITCTC 
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ACAGAGAGTGGTCCTGCTGGAAGAGGAGCATGTACCT 
*-~- c 420 



NCCCGATOCt^CrrACTTGAGCK^^ 

ttttgtctcggcac<:ggtggccgtagctcxatcgcattttatg^ 
cgagaagggaacggagttttcatcaggtagattggtttttgt 



s^^^ ^I C ^ ACCGGCGGCGCCCGCCG ^^ 

jgctggaagacctccaggatggagaagtgaggctgggtc 
ggc^ttcagcaacaatoagagaattaaaaacttcttcagag^ 

TCA^^AACAGCACaCGKTT^ 
TTCTCTGCTCCCGG<J<^ 

A^^TrS^^ G ^ AGAGAGCTACAGGGAGAAAC ^ 
A T^JS^^^ G ^ C ^ GTCAGT ^ GACTCTAGTATGGA 

^2S^5^ AG< ^ TOAACAGACAGACTC7T1 ^ 

^^^^^^^^^ A ^ GAA ^ rGZ ^ mG ^^ 

AGCAGGCCTGCATCTTACCTCTACCTTGCCG 

> Sequence 422 

GGCCTATGTGCANrrNI I J 1 1 IGAAm^COiAJ^CTTACCCXXKJTGGCXXKXX; 

S5^?^5 G ^ AACTGGGGAAT7T ^^ 

A r aaa^i r^-^ T.^^SITT'^^'^' A TGTCA TO A G A TCCTCTA CCTC A T A 
a T^rx^^^^Jin^^^^ G ACT AA GGCA GTA G CTCA A A GGGCTTTGCAA A 
^c!ta^aa7.^^ GA 

^SSaI^S^ 

>Scqnence423 

TTTGGANTNGCCACTCCAOCGCGGTXjGCXjGCCGAG 

CTji^^^^^^n^r^^ ^^J^ A OCTCA A CA TAA G A ACCGT A TG TG AAAA GC 

a^ A <w G ^J^^^^^^^^^^^^ GG ^ AAAGa ^^ aaa GCTTTTCCCCTA 

G^G^Tr^ A ^ A ^ A ^ AGGC ^°^ 

GNGT ^TGGAAGTTCTAGCtUAAGGAAGTAAGCAAAAAAAAAAATCGAAA 

?i^yi A J A ^ A A AAcccTGGGccAGGTGc ^ 
a ^^^^ aggccgaggtggct 

CJ^CAGOCT^^^CG^^ 

AAAAAAA i^AAAATrAGCCIXXKX^^ 

GOGTGAATCTC^GGTG<U 

ACTCt>GCXXTCGC^C^GAGCAAGACTC 1C> 1CI CCAAAAAaXcAAAAC 

ggagagagaaccctcaagattacgcacacacacaca^Cgcocctg^^ 
A^TT^r^rrrr^A^S^^ 

AAA ^ TCCCT1G » ™ ' A G «XggCCATTAAAATTTWIAAAAAQAAgrrA 

g^cttSc^ AT ^ 1TCAAAAAAA ^^ 

• 424 
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GTGTGGTGGAGGACGACAAGGTGGGGACAGACTTCCTCGAGGAGGAGATC 

ACCAAGTTTGAGGAGCACCTTX^GAGT^GTCGATATOGCAGCTITCAACAA 

GATCTXJAAGOCTGACTGTCGGTAOCTXjOOOG 



TX}OATX^TCAAOTrxrTt^CCCCXX}TtXXXXXXX^GGTACTAAO 1 GG 1 1 1 A 
AOGA TGGAAAAG AGCT AACAACTXiACAACAAA T ACAAAATAAGCTTCTTC 
AACAAAGTATCGOGCCTTAAGATCATCAATGTAGCCCCGAGTGACAGTGG 
CG TATACAGTTTTGAGGTGCAG A ACCCTGTTGGCAAAGACAGCTGCACAG 
CTTCATTGCAG G I I IC AGGTTGGTTGA I I ICI I OG G C I 1 IIU.1 ICA TCA 
TTA TAATAATGTAGTTCCTGATTTTCATAAATGTATATGGGTTGTTACAT 
CTTCTATAGGATAACATGAGTCCGACATCTrCTGAATCAGCAAATTCAGA 
GGCAATaCCaTCTCAAGAAGCCAOCATTGAGACCACAGCCATTAGCTCAT 
CCATGGTCATCAAGAACTGCCAG AGG AGCCATCAAGGCGTCTA TTCTCTT 
AAAATGAGAGGCAGGACTGGCTAGGG7GATGCCTAAAGATGATTXOCAGG 
CTTG ACATGCTGGTATTCTTA CA TATCTaTTCGTGGCTGTA TAA TCTGTG 
CGA TGAAAA TTCCAAAACCG AG ACAGG AATTCGCACTTGTTAAAGTOG AA 
GCTOCAAGCCTGAGATOCAATrGG 

"ACTGAA 

TGTGGGAAAGGCTTTTGCCAGAAAGCACACCTGAGCAAGCATCAGCGAAC 
ACATACAGG AG AAAAACOCT ATGA ATTTTAAGCAATGTGG AAAAACATTCT 

gtgtgaagtcaaacctcactgaacatcagagaacacacacaggggagaag 
ccctatoaatotaatgcatgtgggaaa tocttctxx xacaga tcagocct 

aatgtggg aaaaocttccctcag aagtcggccctaa i 1 l» i ) ca cca gag a 

ACTCATATAAGACAGAAACCCTATGGATGTAATCAATGTGGAAAATCATT 

CTGTGTGAAGTCAAAACTCATTGGACATCATAGAACACACAGAGGGGAGA 

AACOCTATGAATCTAATGGTTGT0GAAAATCATTXTATGTTAAGTCAAAA 

CTAACTGTACCT 

>Scqoeacc427 

GAAATGATTANT^CCrcACCGCGGTGGCGGCCGAGGTAGCTTACTTAGCA 
GAGCACTTTGCAAACATATTACTTATTACCAG AG C 1 C T'TT G T A GACCTTC 
CACATCTGGCTGTCAGATCTTAAG G TI G IG AATTTAGGCTCCAGTTATAT 
TCACTGGAGAOCATAATCCCACAOGGGTTATTr ATAAATACAG AGCCTCT 

TCACAAGGTAAAACTTAAAAAGCCAAGCAAACAAATTGCTCrOCATTCTA 

CI 1 1 1 A ATTSGGCCAAACAGCATATGCTACAGTAGTAACAT G i 1 1 1 1CGG 

AGAGTCTAAAAAACT CIG TI TA CA T T TOO C TO CTOCGTGOOTTGATOGAA 

AATGTATAAAACTGACTGCTnrrOGCCAGGCTCAGACAAGAAGAGTGAGC 

TGCTOGT 

>Seqaawc42S 

TCTACAOGOGAAC 1 1 J UCAOCTCTCTACATATOGTATCTAGTATCACTTC 

TAATTTACTTCATATCTGACTCTAXXTTCTATCATACAACTATrCO 1 CI AA 

TAAGTTTGTATACGATTATTAGGTGTGAGAGCATCATCATXATTACCACA 

TACAA>n'AAGGGGNN>MGAGTTGATTTOATGO(OCCTTOGCGGAGGOGGC 

CGAGGTACAATTCATCTAACTTGCGGAAAGCACTTTCAGGOCAAATGCAG 

AAAGGTOOCACATGCCGAOCAGGAGCAAGCT/rCAAAATGTTCACTTGGGG 

CATTAOGCAOAGTAATTXCAGGGA I U 1 1 I C I GAAGGOCTTGATGATAOCA 



TTATOCTCATTATAGAT GATtX ^OGGGOOOCroOGCTGQATAOtXKXlACta 
TTGCCTTTGACAGCTCTCA I FOOC 1 UAOAGOCATAGAOCT 



GTTTCTCAT1 



TTITGATAT C A TT CC AG OCTT TA AOGCT/ICTTA AGOAOCAAAACAOCTrC 
CAGGAA C I I UC1CA ATACGATGAOCTTTAGACATCACCAGTGCTGGTAGO 



GTACCTGCCAG 
• 429 



TGGGGOGTT G T, IOCTAACOGCGTGGOGGOCGAOGTA C 1 1 1 II H H 111 I 
milll rOTC ATCTCAA CTGCI 1 1 1 ACCAAGTTGTGAATATACTTQGCC 

I I PCTGTCn lUJU ^AAAGCAATTTGGGATTA l 1 1 ICCICCI 1 1 1 I I I I 
C7GCATT 7XATCAT A AATAC T OTCATA7TCATAC ACAG TAGCA FCTTCTG 
CAAGGGCCTTXTGGATTTXXAGTTTGGTCrCTTTCA TGGOCTGCTTCTTA 
GCAQCTrCCCTCTGAAGGCl 1 1 C ACTCACAG AGGTCTCATCATCATCATC 
AGAATCATTDOCAAACACTXjATGGTTTTTGCAAAACAGGGTGCAACTGCT 
G TGTTTTCn I G GCAAAATAAGQ0CATACTACCT GC COG 

> Sequence 430 

I I I rOCGTTGTTOCrCATCOGCGTOGCQOCOGAGOT ACAOACAAAACTAC 
AGACTTAGTtrTOGTGGACTGGACTAATrACTrGAAGGATTTAGATAOAGT 
AT^TGCACTCCTGAAGAGTCACTATXAGCAAAATAAAACAAATAAGACTC 
AAACTGCTCAAAGTGAOGGG I T CT I GGTTG PCTCTGCl GAGCACGCTOTG 
TCAATGGAG ATGGCCTCTGCTGACOCA G A TGA AG ACCCAAGGCATAAGGT 

tgggaaaacacctcatt1gaccttgccagctgaccttcaaaccctgcatt 
tgaaccgaccaacattaagtccagagagtaaacttgaatggaataacgac 
attccag aagttaa tca tttg aa ttctgaaca ctgg ag a a aaaccgaaaa 
atggacggggcatgaagagactaatcatctggaaaccgatttcagtggcg 
atggcatgacagagctagagctogggcgcagococaggctgcagoocatt 

CGCAGGCA CCCGAAAGAACTTOCCCAGTATGG TCGTCCTGGAAAGGACA T 
1 1 1 roAAGATCAACT ATATC1 ICC IP I OCAT7CCG ATGOAATTTCAOTTC 

GOGGGAAGATN 
> Sequence 431 

GAAAGTTTTCGTATCGGGGGCGGCGAGACCAAACAACAGCCC7XCAACAA 
TGATGACCAGTGGAAAAACAATGGAGTCACCAAAACCTCWGACAGGCTCA 
TGCTCCAGGACAATTGCrcTGGCGTAAATGGTCCATCAGACTGGCAAAAA 
TACACATCroCCTrOCGGACTGAGAATAATGATGCTGACTA I COL I GGCC 
TCGTCAATGCTGTGTTATGAACAA P C I I CGAGCGGCCGCOCGGGCAGGAC 
GCGGGAGTTCAAGAAGCTG G TG G TCAAGGAGGAGGAGGTGGAGGTGCCAG 
TGGAGGAATTGCAGAAGCTGGAAGTGGTXATATGAACTACATTX^AGTAA 
CAOCTCAGGAAAAAAAAGCT ATAGAAAGGTTAAAGGCATTAGGATTTOCT 
GAAGGACTTGTG ATACAAGCGTA I ITTGC1 FGTGAGAAGAATGAGAATTT 
GGCTGCCAA I mCl PCI A CAGCAGAACnTGATGAAG ATTG AAAGGOAC 
TT7T1TTATATCTCACACTTCACACCAGTGCATTACACTAA C I I K, I IC ACT 
GGATr G TCTCG GATGACTTGGGOt^TATPC ACAA TACTTGGT AAAGGTA 

>Seq oence 433 

GACTCTGCG1 POTTAGCACn3CTTOOOGGGACTCTGCGTTGTTAGCACTG 
CTTACTCOGl rop CAGCAl i tCI 1 1 1CICI PCTCG1 1 PCCTGTAGATTCC 

TTOCTCAAGGGGG^AACGGATTCCAGAGCATTTGTTCGGGCTXATGTAGG 
AATAGATCTITGACTGGCCGGTAAATOCCGCGTACCTOOOCG 



. OOG ATOTOT . 1 IGJLAThTIGCHAOCTTXACXXXXiWOGOGOOOGOCCGGGCAO 
GTACAAATCTAOCTOCOCAOCAAA rOTCCTl AGAGGGOCAAAGATGGCCT 

mm rcn pca toataacatoo oci ncnrnnnn poagacacoot 

TTCA1 rCnnCAOCXAGGCTGGAOTGCAGTTGTGCATICATtXaClCAOCA 
CAOCTTOAACXXCCAgCCTCAOOTGAI V C I C IC ACCTCAOOCTXXXXAQT 
AOCTGOOACTACACXIGOCACAOCATCAAGaXCOGGTAA l 1 1 1 1 GAAATT 
TTTATAGAGACAGGATTTTACCATO I7TCOCAGGC PGGTCT1 GAATTCCT 

GCATGAGCCACCACACCCCACCTGTCTATTITACAA I I M UL II I G AOCT 
CI 1 1 1 1 I CCAGCAGTCATGAAGCTGGCAAATGGCAGAACTGG AGCTAGAA 
ACTGCTGACTCCCTTTA ICI 1 1 1 CCATAGCAOCOCAAGGCTAAAACCAGA 
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CTCGCACAAATOGTACCT 
>Sequcncc434 

TGGCTATAGAGAC 1 1CCI OOCOOTGGCOGCCGAGGTACTTTTCTAAAAGC 
TCATCCACTCTATC^TTTAGATATOCAATTTTXAGAATCTOX^^ 
GGCCACTCCATCTGCCATTCTTAA O I C ICC 1 1 GGGAG PC I OOCAGAAGAA 
TTA TGTT ACT ATTTjTCTTTTA GTTG ATTG AAA TATTCTGT A TTCCTCAAG 
GCACCATCA 1GI I POTI A AATACATGAATTA G I PCTOCT1 lAAATCCTTT 
GAGCACCCCCT ATQ AAAAATATAAATCT 1 1 1 GA ACAOGCTTTAAAAATTC 
TAI I IUI I GGATTTTCATATrTTGGAGCTCTTAATTGATGTCACTATTAT 
TTCATCATATTTGTAAATACATCTTTCATACTAGAGATCTCAAAGCACTT 
AAGTO CATCACA TTCAC CA TAGCTAAGAAGGGCTOGOAGAAGTAAATGAT 
TTTTTAGATACTATTTTAAATGGTAAAACAAAAGCOGGGCCCAGGGGCTC 
ACACCTCGTATCOCAGCACTTTGGGAGUCCAAAGAGGACAGATCACTCAG 
GGTCAGAGTrCGAGACCAGACTGGCCATATGGTGCCAACCCCCCTCACTA 
AAA T AAAAAA TTA GCCA CGTTTGTGGCA CGCACTC TAA T- 

> Sequence 433 

GGG A TGA TGTG A CCCTCTCCGCGC TGGCX3GCCGOCCGGGCA GG A CGCGGG 
GG TTGCTCAAA CCG AGTTCTGG AG AAOGCCA TtlAGCTtXiCTGCTT AAAA T 
TAAACCACAGGTTCCATTATGGGTCGACTTOATCGGAAAGT^ 
GA 

> Sequence 436 

CCCAGTGTGGAGCAGCCCAGCCAAGCACTCTCAGGA^ 

GCTAOCAACTGACTGCAGATClTX!AATAATAAGTGAGGGGTAGATC1G<X 

CAT AGAGC TCACTTTAGACCGGCCTATACTOCTACAAAGAATTGTGGTAG 

GATCT TTTACTT^TCCTTGGCACAATAGAATGGCCAATGCCCTTCTAA 

TGTTTGGTGAAAGTCTTGGAAGC^CX^TTTnX^ 

GAAA7TJAAGTCOCTAAGGCAACCAOCAGGGCTAAl\lGAGGCTGAAATTTC 
AACAAAACCCTATTGGGGGGGAAAAAOCCAAAAGGGCGGGCATATTTTTT 
TTTOCCCA A A A GGG AGCACAAAOXAATTAAATCTTTAAAACGGAGTGGG 

gggggcaaaatttattkkx^caatggcacaacctgggaaaaaaaatdctaa 
gg gcocg ggttatattooctataacoogtaataactocaaccacccggtt 

AA TnfTTTAGAAA CCTTAAAAAAG 

ttattgct aaggggcctttgcaaaaataaaggggaa GGACOC I OJCCOGC 



437 

GTTATACTAGTTaTTTTATATTACTOGTAATATGCTTCGTA I IIX i I 1 1 CT 
ttatcttag 1 1 UTUTACCTTATaCTCATCTATCAGTTTGT AATTTACTAA 
AATTGTATCTATCATATAGTTACTATTTKTNKTA PCrroCTGTTGI OOCT 
TGGO GGOOGATGTA OC 1 1 1 1 1 AGAAGAGAAAAGAATCTTGAATTGTATAT 

ATTTAri rrecn I acagaaaaaaatog m CG I AAATAATTTGCCTATT 

7T 1 GGTrAACATAGCACATGGAGATAA7XATCTGAAAGTTATAGGGCACrrG 
GCACIGCTGAATCAGAGCATXtfOCAATATTIGAGGTGGCTCTGATT 
GGCAGCT GAA CTOGGGTAGTOCAGTGGOCT AGC TGG I CC I OCCCG 

A 1 1 1 1 CI AGTCTATAA I C 111 C 1 U 1 1 ATATTTATATGTATTTTATCATTT 
ATCTAGTATGTATCTATATATTAA J 1GI I T AATAGTATGTGATTACTCTA 
TTTAOTCTATTATHAA 1 1 1 PB1 POGAGTGTCTGCCGOOOGGGCAOGTAOC 
OGOGGAGCTGOCGCIOI IGC IGCltAi lb MGAATCTAGAACX3GTAGCCAG 
^^^PPCACTXjGAGG A CCaGCAAAAOATGCTTAOOGAATOOGGAGATCCT 
GAGC^GGAGGAAGAGGAAGUUXUGGAATTAaTrGGATtXXCTAACAACAGT 
GAGAGAGCAATGCGAGCaGT/PGGAGAAATGTGTAAAGGCCCGGGAGCGGC 
TAGAGCTCTOTGATCAGOGTOTA ICC ICI OGATCACATACaOAACAGOAT 
TGCACCGaGGaGCTCTTTQAC 1 ICI I gcatoogaoggaccattgogtogc 

OCACAAAC7XT/TTAACAAC1TGAAATAAATCTGTGGAC1TAATTCAGOOC 



AGTCrTCATCATTTGGGCATCAGAATATl I CCTT ATGGTTTTGGATGTAC 
CTG 

>Seqacnce439 

CTATGTACTACTCATCTC 1 A KTCTGTATTGCACT ACGTAC IP GTG1 1CAT 
AAATCTAATDC AT Cl I C T C I C 1 G 1 1 A GTACGTA C TTT G ATTOCTATTTGA 
GTAGTCATTTCA 1 0 1 1 1 ATATTTTATATCATATCCTATCWTATCNCANCT 
TGTTTGTTGCAGTCCATCrrGXjTGGCGG<XGAGGTACT 
TAGATTTGGAGAAGGTGAGGGAGG AAAGGC TO ICC ICI 1 1 GATCOCATAC 
ATOGCTGOCAGCATAACAAAATAAGAAGG AAAGA AAG 

AGGCCCAGCTG<KyiAG ATTACTTG AGGTCAGGA GTTtlAAAACCAACCTGG 
CCA TCATXXnGAAACCCCGCCCC ACCAAAA A T ACA AAAA A TTA GTGGGGC 
GTGGTGGTG TA TGCCTGT AATCOCA GCT A CTTGGGAGGCTGAGGCAGG AG 
AATCGCTTtlAAGCCAAGAGGCAGAGGGTGCAGTGAGCCGAGATCGTGCCA 
CTGCA<rrTXAACXrTGTGCGA(^GAGCAAGACTCTGGG AAAAAA AAA TAAA 



TTTCCAACAACCTATTACAAAAAAGAAATTiAAAGATGGAAAGTCAAAGAAA 

GAAAGG 

>Seqncace 440 

TGGTCTA TGT GCCTGaCOOGGGOOGGOOGAOGTAOGOGGOATGTC TAAA T 

atcttgtaaaaagtgttaaaataaacaaaoccagtcaattaaaaattttg 
actgttattgagaaaactxx^tgagggaaataataagatctataaaggt 

CTTAAGAAAAATATAAI Tl GAAAAAAACATGTGGCTGAG I GTGGTGGCTC 

ACOCCTATAATCCCAGCA C 1 1 PGGGTGXMXTAGOTGGGCAGATTGCTOGA 

GTCCAGGAGTTT AAGAOCAGCCTGGGCAACATGGCAAAACCCTGTCTCTA 

CAAAAAATTAGCCAGGTCTOOTGGGACAOOCClTTrAOTOCCAOCTACTCA 

GGAGGCTGAGOCAGGAGGATAGGTTGAGCCTGGAAGATCGAGG<rrGCAGT 

AAGCTGTGATXACACCACTGCACTrTAGACTGGGCAAAATAATTGTTTAA 

TGATAAATGAGGTTCCTGOCCG 

>Sc V xnce44l 

CGGATXTIGANNATTGATATAGCGACTCCaCCGCGG74GGOGGCCGAGGTAC 
ATTGTAGCTTTGAACTCA O 1 0 1 II A AAAA7/TCAATCTGGTTACACACTCT 
A TCTTCT AGAT0CCTTGAGACAC1 G TCTTOCTT G AATAAGGGCCAGCTGA 
AATGGCATTTCA GCTCTGGAAGCa I 1 1 ICIIX AGGGAA I ICI rGGTG ACC 

AGGCTGAI 1 1GI rCTTCAGAAGTCT POCtXaTCrCOOCDOGOGTACTCTTC 
CTGCAGGTTAAGGCAGGACP0GAAC1 GC1 C C ACAGCTTDCACATA O I ' 1 1 1 
CAG ATTCAACACT AA CTTCTCCGAGTTTAAGA TGTGCCTGGGCA GCA TAA 
AGCTGTCCTTCTT1TGT1TCTTGCCTTTT AAAAATGATCTTTGCTAAATC 
CAGCATATOGCAGGCAAGCTCTAGG 1 1 UCCA A TC TOC I OC I OL1CATTTT 
CTTGAAGAGA C I I UI II IC AAGGJ^CTGAATCATTTOGC AI 1 1CHCA GTC 

tta tca i men ia tcaiccici pocgagoci icao i i rcnc AODCic 

TTPC AT C TCOT C I I C 1C1CICI POOGOCTC1 ICA TTaGCAGCTATCTCAA 
CI 1 tGOCTTCAGGTGAI 1 1 C 1 LAGTAGC I UUL > GGGCTAOL 1 1GGTAATA 
ACXXX^TCTOCAGCT GGCTCA AAC TC 1 1 1 1 AC AGAC AGCVTAG I C IC C1 1 
CTGACTGGGAAOCAGC1 1 raOOCTGACTTCTMCl 1 IAGATOOG 
>Seqaon442 

CGGCCATTXGO^TC^TATCTGCTCTGATTJCAAAG3^n nCAACGTCACTA 
ACTATGAGTCAOG 1 0 1 1 IGlAtCOOCI INIU^CK KQJAAAAKAKNHAAGO 
TGTOAAOTI CG I ATOCACTOCAOOOOCOOtXOOOOCOOQOOCAOOTACTT 
1 lOCIGCI UAGGAATGGAATCAAAAGAAOGTA G I C IC C I GGTAAOCAOCT 
CAGATCTCTATTA TTAGG*TTAGATGTTXXXXXXXrrOACTCtXXX^GCTTC 
I IGC ICIOGACOCTGCACTGTAACnTG ClJC I ICIA T TAGCAGGCAA GGAA 

AACCAACGTTGCTATGAAATATG^CTCACACTGTATAGC 

ATGATTAATGAATCAGAAT 
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CCTGTTTCATTGGTGACTTGGATAAAGACTT i i rAATTTTAAAAAAAAAT 
ATTCATCCAATAGCGTCCT 

> Sequence 443 

TGCHXiATAGNGTCCTCACOCCGGOOOGGOCGAGCTACATGAOAOACACTT 

T AAGCA GOCTCACAGCAATAGAGTOAGTOCGOACTCAGATTGTTTAACCT 

ATTrrcTOAACOCATTOCTACTGOGTlTAACTATTTTATTGGTTTCTAACT 

ACTAOCACAGACACGGATAOCTCACAGGTTOCATTATTACTCACAGCGTT 

GTCGTCCGGGTTCATCCKXATCCTGCTCCACGC1OTCATAATCCTCACGC 

A'TCCCCGCICGGGACCOCTCTTCTATAAGGGACATACAOGAGATCACCGA 

AAACTOCTCCTTTCTCCCATTGTTOCTATGAGGTGGGTGGGGACTOCAAA 

AOCOGTAGCTCCTGCCCTAC 

> Sequence 444 

TCGTTtrrCATACTATTATAATTGTATTCTACTATCTTACATTATCGTATC 
GTCTT AATG A TTCTAGTATCrATTGTTCTG AATATTTATTATCATAA ACT 
AATATCNNAKNNNNTTGTTTG TTTATTCTCATCGG ACTCCACCGCGGTGG 
CGGCCGAGGTACCCAGCCCCACCCAGGCAAACAC<rrcCGACATGTTrcGT 
AAGTGAG ACAAGCCAGTGCAAG I T 1 1 1 1 I TI I ICC I I IGI I I fTC OGCTT 
ACCTTCTTGCTTA A TOG AA TTGTTA TGOCT AA GCACA T A G AAGGCCAAAA 
AAGGAGTTTTTCAAACCCAGCAAATCAAGTGCTTGGATTCrGAAC^ 
AAAGAAAACTGCACTTOCCCrcTTAAGTAAAACGAAATCAGTTTCTTAGG 
TAAATGTATTCATCAGCCCAGATAAAAAAAAAACCAGTTATCTGAGCGTT 

agtcactgctcatttccaggaagatcaaacaaaataccagcccagccaga 

CTCACATGTGTGTATATATATATAAAGCAAAGAGCCCCGCCCACAAGCCA 
GCAGCTOGOTGAAATATCAGCTCTCCACGCCGTGGTATTIOCAATTOGGGG 
AAATTACCICCTTGG AAAAACTGGAA AA ATTA7TTGTTG AAAAAAAACTT 
ATTTGATAAAAGTGTnT 

> Sequence 44S 

TGACGATNAGATtXjGAGTCCTCACCGCGGTGGCGGCCOCCCGGGCAGOTA 
CTTTACTAAAATGACTGCATTCTTTOOATTCCTTCAGTCTATGGTTCAAG 
TCACTAAAGATTCATrTTl G 1 1 GAGTCCTTATC AGAAACAGCAGTATGAA 
TCTTGACCGTTTCTGCCCGTCCTAATGGCAGAGCTCTCTGACTTGGGTGT 
ATGCTACXAGGCTGGCTTCAAGTGAGAACTTCTGGTCAG I C I fCT GT GGG 

TT GAA G GTTCAATA tcaattctgtttcaa agcctttgtg atcctattto a 

ATCI 1 1 GC I (XiGTATATGCCACOCAGTGGTCAGTCroGGACCTAGOTGGT 
GAGCTATCCCATAOTrcATTCTCAACGTCTITACTGCACTGTTTAGG 
AGATACACATAT ATATA CAACTTTGGGTGAGCTCAGGAGTTTATAAGCTT 
TATGGGCTTGGTGTTTT GATTTATAAACAGGAGTTTATAGAACTTTATOG 
gn I ^T^^ Ml* I | ^CAGTrCCCTTGTATTTTTX^GCCCTTAAAAC 

GACCAGT7TCACA GTGTG 



TGATGATGATTOOCTKATCCGGTGGCGGCOGAGGTACGCGGGGAGACACA 

ACTTC CTGGGCTTAGATATTTCAGAATATCACAACTAAACTCTTAAAAAT 

TT^CTGAAGGCTGGACACCGTGGCTCACACCTATAATCOCAGCACTTTGGG 

AGGCTGAGGCAGGCAGATTGAC7GACKrrCAGGAGTTCAAAACCAO^ 

GCAACATGGCGTAACCTCGTCIO A CAAAAAATGCAAACA 1 1 IGCIOG GC 

TTGGTGATCTGTGCCTGCAGTCCCAGCTACTTGGGAGGOIX1AGGCAGGAG 

AATCGCTAGAAOCCATGAGGTGTAGGCTGCAGTGAGTCATGTTrGCACCA 

CTGCAGTeCAGCCTGOGTOACAGTGTOTATTA G 1 1 1 G 1 1 II C ATGCTGCT 

GATAAAGACA T AOCTG AAACTGGG AACAGA AA OA GGTCTAATTGGACTTA 

CAGTTOCACATGACTGGGGAGGOCTCAAAATCACGGTGAGAGGTGA^ 

AACCCCTGATAAGCCCATCAGAATCTATG AG ACTTATTCACT ATCACAGA 

ATAGCC 

>Seqococe447 

ATTATACTT ACCTCTTA G A TTT ATTTA TCTCAAGAA T ATATCG ATTTCAT 
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CTTTTATACTTA NTTGTA CATA TTTTTTA ATT AT A TATTCTA TTT A TTA T 
T A TAC AAACN ATCTAATGCGTTGTA I C 11 C rOCCGTOGCGGAOOAOGTAC 
GTTTTGTCACAGGGAATAAAATTTTAAGAATTCTTAAGTCTAAGCGACTT 
GCTCCTG ATCTTCCTG AA G A TCTCT ACCATTT AA TTAAGAAAGCA OTTGC 
TGGTCGAAAGCATCTTGAGAGGAACAGAAAGGATAAGGATCCTAAATTCC 
GTCIX^TTCTAATAGaGAGCCGGGTTCAOCGTTTCCCTOGATATTATAAG 
ACCAAGCG AGTCCTOCC1 CXXIAATTGGAAATATGAATCATCTACAGCCTC 

tgccctggtcgcataaatttgtc 



TGGGGATGT GCCTCTCTGTGGGCGGTGGCGGCCGAOGTA C 1 1 1 1 II I I I 1 
1 1 1 1 1 II 1 1 OTTAGTGTTTTCTOAT G 11 1 1 1 fCTA ACAAATCTTTGOCTG 
CCCAAAAGTCTCAAAAACATTCTCACGTTTCTAGATTTTTAGCTT^ 
TTTGTGTTTGGG ACTA TG ATCCA TATTTAGTG AATTTATTTTTGGGGGGG 
CAGAGTCCATGTTGCCCAAACTGGTCTGGAACCACCACACCCAGCTAATT 
TTTGTG A ATTGCXXKJ TACXIAGCACACCGGCGCCGTCCrGGACTGCGCCTT 
CT ACG ATCC A A CGC A TGCCTGG AGTGG AGG ACTA G ATCATCAATTG A AAA 
TGCATG A TTTGA ACACTG ATCA AG AAAATCTTGTTGGG ACCCA TG ATGCC 
CCT ATCAGATGTGTTGAATACTGTCCAGAAGTGAATATGATGGTCACTGG 
AAGTTGGGATCAGACAGTTAAACTGTGGGATCCCAGAACTCCTTGTAATG 

ctgggaccttctctcagcctgaaaaggtatataccctctcagtgtctgga 
gaccggctgattgtgggaacagcaagcccgata< 
acggaacatgtgttacgtgcaacagcgcacggagn 

>Seqacnce 449 

GAKITGTGCXrrCTCGCGCGCGGGGCGGCCGGGTAC _ 
AGCCCCAGCTGTTGGCTACATGAGTATTTAGAGGAAGTAAGGTAGCAGGC 
AGTCCAGCCCTG^TGTCGAGACACATGGGATTTTGGAAATCAGCTTCTGG 
AG GAATG CATGTCAC^GGCOGGACTTTTTCAGAGAGTGGTGCAGCGCCAG 
ACATTTTGCACATAAGGCACCAAACAG<XXlAGGACrGCCOAGACTXrrGGC 
CGCCCGAAGGAGCCTGCI I1GGTACCTGCCCGGGCGGCCGTCGATCTCCT 
TGTGTTCAA GCAACTTCTTGCGGTAGTCCTGAAGCGCCTTATCTCTAGGG 
TCCCKX^TGATGAGAACCCCGCGTACCTGCCCG 
>Seqnen ce 45 0 

TGGGATTTGCCCCTCCGGGGGCGGCCGACGTACTCCCTACGGCACTAGTC 
TACAGGOGGAAGGACOCTCTGTGCTGGCAGCOGTCGCTCACATGGCCTGT 
CTGCACTGTAAOCACA GGCTGGG ATGTAGCCAGG AL 1 1 GO 1 1 111 M C CC 
GCG TCAAG A G ATAGAAAG ACCAG I CC 1 1 G I G AAAGACAAGTCTGAATGCT 
CCA CTT7TTC A ATTCTCTCTCCATTCTTCA G T AAGTCAA CTTCA A TGTCG 
G ATGGA TG A A A COCA G ACACATAGCAA 
>Seqsenc*45l 

TGGCACCGTGCGTCTCCGTGGTCGAGCGGCCCCCCGGCAGGACAAATGAG 
TTT AOAAAT GTTCrrATAAGC^TGATCTGGAOCCAAACTAAAACAACGTTA 
ATCCTCTT CAA ATCTAATTTAA TATAC<XiAATAAG^TTATTGA 

mm icctca i i hlii i i iccig aac gi i i i i i ig tagaaaccatgg 

TAAAAAC<WAAAA GAAAC CTTTGACTGC<XX>COGCAGC<KKiAATACAAAA 
AAA AAATCC CTTGATTTTTAAAATATACTTGAATATCAAACTCAOAAAGA 
GTTA1 11'ITGTOAAAGAGGCAAAATTGXjICTIGAGCTGCTTCAGTCTATG 
TCTGAAGGTTTTACTGAAATTATCGTCX^GTTT^ 
GAAAAGTCAGATIGTAGATTTTGAGAAGGAAACTrCTGAGGTGCrrC^ 
CrCCAAGOTCATGGTTATOAACCTCAATGAGGGOCTGAA 1 1 GC I ICITC C 
ACAgATCCCAATTCAATGAGCOCCATTTTGCQA ICTT1 CTOAAAOAATTT 
AAAAGCXrrTCACTGAACATCCAGCTTCTATCAAAAGGTTC^ 
^^^^CAGAAGGGGGAATGTTGGAAAGATCAGAGTGGCT 

TGGTT ATXKj ACCTCA CXXXX?7X>GCGC<XXiCTAATCrTT A GAAGTTAAGTTG 
GAACCTATATTGTAGAGGAACAAAAGCCAATCAG IG IQJ 1 1 I T IG TCTTT 
TTTTACATAAACTTTTACTACAAAAATTAATATATGGATTTTGAATTTCC 
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agtcaaaccaaattgtaaaactgtttcatttggttctatattatgtatac 
a t aattta tctatt a ta t a ttta ca7t aaaa t a t a tgca t a t at aa tgga 

TTT AA TTTCCTTTTGGCACCCCCA T ATCT AG AAGTCTCTTCATAAATTAA 

taaataatctag<kx x>gc attatgtttgctagacctgoatttgg 
tacttaaagtt aaaag 1 1 jc1g1li rttttci i ggacttg aaactgccta 

GAGCGT CAGTCTCTCIOrTA l 1 1 1 1 1 1C1AI 1 1 111 1 1 1 KJUU CCATCAG 

ICt 1 1 lAGCCACTTGAAGCCAAAATTCTTA Ol I ICTGTCCTAGTCOATAA 

GAGTAAAAGGC^AAGGAGGAAAAGAjGTCCAGTCCCACTGGACAGTCA 

CTCICIGGGAAGGACOCATTACAAGAO^TGAGTCCCTCT 

ATTTCTATTTACATAAAATCTTTA 

>Scqo encc433 

CTTTATCCCTT ATA TA CA T A AA TATT ATT A TTGTT AA CA CAA CTG TT A T A 
TATAACATTATAATATAGTATACTCTATTTTGAGCAC^AGATGATCTCTC 
ATCCANNNAAGGG TGTTGTTAG ATTCCATTCCCCGCGGCGGC 
> Sequence 4S4 

ACCACGCCTCCGCC A CCTGTTGTCATTATAlXrrcCTG ATCGCGATCTACC 
CTCTATCCT ACCHATCGACATCC<K}CAGCATTCTATAGGAGTTGGTATCT 
ATTA TACT ANT ANATAKAAC<kXXXXXXJTCACGCrCACTATAGCCGAAGG 
NGGACGTICCGGCCAGGNA CGCGGGGACXTITTCACGGC<XXXK}GG AGCTGA 
GGCTCC I GCCGACATCTCroATCCTTGCACCCTCGCAOGAAGCTGGTCOC 
GGGCACTATAACGGGAGGCCTCCACATATTCCAGAAAAGAAACCACTCTG 

cagtgccagactggaagaagtaacxk;tcactctgaaaacac<^ggggaga 
gctgcctcccl 1 "i gaacctctcccaggaccaactctaacccagggagggg 

AACTTG<nTCX}GTGCAAC<XXrrGGCTTGGAGACAGAAT^ 

agatacactagaaggcgctggggatacatcagaggagagggatactcaco 

CGGGCTCCGTGGATGAAGAGAATGGCCOACAGTTGGGCGAGGTAGAGCTG 
CAATGTGGGAATGGTACCT 

CCACCCCCTTATACCAGTTTACATAATGTTGTTATTTTGG 'I I I IC I IX. TA 
CATAAGTAGATCTTCTCATAI 1LI I I I CAATCTCTATATTCT ACCTGTAA 
TATCTAAATC>TlTGTTCGTTAGCTGCTGGCGCACOCGCGGTGCCGGCCGC 
C CGGGC AC^ATOCCCC^GGA^ 

CGTCCAGTTCCTCTC<^TTTQAACAGCTC^ 

TTTAG A GCAG AA GCCAGCAA T AACT AAA CGGTCAGGGA CAGATA G AAACT 

ATTTTCCKKnTCATGGGCCACACAGCCTCATTGTAGCTTCT 

TG TTG T A GCA A G AAAG AAGCCA TA TACCCTGTGT AAA CAAATG AATATGG 

CTGTG TGCCAA TAAAACTA TTCACAAACAT AAA G AGTGGGCTGG ATATG A 

CTCAGATACTG1 1GI 1 1 GACAACCCCTGATCTAGAGTAAAAATTCCAAAC 

TCTATAGCCTOCAGCCroGGAAACAGAGCGAGA C I [CGTC1 IA AAAAAAA 

TTAATi^TATATTAATACATATGAAAAAAATATATTCGAC^rrGGCOjTT 

GTGGTCTACTCTTGCATTCCAAC>CTTTGG 

TTAGTTAGGGGACAGAACAGCCTGGTCACATGGTGAAACC 



pct/usoo/33312 



AATOCTA1 1LICJILATIGCIGG1 1 CATAATATAATATTATATATATGTCT 
^'^^^^'T^CTAATTTATATTATAGTAGTATGTAATTGCTGGTAATA 
TCTACATAATKTTTCKKTNTAATAAGTGCACTTTGOC^ 
TTCTCCCX3OAO<»a3GO0GAC<nACAACA 

CTA.VAAATGTGOCAGAATCCACCTGTGC<XXXlAATCG 1 G rTTGGTTCCTC 
1 1 iCiA CTCC ACiXiCAOATCACCAAACCTGTCOCt i ClTJCCA Cri IPCTCA 
CTGATATTGGGAGGAGGGCAAGGCCCAGCCX1AAGTICX1A 
CCAGGAGAAJAGGCACCGGC^ 

1 1 i^iM ■ 1 1 1 1 111 1 1 1 CCCGACAGCACAAAGAAGTAAGGGCAGTTATTG 

cacaggtgttatttaaacattctattgtaaatgaa 1 g itjttgtitogttc 
tactgcattgtgcagcatgcgcgggaagagaactgac^ 

t, v * r^v-r™™^ " * TTAA CCA GTCCTTGA TGTTGTGTG ACT AAA GT 
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AAAGATGATAAACCCCCATTTCCTGCC<KJTGGTAC1^ 
GATTGGG AA AGCTTT CCATACtXTTGGCCATTCC C I ITTTTCTTTTTTT1 

c aaccccat ttttttaggaagggattgttaacaaaaac c 1 1 1 ci n I A AA 

CCTTTTTTT 
>Sequcacc457 

^^^^^^AGTCGACrcAGGGGGCGG^CGTATATTACTGTGCG^GAGGT 

AAAGGATATAGTGGCTAOGATTACGGCCTCTCT 

>Seqnence 4S1 

C<iAGAGTTGANNCAWrrTTGG^AAGCGCTCCCCGCGGTGGCGGCCGCCCG 
GGCA GO TACAG ACAAAA CTACA G ACTTAGTCTGG TGGA CTGG A CTAATT A 
CTTGAAGC^TTTAGATAGAGTATTIXKIACTXKnGAAGAGTCACT 
AAAATAAAACAAATAAGACTCAAACTG<rTCAAAGTCACGGG I fCl I UGH 
G TCTCTGCTG AGCA CC<7TGTGTCA ATCC? AG ATGGCCTCTGCTG ACTCAG A 
TG AAG ACCCA A GGCA T AA GG TTGGG A A A ACACCTCATTTG ACCTTGCCAG 
CTGACCTTCAAACCCTGCATTTGAACOGACCAACATTAAGTCCAGAGAGT 
AAACTTG A ATGG A ATAACG ACATTCCA GAAGTTAATXATTTGAATTCTG A 
ACACTGG AG AAAAACCG AAAAATGG ACGGGGCATG AAGAG ACTAATC A TC 
TC<3AAACCCATTTCAGTGGCC^TGGCATT^CAGAGCTAGAGCTCGC<KXX 
AOraCA^GCTGCAGCCCATra 

AAGA A GAGTA C A GCCCT A G A GGCAA A CCCrCTTTCAA TCTCTTGA TN 
>Sequeace459 

C<XXiATAGTCNGAATCACGTGGCGGCCGCCCGGGCAGGTACCKXXjGTAGT 

GAGCTGGTTAGTGAAGGCTTTGTAGCTGAGCAGTTTCTAAATAACACAGC 

CACT CAA CTGACATA(XA TGGA TTATG TO AACT AACTTCAACGgTTCAOG 

AAGGAGAACTTTGTGTGTTCrTTCGGAATAATCATTTTAGCACCATGACC 

AAATACAAGGGTCAACTCTA I I IG I I O GTAAOOGACCAGGGCTrTCTTAC 

TGAAGAGAAAGTTCTTTGGGAAAGCCTACACAACGTAGATGGTGATGGAA 

ATTTCTIGTGACTXIAGlAATTTX^ATCTTCGACCTCCTTCAGA 

GTATACAAAGGACAACAAGATCAGATAGATCAGGATTATCTTATGGCATT 

A FCTC1 ACAACAAGAACAGCAGAOCCAAGAGATCAATTGGGAACAAATCC 

CC<JAAGGAATCAGTGATTTGGAACTAC<>AAGAAACTCCAAGAG<JAAGAG 

GACAGACCOGCTTCTAATACTATCACqAA^ 

TTCATCAAGTGGaG 



PCTrtJSOO/33312 



t AGACTTCAG GGAA ACAACACGTCCTCAAAGAAACATGATTCCCCTCAAG 
CCACAAAC<^TTTTCTCATCAAGTGTTTTCACC^^ 

CACAAG AA G A GG AG A GC A TTT A CTCAGGTAAAAA T AG TTCTCTT A GTCTC 
TTCC1 C i AGTTACTAATTTTTAATTTAAAAATACAATTAACTGATCTAGC 
TGATAAAAGTCACAGACAGAAATAAGCTAAG 1 1CI L1CI 1 LX.1 1 I A GGGA 
ACC<TTGGTGGCAATTCACCATATAAACTGGATGGAAGAATTXrrCCCA 
ACAT 

>Scgoe« ce 461 

CTCTT ACCCTCGTCTCACTGTACTGATAAACATTTA 1 C I IGCICACATGT 
ATATTTTATACTCT ATCTATTC^TCTGTAACTCTC^CAAATOCACTOAAGA 
TTATTGTAGT AATAGTGATTA IGI I fCTCTCl ATAATTTGGGGGTGATTC 
CTATCAGTrGCCCOCGTOCC^X^GG^CGCG 

TCTAGCAAGTCGGOCA GGATGTCTAAGCOTGAC<rTTCiAGAAAGCTGCAGA 

GGAGGTTAGGCACCTTAAGACCAACCCATCOGATGAC< 

TCTATGGCCACTACAAACAAGCAACTGTTJGCOGACATAAAAACAGGAACG 

gcccggcatgt tcgac ttcacxxkk^c<xraagtt^ 
agctgaaagggact^ocaaggaaag^tgccatgaaagcttacatcaacaa 

^ ^T A 9 ^ G AGCT AAA GAAAAAA TACGGGAT ATG AGAG A CTGG A TTTGGTT 
ACIGIOCCATGTGTTTATCCTAAACTGAGACAATG U. 1 1U 1 1 1 1 1 1 ILIA 
ATACCGGGGATGGTTKXjAATTCGC^AAAATAACXAGTTAAACCAGCTA 
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caaggctgcitaccatacgggtctaacagattaggggctaaaaacoatta 
ctgal rnocttototaqtiti i a tctgaaatcaataaaaggggattggt 

ACCATAAAATTCTTrcTTATTCrroTOCCITGGOCGTTTAA 



1 rGTTOCTGAT] I OCX. I GATGTGTGGACGGATCACCAAGCO AGTGACACG 
AGAG CTCAAGGACAG GCTACAATACAGGTCAGAOACAATGGCTTATAAAG 
GTTTAGTGTGGTCTCAOGATCTXMCAGGCaGTOCAGCCTGACXi i ICTCC 
ACACTCCAGACAAA CI I O O C AGACAAGC I CC 1 1 fGTGOCTCTACGTOGAG 

agggcgtggaaagttatxacattaaaagatggaggatttaaaaaataaaa 

> Sequence 461 

aattactctacagtaaggactgtaagtactagaaattatatgtatgtaca 

G A TACT A C ACT A T>4G ATTTATACTAACTTTATATTAATCAATTTACG AAT 
TAG ATTATG ACATACTTATGGAGCTAATTTAI ICC I 1 lattactagttta 
GTTGGi 1 1 GATTCGAG f CN T CT ATCCCGGTGGC 



TGCACGATGATTCGAAGCCCTCAi 

llll IO C 



lGGTA 

TTTT 

rrrriTTTTTTn I a aaccgci gccaccaccatoaaag aggggccaccac 
a tttttattgcatactcaggggaataacttattatacaatgaacactcct 
ccattaggagaccatgcccacttacagaatgcagccgt aaatgcggtaaa 

TCTATTTACAGAGGTTGGGGTGCAAGATGAGAGAAGTATCACOCOCAGGA 
ATTTGAAGTGAGAATGATCTACAAATTCTOCTGACAAGGAGCAAOCGGGC 
TrGTGCTAGTGAGGGCTGAAAAAATTOCTGGCAAAACOTAGGGGGAGATT 
AAATCTOGGAATTGACAGCAAGTT TGGGGA CAGTGCAAGAAGAGAGGGGT 

ACTTGAG AGA TGAGTT AAATTTAGGGGTGA ICTT I A AC CC 1 1 1 ULACCCC 

AAATAAGG 
*46S 



ATAGCACCATGCGGATTCGTCGAACTGTGAATTCTACCCGGGAAACTCCT 
CCCAAAAGCAAGC TT G C I G AAGGGGAGG AAG AAAAOOCAO AACCAGACAT 
AAGTTCAQAGGAA ICIG7C1 CCA CTOTA GAAG AACAAG AG AATGAAA CTC 
CACCTGCTACTTCGAGTGAGGCAGAGCAGGCAAAGGGGGAAGCTGAOAAT 
GAAGAGAAGGAAGAAAATAAG 

> Sequence 466 

GOGOGCTTCTCOOGAGGTGGAAGCGGGGGCAGTCAAGGGCCGGOGTTCTC 
TGCCGTCAOC C1 1 ICC HO C 

> Sequence 4<S7 

OOOGTOATOACTCATGACTATXCOGOGGTGGOGGOCGOCCGGOCAOGTAC 
U II I 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 gagacagau i o igctccatcaccc 

ATOCTAGAGTGCAGTOOAGTGATCTCXjGCTCACTGCAACTTQJ G O L I III 

goottxaaoctattctoctoccicaoccttocaagt aactogoattacao 
gcacatgccaccacococaactaattttgtatttttaatagagacagggt 



TOCTCAGOCTCOCAAGTGCTGAGATTACAGGCATGAGOCAOOGCGOCTGG 
CTOA IIOIOIItXIMllC ACAGAI IllG l l l llOIIU I G I MlCllOA 

acactcagctggactocatttcocag c i i c c c hucagttaagtcacaag 
tagcgctgtgactgggt/ictgcocggtagaaggtaagcagaagtgatgtg 
tattlacttcrratgtgtggcctcccaaaacctctaaacgtta i g » 1co0ci 
ci h i roccatctatoocctokaaaotgaaatattatooagoci 1 1 1 oct 

GAOACAOCX2CGCtTTACCT(XkXXXKrrCTAAACTA 
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TCGGTGTGCIGTGCrCATCTGTCTTCCAAAGGAGGAACAGATCGGCAAGT 
GCATCTGAGGCGTGGCCGACAAATCCtGTCGAAGAAAGAAATAAAAACCC 
TGAAACATGAGCGAGAGTGATCOAAACGTGTGCAAATG U. 1 1 CI I A AAGT 
TTATAAAAGTAAAATCAAATTACA 1 1 1 1 II 1 1 1 CAAAAAAAATAATTTAA 
AACTAAATGTACCTT AAA 



GGGATTGGaGCTCCAOGOGGTGGCGGTCGGAAGGAGAATGGTATCACTCA 

GGCTCTCAGAGTGACACTGAAGCAAGACACTCATGGGGTAGGACATGACC 

CTGCCAAGGAGTTCACAAACCACTGGtGGAATGAGCtCITC 

GCGGCCAA CTTGGT A GTGG AAACTGGGCAGG A TOGA GT A CCTTCAGG ATT 

GGCCTCTTA 1 1 1 1 1 1 1 1 AGAACTAAGTTCATCTTaAAAATTTAAGAAGGT 

GGACATTrcAACACCATXAAGTGCATTTACGTGACATGTTTAAGTTAACT 

TGACI JUL J I UAATOACCT AGTTAGT AAACTAGTCACTAGTAA TTCGGTC 

A CCAAGCAAA TCAAGCCTGCAAGA A A GG AAGCCA A T A TTCA A A A TGCCA T 

GTTACCATCTAAACOC 



TTGGAGCTCCCOGCGGTGGOGGCCOaGGT ACTGATTTTATTCTCTACCTC 
TCTGGACI IGC IUXAGCATCCGGACCAAAACCATCAGTGCCACAGCCAC 

gacagaaggcgaacoggaagttgacaacl nc ro o m iagatgccacoc 

iATGAAG 

TTTGTT 



AAOCCrACrrGCCCCCGAACGTACCTGCOCO 
• 471 



TTGGAO 



.tac 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



TGGGAAGACACAAAGATTXAGACCACAGCCrACAGGGAGAGAGGATTTCT 
GAGGATGGTGGTGCACTGTGAGtCCACGCAGGCCtOCTGGGCATAGGATG 
GAGCAATICTATCTCACCTXAGGCCTAGCACAAAGGGCTtCAGTAAACCA 
GTGG AG I 1 1 L)L 1 1 LA1TAGGATTCCATOCCAGGATATOCAGAGGACAAGA 
CGCTGGCCAACTGCAGGATTAG43CTATGCTCCCGTGCTGGATATAGGCTA 
CaCGCAAGAGAAAGCTTGGGTGGGA ICICCIGATCCCGCGTACCTGCCCG 

ACTXACCTAACTTATATTCCTAGTTTATTTAAGTTATATTGTrACATATT 
AACAATrACTGATATCTOCTGACTAAATATCTACTACACrrCTCATACAC 
TTXAACACnGCTATATATTATATTGTATCTAGTGTATATTTTATNNAANN 
TCAGTr><GGTATGCTGATCCCGTTGCCG GCGW CCOGGCAOGTACTATGGG 
TGTAGTGTTACTATTACAGTTAAT ■ UO 1 IX! I ! IGIG1UCGCTGATAAATG 
CAGTGAGGATTGGAGCACTGTOCACTGA G I C IC IG t G CAACAACTTATCO 
GTC TGOCAGGGG7TT CCOGTGTCTGGCTCT0 A 1 C I' l G G IC GCT GGAT ACT 

OCGGOGT001 1 GGCCAAGTGCTGTCT A ATAATTTCAGGAGAGG ATACTTT 
GTTGCTGCTGCAGGA TCAGCCA TGGT AG ATTA TGGTTTTTG ACAACCAG A 
TGGGGCACACAATTTCTAGTGTGCOCATTTAACAOG GIC I I ICA AAOTAC 
CATG 



TTTATATAA CTT A 1 KXil 1 CA ATCTATTTATTAT A ICTCTC1 I ATATACT 
CATTGTCTACITTAATATCATACTTGTTATAATTATTCATACATATTATA 
ACAANAO0GATGGATGTTTCAT>rrAMTTA>«GCAGCACACCACCG<XKnGG 
COGAQ3AGOTACAAAATAAT TATAATG TATTAACTCATACTGOC HOTCI I 
TT A T A GGOG AAAAAAA T AAU_ J llll 1 ATTTTAAAOTTATAAGOTOOGTT 
ACCTTTTAGI 1UCI I GGATGACAGGGAATTAGGCTAOCOCATTTTGGTCT 
GGAACAGAAGACTTTCAAATTTAATATGGOOCAA G roiCTI C CTACTTAA 
GTGCAAGATCATGCrATGTGAGTTACCCAAGCTGGAATACCGTGACACGA 
TPGTGGCTCGCTACAGCCrCCATGTOOCAGGCTOOAGCAG 1 1 CI OULACC 
TCAAOCTTC CCAGTA GCCGGAAOCACAGAACCACA O ICI TCI CA TTTTOA 
AA AOAC ATGC1 1 1 I 111 lAAAGCAACAAAGOTOOTAGAOOAAATTTCTTA 
AACTTTGTCAACGAGTCATGTAACGTTACACTGGOCTTCATAAAGGACOG 
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7TTAAGAAAGGOL 1 1 I UI I C ATCTTTTATACTATATT C T G nC T l GGCC 
TCGGGGGOC llll I AAAACT AGTGGAT1 
TTCAGCTTTATTTTACXTTCCACCTTA 



• 474 

TGCAGATGGAGCKTtlTACCGGGGGCGGCOGOCCGGGCAGGTAOGCGGGGG 
AGCTGAGCCGGTGGGTGAGOXXXKjGCACGGCATCCTGTGCtGTGGGGGC 
TACGAGGAAAGATCTAATTATCATGGACCTGCGACA G 1 T1C I' l ATGTGCC 
TCTCCC TGTGC AGAGCCTTTGCCTTGAG 

CGtGTAC TTCTAAAA tTOCACTTTA I G n rTOTAOGCTTOOAOCI ICT I G 
ATTATGGCn miOJ TTACAAAA TTCAACAACAGAA TCAATACTTTGCA 
TAAACATTATGGATG C rTTTTCTOTTTCI ACCT 
> S egue n ee 47 J 

gtacg a ttgagcontttgg aagcccctctcccgcgg7ggcgacagggtt 
acattggtaagggtgacagttagaaggggaagtccttttagtgaaataga 
tgagaggttttaga 

>Seqacac* 476 

llll II I CATCTCCCACTTTTCACTCT ATTGTGACTAACAC 1 L 1 1 llll I 
TC A 1 CI CTT AACGA rCTCGIT I ACTCTATTCTATATATATGATTATTCAT 
TCATCTTTNAI LKLI IU GAGTGGAWCTTTGGGAGGGCTKTOOGCGGWOGC 
TACACG<rrAGGAACCTTGCAGCTTACACTGACAGAGCrCCCATTCAOGAG 
CCCACCACT CATCTCO A TTTCT GGATCTCTAGGGAATGAGTAGAGCTCCA 
CCTGGATTCOCT I fTCCAGI I PCI 1 ATGTCCACAAGTCACTGTGCACAGA 
TAAGAGTGI I CGI 1 CI CAAAACTCACAGGGCTCAGGGTCAGCGTGGAATT 
GGTCCCCTrCACTXCTCACCTTCtXGCr^ 

CtCCAGGAGAAAGATGGGGATTXACAGCCCATGACACCTACATGTCAACA 

tgactcag'Tctccaatxtgagcagcaatccggggtccaggggagatctca 

ACAGTAAATC GI I I11C1 I G GACAAATTAATTOCA C C TC n I rCTGGT 1 1 
TCCCCAATCAAATTAAAC 1 1 C 1 1 cacaccacattaaattgcagaaatctt 
ocatocttataacaattaaagtggagagttggatttccagaaaggtggtt 

gaaattccca TAATCTAAATTCTTACTCCAAAAAATTTTGGG AGCTGG AG 
ACCCTTGCTTGGACCAGGCAATGGTACCGAGCCCCTrTTrGGAAGTTGGG 
GGAGGGATCACAGAAATAA04 

icici rcTCTtxn rnrrcn icgic i ia tcttta giii hi hi i iu ca 

CI I GUI llll l ll l llllll I mm — 



tagttatacattattntn 

GGTACT 



NTrTrnTGCTTTGAO 

1 1 III 1 1 I II I M I I 1 1 II 1 1 1 I ITTGGCAAAAATATTTATTAAAATGAT 
TTTTTTAAGTTTGAACTTTATTGGAAGGAGTCtXTCT 

ATCCT^AGATAAATGXXrrAAG^ACCACATATGGAATATAAAGOlTniATTT 
TTTAAAAACCACATAGTAGCACAGTGAAAGAAATGCAATrCTCCAGGGTC 
TTAGA OAATTCAAAGG GGCATCTTAGGTOGTCTAAGAAACCAATTACAGT 
ctcatgo 1 1 1 rocn I IGCTTCAAGATTAGAAGAGTCAGGTTAOCACTAC 
C IUI II I lAGAGXrrAGAATATGAACTTTCTACTAGTCCACAGTTTACTGG 
TCAGGTCGOCOCAACAGGCTTTTATCTTAGCCOCATTn^ 
GCXIAGATGATtrrCTATGTOCXIAAGCAGCAAGGC 1 1 ICIGC I U ATCTCTAC 
TTTAGATCCATGAACCGGTTTTC 
>Soqmcc 471 

*^AOGAiTAHCCNAATATGAGXXTOCTAGCGCOGTGGCCGCCGAGXnACC 
TCCATCAGGCATAAGAACCCATTCOC C I UCC I \ O TTCCCGTGTGCTCTCG 
CCATTGCACCATCCATGAGACGCAj L il i m i A TAGAAGTAAAATTGCCTT 
OCTGaGAAAAAAAAAAAAAAAAAAAAAAAGTAOCTGOOOG 
>Scqaoice 475 

TGCCGATGATCCGAAGCCTNACCGCGGT^ 

I it » u uCCTO C^TCTXaGCTGGXXXXXATXACTCT^ 

CATGOI 1 1 COCTTKlATAAAAATrTTCTAACTTGGGTTGAATATTTGTAGC 
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GAAGTATCTCTTl ICATTII I IICACGTTATAAATAAAAATACTATGCTG 

GCCGGGCCCGGTGGCTCACACCTGTAATXCCAGCACTT^ 

GTGGGTGGATCATGAGGTCAGCAGTTCAAGACCAGCCTAGCCAAGATGGT 

GAAACCCCGTC1CTAOTAAAGATAAACAAAAAATTAGCTGGGCTTGATGO 

CATGCGCCTGTAATOCCAGCTACTOGGGAGGGTGAGGCAGGAG^kATCGCT 

TAAACOZAGGOGGGAGAAGGTTGTAGTGAGCCAAGAATGGGCCT 

CTTTCAGCTrATCAATAGAAGGGAGACrGGCACCCCTTrAAATTACCTTT 

CAATAAATTGGTCCTTGCCCGGGCGGGCGCTTTTAAAACTAAGGGAACCC 

CCCCOGCTGTAAGGAATTCGATA TTAA AGGCTATTXAAATACCCOCOGCC 

TTCGGGGGGOGCtXGGGTCCCAAri | rOCI 1 1CI 1 1 1 AGGGGGGGATATC 



aaoggaggtotagactgtccagctgccaaagtgg^ 
gg aoca tgaaaggatcttaatgcaagtxatgaacttgaatgtgccgatga 
ggcctggcattct i g i ccagagacagagtaaggaagtgttggccacaccc 
ttagaaaacagaagggacatggaggcagaaaaaaaaaaaaaaaaaaaaaa 

AACCTACCTG 
>5eqnenee4ll 

CCTCCAC C 1 1 C II 1 1 1 1 rCATTGTfCATTrCTCTATACCCOACC*TCTTAT 
Ol I I I I IA1CI 1 1 I C 1 A TTATACCTCATTTAATAAT A 1 IGI ICI 1 C1 1 1 1 
TAAGTNNNKNKNAWNAATTATT I Gf I OG ATTCTOCGAOGOGAAAATCTGT 
AATC TAGAAGGCGAAGCCTCTATGTGTIGAATAACCCAAAT TT G r iCA TG . 
1 1 1 IG IUGCCAAGGTGAGGGCTGCAAGTC'1 IT1CTAAGGGTTGAAACATC 
AGAATAA AGGTAT GGTGGCAA U rCCTCCTTCl GCTAGGCTGGCTCGCAAG 
GCCCTATCTCrTOAOCTAGGTGGTAGTTACAAGGGTATTTATTTGCCTTA 
TAATAATTCACTAAACTA TGTATTTOAGTAGATTTTTA f C fOT OT G CI 1 1 
AATTTACAAAAAAAAAA AAAAAAAAAAAAG O 1 C OC I I 
>Scqaencc 4X2 

CTG AG AGATCCCCTCATAATTTCCCCAAAGOGTA acca t g tg t g a ATAAA 

ttttgagctagtagggttgcagccacgagtaagtcttoccttgttattgt 

GTAGCXlAGAATOCCGCAAAACTTCCATGCCrAAGCGA^ 

CGM1 QUA inn GACTGKj TT AGCCTGGAAGTGCTTGTCOCAACCTTCT 

GATTAGCAGCATGATATCAAAACGCIXrrGAGCTGCT CG rrOGCCT 
GTAGGCXrrAGTCCGTAGGCAGGACTTTTCAAGTCTCGGAAG O I I ICTTC A 
ATCTGCA 1 1 CCA. 1 1 CGAATAQATATTAACAA O I IG I M G GOTGTTCGAAT 
TTAACAG 
>Seqagnc e 413 

TCTTATCTTtlArrCClTAGTCTAGAAATTTATTTTGA 
ACTO"AACTTA TTCIGTTTCTTCA ACCATCACAACTTTGG Oj I IO OI I A T 
AAA ATNA TATATTm IQCI 1 1 1 KCNXMATNATANACAGGGNNG IT GC 1 0 
ACATTTTACAAGCGCTCCAOCCCGGTGGCGGCCGAGGTA CTCTTCAAAAT 
TGTCAAGGTCATGAAAGACAG<!AAAAAGTGAAGAAT7/CTTACAAACTAGA 
GGAGACAAA GA TTGGAGAAGAAACAATGACTGG<rrGGC<^CCGTGGCTCA 
T GCC T G T AA TO CAC lTTGC CAGCACTTTGGGAGGCOlAAGAGGACAGATC 
ATCTTA*WTITGGG^AGTTGGAGaOGAGOCTGACCAAGGTGGAGA^ 
TCCCTACTA AAAA TACAOAATTAGC TO OO 1 0 1 QQ 1 VKi I GCATOCCTATAA 
TCCCA GCTACTT GGA^lGCCTCGGCAGGAGAATCACTTGMJOCCOOaACG. 
CATAGG 1 1 U I GG IGAGCCAAAATTGGGCCATTCCACTCCAGCCTGGGCAA 
CAAGAGCGAAA I ICTGICI CAAACKATAAA TAACT AAAAAAAAAGT ACCT 
GOCCGGA 

GGAGATGTGAAC^TGTG TCATTGCTCI CAAGAOAAGGATCTGGATGGCC 
GTAGTCACCTCAGAGATGGACATOGAGOGGOOfJCOCGGGO^GGTACACAA 
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GCTTTATTGCCCAACACCAACC AGCCAOCCTGGCAAACAATGAAACT AGA 
CTCCCTCAG AAACAOGAAAC ATCATATGTGTGTCATCACAOCA TOG AG AA 
TTTAAATCATCTXJGAAGTTCCTGCTAAATTAAAGCATACTGTGCCAGAGC 
TCCCCTCTAATCAAAAAACOCTGTCCTXW 

ACAGACUGAGAGATCAAOCAGTCAGGAAATCACAGACTCTTACATGAGTT 

TACAGTTAA OCXXACTGCACAAAAT aat aaatt agcca taatttggtttt 

TTTTCAAAAACCATCCCCOCCACCTGACCCCACAACACAACAGGTACTGG 

CATGCOCAGTTTATTAACAGATGGGCCTAAAACATGCTGGGGCGGAGAGA 

CXGATTACGGGTAATGCGCTTTGCCCGAGAA 

>Scgpcncc4i3 

ACATTCCTCTATTATACCGTATTCTCTTATCTA 

TCACTTTCAACTATCTCCGGTATCrCTGTATATACGTCAACTTACTAAAA 
CATATA ACNNNAATACTCT TGTfnTTTTTGCGCACTCC^fGGTGGCGGCC 
CAGGTAC I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 GGGGAGGATACTTTCATTT 
TTATTT TATATCGTGAGGTA TTGTTTGG ATTGTTACAATG AACTTOCATT 
TCTTTTCTAA TGAAGAAAATAATACA G AGGAAATAACAAC AACTAAA CCT 
TTGGCCTOGATTA7X1ATCGGCTGGAAATTCATGTTGGATGCAAGTTTTT A 
TTCATAACAAGTTATTT1TTGOTITATATGCAAAAAATGTTCATTGAATG 
CCTCCTATTTGGCTGGCACTGCCTAGGCACTTTCAC^ 
AATCCrcACAACAGCCCTATGAGGTAATCATTGGTCCCAGTTTACAGAAG 
CCTTGGGTGGG A GATTATTGCTTG AT ATACTTCTATTTGCCACACATTTT 
TGTTGGCAAGACGTTt^TATCGGCTGGTGATTCACTGGTCAAGAGCTCTC 
ATTGGCCAGGAGTTCCTA 1 I IG I I G CTGTAAG ATTCAAATAATCAAAATA 
CTAGAATTTTTOOCCCACAAGAATGATGGGACCAGTGGCATAAGAGTAAA 
GGAAGG AG AGTCAGTGGGTCTtTlXXXTTGAAGCACCCATrG A GTTGCATG 
G 

>Seqoeace4M 

TCACACCTATCTTCTCTTCTCATTTCTCOC A TATTATTAATA CGCTTATT 
TTCGTTOCTCATCGTTCTTATAAATCGCTGTTACTACTATACACTTACTA 
TAAGATGAAAACTTTTAGC>(NNNNANANhn4ATGGGTCCTGTGOGCCCTC 
CGGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGTGTGGATTGAACAGAAA 
A TTGG AAA TCAT AGTX^AAAGGGCTTCCXrrTGG TTCGCC ACTCATTTATTT 
GTAACTTGACTGGG O I G 1 1 1 >C 1CAJ 1 1 AAAAATTTCAATTCTCOTGGTAA 
CAACOCAGAGTAGAAGGAGAGGGTOACTTTACOGAACTGACAGCCATTGG 
GG A GGCAG ATGCGGG TGTGO A GGTCTXKXXnXiAAGGT AGTG ACTGTTTG A 
TTTTAAAAAGTCTC ACTGTCAGTTGT ATCT G T'l'GCf 1 *1 TCI C AATCATTC 
AGGGATACAAATGGGC I ICICI CATTCATTAAA A GA A AACGOGACATCTT 
TCT AAG ATTCTCTG TGGG AAAATG A CTGTCAA T AAAATGCGGGTTTCTGG 
GCCAAAAATrATAAATTTATGGAATATATAATACTAATAGAATAATGTTC 
CT 

>Seqwcace 4<7 

TCCIC1 I t; I TA 1 I IC I 1 1 I IC I 1 1 A I 11 1 C I C 1 ATTAATCTAACCTTATT 
CTTAATAI 1 1 1 1 I'l l AATT ATCTTACATCATTXXnACTTCATTCTGAACT 
TTATAMAAAGANATTTTTGATGACACCTCCTAAGCTGGKXKJiCCGAGGTAC 
TTGTTA 1 1 IGI I ICI ATTATTACTOTTTGACTTCIOOOCAGGGTTCAGTC 
CTCAAGGGGCCATCCrGTOOCACCATGCAGTGGCCCTAGCTTAGAGGCTC 
CCTCAATTCCCCCTGGOCACCACCCXXX*^ 

G I C 1 1 G I U I GCATTGCTGTGAATTAGCTCACrTOGTOATATGTOCTATAT 
TG<KTTAAATTGAAACCTCGAATTXrrcGGGCAATCTATTAATAG 
AAAGTCAGTAACTTACCCTTA 
TATTCAGGGGTTT1 1 IGTGTACCTGCCCG 



PCT/US00/33312 



PCT/US00/33312 



G ATCGTCATrG TTATTACTrGATTCTTATTTTATTTAT O I rTTTGTTTTA 
CTLI i ICI 1 1 1 AAATTTCTGATGTTA 1 1 1 1 i I J 1 1 1 J 1 G I ATOGTTTATT 
TTTNAKNNAT TINTTXK XKKjCTATAGGCN C I ' I CI Cexr COOGOOOOCGGOOO 

TATTTTCAATATTT 
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GTCTTATTAATATT I TTC'1 1 ATTTTATAATOCAATTACAACGGTTTAGGA 
GACAAAACAATATAAACAAACGAATGTTAAATAG 1 1 1 1 1 i l l AAAAAATA 
C O IG I IG CI I G CAAGAAAGTCCATATAATCTTATTCCCOCOCAAATATA 

attttatactttgcactaaaccaaaatagcitatggaaaattagtattaa 

atagctaaacacagaaaacxtacagctataaataacataaaatacagttt 

^actttaatctgatgcltaaacaaagcaaactatgatgcaattroaatca 

actrcattaattggaccagtccagtggggcacaatrttgattagooctaa 

cccctcattggtggccagtgaaacctxxacxxxiagcaagggcctttctgg 

TCTTGGGCTA TGTCCA ATTCCATTCGAG AA AGCCACAG TTTTT ACA TG TT 
CTTGACI 1 1 1 1 1 ACTGAG AGG ACGCCAAN 
>Sequenc c 499 

CATTCCC1 I ICI ICI I GCAT TTATATCTTACTTTATATATTTTATT ATCT 
ATGTTTAACTATTTATTTTTATT ATGATTTTATTA ITI"! C1CTATCATAT 
TTATTCTATTNNCIG1 1 IGl 1 1 CTGG ATATCATTCCCGTGGTGGCGGCCG 
ACCG A A ACCTGGTG A AGCCCTITGGGCGATTGGTG ATCACCCCTAG ATCC 
GTG A AAGCTGGCTGCCCCCCCATCCGGGCAAGC AGGGCCAAGG TG OCA TC 
TTCACATTCCT GGAA CCCACCCAGTAACAGCAGCAGGTA 1 1 ICI ICIG GT 
AAATGA GAGCCTTTfCGAAAACTTTCTGCCCTCAAGTATTTACCATAAATT 
CTCTTTAAAGTGACATGTTCAGAATCAGG^GCTI*AGAGTrTGAAGTAAAGA 
GTCA I 1 I CTTAGTTCACCTrTTCAATTTXjTATAACTTT AGCCTCTGCCCTT 
TTCAAAGATTTTTGGAGAGTCAA 1 1 mci 1 1 igi i ca ta ci IC I I I 1 IC 

agagg<>catgtrggctrcaatgtixmgcaccaccattcaatgaaaaccct 
ggagattattcttaccagcttocto<ictggcgatocaactgatctgccct 
tccagcttaatacccatgtccttaa atca 1 ictitntcagtaattggct 
tg attrccctggcagcttaaca 1 1 iigii aaaagtctttatttcatgtgc 
ccca agagtccttrgccgttrraaacrgtto acccccggctix3ttj 
tataaacctaaaaatctgtccctt 

> Sequence 490 

GGGGATATGTCGAGTCTtXXXKXjGTGGCGGCCGAGGTACCTGATTTTAT^ 
TCCAGTTTTCATCCt*^ATCCACTGGGGAATGGGACGATTTTG C I I HOI I 
TCTTGGCCAGGAATCGCTTA ATCCTGAAAG TC T r GT G AG AAG ACA 
>Seqocncc49t 

TTOTAAGAGATGAGCTCCCXXGGTGGCG^CCGGCCGGGCAGGTACAAAAA 
AATAAAAAGGAGGCTGGTGGGAGAACTTGCTrGAG<*OCXlAGAGTTTGAGGT 
TACAGTGAGCTATGATCACATCACTGCATCCCAGGCCTGGGCGATGGAGC 
GAAACTGTCTCITAAAAAATGGCAGGCAGTrG^GGAGCTGGGO^GGTGCA 
GTGGCTCATGTCTGTAATC>CCAATAC^ 

CACTTGAGCCCAGGAGTTTGAGACCAGGCTGGGTAACACAGGGAGGACCC 
CGTCTCAAATATITAAAAAATTAATCATOCGTAGTCGTOCA 1 I CCI ig go 
GGTTCCAOCTTCTTGOGGAGGCTTAAA GTGCAAGGGTT A GCTTTGGGCCT 
TGGGAAGACACAGOGTTCAAGAGCXrriTGAArrTGAC^ 
AACCTOGGCCGATAAAATGAGAACC C 1 1 1 IC I I AAAAAAAAAAAAAAGGG 
GOAG43GGC?CCCCCCC CIGl 1 1 1 1 ATTTTGCGCAACAAACCAA I I 1 1 1 1 1 1 
AAA 

C492 



TTGTTACXnxrrCGAOCnrtXX^XXXrrGGCGG^XXJAGGT ACATGAG AGATAA 
TGTTATGACAAGAATAGl I ICI GCAACATTAAGTATGGGTCAAAAAAAGA 

GTATcACA^jG^ 

CCATCTCTACTAAAAAAAACACAAAACTTAGCCAGGCATGGTGGTGCACG 
CCTOTAATCCCAGATACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACC 
CGGGAGGTGGAGGTTGCAGTOAGCCGAGATCACGOCACTGC^TrOCAGCC 
TGGGCAACAGAGCAAGACTCCATCTCCCAAAAAACAAAGAAATXI^CTTTA 
GACAAATGCXrTTGAATGAAATTACAAAGAGGAGGTGCATTAAAAAATACC 
AGCAGT AAAATCTCTTG AAG AA TT AAAATG A CAGGCT AAAAATAAAT A TA 
AATG 1 J CI 1 1 1 1 AAAAAATACTAGGTTGCTGCTGGC 
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493 

GGGG>WAATXX}ATAGACKnTMCCGCGGTGGCGGCCGC£XXKKK^ 
CGGG«nXIGCG<KX3TTGGGTrGAG 

OTTC IG I G ATATG AOCAACAATGG ACCAGAAG ATITTATCTCTAGCAGCA 
TGCTTTTTTATXrrCATATTAGTGCCTG 

GTOC ITU GA AG ATG ATGATATCACGCACGTTG AAGGAAGTGTAG ATCCT 

A TTCGAC ATA TAGAA A TA ATAC ATG A AG AGCTTCAGCTTAAAGA TG AGGA 

AATGATTGGGCCCATTATAGATAAACTAGAAAAGGTGCCTGTGAGAGGAO 

GAGATAAAAAACTAAAACCTOAATATGATATAATOTGCAAAGTAAAATCC 

TGGGTTATTAGATCAAAAGAAACCTGTTCGOTCTATCATGATTGGAATG 

ACAAAGAATTGAAAGTCTTGAATAAACACTTATTTTTGACTTC 



TTAATTTGATCGAGTtXACCGCGGTGGCGGCCGAGOTACTCATGGTTGCT 

GTAAATTAG&CAGCCGTTCTGCAGGGTTTTGCTTAGCCAGGCTCCTCTGA 

GATCTGGCTATTCTGTCTTGTGGATTTTCAGTCCCCGCGTACCT 

GCGGTTCGA GCGC TCGTCGGGTCAGGTACATATACATTATGTAATTAAAA 

AGCGTGCATGTTTATGTATTAAAAATAATTGGATTTAAACAAATATTATA 

TATACATTATAACACCTAAACGCATAGGCTGTTGTTATTCACAATAGTTA 

TACCAATATTATTAATXJATGTGTATGAAGACACAATACAAAGCTGGAGGA 

AGTATTTAATAGGT ATAC TCAACTAATACACATAAATTCTAAGCAATAAA 

GTACGCAAATTATCTTTTTC<MTGAATTTTCAAAATTTGTCATAATAGAC 

TTATATTGAGTTAAAACTTGTATAATTTTTGGAATTTTAAACTTGTGACA 

AAAACTTTTTGTCAAATGTTTCTATTAAATTTAT 

>Sequenee 495 

GATCGAATCAATCGGCAGCGCTCGAGGTACGATGGGGCATATCTGCAGAT 
CTCAAGATCTGGACTTCTGTTXjAAAAATTTTCCACGTGAG<nTTACTTAT 
GTCTGTAAAaATGGGAAAAAAATACAAGAACATTGTTCTACTAAAAGCAT 
TAG AGGTCATCAA TG ATTATCATTTTA G AATGGTTAAGTCCTTACTG AGC 
AACGATTTAAAACTTAATTTAAAAATGAGAGAAGAGTATGACAAAATTCA 
G A TTX3CTGACTTGATGGAAG A A A AGTCCCG AGGTG ATGCCTGTTTTGGCC 
AAA CT A A T AAAAATnTCC AAG A TATCCCA CCCCTTTG AAC A GCTG G TTT 
AAA CTTT AAA A AAAA AAAA TG TT AA AAGGTAAAGGGCCCCCCCCCTTA TT 
AAAAAAAG AAGA AAAAAGG GAGGGGG O 1 1 1 1 I CI ITT 1 C OCTCGGCCCTC 
CCCAAAAAAG CG I G 1 1 I I riAAAAATTGGGGGGGGGGGGG^CGCTCTTTTT 
I 1 1 I GG III ICI AAAAAAAAAAAAAAATTTCCCCCCAAAAGAAAAGGCCT 

tccccccaaaggggaggtagggcc 



TGGAGATGAGCTCAOCGCGGTGGCGXX3NCGXjCCGGGCAGCTACCGTGAAA 
AGGCAC I I C I CC I I GAGAAGOCTGACAGTGTTCGTTAATGTOCTGCTGGCG 
CATG<rrGAAAATTTCAGGGGAACAGTAAAGCACCCTtrrTT AATTTCCCTT 
CTCCAAGCCCAAOCTrTTOCAGGTAACTGGAGCG C I ICC I C ATTTGCATA 
AT AGGCAG TTTCAA T AA CTGGGG ACTTTTCTTCAA G ACCACA CACA CA GG 
CTCTGGATTAAAACCCAGAAAATTAAATCTTGAATGGTX^ 
GTGGAGAATOGGACCTTGGCGGAOCTTGGGCGG 
> Sequence 497 

TGGAG CTCAC?OGCGTGGaiOCCGAGGTACTCGGAGC^^ 

TGTTGAGGTXrrATTGACTTTTGAAA G<XXXK>.TTrrGAACTCACAAAACTCA 

AAGACATGAGOGCAACAGGCACAAATGTATATrTAGGGTGAA€GTGAGAC 

CGCACATTGGA TG I C 1 1 G I GGAACATCATGAATCAACACACATAGTAOCC 

CAGCTGTGATAACGCATGGAGATACACATGGCATGGGGCTGCATATAGGT 

TGGATTTGAAGCCGAAACAAGaGGTCCCTACTGAAATGAGCATTGAAAC^ 

CACAGGTTTATTATXSAGGACCGAATGAATATATTAGAGAGCXXTAGAGTG 

GCCCTGCGCCGGA^CXXXXKIACATGAAGCAACTAGG 

CCTCTGTGAGTG^TAGC^ 
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CAAGCI IIIHI I GGTTT COCCCTTATATAOOTTGG AGGGGOGTTTAAAA 
TTTGGTCGGCGGCTTT^ 

CCTGTG 1 1 1 1 icCCTTGGGTGGTTGAAAAAATTATGGT ATATATCNOCGC 

TTTCAACCAAATTTTCCCTACAAACAG 

>Scquauc49S 

TGGTTGAGCTX^CCGCGGTCGCGGCCGCCCGGGC^ 
ACTTCAGCTGACTGAATTrAGGCAGTTCTOG 

CAGGCCTCCTGACOCATGGTATCTACGATGAGATCCACKTGTCCATTATA 
CACCGTCACGTTGATDOCTGCCTCCAGCAACTTGCCACAATGCTAATGAC 
TGGGTTrAAGGAAGTCCTOCCOCATGGTACAAAAACACX^^ 
CCTOOCCAGAATGGX1ACCTTAAGGAAATATTTTGGGC1 1 I I rTTTTTTOG 
GGGGGCCA TT A TT AA A CTGGG TTT A AGGGCTTCTTG TAGGGGG GG7T A CA 
AGGG<K}CGGACAAAAAACAAACAAAGC>GGGGTCTTTGTGGAAATTTTAAA 
CACCOCCTCCCCCTTTTTGAACCC^GGGGCXX^ I 1 1 1 1 1 1 1 AAAAAGGT 
TTTTTAAAGAGTT 

CCCTTTTTTATGGGGGGATATTXCCCCCCCGGCG 
>Sequcn ce 499 

TTCT r AT^ATA T OCTT AT ATAT TTTATATATATAA l I I I ICIT ATTACTTC 
TTTCATCTCTACTTTAGCTTTACTCT 

TTCTANATTCrATCTATTrrAGANAGCATGCCGGGGCGGCCGAGX3TACCTC 

TTC AG iv-i i «XAATCTGCTTTCGGTTCCCCAATCTAAGACAGAAACATAC 

TCAI 1 1 I UCCATCA CTGG ACTTCCAGG I IGI 1 1 1 CA ATTTTTAACTGTTA 

^*- ^^C^AAG GTGGCAACATTTATCTACAAACCTCTGGATATACACGTAGGA 

AGui 1 1 1 GCTTATTTCX1ACTAGTGAAACTGCTCAGTTGAAGGGT ATGTGG A 

TCTTCATXTITTAATAAATATTACCAACATCTGAAAAGCCXX^ 

AGG ACTGGC A AGAGTGM^ ACATGTGATGGGTTGTXKjAATGGGAGCTCACTO 

T A GCAGGTGCTGGGG ACTCA A TTTXHjGTCTTGG AGAA GCACTT A GTTA TA 

GCAAGAATGTCTCATAAATTKKjTTCTGATAGAGACCAGAGTAaTGKXXjAA 

T AAAAAC TAGTTGGCTAGAAATAATTATTGATCTAAAGTCAACAAAAAAC 

CCTTTTTAATGTAG 



« 1 1 1 i u A I C 1 1 CGCTITGACCGCATATCTTATTCTGCTTATTATCTTAC 

TCTAATACTATT ACCTTA TT ATACTATCTAA CTG 

>Sequenee501 

CTCCGCCTTCT ATTA TACA TTGTTA TTTG A TTGTTA TCTG ATA TGTTTTG 

TAATGCTCTTCGCACTCTATCCAGATATATTTA 

>SeqneaceS02 

ACTCGCGIH 1 CO 1 1 AATTGCTATCTTATTTGATTTCTTATT L 1 llllll 
TTCA 1TICIC1 ATTTATTA 
>SeqoepDC 503 

CCIC1 1 ICTC ATCATICT AC TTTCT TATAATATCTO 

ATTATAAI ICG ICI II IA ICI 11 ICI A CATCTANAATCG lGl ICI I A TCT 
TTATGTACGTATACTXXTACTATTAIC 1 1 C I GACTATACCATCKAATACT 
ACTTATGGTG A TGGTrCACCG CGGTG GCGGCXXSAGGTA C 1 1 1 1 1 1 II I II 
llllll 1 1 1 ! ri ATaAATTATTTA I ITICI IJCICA GAAAAOGAT G TO O TC 
TCCACITAGCAAGGCATGGGCATGTA I G 1 GG 1 1 1 1 1 G CATACTOCCCACA 
TGACGTGGGo 1 u I o i t I ATGACTGGTCCGCTTCTATGA^XTAGaTTGTG 
ATACTCCCTCGGGCCTGTCAaX*^^ 
TOGTAT AA<XXXTITOGAACCrrrT^ 

CCCAGGC 1 1 1 1 1 1 1 IGI I IC1CCI 1 1 ATATCATTCAGCGTGTTTATAATT 

T 0OCX-A CTX>CCTCIOTGOCOCnTATAATtX^IUIG^ 

Gi i rCTACCCTGGT IGG1GAAAAAA I 1 IGI 1 1 IA TTCCCCGGCTTCCAAC 

AAAH ITGI I uACCATTXAATCTATTATCTGAAGTTX'CTG TGGGAAGTCAA 

TAAATA 1GI I IGU IA ATAAGT C IC I I G GIGIG ILIGCIC 1GA AATTGCA 
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CTTlXK^CTCTTrAAAOCTtXAOtXATTTAAATTTCGOOCTTTOGCCGOC 
TTCAACTTGGCCCOCGCCT'T 1 1 II CCATACTTCCTGGGAAAOOCCTTTGT 
CTCTTCTCCCAGTCTTGCA 1 1 1 1 1 AATTCAAATTCGGTOCCACAOOCCTC 
CCOGOGOTAOAAGOGOOOQGCI 1 1 rGOOTA TTTOOOGGOGCO CTOCI 1 AT 
ATAACCTTGAT 

e 504 

CTTAATtJAACTCATGCrrAACrrCACATTTAATTTOCCC^ 
CATGCTOCOCCTTTTCCACTOCGGGAAAACACTGGTOGCTGCCCAGCATG 
OCATTACATGQAA TTOCGO0CAAOG00GOCCGCTGACGACGOCCCCTTTT 

CAAGCTTTAACTTCA^ 

CAGTGGGG ATAA CCCCATG G AAA AG A A a Ca TTGGTC A GCAAAA A GGGCCC 

ACCTAAAAAGCCCCAGGTAACCCC 

>Seqoeoc* SOS 

C ACACACTTCATCTGT A TCC A TTA TCATTCA TT AA A CTT ACTT A TTTCAC 
ATA CA TGTT A TCT ACA TTATTCT A TGT AT A CTTGCA TTG TCA TTCA CTCA 
TCA GTCT A TAATTA TATTATT A TTTGAAGTAG ACCACTCG 
> Sequence S06 

CACTACCTGCTA rCGTCCTl GNCTACNTGTATOGTCAGT ATCTACATCTA 
TCTNGACATCTATACAGCTT AT>fTATCGGTTCGTGTANANCTATNGTATC 
TGTACT G I G TGTCAGTCO ATATCTCACATCCOCG ATA I OGT TT CTO TATT 
ACGTCTCTCTOTCTCn ATTCATCCTATOTCATATTATAhfTN ATAATCATA 
ATOATTrTAGACTCACXXKXXrrGOCOGCXXJCCXX^^ - 
GGTGAGAGCtnGAGCTGCTGAGATTItjGGAGTCTGCGCTAGGCCOGCTTG 
GAGTTCTCAGOCGATGGAAGAOTTCACTCA TG 1 1 IC CACOCOCGGTGATO 
OGTOCm 1 CCCAAG AACAAGACTCTTGGCTATGGAGTCCCCATGTTGAT 
GGA TCCTGAGCTTG AAAAAAA CTG AAA G AGAATAAAATATCTTTAGAGTC 
GGAATATGAGAAAATCAAAOACTCCAAGTTTGATGACTGGAAGAATATTC 
GAGGACCCAOGCCTTGGGAAGATOCTGAO L I ICI I CAAGG AAGAAATOCA 
G AAAGOCTTAA G ACT AA G A CAA CTTG ACTCTGCTG A TTCTTTTTTCCTTT 
1 1 1 1 1 1 1 1 1 1 1 AAATAAAAATATTATTAACTGGACCTCCTAATATATACT 
TCTATCAAGTGGAAAGGAAATTCCOGGOOCATOGAAACTTGGATATGGGT 

aatitgatoaacaaaatctttactt aaaggcaaogttoci ro ooooro 

CACTCACOOTCGCTA rrCTTTOCTCTOI I a ttaaatgaogttcaatctac 
G TCA CA TTCTT A TTT AA TTT ACTA T ATT A TTTCTTA CATTTT A TTCAT AT 
ATAACTCATTTCTTATCT>nTCTClAAGTTTGATGTACGGGTGG ■ 
CCGGGCAGGTACGCGGAAA TCCOCTAAC U CC I I U CTATCrTCOCATOCC 
ATATTTAOGTTAGATATGAGAAG ITGTQ1 ATGCTGTGT ItHXTTOCTOTOI 
GGTTOCTACGC^CATGTG ACTGA TACTTGTAAT ACA CT AAATTG AAGT AT 
ATTOTQAATAACATATCACTOCAATTTAATGGAACAAACATTGGACAAAA . 
TTTTCATTTTAGGAC I fCICl A ATTCATAATGATGTATTOCA O 1 1 1 C I C I 
ACAAGCTTTGGCTATTTAGTA TATCTTAGCTACTTAAACATTTCTAOAA T 
TCTCfOOACATOGTI 1 1 ICICJOG1UOGAATATAAAGGTCAAGGGOCTCT 
TTAOCAAGTTCTAAOOCAOCTCC1 1 1 1 1 A AOCCTACOTCT ATOTAAACCC 
A GTTTAATAATCT AA TCATAACAA GGCAAGG ACCCCCTTTTAACGGTTOG 
TATATTTTTTAGTTOAACTOCTAAATAACAATOGATACCrTOC^ - 
TTTICICAGAAAATTOOCTCTAAOCACAATGGAAATTAOjOTOOGOGAAGG 
TTGAACTTAAAAGAATAACTTGGAGGAAAAGGGTTATGAAATTTCAGAAA 
TTATOGGTGGT TTAA TA HKIIUiI (XAAAAATA TTTCTTA TTOCTAOO 
GTTMGCATtjAATTTTAOOOCTTAAAAGGACCTAOCAACOCATTTAGTGAA 
ATAAATTGGAGCGGGATGTOTTAACCACATTOATTGTCAATAAAACAGGA 
TACAATCT 

>ScquenccSOS 

CTCQCTCC1 1 iAICI TCI I A TI II ATOGATGTGATTGTATTTATCTTACT 
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TATCGTTGTA 

> Sequence 509 

CATOO IC IL II reATATGOCTCAAWWOTCAACWXXICOGTtXXiOGGOTTA 
TATT CTACT VTTNCA IC I 1 1 1 1 C ACITOINANOCAAACAOOIOCTCWNCT 
TAKNCTTT>0^MTCAATNCANTTNNOCrrAAT>m 

TCCA TT AOJCA>W AANHTNTNNNCA TTCANNN CCACAA TCCGGGGGOGGO 
GGTTWCTOGGCCACATCANCAAAAATCACATCCACCATTGQ^TCCCNCN 
TACCTGCCCG 

> Sequence 510 

AGGACATrCTCATTTAAACAGTTTAAANAGGCGGOGTGCGGGAGGOGGAA 
A AAAAGA A ATATACCCTGGCAGCGCTGGCGGOCGGAAAGCGG AGAGOG AC 
GCTAAG ATCaGCAAATTCGCCAGTTTGGATO C I 1 G 1 CC 1 1 1 I C CGCCCTT 
TTOCCCCCATTAAATCCAG AACCCGTCACATGATAATT AAGAGGCGGCGG 
CAGTTCCOGCTGCTCAAACGACTGCGGTAGAGGATCOCCOGOCTACCT 
>SeqaenccSI1 

caacttgt agcct agnctjggggcgtttxcxxlagchactacl i m itinnn 
nacahwownccc^nncccaaaoacttttcc^ 
agtgcagtaaaccaaatcagagctcactgcaaggccacacatt^ctattc 
cca gctaatt aaaaaaa i i rit ti'l il catacagatag agtcttgccatg 
ttgcccagactggttrrcaaagccocggaaccatgn i 1 ici i i o ggcgggg 
gccccccaaagggcngagaaaacagocaogacccacggcaccaagcnoga 
iwgagggcgggggagacgccgccaaaagcaaaacggcggccaaancngag 
ggagcaanncggggogaaaaggnaaacggaaccaccacgaaagaaaacca 

512 
513 

ngcgttaggagcactccgcggngocgctcoanngtttgatcaoqaggccc 
cgnag^icaccgaogaggaccagacgctgnnangaacatttattcaaagoc 
cacccggncacagoocnaaaggccaaccctttttggaggfkjccngggang 
caaacogaaaaaaagcnqgaaaaahwgaggagnngaagccaaacagociaa 
anmcngccaknaggaagngngnaaoogot1ticma g1 1 1 1 1 1 i nnoggtt 



TGCGT 



CCGAGGGGGGGCOCGGHAO 
GCGCGCCGGGGGGAAACAGGOGCAAAAGCGGGOOCGGGGGGAAAAGGGAA 
GCGGOGACAAGGG>AACAGCAAACOAGGCX3GGGAGGCAAAAGGGAAAAGC 
0GGGGGGGGCCAAGGAGGGGGGGGAAAO3AAAAGAAGAGGGGGGGGGGCA 



GGGGGAAGCCGCGGGAGAGGGGAGAGGCKXKiAGAGGGGGGAGGGGCAGAA 

GCGAGCGACGGACCGAGGAGGGCGOGGGGAGGGGA 

>Seq»ence 314 

TCTTA CCTTCA GIC1 ICTC1G OCAOOCAOTCATTTATAT0C3 rOCTOCAC 
TCTTCA GIGICT lCAGCAAAGGAGAACTOCI CCAOCTCTOCCTGATAGAA 
CTTCTOACAGTAT ICT I lAAAOTCTOOAAGQAAATCACAO GlCl II IOC 
CAAAGAOTCTGTTGGCAGTTCTAAGCAAGT ACGCjCGGGTAAGCAGGAAGT 
GAAACXlACAGAGCTTtlAAAAAAAGAGCGGGACAGGGACAAGOGTATCTAA 
G AGGCTGAACA TG AATXXACAGATCAGAAATCCXIATGG AGCGGA TGTATC 
GAGACACATTCTACCACAACTITGAAAACGAACCCATCCTCrATGGTOGG 
AGCT ACACTTOGCTGTGCTATG AAGTGAAAA TAAAGAGGGGCCGCTCAAA 
ICTCC1 1 1 OGOACACAOGGGTCTTTCGAOGOCAOGTOTATTTOOAGOCTC 
AGTACCT 

515 
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GCGATTGG AGCTtXXXXXXK?TGGCGGTCGAGGT ACGCGGGGACGGCGGAG 
CTOGCTCTCACXriTiGAGGCOGGTTAATTC 

AGACACACAAAGTAGACCTTGOGCTCCCAGAGAAGAAAAAGAAGAAGAAA 
GTCGT CAA AGAACC AOAGACTOOATACTCAOTTTTAAACAATGATGATTA 
CI I FCCTOATG1 1 rcToci I IAAGAGCTACATOCCCCTCTAAGAGTGTGG 
CCCATGGGCAGGCACCTGAGATGCCTCTAGTGAAGAAAAAAAA AAAAAAA 
AAAAAAGTACXTTGCCajCGCGGCCCCTCXi ACGTGG TCGCGGOCG AGGT AC 
AACTGCAGTAAGAGGGACOGTTAATTCACAGCTTCCAGCTCTTGGCGCCA 
G AGTCC GA TCCACTCCTGCA G A T AACGGTGATTTCGA TTCCGGG AGAA CC 
U-i i «-OAAAAACAACCCOOA TGAGACT A TCTGGCAAA TTGCA GCCCTTGG 
CGGGCITTTCAAATAGAGCGTTGACCXATCAAAGAAGGGGGACGTTACAG 
GCACTG AAAGAA T AA CC 
>SeqaeaceS16 

nTTOCTCl 1 C I AGOCCAGGCTGG AGTtiCAATGGC AGG ATCTCAGATCAC 
TGCAACCTCTGCCTCCTGGGTTCAAGCGA 1 1 II U. 1 UC I 1CATCTTOCCA 
CGTAGCTGGGATTACAGGCATCTGCCACAACGCCTCCCT AATTTTGTATT 
TTT ACTA GAG ACTTjGTTTCTOCA TG TTGGTCA GGCTGG TCTCAAACTCCC 

gacctcaggtgatcogccccoctcggccttjctaaagtgctgggattacag 
gcgtgagocactoxxjccagctatactct 

^ TT9^^^ *^^^^^ a ^^^ ta ^^^aaaagaacataagttatca 
to g to 1 i gggt aaa ttagcg aa a tcaa cccttcct a gg ttt agggg aaag 
tta1 1 1 1 laaaaacaacttaataaaacttacactcttatacaagagtgat 

TTCCOCI I AT TAGGATGCAT tSTTGATTAAACTCGAGATACAGCTTTTTGC 
AGATOOOGGGTTOGGTI I t^OTOT AACCTCTTT AACATGTCACAL' IGG11T 
TCAA GATTAAGAAAATA TTG AGTTTGAGTGTG TTT AATAAL. I I ICTO AGT 
TT7TAGAAGTCTTATTATTTTTAAA0AACTTAj\TAAAGGTCTAGATTGAC 
AAAN 

>Scqa*ac*3l7 

AGGTACGCGGGTGTTGATCCA G T f C I IOC 1 1 1 ICA ACGAGAAGGATTTGC 
AOGTCAGAGT ATGTCACiAAAAACGCACAAAGCAATTTTCAGATGCCAGTC 
AATTGGA 1 1 ICG 1 1 AAAAC^CGAAAATCAAAAAGCATGCATTTAGTAGCT 

C^GAGATGTGGGTCCTTCOT 

CTC1 «jCAOACCGCAGTIGCCGAGGTGACTTTG AATGGGGATA I rOCTTTC 

CATCGTCCA 

>Scaooxe5ll 

cacncagkaggoctckhaagcaggactaggcacanoc 

ccccgggggaagggnhgaagggagggctttgagggcngagggggaagccc 

ccgaaagnk>0*cotocanccagggcagaagagao^^ 

AAGGAGA GOGAAC AGGOGAACCAKCA CI 1 1 Hj I ICI I i gc ggggcackgn 

gcagggaccoodcacaaaaaaaaoaccnocooccaggaggggggggggca 
^a^tcgcaa^^ 

A tf ooo ogoooocaoqaaogosaaaacaaoocaacocaoodogcogacc 

°OGAAG<X»GGOGCCOXXjCOCCAGCA ta ggaaocct aagggg aggcgaac 
OCCOGCAACAGOT^CCAQAAC^^ 

CGGGGCCAAAGGCGCGGOCAACC^CCATAAAAGGCXjGGGGGCCCAGACCC . 

GOGGACAAGACGAAAACCGGGOGOOOQA . 

>Seqa«xeSI9 

roOC TODO OCAOOGATOOOO CI I I CG AAGGTGCGCTTTGOCT C C G TI I AA 
A TAACTC AAGGGCGGAQAOGOI 1 1 ICtAJGGAGTCGGGTTTACCCTTGAAG 
ACej I n i AGCQAAATCOCCCAAAAGGCGGGAACCCAAAAAAQAACCiQ I TGI 
TCGAGGGTTCCATAGOJ 



TACTATGTTGAATAAAT 
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G 1 1 1 1 1 1 ICCCI 1 1 IA A I 1 11 ICIOCI I C OCTAGTGCATAG AATTGAACT 
GCTTAGGGAGTTTGAGGCTGCAGTGAGCTATGGTCATGTTACTGOGCT 

agcctgagtgatggagtgagaacctgcctcaattaaaaaaaaaaaaaaga 
aagaaaaaacagtgcagtggctcatocctotcatcccaacagt^ 
occaagg<^gag«attoxaggagttcaagaocagcc^^ 
caagaccttgtanrttrxaaaaactttaaaaatta g n ctgt ctggtgtg 

CCTGGCTGAGATGAGAGGA TTGC I ' I GA TCCAGGAGGTGG AGGCTGAAGTG 
A GCTATG A TTGGGGCACA GCAATCCAGOCTGGGGG AAAAGGG AACXXTTXjT 
CTTAAAAAAAAAAA AAAAAAA GAGACCAGGGCGCTTTAAAACTAGGGAAT 
CCCCGGGCTGAGGAA TTCAATTT AACTT A TTGAA TCCG TCACCTTAAGOG 
GGGCCGGTCCCAATTri IC I ICCTTTAATCOOGAAATTCCCCTTTGGAAA 
AAAGGAATAGI I 1 1 CC I GAG AAATTTTTATCGTTAAATTCCAAACATACG 
GC 



521 

CCGOGCAOGACOCGGGCOOCTCTTAACGOTOGATCACTOGG C I C G I O O G I 
CG ATGAAG AACSCAGCTAGCTGCG AGAATTAATGTGAATTGCAGGACACA 
TTGATT^TCGACACTTXXlAACGCACrrGOGGCCCCCGGTTCCItXX^ 
CTACGOCTGTCTGAGOGTOGCTnXAAAAAAAAAAAAAAAAAAAAAAAAG 
GTCCCT 

AGGT ACACCTCCCCAAC^TCTCTTCCTra 

GCCIUCI I UCOCTTCGCCTTCCA CCAAG A CTGT AA U 1 1 POCTGAOOOCTC 
CCCAGC rTCCTGCAT GCTTCCTGTTJCAGCCTGCAGAAC^ AAGTCAATT 
AAA LCICI men IATAAATTACOCACTCTCAGGTA GI III 1CACAGCA 
ATGTGAGAACAGACTAACAAC>ATCAACTCATGCCTTTAACACAAAAAAA 
ATAGCTAAGTTCAAAATTAACATATTAOCACATOCAA C flCn I ATTCTT 
GAGAAAACAAAAAAGTCCAAAATCAAAGGAAAGCA CCCO 1 I 1 1A AACOCT 
CATA ICI I ICI CAGGQCTCACTOCAGTCTOQCCATATCTCAAGCAGCTCT 



*S23 



TTGGAGCTCCCCGCGGTGGCGGCCGCCOGGGCAGGTACGCGGGGGAGTGA 
G A GGGAA CG AG A GT AA GAG AAA GAAAG AAGTGA GGGG A TGT AAACTCG A A 
TAAA TTTC AAAGTGCCTCOGAGGCATGCAACGGGCAAAAACTGAACTXnT 
CAGG CTTCAGATTGTAACTGACGATCTGAGGAAAAATGAG O 1 1 1 GTOTOA 
TTTTGCTAAAATGCATCACCAACAGOGAATGGCTCOC1TAGGGAOGGACA 
AAGAGCTGAGTGATTTACTGGATTTCAGTGCGAT G I TT T C ACCTCCIC1G 
AGCAGTGGGAAAAATGGACCAA CI ICI 1 1G GCAAGTGGACATTTTACTGG 
CTCAAATGT AGAAGACAGAAGT AGCTCAGGGTCCTGGGGG AA TGG AGGAC 
ATCCAAGCCCGTCCAOGA 
> Sequence 524 

AGGTACGCGGGG<rrCTTGAGGAGTGAGACTGCAGGAGATGTX3GGOCC^ 

caaa gaga toga tgagac roTTOCi g agttcatcaagaggaocatcttga 

AAAI POOCATGAATCAACTXSACAACAATOCTGAAOCOCTOOGATTTITTP 
T CroAA AAT CAAC TGCAGACTGTAAATITODGACAGAGAAAGCAATCTCT 

agttcagcactt gatcca tc ro ra i gaggaaaagcgtgcaagtatcagtg 

ATGCTGKXXTGTTAGACATCATTTOTAAOTGCTOOAGTGCAOTAACGCCA 
TCTCAGCTCACCGOGACCrcTGOCTOCTOGATl^^ 

TTAGCCTCCCGAGTAGCTGGGACTATAXSCAGTGCAOCACCATATATOCAA 

TTTCAT . 

>Seqqeooe523 

GCGTTAGKAGCNCTGOGTtCTGTGGOUWLI IL^JUATCKTTOCa<GAGCTTT 
ACXK3T<XX>*0«KXXAGJmCACCCATT^^ 

ccoctooc ogaoo o^gcagaggaagcaoack g i n n IH AOOWOCOCOOO 

GCOCCAC>.CCCCAAAAAaXXX^M5aXX 

N>^GOG>*GGOCGCAA^ACA NCAA CKGGG AACA MCXIXXt^GGGAAACCGOC 
(.1 1 1 1 1 1 1 1 1 1 1 1 1 iGTTjTTTCGCAANKGAGGGJ1GCCK>*GOGCGCC1ACAA 
OAAAGACAACCAAC CC OCaaOGOGAOAMOGGCQIOCAGCOOCAACnTC 
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• uTGGGOu ioirit.ii unGGGaCCaCaCA I ClTICCTI CCI GGTGGGCAAC 
^^~^^CCTGCG<yrGAGCGAATGGGCACCTCAHT(iCACAGAGAGGTGGCTT 
CTOAGGACOCAOCTTOOCTCTOCAAAOAGTGCATCAT] rCCTTCTTCAAA 
GATXXAGGOACCCTOACCOTTOCT ACCTTTTTGC7G AAG AGATTTATGAC 
(XiGCAAGGTGGAG<XCCTGGGG<XTGGAATGAGCCTCTCCTC 
GCGGCXXGGAATT0CAOGOCCCrrG<KXX>G^ 

TrcOOOCCTGGGTTGGCTr AGGOCCTCCTGGCATTCTGGAGGGG^ AT 
TCT AA T A OCA GCCCTC A TCAA A TTGGGGCT A CA A CCCCA A GGCCCTCTGG 
ATC 

>Scqucnoc S26 

GCGATTGGAG<^CTACGOGGTGGGGGTTGAGGGACATGACATGOCAOCAG 
TA A AACTT AA TCTCTTCC I I I 1 1 I I 1 CTCACTCGCTTTTTCATAGATCGA 
CACATGT AAGCAGCATCATGGAGGTAAGTTT7TGACCTTGAGAAAATGTT 
HIGH I CACTOTCCTG AGGACTATTTATAGACAGCTCTAACATG ATAAC 
CCTCACT ATGTGG AG AACATTGACAGAGTAACA i I 1 1 1 1 I I G GGGA AGAA 
GAATCXTrACAGGGTCATGTTCCCTTCTCCTGTGGAGTGGGGGGGGAAGGT 
CTATOGCCXCAGCGATGGCCATATTACTCACCCTCTACAGAGAGGGCAAA 
GCAACTGCCAGTATGGTATTGCAGGATAAAGGCAGGTCGTTACCC^CATT 
ACCTGCAAGGCTTTGATCI MCI I CI CCCATTTCCACATTGGACATCTCT 
GCTG AG O AG AGA AA ATGAACCACTCTTITCCCTTTG ATAATGGOGOTTTA 
TTCTTTAGAC^GAAGAGAGGAGTTATACAGCTCTGGAAACATCCCATTCC 
TGTATGGGGACTGTGTTT0C5CTTTTAGAGGTCCCAAGCCCATAGAGGAGA 
TAAAGGGGAACAGAATTGTATAACTTGATATAATGATCCTAGATAGATGG 
AACTACAAGGGGCTCOAACCCAG AG AGAGOGGGGA l 1 1 PGC1 1 
>Scquence 527 

AGGTACTCACAGTCACGC ICCIC1 UAACCATCCTTGGGCTTCATGGGGTT 
GGCATIXMGGATCCCTACOACAGTOOCCTGCTOCGTCTTCCAGAGCGCTT 
TGTGAACTTCTCCAAATAAGAACAAGGACACACATTGTGTCAGGTCACGA 

agatcattcagtttccatatcctgaaggtttttccact attcacactctg 

TGGCGTAACCTTCTTCA A TATA A CCCCAAATGTCACCC AA TCTATTTCTT 

CCAGCTTCTCTCTGGCCA ILI 1 1 iuli iu atctgagacagtctgatcagt 

TTT 

>Scqoence 521 
AACATGGAGNCACCA 

CtrrcTCOCACAACCANGCCN AA CNN AAACTT ATGNAN AAG AGNG A AANAG 
GACCCAAAAAGGACAAAAGGG)^NCANNCANAAAAACAAANNNCCAANAAN 
CCGOCCAANAANANNGCAAAAGNNCCCCCCA 1 Ml I I 1 1 1 1 1 1GTGTGTO 

aaaagggaagaacctaatocaogcttaactatcttaacagggtgggagtg 

CAAOAQATTOATOAGTCCAAATCTGACCAAGATGGTGATGT^ 
GAATTCTCIGGTTOOCACCTTTAAGTGGCCAGCCCTT^ 
GGG AGCAACCCGGCTAGGTACA TCAAACATG 



ATTGTTTCAATCrriTTGAAGCGTTTGGCACT 

AGGTACATTGTATACroCAGTGTOGTCTACATGGCATTGGACAGGACATA 

ATGTAAAACATAAAAGTGGAATTGTTACACTTACATATGATAGTGAATGG 

CAACG TGAOCAATTTl I GO I V I CAAGTTAAAATACCAAAAACTATTACAG 

TOTCI ACTtXiAT TTATGT CTATATGACAAATCTTGATACTGGATtXXIAAC 

ATTACTGGCUIGCI 1 1 1 UGI I IU CGTTTTGAGGG OJI I I lU GTGCTOOC 

TATTAATTACGGCGCTGGI 11 IGG1I IGTGTTTAATAOGCTTATTTATAC 

TATTGGTGTTTACATTOOGOATTACAOAATAC L ' III I C 1 1 A GGGQGATAC 

CGACATTCACTTATTGGTCGAGTTIXCXX^TTCTCA^ 

CACCG 

>Scqucacc530 

AGGT ACTTGGAAA OCCATriXXi ATTAATTAG AGOTCTPTCT G AAGGAGTT 
GAAGCTTTATIXn"ATGAACCCTTCCAGGGTGCTGTTCAAGGOCCTGAAGA 
A TTTCHD A G A GOGGTTAGTOATTGO AGTG AG AAGCCTCTTTGGACACA C A G 
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ltSSJ^ JX ^ < ^^ A0 ^ TATCT ^ AA '^ C C^'^C-l C 1 1 GGG AAA 
CGTTTGGCAGCAATTACAATGG ACAAGG AATATCAGCAAAAAAAAAAAAA 
AAAAAAAAAAAAGTACCTGCOCG 
>Scqueaca53l 

>HTACATCNG ACVTTTCAANCNN CCTTTC NG A NCTrTTTTGCCCXXCCCAA 
" ACAGN AA TAAGG ANTKNN AA AAAACAA TTTCCA CCCGACA GTAGNCACC 
TTTACACNG A GG ANAA CGGG AACTTTA TTT AAA GG A TATTGTCTCA TTTC 
TAACACNCNGNAANCCANCCTTCCCTOATAATAAATCACTGGAGAACAAA 
AGCGAATAACAGCACGTCT LIt 1 1 1 I I I A TTCCAATTTCTTACATTTATT 
GCCAATG AAGAATTCAAATGCCAAGGGCCCTGCCT AGAAAGCCACTCTAA 
AGCAA CAAAGAGGTCTGCCAA TTCGCTTAAA AAACAAA CCCCCAA GAG AA 

aaaaaattcaaaaacccttatttaaatcaaacaagcaaactggggccocc 

CT AACCCTTCCTTCAATA TAAAGAGaCCCCGCCCOCGGGNNNCCAAATCA 

AAAAAA^^CACCTTTACAATt^CTt^TACTGAATCACA^ 

? AA yi A 5 I ™ TAA ™CGACACACCACATATIT^ 

ACCACTTMTATAACCTCACTCATAAACACACATCAACACTAGACAGACTA 
CATAATCAACATCCACAACTCATCACAACAAACACTTAAAATGTTCAACA 
^ I^^ A ^^^^ TA ^ C ^ AGC ^ TACTA< ^CTCATAT^ 
CAAATCTCGTAACACTCACTTATACTCTACAACT^ 
CACACAAACACCTCTTATTATCTUIX^TATCAATCAATAAT^ 

A^CT^C^OTCCG™ 



CG AA TGTCA TTG A A AA GGTCTTCTCGCGCGTTGAG AA CTTTCGGTGNNTN 

^^^^S^^^^ A ^ A ^^^^ A '^^ AA '^^^^G^TrCACACNNNNACATCAA 

A A TG TT A TTAACACTCTTA G AAG ACTGGTTTC TTCATTTGACA TTGGGAC 

GTGCACCAATTTTTATTACAAAAATCAAAAAAGTAAAAAT^ 

TTTGC AG A GT A TA ACCA CT A GTTGCCTA G ACAA A A GCTAATTTCT ACAAA 

ATC^AAACTTAATGCAGTTrTATTAAGAGAGTCAAAATTCTrTCAGTTA 

ACTGGATATACATAGTGGTATATATCTTAAAGCAGAAAACCOCAAAAAAC 

AAi^CAAGGAAAAAAGAAAATACATGTCAACAGTt^GTTAAATATTrro 

ACCTGACAGTTTCTACAAATAGTGATTTTCACTACATATAAAGGAATCTG 

T; A E A TGTGGT AA AACTTCCA GAAA CCAAG TA GG AAGTGTGG AATAAAA A 

CAATAAATT CAAAC GCAGCCCCAGGCTTJGCCCTGTTTTT^ 

AGACA CTG A TCTTT A TT ATTAAGG AGGG A CCACTGTG TCCACA A CT A AAA 

AGJTTCAACpACA TGGTG ATCTGCAAAGCTTT ATTTGAAAAAGACAAACA 

TTCTCGGG<XXXKXXGTTTTTTAAAAAAACTTTAAAAT 
>Seqoettcc 533 

GGTGTAGGGGCACTAGCXX5GGNNGGTTTrcGAAGNACGATCANNCCCCCA 
GCNGOjCWXXaK^^ 

CTACCCNGG A NCNNGGCCNN ACA TNNNCG ATTGNGNCACNGGGCGCCACC 

NCACGGGAGAAGGNCTWGCCGGKAAGGGNNNNC^CGAAGANCWGCANNNN 

^^^^^^OGATTTT^^ 

CACAGGGACACAAACAA GCTCA CCCAACAAA GCCAACCGCCCCTGCCCGC 

GTACCTCCCCGTTCTT 

>Sequeace534 

GCOTrTT^GAACACX^^ 

TCAAAAAGCAGCTGAAAGGAGTC<jTAAGGCTGGACCAA 

1^?S5J2 A H A 5^^ 

GCTTATTATCGOTCGACACACACTGCCAATGAGXXKXXK^^ 

i^JH A i^ GA "^HISIT^** G AAATTAAAGATCA ctatttggattacttca t 

JCTTCCAAGGTTrCCAAAAGTCTCATTCTTAC^^ 

l^^^^ AA ^ A ^^SS^ CMmATA ^ A ^ aAA ^^^ 

AAG A CAGAAG A AGTGGTCCTGAAGAA GCTAGA GTATATTTATGCAAAACA 
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GCAAGCACTAGAGGC 



^CTTTCAGGCAACTOICGCXX>WG^ 

OCNGGGCAaAGGGAAGKAACAGACACACGTTTGNGNGGAAGGATGTAACC 

OOGGACCAOAGGO<CAG74GONGGGAGAGANCOCKK3CATTAC^ 

AGAAC»>K3GCCCGCCAGAGGCNNGAACNGAGAGAJ^GAN1^GG^ 

CNA>^AAAANANAOACA>WC>K^C>NAAGCCTTGT>^ 1 1 1 III I IN CC 

GGCGTGACCCNCCACXXX^GAAACANNNCACAANAGGCNGCCGGNNCAAA 

CGGGGGGGAGCACGGACTGTCAGNNC>ICNGGGAAGGGGNCAGCGOVNCCG 

GC*GGGO04OICCNCCttXX^^ 

OCCCCAGGGACGGCCAGGCNG^WCCAGOCAGGAAGGCCAAAANCAAGAGG 

9^ ga J^ nagaaato >^ aaa aaaagaaaaaggggagw 

^'I'^'^^^C^ACAANNGGANGAhfNGGCANAAAGGGWnvtAGCANGNCCCN 

co^nncokzacccccc^^ 

GAANGGGGAGGGCCCAACCCACAGGCGGAGAACCCGGCACCCCCAAGCAN 

NCAAGAAAAAGGCGCCCCCAAAAACAAACAACCCCCCCAAGGG 
;eS3 6 



GGCTTTGAGGCACTCtXXXXJNGGCCCTCGNCCNG 

G>MCAGCO<GAGACrrNGCTGGNAGACrCTTTTAhK^NC^ 

TCC^TCCATCNGCTX^TCCTTIXnC^ 

TCAATtXTTG TGTCAG ATACTGGGGCTGCCCTCAAGGAGCTTTTATAGAG 
TTCA GGGN A CCTTTTTOOC I I 1 1 1 111 



GGCTTTGNGC>IACTC0GCGC>WK3CtXrroGCAGTAN><ATCGNGGCOC 



GCGTTTTGGGGCACACOCCGGNGGOGTTCNGANGTAOGATCNGCCCCCGC 
^ A ^ A ^^ A ^*^^^ A ^^CAGACCtXTTGCGGCNGCCTC^TTAC^CCGWiCA 
AGCTNGAGCGGOGGGACAGGNCGG<X3GGTlTTCGAAACACTGGACTGGAT 
GCCACATGAIT^GAACTCCCCTCCGTT^^ 

A CTCATTCT AA TCA GCG ATCA CTGTTTT AA I II 1 1 11 1 11 N CCTATTCAC 
TATNNCACAGATCAGGCCTACCTCATTGGCATATTAAGAAAGTTGTCTCA 
AGTATATTTAGTGTTTATCATTTTACTATAGTTCTTCAAATGACTGACAT 
TCATC TTTTCCCT ACCTCTAAATT Cl 1 111! I 1 I it-ACATTATCTTTCTT 
GAT1GC 1 1 1 1 1 AAT AG AAAAA CAN ACAAA G ACA TGG A TTT A CTG TGCAT A 
TTAGCAGATCCATACTGGAAAATGCATGGAGGTTTCATATACACCACTTA 

cagaaagaataactcagagtataaagtogaaaagaaagaatctgaaatat 

TAGACHGI I C 1GGAATAAGCGTACCTAGG ATGATACCACTTCACTTAAT 

CAGAI 1 1 cccctttccactatttaacagggcaatataaaaaactggtagt 
taaatacacaagaggcacttatattactggctcctcaaccca 

> Sequenc e 53* 

OCGGGCAGGTACI 1 IC1 1 1 1TT ATA GI I rTTTTCTTTI IG I G ATTTTTTT 
Jill IOOI II I IG I 1GI 1 1 IGI G 1 1 II 1 1 HCCI 1 1 II II 1 1 IGGTTCTT 

agaaaatctgagacacgt^^ 

CCTGGCTGOCTGGTGGTTGA 1 1 1 1 HI 1 GA 

GTTTATTCTQATTGA 1 j 1111 1 Id IOOI 1 1 C I GGATAAACCACCCTCTG 
GGGACAGGATAATAAAACATGTAATATTTTTAAGAAGGAAAAAAAAAAAA 
AAAAAAAAAGGGOOOOGGGOC 
>Scqmx 540 

CCGGGOIGGTACTTTW 

^5^ A £T rG J AAICI 1 1 1 "^-'GGTGGAGGGI 1 1 I O I CKlr\ATTTTO0T 
OGCTGCTGACTGATCAOCGTGGTGGTTGOrGAAGGTTGGAG^ 
CAAiMti i AAAATAAGACAACAGGCTGGGTATATTGCCTCATACCTGTA 
AATC CCAGCACTTTGGGAGGCTGAGGTGGGAGAA 1 IT 1 1 Ili AGfitTAflfiA 
<nTTAAGACCG<MXTG>IGCAACATCrGTGAGACXX^^ 
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^J^ A ^ AAA ^ AA ^ NANAAAAAA *^^ 

CCGJTCTAA A ACTA GGG A TCCGCCGGCTGG AGG A TTTAATA TCAGCCTATT 
CCCCGCGGCCCCTGGGGGGGGGCCC^^ 

TT^ScSSS^^ 
>Sequcace54l 

GGACGGTCAGAA CCGATACCACCGCGGCGGCGGCCTG ATGTA t 1 1 MIM 

MilLLi ' ' ,G ^ A ^GACACAAGTAGTCATATATX^CATCTGTTT 
AACTCGTGACC GI1ICI I J I f TTCAA Cl 1CI I 1 11 ICI 1 1 ICAOTGCTTT 
CpCTTCCApACXrrrTTCCTGATT^^ 
CTATCTTCTGGTAGCCACAGCrcAOT 

JCTG^CG^GG^ 01 ^^ 



TOCTCACTTTCCACCC^ 
CTTmT^TAATGC^TGGGGTGGA 

' i^ ATOA ^ OT ATTGTATCACCTTGGATACTTGAAGTTTA 
ATGCTC AGTTTXKTrCGGGTGGCATTTG AC7TGGAGGCTGGCATGTTCACC 

TGAATCACAGTCTTA AGTGG CAACTCACXjCCCAGCTC^ 
CCTGGACATGTCCTATTrrrATTCACTTATATGTGAT^ 

TX^SIII^ TA ^ MAA ^ TA7TX ^ A ^ AACG < J ^ 

atatatccctt 

>Sequencc 543 

^ a ^^^T^ cgcaaaacaaa " g ^ taa1 ^ ataa g 

AAAGAGGTAAGAGCAACGTGCTTrGCGAG<lAGAGA^ 

CATCTTGCCTGGATGAGCCAGGGGACACAGAA 

TTTAA TCTTT ACAACTCTCITGCAAGGTTXXXrrGGTTGTGAAAA 

^™ A ^^ TCAAGG ^ CTA ^' rCC T^^ 

CTCGGCTGGACGGTTTCAACAGAGACKXnTATT^ 

ATGAGGCTTCTC^A^ 

CXJL I UCCTA TGCCC AGTG TGCCCCTTCOGCTGTCAA TGCC ATCTTT AAAT 
GGTCCAATGTTCTCATrTGGGTCTGGA<^AAGTGCCAAT 



GAGA GGOTTOXX3GG|rGGCGG<X^ 

A T A CTGCTTCAAGGTATTTAA TCTAAAA TTTT ACCAA CTTTGA TTTGTtrr 
GGTTAGGATAI HIGH I IAOTGGATATCCTTTAATTCGGATCAATTACT 

^5^^.^ AA ^ TCAAATC7TG ^ 
^TIJ A ^ A ^ T I A 5 coc ^ 

AG E AAOAA £ AGT ^l G *V l 1 ,oi » GCTA TA TCCCCTCCAACACCTAAGGC 

AAAATT AGAA TCTAAAA GACOGAAGGCA TCTAAAG AA CAGTTCCCATCAT 
GCCACAGCTGAGAAATTGGAGACC ^^^nu^TCAT 
> Saqucaot 545 ■ . - 

A ... u 1 1 ATACCGCGTCAOCaATTTDCAACACAAAjCTATACCGJUIGOCOG 

A^^^ A ^ G ^ AAAAA ^ CT ^^ 

AGGTGGAGGCTTAAACTTCAACATTTAAATTTTGCGXnTIGGCGGOCTTC 
ACATOCtXXiCCCCTTTTTCCCAOT^^ 

GGCACAGACTTGGCAATTTAAATTGGAAATACCGGGCOCAAAA^ 
CPGOGG^AAGAAGGGOOTO 

TTCGGCTTCGGTTTACCCGCTTGCCGGGC^ 
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CTTCG 



gcocgggcaggtacctgatgca GGGAA Tl 
AACOCAATAOGAT000C AT C I GCOOCATTAATOOCAGCTTGTOCAAOTGT 
AATTATTAACAG ITJPOOCCTrTCACICTCCAAAGAGTCCCTOTCCAQACA 
GGTAATTCTGAAACTCCCCTTCAAAATGACTCCOOCGTAAGGAAACTGGA 
CTtlAGGGAAGCAGOCTAGGTGGAGGTTrrGAAAGGGAGAAGGGCCTCAT^^ 
CAGGGTWXHTX^CCTGCACCAGCX^ 

TCTTGCOCAGGCTGAOTATCAAGAGAAGCAAGAAATCTAGATAAAAATCC 
AAAlCCAGAAACATCAGCGTTTTGAGGTTAACATTnTCGCAATTATTCAG 

ctttatqaaataaatatta i l i rrcrrn iciacccgcttgggagoctgo 

CAAAATATGCCCGGGACCCCTOC C I rCTTTO 
>Sequcoee $47 

AAACAAGTTCACATAA TCATCAATTACAACTA ATAA TTTTACAATCTCAT 

cttactatttatataaatattoctaatctctataatttttattatattat 
atattcttatattaatttgaggccccgggcgccgagtcaggt aagccctg 

GCTtXXTOCAOCX^CTCXCACXK5AGACCAAAAGCCTTC^ 
TTGGGGGACAAAAAAGGGGGAAGGGGGGGCAOGAAGGCTCATCATTCAAA 
ATAAAACAAAATAAAAAAGTTATTAAGGGCGAAGAATAAAAAAAAATTTT 
QOCATTTACAT AATT TrACACXXIAAAAGCAATGGCITATCACCCI ilXt X ' 
TTGGTGTGGCACTTTGGAGATCAGGGACOCTGGGCCAATTNTNCTCCTTT 
AGAAGAGGAAAGTTGGGGGTGGGC7 1 ICU AGTGAATGNGGCAAGGGGAG 
CI I ncocTGi 1 1 AACAAACCGOCATTOCTCAATATTTTTGGGAAATGAAC 
CCTATTAAANNAAAAACACACAAAATGTGGCAAATCCTAAAGGTCCCTTC 
CGGCGCACCA T T TG II GAAAACCTTTTGTGGGGGNAATTGTXTrTCGCTCT 
CAAACCCGAACI HOOCTG1 1 CAACTCATTCCACCGTTTTCCCAAGTTTTT 

taaaaattooctggaggtocaaagcgocaaaaaaaaaaaaaaaaaoccaa 
aaccaaaaaacaaaaaaaaaaaccacattaaaagg 

oggggttgtcggaaocccaatttatagctattcagtcagaaaaaaacagg 
traoacaatc tgggg ctt goga ctggcattggaagtgggggaca g tt g i g 
cggggctcagccttcaaoctgtgggatctqaooctatcrctgggtaoato 
aa gtagaattg aacigggggacaoccagctggtgtocactgcagaatgaa 
i iuc i igclto atgtctaogoagcccgcagaattatagcaggaggfgaaa 

AGCAll^ l\J 1 1 ATTAGCAGTGGCAAGAGAAAATG AGAAGG AGCAAAAGCTG 
AAACTCCTGATAAACCAATCAGATCTCATGAGGCTCATTAACTATAACAA 
GAATAGCATGGOAAAGACTOGC 
> gcq«ta ca 549 

AC 1 1 GATAGCOOOGTGOOTG<X^GTOAAOCTCTCAGCOOOOGTATGOOGA 
CCTGAACCTCACA TGOG TCTAACGTCTA TTGCATTCATGTCTGGTGAAA G 
AATtrrcAATX^TGAATGOGGTACCTAACAGACAOCTATCCTOGCTOGCGA . 
AAAOAAAAGAATGOGCTGC FC I CAGAOOGTAGACOCTAAAAGGAOCroOQ 
OTCTOTOO OCCCI G O ICCC I TOOCCACAOOGOOOAOCAACAACTACTOOAA 
CXXXXTTGGCTGT ATGAA T ACGAT A TCCA TCTT A TCAATCOCAATAACOCA 
CATGGGGGGCCrGOCOOOCATGACl POT 1 GOCTTTAGACAOGGTTACTGG 
CTXXXrTTOOCAAAGGCATGGOCATAACTGGG 1 GC I U I GC1 GAAAACACAT 
OOGOGTCCAATTTOOCACAOOGTACTAAOCXaAGAGCATATAGGGTGAACA 
OOPGOG'roOCTAAOGCATGAOCTOAAOCACACTAATTXKIATCATACTTAC 
TOOOCOCTCTOCAGTOTGAAAAOCTOTOCTGOCAGAOOGATXjCATOCAGC 
G 

> Tiqucn ti 550 

Aw ITjGTI AOOGCOGTOOO I GCGOGAGGACTACAOOATGATCGGTOATro 
T0CTCATGG<n"ACOCAGCTGCACOCATCAACTAOGC0GAGAGACTCTTTT 
AGOC I U I GAGGGACTCAACCGTTATACTGAATGGAGAGCGOGAOCACATA 
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CTXXXHXXlAAAGTATArTGCGGACACrrCCGGCCCT^^ 
GGAGAACCTAOGCACTGCACGCCATGCC rCTTTOCl ACTCA AGCCTCAAG 
ACTTCTAOCTTGATC1 G C 1 IG CC I I CC 1 1 G AOCATCTAOCTAGAACTAAC 

cgagtdocaoctoocaaoctggcati 

CCTGCCTGAAOCAT 

CCTTATTTAATCCCGCCTGGGAACTTAA^TTAT" 
CCATTTTTCAAAAAAAAAAACCTCCCCC 
>Sapcaee55l 

GTGATGACGAGOGOGOGGOGGOOGAGGTACATTAGCAAAAACAGTGGACT 
TTGTCACCTTGAAAAAGTCATTTAACATCTCTGAACCCTACTTTCTAAGT 
CTCTACAAGTAATATATAGTGGGTGAG OIG ! I CI ITCTTTffl 1C TGTTAC 
TCGG A TGTGA AACT C T CC 1 1 f I GT AGA TG A AACCATTGCGTA AGTAA TAT 
A A AGACT TTTCCCTGTAGTTATCTTAiCAC ACTGGaG AG AGTGCTAGTG AA 
TGCTTT7GTCTTCAATGCCCATCTCTTGGAAATATTG AA GGTGG AGTAGC 
A A OCGGGCATTA TATTA 1 CI C1 1 TJGAA AAGG ACCTCAGCAATGGAGAATA 
TCCCGA TCA TCACAACTGTCATCACTCTGOCGCACGTGATTGTGG AG AAT 
ATXCCTCTCCATGTGAATGCAGAATGAGATTCATTTACAAAACGAAGCCA 
TTAGGGGGGAGC Ci 1 HUH I A AACCAAGAGAAGTGGGGGCA 1 C 1 1 ICI 
CTGGAAGATCTGGCCTCATTTGGGCOGTGTAAAAAAATCC 
>Seqaaxc S52 

TTC TG TGCATT A ATTTTA \ riTTGTT A TTGTGTI I AAATCGAAGATATTT 
TTTTT AAA TT ACGTTCGTT AGTT A TGTAAT A T ATGGT AGTTOCGTGGTTT 
ATTAI 1 1 I 1 1 1 1 AGAGATGAGACGAGTGGCCGGCCGCCCGGGCAGGTACT 

acaatgaticrgaagcacactgtat/tcagacagatacagtgaaccaagtg 
caatatgtaaggatgaa agaag aagagatgacaaagaaatccaagtaaat 
gcci itnctttgcaaatg 1 1 ittatattaaatcataagggaagggaacta 
ctgccttaaatgttatcaaaagagtittctaacaaggttaa 
tcttaacaj i 1 1 1 1 1 i cn 1 a tgtcta g i g i t n c atoctaccttggtag 

G AAACTTATTTACAAA CCA T ATT AAAAGGCT AATTT AAAT AT AAAT AA T A 
TAAAG roCTCI UAATAAAGCAGAAATATATrACAGTTCATTCCACAG AAA 
GGCA TTCCAAACCACCCAAA TG ACCAAGGCAT AT A T A GT A TTTGG AGG AA 
TCAGGGGTTTGGAAGGAGT ACGGAGGA AGAATG AAGGAAAATGCAACCA G 
CATCUTTATAGGGGGGTTCATTTTAATAAAAGTTOAAGGCACAOG 
>Scqueacc 553 

GAG ATGA CCCGGGTOGCCOCCGAGGTACCCATCTCTCCOCaTCACCGCTG 
G AA TTTTG A TGACCT ATTGGAAAA G A TCTGGG ACTA TCTG AAACTA GTG A 
GAATTTACACCAAACCX1AAAGG<XAGTTACCXGATTAGACATC^ 
GTGCTTTXrTTACTTCGAGCGGCXXXXXXXK^ 

CTAGCTCAAGCACTGACGTTATTCTACAGGACTATGAACCTTCATATCXIA 
CATTTACAOTCCGGACAGATAAAGQAAAACAACCCAAATCGAGGAGGCAA 

tataaaaggaagagaacaaaa cacacattcatacact cacacttaaaaat 
aaatcocatgtggtaoct 

>Seqmce 554 

GAGATGtXXXKXTIGGCGGCCGAGGTACTCTTGAGATTOCT^ 
TATTGAAACAACAATACATTTTCCACTGTAGTAATCGKiAGCACT AACTCT 
TACAACAGTT AGTGAA I CG 1 1 1 1 A AAOAATCAGTTCAGTGTAGACATTTT 
G AAAAG A I TG I U CC » G I GC 1 C I ACAATAGCnTAGTGCAATGTGCACTTC 
TOTTTTACTTGCCAI 1 1 r O CTOCICTQT I 1 I C I C I U I GACATG AAGCAAC 
AG AA ACTOAGATCAAAGTTAAGATTATAT CC fOI 1 1 GT AGTATCAOATAT 
1 1 1 ICTOTC1 ACATTTACATTCAAGTTGATAACACTGG I G O 1 1 ICATTTC 
AATACAAATTAT OCTAG AGAACTOACATTTTCAOACATGGTCATATATAT 

GCTATTTGAAI IUCI I 1 atcttgatacagatcttgattgtgaa tctctg a 

TGATAGATGTGCAGCTAATTrGTCCCGAAACTCATGAAGAT 
* 555 
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TGAGAGATCCGGGTGGCGGCCGGCOGG^CAGGTACAAGACCATGACACGC 
CCAAAACA C I IOC I C CAGATGTTGTCGTTGGAAAACTQTCGTCTTACAGA 
AGCCAGTTGCAAGGACCI IOC IOC 10 »C I IOG1 ITJTCAGCAAOAAOCTOA 

CACACC IUIUC1 1 ggccaaaaaccoca ttgoggatacagogctgaagttt 

CTXrrOTGAOOOCTTGAGTTACCCTOATTGTAAACIG^ 

acagcaatgcagcataaccaagcttggctgtagatatctctcagaggcgc 
tocaagaagoctgcagcctcacaaacctggacttgagtatcaaccagata 

GCTCXiTGGATTGTGGATTCTCTGTCAGGCATTAGAGAA 

cctaaaacacctacogttgaagaocta tgaaactaa tttggaaatcaaaa 

ACTTTTOANGAAQTGAAAGAAAAGAA TCCCAAOCTGACT 
>ScqoexKX 556 

GAGACTCCCXXKXJTCGCGGCCGAGGTACGCGG 

AGCTGCAGCAAATCTCAAAATAAAGAGGCAACGGC C 1 1 ICIC I ICCTCTC 
CATCTCTCTATAOCACACCTTTTA 1 1 I C I 1 1 1 CMC 1 1 1 11 1 1 AAGCCTC 
ACGAAAGATTTTACITGTAGATCAACrTTCAAAATGTAGGAAGTCAGAAT 
GGGTGACATCATCAGAAAAATATGTGGAGCTCATCACAAGAAGTGAAGAA 
CCCaGAGCAOjAAAGCGGTTGTGACTCCTGGGC^^ 

tctgggcttcagaggagccagcgcctocga g i rorci I G AAGTGAGGCTC 
tgctctagicci gi IL1 1 c io gctctaagatctgaatgttgtgaccacta 

ATTTGCTCI I 1 LC 1 OGAGGGTAACCCCACi 1 1 IGCtCCACAAOGCI 1UCIG 

cccaatcttttocaacagttgaacx^c^tctca^ 

>Scq»cocc557 

tgagatgctcogcutggcggccgaggtactggatgtcaggtctgcgaaac 

ttcttagattttgaccrcagtccataaaccacactatcaocroggccatc 

atatgtgtctactgtggggaca^crrogagtgaaaacttcggttg 

ggtcogtgggaaaatcagtgaccagttcatcagattcatcagaatggtga 

gactcatcagactcgtgagaatcatcagtgtcatctaca 

>Seqacacc 551 

ggga tg tgtctccacogcggtggcggodgaggta ci 1 1 1 

TT G I I 



II II I I II I 

TTOAGACGGA GTCTO 0CTCTG1 IGCtXAGTCTGGAGTG 



CAGTGGCATGATCTrGGCrCACTGCAACCTCCATI 
ATTCTCC 1 GACTCAGCCTCCCAAGTAGCTGGGATTACAGGTGCCTGCCAC 
CATGTCCGGCTAA 1 1 1 1 1 G I A T1TTTAGTAAAG ACGGGGTTTCAOCATAT 
7GGrCAGGCTGCTCTCGAAATCCTGA(XTCGTAA 

cccgcctacctgcccg 
oc tcccttt ^ ^ 

LCI 1 1 ICTOODCTG1 iu cagctgggagcggaogaagogogaagctgogat 

TTTTTACTG ICTCC1 GAAGAATTTAACACAAACATCGATATCAGACCAAA 

tcatacaatttatatcaacaatatgaatgajcaaaattaaaaaggaagaat 

T GAAGACATCC CTATaT G CCC I t* 1 1 1 I CICA ATTTGGTCA TGTGGTG GAC 

ataatttaagggaacggggctattccacx^aaaggocttggagg 

T ACAAGGGATTTCCCA TTTTT AJ 



GCGATGTCv ^CTCOOCGCXXrrGGCGGCCGAGGT A CT 1 1 1 1 1 Ti l 1 1 1 I'm . 
1 1 1 1 1 1 1 1 1 UATOGGCAAGCGAOGCTT agacaggcgtagocccgggagga 
ACCCGGGC^OCGCAAGTGCGTTCXj AAGTGTCGA TGATCA A T G T G TCCI OC A 
A TTCACA TT AA TTCTCGCAGCTAJ 
GAGTGATOCAOCOCTAi 

TAGCTACTTTAOGCTGIC I G I ACATTKTGTCCTATACATGACTACTGTCA 
TAATACTTTTGACAC I IOCIG1CICI AGTTTCTAATATTTATATTATAAC 



ATCACATTGATCTATAA 1 1 1 rorCI 1 1 1 ATTTTANANANATATTTGCGAT 
G<K^CCCCGGGTTjGCGGGCGAGGTACX>TGTTjGa 

tgttgcctttcaagatatacccaaaticccagttccagoccgto^ 
aaactccgctoocotoaaaoatoaoo i cc 1 1 agcccagcagctgcaaoga 
ctcgc0ctc0ctcaaaggqatgccao cci 1 1 1 atttagagatgaagttgc 

rTCTTTGT I A TTTG ACCCTAAQOAAGCOGCCACAATTGACAGGGACACCG 
tcttixkx:attggtgagcca ici 1 1 I AACTTAGAAAAGCTCTTGGAAGCG 
T l IGI ITTC I G GATGTTA C T G T I l'l I T ! I ICCC C CC I G I 1 1 f C I CnC I G 
TACCCGTGCICT lOCI IAACAGI I ICTGCATOTTOATOTATATTTTCAAG 
OG AAA GAGA TCATTAACACCA 1 OTOCT1 G G I GC I TGAAATGTTTATTAAT 
TTTG AGCCGCCGGCCG I C I GGAAOCTGGGGGCOCACTOGC 
>Seqtwocc 562 

AGAAACATTCTCAAGCAAATAGGCCXI AGTCAAAATGG CCCATTGATACCG 

tgcctcccx3ctgaacgtagtcaatcaactaaaaaattcc<iaccgcctc 
agottcagtagggtgtgccgaaaa^ 

cciic1 iccci 1g11 ictxiaacccci locccxxxnttacccggaataacc 

TCTTCATTAGCCTCACG 
563 



GCNN AGCCCOGGGNNGA TOCHATCT AGKTTNCTN AGN A GNCNN GGCCGNN 

CCCGGCKXX^G>^AACCNGCGGG>04GGGGXXXXKKsAA^ 

AG AAAAAAAGGG 1 1 1 ICI I AAAGTTGGGGCTTTGGCACCGGGTAATTTOC 

CCOCCA AAAGC A G AC ACCGGGGG<X:CX:Cr^XXX: A A A ACKXir^GGGOGGOC 

GGGGGAAACOCTCCCAAATTTTGCGOCCOCTAATAGaGGGGGGGCGGTAT 

TTAACCCGGCCGCTTAATGGGGCCCOGGG G I 1 1 1 1 AAA AAOGGTGGGAAC 

TGGGAAAAAA AOCTG GGGGOGTfOCOCAAATTAAAAAGGCCTlTCGOAAG 

AAATACCCC1 CTTTTI GCCGGGGTCGGGGGAAATAAAAAAAAAGGGGOCC 

CtXACAAAAGGCGCTTTTTACAAAAAATTTGGCXXXC^ 

G AAGGGGGGGCCCCCJCC I 1 1 1 1 1 OGGCGGAA T A T AAAAAGGGCGGGGGGG 
GGGGGGTGGG G I I I Tl C CCCCAACOGGGAGGGCGCGTTATA Ti l 1T1G TO 
OGGGGOOCITATTAOOGAO OCTT T Tr KT>WNW CTtjl 1 1 IT T TC CCCCTCT 
AGGGAGC 

e 564 

aggtaccaagtaggataattactactgocaacacacacatgcacgcatgc 

ACACACACACACAGATGTATI 
CTAAAAG 

NGGATTGGAGAATCOGCGGKGGCXXriTGGNNGCA^ 
CCNCCGCACTAAGCANCGMCACCCTCUANAGAK^ 

CCCNGMAGAAAOQilCNGCGCCANGAGTTTCAAGTiGGAGGAAGAAGOGACT 
GCGCAAGCKGAAGOGCAAAAGAAG AAAGA>IGAC<XA G aGG><CC AAGJ<AAA 

CAGACGC I CC 1 1 ATGAAAGTACCAAGAAGTGGGAAGCGGGGTGAGCTGCT 
GAAGA Mi ll G GTATTXiACAGGGATGCCATTGCACAAGCTGTGAGGGGCC 
TCATCAOCAAGGCCTaGGGOGGGTATGAAGTGTGGGGCGGGGGTCTATAC 

A TT OCT gagattctgggaaaggggctcaaaga tot - ' 

GTCTCTGCCaGGGGAAGATTCCTTGGaGGA ggccctgcagcgaca tggag 

GGAGCTGCTTTXXTrGAGAGTCTCTGTtXTCTGCA TXTTGG ATGAGTGCACT 
1 1 ICC 1 1 IGIGIGG GAGTGAGGGCaGaGGAAGCITjGaGCGAGGGTGCAAC 

aaaacgttccaagtgggacagatactggagatcctcaaagtaagcocctc 
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GGA TGCCA TTAAGTATTTCAAGG AAAAAGTGAGCACACAGAATCTGCTAC 

iCCTGCTGACTG 

> S eq u e a c c 567 

TCX;ATTGGGCCCTNCGCGGNGG«Xrr^ 

CANN^mAAGGNOGAGGG>l^^40CCTX}GA>IGA^^^GGTTAKTCGGGOOOOOOC 
CGGGCNCNGCAGGOOGNCAfWANCXnTGAliJGCNCOCGCGOGCGCNGCCCC 
TCAAAACCCCGNACCNGGCCGGGCGGCTGCNCNAGAACNAGWGGANCCCC 
COGGCGG CAGOA ANNC0AGAGCAA U I I 1 1 ICH II I IOO I 1 I IC CCGAOO 
OGGGOCCCI 1 1 ' 1 CAAAAAAAAATCTCCCCCCAGGG AGGGGGOAGGGCGCG 
^^^^^^^^GGGCACAGOOGGNCOCCGGGGGAAANNGGGAAOCGC 

gggggccaacgagggagocaaoccocaccaaggg 

>Scq uta c t 561 

gcgattcga^ 

CATCi i k. i tCCGCTGCTG AAATTTNCTTGGCGGGCGCTTTA ACCCG AGGA 
CCCCTCCCCCCGCGTACGCTGGATAGCC I I I II I CCAGAAAG AGAGAGTA 
GCGCGAC^CAGCTAAGGCCACCGAGCGACWCATCrCGGCCCGAATGCrc 

tca gcttcagg aa txxxxxiCGT ACxrrGcxx mrcm h t 

>Seq ucnce 569 

OCGCTTTOGAGCNACTOCCGCGChfJKXKX^ 

CCGhOWAANCCAAGGG^AGC^NAhtCAAGCTCCTXJNGATNNACTAATAC 
ACAAACCCCAGACAGCAGNAAGGNCAGAAGAACCTTGGAGAACAOCAGAA 
GCAACACCXKIAGAACNCNGAAGGCNGAGAACACAAGNCAAANACANNNAA 
CNNAAAAACAAOGCNGAGAGAACACNGGGAAAAA I I III! I H 1 I AGATG 
TCCA CAAAAAA GGACATGTAAAGGGG AAGGTCAAGTTGTTGAG ACAGCTA 
CTTTATTCTTGGGATGACTGNGGAGGTGGTGGAGATGAGCCTTGTTTGCC 
^^ A ^^-GGTTCGTAGTrcACGAGTCGTTGACCCACAAGGTACCTGO0CX3 
>Sequcnca 570 

GAAAAAGGGAGCCAGGCCCATTTCCCAGCCGATTAANCCGNGGGGGGGAA 
CGGGGKhWAAAACCG<KKiAAAATTTAAAOCCAAGAGGGG 
AGGCOCANGG<M3OCGGGAAACCCAACCCCAGGGGGGAAAAAACO0GGC0C 
CCCCGAAAAAACCCCXXXXXXXXXTTTTTAATTTTTTGGGCK^ 

corcoooccccoccci rrm iwkxkkxkxjaaaaaaaaaocccaagggg 

GGGCCCCCCGC 



GGACCTCTCATGGTTACAGGCTTTGACAACCCAGAATCAAACTGGAGAAC 
ATTOCGAAOC^GTTCITATAAGTGTCICCATCICTACCTGXSGCTGAAATG 
GAATGTGCAAATGTAGCCCAG<^rrGGTXXTTGGGTGTTGCCAGTTGATTG 
A7XJ ACTGGGAGCCAAAGTGGCA 1 1 1 ICI I IGACCTAAACGGGCGATGATG 
AAATAAATCG AGCGGCCGCCCGGGCAGGTA CATCTGTGAATGTGAATGCC 
AAAC OGAAGGCATOOClGAAAGTCCCAAGTGTCATGAAGGAAATSGfi AfiA 
TTTCAGTGTGC«JCGTGCAGCTGCAATGAAGGGCCT^^ 
TGAATGCAGCACAGATGAAGTTAACAG 
> Sequence 572 

GOOGNGNNNCIICACCGCGGNCO^^ 

CGCCAOGGAAAAGNNGGCOCCOGAGOCAGAGCTTTTGCAGOOCNNGNGAG 

GGOGNGGOCCGAGGCAANGGAAAGKGGGANGNAAAAOGAAGNACAGGAGC 

AGANNNGAAGAAfWACAAAGNGAANNGGNGClTTTXIAGTTTTTTAGAGAG 

AATTAAAOGAAGACTCATOCAA C I IGI 1 J 1L1GGCAATGAAAGCAGCAAA 
TTAGAAAATGAGTOCAAACTATTGTCATTAAACACTOATAAAACTTTATG 
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> Sequence 573 

goocagaaaaaaaaaaaaggcgctggggccaccccaganagaactcaggg 

GACAACCACGCGGCGGCGGCCGCCCG<XJCAGGAACANAGCCCrrCAGGGGG 
GNCGGAAACCCCCCANAGGACAGGACANAAAGGAAAACACAAAAGCGCAA 
GCCGGAGACACACACAGGACAGCGAAGGGCAACGAGACCCAACGOCGGAC 
ACAAGCCAAAACaCCAAAAACGAG 



CCGCAAAGGAGCGCCGCCGCGCGCGGCCGAAGAAC __ 
GCCGGCOGCGACACACACAGCAAAAACAACAOCGGCAOGCACCAAGGGGG 
AGAAACAGCCGCCCCCGCCGAGACGGGGGCGCCCCGCACACCAAAACAOC 

aagacag 

574 



> Sequence 575 
576 



ngcgattggagctccccgcgctggcggccgagctacgcggggtaggagcc 

tctctccctactgctgctacacaagaccctgagactcacctgcaggacga 

aaccatgaagagcctgatccttcttgccatcc 

> Sequence 577 

CCGCXK^AGGTACAGAGACCTCCTTACTTACCCCCC I ICICCI rcGOcroo 
A GCTCGGCG AGCG AG AGGCGGCGCTGGCGTTGG AG AGOG ACGGOGGCCCC 
CGCGTAAGCAGTGGTAACAAOGCAGAGTAACGCGGGAATGAAGAATCTTA 
GGCGGGTGCA CCCA GTTTCCA CC ATG A TT AA GGGTCTTTACGG AA TAAA G 
GATOATGTCTTCCTTAGTCTTCCTTIX^T^ 

AG ACCTTGTG A AGG TG AC TCTG ACTTCTGA GG AA GAOGCCCGTTTG AAGA 
AGAGTGCAGATACACTTTCGGGGATCCAAAAGOAGCTGCAATTTTAAAGT 
CTTCTGATGTCATATCATTTCACTGTCTAGGCTACAAOC 
> Sequence 571 

GCGATTGGAGCTCCACGCGGTGGCOCGCCCGGCK^GGTACCTC^ 
GTTCAGTCAGTAGCAGAAGGATCI IClllCI 1U1 I CCTGATG ATTTCAAG 
GTCCTCAC^GTCCTCA TAATC TGGTTCTTCOCGAAACTtXXlAAATATCTA 
TGGAGAGCTGTTCT AGCTTTTGCACAGGGAACCAGTGGACAGAGGTATCA 
TTAAACATGTCXl\TGTATTCCGAAGTCTGAGGAAACTCAAG<rrCCTCCAG 
TCvi 1 1 1 AAAATCTTTGC AA TG T AGGG A T AA TTTTTCTGCAG AATCCTTG 
CCAACAACCTCTCCTCAAGTCCTTTGAAACTG^ 

TTAGAAAGGGCATCTACTGACCAGTTACTOCATAAAAGATTGTTGTACCT 

CGGCCGCTCTAGAN 

> Sequence 579 

NGGAATTGG AGCTT ACTGCGGTGGOGGCCGAGGTACTTTGGACAGTGAGG 

GTTCGATTCATTTTAGGGGTAGGGTTGGGGGTGGGAGTGGGACrrGTGGGT 

TGGCAGGAGGAAGAATGAGTCTACTTTGGAGACAATTAAGTCATGGTACT 

m 1 1 H 1 1 1 1 II I IU 1 1 1 1 1'lTITOGCTACATAGAC A T C I T l C IGATO 

TATT GTTACTAGAACAACTTGTATAOGGTTTTATGGTTTGGGGAAAACAT 

T7TTAAAAAATGG ACTTAT CTCTATTATACAGAGTTATAATATAAAAATG 

ATTTAAAG GCTAT ATTTTTCAGCATGTAGKn'AGCTACACTGTAATCCTGT 

TGAAGAAACITTCCTATTTAAGCTTATAGGATGAAAATATATAATTAAAG 

TCTTCTGATCATAGCTT 

> Sequence 5 SO 

AGGTACCATt^AAATGCTTCCCrGGTCTTC^ 

TCTGAGTGGCCTTTTCTGCACCCTOCCC I ICI I ICICI I IG AATGGAATT 

AAACCCAATTTGGAAACAACATTG^CCXIAGTCAAAAGCTTCT AATGGTTT 

CI 1 I I ICTTCCTCCAGTTTTAGTTTGCTTTTATTAAAAAAAGAAAATAGT 

GCATGGCCATAGCTCCTTCAGTrCTCTTATTGCAGACTAACCATCAGG 

GGTATCAAAGCACAAATACTTTGGAGGGGAATCCGTTGAACTGGGOCAAG 

TACCTGCCCG 

>Sequeace 581 

CACTCGGCACTCTCGGT 1 L I C I UCTATTTTAATTGTATTTTGTATAATAA 
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CAATACGTATTTTACTACA 1 1 CL1 1 1 AATCTACATAGATATCATATACTT 

ATTTATTCATTAAmTATATTATGGTTTAGTAGTGAGCTC 

> Sequence 5t2 

GTTTTTAGAGATGAGCTCACCCCGGTGGCGGCCGAGGTACCAAATTGTAA 

aata ctogaa ggccttcaggaacctgtgactgatttacataaataccaoa 
acctai 1 1 1 ugatoagotaaaagacatgtgctcatctccaattacagttt 
^^^gctgcto tcgoccaaccctatcagcgocgaggccacaaagcataaga 

AI ICI I I 1 GGGATTACACTG ACATCXATA ATTTTTATCACTATCTTCCAT 
TACACT ATTGTGCACA ttaagccaattttctg A TCATCACATACTTGTTG 
TAACTGCTGCTGGGGGCATATCTAAGCTTTACGT 
>Seqococe 5«3 

OCGCTAGGAGTACTOGCGONGGCGGTTAGGGCTCT ACCGG ACCNCNGACC 

CTCCNGGAACCGAAAAGGOIGGNGCCKKJTTTCANCC^GGNCNCACTGAN^ 

GNCGGACCACANGAGNCAAACTTAGG>ICNAGCNCAGAGAAAGCCCGAGAC 

AGCAGGGCAAAAG<X>GCKNGCGCCCCGG>WGGAACANCGCCAGCCNCCTC 

AN AANCCANNNCCAG ACAAGCTTTCA 1 1 111)11 1 CAAATCCG ACATCTA 

CTCCAACTACATGATACACTAAAGTGCTTGCTGTGTGGGCTTOCAGGGGA 

GATGA AATGG TAAGTOGGGCTGCAGCATCTCTG 1 TCAAAATATACACCAA 

TTTCTGl 1 1 C I CAATGGCACTAATCATAACGCCTCGCCCTTTGGGATCCA 

CAGCTAAGAACTGGCCAGGAACGA 

> Sequence 5M 

GCGATTOGA GCTAC CCGCGGTGGCGGCTCAGGGACATTACGTATTGOTTA 

tacaacatttgtttaataaatgcaactaacaaagctacacaagactt aga 

TATTGAAGCAGAAAAGGTGGTITTACAGTCCCTGCATTAACCTCTAATrC 
TTACTACCCTGGCCAAGAAAGCATTTTCACCTCCTGCGC 1 1 I LC I 1C CTG 
TGTCCTTGTGG7TGG1 ICI I ICI IC1CAGG C I I ICI I ATTCTGATGCTGA 
GATAGTTCTGI I CACTTAGCAACTTGGGACAGTGACACAGGGTTTGTTCT 
GTACAAGCAGGTTATCCAAO AGGCATOCATACCCTGGO 1 1 1 ICCI C CAAC 
C ATAAGGA AAATTGATGCAGCTGTTTCTGACAAGGAAAAGAAGAAAACAT 
AC 1 1 C I II GCAGCGG A CAAATACTGGC 
> Sequence 545 

TAG T ACCTGGGCCACCAA ACACAGCTGGACTC AA T AT ATGGGG AAGOTA A 
GTGTCCTCAGl 1 1 J iGGAGAGAGATTACCXrrCTTCCAAAAGAGTGCTTGA 
TTCTCGTAGTCCAAGCTCTCTCCCTCTGGTGGCACCCCAATTTC 
CTAOACOCACCTCC 

GCXmi loNGNCACTCCGCGGNGCNanTGKNG^CTGTACmGC^ 

^GGAGACGChWGNAG>KX>JNGNATTTTKNGNNGGATTAGGCTTGAAGACG 

CGGNWANGCNNNNCAGAGNCACANCATTTTTGGKCGAAANAGGAG 

CACAGAGGAAGG>WGAGGAGGCCNGCAG<n4ACCWCGGCCGCNCAAGAACN 

AG>K>GANCCCCCGGGCGC^GGAATTTTAAN CI 1 1 CTI AGGGGTTCCGNG 

GACX>KXCGGGGGGAGACGG^CCCACXXXXXXX><OCCG 

NAACNGCGCGCNNGGCGNAANCXNGGGCANAGCCGGTCNCCCGGGGGAAAA 

hKJNRAhKXXX^CAGAA^WCGACACAACAAACXlAGCOGGGAGCA 

>Scqucoce 517 

GCGATIGGAGCTCCCCGCCGTCGCGCXnTOC^^ 

TAATGACTAATTA TAGGTGAA TAA TTTTACA GACAGTCTA T A TTCT AGO 
AGGCACKrTXrrAGGCGTTTrAATTGGAAATAAGCATTCTG^ 
ATAOCAGTOTAOAAAAATOAAGCTAAAAAAATrC/ULAGTGTTOA 
TCCTGTXX^CrGGGATTTTTATTTTAATC^ 

GCAGHAC1 1 IN U 1 1 1 1 1 1 1 1 1 1 1 11 rTTTCGGGTTTATTTTATGCACAA 
AGAGCCATCG 1UOI 1 1 TTTATTAGGT A GATGCXXrTGGAT AA 1CUH ICAA 

GGAAGATCACTTAGTXXAACTTAATGAAACCAATATOCTTOGCATACT 
> Sequence 5U 

pCmTTGGAGCACCCCaWNGGCGTITX^ 

G AAACCG AA G AGCCA G ANNGTTAA CTKJCA G A TT AA GA CN A G ANOGCCG A G 
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GNNCGGGACAAGAACCGNGAAGGGTrGATGGACAGGGAAGAGACCAACGA 

CTOGATCCTTTXXTCAGACTATGATCATGCAGAGGCAGAAGCCAGGCACC 

TGGTCTATGAATCAGACCAAATCAAGG II 1 1 1 1 I GC TOTCCAAGG AGG AG 

ATCGCTGACAAOTATGACTTATTTGTTGGCAOCCAGOCCCC^ 

GGAGGCCTTAGTACCT 

>Sequeace589 

GCG TTTGG A GCA CA CCGCGGNGGCGTTOiGNGG ACTATCATCNGNCCGCA 
G A NCA G ACTNGCAGCCG ACCAAGTTATG NGGGG ATTTAG ACAAAANCCOG 
G AKNNC ACCNTNCCA CTNTNNGAGGACTTTCTCCAGGGTCTCTGGTCTAC 
COATGTCAAAGCAAATCAGCACAOCATCGGAATCAGGGTAAGAOAGGGGG 
CGGACATTXjTCATAGTAAGGAGAATCCGAATTTTTCCACAGGCTCAACTC 
TA ITCTI i GTG TCTCG ATTTX^AAACTCGCCGTGTAATTCTCAAACACTG 
TAGGAACGTAATTCTCGGGGAAGCAGTXXrTTGGCGAAGACATGGAGCAGC 
GCA G TTTTTCC A C ACTG A CTG TCTCCCA CCA C A ACT ATCTTGCA T 
> Sequence 590 

GCGTTAGNGNC>JACACCGCGGNGGNNCTCGWGTACNATCTGNGGGACAG 

CANCC^IACNGNC^AGAGCNNGhWTTAANNGNCNAGTTTAGACTNGCCCCC 

CGAACGCCGANCACCNC7*GAGACCCACCTTTTTCA>IAAACAAAAGGCCCA 

AGCCGGAACACWGCCCKGGACCNGNGACAW«^GGACNAIWNC^ 

AAKNGNGGCCGAGNGAACAACCATTTANA I 1 I H I lO GTGTTTGNNNAGC 

CCGC GAANNAA CTAAAAAAAAAAAACOCCAAAAAAGOGOGGGGGAAAGNA 

CCCGOCTTTTT 

>Sequcnce 591 

GCGATTGGAGCTCCACGCGGTGGCtXJTCGCCCGGCAGGTACTCAGGT^ 

ATCTCTGCACTCCAAGTAGG A TG AATAGATAAG A GCAAAGGCTCATGTTT 

GCCAAGTCTGTCCTTTTGTAACAAAAAAOCCAOCAGCTTTATCAAGCAGA 

ATTCGACCTGTATTTCTTAACTTGCCAGAGCTGAGTCTCATGG 

TAGCAGGAGTTGGGGAGGTATTTTTAACAAGGCACATTATCATCTCOOCC 

ACCCAAAGTGGAGCTATTC^AATGAAAAAGATACAATGAGATGTTTATG 

AAA TTATCTGTAGCTATTAA TGTCAGGTTTTTGAAATTTACTG ACCTGG A 

AG AATA CTCAT AATOCAATGTCAAGTG AG AAGCA GG ACAAAG AACATTTO 

CAATACAGTT 

> Sequence 592 

TGGAGTATGCGAATGA GCTOCACCCOOGTGGCGGCCGAGGTA C lllllli 
1 1 I I I 1 1 I 1 1 1 I M 1 1 1 GCOCAGCAATTAAAAAAA I I 1 1 II I I I I Q TAAA 
GACTGGATTTTGCCATGTTGTCCAGGCTGGTCTGGGATTCCTGG^ 
OCAATTCTTCCTCCTCGGCCTCCCTA^GTGCTGGGATTACAGGCATGAGC 
GACCAT ACtrTGGCCACTTCTTCA TTCTTCTTGGCTTTGCGTCOOCGATTT 
AAAATTGGTGAGAAGTTCCTTCGGCTGGGCTGAGGACCCGAGGTCATGGG 
^GGATCTCATGGAGAGAGGGCCAGGACAGGGGAOCGGrrCTOCCAAAGGAG 
TOCTOCTCTC1 1 AAGTCTTTGGCCCAAAGTGTCGGAAGGGCCCCATAAGA 
GOOG^^CCACCO^C GM I IOIOO GACAAAAAAT UI 1 1 1 1 IN 1 1 IGGG 

GGGCTTTTTTATTATAAGTATATATCAGTCTTCCCCTACGCCACGGtCGA 

TtXATCTATATATATTTCATATTTTTCCCOCOCT 

>SequcaceS93 

GGGAAACATGGCAAAGATTGTCCTOOGGOAAAAAATTGTTCC^ 

^^ogccaaaaaaaactagoo ggggg gaaaaaaagtaaaaAagoccggcgct 
ccag^kxxxacccacacocci 1 1 1 jigggggggggccocctxctccccaa 

CTCGGGGGAOCOJLI 1 1 1UI llAXXXXTTCTAATAGAGTtrcOCCOCCGG 
GGGGGGGGGGGGG AAN AA AAA 1 1 1UI 1 1 ICI 1 CAATTATAAAAAAGGGGG 
GGGGGGGGGGGGGG 

TGTAGaGGATTG ACGGGGGCCCCGGGTCAGGTGCG ATTCTGGATGACAAA 



TGCCAGCCAATCTIXr^GGAATGTGGAGGAGAGAATGAATGG<^ 
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TTAAAGATGAAAAGGCT lTGTOG AGCGCCCGOCCC< XiCA< XiTAC TTTNTT 
J 1 1 1 1 « 1 1 1 1 ■ 1 1 1 1 1 1 1 i AAGGAGci 1 1 lAi ram IAGTAAICI IAAC 
ATAAOTAAAATAAGACACGGGAAATGACATCrCGAGATCTACGTATCTG 
GCCCATTGCAATTOAOCACA I I ICI IO 0 GTCTS1 1 ICTCTA T C TC TA AGO 
CCACTCTCAAAA CCCCACCTCAAAA T AOGACACT AACATG A TGAACATGC 
ATCAOCTTTGAAAAC [CCTCTOI AGTCTTATGATOATCTAGAAGAOCACT 
GTCCAATAGAA C 1 1 1 C 1U I O ATOATGAAAAGATTCTACTTTTGACCTATT 
CAA TANGGTAACCACTTATCA 
^Sc^macc 595 

ACl mi ACTCTATAGTACTCTTAC fTGTTI ATTATTACTAATCTTATTT 
TATATTAATTATATTTATGTAAATTTATAATACATATTTATAATTTTTAT 
TATATTTTTTATTAGACCCA(XrrCAOCGGTGGCCGCCGCOCGGGCIAGGAC 
ATOGTCAGCATAGTA AGAA 1 GTGT 1 GG TCGACACAG ACT AC AA TOO TCT A 
ATAACTAGGTATAGGTTAATTCTTATCTGCACCCrrGACAATA TOaGGAA 
ATGTAATACAA GOG A TACACAATA1TIGAAGTGCAATGGCTTATTA AAGA 

gttaatcaaaqataa i l ien 1 1 ataaaacctaaaaatagctg aattttg 

TGAAAGCATCAATAAACTI 1 1 ACATGTTTATAGTT5CTTACCCTCAAATGT 
TTAATTGA ITGGTCTOCAAAGT AAACATTTGGOTGTGCATATCTCTAAAA 
GTTTaGTAAAAATTGGCCAATTATGCNCAAAAAATTTTTTAAAATAGGGA 
AATCACAOOCTTACAA 1 1 I 1 1 1 1 U IICI I AAATCAAAATT CACOOCCCT 

TTACAAGGTGTCCTGGGGTAGGGAAAAGAATTATGGGTCTATTCCAA^ 
TCGT ACAATTGGC AAAA CAA GTT AAGCTTAA TATTTTATGG AA GTT ATTT 

AAAGAAGTCTCCAATTATAGTCTGCCCAAGGGAGTGGGTTTTAATGAGAA 
TATTAA 1 1 1 1 11 1 1 ATAAACGAGGTGTATOCTTCGGGCACGGATTTTAAG 
AAG 

>Seqncaca 596 

TTAAOAAAAGAACAAGGTTAACTAACTAAAAGCAGAACTCACTTATTTTT 
TGCTOCCTAGGCAATTAAAAATAAGTTCATTAAAAGCACTTGAAATTATA 
TATTTAAOCTG AAAAAAAGTTGCTAAAATTCCAATA TAAATCTAAATATC 
TTTAACTTGCTT AACCCAGCT ATXXXXAAAACAGTGTAGTGGGGCAAAAT 

GOT GCCTGGOGOCTACCCTGGG 
>Sc gocnce 597 

TGA1 roCTQl rOAGCTCAO 
1 n I II 11 II 1 11 I I I n I lU AGTTACTCTGA iOt 1 IA TTTTAATGCATC 
TTAGTOCACACAGTTGGTATAAAATCAGAAAATGCAAAGCAAAAACAAAA 
GGTCTGGAGTCTTAGCATCAGAAOGGC^GCATATATACATCTACAGTTGG 
TGGCCAATACAAGTCATTGCCAGACAGTtXTITGGAGGCACAGAACAGGCC 
AGAGOCAGOCAAGCTCTAGGAACTCAOGGGTOOCAGGGAGTTCTAAACCC 

rrorrci oatgctocaaocgtaaaaaaaatgtgggagtgatgaaggcttt 

ATGATTTACTCATTATOOOGOGTAOCr 
• S9S 



TGTAGTTGGCTTCAGCTCOGGGAGGTGTAGAAAGGCGCTGGGTGI^I^AA 
ATAGGCTCroCTGGOOCACGGCTOACTGTCITCCTTGTGTCTCTACAGTG 

TAATCTtXXJOCAAGTOCXKTrACAGTTACCTOCAACA GGACCTOOCTGACA 
AOOOCGAGAGGTTCA A T C I G T T T I XCT G TGTCTTGGGCTCTOCATGCTTC 
ATCGCOGGGAGACATTATTGGGAGGTACLAGGTGGGAGATAAAGCCAAGTG 
GAOCATAGGTGTXrTOTGAAGACTXACrrGTGCAGA 

TATTGGGCTTTTAOCTTCCCAATGACTGCOCTAGOCCOGGOJGACCCGaT 
TCACCGOGGOGGGAl 1 1 1 1 1 1 CGGCCATGATGCTQGGGAGGGCCTCCTTT 
ACAAAGTGG 



i m-MTo^occcccxrrccoGTCCGOT 

TTTGGGGTTC 



AOCATITCTTOGGGCACACTTGATTGAAAACTGAGACTTCTGAAGAGAAG 

gccagaagatacaaagacagaccatgccagttcaatgctgtcttocaaga 
acagaagaaaatgatccaggcgcaggaatocataacactggaggatgtgg 
ctgtggacttcacttgggaggagtggcaactoctgogcgctgctcagaag 
gacctgtaccgggacgtgaigttggagaactacagcaacctggtggcagt 

GGGGTATCAAGCCAGCAAACXXK^TCCACTCTTCAAGTTGGAACA^ 

AACAACOGTGGACAATTGAAGATGGAATOC^CAGTGGAGOCTGTrCAGAC 

ATATGGGAAGCKXCTTCATGCCCCTGGAACGC^^ 

GGGGA CAAA AGG AAAOCATGTG ATGGAC 

> Sequence 600 

G7TTGTGCGCACCAOGCTCCGATGGCCCAGGTGACCAATGGCCGCAGGCT 
CCATGGCGGCTGGCI ICI I OCAGGCCTTCATGTCACCGCGCTTCCCAGGG 
GGCCOCCGGCCCACOCTGCGGATGCCGAGTCAGCCT^^ 
TGCCTCCCAGCCCCTCCTOCCTGGCGCCATGGAGCCCTt^ 
AGGGGCATOCGAGGATGGGGGGCOCAATI 



ACCCAACTCCCTOOOCGOCXX^GGCCTGCCTOC^ 

CAGTTCGTGGCCCGTGGGCCAGCCOCAGTCGAAACTTCGATCCCCTACTG 
CTTCTCATIXCCCGGCAGCTACACCrc 

OCTGGAACACOCATCATGOCTAGGCCTGGAGATTOCAOCAACTGCAGCGA 

AAACATGTGCACTATCATGAACCCT 

>Scqoence 601 

TTTTOAGTACTAAGCTCGACGCGTAAAAAAATAATAAAATAAAAAAATCT 

gtgcaataatttaaaatgtcctcocaggaatagacacaaatgttttgagt 

ATCTTTT AAGCTGCA TTTTCCTTT A GTGATGCaTTTG TCAATTGCACTG A 
A TTT AAA TCTG AAAGTCAGA GGTG ATT ATTG A TA GT AC 1 1 1 PGTA TTTTG 

atatggacagtttattcatttgcatacagttattgaci 1 1 1 icccagctg 
atta aaaga tagtcaagaaattctgcaatatagctgccaaaatagacagc 

TACATTTT TATG ATATTGTCA1C1 1 1 ICIG N1 rTTTTTTTCrTTTTTTTC 
TTTA GCTA TTTT ACTT AAGCAT AAT AGCCACAA T A GGACAT A T AAAA GA T 
TATAAATACAGAGCTrTATTATCTTGACG7trTTGGGT L"1 1 1 1 AAGTATAT 
ACTTTTCTGAAAGGTATCCATTTTGTAGGCTTGGCi 1 1 I CI I A TGAACATA 
CGATGTTT 
> Sequence 602 

CACCACATTCTATATATCATACC AATGAAATCGA rTTTGTTCl ACCGTAA 
CATGTACCTCATACTGTATCTATAATTCrcACGTATCAATGATCTAATAC 
CAGTCAAGACATCATGAATAGACAAOCAAGAOGAGGTGCACACC1TGGGA 
CACCCTCATCTGCGTGGGXXXXXAAGATCGGAGCAGCGAGGCTGCGGGCT 
ACCCOCATGOCACCCATGAOCTGTAGGGACCAOCTCTAGATGGCTACTOG 
ACTCAAOGACAACACAOCATGTCTCCGCTCXIATCTGGOCAAGCTGAAOCA 
GGTGGGAAGACAACAGTCIt^CTTTGOCATGACGCACGGGGGGAOOGGAT 
TCGCOGGAATTGACTCCAGCTCTOCaGAGGTGAAAGGCTATTGGGCAAGT 
TTTGGATG£ATCTACTCAAAOCACOCATGAACTCACCATTTCCAAAAAAC 
^AATTGGCTGCATAAATXXXXX2GCS^AAG<KICGCX!IAACATTTAAAGAGAA 
CCCGOCAGAT1 

AAOGGCTCCTCTGGAAGGG00GGGTA 
TATTAQATC1X30CCCAAGTATTCTAAATAAAGGCGGO C ' 1 1 1 IUCI 1U1U A 
AAAAGOCATGGGGGGGOGACTAAG 
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CATOTGTTCATCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAAC 
CGTAA TC ATA fCTTTl 1 CCTATCTATCTGAGGCACTOOTOGAATAAAAAA 
OCTGTATATrrTACI f TUnXlCAGATAOTCIT GO OGCATCrroGCAAGTT 
GCAGAGATGGtXKlAGCTAGAAAAAAAAAAAAAAAAAGCOCTTTTCAGT^ 
GTGCACTGTOTATGG TCCGTGTAGATTC ATGCAGAA 1 1 1 fCTGAAATGAA 
ATOTITGTTTAG ACOG AGAAT CATACOGGGTAAAGCAGGAAATGACAAAG 

cttgc i j 1 1 ictggt atcttttjtagg atgtattgtgac i j i laactgtta 

tattaw attgcc aata ttaag taaa tatagoattataatattgtataggg 

gttttcacaaaoctttaoaoocttttacctttcagocaccocaaag 

TTCATATTTTAGAGTCAGGCATTGGTrATACATGTGTAGTTCCAAAGCAC 
AT 

^Setjoetjce 604 

TCGTATCGTAACTTATTTAATTCTTATATTAACATATACTCTATCTTATA 
CACTTATATTACA TCAC I T f TXT T A CTTT A TTATTTA 1 TTTTCT I ACCAAT 
TTCGTATTAATrATTTTTA CTA T CTTCn GATGGGGOG ACACGCGTOCG A 
GA CAA TA CAAA GTT ACATTTTTGG A CCA T ATT AAAA CTGCAAG AA G ACAG 

gggtcttactgaagatxttttagaaaacttaaatoct 
tagacatgtgtagaatgtagctcaattttttaaaaagtaactgagctaga 

GGGTGAAAGTTGAAACTGACACATTTTCAAATTAAOATTATOC^ 
TGTAACAGAA AACAATO TTTAAACACAAGCAG ATCT G ttgt A TGTAATAA 
GTAACACAGA G1 1 111 1 AAAACAAATTTAATTATTTAGCTTTATTGAAGT 
mgT TTm 1 CC 1 1 CCGAACCTGGAGTTA TCATAATTATAAAACAGCAG 
TTTTCACAOCAGAATTAGCAGTGCCC i I I ICI I 1 1 lUGTACATACTGGAT 

TOOAACC1 1 1 ICI 1 1 actgggtacctxxkmccacttttt atgttagtttt 

TGATGCATAAI ICI 1 1 OOAATCCCTTTTATACAAAACTTAAA T 'I Gl 1 GT G 
TAAOAAAAAOC r O TO OCI UGAATGTAATTAAG 

T^^AG^GCGTTGAGCTOCOCGCGGTGG<XXXXrGAGGTACCCAAATAOCAC 
TTCAGGAAATCTGGOCAGATCACCTGAATOCAAATGTrCTATTAATTCAA 
^ACACGTTATCAAOTCAAATCCAAGCAAACGAGAGTCrCTCTOCACAACG 

gagccatgatacaatgtgatggtcaaattcagatoccgaggtttcagaaa 
atcooocaggaaaggagctaaogaatcocctctocatcgtaatttatc 
^a ttaa tatctactocaacaagcaattcaatgcatggattgacttttag 
aoc cttaag aotgaagta tcagc acatcixaogtctoo^ cci ICI I AGG 

CTCA TGTTGATCCACT AAATTTTTAACG AA CTGGT ACCTGOCOG 
>Scqoen«606 

aaagaattgagctaoogoggtggoggccgggtacttaaaataattactgg 
cagtaggttataai iogicc-i I iaaaaataacattggaatacaggacttg 

TTCC CAATTCGGTAATTTTCATTA Ol IGTITTOTTTCTTI Hj ATTTOAAA 

cctcgaaatacagtaaaatttgactxtittaaaatgttg^ 
aaaaaaaaaaaaaagotcogggggggoggaggtcagggacaagatggtgc 

CACOGGTGCAGGTCTCTCCGCTCATCAAGCT 
> r iij 607 

TTT A1 1 PC I 111 I ATATTATTATATTATTATTTAAATAATTTATTCTATA 
TAA TCTTA T^A mATTi qTA CTA MCI 1 1 I^ATTATTODTiVrTTTTATAT 

AGAAGAAGGGGOOGAAAGTCAOOGTCAAOGTCTATTTTIG^ 
GGAGATGAAGATGTAGGCCGGGTGA 1 C 1 1 rOOTCTCl lUJ GAAAGACTOT. : - 
TOCAAAAACAGTGGATAAI 1 1 POTCGOCl IAGCTACAGGAGAGAAAGGAT 
TTGGCTACAAAAACAGCAAATTCTATOOTGTAATCAAOGACTT^ 
CAGGGOGGAGACITCAOCAOGGGAGA'n 
AAAAAAAAAAAAAAAAGGTACCr 



GAGAACAGA^^ 

TCTGCCGATOGACTTCCGCCTCXXjGCAG 
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TG ATOTGGAAAG GAAAATAGAAATAGTOCAGTTTOCT agccgg acacgcc 

AACTCTTCGTTCGATTATTAGCTTTAGTGAAATGGGCTAATAATGCTGGC 

AAAGTGGAAAAATGTOCGATGATTTCAAG^ 

CC TO 1 1 1 GTGGACACITXrrGATCGOCTGGOCICG TI AGCT AGAGATGCTC 

TGGTCCATGCACGOCTGOCTAGTTTTGCCATOCCATATt^ 

CCTGOCCG 



TGACTC AOCGGGTGGCGGCOGCCOGCCAGGTACTITJCGCL 1 fOCCC! I AG 
CI rCT GOAGAAOOTGC I ICI 1 ATTCCTGGCA GGCTTCAA GAACAGCTGCA 

cat gtgco gctaactgacogcgttgocattggcgacctggact^ 
caggtttattctaaacccagtgagacgtoagggogagtgatgaaagg« 

TCAGCTGTATTTOTGTG 10 IblUI GTGTG A GCACCTGA CAAA TCTA TG AA 
ACCGAGTGAAAGGAGAAATJGTTAOATTCTTTATTATTTTATTATATTTAT 
ATGCAAAGCTCGACTCTCCCT IT GGTAAGTCCGAAGCATCT TG TCTGTTC 
GTCCGTG AC 1 0 1 C I I CC I CAGGTCTCTGGCCTGTGATTTOCAGTCACCCT 
TGTAGTTACTGACAGG AAATT GACTGGACTGTCAI I I Q I OI rG AAGTCTA 
OGAGGGA AATCGGCCATTTTAATTGTATGAATTTTCGTCATAAGTAAGGA 
CI 1 1 1 1 1 1 ATCTCAOOCATATTAOATATATGTAOCTCGGOOG 
>Scqococe610 

OAGCACCGCGGGCGCGAGA ClbN 1 1 1 1 1 1 1 \ LXJ I A TATAAAGTGATACTG 
AAATA TGCTAATTAATA TATTAA TTTT AGTT AAA TGCTGCTAA TATGCAT 
AO, lu ll ACITGA AGG TTTTTA ATATGTTTfGATAAClTTAATAACTTCA 
GGTOA TG T CI GT ATAATTTTTAAAGTGCAGCTCTCTCT A ACAAATGTGCC 
CTACAACTCCTOATTAAAGGGCGTCTTGAAGGTTCAAAAAAAAAAAAAAA 
AA AAAAA A GGT ACCTGCCCG 



A 1 I C 1 T A TAOOOGCACATATTAOO 1 1 1CIC-I A TTACA 1 I 1 1 ILIA TTTTT 

aattttacaaattcatc tcgatatttttattctg I C A TTTACAACTTT AG 

ATT fllilll ICCl I 1 1 1 AGTTC GACTCAOGTGTGCKXXrrCGAGGTACTT 
ATGAGAAATTGGCATGCCTTTGCTAATATTTA 

GTTTGATAGACATTATAGTAATGTTTGAGTAGGGTATTGTACrATATTCT 
A GTGGTCCT AT A COG T A A TAA T A TCT AA T ACAGT AA A T AOCTGCCTTGC 
CTTTTTCOCTGGATAGTTCTGTAATATTAOGAAATTGTTCTACT AACCTC 
OGAGCAATGTGATOCTCAA 1 1 ICK»lCjlG GACrrAGGCTTOGCCCTTTTCC 
AT GTGGG TGCAATAGAGCTAATTATTGGCTCAA G ITTCTG1 AGGTATAGA 
GA1 ICI 1 IClCJACCAAGTAGACATTATAGTCTCAGGGGGA Oj I 1 CIC1 1 
ClCILXjIOl GGGGT AACAAGGTrCACTAGAGGCCAOCTTATAATTTTTCC 
CITOOCTGGGCTTCAAAOOCTTGTGAAAAAGGOC1 1 1 0 1 CT ACCATTAAA 
TAGG TTAACC A TGGCTA GAGOTTTCA 1 1 1 IC I I OCC AAACAAGTATTCTT 

atgaci 1 1 l i aagtttt aataaacaacc 1 1uji iccci 1 1 1 caatataat 
gaaatggtg gtaataa ggaggtgaataaaatatttoocaacattatagtc 

CTTAAGO CAAGC 11 GT ATAAAOCTCAGTA TTGGTTC 1 1 A AGAAAGCAAAT 
1 iG GTAGGTGAACCTTAOCAATAUI ITGOTTOCTAGGGATGGATAT 
^j^ '^OAAGOGCCTAGGCA ^ 

>Seqneae«612 ATAOCCOCCG 
GAGA CAGTG AGCCACCGCGG TGGtriG< X^GCCC^ 
; ' ATGCAGTTCAAAATACTGOCAG 1 1 1 ICLAAGAAATTTTGTAAAGTTGAAC 
ATGGOCATXn-ACTCTTGCCTT AAAA C 1 1 1 IT L TCACCACAfXCACCTTCCC 
A CATGCA TGATATOCAAGOTCOACAGAOCTGGATrAOAATOCACTCTCAA 
GCnCicATGCAG TGOGTATTGT Al 1 1 ICI GCATAAGAAAGGGCTGCCTC 
TAG AACAC AGTAAGTOT ATTTGOCCAGTAGTGACATrGOCTACATATAGC 
C AAO 1U1 I ATAO TATACCAACTTAG TATATTrTTCAAGGAGAGCTAAACC 
AO. 1 1 1 rOTAATGGI HOP 1 1 1C1UACTGTTA ICI ICCt 1 fOCTA TAATT 
AArTTATTTTAATCTACAAATTGACATAGGGCTAAAAGCTTt^ 
ACAAAATATTAATTAA TGT AATTGTTCCCAA TTA TT AGAAACTTTTTTOC 
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* T7TTTA4AAA TtKJTTCCCAACTTACA 
>Scqucace6l3 

cgagaac tacrc tagtocccockcgtatttdgtgagatcocacooocagg 
aacnccactttoanoaogccatcaaaaccaacggttataooocncocaca 

NNNCACNCNGA GGGGANCTTTTACAACNNCAOCCGOCNCCCOOCCTOOOO 
AAAGG AAACCTAA CTCCAOGTCTCTTOCAAAGGCCTCTCCTGGTATTTa n 
Tn^AOGAGACGCOCACCTTATOCAAAGAGCTATATCOOCTGGOOCGOCTT 
GATGGGCTTCACACAGTACCTGCCCC 
> Sequence 614 

GATGATTAAGCCLAATAGTAGATI>KHXXKKXGATGTTGTCAGATCTACTC 
ATAGKTAAGAAACATTCTTTATAAATITroATCOCTCOGTTCAAGCCAAT 
A^TAATTTAATTACAGATGGATATTATATGGTAACGGGTATTrACAGA 
AGGA^GGGTGJ^ATTACCGAAAAAGCTAACGGCACGACGTITATTTTTCC 
CCC ACA A TCTTTCA TACAGC A ACTA ACA A A TTG AACTTGCAAAA GCACTA 
AAAC A TCACATGT AA ACCCAGCTAACAG AAAAATAC ATTC AC AAGCGTTG 
TTGGTGGTG CTCTG TATXTIXjTGTGCTATGGGTCAATGTtjCTCAAGAAACA 
GAAGGGAGACTTTGCCACGGCTCATTTT^ 

GTTTACAATTAGGTTGattATAAA^GGAACCCCITATrCAATACCCCA 
ATACCAAATAAACO LI 1 1 I CM I C ACATACTTAAAAAGAAACCGGGTTAA 
CTAAAAAGAGGAAAGAAGAAGGCCC 
>S«quence6l5 

TAG AGG AGCTCACAGCG CAGCOCGGTGGCGGCCGCCCGGGCAfifiTA fmr 

' * 'Jill} 1 1 1 1 1 J I rrAATrTTXXATGTATTGGCCTTAATCAAACTAT 
AAGCTCTOQAGTCGCCAATATACTCCATTGTGATTAT^ 
TCA CCTGCCTTTCT A CT A TC AACTCTT A TTAG A TTAAAAGG AA A T AGA CT 
GAAATTGGGAGNGAGGTCAGCXXjCTGGCTGGATAAGATGTTGGGCTCAGA 

ATA TAACCACTOTGTCTG AGCTGGGTTGTGGATAGTCTTATTTCGCAOAfi 

G<*GGATAGCCATTATATTCTATGAACCT 

>Seqvencc616 

TAGAGTTKXjATCCGCCGCTCAGGCGGCCGAGGTACTGTGC^^ 

cactaggtgaccagagtggttttgactcctctggtgct^ 

CAGGGGTCTCTATG ACCTTTTCCCTCXrrGCAGTrCA , 1 1 ii i i t i 

Ol 1 1 II 1 GAACTTACTTCATTGGAGAAGCCCCTTGA I 1 1 1 Id ICCI 1 1 A 
TACCAGATCrGGCTTCACCAA^GCTGCATTTAGGTACCTGCCCG 
>Scqucnce 617 

TCCCTTCGAACTGGAAACATAAAAAGGATGCC^ 

GOGTA TTTtKSAGTTAT ATAAGCT TCOCAAAAAGGCATATTOCTTrCAAAA 
TTTTCAAAATAAACAA II II 1 1| I ACTGGATTTTAAATGGGGGTGTGCCA 
ACTCATTAAAGGATITTATAATGGGTGGGGCCCCCGGGCCCG^ 
t^^^L^llJ^F^ ,,C 1 ' ^^AAATGAATtXrTGGGCTCGGGGGTCXXJCjTG 
TGGAAGGGGTITAGTTATTTAAAOGGGOGAAATGGG 
>Seqococe CIS 

CAGCOT^G<raoOGO^ 
CGTTATCGATCATAA^ 

TAOCTCAATCATATATOCACACAA GIO C I l l ld 1UACATTTCTCCAAAA 

"roGGAGAAGAvAGAATAAAATTGTTTATOCTOCACAACrGCCTGGAG/DCCC 

TGAGACCAGCAGAAATCTACCACTGTOOAAGACAAATAAAATATAGCAAA 

^^GATTGTGCTATT^^ 

GGCCTTGGCTCAGTTGGAATTCAATC 

AG ^<^ C ^^ TA TGC^GTGAGAGACCATAACGTG 
S^ZZS^^^^^'^^ ^~*"^ A TATATAGCTTG AGTCCACXTIXXKKiACGAGGCCA 
GTGCCTOAAACGCATCCC<nACCATGGCACAG^ 
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AGAAGCTTTATTGCCATTATTTTGTGAAOTTGGTGGAAGGC^CCCACCT 

CCAC CTCA GCCACCAAAGACGGCAGTTGCCCATTjCCAAAGAAGATATTCA 

GCAGCTTCGCAGOCCCACCATOGTrCACACTCTATGATGAGGAGAATTAG 

ACTCCACAGTGTATATATTTTCGCATTTATTTTCTAAAAATAAACAAAAA 

TGG AA GCCAAA AAAAAAA AAAAAAA T A CCTGCCTCGGCGGGCG TCT AGAA 

C TAAG GAATOCCCGGGCroAAGAATrCGATATAAGCTTATGGAACCGCGA 

OCTTGGAGGGGGGCCCGACCCAGTTTGGTOCTTTAAGAGGTTAATtK^ 

TTGGGTAATATGGG 

>Sequence6l9 

TTAAGAGCCGG AGCTGCACCGCGOTGGCGGAOGAGGT AOCTACTATGTGT 

CAGCCATGGGGGATACAAAOATCTATAAGGCACAAGACCCTCAGTCTTGT 

AGTOGCCTGACAGCCAGCCAGCTACAACATAATGTGGAAAGGACAATGGT 

GGGAAATGCACTCAGGTCTTCCTAATCCACAGAGTATGCTCAGGCTGTGA 

CATAGG A ANGA A AAC ACG ATACTTTACCTTA ACACNGG ACTTGG AGGG AC 

CTTCA A A A A ACATGTG ATGG TCAGG A A ATCCA GTTTTAAAA GTCTTG ATT 

TAAAAAAAAGAAAACACTrTCTGTGGATAAAGATAGGCTGCAOGAAATGT 

AACCTATGAAATTTTCTtlAAATTAGCTTTCAGACACACACAAAAAATTGC 

ATTTGTTTGAGGAGCAGAATGTAACTTATATTAAAGAATAAACTACTATT 

TAGTATCTGAGTGAAGTACCTGCCOG 



AGTXJAACCGCGGTGGCGGCCGCCCGGGCAGGTACATTCTAATTTTTATGA 

GAC^TAGATATGTATTTATAAAAAGATAGATGGAAAGAGAAGAAATTAAC 

TTAATTC TAAGA GCCAAATTTACir^GAAGCTITAGAAACACCAAAATTA 

ACAGCCAGI 1 1 ICI IGAI 1 1 ICI ICI I GAAGAAG AGATTGGTGTTG ACTA 

TGGTCAGATATACTATGGCCTTGAGAGGCAG7TTCAACTTGAAAAGAAGA 

TGCAGCTTG AGCA ATCOGAGAGGACTTCAAAGAAGCTGATGAOCTCTCCC 

GTGGACTT ACTTTGACAATGTTGG AAGAATCTGGCTGGCTAGTCTGAACT 

GGAGTGGCTTGAGAACTCTGGGCTTXXnTATTCTCAAAG 1 I CI II I I G TT 

TGCAAACCC 1 1 1 1 1 1 1 AGTAACCTGCAGAGGTATAAACTG ATTGTGCACA 

CCCCCTXXn'ATTCCCCCAGCCATGGGCATGGTCCCAGAATATA-^ 

ATGGAAGGGCTTCCCAGOAACTGGCACCOATGGTCCXXiATGGCATTGAGC 

CAA AG AG ATG AG ACGACGAAGGCATTATATTAGCAATAGGTGGGGG AAAA 

GCTGGAGGAATGGTTTCAGGTGGTACCT 

> Sequence $21 

A ^ A C^CCCGCGGTGGCC<5CCGAGCnTAACCACGCCTCCCCATCACAGAGC 
CTAGG AAA TCGCGATG ACAGTTT ACAGCAGGTAAAATCCGGTGGA G ACCA 
^^^^^^CCCGAGAACKXXrtGCGATTGTTTGGGCaTATGTAA.CTCGCTG 
GTACTXTTTGCGCCAGGGGCGNOCCCGCATGCTAGNAAN 



TGATt^GACTGAGCTCACC<KX^rrGGCGGCC<MGGT 

TAAGGACAATAAGTTTACTTTGTATCTX1AACTCAAAACAAAGTAGTTGTA 

TATirTAACATTCAAAATTGGGATTrCCCAATGTCACACATCATGAATGC 

AAACCCCTCCAGCCCATCAGACXKX1AGGCTGCCTACTW 

TAGTATATAAACA TGTA AAAA TAGG TTGTAT1TTACTCTATGTATGATGC 

TAATCAATGAACACTTTATTTATTITACAGAGAAAACTTATXrrGTGAACT 

TTAC TATAT AICI GI 1 1 ATTTTACTTTA 1 1 1 1 1 1 1 1 ' I A AATAAAAAGGGG 

TTTTAAATGCT7TGC«rrTXrrra 

CCCGGCTTGA AAC1 I'll TTAAAA GAACGGGGAGAAAACXXTTOGTOTGCAT 
OCAAGTAAAGTT TTTTATCr AAAGAAAGGTICTTCAl 1QC1 1 1 I IT G ACA 

CAGTTGTGGCTCTGTT I II! NQAATOAAACTOGnTAGATATC 

> Sequence $23 

TAGAGTGGCTCCCGCC<XXKXXKXGCCCGGGCA^ 

TTGAG 1 1 1 U I CI GGCTAGCAAAAAGCTGGCTGTGTTATGTAAATAAAGCC 

CCTA TAGT AATTAAAATTTAAAAAAA GI 1 1 1 11 AAGCTGGCTGTT1 1CCT 

ACCACTTCAGAGTCCTTGACCXXCTAATTTAGG^^ 

AGAGAGAAACAAACAACAAAACAGTTAAGGAAAACTAACAATXXiTCACAC 
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AAA TT A T ACAA 1 1 ICIUAGTGCTCTAAGTCCAT1TMAAGAAAGCTCAAAC 
TDCATAAAAACATCACCTGCCTTCCATCATCATGAAAGCAGGAAAACTTG 
CCTTCCTTG TTGNG A GCAAGTAAAA CTCCAAAAAAAGAGGTGTTG TACCT 
>Seqqe nce624 

TGTrATGACTCACCGGGTt30CGGCCGAG<rrACO<XXKMXJAG 

T A COGCAGCT A GG AAT AA TGG AATA GG ACCGCGCTTCT A TTTTGTTGGTT 

TTCGGAACTGAGGCCATGATTAAGAGGGA 

>Scqucocc 625 

TTTAAGGCGTTGAGCTCCACCG<KrrGGCGGC^^ 

GATCTTCATCAATGTGTOGTOCAC I PC 1 1 H C1GI I 1 CIO TCACAOTAGC 

TATAAACAGCTGTTTAAGGATATCCTTATCTAAATTTCTCCCAATOAGX3A 

CCAATCGATTTCTTCTCTX^GTGTCATCXrTTCCAGCT 

ACATAGAGCTCATCCCGCGTACCT 

>5cqwnce 626 

CCG<KKAGGTACGCCKjNGATGAGTCCTaGGAGGCGCTGGCTC^ 
CTCGGAGGAGCGGCTGCTGCTGCTGCTGCTGCTGCTGGTGGC^XCTTTGC 
AGATGTA 
> Sequence 627 

GGAGACKIATGAGCTCACXXXXJGTGGCGGCCGCC^ 

CTTCAG^AAA ATTXn XXrTTCAG<IAAAAATGTCCAAOATAAGATGAATCAC 
TTAATACCGTATCTTCT AAATTTCAAATATAATTCTGTTTGTCACCTGTT 
TTAAA TG AA CCAAA CCAAATCA TAC 1 1 1 1 ICI I lU AATTTAOCAACCTAG. 
AAAC ACACAI 1 ICI I lOAATrTAGGTOATACCTAAATCCTTCTTATOTTT 
CT AAATTTTG TG A TTCT AT AA AACA CA TCA TCAA T A AA A T A GTGGCAAA A 

AAAAAAAAAAAAAAANNG<XMTACTCCCTGATAAAGGGGGAATTTTCCAT 
GCCGTCTACX^^ 

Li i n i n 1 1 I 'AAAAAAAGGGGGOGAAAACC7TTTTGTTAAAAAGGGGG 
TTTATACACACC<KKK)GOGGGOAATTrTCCGGAOOGG I 1 1 1 1 1 1 1 II 1 1 1 
TTAAA AAAAAA GGGC^GCGCCCCCCCAAAAAAATrTXnTTATGGACAACA 
CACAATTTTTT 
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TAGTTGGACTGACCGTGGAGGCGGCCGCOCGGCAGGTACGCGGTGGAAGA 
CGGAGGCC^TTCTACAGAGACXTTAGGCrGTCAGGGAGTCTTTA 
TGCGC I ICI GIT ''^CTCOOCGCCCCTXntiCrGCTCCOACrcACATACTCGT 
CXlAGAACXXKjTCTGAG<XrrCTCCGCGCAGAAGTGCCCGGAGCATG 
A OCT 



T A G A G TGG A GCT CA CCX3CGGTGGCGGCOG AGGTA CAGA CGACGTCACCGT 
ATATCTTCl 1 1 ICOGCCAGTGGAGGATATCACCGAAGAGGACTTAGAAAA 
TGTTC<XA TAACTGTTCG AGA TAAAA TCTATGAT AAA GTTCTGGGTAA CA 
CCTGCCATCAGTGTOGACAAAAGAOCATCGACACCAAGACA G T G I G I CCA 
ACAGTGCTGTGGTGTGCGAGGACAGT 1CIG I CiGACCATGCCTGCGGAACC 
GCTATGGGGAGGATGTCAGATOGGCATTTXTOGACCCtX^ 
CCCCCCTOTXGTtXKJATCTC<AATlTX^ 
>Seqoence 630 

TACAGTTGAGCTCA<XXSCCGTGGCGGCCGCCCCG<^AGGTACATAGT^ 
GCGAACTCAAATCGGCATTTAGATAOATCCAGTGGTT^ 
TTTGC TTATAAAAAA AGTGCAAAAAAG A 1 G I GO I i lACAAGTTAAAGCTA 
CAOAATCCCI 1 1 1 1 UC 1 G I AATTGCACCAGTTTTAAAGCCTJC3CGACAGA 
CCAGATCGTTTAAAAC I rTPTTTl I CI 1 AAAAGCTTACACTGTTTGOCTA 
ATTCTOCTCCCC Mill ACAACACGG<XXKXXX*AGGGTGGACACTGGTGG 
CAGCTT AAGG<1ATACTGTCACTTTAAGAAGCCTGCAGATTGAAGTGTAAA 
CATGGAGAAATTAGGGGCTGATTTTTTAAACntrrGTGAGATA^ 
^^^^T^^^^^^^^^^p^^^CCAAA.CAGOGATTTACACCGATTAA 
CACCOOCTTTATATA T TTTTTAC AAAAATACACTGAGAAAATAATCAAAC 

oi 1 1 icAicicici ioici irmroTTm IAAAAGTGTCAAAAGTCTA 
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CAl 1 1 AAATATAAAAAATTAAAAGTTAA A ACTCT AGCCCTTCAGTXj AAGG 

AGACCTAAAATCGC<nXKKn"AACAACAACTACCAAAAAAAAAGAAAAAAA 

AAAGAAAAAAAGGAAAAGGAAGGAATAAAGAAATAAAGCGAGTAAAAAGA 

AAGGAAAGAAAAAAAG<XjACAAAAGAAAAAATATGTTTGGCCAGTATAAA 

T ACGTTCACA T A T A AAA TGCATCTGA TT ACA TT A A CA AGG AA AAGAAATA 

CXJAGGATGGAG<ATCC^TGAGGAAAAAACACGTTTCTCATTTA(^CCT 

aggaataaacacaacactn 

>S«qucnce63l 

AGGTCATCAGCrTGCCTCAAGTCTGGAAAGAAATTGGCT^ 
AGTTGAAGGG ACCA CCAAAAGA GCT AAG ATTTXrrTGTAATACTCATGTGG 

CTOGCTAAGAGCTCA^CACTCTC 
TTGCAACGAATCTTTTATATATCTCOIXTTCTCCm 

TTGAGAAGTGCAGGAATAGCATCTTTCTCTTGGGCCCTGTAGGGACTAC^ 
CTTCA CCTCC A CAGTTGTG A CAA TG TTAAA G TC A TTGCTG TTTGCC A TCG 
TTTTjTCXZATCrcTTCTACAACAGGTTGCATCTTTA 
> Sequence 632 

A *^TAC&ACAC1^AQQQCAGTTTCGAGCTCCTCTCACAAACAQTAAATCT 

aca caa ctttcaca gaga g tg tg 1 ccgcacac^ttx^ccatcagcttcaa 

ggaggggttccgata 1 1 1 og i ggtcttacaccgagggcaaccctgatcgt 

cx^kkxxx;tttccctoctacagactctcc<^ 

a gccacct acaa gccccxttccccgcgt acca cca cactg tcccaaa tta c 

CTCTTCATTACOCAAATCAAAGAAT Cl I 1CIO I I I ICCCAATCCTCAAAA 
^^^^^AAGAAAAACCAAAGAGCAAACTCAAAAGATGATTTTTACCATAA 
ACCTCAAA TGTGGCTTAA CAAGTACCTGCCCGGGCGGC 
> Sequence 633 

GCCCAI IGCIul HUM I GTTTGCTTGA AGACCAAGACGG AGTTGGGCCT 

CTTGA 

>SeqtKnce 634 

^^-^^^^^^^^^^^^^^C^CTTCCAOAGT^TAAACCAGCTCAQATCTT 
A TCTCTGGGGGAA TTA GTGTTOCCCTCA TTT A GCAACCTCCAT A CC ACAA 
GGTCTCTGlL, i GT AGTTACTGGGATTATCCAG ATACACTATCAATG ATAC 
AAATTCATAGG A GT ATT AA TGCA TTTCTTTAAACACAA CTTGATT AA G AA 
GCAAATATGTTAAGCAG1 1 1 ICI 1 1 1 J C I GCTGCTAAATTACAGTTAG AC 
ACTTCAGTATC 1 ICI C I 1 i ACATGTGTATATAAATTAGTAAGAACCTGCA 
TCCAAAGCAATTjTAGTGTTJTGTATGTATCTATATATATTTATTCT 
AC^CTTCAGAAGCCTTTTIGAGTTACAACAATATTTTAGT^^ 
CTGT A G AGGTAAAA TTTCT A TA TTACCAAGCTCCAG A GG AA T ATG A T A TT 
TTACAG<jCACAATTTTCTGGCTGTAGTOCCTG<3GGCATITA 1 1 ltXJ l IG C 
CTCCATGGGATGCTGTTA GAACAA TTGTT A GCCGGCAA GAG AA G AAA GGC 
TACCAGGACAGCATTTATAATTGTAGAATGGGAGCCITT lllCt 
CTCCACTTTCATTrAGCATGAAATAAAAAATAATTGGAAATG 
> Sequence 635 

NAGAAGT O ACTC A CCGCGOTGCCGGCCC^GGTACAGATG ATGAAGCTTCC 
AGAGCT/ TATCTG ATCTCTTAGACAGAACTCACATAAACACACAAATACAA 
GAGGTTATTTTXAAGACACACACTTGCAAGTAA IC I I ICI A TAGAAATGG 
CCACA GCA TTATAATATTCAAAATA TGGAAGA TTG ACACTCTGAGG ATTT 
^-T ^^^^AAAAAAAA TCAAAGGA CTTTJCCAAAAGGATAACTACATAACAGA 
TATGACAATCTACAG<3ACAAAAAGACAACAT<jrcAGCAAATATTGTTCAT 
^^^ACAGOGTTAATCGAAAACAGTAAAACACCTTrT AGCAOTGTGCATGT 
TAAGTCTTTTAGT AACATTATCTGTAATGAGGTTTGAAAGTAAATCACTT 

agtagacaaagtaaaccaccacagaaccaggaatagcaoccatcactgct 
GCI I I ^tca ctcca gaaagctgaaagtcaaccgaacaatcaaaaaaagtc 

AAAG AA GCA TTTCCCTTTG AA TTCAGTCCT AAAAA TA TGAATGCCTT ATA 
ATTAATTTCAAAATAAGTATCTTACAAG IOI 1 1 ca tgaaaca TT G ttttc 
CT AAAA GGCAAA TTCAACA TTATGAAAA T A TA T A TTTTGCCCGGT AGTT A 
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CTCAGAAATCTCAATOCTTTCAACTCTAGAGAATGATGCKATCAAGTCCC 
C I 1 IO A(XXXX>CTtXXXKnTCOGCCTGTrr>HXAI I lOOC l ICI O CATT 
CGCACCTTAATGCAGATGTA CC I TO CCG 
>Seqacac< 636 

rCTTTTCI I A TACATATAATTCTTrAGTTITATATATTTATACTATOCA T 
CAGATCTGTCrrrATGTCATCTATAATATTAACTTTTTTTCATTTTAATAT 
ATTTATXTITAATATCATTATAACTCGACGATXSACTCAOCCGGTGOCOGOC 
CAGCTACTAAAGCCCAACCTTCACCACTACAAAAAGGAA C TT O TCT AAA a 

ccaagaattcaattaaogctcggtaaoaaaactcaaaacactaatcagtt 

CTCCATCl^GOCAACnxrrAAGAACCCCCTCAACTATAOCXXMC^ 

ACACTACGCACOAAGCCrTACTTGOCCA<rrCTGAATrTCTATTAACTAAC 

GOCAGAGTGAGGGAGAACAAAGAGCrACTTCCGTAACATTITAGTATCCA 

GATAGTACCTGCCCG 

> Sequence 637 

TGTGGCTTGaGCTCA CCGOGGTGGCGGOOGAGGTACAGGAAAGGG AAGCA 
CaGTTTGGAACA A CAGCAGA GA TATATGCCTATOG AGAAG AACAGGATTT 
TGGAATTGAG ATAGTGAAAGTG AAAGCAATTGGAA G ACAAAGGTTCAAAG 
TCCTTGA GCTAAG AACACAGTCAG A TGGAATOCAGCAAGCTAAAGTGCAA 
A I rCI r DOCGAATOTtntnTGOCTTCAACCA TOT CT G CAGTICAATrAGA 
ATOCCTCAATAAOTGCCAOATA 1 1 ICC I tC AAAAOCTOTCTCAAGAOAAO 
ACCAATGTTCATA TAAATGGTCGCAGAAATACCAGAAGAGAAAGTTTCaT 

TOAGAOCTTAATGGACAGAATCAAGAAACAGCTACGTGAATGGGATGAAA 
ATCTAAAAGATGA TTCTCn IXT1 1 CA AA TOCAATAGA I I I ! ICTA CAGAG 
TATTGATGATGTATTGAGAATTCAG C I CC 1 1 1 

»63S 

TCTCGATCACTCACCGGGTCGCGGCCGCCCGGGCAGGTACGCGGOAGAAA 
ACTAAAOC 1 1 CA TTTACTGTGAACA IC 1 1 C I O ACT0TGGCTT0CAGATOC 
TAGTTTACAGAACAAGCaCACAGCAAGACCAAGCTT atgctq AGTTGACG 
G AACAATGAGT AAACATAA GG A T ATT A CTGTG AC1TTG AAATTCTGAAAT 

tu 1 1 ctttcn aacttttccattaaaatcacatttattttataaaataat 

gaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

aaaaaaaaaaaaaaaaaaaaaaaataaaaaaaaaaaaaaaaaaaaaaaag 

AGCOOCOOCNCT AAAAAAAAAGGOTTTA AAAAA 1 1 C I CCCIC TT ATTGGG 

TTTTTTACTOC CIXOJ OCAAAAAAAAAAAAAAA A 1 1 1 I C CC IC O-A TTAA 

AAAAAAAAAATTTTTGTATAAGGGAACCKrrCCCGTTTAATAAAAAA 

>Seqaeoct639 

ATCOCATCAGATTTGTAGATCTTAAGCAGGCAGTCAOCGAGGOCrOGGAA 
O I CCC 1 ! I CAGCTCCAGCTITAOOCACATCAGCTGCTAGACGGGTACCT 
>5cqocnce640 

TCKJAGACGATCGAGCTCAOCGCOGTGGCG<XXXKXXX3GCAGGACG<XKXK} 
GCTG 1CI C AJCCGGTCAGACCTGGAAGCXX3GCGAG 
ACCTCCAGTOOCTGGCC1 rCOOOCAOCATGGAGTAOCT 
>Scqucnc*64I 

TOAOATTOAGTOQAGTTCACOQCOGTCGCGOCC^^ 

GTCTTCAQAAACCAOGC I OC 1 1 1 CAGGAACATIGCTOTGGATTCCCAGCT '- : ' 
TTCAGACAACACATOACTAAGACAQAATQAGACCACTCTAO I rOOCTCAT 
GGGAAACTCGOCAAAAGACTCCAAAAACAACA I ICI I I C iC CCTTTOGAA 
TTCTOGAGTTATAAGGCAGAGGTCCCCCATXrrTGCOGAACTGGCCTATTC 
CGCT AGAAGCAA GATGGCTG AACTCAATACTCATCTG AATGTCAAGGAAA 
AGATCTATGCAGTTAGATCAG fTGTTOOCAACAAAAGCAATAATQAAATA 
GTOCTXKn tSCTOCAACAGTTTOATTITAATGTGGATAAAGOOGTGCAAGC 
CTTTGTGGATGGCAGTOCAATTCAAGTTCTAAAAGAATQGAATATGACAG 
TT>1NNAAAAAKAAAAAAAAAAAAAAAAAG 



B188 

3inT«bk2) 



643 

GCCGAGATC ACTCC OCCCCGGOjGOCGGGACTTGGAGAATATTTOCACAA 

TAGOCG ATGAC I fOTTCTTGI HG ACAAGACAAAOTTC1 1 I O GCTGTTACC 

^yCAATG ATAGTGAGGTCCATTCGCGTCTATTAAATGGAGATGATTOCAT 

CI i u l ^ i ACAG ACACTGAAATACCTGGCTAAAAGCCGCC 1 1 rOCTCTGCG 

CTGCTAOCA GCCCTGTCACA GGTXXXXKXXKTTCTACCTCCXXXiCGTACCT 

GCCOG 

> Sequence 643 

gttgagtgagctcxx3gcggtggcxkxxxiaggcacgagaagclxactcgct 
otcctaaaccaaatgaatggaaagcgccaaaagtgattttataocaaggg 
toca tccatacaaataaacaaaatoctatcc 1 c 1 ici 1 1 c i atattgtgt 

i i «. i i ACA mCTI ATACAAATAACAGAATGCTTCATTTTATTCACTTCA 
A TAGG ACAA AGTOCT TAAAG AAACACTG AAAAG AGCTCATAATCAAAATC 
CCAAATTrrATGCTTATTTTTGGTTTAG<XXrrATCAA 1 1 TTCI C ACATAT 
TAA CATA CGCAGGAAAACATTCIT^GTAAATTGAGCATTn^GTCTACAA 

atgtc ttgaagcactctggc aagttacatgtatcccat g tt g c n I rG GT 

TTCCCATCTCITCII FGCTTCAAAC00CCATGCAA GTTTC1 ICTTTTTTC 
GGGCAGGCTGTGAA TA TTCAACC I CC 1 1 1 1 l U GCTrTTACAAACGTCTOG 
CAGGCAACTGC1 1 1 t^JCAATTTTTACACCAAGCTCTCGAGTAGCTAGCTG 
GTTGCTGCGGTC 



TGACGACGTGGAGCTXXXXCGGTGGCGGCCGAGGTACACCCTCTGGCCTC 
TOCA AGCAA GCA GTC^GOTGTGCATTGTTAG AGGTGCAOCGGGAAGGGAG 
CTT OO 1 1 1 CGGACXXX^GG ACATCCTGTCCGCAAGCAGCTGCT ACTTCTT 
GGXJCTTCTCTaGAATATTGAGGAATTTCOCCCGTGTCATCTCTCTGGACT 
CATCCAGtXCCAGCTOATAGGCTAOG I IC'I U I AGGCCTCGAACCTTCTCC 
ATCAAATTACiCCGTOGTGAOACTeCCCA CI ICI 1 ICA ACATGTCGATCTC 
ATt^CGTTCTATXnXL^GCCATCCATTTGGGTGGAGAACT AGTAATAGGAC 
TTTTG AAGGAAGCTGCAAATTCAGCAACACCTGGT AATTG 1 ICIOGCCAA 
AGATCTGGTGAGGGACGGTCAAGTTTTTCAAAACTTAGCAAAGATGCTTC 
CAGATCnSTCCOGTCTGTGGGAGACGCCATCITCCAACOCATGTCAOGTC 
CCCGCGTACCTGCCCCKXKXXXX^riT^AGCCAG^ 
3* Sc <jucocc 645 

TTAGCGTtiAGCTCACCGGGTGGCCGGCCGCOOGGOCAGGTACTTCAGGGA 
GGCCTATATATTGOCACCCAAGOAATGCCAGOACTOCCACC1GC IOC ICC 
AGCGT TAGOCTCACTOO ICI GCTT ACTCACTTTCACTGC C 1 1 1 1 I UICIA 
1 I 1 C I GOG AGGTTGGT AGAATGAAAGGGATGCTCCAAGGCAA GCA QATGG 
CCTOTOCACCTX3CTATATATTCACAGTGCCAATGAGTCT AGAGTCTTGCT 
ACAAGAAACAAAGTCATGAGAAATGCCAGGtnTXXntrTTACACCCAAAGA 
CTGCTGGCOCTOCTACTCTATCC 



TCC ACTTC CCTTTC AA I 1 1 TO T1 A CTCATTATTGTTATTAATA ICTCTl I 
ATAT TTTGaCA TTAII 1 I M AAATTATA TGTTAGTTATTCACTTCTT A TC 
tatta men AGTAGTGTCOACTCACGGGTCGCGGGOGAGGTACCGGCC 
AAOOCTOGT^ O LM CMC 1 1 G GGCACTCTCTATGGG<XSGAGAAAATtXlA 

TCGTTGAA GATOCCAGCTGCGATGGCTrOGCTCACCAOATICrAO<XnTC 
CI ICICClCXjATGTCTOG<XrrAAACTTATCTTCAAATACAGACCATTX3CT 
GOCAAOGAGACCAGAAOCCATGGTGACAIAAGOCAACTTATCAGTTTGAT 
OCATGAGGATAGATGCTGTAGAGGTGAG<ncrcAGTTACATCTACTTCOC 
CCTAAAACTAGGGCTGCACX^AATGTAACXnTGATACCTG AAAAGCATCTG 
CTTtlAGCATOCaATTOGCTXJTXZACAACTCTGGGA^ 

GGGAGTOGAGCTCCAGGTTGGAAGAAATGAGCTGCOTTXrrCATG \ C I U I G 
TC TGCAO Cl Ql CCCAGTAOCACAACATTAAATATTAGGAGATATXjAAATG 
TATTTTT 
>ScqucaM«47 
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GOAOGAAGTOGAGCTAi 
TTTTTTTTTTTTTGAGACA' 



ATTAAAOCTTT 



ITACACAATATGTCAAAAT 
TrTTAAATTGGCAACACTCAGGATATTGGGATAATTAA 



ttaggcaatoattcaaaoatgi nrooTn iaaaattcaaaaccctccaaa 

OGTCAAAACTCTOGAAAAAA1 1 1 1 1 PGG1 1 ICC CCCTOCCAC GI 1 1 1 1 1 1 ' 
TTTTAACCCCTTAAAAAAAAA^ 

GAAAAAAAAOGOGG OT 1 O CTC C I CC1 AAAAAAAAAAAAAA AA A G AGGGG< 
GOGGAQAGGQAAAACAAAAAAAAAAA TCTCTCCC1 1 i n ICI 1 1 1 1 1 1 IU ■ 
TGGGTAT AACCACGA GAAAAATATAA 1 1 1 IOIOII I ATTOTAATCAACAA 
CCCCACCACTCACTTATTTTA Hi III 1 1 II CC ACTATCAAAACAAOGCTG 
TTGTTTGTGG 



AGGTACT ITN 1 1 1 1 1 



GG A CGACG A TAG 

1 1 1 1 H I 1 1 1 1 II I I 1 1 ITT A 1 1 1 1 1 1 1 1 1 IATTTT 

* 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 rGTTTrm^TTTT i miiiiiiiiin ii 

cococacggggaccaaaacxrrtaacoccggggacc^^ 

ccacaacoocaaggctgttaaa acacog gttgoggggogggaaaaaaago 

ggggccaaaggggcccccccccctttttcccggggggagaaaaaaggggg 

ACCCGGGGGGGTTA 




CGGAAAAAAAAAAAAAAAATATTTTTGGGGGGAAAAAATATTm 

AATOCOOOCAAGGGGGG 

>SctpKnoe649 

GAGATTOA GCTCCCGOGOTGOCGGCCOAGGTACAGATAGGAAGAATGTAT 
ATTCTCTGGTTGTTGGGTGGAGTGAATGTCTATGAGGCCCCTCACTTC 
TC ATTCAGGAA CAOGATTCAGAGCI rCTCCTGTOCAOTAGGGGOCATCAA 
TAGTTCAI I UC I 1 1 1 1 ArTGTCI G CTACCATTOCATTGTATGGATTCAA 
O CT AGT C TGTTTATTtlATTCTCCCAGGCTTTCCAOCAGGCCA I C7 Cm C 
ACTTOGGGGGCAOCT 

gtqagaatgagctcgocgcggtggoggccgaggtactcagtggggaagaa 

GGTAAGAAACAGTTGATT AACACOC 1 G I G I IC IOCCAGGTGGGATCAGCA 

atatgtaatocaactcagctocatgttcaagga i g i ooc ic i oactpcag 
aagagotggaai i roi 001 ggaaaaagcattgagcat g ttctccaagatg 

AATCTTC^OAAATAOC ACCI MGO I CI A Tt^G CnC I UOIIClCICCIC 
CAACGGAAGCAGAAAGA GICI 1 1 lO GAAGGAATCATAGC C rT C n CA GTG 
CACT AGATAAGCAGCACAATGAGGAACAGAGTGGTGA CGAGCT ATTGGAT 
GTTGTCACTGTG CCATC AGGTGAAC 1 1 CO i CATOTOGAAGOCAOCATTAT 
TCTAC ACaTTGTGI 1 1 OCCATCAAATTGGACTATGAACTaGGCAGAGAAC 
T OGTQAAAC ACTT AAAGGTaOOACAGCAAOO AOATTCCAAT AA TAACTTA 
AGT0GCTTCAGCATYGCTC1 rCTTCTGTCTGTAACAAGAN 

GAGAATGAGCTXXJCOGCGGTCGOGGOOGAGGTACTGCGT^ 
■- GTCCAGCCCCCI 1CCICITOCTGGAAATTAACATTGGCTOCAOCTTOCAO 
CAATTCCTGOACCAGGTCAACA I C I ICC 1 1 1 1 uaacagctttaatcagca - 
AGTOATTCTCTTTXACTGCAGCCCTT^ 

TtXTGGGGGTTGTTATGA 1 OCC 1GC1C IXX^TGAOGGTAAATGCCACCTG 
CTAOCACTTTTAGOCI 1 1 1CCI IGAGAAAATGCAAATTTATCTOCTAGCA 
C^AATCAAAGAAGCTTTGACTGTAAATTCGGATTCT^ 
AGCAGTATGAAGAAGGAACAATGTTCTCAC I CI ICIGACATTOCACCTOC 
TCAACTCAGAOGTCTCAATTA I ICC1 1 rpG CAOCOOCAAAOOCTDQAAOA 
CIOCI rGCAOCOCGAGCAOl 1 1CC1C C 1C4.IOCCI CC G OGT A CCAOTGAG 



GAAOGAAAOAOCATTC I CC I I I A GGGCAGCAATCACAAN 
>Sequ«ce 652 
GGAGATGGGTTGAGCTt 
GGAGGGCCAGGTCTCAGGGCTCCTGGAG 

caaatgcttgactcagtgatgcagaaoctttcagagttagctggVagc^ 

^CCCCTOOnCITG ATCCAGCCTGGATCCAGCCGGTGTGAAGAGOAGAC 
CCCI 1CCCICI iUlOGGGTTTGGAICCl Ol Ol 1 ICIAGOCT 
>Scq^a cc 653 

AACTXiAGGAATTATAGATAAAOCITAGGTCAAATCATTTCGCAATTCCAT 
TGGTGOTATTOAA AAATGATGAGA 1 7 I CTCTG ACAGAGAO C 1 1 I O ICCTA 
QTTT T^y ITCTTX^ TAGGTCAAAACTGCCAATA IlCTCnG T C l XJ CAAGA 
TAAA GTO TTTG I OCT TCTA TCAC CATATGCATGAACATGTAAGAATCAG A 
TACAA1 1 1C1CCI I CATCAGTTTCACATGTTCATC TTG TCACTGAAAAAA 
TG C A TCT A CTGTTT A T A GCTCCCAAGG A G ACCCXZA A A TCC I H I I I I C I I 
TTGA G ATGG AGTC ITOCTCTTGTI GCCCAGGCTGGAGAOCAGTAGCOCGA 
TCTCAGCTCACTGCAACCOOCACCTOCrGGGTTCAAGTGATTC^ TGOC 
TCAGCCT OOCCA GTAGCTGGGATTTACAGGTGCCCGCTAOCATGOOGGGT 
AAA f HOG 1 1 rTAGGAAAAAOOO OTTITO OCriTTGOOOOGCGOTTTTT 
>Seqacncc 654 

GTGTOGTCGAGCTCAOCGGGGGCGGCGAGGTACCTGTTACCACTTTA AAA 
GTA AGTTC TCCATCCCATAAAGOCATTTAAATTC^TTAGAAAAATGTOCT 
TACCTCTTAAAATGTGAATTCATCTGTTAAGCTAGGGGTOACACACGTCA 
TTG1TOCTATATG TATGTGAC tTCCCIC OCCC 'IC<XAGAATA CI 
CAATTGTAG GTATTC 1 1 1 1 1 GG HI AATTTTTGCCAATGTAATTAAAAAA 

tggt a tgtca ttttt aaaa tttgta men I CA TTACAAATAAGATTGT 
tatctcag tattgt tattgg c rr ' i iugi atto cici i a acgtgaaocgtc 

TGTTCA IIG I I 1 II AC CIGIHICOIIIIA GCAAGTAGTACCTOCOCX; 
>Seqncnc«655 

CCATGG ACTGGAAACAAG 1 1 C 1 1 CO I CtXKXXXTAGCGAOGCCCAACAOC 

TGTCCAAACAAAAAAAAAAGTGAACAAGAATTAAAAGATGAAGAAATGGA 

^TATTTACAAAATATTACTOCGAATGGAAAGGAGGTAGAAAAAACACAA 

atgaattctataagacxzattoocpggttttattataggctgcct^ 

GATGAAGTCTT ACT ACAGAAATTAAGAGAGGAATCAAG AGCTG TCf 1 1 CT 

A CAAAGA AAAAGCAGAGAACTGTTAGATAATOAAGAATTACAGAACTTAT 

GGTITTTGCTGGACAAACACCAGACACCACCTATGATTGGAGAGGAAGCG 

ATGATCAATT ACGAA AACI I 1 1 1 UAAGGGTGGTGAAAAAGCTOGAGCAAA 

AGGCAAGCAATTTTTCACAACAAAA GICI I IG CTAAA C fOC I IC ATACAG 

ATTCATATGGAAAGATtTTTCATCATOCA O I PCTTTAA 

>Scqoeacc656 

GGAGGGAGTAGACTCA<XXX3GTCGOGGCCXOGOGGGCTGGTACGOCACAA 
GGCATrTAATOOGC ACAC TAACAGGG C 1 C 1 1 IOA CAGTGGCAGAAGAGGA 
CGGGACTAAAGTTAC I t T G T GCTGAGAGGOGGAAAOAA gcacaaagtttg 
GTCTGTTGCGTAATTGAATTTTTAACACTCT^ 

nocnnccro croi gt aaaagacatgagatatattacagattttcaaac 

AGGTGAGCATCCTTTTACGAGCTGGGCAGGTGGGGAGTCGCC 1 CC 1 1 1 lit 
ATGGA GTGAGGAGATTTQGATGAATGAACGCtAAQATGQCCAQACGCACC 
TCTrtJGATCGTAACTCTOCAOGCnJGOATTOCAGAGCTGCAAACAA 
TXjAATTCGATCTGTAAAOCTG ITGTCATTTQAOGTrTGCAGGCAGGGATC 

aacatttacattgagattcaatagaogctactactacaaaggagctttat 

TXTITGCACCTTAAAATGGTTGCTGCGGGAAXlACTG^ 

nrm 

GCTIUTCGATGAXTTCCCC GQGTGGCOCKX}^ 

TCTTCATTCTGATCTCCTAGAAGACAGCAAATaCCGAAAAATCTACTCCT 



P*Scl90 
(of26l (acaiaT*tle2) 



Page 191 
0*261 pitBbT*k2) 



PCI7US00/33312 W O 01/42467 PCT/USOO/33312 

413 4M 



Table 2 Table 2 



TTACTCi'l AAGCCTCGAA 
> Sequence 651 

CCTTCTGCTACOTC TGTA TTCTA I IUCI I G TCAAATO C I CI 1 11! 1 AATA 

TACTTCCTGTCCrrATTTTACGTGTTTTATTTCAGTTTTXXnTrATACTCT 

GGCTATCGTAATTGAAATGGGGGCGATGGAGCTCACGGGTGGCGGCCGAN 

^^^ ^^*C TtX3GQnTAG<nx^AATCTTCTTATACACTTTCACAAAACATT 

CTATCTTTCATCTCTTOGOCA IC I ICI 1C1 IG CCCATGGCACCTGTCACT 

'HTjCGGGGGTAGCGGTCAATTCCAGCCACCAGAGCATGGCTCJTAGGGGCG 

ATCTOAGOTGOCATCATCAATtnTCTTCAOGATGACAGCTTTGOGTCCGG 

AGTAOOGTOCAGCCAGOACAAGCACCAOCTTCCCAOG 

>Scaucace6S9 

GGAGTCAGCrrCACCGGCTGGCCGCCCCCCXKXICTGGTGCGOCACAAGGCA 
TTTAATGCCCACAGTAACAOGGCIGI I 1 GACAGTGGCAG AAGAGGACGGG 
ACTAAA G TTA CTTTG TGCTG A G AGGGGG A AAGAAGCACAAAGTTTGGTCT 
GTTOCGTAATTGAATTTTTAACACTCTTATCCACAACAAACACTTTTTDG 
TG TCCTGCTG TGT AAAAG A CATCAG AT A T A TTA CAG ATTTTCA AA CA G GT 
G A GCATCCTTTT ACG A GCTGGGCA OGTGGGG A GTGGCGTGG TTTTGA TGO 

agtgaggagatttggttgaatgaacgctaagatggccagacgcaoctgtt 
cgatctcaactxttgcagcctgggattocagagctgcaaacaaocactgaa 

TTOGATCTGTA AAO L I U I IU I C ATTTGACGTTTTCAGGCAGGCATGAAC A 
TTTACATTGTAATTCAATAGACGCTACTACTACAAAGG AGCTTT ATTGTT 
CCA GCTTAA T A TGGTTGCTGCGGCA A CACTGA A AGA TG AAACTGACTTTT 
TT 

> Sequence 660 

GAGTGAGCTCACCGGGTCGCGGCCGC<XG<XXIAGGTACTATGAOCTGAAG 
AGGCAG AGGCCATCAC ItXTTGG rcCGGTCTCCACCTGGGGAAACTGAGGT 
TGCACAGTGTCTXrrGTGGTGACGAGCAGGGCTTCATCCAGTGCCTCTGTC 
CCCAGCGAGGGGACTATOOGAGACATGGAGGGTGTOTGAGCAACAGGTGA 
^^^][pQ^*pCAOCTOAAAACTGGGACACCCAOCCAGCCAACAAACAATGT 
CGGTC ICTGTCI I GGCAOCTGCAGGAAACAAGCTOCTACTTCCAGAAAAA 
GTGCFCC IGGGACTCCAGGATAOCAGGCATCTOGGTAAGCTACXATGCTT 
AAO CACTTAA CACAATCAGGAAGCAACAGCCATGCATTCGGGAAAGGAAC 
TTCAGTGI rGTGT OGCTCAGTCTCCAGACCTAA C 1 1 1CC1 I I lO GTACCT 
>S *qotD ce66l 

GGOGTpGGATCGAGCTCCCCGaXTIt^^ 

1 1 1 I I IV TCACCATGAATGTCACOOCAGAGGTCAAGAGTCGTGGGATGAA 
Ol 1 1 OC 1 GACSGAGCAGCItSCTAAAOCATOQATCGACTCAAOGCAAAOGOC 

>Srqi trace 662 

GCCTGAGOTTCAGCTCCCCG<XXjTGGCXXXXGC^^ 

1 1 1 II I II U 1 1 1 1 11 1 1 1 11 1 I IAI 1 1 1 II IAI 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

1 1 1 1 II II I I 1 1 1 1 1 1 1 1 I H 1 1 1 1 1 1 1 1 111 1 1 lU GACAAAACAAACOA 

Ol 1 1 III I AATTTATTTAGGGGG AAGGAGGGG IXSTC1 1 IGGATATACCAC 

AGCGAGG 



GTAGATGGAGTTGAGCTCAOOGCGGTGGCGGOOGAGGT ACTTGTGGAAGG 

TAGTGACCAGCAGAGCCAGOGCCTT3CTCCAGAGAACTGCACATCA 

> Sequence 664 

GGCCTT ACTOGCTTCAAGAGCCGAA I lUX M CCAAOCAOoS^GGGG 3 
GAOOOCAATTAAGGGT1TOOOAO(X^CTATTTTTTAATAAOGOCAOCAOC 
TTAAAAT GCCTO GGAAOATtXnX!GTCATOCTTCGAG<XTCAAATATACTT 
TTOATAATG TrrGC AGCTTCTCAAGCTTTTAAAATOGAGAGCAOCXX^GA 
ATCTAGATATC I J GC I CA GATTGGTG ACTXXXTTCTCATTG ACTTGCAGCA 
CCACAGGCTGTGAGTCOCCA 1 1 1 1 ICIC1 I GG AGAACOCAG ATAG ATAGT 
CCACTTGA TGGGAAGGTG ACG AATG AGGGGACCACATCTACGCTG A CAA T 



Page 192 

(of 261 pages in Tabte 2) 



ViAiCCIljl i AGi I f l GGGG ACGAACaCTCTTaACCTG i GCaCaCaaaCTT 
GTGATCTAGG AAA TTTGG AAAAG AATTCCAGGGC AGA TCTAC I II II IC T 
AAAGATCAAAGATTATTTTGAGTGCCCTT 
>Sequcnce 665 

ggatagattgagotcccgcggtggcggcogag^ctaacaaggaaagcccc 
tggagctcctgtaataagaatgtggttggaagatgcaaactgtggatgat 

CATCaOCTCCATTTTCCTAGGTCTCATTACAGTGATCATCATAG 
GTCTTCCTGC AGTAACTT ATG ttg atg a agatg AAAATG A AATACTTG a a 
ttatcatcaaacaaaacattcttcatcatgctgaagattccagaggagtg 

TGTTGCTGAAGAGGAATTGCCTCACCTGCTCACCGAAAGGCTCACAGATG 

TGTACCT 

>Scqucncc 666 

gatgactcacgggtggcggccgcccgggcaggtttaatctxaggtctccc 

T A T A C ACTTCTCaGCCTCA GCA CCT A A CCCTCAC A CAACACTCCAG T A TT 
GG ATGCAGTC A ATC1TGTATA ACATTTTTTG A A TGTCCAATGTGCA A AGC 
ACGATGTTGGAAATTATACAGAGGTGAATAAGACAAAAACTCTTGCTCTC 

aaagatg 

> Sequence 667 

TACGAGATTGAGCTGAGTTGGCGGC GG^XXjA GGTACTGGAGAGTCGGCTT 

ctagcaac^tgtctcatctgagcttcccagtgcc^^ 
tggggctgaagccagcgctgacggat 

>Scqucncc 661 

TOAGGACTCACOGGTGGCGCKXXiCCCC^GCAGGTACl I 1 1 II I I I I I 1 1 1 
TTTTTCTGGTCGAAAATTTTTGTTGGAATTTTAAAGAAAANGAAAGGCAA 
AGTAGCACTCAG ATGGCC 1 H ill I GTAAAGTG AAGTCAACCTAATACTC 
TGGTGCTTACTTTGCAAATC Mil ccataagtcaagtattagtgttaaca 
atacacttaagaagtaaggataaacccatcaaggtocacagctaaataac 
cagcagattcccagaaactttatgtatttgggaaaagtaaaatatacaac * 

AG ACA TATCCCTGCCCTG ATTAA GAGGG TAG ATAAAAACAAAACAT AAAA 

CAATTTTACTTGAGATAGTAATAAGTTATTTGAAAAAAATACAACAGAAT 

ATAGGGAGAGAGAGCAACTACAGAAAGAAGACAGAAGGGGTTCTGCTTTG 

AATAGTAAGGCTTGGGAATAGCTGAATTGTAAAACAAATCTGTCAGTCCA 

AAAACGAAGATATTICATTCACCGCTGACTACTGAATGGGAAAC 

>Sequeoce 669 

ggaggactgagcrccaccggggtggcggccgaggtacagagtaggatcaa 
taaaatctgtgtgttacagcggcagactgaaggaogggtgcctcttttca 
gccatgaggtagtccctgaccatctg a g aacca agcctgaccctg aa g tg 
atgctgoccgcattggtgcaga 

atcatoagaagctaggcattagcattcctgtctagtagggatacatagtt 
ggttgcgcctaa g 1 1 c c 1 i c tggcagaacccaaggaataaatttctccat 
atcgtttcctagtt accct aatctctgcacaaa 1 1 1 g i gtgttacag aag 
cagatccagagcttgaataaaatgtgticaaaccttctggagaaaatcag 
canagaggagcgagaatagcagagtgggagtatgatx3cgactggtggcta 
aacagagaagagagggatttaogatcactogaatggatgggtg 

>Scqucocc 670 

TTATOACICACOOOTGGOGGOCCOCCG<XjGAGOACAT^ 
TTTAAACTTTTAGGGTCTTGGOCTATTGCATACTAAAGGGCAAAAGGCTT 
AGAGATATCAAAGGGGCTAATTTTTTATTGACAGACCATGGCGATGTAAA 
ATTAGCTGAC rTTGGTC ItXiCTGCAAAAATAACAGCTAOCATrGCAAAAC 
GAAAATCTTICATTGGCAOCGCTTACTGG^TGG^XXXAGAAGTTGCAGCA 
GTAGAOAAOAATOGTGOCTACAAOCAAC IC T O r OA TATCTGGGCAGTAGG 
AA TAA C^GCAATTG AACTTGGA G AA CTTCAGCCACCT ATGTTTGGATCTC 
CACCCAATOAGOG C1C1CI IC I I A ATGTCAAAAAGTAATTTTCAGCCTCC 
AAAACTAAAGGACAAAACAAAATGGGTCATCAACATTCCATAATTTTTGT 
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CAAAATAAGCACTAATCAAAAAAAAAAAAAAAANAAAAGTACCTCGGCCG 

CTTAGAACTGANTGGATTCCCCGGCTGCAGGATTCGATATCAGCT 

>Seqoeaoc671 

GTCGATCTTGAGCTACOGOGGTGOOGOOOGAGGTACOOGOOO I C 1 1 CI CA 
TGCTCOGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGOCT 
TAAATGAGTGCGA 

e 672 

G ATTjACQATC GAGCTCACCOCOOTOG^COOCCOAOOTA 
TTC mCmCl AATAAAA C TTTCTT ITI CGAACCTATACTGTCTTCTGT 
. AAA I ICI I CI I ACTACGCrATGAGCCGTGAGCCAACCACTTTCCGATGCC 
AGGGTTXTGACACCTCACCTGGCATAATATAAAGTGn tTI IT I I I ATAC 
CCTTCCACTTGGAAAGACTACAGAGGAATCTTGCACIXjCATAGTTCAAAC 
T AAA AAG AG A AG AGTT AATT ACCTG A A AA GCAAG AG AAAACA AGAAGGGG 
TAAATTTTG AACCAAGGGAAATCATTTAAGA AGTGTCTGGTATTTTTCAA 
A TTTCTG TC A GTTGTT A C ATTTG TC A T A ACT A A A TG TTT A GGAA TAA AGG 
A TGG A G ACA TGCTT A TTTT A TTT AACTCCCCCNNNNN A A AAA AA TAA A A A 
AAAGTACCTGCCCG 
> Sequence 673 

tgcgtgttgga ttg agctccccxkxxjTggcgoccgtccc ron AATACTG 

GCCTXAGTTCCGAAAACCAACAAAATAOAACCGCGGTCCT 

TCCTA GCTGCGG T A TCCA GGCGGCTCGGGCC I GC i I IG AACACTCTAATT 

TTTTCAAAGTAAACGCTTCGGGCCCXXJCGGGACACTCAGClTJtXjCGT 

T 

>S cqnc a cc 674 

OCGCGTG ACTCA CCGGGGGCGGCG A G ACTG A GCCA C ACAGTGTCCGG A TG 
G AAGTCTGCATCTGaGG I I G CMCAGTGTCCXX3GTCATTCATTTACACATT 
TTAACTTGCATTAAAGAGCTG1 ICI I ITCTXjTGGCCTAGACTCTTI 1CAC 
TG ATCTCAAAATAAACTGGTTT I 1 1 I CAA AA AAAAA A AACAAAAACAAAA 
AAAAACACAAAAGCTGCATGTCTAAAATTAC^TGGAGTTACTGTCTATTC 
I I 1 1 IC CC CI 1 1 I O CA GCAAC TTACACAGCA 1 1 1 1 I AACAC CI 1 1 1 II I 1 
CTAG1 niTTTGTTCGGT T II ITnCCATCAGGAATTTCA G TlCTCT C T 
A A CCCAGCTTA CTGTGGGACA TA GG AAAACTCAGT AG AAA TACCTTTGGT 
GATCTTGTTG AGTTT AAGTCTG ATCTTG A TCTT AAACTCAGTAA GCCACT 
ATCTGCAATTTTGTACCTGCCCG 
>Seqneace 673 

GTTGATGTAGAGTTGAGCTCACCGCGGTGGCGGCCGAGGTACGCGGGGCT 
GTAGTT 

TCTCGTCAGOCTOCCGC 
>Scqacnc* 676 
1 1 1 1 IGTGI ITGAT 
G ACCACGCCATCGTCCAGT ATGAG 
AGTCGACATGAAGGTAACGCATGTTGTCACTGC 
>Seqoeoee 677 

TGAGTCAGCTCACCGGGT«KXKKXX3CCCGGGCA 

CTCTAGTATGTAQ OGTO1 1 ICI I A GOTAAA UllIll 1 I 1 1UCTACTGAAA 
GGGAAA1t3GTCTCTAAACACTOGTCACTGTAGCAGGTAAACACTACTCTA 
ACGTGGAGAAATCAOCTTCATGCTGAGGTAGTGG TT O CCTTAAAOCTGTT 
TTTTATOCTGTAAAAAOCAAAATGG G ITTCCI I LLC IG ATAGGTTTAAAT 
TAAAATT TGCCTTAT AOI 1 1 ICI 1 1 roOCICCGOO0CAGATAC»O0GOGA 
GClTTTC a-l 1 1 1 1 ICLXjGGTTTTAAAAAAGGOI Mill lAAAACGGGGO 
CCOCGGGTCCOCACC1 1 1 1 1 1 1 1GGATTTTCGGCOGGGCCOTTTTAAAAA 
TAO<3OQATO0000000CGGflAGGAGATTTCATTTAATAAA 1 1 1 1 1 ICCCC 



TTTAGAAGAATA 
TTC 

«67t 



ATATTATCCGGOGA AJ 11 1 II I 
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gagagtgagtgagctccccgcogtggcggccgaggtacttgtggcagacg 

TCGATGATCGAGTTCAAGGCTGTCTCCAGCTCGGGCAACATGA 

> Sequence 679 

TCAGTAGTCGAGCTCACCGGGTCGCGGCCGCCCGGGCAGGTACrGGTGTT 

OTGATCGCAACOTGTXX^TCCCCICI ICI CATCACTOCTOCTOCAACTPO 

ATTTATTACTOCGGGAATGGTCTCAGGGGGAAAACCAAT G T G TI'T A OCOT 

GCCTGCCCACXTG«KXrrGAGCACAACTATCCTGC^ 

TCCTGC^CAGGAAAC^ACCTTCCCCTCCCAATTGATCGTTCAAACACTGC 

(lACCGC TGAC TGGCCTGCATCrGTGGGTCTOTAGAACAGAAAGGCAGAAC 

AACTT A TTTTTT A GGA TTT AACG A CAACCGCTTG A AAAA A A CGGT AGG GT 

GTTJATGCTCACAGAGAATAAAGATTTGTAGAAAAGGTGCTGAACTGCCAA 

GGAAGGCATTTCTTGTGCCXrrGTCTGGAACCGTGTATCCTTA 

GGAGAAAGACACCTCAATTTGGCATGCAGTCTACTTCCN 

> Sequence 6X0 

TG AGATG TGATTGAG<TrCCXXXXX>GTGC<rGGCCGCGTACAA GGGG AGGTA 
AGATGGGAGCrCCACTCCTTGGAOCACCAGCTGGTTCTGGACCGTATCXX 
CATGAA I C 1 G 1 1 1 G AA CGTAAGG AGG A A GTCAAA AAAGTTCTT A TTT AGG 
Gl I ICI I I OAGATGTGGGG<XACTTXXATTCCCA.CCCGG<^<!AGGTAGGC 
ACGGGCAT A CA CCG ACACTA GTGGGTCTOCGA TCCCTCTG ATCATGCATG 
TCAACCGGGGCAGGCACTCTGAAATTCCCGTTTTGGAGAGGAATTTGTTA 
CATTTC AGG ATGGATXX^CTCCACCT AAAA TCTTGGAATGAGTTCCCT 
GG AGGCAATCTTG AAAAACCAATTTAGGCA I O 1 1 I CC I I GGCCGTGTCAT 
TTGCATTCTCTGGAGAAAAGTGATCrGGTAAGACGCTGCGGCTATCCACA 
C^CATCGAAAAGATGCGCTCGTACCTGCCCG 

> Sequence 681 

CTCCTCC A C A TGT A TTT AATTTA TTT A TTT A TA T A TTT AT AT ATAA TTTC 
TACT AAAAT TTTTATACTATATTGATCATATATAAAATATGTTTATATAT 
ATACTTTTTA TTCAT A A A AAG AAG TGT A GG A ACTCTCGGGTGGCGGCC G A 
GG T A CCCT AATGT AGT A G T AAA TTT AA GGCCTTJTCG A GG AAA TTTT AACA 
CTTCCA A CA GGTG ACT ATA TCAGG AAGG AG AAA A CCAAGTGCTTCCTGCT 
TCACCTTCTGCTGCTTTTGGGAC 1 1 1 1 1 A TGAGCTAGTTAGCTAAGGACA 
AG ACCCT G AACC CA 1 ' I'M ' I TCACTGGGAG AGGAAAAOCACCAGGCTTCTC 
AGCTTTGGCTTGGCAACTCTGGAGTTCCTATGGCTTC^ 

CKiACCCTGATAAGTGG<!XrrCAGGCAGGAGGAGATCGGGAG03GATGGGAG 

AGCTAGTCAGGAAGGTXJGAATAGGGACCATCCCCAAACACGTTGGCGTAT 

G A TG A TTTG AGG AACTGG ACGT A GTTCTGCATGCTGCGGTTGG AGCTTTC 

GGACnXJCTtXAGGCGATCTTTTJAGGTCTTGCAACCGGCT^^ 

GGTCCGCATTATAACGGGTTCGCCGCAGATGGTTAAATTC 

>Se qacnB t 612 

GATGGGATTGAGCTCACCG<XGTC<XXKKXXiAGGTACICT 
CXXXTTCTTATGGCCAACCGCTCGGCTTG<X 

ATGTTCTATTCGGCrGGGTCGAGGGTCIX^GGCAGAGTGCGCAGGCTCGA 
CGGCTT A TACTTTGGGAA CGACA 
>Sequencc 613 

CA TGCAGCGCAAGTAGG TCTaCAAGACOCTACTTCCCCT a TCATAGAAGA 

gcttatcacctttcatgatc^ 

actaacatctcagacgctcaggaaatagaaaoogtctgaacta i iccrsc' 
cggcattattctagtoctaaagc<xxtcccatccct 

AAACAAAAAGGGGAAAAGATCCCC I ICCI 1 ICA AAAAAAAAAATGTGGCC 
CCAAAG1 1111 IGGCCCCTCCGGGGCCCTCGGGCCTTTTATAAAAAAOOG 
GGACCCCCGCGGTGTXXiAGAGAATATTATAAAAGTT 



GAAAAAACCCCCGOOGAAAAAAAAAAGAAATAr i I I T TC 



7TTTCCCCCCCC 
TATGGGG 
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toctcagcatcjTtaatogaag cl i rcn I GA GCTTCACAAAGGTTQCATTG . 
AACATTTGAOGAAOGOGAACAAGCTGCAACACCT I rO OAACCTTTOOOCT 
CACTDCATTOATAOClCni AT ICT OA ftiACAAACXiOCAATTTOCGTTCro 
CAOOTAOGAGOACA 1 1 1 I U O UC CGOGG CX I UI TOO G OTCTC C I 1 1ACCCA 
TGTTGACAGATOOGCX7TCCAO0OCACGCTATTCGACGGTATTCTTCCCTG 
CTCOCAO CI 1 1 roCTC ACAGTCOOOCACA 

>Sapcoc*6IS 

GAAGTGTACAGGCTGGCGGCCG AGGTAC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 FT 1 
1 1 1 1 IIO AGATGGA O I I IC O CIC M Ol I O OOCAGGCTOOAGTGCAATAGA 
GCGATCOCAGCTCACTACAACCTOOGOCTCOCAGGTTCAAGCAATTCTOC 
TGCCTCAGCTTCCTGAGTAGCTCGGATTACAGGCATAAGCAACCATGCCC 
AGCTAATTTGGA 1 1 1 1 1 A GTAAGATGGG G 1 1 1 I I C CA 1 1 1 IOGCAGGCGG 
GTTTTGACCOCCCACCTAAGGGGGGCACCCOCTCrGGGTOOCAAAAAGGG 
GGGTTAATAAGGGGGGGGATTCAATTOCXXOCGGTAAAAAAAAAAGAACC . 
COOC T O C CPGGGOTGGTAAA 1 1 1 1 I A ATATI 



ATTATTAACACOOCAA 



OCTTTAAAAAAAAAAOCAT 
ACCCCAGCGGGOGGG 
>Seaueace 6S6 

GTaCGAOGATTGAGCTCAOCGCGGTCGCGGOCGCOOGGGCAOOTACI 1 1 1 

iiii i iinii ii iiiiii i iuoi iiiiiiiii i iuii i mii 

1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 H 1 1 1 AAAACAGGAA 1 1CI I C AGACTTTATT 

aaaaaaggccctaaggggoctcttattaaaaaaggtataaaaaooocata 
'aattoggggooocrgggcfcggcagggttgatatooctl aaaggggagga 
a gggggggga tggggggtg aa ccoggg actgggg aag agga ocagggggc 

GGAAAGGGGOCCAOGGG1 rCTCACGCAACAACGGGGGAAGGOCGGOGACA 
MINI rOOCAAAATrCTOGGGGAAGOGGGTOOOCT 

TCTTCATATCGACTCOOGCGGOGGGACGOGGGGCTTTACATtjGCAACAAG 



AGGTGTTOOGOCAGGCTAAA 

oocaattttgaaaaagaagaaaggcgtaaagaacttaagcgacttogggg 
tgaggatacatgcattxn-acctgatgtgaatgagagaatrgaacagttct 

>ScqBcncc MM 

gacgcgtgagctcaoogcggtggogggogaggtacactcggcaggggttt 



■ 619 

TOAOOGTOAO 

CACTOOATAOGTA G I TOC TT TOO IPC I CAAOOTGGCTCTA OC 1 0 ICCI IO • 

AOC ICtCOTOT CACICOCrroOTOATOOOTOCA CACATCAGGOCAATCAO 

GAACAATATACAGATGCTOOCACIXjATCACAGAGAGAATGTAGITCTTAG 

ATCGAGAOGTCATTACTTGCATGGCAAGATCAOAGAAGOCATCIGCTGGG ■ 

GCCAOCTAGAATGACACAAGGCAATGTOATTCnTrrGAGAGAGCACTOGOC 

TCaTTOOCAGTCKTrAGGTCTAACTTATCCCTXrrCAGTTOCT 

a igici mem lU OAOAOOGAGOOCAOOAOATAAOAAAAATCAACACA 

OAOCTACAACTCT 1 1 1 I IX lOO 1 CATAAAACTATACACAOGTCTACTGCA 

CAAAATTAGGAATACCAGAAGAGCCAAAGTCXTTGCAGTCAOOCACAATTT. 

CTCaCAGTQATCAOCACTAACAOCAGGAGC 




TGTTGACTGTOaCTCCAGOGOGGTGGCGGOOGAGI ITOATTTCTTOCAGT 
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CCTGAGCGATGOAGCOOGOGCOTGCC TGO I ta t 
AGATGCTTQO C1 IOI 1 1 1 1 C AAGAGAAC C 1 1 1 1 1 C GATATTCA 1 1 G CTOC 
ATCGATTGGATCCAGTCCTTGTrCAGAAAATTGT 
>Seqnenc« 691 

GGaGACGGAGCTCAGCGGGGTGGCGGCCGAGGTACTaCAGOAAGAAACTA 
OAGGAAACGGGAATTTCATOCAT G TtXTGTCTATCTGCTGGCAACAGGTC 
AGAAOGGCCAGTATGTTAT I OX - I OCAGGCTGCCTAGGG rGCTCTOCTCA 
AACAGATCA CCTG AG C C 1 CI I GCATCTATGAAGTTATGACACAGCAACCA 
GTT A CTCA GAGTCTG A TGAGAAAAACAGA TTTT AGGTTTGGC AAA TG GGA 

ttactgtaatttacacato^uuvtgcaaactxx^gctctgam 
acgctggggagaacttgatgctaagccacaggtacctgcgog 

>Scqatacc 692 

g agtgagctccaccgcggtggcgggcgagg TACACCAAATGGTG ACATCC 
tttcaccaatatagattacttcatacx^cattgtcaaggaaaggactaga 

AG AATTTTTTG ATGACCCAAAAAACTGGGGGC A AG A A A A AGTAAAATCTG 

G AGCAGCATGG ACCTGTCA gca act a a gg a acaaaagt aatgaag a ttta 
cacaaactttggtatgtcttactgaaagaaagaaacatgcttctaaccct 
agagcagoaggocaag<xkxiagagattgccaatgccaagtccagagcggt 
tagataaggtagtagattocatggatgcattagataaaggtrgtocagga 
aagagaao atgooctaago cl 1 c rr c agactgctcaagaaagagctagac 
ctgg tgcttgaag aagagacatttttcg aagaata attgggcacaggttc 

AAGCAGGGGGGTTATACCTGGGCAOCCTA A TAAAAG ATACAATTGGAAAC 
CGA TCI I ro C Cl IGCl I I ATGGGACCCA I I tCI IA AACTO 
> Sequence 693 

CTGTGG AGACTAAGTGTGAGAGGCTAOCGTCACAATCA1 TT1 GGGGGGGG 
CC GGAC TOCCCAG CI 1 1 1 1 C ATOOOCTTTAAAOGOOGOTGTAACCAGGCT 
CCCTTTCATGTGAATATCCTDGAOOCTOCCCTTGTATCACGCGCGGTTAA 
TCTATATTAGTGCC Xi l I 1 l A TTA CCl IC IUCCCI AGTTAAACCTAAOCC 
CO i"i iGGGGCOTAGGG I'CX* l'lTAAGGOC^'IOGGGOCGOCCGTGA'] 1 !"] AAA 
1 1 r T C CTCT CAAAGGOCTGTAATTTACOGI F G1 1 1 I LCCACCTG AAACCA 
CGGGTGCG 



TTAG CGTGACTOCAOCGCGGTGGCGGOOGGGTACAAAACAGGGGCTAGAG 
CTTOCTACTAI 
GACAACAGGATAT* 



695 

gaggactaccgggggggcgccggcggtctgtataatggaggctgaccaga 
gcagtttagg A gattgtaaagggaggttttgtgaagttctaaaaggttct 

AGTTTGAA OG TCGGCCTTOTAGATTAAAACGAAGGTTACCT AAA TAGAAT 
CTAAGTGGCA7TTAAAACAGTAAAGTTGTAGAGAATAGTTTGAAAAAAAA 



CGGTDGAOACTX^AOGGGTGGOGGGOQOOCGAGGTACAGCAGGOTOCCTCA' 
GCAAGAGAGGACTGAGTOGA 1 1 TIIX'1 I AGOG ATA TTTATOAACCTT AAA 
GCAG<]AG^TTAAAGGGAATTTGOOOCATATTAACCACTTAGGTCATGATA 
AATOATTACATTTTTOGACATT1 rOGTGTCI 1 AATGTCAGCAAGGGTTGC 
AOQATAA GTITTG ACATGCATOCAtOGGAOACATGTAGAA ATTCT AGTTA 
CTTACAAG 1111 1 OGGGAAGAAGCCTGGAC<X>GATGCCAGCTTTAAATA 

acagggcaotctaattacttctaaattoctcacataogga o i 1 1 ro oc rc 

TGOATGG OC I OC 1 1 U ATOGTCACTA OGGTGA I C 1 1 1 IUCCCCTTTTATTC 

GCCCGGGAATAATGGGGGAAAAA I 1 1 1 liUGOOOOOOCATTTTAAAAAAA 
AAACCT TTTA TrTGTGGGGGAAAATGGOOOOOOOOCACOOOGGGGGGOGG 
CCCTATTTTTTA 
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AAAAAA1 1 1 1 1 1 1 fCTCTCCCOCGTAAAAAAAATA 

TTTAGATGAGTCTCGGGTGGCGGCCGCOCGGCAGGACGOGGAGAGACAGC 

CTCACGCGCTTGA I 1 1 00C1 O AGTOOOCCTOOCICAOCTOOOCaGTOOOC 

ACGI rOGTAAGAAGGCAACAAGTPCI 1C1CC1CIACAGAAGGATTTTGCA 

AACAATTCGGCAAGTTOCAAATGATTCTGATCGCAAATAC^^ 

GGGCAAGAGAAGAATTCAGAAGAAACAAAOGTGGCAOOOAAGAGGATACA 

ATCCXXiATGATGATTACTCAAGGCAATATCCAGCTCATGGAGTTAGAAAA 

AACACTTGXTTTTAGCAAAATCTTAACTATAGCATTATTCTGAAGGATTTT 

CAAAGTCTCCA IQTG1CI 1 1 GCTGCATTTAGGATTAACAATGGACAAOOC 

CAATCOQCAAGCTTACTAATTAAAATAACOCTGGC 

>Scqt*eace69X 

CGGTTGATGTTATACCTCACCGOGGTGGGGGCOGAGGTACAGGGGOOGCT 
CGTAGGGATCOTG1 1 1U1 1 (X IXIAOGAOCCTACGGTAATOCAGOOGG AGC 
ITCTTl PCCGTAOCTCOGGACAATC1 rCTCTCCTT GC TCl rCATGTDGTG 
GAAGAGAGGGGCAGAGTCTTCCTCTGTCAGGCAGGATGGAGTGCAGCGGC 
GTGATCTCACCTCATTGCAACCTOCACCTCCTCGGTGCAAGGGATTCTCC 
TOCCTCAGCTTCOCAAGTAGCTGGGATT ACAGGCGTGCACCACT ACATCC 
AGAGACTGGGACTACAGGCATGGATTTTCAGGTTT ATAACATOGCAGAOT 
GAATTCTCKK^CACACTGAGTGATGCTT UICAATGGCCACTATCAGGAA 
TTTAAAACAAGATTTGGAATTATGACATCTCGACAAACCACATATGCAAA 
A CCTA CC I 1 1 1 G GTCCATTCT OCAGGG C I I CI I L IG CAG C I rCTGGl ICC 

CATCATATTACTTGGGNCOCGGG 



TITCTCACAACCTCATTTTWTATTAGiCAATT IC I rCTTCATOCATCTAGT 
tatttctataactacaattctacattgcaagtag I ICTO1 ATATATTCTG 
rCTTCTTCATA Tn 1 1 I A GTTWTCK N I I C C fCTTC It i ATTCACGT GG TGG 
>Seqecaee70l 

AGGT AOGCGGWOOAGAGAGGAAAAGAACACAGATCroGCATOGTTCAOAT 

1 1 1 ICI III lAGGTCCAGGAGTAAGATATATCATACGAAAATGAAAATTA 

TAATGGCTrcTTGCATTOCTGGGAGOCACATTGTCAGCCOCACTTATCCC 

ACAGOGTCTCATGTCTGCAGCAATAGCAATGAGTTACTTCTTAATCTTAA 

TAATGGTCAACTTTTGCXUCTACAACTTCAGGGC^ 

TTOCAIX ITTCTCl GGAATTTTACAACAGCAGCAGCACGCTCAAATTOCA 

GGACTCTOOCA O 1 IC 1C1 1 1 ATCAGCTCTAOACCA G 1 1 rOCTOOACTOCT 

CCCAAATCAGATACOCTTAACAOQAGAGOCCAOI 1 1 fOOO C AAOOAOOOC 

AGGCAGGOCAAGTTGA tcccctacagcttcaaacaccg>*cct aaacacaa 
ccaggoodcagtcacgtgatgooctatgta rrcToen caaaatgoctca ' 

AGAGCAAGGACAGATGTTTCAATACTATCCAGGTTACATGGCOCTAOGCT 
GGGAACAAOOCTAGCAAACAGTITCAAGGGCAGCTAAACAAACAGGACAG 
CAACAGTTTTGAGGAGCAGATACCATTCTAGOCTCAAATTGGGATOCATT 
CCACAACTAOCAACAAOCTGCI 1 1 1 A TTAAGGAGAGGGGCACT 
>Sc^acacc 702 

TGTGATOACTAOOGGGTCGOGGCOGAGGTA CI 1 1 1 1 1 1 1 1 II I II II 1 1 1 
tttatgaattattta i men I C IU AGAAAAOGATGOOCCTCCACTTAG 
CAAGOCTOOfiCAGGAl G1U0 I 1 C JU CATCTGCOCACAOACGGCGTOGTTC 
TAGA --.«*.- . . . v 

>SeqacDce 703 - . . - " • "X^r '? 

GTCAG CTJ GOCGG PGGCCCCGGGCAGGT^ 

TOCTC CAGATG TTGIOGTTCGAAAACT G IP G TCTT A CACA A GOCAGTTGC 
AAGGA OCI rGCTPCrGTCnCGTrOTCAGCAAGAAOCTOACACAOCTtnO 
CTTTGGOCAAGOAACTCITATTTOGGGATACAOOGOTGAAGI I rCTGTGI 
QAGGGCTTOAGTTACOCTGATTOTAAACTGCAOAOCI mOTCI IACAOCA 
ATOCAGCATAAOCAAGCTTGGCTGTAGATATCrCICAGACGGGCtXCAAG 
AAGOCTGCaGCCTCACAAAOCTGGA£TTGAGTATCAACCAGATAGCTCGT 
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GGATTGTCGATTCTCTGTCAGGCATTAGAGAATOCAAACTGTAAOCTAAA 

acaoctacggttgaagaoctatgaaactaatttggaaatcaagaagctgt 
tggaggaagtgaaagaaaagaatcocaagctgact 



atocatgatctgcagttct 

JTACCTTCCA CAA GTACA CGGG 

TCTATTTGGCOGTGA CC rTOCTCl g gaoacoatoatatcccttcagcctg 

A GGGAATTGA TGTTOATGAAOCCGGAGGCATC AGTTCGCTCATAATCAOC 
CTGCACGTTCATGCTCACCAGCTOCTCATTGTT^ 




i i i i i i i racTGGi I AACAAATATTTTAATTCA 

TT A A A AT AA AA CTTA A AA A TTACA TGCTAGTCT A CA CAAG7TT A ACTTAC 
TTTAGTCACTTAGTGAATTGTGAATTCGCTCOCATTAGTGGTCAGGaGAA 

tgtatttggtotagaaaocaaataaatcaagctattatcgocttgtgagt 

ACC 



actgggaagtgcacttggacgaacaata 
aatataaatatttattaaaaaaatatattttaaaaaatagggaaaaaaaa 
aaaaataa aaaaa aaaaaa tt i oct i g g aaaaaactaaaagagaatca aa 
GA TTGC GG O 1 1 1 1 1 1 aatttttaaccaatttat agttatttttaattatt 

ACCTTT AACT T I I TT G TGA AATTTGA L ' f T IT T TlT'l G AATGTAG OCr rrT 

i rrn i agtgtgai i i tttatgtgttocaaatta gi mini iattgaa 

TAAAAATTTTTTTTrTOGAAAGTTTACl ICI 1 1 IC AAATG TTTTAT OCTT 
1 1 1 1 AAAA TT TCTAGAAA TTTT AAATATTT AA TGGTTTTAAAAGT ATTTT 
AAA TTTTTTT A T AAT A TTCAAAAA A T AAA TTTT AAAA T A A TA T A TTGAAT 
TTTT A 1 1 1 1 1 1 AAGCGOCATAATGTGCTAAGOGAOGA 1' 1 1 ICATTCATTG 
TTG CTTA ATTGAGAATA TGGGGG TACAATTGGTAGCTrATAAATATrTGG 

ttattttattgtatttca 1 1 1 1 1 1 uc i u l g a taataggtttatatt ctta 
caatgtttcaaaccgatttaaaaggttatatgttttaaatatgatgacta 

TGGGAG ACTATT AAGAGXjTAGATACGGTGGAGTAATTGGGTTTOGATTAA 
TTTAAIUI 1 1 1 1 OOATGTOATTATTTGAOGGTTATCTGATTTATAAAATT 
GGAAAACATGT G GTGAAAOCAATATGTTTAATAAOGCCA1T 1 TCG potto 
AGGAA 



GGTaCACA 1 1 1 1 fCTG AAA 1 G I OOOODGTGATTAACTTGTGAACAAATGA 
ACATGOCACATGTCAACAAACTGAACAAACATGGATTGTTAGTX3ACTTAG 
AGGTOCAOOGAGGGCTAOAOAGAOGCTAO C rOTCTTOOTCl GOCAATCTC 
CTGTGTOOCACACTGGCTACAAAAATACAAOCACTGGGTAGGTAGGGCTC 
ATCTAGAAOCAAAATTAGGAATAAOGATTGAGAAGAAAACTCAGCAAGGG 

tgatgaatgagtttcagctcattgctggagttagctcaagaatgaa 

ACACAGTGGATGAAOGAACAAGCTATTOCGGGGA CC 1 1 1 rC AAOOOCTOO 
GACCTCACATCDCTAA TAGCTAAAGAAGGATAAAATGGA tgtaagccagt 

>Secjaeoec 1013 

* GGTACCAOCTAACAAATTGGAGGAAATGAAAAGAGGAATCAAC^ 
T OGAGA A AAAATT TATTCrACTTCTAGAATTrCATTA^IljCAA G TG W CTT 
AGTTCTTOGTI I t^JTAGATGAAGTGAAATCAAAATTGGATATTTGGAACA 
TTAAATATGGGAG<AGAGAATCrcTGGAATTATTGXrrG 
AAATTTATTGAAOAAAAAOAATrOCTAOCTOGACTTGATA Cl I CM HCA 
AAAATGTGGAGAAATTTATAAGAATITGGCTX$GAGAATXrrCAGAATATTA 
ATAAACAGTATATGATGGTGAAATCTGA 1 G U I U I ATGTATAGAAAAAAT 
A T A T A TAA TGTG AAGTOCACTCTAjCAAAAA GTGCTGGCA TGTTGGGCT AC 
TTATGTGGAAAACCTTOGCTTACTA 



<<rf26t f 



C 199 

ac»Ttfafe2) 



421 

Table 2 



rCT/USOO/13312 



WO 01/42467 



422 

i noie 2 



PCTYUSOC/33312 



AAGAAGAAATTTAAGAGGT AOCTOGOCCGGCGGGCGATCTAAAGGG 



A CI I INI 1 1 1 1 H II 1 1 1 1 U 1 1 1 1 1 H IO AOACAOOOTCTCOCTCTATC 
AOCTAAACTCGAGTOCACTGOTtM^ATCTtXXKnT^CTCCAAOCTTCACA 
OOOCAGGCrcAAGTXnCAATCCTCOCCCCTGAGTAGCTCGAAOCACACCT 
GCGCACCACTAAACCCAGCTGTTT AATACACCATTTTTAACCCAAAACAT 
TAAOAAAAATATACGAACACT AAGTAGATTACATTTTGTAAACAGACAAG 

cttacaa oi 1 1 1 l i ca aatatga aactcatactaaactgogaoactotta 
ac 1 i ci i g a tggggtt a a tctct aat a tg a agccaca gtca t a gct aact 
acaaattacatatacaatcocaaaaataticaaaaataacattttttgca 
ccttaatcattacaaatoctaaccagcataaagacactogaaagtttcag 

AATCTOCTCATCACATACTTTCAAATATCTTCCCTTTACTTCCATGAAAT 
TGAACGCCGGATTCTATCTAAGTG ATGA C 1 1 G I LA AGGTTCCAGGTGTAT 
CTTAACTTA A ACTA A A G A ATGCCCTA ACTT AGATGGGTTTTGAGGCTATA 
CAATTOGT ATTGGTTG ACCCTTA AGCTTTACCTCTCTT A ACATGG AGG AC 
GAAGAAAGCTGACCTTGGOCCCAACCACCTAAGGGCGAAATTCAACACAC 
TGGGGGGCGGTATTATTGATACCACCTGGGACCAACTTGGGGAACATGGA 
>Sccj«eace tOS7 

A G ATGGGCTCTA GTA TCTGAA CA ACG AAATT AT ACTT A T AG ACT ACTTTC 
TTTTCACAG AACA AATG AG C ITTCT I GGCTTTTA ACA AA ATTATCATTG A 
AAACTACAAAATTAAGATCACCCATAATCCC 
>Seqoeace I OSS 

ACATC CI ) I roCATGCTCAACAGCCCATTCl 1 1 1 CA TCATTCGG AAGCAA 
CAGCCGCACTCCXXnGCCCAAGTTATCCCACrAGCTCATTGCTATATCAT 
TGCFGGAGTGATCTATCAGGGACCAGACTTGGGATCAGTTATAAACTCTA 
GAGTGGTAAGTGTCTTCACATTCTTTAAGCACrAAAGAAAAClTTTAATT 
AGCTAO C I rOCl 1 C CAOTAATCAAACTAGAGC rOCTCl UCCTTGTGTAAO 
TTGCTATAAAGT ATTGACTATTAGAATGTCTTG AACTTTGGTTACTGTG A 
GCCAA GTCGGTGCTCA AAGT ATA TTTC AT AGTCTCAA TT A T A T AGTAA TT 
TAGGTTCTG AAAAATAGO 1 1C 10 I C 1 1 1 G CATATGGAATA 1 1 1 lOlO AGT 
A TTTACT TrGOAAAGTTTOGTCOACCrrATGATAAATTAAGAGTTATTTT 
T C I I' I T! A CAACCTACTGG<rnXXiAAGGGAATTCAGTCAG C I 1 I 1 IG ATG 
AACGTTTCTAATACTGACGATATCA 1 I C C 1 1 C CAAAGGGG ATTTGGTGGC 
ACTTTCAAAAAACATTGAAGAGCAAGGGTAAGTTATAACATTOCTTACCC 
TTCTATAAACAAC 1 1 ' I ' I ' ITGAATCT IT 1 G AAG AAG ATAA ACCGTGATCTC 
GTATAGTATAAAGGGGAGTGGGAATGATGTCTATAATGGGGGTCACTCTT 
CTCAACGGTGAAAATGGATOAGGATCAACATGATAGATATCTGTCOCTGA 
TQATGGTGG ATACGATAO ATGACGATGGTGG ATGCTGGCCGGCG 



acatatccctatctactatgtaaagacaaaaaggcaaatgaaatcatgta 
atacaatgaactcctcagaaaataactctgt aaaatctcag actgcctgt 
tt atcatatgctag a g t aaactt a cat7cctttcttgtt ag ag aaaaatg 
atggtaaaatccatgcattaatcaaaactaaaaacatgaaaaggcaagcc 
aactacaagag aaat acagttogocctto aacaacacag atttgaactac 

ATGAGTCCGTGTACC 



ACCGT GCAGAAGAAGCTAOCAAACAGCAAATATGGAAATAGTCAGTTTTT 
- 1 1 1 1 1 H I A AAGCCTCAGTAGAAGAGTGCAOAGTTACA C I O ICCTO 1 1 TO 
OGGTGCCCCCCTOXCCTrrCOACCTAAOTOCTGAC^ 
CTCAGTTCXTGCCACTGTX^TTTATATGCTTCCACAATGAC^ 
TCFGTTTTTTCATACCCATCACATAAGTGTTGGAGCTATGTATACGATCT 
TAGTGGCTTATAO 1 1 Hi I'l l I I I ATAAATTAGAACAC 1 1 TTTGTTACAT 
ATTAAAAAAT 
CI09I 



TAAAAGGGTCCTT agtgcctcttactc 

C7GTGTAAAATGACAAAAGTGCACAATTGA7CATT 



acgcgggatctaaagttc<kknt3ggaaggaaggagaaaaggggattgatt 
ttagtggaagaacaagaatc^ttctgaaattgattgtgatggctgtataat 

CCTGTGAATATACTAAAACATTGAGTTXnTXACTTTACATCAGT 
TGTCGTATGTGAATTTATATCTCAATAAAGCrATTTTTAAAACGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAGTCCC 
> Sequence 1093 

GGTACCA GGTCACCTGTATtnTG ATCACCAGAGAGCACACCAGGCTGG AC 
AGCAGCACCATACGCTACAGCrTCATCTGOGTTTATOCCACOGGATGGTT 
CCTTGCC ATTG A AG AACTCTTTAACCAGTTOCTG A ATCTTTGG A ATTCGA 



TTC ACCAGTCA A AGCG A ACTACTA TACTCAA TTG ATCC AATAACTTG ACC 
AACGGAACAAGTTACCCTAGGGATAACAGCGCAATCCTATTCTAGAGTCC 

c 

> Sequence 1094 

ACATGCCAAAGACTTCGCCATAACrrTTCAAGTTAATTACACCTCCTACT 
GTTrcACTTAGTGGCACTTTXTCTTAACCTGTTATACACAGAAGGGOTTGA 
GAAGACAAAACACrGTTAACTTCATTATACCTTTGACAAAGTAATATTAT 
GTGACATCATGTGTrTrCCCCAAAATATTAGAGCTGCAGATTTAGCTGAT 

tcaatttatggg ac aat it if it atgtg atct aacaatttggcatataatc 
tagaaagcagctttatgatcaaaaattgattttatatatatacatataaa 

TC 

> Sequence 1095 

tttttattatataaaaatgtaaaaatccagcaaaaccagaaatacgaata 

TA I 1 1 1 1C TGGO CI I ILA CAI I lU l lG AI I M I A TTCOCOAI CI ITTTC A 
ATACAATTTACACCCTCATCCCCATTICCAGTCTGATTATACAAGTOCTA 
AGTGGCAGAAAGGTCTGGAATAAATACATCAAAAAGAAGAGGCAAAGCTG 
TG AAACTAAGTTGCATGCAACAGGTTCTATG AGGG TGGGGG AAGTGTCTG 



e 1096 

ACAATCTGATACAAAATCIGAAAGAAAGAACAGTCTTGTAATCTTTACAT 
ACTTGTAAAAGCATTITJTCAAATTTCAGCTTACITTCAAAATAAAGT^ 
TACTOTCTAATATG C 1 CTCTTl AAATTTATTAAGTATTTTAAAAATACCC 
TGGCTCTTTA TCT AGTTTCAA TCT AAGT A TA G AAAAGCATTCTCTGTAAG 
GCTG TCTTAAAA AAAA G AAAAA AA A A A A AAA A AAAGT ACC 
^Sequence 1097 

A CA TCTGCAG ACATACTG AGTGTCACOGTTG AAG A GAGTXKJ AGTGGCTTT 



ACTCAGCT> 

CGCCCATCTTTATCACCAGAATGAGOAACTCCTGGAAGTTAACTXK1ACCA 
TCAGTGTTGATATCCAA CIC 1 1 1 OAACCAGACGTCTGCACCC 1 1 1 1 IUCI 
GATATACTGAGGACACTCGGTCTCTAGCAATI ICI ILAGGTCATCCC 
> Sequence 109S 

ACT AOCA TTCCA T ACAA TGGAAT A TTACCCGATG AAAAAATAAAGTTG AA 
CACATGCTACAACATOCATOAA C ' l 1 1 OC 1 1 A TAAGAACATTGAAAAQAAA 
ATC<XAAAAGAAAAATGAGTTTTAGCTCAAAT^'.-nTAAGAGC<XTAGC 
CTGCTCAAGATATCCTGTTi 



GGTAC C 1 ' 1 1 0 CA CTI TT1 CT AAGGOCTCTTAOTO Cl ' I ' I T AACTAOAAAGGG 



TCCCA 

T ATCT AAGGTG AAAATAAAAAACA TTTTTAAAAG TTT AA TATAAAG AATG 
AAATAATITCAGGTCAAGTTTATTATACAGAAATTATATTAATGGGTGGG 
>Scquc&cc 1099 

ACGCGGGGGA GGTCTCCA TTCAGT AGGTGGCCCGGG A TC AAGGCCGTGTT 
GGGGCTAAACCACACTCTGGAATICTGTCAGCAAATTCCTCGCTGTGTGA 
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ACITGAGCAAGCCATTCAGCTTTCTTAAGOCA 1 1 1 1 CI 1 GATATTTCACA 
GAGCCTCACCAAGTATTCAACGAGAACATGTAAGTGAAATGCTTCACAAA 



AG TCI CI GCCCTTTAATGTACC 
> Sequence 1 100 

ATAATTAAAATAAATATGATATAAAAATAAAATGTAAAAAAAAAAAAATT 
TTATAAATAAAAATCCAAAACATTCAACTrcATTACTTCATrrTCATAAA 
1 1 1 1 I I IC 1 1 AGTAAATCATTTATTTATAGTA ATTAACGAACCCCATNTA 
GAAGKGGCCGAATTTGGGGGCCCOCrrcTKAAAATAGCCATNGCCTTNCG 
ATGGAATCTGCANAAT1 



AAAACCACGC-GCCCG 

> Sequence 1101 

GGTACTTGTTGGCTAGGAGCTtjAGCTTATCACAACAAACAACAGCATTAC 
A GG AA TTGTCTT A T A TG TGGTCAGTTGTAA A GCTG A T A A AAATT ATTCTG 
TAAATCTTGAAAACCTA AAAA ATTTACGCAAGAAAuAGACATCACTTGTCT 
ACTGTAACATCCAAAGGCTT iGCCCAGTATGAGCTCI Tl AAGTCCTCTGC 
CTTGGATGATACAATCACAG<^TCACAAACTGCGATCGCTITGGATATTT 
CCTGGAGTCCTGTGGATXMCIATTCTTCAAATCXXrrCCAC I CI C I IC AACT 
GCAACTCTOAATATTAAAGTCGAATCAGGAGAGCCCAOAG G TCCT TT G AA 
TCATCTCTACAGAGAACTGAAA I I IC1 ICI ICil 1 1 IG GCTCATGCTTTGA 
GGACTGGTGTCAC1GAATGGCTCGAGCCCCIGGAAGCAAAATCTTGCTGT 
TGAACI'llil ICAGGAA I 1 1 C IGAATGACTTAAATAAGCTGGATGGATTTG 

gtgattctacaaaaaaagacactgag ottgaga ccttoaagcatgacact 

GCTTGCAATCCATCOTTCCOTCAAGCG I u 1 1 I icAAAGnCGGGGTGATC 
TTGATATTTJCTGAACAACTGTGGTGCAAAATGAGCAGTAGTGTGAATTCA 
TACCAAGACTTGGTGAAGTGGTTCACATTGATCATTCAAATCTACAACGT 
GGGGMATTCAAGCCTGGCTTCATAGGGGAAGTACAATTAACTAGTAAGCT 
CA T TC TT A AGCTATTT 

> Sequence 1 102 ^ 
OGTACOCOOOATTOCCCCAT OT TITCTTC1 A G AA G TrrT A CAGTTTTACG 
ATCT ACA J 1 1 1 1 O GTCTATG ACCC A 1 1 1 I U GTTAAA I 1 1 PGTC IAAGCTA 
TGTTATACATGTGOAAGTTCA 1 1 1 1 ITIO CATOTAAATATCCAA1T 0 1 TT 
CAACACCATTOGTl OAAAAGACGGTAT O 1 ICICCI I IG AATGCI ICTGCO 
CCTCAATTAAAATCAGTTTACTCTATCTCCATAAGTCTACTT^^ 
TCTACTCTCTTTC A TTG ATCTGT ATGTXTTG TCCATTHTCCAAT A CCACTG 
ICI I I A TT A CTGTAGTTTCATAGT AAACCTTG AAATCA TAA TTCT AT AG T 
AA GTCT AAAAA AA TCACA CA GGTTGG A AATGCA CAATT A GT ATGCT AAAT 
CAG AGCAATC1 T GT lli'l 11 CAAATGGTTTATGGGAGAAATATTAGCCAGT 
GGCTTAACATGCTCATTCATGATAACCTGAGCTTATCTGAAA^ 
TTTAAGCTTTGCCGCTATITCAAvAACCACroGCTOOGCTTGGT^ 
AATATQACQACCATTTAOOAAATOOCATTCATGAA I C IC C M C ATCTGAA 
TATTTGGAAATTGGCCAACTTGCGGTCTAACTTGGAATTAAGTXnTTrAA 
TGGAAGAGCAGTAAGOCGGT ATTOTOAAAAAAACGCATTTTATTTGCCTA 
TGTGATTTAAAAATGATCTTAGA 1 1 i 1 1 CCATTAAATGATATATGGCTAT 
GAAAATAA TTTC ATT 

>Seqwence 1103 

OGTAC1 1 POI rAOCGTCroCCTGTOTATXXIAAAGTroACAAAAAATCGCA 

TG AAAA GATC ATGATT OGA 1 1 1 ICI 1 1 lA AACCT O OCC'l 1C1 G 1 A AAA AA 

TAG1 1 lATATAl 1 1 1 1 AAATTAGTAGGTATGTGTGGC'I ICCT1 11T1UCT 

AACATTCCCAGCAAA 1 1 1 1 1 OC 1 U CTAAGACTATCACTGTTAAAGTGAAA 

ATTA CAGGGA AAAATGTG A TGAAT ATACXXrTAACTC^ 

ili lAAAATCAcici 1 1 lATGCri IAGGAACTGG11GGTCTCCACI 1 1 OA 

TTATTAOTGTAAACAOCC TOAGTATACOTGOA 

TCCCCACT ATCTTAACACGTAATTCTGATTTATG CC 1 1 1 AGTTTGACTTA 



TTTTTACCAAA ATAT TACAAGTTATGCTTTAAAATGTTAAATOTGGACTO 
AAA 1 1 1ICAICU I IC 1 1 1 G AGAATCTATGAAGGGTATTATATACGTGGC 
CTAAAGGCAGGTGTGG A I ' 1 1 1 G i l' ATTCTG AAATTOGTTTGCATCTGGAC 
AAAT A CT AAAT A TCCCGTGGCCC 1 1 1 1 1 1 1 1 1 1 ATTTTTAAACCOGGGTA 
TCCTTCCATCCTTTGGCCATTCCTAGTAGK3CAAAAAATTTGTAAGCO^AC 
TTCATTTTCCAATTACAG ACTG AAAAATTTCGCXC GTr I I I AAG AAGTTT 
TAATTTT 
^Sequence 1104 

CATTAAGAGAGAGACTATACATTCACAGAGGTTGGGAGCTTCTGTCTAGC 

CTGTTGTCCAAAACTGCTTATAAAATTrAGCAACTAATTATCACTTTTGA 

CAACTATTTTAATTCTAGAAAATAGGTTTATAAAGAT7TTCTTAAAGTGT 

TATCTATCCTTCCAATGACTTATTATAAATTTTAGAATGTATTTCTATAG 

GGTGGAAAAATCTCCrTTAGTCAGAATTGAACAGTTTTCATTjAAGAACAT 

GTTAC ACCATGTAG A A ACA TGGGTACC 

> Sequence 1 103 

GGNA CI 1 1 I 1 1 1 HI 1 1 1 1 1 A l 1 1 1 I J 1 1 1 I 1 1 1 A TGGCAATATTTATAT 
TTATT r TTGCAATTCCTTGG A TAAAAACCATTTG AA CAATGTTTGG T AAG 
GTGTTATTCTCATAAAAA C 1 IC I 1 1 CAAAATGAAGC L I ITTCTATTTTO C 
ACAAAAGTTAAAATTACATGCTGAAACAAAGTrAATGACTTAGGTAACAC 
A ATACAATGTCAGA A A ACTGG T AT AGT A AG ACCAGCTAATG AACTACGTT 
TTTT AACT AAA CA A TT A CACAG AA CT A G AAAACTTG ACTTAGTA GTA AAA 
TATA TAAA TCT ACAA TTTT AAA TAT AAA TTTGGTCCATGGAGACACTAGG 
GCAGTATTTACAAATAGTAACAAG ATTATTTCTAACTTTAGGGGGA ATAA 
CTGCACTTGTAAAAATGGATGAAATATTATATCGAGAGAGTGTTITATTT 
T A CCG<XX7TTCTGTTTTT AAA A GTT AT A TTTTTT A TGCA A A ACCTG AT A T 
A TTGG CTACAAATACAGCTTATAAAATTATTACCACTACATAACCA AACA 
TrniClGGIGTllUGGAAA TICC TT A ACCTTCTCATATAATOTOAAATT 

tgttttgttt aaatt aca ttacagcxl\aaaaaaataccctcacattccca 
tatttttagaccatacttgtaaatattataatcctgttttaatttcacca 

CATCATTTAGTTATATTTCACA G 1 1 1 1 1 1 UC AAAG AGGGCACAACATTGC 
tca tgat attaatacttgcggcgcg A ATCCTATG A TTAG AG AGGTTACTT 
GTCAGTGATGTAAAATAACT 
>Seqpe»ll06 

CCCTTAGCOTGGTCCTTTGTTC AGGT ACAAACCTGCATGGTGTTTATGCA 
CA CACvAGA TTTG AG AACCATTG TTCTG AATGCTGCTTCCA TTTG acaaa g 
TGCXiTGATAATrTTTGAAAAGAGAAGCAAACAATGGCG I I ICI 1 1 I A TCT 
TC AGCTTATAATG AAA ICTGTTTGT1 G ACTTATTAGGACTTI G AATTATT 
TCTTT A TT AACCCTCTGAG 1 1 I 1 1 O OATGTATTATTATTAAAGAAAAATG 
CAATCACGATTTTAAACATC^AAAATCAAATTTTGGATAACTTTAGATGA 
CTTCAGTGAAA 1 1 1 1 CA CGTAGTCTGAGTAATAGAAT O 1 1 1 1G CCACTTA 
GAATAGCATTIGGCACTTAGTAATTTAAAAAATAATTGGCGGAGAATTTA 
ATGGCAGTTTTGGTCACTTGTTATCT AATG ACAAATTAT AAA GCtTTT AAA 
AGGXXrTTGGACCACATTTATlTTGAACATAGGTTTGCACACATrTAAGAG 
ATACATGTAGCCAAAATGACTTTATACCACCG ATGii 111 Tl 1 GGAATTTG 
TAAAAATAATACATTTATATGTGTAAATTCATTTAGAAATACGCCACATC 
TTTTGTTrCGCC<riTCACACXjACTCXjCTTC 

GGCCGCCCGCTTTAAAGGGGTAAATACT ATGCACATGGGGAGA<X>GTTCT 
TGGAGATCCCG O IC I 1 1 UCU I CA TACTJCGGTJGCACAAGTAGGGCGATACO 
G C^TT CATAGGGCAAAATGATACrCGTCTCCACGCTCCCCACCCTTACCT 
CCTGT 
>Seqaeacc 1107 

CCC1 1 1 CX^C<XKnCGTCCGGGCATGTCTGTGAGATXrrTACCACTAGTAT 

ttggaaaaagaataaaaatctggcogggcgtggttjacacatgoctgtaat 



GAGGTTGCAGTGAGCCAAGATTGCAGCATroCACTCCAGCCTGGG^AACA 
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iTAcrrr 

ATTAGTTXTTAATTTTACACAGTTAA 
CACTGAAAAATX1AATGATA7TTAATCATTGTCACTTACTGAGAAGCAAOA 
ACAATGAOTOAGCOCAAAGGAGTCTACTACCATACCTaTTAAOTGTaGGG 
AAOOCTTTAAOT ATTTTTTACATACI T I ' I CT I C I 0 1 CA TTCOAAAAACAC 
COOCOCATCTOAAATOGACAGAAGAAAATTTTOOCACG lull 1 1 A CTCTC 
ATCAGAACAGCTTGGGGGCAGTGACCTTCACACTGTTAGCTTGCC^ 
ACTGC1 1 GAAOGOCAOCAOTTAACAGCTGCTAAAGGOA O PCTC1 1 I A AAA 
ATACAATTtn^CGAGATCOCACTTOCAAAAGGTATXXjAOCAATCCTTTTT 
TDCAACAGCAATOAATCXrrCGGGCTGAAAAOCAAACITTACAGOOOCTXjO 
CTTAOGOCTOGOOAOQO<MOAAOCAOAGCATCATTATTCCGAOTTTOGAA 

> Sequence ItOJ 

CCCTTTCGAGCG GI [CCTI 1 G GCATOTATAATG AAA ItSTCI 1 1 1 AAAAAA 
AGTTTGTTOTAATTCTCTATtJTAATTXTTOACAGTAATTCAAAACACAAAA 
TCACACAI 1 1 1 HJL I A ACTTOOCATGTTCTGGATCTGGGOACTGC AATAT 
TACAO AAA TATCCAAAAATAAOTTTACTGCTCAG ACATAAATAATTTTTC 
TTATTTCAATGCATCAATCCGCAAAAATTTCAATTCAAAAAAGCCAACCA 
CTCCTATATCCAAATAAATAAAACATTTXjACAACACTTTTATAATCAAAC 
CCAACATTATACAAAAAATGTCTGGCAOGTGCACATACATGTGCATATCT 
CTATOCAATGCCTATTTTAOAAAAAAGG fOT C Tl g atgaaaato attttc 
aaaatactcactgacacacattatatacaaaaccttttatataaaaaatt 
aaactattttcaatgaaattccatcttcacactctattcixmcaatkk^ 
aactgaatcataaataggtctactaaccaaatcatggttaaggcagtatt 
ttttacaacog1 ntcttl i atatcacatatctcact aaa tact actoca 

GTCA AA T ATACA GA TOCAA TCTG ACATGCCTT A TOCTTATT ACCTG AA CT 
TTTTPGACTGTATAAAGGGAGAAATACATTTTTTACAAAAAAATTTT 
T A T AAAAA T A CTGOT A TCATTTTCCAAAAAAOCA TT ACTT AAACATTAAA 
GGTT AAACGTCATCTTATGA TGTAAAACG 

> Sequence 1)09 

GGTACATITTCGG0CTTTAAT00CATCTAAACAATTTGCTGTTAAOGAAA 
CTCAAAAACA GAAATACCTA T A TTTTCTCGCT AAA TCCAA TTGTTACCTA 
TG ATGAGTAAAG ACACT AG A TCTGCAGGTCCT AGTACAA TCTAT ACAT AA 

aaoaaaaaaaaaccttctacacati iccncrn I A TCTOCAATATGAGA 

AGOAATCCTTTCTAACTCTAATAACATATTAACAAGAATTAAGAACAQSA 
TTCTCGGXX«AACTCAGATGTTGGCAAAGCTfANAAATAAAAAAACAA<KjG 
CTOGGTGCAGTGGCTCAGGOCT ATAATOCCAACACI I rOTO AOOOCOAOO 
CA GGA OG A TTGCTTAAGCCCAGG AGTTTOGGATCAGACTGG AGAACAAA G 
TGAGVUXXXrrATCCCTATCTCTOCA^ 

AG TGGTGTGTGCCTGTAGCCCCAG<n'ACTTAGGAGGCTAAAATGGGAGGA 
TGCCTTGAGTCCAAGAATTTGAGAATGGCGTGAGCT ATTMTCAAACTTCA 
ATTtlAGOCCGGGGTGAACGAAGCCAGGGGTTTTrAAAAAAAAAAAAAAGG 
OAAAAANWAAAAAAOGOGOAOOTTDOOC1 I O CGOOCOOOOOOQOOOOOOO 

GcccGOG onnii i 



ACIOOol ACAGCCGTGAGOCACCGCA 
ATTCCACCAGATACCCTAAATC^ 

TCTCTAGA GCAO GOGCAQAATOCTCCCA O I C I C I 1 1 GCTAAAGCATAGCA 
AAAATCACC'I I IGCTGCTCCAGI ICCCAATAACJ riOCTCAICICrGTTGG 
AGAOCACCTCAACCTOGACTTCATTGTOCATATCAAGATPOGCA 1 1 1 IGG 
TCAAAGCCATTCAGCAAGTCTCTAGGAAGTTGCAAACTTTCC^ 
C tJIGICI I LIlLILO ^CCCTraU^CTATTTCA^ 

AGXrGCAAAGTTACTOGCACATT>nX^GGTATirrTTACAGCAGCAAC03G 



a ii in i i ce 



426 

Table 2 

CTCTACCGGT 
>Scqa cnce 111! 

GGTALI 1 1 M ! ATGI 1 I IAA TTI T 1 IGT AGAGAATGGCTCTTGCTATGTT 
GCCCAGGCTGOTC1 1 GAACTCCTCGACTCAGCTGAAGTGATCTGGCCACC 
TCAGCCTCOCAAAGTGCTAGAATrACAGGCGTCAG<XACCACTOOCAGCC 
TGT AGCCT A TTTTTATAAA TG AACTTTT ATTGGAACATAGCCATGOCTGG 
Tt^TTTACATACGTCTATGG<rrTCGTATGCAATATAGCAACAGAATATAT 
TAAACATTTACTACCTGGCCCTTTGCAGAAAATGTTTGAC^ 
GTATAAAGATAAAATCTCCCAAAAAATOCTGATATTACCCCACATGGAGA 
AA CACTG AAiXCCTCTTCAG AAA TCAGA TGOCAA TTTAAA TATTACTATC 
AGACAAATACACTCTGATl 1 1 1 1 1 1 rCCTATTOOCTI III I 1 1 ATTTTCT 
TTTTTGAG ACAAGGTCTTCCTCCG TT GCCC A AGCTGGAATATG ATGGTGC 
CATCATAGCTCACTATAACCTCCGAATCCTGGGCTCAAGTGA 1 cc rcTTG 
CXrrCAACCTNCTGAGTAGCTTGGACT atgggogtgtgccgccgaccctgg 
CTAATTTTTGGGA [ 1 ITt AAA AAAAAAAACGGGOGGTTTTCCCCACCGTT 
TTGGGTCCAAAAA CTTGTXXSTOCTTTO GAAAAA OCTTCTTI IGTGAAACC 
CLI I IOCGGTGGGAAATAOCCTrCGGGGG<XXXXX^AACCC<X"1 1 1 1 1 1 1 
>Scqocncc 1112 

COGCOGCTCG AAAGOCCTATACTTAG CU 1 ' 1 II I A ACTT ATATWTCTGTGC 
TTTWNNNWICTNNKGGGWAAGTGGGCGGAATGAGGAGTG^ 
TACGCGCATGTCrroTTCTCAAl rCCCCI I AOGGCCCCGGCAGACCTTGGC 
TTG A CTG TGGTCT A NAGCA CAA GAATA TGCT AGGCTGCA CTCTGCT AATC 
AGA TGTGTG A A TGCTCCTGTGGNG TGT A TTG AATGGG AA GCTTTTGCCCG 
GNGAACCAAAGCTCTCATGGATGA TG I GGTGA AAGCCACTTCTAGGGGCT 
GATCACCATCATAGGTGGTGGAGAGACTGCCA 
> Sequence 1113 

OOTA CV T r 1 1 ICn 1 1 TTTC1 1 I I I m*TTTT I G AGACaGAGTCT C TCTC 
TGTCACTCAGGCTGGAGTGCAGTGGCATGATCTCAGCTCACTGCA 
CACCTCCTGGGTTCAAGCAATTC 1 CC I OOC I CAGCCTOCTGAGTAGCTGC 
GATTACAGGCAGGCACX1ACCACACCCGGCTAATTTTGTATTTTTAGTAGA 
AACGGG OI I IL I L CAIGn GC TCA GICIOOl I > IG AAC I OULAGCGTCAG 

cacogcocccagocac 1 i c 1 g i atttttaaaaaagtggtaagatttgagt 
attatactgggatagaactgaagttgggggcttaatttcatctatcagct 
ta ttoaa aacaaggaocttttaagaaato g 1 1 iip 1 1 aggttggaaaagt 
gagttttaattcgtcatttaattagccaggatgttgal ll hit 1 ggtga 

AATGTAOCTGOCCCOGCGGCCGGTCGAAAGGG 
>SeqnoKC 1114 

GGT ACCACATTG ACCCAA GG ACCTCT A GG' I G I G 1 1 1 GGTGA GGCAGGTCT 
TTGTCAATTTAAGTAATCCTGTCAGATOGTCTAOC^ 
GACAAAG CAC I G 1 1 UC I G AGATACTCTGATTTA 1 1 1 IU-1 1 A ATGGGCAO 
TTTTTTTATATATATACGTTCCA 1 1 1 1 CAOACAOO fO Q TO CI 1 1 GAGTTG 
AATTTGCA AGTTCAGTOAA ACATGG AT CTCTT TTTTATT^ 
ICTOCTAAOGTGCTT AATTTCCATGCTTGAGATCGT 

i^liMM C 1115 

TGTACAOAAOOCTTTCACCATCTTCACCACAC 1 OG 1CI CAAACTOCTGGT 
CICAAGTGATCCATCTGCCTCAGCXrrcCCAAAG 

TGAGCCAOCGCAUJL I G I LIXATCACTTTATATTTTCAAGAAGGTOGTGA 



ATCTCTQGTAATGTC1 1'IA 
>ScqocDcc 1116 

TCTAOC^TCCCATGGACACAAGTTTCCAGGCAGCAGCXnXX 
GTTAGAGATGTCCCATtlACTTATGCCCCTACA L'I Gill ACATCTGQACTC 
TGGATTGCAAGTGTAAGGAAGAAAGTGAAAATGAAAGAGAAAGTOGAACA 
AATATTGGCAACAGAGCCCCCAGAGGACAG 1 1 G I OCCTTTTOCAACAAOT 
TAAGTCGAAAATGCTOTTGOCATGGOAGT 



rcr/vsoonxiii 
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CIII7 

AAAAAAACAAATATTTTTTAAAGGGTOAAAAAAAAAAAAAGAAGGGGGGG 
OAAA TCTAAACTTGGGAAAAAAGGGGG CCITAA AAAAAAAAAAAAAAAAA 
TTTT AAAAA CAAAAAAAAAAAGAGGGCCTTTTTAAAAAAAAAAAAAAAAA 
GCCCOCOGGGGGCGGGOCAAAAAAOOOCl 1 1 1 1 1 AAAAA GGGTTT AAAAA 
AACACCOCCC I COCCAAATTTAAAAAG 
AAAGGGGGT 



TTGGGG<XiAAAAAAAAAAAACOGGGTTAAAAAAAAAAAGG<XWAAAKTTC 
CAAAAGTAAAGAGGGGAAAAAAAAGG G I G 1 1 1 I I G GGGG AAAAAAAAAAA 
AAGAGGCOCCOCAAAAAATTPGTAAAAACAAAAAGGGCAACTTCAAGGGG 
GTGAAAAAAA AAAAAAA AAAATTCOCCOOCCAAAAAAAAAAAAAAAAAAG 
OGGGGOGAAATT1 1 11 ICI 1 ATTTGGAAGAAAGA AAAAAAAAAAGGGOGG 
GAG1 1 1 1 1 1 lAAAAAAAAAAAAAAATTGGGGGGGGGGGG - 



elllt 

TGT A C 1 1 1 1 1 HI 1 1 1 » 1 1 1 1 1 1 1 n 1 1 11 lA AAGAAAAAGTTCGCCCAG 
CCCCAGGGAATAAATTTTGACTGCTCTA A ACAACCACA GACCAAGGGCCA 
AATCTCGCCCTCTGACITJTATAAATTAAGTTTTACTGGAATAAAACCAGG 
TCCATTGATTTATCCATTGTCTACATACOCTITTTAG 

ACTOTGTCACTAC AAAAGAGO TTATCT AG AC^AA AAGCCTAA AATAT TAC 

rGCCTCTTTATGGAAAAAGI 1 1GCCATTCCC1 AGTCTAAGGI 1 1 AG 
A TTCTG AGCTTA TCATGTTATOCTAOCCCCGCCCGOGT 
>Seqrace 1119 

ACAATATGGAAAGGTAAG ATCCATACOCAAAGTTAGGTAA G I U 1 1 IGA OT 
TGTCCCATGTAAATAGTTTAAACACTTGTAGAAGTATTAGAAGAGATCCT 
TAGGGAATGATGCAAGTGGCATTTGAGCTATTCATTTAGAGAANAGTTTA 
GAAACATGCAGTCTAKNAGGAAGAGATAGAGGCAATAGGAAAAAATATAC 
TTAAGATTAACaGCTCTTTATOCCCGA C I POCT I AACTTCKGATGTNGTG 
TCAG AA A AGCAACAGTATGGGCT AGAA CAAAGTGGG AATGGCGTTTTAA G 



TGTGTTGOIAGTGATTAATCATAAGCTTATTCTTCATCAAAAGTATATAT 

ttctttcacactacnctaagacagt attatacatt 

AKTGAAACT AGAACTCTA TATTTAGGGAGTT A GCAAAAAAAAAAAAAAAT 
A CXTTTTJCTCGGCA CCAACCTTGGGGAGAA TT ACTTCXXACTTGQCTGOCG 
GGTCI 1 1 1 1 1 GATGCAACGCTCGGTCOCAAACCATTGGGTGGGAAGCAAA 
OG COTOOGTT AAAC TTGG L I 1 rOCTTGOOCTGGAAAAAAAAAAAATTTTT 
TCCCCXj i 1 1 lCCGGCCTTTA 1 1 1 1 1 1 ATTITTTCCCCCACCAAAAAAAAT 

iiiii i 



CTTTTA 

GGTACACAC ATC1 1 1 1 I O AGATOCTACCTTCA G fTCTTI I OA OTATATAO 
CCAGAAGTOGTATTACTAAATCTTAOGATATTTCT^ 
AGGAACCACTGTA G T I 1 1 1 CA TAGCAGOGCAOCATTTrACGTTCTCACC 
AAGAGTGC^CAAGGGTTOCGAGGTTCCCACAIC^ 

1 1 1 ILIGC1 f 1 1 1 1 I ACATIXXAGOCATCATAGTGCGTGTGAGGTGACAT 
TTCAI ro roOl 1 1 1 GATTTGCATTTOCCTAATGAGGAGTGATTJCTGAGCA 
ILI 1 1 rCA TA TCCTT ACTG GTCA TTTGTA TGT rOTC l I IGO AAAAATOTC 
TATTCAAGTOC7T1 GACTATTTTAAAAATTGGGTTATTAGAGTTATCGTT 
GCTGGTCACTTGTAGGAG1 1 PCI I iClATATTCTGGATATTAATOOOCTA 
TTAG^TATATGATTTOCAAAgiflgCI IL ' ll 'l L ATTOCTAAGGTTACTTTTT 
<JLI 1 1 IGG1GAATGGGG1L1L IGATGGATAGAAGTTTTTAGGTTTGAAAT 
AAGCTAAAT TA IL IGG I 1 1 1 ACTTlTGGGGtX^GGOCTTTTGGGGCCATA 

GCTTCAAAOOOQAATTCAAOACACTTOOOOOCO OTI 1111 po aatccagc 
1 OGG 1 UGAAACA TGGCGATA TAA TGGGA TAACA TGGTACAGTGTT AAA TC 
>Sec*«c*ll2l 



OOCTTAGOGTOOTOOCl I ICG AGOTA Ll I INN I II 1 1 1 1 1 1 1 1 1 1 1 1 1 A 
TATTTAGTAGAGACGGGGTIIt^CXXrrOGTAGOCAGGATOGTCTTGATCT 
CCTGACCTCGTGATCCACCCACCTTGGCXrTCCCAAAG^ 
GGCGTGAGCCAOCGTG^CGGGCTGAAAAATAACCCTTTAGATATCTACAG 
CTTTAAAC P0TG1 GCAGTCATGAAAAGCAGACATTAGAAGTCATTGGCAT 
TT AA TAAA TTGCAGTAAAATTATACAGT A A AT ACATTACAATCA TT AATA 

AATTTTATT L IG 1 1 C TCAATA 1G1 IG CTO CILI IH I ATCAAATACTATA 
A TAAAACT A T A TG ACT ATTA T AT AGATTTCAGGAGCT AAAAAAAGCCTT A 
TATTITCAAATTAAAGAACAATATTAATTITGCAAAATACAATGAGCATT 
ACTGAAGTATAAAGGT AATATTTTGGATTAAAATATATGGTCATTTAGAT 
AOOGGCCTTAAAAOAATAGAAATCTTAATOA 1 1 PCI I rCTGGCTACAGTG 
A GCTT A A AAA T ACCA CCCCAAA A TTT A A T AAAT A TGT A GCACTTCAA GAA 
ATTTTTTAACAACTTt^TAATGTUAAATTGA 

GAA TTTG AAA T AACTGCTGOCA TTCTTTTGGAAAGGG A CCTTT A GGG AGT 

TCCTTATCOGACACGGAT 

>Scqxxzxx 1122 

OAOCTOOACCAGTTOOCTOAOOAOAGC1CTTCAGCAAAAGC O C 1 1 G I LAG 
TTTAAAAGAAGGAAGCTTATCTAACACOTOGAATGAAAAGTACC 
>Sequcact 1123 

A CC1 1 1 lA TCCCTCAAAGGAOCCI PCI 10 0 01 1 1 PG AATGGAAG OC 1 1 I A 

ttoccgttaagatgtt rrci ICTAI 1 1 POOCACTTOCATCI 1 1 1 1 1 IGIG 
GCCCTCGATCCTA Mil PCOCTO ACTOCATG C I IGG 1 1 G GCCCTTATAAA 
ACTTCTGCCCAAAAGATTGTGGATTAG A C 1 1 1 COG AGOACTTAOCTOTOC 
T A GGGGA GTA GGCAA GCACTTCCACTAGOGAGGGGG TGGGGGAAAGGAAT 
GACAG^TGACATACATGGCATACA^TTAAGCAGTTGATCATATGTXrTCA 
CTGG CTTCCA G I 1 IL I IGG GAAIGI IGGICU XI IG I ILA GGCTTGCATA 
TTTT AAA CTAA-AAA TTTCAGTCTA I POTT I rTAGTAACTTCATTTATAGT 
CCTCCATAACAAGTTAGAAGGATGTATCTGCTACCATTrATTOCTATAAT 
TTTAAAAAGTTGGGGCTTGACATTA TACTC ATTTAGTOAOAGTAGATGCA 

GAGGGCAAGCTTTATCTTAACCATGTATGGTA< 

taaggg 

e 1124 



CCC 1 1 1 CG ATOXKXXI O ODOGGCAOOAOGOGGGTAGGGCAACTTOOATOT 

atgcttagggttcg<^aaaagtaaacaaaaatagaagggaaaaaaattat 

TOACAATGAACT GC I 1 IGO TAGTGAI I IGIU A I I UG1 1 1 1 1 ILI IGATT 
AGTAAOCAACAGCACAGCCACCAAGAAA 

ggtacaglaaaaagacacatttagataaactgaagcagattaaagtgactt 

TATAAGACAACA ILI 1 IG1 1 1 1 1AIGI 1 1 AA 1 1 1 GAAGT ATGGTT AAGCA 
CTAATTTAATTCA OTO C1 1 PCTOCTTAI ll-IUI I PCTA OTAAC TC1 IA CA 
GAAACAAGTCTAGTCAGTAGCCAACATACATCCATGTCAGOCTATATATG 
ACTTACTAGGAGGGCTTAG 1 1 III 1 A AAAGAGATGAAAAATAAAGAGAAG 
GTCTAGTAl 1 1 1 OC I O OCACATTOCAACAOATCATTTTATGTO OOOOC 1 1 
TGGGTGAGCACATTCCA IG 1 IG 1 AGACCATTGATCATAGTACTCAGAGCA 
.TGGAGCTCTGGAGTTCAGAANAATAATTTTATTATTG^ 

aataattaocatgaaaaaaaaaaaaaaaaaaagt 

>Seapeocell26 ' 

ACTTTACTG1 IC I 1 1 1A AACCTOGAGAAG<XTCTATOGCTTA I ICULI I A 
GAAGCAACAAATGAAATGATGTATAAACCATCAAGTCAAAGATACAGAGA 

actggacacattxactaattgttatgacaatcaaa 

AATAOCTAAGGG ITGTC1 AAGGCrATAAAGGTCAATTTGAAAGOCAGTTA 
GGGATCCA OOGTOI I I LATAAAAGTGTCTTACACTCA IGI I IOGC I 1 1 LA 
AGAAGTGaTA TGCCTACTAA^GCTtrrTATTTTGAGA CTATCCCGOGTAJDC 
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^Sequence t 

CCCI I tocaoocccoottoqgocaggtac 

TTTTGGCCTCCAATTCCATTTTAATi rlCi I 1 PCTTTGT1 rGICll ICCTC 
AAATATACAGTCCA TCAOCTTGGCTCA CTGCATGTCAOCAaAAATTCTOC 
AGGGATTTCATACTCTCOG W M TG1 G GCTOOOCCAOOACTATCCAT GCAO 
GGAGGOCTGCACCTCTGACAGTOGGCTGCAGCTGGGGGTGOOCATCTTTT 
GTGCTCTGTGGTACTCCTACACACATAAATTCAGGAAATGACrAGATGAG 
CCTGAGTGGCTTTATCATTATTGTGCAAATACAGTTTCTATACCCACAAA 

cccaa ATTAAATTATTATAGGGACT aatggctgtcaggtgggtgtgggag 
g AAAAA ATTCACA agcttgtgtaccaattacctttaccatgaattttatg 

TACCCTTGCGCGCTAOCACACTT AGGGCTATTTTCTOTCACACTGOGGGT 

ccgtatcttagggaatoccacttgggtcocacatcatggatgacacctgg 
taattaactggtrccctctcataaataaaattccgttctacattcaacac 

A A A ATTACGTACCGTACTGCAAAATATTATATTCTTCGGCGTCCCACTCA 
G A TG A TCTT A C A C AC A TCT A TTTGCT A CGCCTT A TTGTTTCTTTA CA A TT 
ATACAACTTATTCGGATAACTT C T C T A AACTAACTfTACACCCCTGCGTT 
AGGGCGCTTATCTATTCTCCA TCATTCTCAACCGTTT 
>Sccjuct>ce 1 128 

TCTATTATCATTGAGTAGGGAAACTTACTAGGTTAAATAGAGAGTATATA 
GAATGTA 1 1 I GO I I A T A G AT ATGTG A A GG AA AA GGCAT A A TT AT A TGGTC 
A TCC ATGCTGGGG A ATATTTTGTAGGTATGTTTTGTTG AG AG A A ATCGAT 
CAT ATTGG ATCAATAG AATTAG AGA AATATCTTGAGCATCAAG AG ACCTG 
GAAACATGGG AATC ATAAAGAGAG AAAAACTGCAGTTTCG AOGTl CT7 GA 
GGCCACAA GAG AG ATGG AGG AATG A GGGTCG TGTA T AGG AA AG A G AA AT A 
AG AAATTGTGTGGG AG AG AAAG ATGGTTTATTG TO A TGGTCAA A A TACCG 
AGCATXXX3AGAGCCAATGGACAACATTTGAAAAATGAATCAAATTGATAA 
AGTACCTTCGGGCCGCACCACCCTTAGGGCCAAT 
> Sequence 1 129 

A C A G TGGCCC AA TCTTGGCT AGTGT A A TTCA GTC 1 1 1 1 G AATAAATGGAA 
AAAATAAATTGTATGTTATTTTTATACAGAAAAAAAGGCCTTAATATCAT 
A AGG TTTTTTT ATAGCCCTCA AAACTGA TTTTT AAA TOGA GG T A GGCAAC 
TGAG A AA ATAA GCATTTA A ATT AG TTTTCACXXX1AA AGCCCCCCAAAATT 
TTGCTTACAAAATTAGGGTACC 
>Sequence 1130 

A C! mnTTTTTTTTTTTTTATTn PCCl llll I A TT A 1 111)11 1 111) 
TTA1 1 11 1 1 1 lA nHT IT l NNNAANHTTTTA T l TTrTrTA TMWWTATAAA 



AAATTATATACNAGGGGGGATAAAAAAAATAATAAAAGGGGGGGTGGAAA 

AAATAAAAAAAAAAAAOGGGGGCCAATATAGCGGATTGGGGAGAGGGAAA 

AAAAAAAAAAAGGGGGGnriTGAAAAACAAAAAAAGGGGGGCCGGAAAAAA 

AAAAAAGAAGATGOGATTGTAAAAAAAAAAGGGGAAAAAAGAAAATAATT 

GGTTTAACACAAAAAAAGAATAAAAAAAACGGTTGGGAGOGTTAGGGGGG 

AAAAAAAGTGAGGGGGGGAAAAAATGGAGAAAATGGGGGGGGGGGAAATA 

TAGGGGGAAAAAGGTGGGGAAAAAAAGGTGGGGGGGGAAAAAAGGCGAAA 

AGATTGTACTAGGAGGAAAAAAAAGTTATTACGGCGAACATATAAACAAA 

AAAAATAGGGGGGGGGAAAATAAAAAAaAGGAAACOGGGGGGGGGCAAAA 

GAGGGGGGACACTCCCAAATATATXK7TGGGGGGGGAAAATGGGGGGGAAT 

AAAAAAAAAGAAAAAAAGTGGGCACATGTGAGAAAAAAAAAAAAAACATA 

CAGGOGC^GGGAAACAGGAGTTAGATAAAGAGGAOGTATATTAAATGTrT - 

AAAAAAAAAAAAAA^AGGAAAAAAGATTGGAAAGGGGGAGGGGAGTATG 

AGTAAAAAGAGGAGGGGAGAAAGGGAGATATAAAAAAAGGGTGAGAACCO 

AAAAAAAAAAGGGGGAAAAAAAGAGGGAGAAAOAOOGaAATAAAT 

> Sequence 1 131 

ATAGGTrGATGGTAGATAACACATTTGCTTCATTGAAGTAATCTGAAAAA 
CCAATCCTCAAAAGACCTCICAATTAGAATTCTTAAATO^ 
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i I if TCA TATA TTTG A G A G A TTO ATTTAA_*.GAAJLAA TAATC-C T T G ACTA T 
CTGAAATAATATTTTAACCCT 

CTA C HIGH ACTGAAATCTTGAACCTGTAGAACATTACrrATCACTGTG 

I ro c 1 1 1 C 1 AATGGGAAAATAATAATAAACACTTCCAGAGTATTKTTTAA 

AAGTTlTTAGCTTTAAAAAAAAAAACCCTGTOCCrTACA 

TTG A GTTG ATACTGATT ATOAT AA TT AG A TGGT ATT ATA CAATCATTCAT 

TCAGCAAACATTC^CTTACroAGCACCTACTAATGTCCAAGTAC^TTCGG 

NCGCG ACACGCTTA GGG 

>Scqoence 1 132 

ACATCACATTjGTGAAAGCAGGAGCAAGAGGGATAGAGGTCCCATACACTT 

TTAAACAATCCGATCTGACAAGAG^CTCACnGACTATTGCAAAGATAACTC 

CAAGCCGTGAGTGATTGGCrC^CATOAOCTGAACACCTCCCACCAGGTCC 

TACCTTCAGCATTGGGGCTG ACAAAGCAACATG AG A TTTOGGCAGGG AT A 

AAT A TCCAAATTATATCA TTCTGCTCCTGGCCTCTCCCA A ATCTCATGTC 

TrCTCACATTGCAAAATATAATTATGCCTTCXTTAACAGTaXX^AAAGTC 

TTAACTCATTC^GACT>n"AACrCANAAATTCAAAGTTGGCCAGATTjCAG 

GGCTCACACCrATAATCCC^GCATTTTGGAAGGCCAAGGTGGGTGGATTT 

CTTGAGCXXAOGAGTTTGAGACCAGCCTtXXn'AATCTGGCAAAACTGCAT 

CTAAGGG 

> Sequence 1133 

GGTACTGAAACT ACACCTrCTGAGCCACCATCCCTGGCTTAAACATTTGTT 
TTTAATTAGCCAGGCTTGGTGGCC^CATCn^TAGTCCCACCTACTCAGGA 
AGCTGAGK>FGAGAGGATCAC1TGAG^X!CAGAAGTTCAAAGGGGCAGTGAT 
CACTCCATTGCACTCCAGCCTGNGT AAC AG AGTGAG ACCCTGTCTCGCCA 
AAA AG AAAG AGGTTAAGGAGGAGAAGACTCTAGA CCA AAAG AAGTAACTG 
AT ATT ATTG A AAAT ATTTGA T A GCAA TCGCAATTATTTCG AT AACT ATTT 
TCA CAT A TG T AAGCAAA CCA AA T A GGGTCTCA AAA GTTTCAG ACCA AATG 
ATTCATGTTCTCT A CTTCAACCTT AA AAAAAA G TT A AA G A ATTCTACAA T 
TA CAAAAA G AACAGTTATTCTAT A GTT ACAAAAA G ACTTGAAAA CTTTC A 
CCTGAATGCA 1 U 1 1 1 1 G TTACAAAACCATTAAAGGAGGTAGGGGGCAAC 
TTCA TG A TTCA TCAATGCT GCCTGC 1 TTTTI AACCCAGGAAATCCTTTAC 
AGCAAGAACCTGAGGTGTAGCTGCCCGGCCG 

e 1134 

A C IIINII1II Mil I1T1T I GTTTAGGAGCCTCTGGTTACCTTI TCT TG 
TATATTTACT T I CI CATCCI TTCICMnCT rACCCTTCCATCTTTG ACA 
TCCTTATCTA TTCT AGTGCCA A CCCCTCTCTTT AAAAAGTCN AGT AG TGT 
NNAATATAGTTCGCrCNTTTTTATTTANNAAAAATTTTAAAGATTGGG 
A TTTGCn 1 ACTTTATCATCTTACC<}AG<XXXnTTATTTATAKTNNGTGT 
ATTACANNAATATATTraTTAACCrACCCTAGCAAATATnNTATGGGTA 
ATAACTTTCGCCTATTThn"AATATAAAATCCCTGGGTTTTITAAAATTCT 
TG AAAATGGCTCCATTTTAAAGTAATAAGGG AGACAGGGGTG AAAATTGG 
TKTOCAAGTTTT ACCTAOCTACAAOCAAGGAAATAAAGGGAAGCTCTAGA 
TTCCTTGGTO Cl 11111 1 CCAAAAAAGAAAAAATTTTTAAAAACCAAGGC 
TTATTTGGAGGTATAGO 1 1 1 G ATTATAAOCCTATA 111! U OACATGGTCC 
CTTCKKKXOGGOACCACCGCTTAGGGGCGAAATnXACA 
GGCGCGTTACTTAGTGGA 1 rOOCXIAQOCTCXXnAOOCAA<XXTCOOCqTA 
AATAATGOGCAATAAOCI U G 1 1 ICC I G OOOACAAAATOOTTATOOCGCTC 
CCAATTCCACCACAACATACCAAC<XXXjAAGCCTTAAGATCT 
GGGGTGCCTCAAGGACGAGCCTAACCTCCCATTAA1 1 G TG Tl GCGCTTAC 
TTGCGCC UI 1 lU CCAATTGGAAAAC C 1 1 1 <J I O QCCAACTTOATATATOGA 
AATGCCGACGCGCGGG<iGAAGAGCGG^ 

AT 
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GOTACAGAGG AA ATGGG ACTTTGCAATTATA I J 1 1 1 C I AAGTGGTCTGAA 
CITC^rrCTCACTACCCACATCANOCTTCAAT^^ 

GACTGCCCCACGGGCTAAACAGCTTjATCCGCTCTCTGAAGCCAGACACiTC 
TTATCTGGGAGGTCCTTTACAGATGCCACTGTTGAAGGCCCGGAAGCrGA 
AOAG AGT TiAGC TCCATCCTCAAGTAOlCCl 1 I A TG CI ULI 1 IO GAACAAO 
CTTTGCTOTrTTGGGCCXXK^TTTGTC 

TTTANAAAGAAGCCATGGGTOCTTTAGGTAATGAAATAG i GTGTTGATGGT 
GTTATGGGTGATGATTjGAACrGAGTGCGGGTGTATAAAGnxriTCATCCT 
CCCAACTGGGTGGTATCTAAAATOGGCTTGXjGCTTT^CATrTATA^ 
GAAGGGTCGGGCC AGCTACCTAAAGGG AAAGGAGGGAC C rTCTTOCTTA A 
^X^A^TCCCVOG^A^CGCAAAACGGCAaGGGGGACAACACCTGGT 

ACTCTGGCCXKK>GTGGTCGTTTTTAATAGG<KrrAAATTCTTATCAC^ 
CTA TGCCGG TC ACT ATAA TGG AA TCCG ATAATTCGTT ACGG AGACCTTGG 
CTCA ACCATAGG ACTAAG ATTGTATTCCTGCTGTGCA AACAGTG ATTCGC 



ACAGATGAAGATGTGTTAAATATCTCAGCAGAGGAGTGTATTAGATAAAT 
OG AATTATCATATATATGATATACAAAl > I 1 1 1C t ATTT AA AAA TAT ATT 
AA TTJG ATCAACTTTA A AATTGTTAGTTGCCAGTOATCITTTTTGG AAAAC 
AAAAATGGGGCA 1 1 IOT I L. ATTTATTTATTTTCCGTCTCTAATTAGTTAC 
CTCAGTTTGATTGAAGCCAGTGAAGTTGTG C1 1 1 1 tXJ l C J A CTTCTACTT 
CCTCTCCCCGACL I 1 1 1 1 IC1GCCCAGTGTAGGTGTATTC 
>Sequeace 1137 



TGTCTCACCXTCCCCAGTAC^rrGGGATTACAGGTGTGCACCA 
TGCTAATTTTTG I G I 1 1 1 1 AGTAGAGATGGAGTTCACCATGTTGGCAAGA 
CTGGTCTTGAACTCCTGACCTGAAGTGATCCATCCCC 

AGTGCTGGGATTACAGGGATGAGCCACXX3CACCTGGCCCTGTCAGGGTTT 

TCTTAACATTAGCAACTGCATTTTGATTCTGACAACTGTCACA^ 

GCGC CAGGTA ACt 1 1 I G<J 1 OGCTTG TG CCCTGTAA G A TTTT AGCAGCATC 

CCCGGC 1 1 CT A OCCACT A GA TGTCA ATA A CA TCCCTCA GTTTTG ACCATC 

AGAAATC ICTCI AGATATTGTt^AATGTCTCCTGAGGAGCAAAGTTGTCC 

TCCCTCGGGAACCGCTGATCTAGAGTAAAGAGCTGGAAATGGAATCACCA 

GGTCATAGGG<XTGTGOGTTr TCAGT TrTrCC^ 

TTCACATTTATXXTCATTTACACTTTCATCAGCATAATTT^ 

GATTCATGTGTTAAGCTGAGCTTCATTGCCTTCAATAAGTGATAATGCCT 

OAGCC AAGGAC AATATGACTC43ATTTTCACACCAAGGTGCGGGTTAAATC 

ACCCTA T 1 IT"I1GGACTTCGG AATCAAGGGATTCGGTTAATTAAGGGAGG 

TTGCCCTTACAT 

> Sequence M3S 

GGTACAAAACAOAACAAGTCTCAGTTTTCAGTGCAACATTTC^ 
TATATGCTGCAATCTAATAATTAAAAGGAATTTTACCTATTATGAAACAT 
ATT A CATTTTTTAAGTTAG ATAA TCAGTTTCAAAA GG AGTATTCAGGTTA 
TTTAAU 1 1G1 111 I AAATOGCTGCATCAG AAAAAAATGTCTATT1TTTT 
TTATT AAAAT A TTTCA TCACTTGTTAAAACAT ATTTTTGA TCTGAGTTTO 
GrTAAAGTATTATTTTAOCTGCTGriTtSCOCTGCCCGG 

G 

>Seqatm 1139 

GGT ACTATCTCGAATGAAGTTAAAAACAAATTAGAGGGAAAAGGTCAGGT 

TAGCATGTTTTAGAACTATTGGTAACTATAATTCATTJGGACATTATATAA 

TCAAAAGATTAATATTTTAAGCACTAAGTTATAAA GGGTTTA CACCCATO 

AATAAAAAGATTAOCATCACTTACTATCAACCACCATTCCATGAATCCAT 

GTAGCTOAACACTCCTAATOAAAAGrrrTAATTATCCTTCAACCTGTAGTT 

GAAGAACTCAGTTCATGTTCATTCAC^GATTro^ 
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ATTG ATGTTA C I HC I 1 I GACACTATATTTTATATAGATATATTAAAATT 
G AAAA CTT AA TGCTGTTT A G AA GGCTA TT AAT ATAA CTATT AATTTCTG A 
AAGCTTTGA O U 1C 1 UAAAAGGCITTTAAGATCAAAATTTCTGAAACACT 

ecACACA nc i re el ca cccacatttaattataaatcaatgttatactoa 

TAAAAGGTTCTATACACACATTTAGAGATATATOTGTGCGTGGGTGTGTC 
TGTOTGTGAATATATATATATAACTTATTCCCCXXAAGAAGGAGTCT^ 



TAGTAGGTGGAGGGTA 
ATCACCTCACATTTGAGATGGGGTTTT 
ATTTTG^GArr^TGCATTO 

ACCCACCTCGGGTCAAACTGATGACAACGGGCAATAACACAGTGAGGTTT 
AACCCCAACTTCTACCGCAATGGGAAAGTCTGCTTGAGTATTCTAGGGTA 
AGAGGAGACTTTTAAGTACXXAAGTCGGTTGTTAGCAGATAATTACTCTA 
GGTCAGCCITTATCAACCGGAGTCCCTCATCTGAACTACAGAACACAGAA 
AATG ATTGAGTG AC I CI 1 C 1 LAAA TCTCCTC A GG A TGG T ATGTG ACTAG T 
ATCATTCTAGATGCAOAGCOGAAAAGTTAAT1TATTACAGTGGTAACTCT 
TTOAGAAGTGO ITCTTTTI AAGAATGTGGCCCTGAACATCTGGGA ACTTG 
TTACCGAGCA G 1 1 1 1 1 04 3GGCCCTTATTT AGACCT ACAAAGG AAACTTTT 
GGGTTGGGGCCCAAAATCTGGTATTTTOGGTGTTG A 1 1CI IOI I 1GATAT 
GAG ACGG A IC1 IOC1C1 GTCOCAGCTT GAGTG AGTGGCACTAACTTGGTT 
ACTG AAACC1 11 GC TT GAG ATlTAOCTATTnCl GTTCAGTTTCCACTTC 
TTAGAAACTTGGCT 

> Sequence 1141 

A C1TIITA I 1 1 1 11 1 I I 1 G ACGGAGTGTGGCTC T CI* r G CCCAGG ATGGAG 

TGAAGTGG<^(XiATCTCGGCTACTGAACCTCCACXrTCCT 

ATTCTCCTGCCTAAGACTACTGAGTAGTOGGGATTACAGGTGCCTCCCAC 

CATGTCTGGCTAATTTTTGTOTTTATAGTAGAGACGGGGATCCACCATGT 

TGGCCAGGCTGGTCTTGAACTCXnX;ACCTAATATGATCCACCTC 

OCT ACTA CAGTGCTGGG ATTACA GGCA T A GCCA CCG A GCCTG ACACGGGC 

ATTTTTAGCATGCAAAACXrTCAGGAATOAATGGCTGTTGGTC 

AA TCAT A CTGGCTACA TGTTGTG AAA CCTGAA GTTTTTGTT AGATTGT AT 

GAGAXATGATCTCTGGACGCAACACACCCTAAGGGCGAATTTCAGACACA 

CTOOCOGOCCGTTAACTATCGGATCCGATCTTAGTTACAAAACTAGGCTG 

AACACATGGTGCAAGGCATGTTACATGAGTCAAATTGGTAACCCGCTCAA 

GATTTCrAAACAACATACGAACCOGGAGGCrrATTTTGCTAGAGAGGGGG 

GTGGCACACX^CCCGCGCCACCCCCCACTrAra 

CTTTTTTGTn 1 GGACACCCCCCCTGTCACCAGC 1 1 1 1 JGACTTGTTCAA 
CCraXECOATGAGGCOTATO 

> Sequence 1142 

ACTATTATCAA C 1 G TGATGATGATGATTTJTGAATCTTATTTTC1ATATCTT 

GGGTTITCTTACAGTGAAATA 

> Sequence 1 143 

acctacacacatatatocatatatggtataatgtatcaatatttacagag 

ATAGCTATTTAAAGTG<ri!ACAATATCGGTAAAAGATTGCTT^^ . 
TTTGGG G1 1 1 111! 1 1 1 U I CAATGATCTATA 1 1 1 1 1 AATGTTANNCTTT 
AAAATAGA iT^^CGTGGAATGTCGATTCAACTTTAi 
AGACTGATATGAAAATAGAAG 
G<XKK>GAAATTCCAGCTACACTTt>GCGGGCGCGA 
ACGAAGCTACGGCrrACACAGAGCTTGGCGACGATAATCAATGC^ 
GACTGG1 rGACCTTGTOGTGAAAAl 1GG 1 1 AATCTCGCTrCACGAATTGC 
CACAACANA CAATA CTGACGCTCCTGAAAGGCAT AAAAGA TGATAAAAGC 
TCTGGCCIGTCGCGCTTAAGTGAAGATGAGCTGTAACCTCAACA^ 
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OTOGGOTATGCTGCCTACACTGTGCGCO ft* I C 1 1 1 tC ACACATGCWOACC 

CCICACACATOCTOGCCGACGAAQAACGGOCTX7TATTTAGCCGATAATAT 
GGNGACNGCTACKrATO0OOTC1TATCATTAN<nCCAACTCGAACTACGC 
TTCnXjACATOGACCACCATrrcCC<XrrTOGO(XXrrOAACOOCCNTAATCA 
TTCTTA AOGATC AAOAOGCOGGCTTAAATACTGOGT TATrCTO CACAOAC 

atca tgc rci gatataccgctagatgaaaaaaacca rron gaacacaga 

G A GTGCCTGCCAACAA TGGCTCATG AACCCG A TG A AAA NGGGGCGCCTGT 
TTCGTTGATGTTI^ATTACAATCACGGTTCCAGTACTOCGCTGGAG 



A CTATAAGTAGTTGGTTTOTATGAT ATGGT TAAAAAGGOCA A AGATAA AA 
GGTTTCTTTTTT7TOCTTT7TTCTCT ATGACAGTTG ACTGTTT AA 1 1 1 1 1 
TCTTGGCACTG A TTTG A TGTA I rOlO 1 GAAAACAATTGTTGTCCAACAAC 
TAAACAGCAATTTTA 1 1 1 1 GC I G AGTTOTTCTAAGCTA A AGATAAAAATC 
CAAAAATAATGGTCCCTCGGCCGTGAOCACTCTAAGGG 
> Sequence I 145 ' 

GGTACTTGTGTTTGCTTAAACAAAGTGACTGTTTCGCTTATAAACACATT 
GAATGOOCTTTATTOOOCATOGGATA TGTGG TG T ATATCCTTOCAAAAAA 

1146 

GGTAGCAAGGTG AAATTTG AATGTG TG AA CGC A T 
TTTCAAAGAGATTTCAAAGCCA CAA G TT AG AT AA GCOCAA G AA GTAAGGC 
CAGAGTGAGATOOAAGTAGGUL1 1 I C I TT1 AAAAAATAATAGCTTTTATT 
TTATCTCAGT ATC1 I C 1 1 IA CAAATCTAACCTTCC C I 1 1 IC AGG C I 1 1 1 1 
G AAAAGATAGCTA AAATTCAGTG TGI I CCTCTT A TT A T A AAGG A TTGGGC 
TAATAGTTAAGCATTTCAAAACATTTCAGTTrCGTT AATC AGA AGCTGCA 
GTGGC1 ITGTT1 IATAGCCAG1 1 1 LA. 1 1 1 IAAATTTGGCCATGTGGGCTA 
T AAGTTCAAOGTA 1 1 PGTG1 IC1U 1 I A UG I 1 A C TCT CTOCAO AATATT 
ACCCAAA CTGTCA A G I I G 1G 1 1 1 ATGGGGATCGCAAACATTCATTCTATT 
CGGAGGAGITI ICAGICICTGCGGTIGC 
>Scqucace 1 147 

ACATCTGTCAAA A ATCATATGTATGTG AGATGTGTCAATACTAG ACTTGT 
GTCA TTTATGCTACTTAGAAAC AA G A TAAAGAAA TATTCC IOC 1 1 TOOAC 
TCCAAAAAAAAGAAAACGTCAGCTCCCTCCTGCAACGAGTAGGAGGATGC 
AAACCCTTTA TAA TTGA TAATCAGAAACCCCTCA G A CCCAAAGT AAAAT A 
AAAAAAAAGATATGTAACATTAGGCATTGATGCAAAAGGACTAGATCCTA 

gtataagcatcctaataaaaggagaggttcaaagacgctctccagaacca 
gtaattcagactttatatgataaactaaatgtgccagtacc 

;» Sequence 1148 

OOCrTAOCCGOCOCCCGGOCAOGTACTATTGAACCAACAGGATATCrrTT 
. TTATTATTT CCATO AGTTAATOCTACAAACAAAATTAAATACCT C 1T T IA 
TAAAACATC1 1 1 rOCAGTCI 1 C 1 AATTGATGGAGATGCGG ATCACTCATC 

tataaaaaatgacttacagcttca gcttaatcag ttgct ataatgtgaaa 
acaogaatgtgta i f n t'ltcaactaggtaaaaggtgcatataatttoaa 

TTGTTAAATGl 1 1 IATTAATGAACAAAGTAAACCTTTTAGTAATTTTTAA 
ATTACTOGTC1 1AGGTGTTTGAAACAAGGTAAAAGTATACATTCCAGTTT 
TGOCCAAAAG^O^CTTAAAATATCTACAAATTATTTAA fCTOTOTOl OOT 
AACAOCATTA 1 1X8C1 IXAATTTCTGOAAAGAGTCT ATTrTCAAAGTTTAA . 
AAAAGAGG^AAAACAGGAAAGTGGtTTAACTITTGCAGTtKIAAAGAAAAAGT 
GTOCTTCA TGGGTTACACI 1 1 LATATTTTTATGCAGCATTAAGTTATCTA 
CGTTATOGOGAACTTGOOTTTrATTCCTACTCATGCATOA FOTOOTl LAG 
AACITACTTOCTGACATTIt^GAGAACTTCTTACATTA 
CTGAGGTGCAA CTTGGACATATTACAATTrACTCATTATTTGCCATGGGG 
GCTAAGTTACTATACTGGTCTTGAAATAAAAGTCCCATTTGAAATACTAA 
GTTAGACTCATAGGAAGGGGGGGAAGGOCTTAATATAT 
e 1149 
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GGT A CCA T A TTGTTCTTGTT ACAGTTGTT ACTGTCTCAG A TA TAA TTTTG 
CAATGGCGGTrrCGCAACTAGCAClGTCCTGACCCTATGTATCATAACTA 
ATCTAGGTAAACAATTAAAATAGGTAAATGTAATGTGATAAATACTTGGG 
GACAAACTGGTCATAATITAGAATCTCAAGCTATATTAAATAATAGATAT 
TTCATTATTTGGGTATTTTCCAATAAAAATGTATTGGAGGAAAACC^ 
CCAAAAAAAAGTGTAACCTTTTTAAAAAGGTGGAATAA 1 1 1 1 lOlLl AAT 
TCAAAGCTTATTTAAAGGTrATGTGTAAAACAAGGGT AAAGAA CCTTT AA 
A TAA G A A AG A TGTA A A TAAA GTT A CCAAA A A T AA AGTG A GGG TTTTTGGG 
1 1 1 1 1 1 1 I I GGTAAG AAGGCTTAAAGAAAAA TATTTTATATGAGAAAGAA 

tcttgtatataaatttagacct AGAATTAAATGGCTGTTT AAGT AAGAGG 

TTAGTCCAATCAACGGATTTTTGAAAACATGAAACATACATTATTAAAAG 

ttcctg gccgggcggccctttaagggggaati 

TTTTJTGGAACXjGGCCTGTTACAACTTOGCGGAAT 
GTGTG AAG AGTTA TCGGTA AAA TATCCA C AGA TG GACCCGCCCCCTA TGC 
GCCCCT 

I ISO 

'ATTTTTG A 1 1 fTGTTl 1 AACAAAAATAATAGAGGAGA 
AGCTGGGCACAGTGGCTCATGCCTGAATCCCAGCACTTrGGGAGGCCAAC 
TCAGGAGGA rTG C TTl AGGCGAGGAGTTGAAGACCAGCCTGGGCAACAAA . 
AAACAAAAAAATTACCCGGGCATGGTGATGTGTGCCTGTAGTCCCAGCTA 
CTTGACAGGCTGAGATOGGAGGATCCCTTOAGCCCTGGAGTTCXAGG^ 
CAGTCAGCCATGATCTCCOCATTOCACTCCAGOCroGATGCCAGAGCAAG 
ACACAGTATCAAAGAAAAAGAAAAuAACACAAAAGAGGTGGAAGGGCTCAA 
CXAGTGCTTTCCACATTOGCATTOCCTTAAA^ 

TAGAGGACTTTTAAAAAACAGAAAATACTACTTAAGTAI 1 1 1 IC! IGTGC 
CCATGTGAAGTCTGGCAAAAOACAGAACTTTAAATAATTAAAGAGGGCTA 
ATCACGAAAGAOOCCTAAGTTTGCAGGAAAAGATGAAAOGG G fCTTTCTA 
CCAATCCTTAAAACCTTGAGGACTGACTTTACAGAAGACAATGGAAACaA 
CT AAGGGCTTA CAACTCCTCTGAAAAAGTTTGAAATAATATTCAATTAAA 
AAI 1 1 1 1 I IT1 ATTGAAGGCAAAAATTTAAATCGGGTTTTTAATGG AATA 
ACCCCAACACCCCTGGTATTG 
>Scquracr 1131 

GAOGGAA^CTTCCTCTCToS 

TTGTATTTTTAGTAAAGACGGGGTTTCACTA I U I rOGCCAQOCTOOTCTC 
G AACTCCTG A CCTCGTGA TCCACCCACC1 1 G GCCTOCCAATCTTATTTGC 
TTTACAAGTCCTGC1 ICAOGGTTAOC1 roOCTOAOCACTGCTOOCTCOCT 
OOCAGCA TTTGCCAAGG ACTGTCA I 1G O C1 1 1A GTTA 1 1 1 1 1 IC1G I 1 1 1 
O OI 1 1 1 1 1 II IG GC O I 1 1 1 1U1 1 1 1 1 1 1 1 rGAGACAGCGTCI TAATCn I 
CCCCAAGGOCTGAGTGCATTOGCACAATCAAAAGCTTGT^ 
AAC TTCTG GGCTT AAOCC A AI ICIC I I A CC CTCACAACTA TOOATACCGO 

ACCTCAGOCTCTTOAATA L1 ICOl I 1 AAAAATCCCCGAGGG ACTTTTTAT 
TCCI 1 1 1 ATACCAATGG AACACrrAAGTCATCTTTCGATGGTAAAGGC^A 
AAAAAAAAAATT 
>S«pcBcellS2 ' 

acaAocaaoa ci i roci I i aatattgataaagaattgagtatcatgtato 

CATTC0C1 nTATGATATACAATTAATTGTAAGTTA 1 1 1CCUL ITCTATO 
CAAOCATOCACA TTTT1 1 1 IC fO AO C 1 1 1 PCCTCAAG 1 C 1 1 ACAACTACT 
TTTAATG ACTGCATTTTGGAGGTGGTCCCAGGAGAACAO A 10 1 1 Q OLI IA 
TAATGGTOTT1TTOCATTTTTATCI 1 1 UATTGGGCAAGGGGGTTGGAAGT 
ATTATTTAGTCATTATATGGA T I CC IC1 AAAAATTGTTCAATAGAATATA 
TA TTCA TTT ATTCA CrTACTTATTGTTT ATTT A TTGCCTAG A GTATACCC 
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aacactgaggatacaataatgatcaagacaggtctaatttctgtcccata 

GAGOCTTAA TTTG AATTAGAAA CAA Tl Til 1 1 ATT AAOCCGTGGAA AAA A 
A GAATCA T AATATAAG 1 ICC1 1 1 OCCCTG A OCACGCT AAGG 

>*rim i H53 

CGTACTACATAGAAAGGGCTTGGAAGTCIGATTCAOGAAAGGAAATCAGG 

AAGTATGAAAAAATATGAAACAGATAACAAGAAAGTAGAGGAGATTCCAA 
AAAATACAAOCCAGGTTTTCTGCOCTCATTCTA TAGAG 
>5equence 1 154 

GGTACTGCAACTATCACTTGTCATTTGTCTAGGAAGGTAAAATACAGOAA 
OT TCCC AACTTAAAAAT CGGCT^ 

GTGATTAGTATCCAGTCAGCCCACAGA^GTTATTCAGTCTGTTGCTATAG 
ATGAAATTATCCTTATTTTTACTTCCCCrrCaA 

TCTTCIGAGTC TGOTCI 1 1 MCI I 1 » C \ CX7TATTCCCTCCTCAATCCTCT 

1 1 1 1 1 1 1 1 1 1 ii 11 1 1 1 1 pctgg2 1 1 ic i i ca ttattctctaa 1 1 i ci i c 
ttgtctcataatacttcagttctattotggtacktagatttagggactag 
tttgaaatoctac i i fo cl g atggaaaacacattttgcagoccttaacaa 
gaagc1aaagggt aga tagggtttagct aata cttagtaaggggctt acta 

TtnXKXATXKrrOCCTGCCOGGGCGGCCCGTTATTCGCCAATT^ 
ACTGGOGGGCGGTTACTAATGGAATTCX^AGCTTGTTAACCAACCTTGGC 
GATAAAAATGGCAAAAACTGTATCCTTGGTGGAAATTTT^ 
AATTTCAAAACACATTACAACCCGAAACAATAAATGATAAGCCCCGGGTT 

ccctatoagggggccaaactaccattaat 

>Scqme 1155 , 

ggtacctgcaggaacaatattcc tc tagccatggaagagggccaaggctc 

AGTCACTCCI 1 GGATGGOCTtXTAAATCTCCtDCGTGCAACAGGTCCAGGA 

GAGGCCCATGGAGCAGTCTCTTCCATGGAGTAAGAAGGAAGGGAGCATGT 

ACTTGCGCTTACTTTGTAGCCTTCATCAGGGTITCCTGAAGATC 

TATAGCCTGAGCAAGAGGTGGTGAGGTTGATCGGNGTTTATCGATTACAG 

AACAGGC fCClCI AGAGGGATATGAAGCCCCGCCT 

>SeqsoM 1156 

ACGOGGGCA I I 1 1 I G I ATTGCTATTAAG AAATaCCTGAGACTGAGTAATT 
TACAAAGAGTAGAGATTTAAATGGTTCAAGGTTCTGCCGGCTTTACAGGA 
AOCATOOTOCCAGCAICIOCI CAOI 1 1 C 1 G GAG AGGCCTCAGGAAGCTCT 
TAATCATGGCAG AAGA TGAA GGGGG A GCAAA TT AA TCACA TGGTGAG AGC 
A GG A A C AAGAGA GAG AAAGG AG ATCTACATATACATTATGTAA TTAAAAA 
GCGTGCA TGIG1 ATGTATTAAAAATAATGGTATATAAACAAATACAATAT 

atacaataaaacacctaaacgcagaggct g ctcttatccacaatagtaat 
accaatagtattaatcatgtgtatgtagacacagacxaaagcagcggacg 
tajtaataggcagacacacaaaagcxaagcaaaaagcacgccagtaatgg 

TGTGGATGCAGTA TCAAGA TTTGTCATA TAGACATAAATCCAGT AGACAC 
TGGAATAG1 1 1 1 IGGTATTTTAACTTGAGACAAAAAATGGTCACGTTGGC 

attcactattatatgtaagtgtaacaa i rococ ITTTATGTTTTACATTA 

TGlCCIOl CCAATGG<lATGTTAACCACACTGCOATATACAATTTACAATG 
C 1 1 1 1 1 AAAA I Mill I CT 1 AAACA TTTT AAAAGT A TT AAjOCTCACCCTT 
T7TAAAGGTACCTTCGCOG 
VSnqararr 1 157 

'XrrACA C GC TI X lGC C I I rAAGAGCACTCn 1 1GCI 1 1 I O GGGCAGAAAG 

TCGAGCTGGCTGCATCACAAGCAGAOCT AGGTTGT ATAGTGGCC I'l I G1 C 
TCAAGACaCTCTCCAGGTGGGAGAGOCTTCCA TCAGGGACA T ACTTTAGG 
TTGCAAATTGACT C T TCr CI C T IGAGGCAA rCI C CnGTG GGAGAGAXjTT 
i c rococ 1 GGAGCTTCAAAGTAAGCACCTAGTTAGA TAAGCTTCCAGTGT 
AGTCAG 1 G 1 L I GO I UAAGGGAAGGT AAAGGTTATGATTGCaTTTCTGAAG 
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AG^T^G^^AG^^AJ^GGG^^^TAAAAAAAAAAAAGAAAAAAAGTTOCTGC 

AODCTAAAAAGAjOOOGO CT I 1 O C I G OCCCTTTATCTAAT AAOGTTDOGCA 
CCCACCTGAGGAAGCGTATACAAAGAATCGGACCCCACAGTGCCAAAGGG 

ggcggatactccgocogtggctataatggaatccao 
gaaca tttcct acgwggcctctccoci igacccg 

1159 



ATTAAAAATAAATAAACATTGGTCAAAAATCATAAAGCTGTATCAACTGT 
A T ATAAATAATTCAA TT AAAAT ATCA TGCAT AAAA TCTGGGTG 
>Seqmce 1160 

GGTACTOGG ATTACA GATA TGA ACTACCGTGCTCCCT C ATACOCTAAATA 
TTTATCAAAATTTTTCACTGCTA TT7TCCTCA TOG A TT AAAAGGGCT ATT 
TA TT A TTTTTTA TAA CT ACA GCTG ACCCTTGAACAACA TAGGGGTT AAAG 
GTGCAGATCCCtXGTGCAGTAAAAAAAAAAAAATCATAAAAAACTTTAGA 
TTCCCAGAAAACTTCACTATTAATAGCCTACTGTTCACC^ 
AAACAGTTAATACACATTTTGTATGTTGCATGTATTATATAATGTACCTG 
CCCGGGCOGOCGCTCAAAGGG 
>Seqacnoc 1161 

GGTACTATAAAG C 1 1 1 ICMC ACACACACTCTGAAGAATCCTGTAACCCC 
CTGAATTAAGCAGAAA GTC l TCA TGG C T TTrCTG G CT rCG GCTOCTCAGG 
GTTCATCTGAAGATTCGAATGAAAAGAAATGCATG 
>Seqpencc 1162 

GGTACX^ACCCTA TTTT ACAGATGGGAAAACTGAOGCTCAG AGA GGTTAA 
ATCACTTACACAAAGCCACACAATTTTGAGTGGCAAGCTGGAA 
CCaGGCAGTCTGACCCTGCAGCTT ATGTOCl rAAOGATACTCCCTCT C AT 

gtgggcaaaggatggcccaggagaaaggcaggcccagattccaaatctgg 
cttgaccg tctaa g a ggctg a gtctt aacxtctctgagcctttgctgttt 

CA TCTGTAAAGTGGTCXrrCCTGACAGCTGCCTXXrrAGGGTro 1 1 1 1 O A GO 
ATAAAGTGAAGTAATGGAGGGCCCTTGGGATATGGTACCTGCCGGGGCGG 

ccgctcaaaggg 

GGTACCTTTTTCACOCTCTG AAA TTACT AAGCAGGC IGIGGGGTGGTGCT 

CTGAAACT AGGTAGAAGTCCTCaCCCCCCAACAAACCTTTACCAGTGGTT 

TTAGCATGCAGAAGATTCTGGCCTGAACCAGTTACTACTACAGAGGCTGC 

AAAATCATGA 1 1111 1CA TTCA 1 ICI 1 1 IGIA AATACCGGGTA 1111 1CA 

CAGGATGAATGT 

>5cqam»ll64 

NAcrmm 1 1 1 n 1 1 n 1 1 1 1 1 1 1 1 1 ici ici ia gcagg gici cactct 

>Soj»ot* 1165 

COCI rAOCGGCOGOOOGGGCAGGTACAAACI I ICI I C AGTTCTAAI I IC I 
AAGATXnTTCACTCTTTAAGTAGAAATGAAAGTCATCIGACTCAAAATTA 
TAGCAGTA TCTAA 1 1 U 1 1 1 1 ICATAACT AOCCAAATTCACAAATGTOCTO 
GATATATTTCTXiGACAATGTAGATGCTGATATOCTTGGATTTAGOTTATA 
CTGACT 1 1 1 ATCTTTACCAAACCATATTAACATTTGCATTTTATAATTGG 
AATOAj^AATTTAGA4TTAA<MGATCTOOATOWTGCAGGCAOGCAACCATC 
AACCAACAA TACTTTTATCTAOC 
>Sc9XDCC 1 166 

GGTAOGCGGGCAGTGG7 1 1 1 CC I C I A TACCACTGAAAAOCACTATAACAT 

aa ttgtto i ocatgatactqaagc tl icuxtcacttotaggttgattac 
attcagagctcta tcaataagaggaatacata ttacagtgaattcgacaa 
ccgcacaagttggcagtaggtatcoccaacctaatttatcttggtaaatt 
CACCCTC1 1 1 UL I agtcctgctcgataaaagagtgtttactttttattgc 
tcttagacagagtagtctagataacitttcaatttatcaacatag 
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A C1 ICMG TAAGTGGAATGATCATTAGTAACTCA 1C1 1 1 1 1 U 1 1 U ATATAA 
TTGGAAACAGAAACGAGG<rnATTGCTATTGCAGAAATCOCAAACTGGCA 
AAGGOCAAGTTATATGCTATTOCATAATATAACCAGCrnTGAAACTTAT 
GTOCTTGOATTAGTGC C TT CIOG n A CCAGTATTGACTCT G TT A CTTCGA 
CCTTTCGGTCTTAACACAAAATroOAATATGTAA 1 IX, ICI 1 AAAATTOGT 
CGAACCTAGTOAATGOAACTAAATCCAGOAATTCTACACATAATTCKjTCC 



CTAATGGATTCCACCCCCGGACCCACGCTGGCCTAATATTCGGTCTACTC 

ccgaggag 

e 1167 



GCTACI I 1 1C 1G ICI IC I AA 1 1 1 I I AAAATTATTAATG I C I IC' lA 1 1 I 1 1 
CTAAGGCTGAi n i l 1 C I AATGTCTG A I 1 1 1 ICC1 1 1 1 1 1 ICACATCTTG 
ACATAAGTAG AGTTC ATTTATTTTCATTTA TT C TT G T ATA ATAAAATTAC 
TT A A GGTTAGG A ATA ATTA AGTTTTGCTCCCATGTTTTTATGTGTA AC A A 
TCTCA ATGTrGTATGTCATCTACTrCA A AATTTCAAGCTTCC CL 1 1 1 AAA 
A T A CTGTTT AAA AAACTTTA TG AAACCAGTA I II C" I C I CAACCTTTGTGT 
AATACCTGGTTTTACTTTAATGTGGTCAAATAATTTAACCTGTACTGCAT 
CXH3CAGTGCCTTCGGACTGTCTATTTGACCTGC^ 

T IC ICCTIT1G ICI CI AGTTCATTCTCTAACCACCAACCATGAATTTCAG 
GGAACICI I I ICI C A I 1 C IC I I IG 1 1 I IGTGGCCA CI I IC ACAATGTAGA 
AGGAAAAAACCAAATGACCCCACTGTGATGTGAATGGCACCCAAATCAGA 

CTCTGAATTATTTATTCCAAAAGTGCCAAAAAT^ 

AA A AA CTTGCTCTA CTTTTTTG AAATG A TC AAAAACCCCTAA T ATTTCAT 

ACirTATACTTCrGTAAAAATAGATTTTCCATIt^CAAACTGTCAGAAAC 

TT AAAAA T A OCTGTCCGG 

> Sequence I !6< 

ACGCAGGGATATACAAAGGTGAAAAGAAACCTGAAATATTTGTTGATGGC 

TGG AA TATTT ATTTTTTTG A TCAAA TA G AG AA CTGCCT ACCT ATTGGTCA 

GAATGTGGAAAAAATACAGAATCTGTTGGGCAGTrATGGTTOGGCCTTCT 

TCG TTTCTA CA CA G A GG AATTTG A TTTT AAA G AA CA TGTT ATTAGCA TC A 

GGAGAAAAAG ICTGCTI ACAACTTTTAAGAAACAGTGGACCTCAGAATAC 

ATTOTTATTGAAGATCCClTTGATTTGAATCATAATCTCGGAGCItXiATT 

ATCAAGGAAAATGACAAATTTTATAATGAAGGCTTTTATCAATGGTAGAA 

G AGTATTTGOTATrcCTGTCAAGGGAATTCCAAAGGACTACCCCT AAAAA 

TGGAATA CU 1 1 1 1 G A TCCACAAGTCTTAACTGAAGG AG AGCTTGCCCCC 

AATGATAGATGTTOTCC^AATTTGGGGAAAAATCGGACACTTCATGAAAG 

ACTTGTCTATGAAGAGAAAAAGTAGAACGCGGGGAAAACAAGAAGACGCC 

CTTGAAGAAAGAATACCTCGGACAAGGGAAAAAGAAGCCAOGAGGCCAAG 

AAATTCCCAACAGGirjCTTTGOCCGGGACACGCTTAGGGCGAATCCACCC 

AAT 

>Scquencc 1 169 

GGTACACCTGGTrTCACAGAAAACAAAGCAACTCTrAAACACCAGCTGGC 
AAAATGATAGGGC IIULLII 1 U AATACTCACCACAGGTGTGAAAGACAG 
AATGACTAATCCATCTGATTAAACATAGACCTTTTAGAAATCAATAACCT 
TATTTACACAGATGACAACIGCTACTGTrOCAAGGCTCCTAATCATGGTT 
CAGTTCTCAGOGCCTCAA O ICI 1 1 1 lO CATTCCATCGCAOAGTAGT 
>Sequenc*II70 . .- * 

GGTACCGCAGCT AGGAATAATGGAATAGGACCGCOCTTCTA I ITTGITO G 
1 1 1 1CGGAACTGAG 

GGGAAGTCATAACCCACAGATAGATCAACCTAAGAATI 
CACTCroCACCATOCAC<MCAAAC ATCTTCIC AACCAClCAACOTAOAAT 
GCTTGGG AAA TAGTCA TAA TTACCCACATATAGTAATTAAT AG ATGGTAA 
TTAATTG ATCCTTGATGTG A IO 1 ICI I I IO CATA I I ICCI IC ATTCTAAA 



GGG AGGCCA GGA CA G A TCC CTTC*. GG TC- 1 *. GGA GTTCG .*. G A CC*. GGGC A GG 
CAACATGCX^G A AACC^TGTCTC^CT AAAAA T A CAAAAATT A TGGTG ACG 
CCTGCCTGTAATCCCAGCT ACTCXXKjAGGCTGAAGCAGGAGGATCOCTTG 
AA CCCATG AAG TGGAGACTGCA GTGAOCCGA T ATCGCACCA CAACGCTTC 
AGCCTGG TOGA CA G AGTG AGACTTCA TTTCAAOAAAAAAA T AAAATTAAG 
TT G I r C TC I 1 AAGAAAAAAAAAAAAAAGTCCTTGGCCGGCGGCOGTrCAA 
AAG 

>Sequencc 1171 

ACAGGAGG AAT O 1 1 rGGTTGGG AGAATCACAOCTTT ACAAGGGTCTTTAT 
ATTTGA TT1 G I G I T T ATATTTOAGGCAGGT ATTGTAATATAAAGGAATCC 
ATTACCATGTOCTATAAATGACCTCTAGCCATTTTATGATT A 1 G 1 1 CI CT 
GTAAAACTCTTCAAGA CTTCAATG AGAAG 1 T 1 C 1 1 1 ATAAGAATTATCTT 
CTCATACCTTTCCI I O I GA AG AGCGTATTCTG I 1 1 1 1 CTATCAGTTCG AC 
ATGAACrrCCA<UTCACATGCT G TT Cn T T C TAGTTACATGATGTOCCrT 
^Sequence 1 172 

GGTA CCA ACCCT ATTTTACAGATGGG AA AACTG A GGCTCAG AG AGGTTAA 

ATCACITACACAAAGCCACACAATTTTGAGTGCCAGAGCTGGAATGTGAA 

TCCAGGCAGTCTGACCCTGCAGCTTATGTGCTTAACGATACTGCCTCTCA 

TGTGGGCAAAGGATGGCCCACGAGAAAGGCAGGCCCAGATTCCAAATCTG 

GCTTG ACCCTCTAAG AGGCTGAO ACTTAACCTCTC TGAGCCTTAGC I GI T 

T<^TCTAGAAAGAGGACXriCCroA(^GCTGCCTACTATOG1TCTTATGAG 

GATAT 

> Sequence 1 173 

ACGAAGACAGCATOCTTCAATCCCGCCAGCrCATGTGCATCTGAGGGTGG 
GGCTL I G I C M CATGCrAGAAACCAAACTGCTCTCACAGCrnrCTGCTAA 
A TC ACCACGGCT A ACGG A T AA GCA GAG A CGG ACT ACCCGCG T ACC 
>Scquencc 1174 

TTCTrAAAAGGCGAATTTCTATAAACGTGTGTATATCTTGAACAGATGAG 
CAGCTCTGCAAAGATGTGTATAACTGCATTTGAAAAAGACAGTGAAAATT 
TTGGGTTACTGTAGATGTCCCACAGTCTGGCTTGGAATTrAGTTCTGTGA 
CT A A A GG A GGCTTA CAGTTGCICX1A ATTTTGGTTCTG TGGGGT A CCTGCC 
CGGGCAGCCGCTCAAGGG 

> Sequence 1175 

GGTACATGGTCACAACAGATGAGCAACTGATATCACTCACACATGCTATT 
AAGAACTGTCCTGTGATAAATAACAGACAAGAAATTCAGGCATCAGAAAG 
AGCCACAGGTA GA A G AGTTATGG ACAGTCCA G AGCGTCCA GTTGTAA 
ATGCCAATGTCTCAGTGCCATTGATGTTCAGAGAGGAAGTGGCTGAATTC 
CCACAGGAAGAGTTGCCCGTTAAACTGTCTCAGGTGCCAGACCCTCCAGA 
TAACATGAATCTGGCCAAGAATTTTCCAGCACATA I TIT I GAGCCAGCTG 
TGTTGTTAACACCACC 

> Sequence 1 176 

GTGATGCTAGAGGTGAT Qri 1 T T GGTAAACAGGCGGGOT AAQATTTGCCG 
AGTTCOCCOCGTACCAATGAC I G O 1 1 CCATGATCCCCTAAGAGAACACAA 
CTTAGGAATGTGGATTCTAATGATAGCTTTATACTGCTT AGGCAAATTTA 
CI ICI GAGCCTTATGTGCCTrCAGTGGTGCAAGCAAAT* 
TTAGAGAGGTTGATTAACGAGTACC 
ell77 



TTTACACT 



GGTACACTGAAGAATTAAGCTGTAATGAGG<2AAGACGCCTGCAACTTATT 
CTTTAATAGTTCAG AAATATTAACAATTGGGTAATTTGGGTGAAAGGT AT 
AAGGAGCTATAAATGTTATTTCIOCAACTTTrATGTAAATTTCAAGTTAT 
TT A AAA TG AAAAGTT AAAAAGTTT AAAACAT AACAG AATAG AACATAACC 
TATTAAATAAATCTGAGTGCAGGCATTJACACAGTGGTTCATC<XrrGTAAT 
TCCAGGG AGGGACTGGG AGGCCGAAGTTXKjCAAATCACTTG AGGTCAGG A 
G 



Page 216 
(of261p*gesinT«b»e2) 



Page 217 
(of 26 1 pterin Table 2) 



PCTYUSOO/33312 



PCT/US00/33312 



Table 2 



Table 2 



1171 

actaaa ttgttttagaagcaaactacaggacttaaaaaaaggtgattttt 
1 1 1 1 1 io gcttx ^ agta ggcacttatigtaaittttattcatgctatgaa 

CTGATGATTTTCCCTTTATTCTCCI 1 1 Xj A TCCT A CTT AAATAAATTTATA 

gagtattgaataatatagaaccaagataagaaccctaagagactttagat 
gtttatttgttcattagcactctgagtacc 

> Sequence 1179 

GGTACTTTN 1 1 1 11 11 H 1 1 1 1 11 1 1 1 1 1 1 U1CI 1 1 H 1 11 1 1 II I I I 1 
1 M M I IT 1 I 1 1 I 1 1 1 1 I I 1 1 1 1 1 1 1 1 I'll ATTAAAAAAAAACTGCCTCC 
TTT AATGGCAGG AATACAA T 1 CC I I GG I I AAG AG ACCCOOGG AAAAAGGC 
AGGTGACTTTTTI GGAAAA CAAATTTGGGA GTTTA AA AAGOGTGTAAA AT 
A i rCTTCCGGOGAI rTTTTGTAAAAATACAGTTTTATGl 1 1 TCI 1 1 1 1 1 1 
GCGACACCCAATCTTTAAACTCTTGAAACAGGTTTTTCCCC II I II 1 1 1 1 
ACAAACCCTGGTTA AAAAACCAA 1 1 1 1 I 1 1 1 1 
> Sequence I ISO 

ggta c i ni ru i im i n i i i 1 1 1 nn - mcn rrn ri TTTTi iii 

1 1 1 1 1 1 1 I 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 C CCCCAGGGGATTTTAT 
TCXXTTTITAAAAAAAGGTrrrTAACAGAGGGTTT TTTTAA 

CCA AAAAA TGG TTTTGGGGGCCTG AAA TTT AAAAAAAAATTTTCAACCGG 
G ATTTTGGCA CGGGGGGGGGGGAAA AAAAAGGTT AAAAA AAAAAACCCTT 
AAAGGGACCGGCTTAAAAAGGGG 
>Scquence 1 Iltl 

ACTT AGGCTTTCA T A A AAAT ACAGCAGGGCA AG AGO ACCA AG ATGO AGGC 

AGTG ATCAGGG AATCTCAATG AGGGTG AGACTGCG ACAAAG ACTTG AA A A 

AGGTTXiAGAAGCAAGOCTTGTCGGTATTTAGGGTAGCAGTAGTOCAGGCA 

AGGGGAACAACTAGTGCAAAGGCTCTAGGAGCK^TCTGTTICAAGTGT^ 

TTAAGAACAGTAACKJAGGCTAGTATGGTTAGAACAGAATGAGCAAAGGGG 

CAAAGTGGTAGAAGGTGGGATCAAAGAGGTAATGAGGCCATTGTGGAGGC 

CCATATGGACTATTGGN 

> Sequence 1 1 12 

GGTTCTAATGAAAGCCAGATAAAGGGATGGACGATCACAAGGTGAAGTCC 
CACAGTA GGCT A TTJTTJCAA GCTG AGG A GCAGG A A CCGCC A GTC A AA CTCA 
AAAGGATAAAAGGG>KX3GAAGCCGACAGGGCAGCCITCAGTCTGTGGCTG 
AA GGCCCT A G A GCCCCTGGCG AACCA CTGGTG TAAATCCAAG AGTCCAAA 
AGCTGAAGAACrrGGAGTCCAATGTTTOAGGGCAGGAAGCACCCAGCACG 
GGAGAAAGATGGCCGGAAGACTGAGCCAGTCTAGCATTTCCACATTCCCC 
CGCGTACCTGCCCGGGCGCC 
> Sequence 1113 

A C I H I CI I II I O I GTATTACTTTTCACTTAGCATAATOTCCTCCAGCTT 
CATCCATAGCAOCTTCATCCATAACTTCTGGGTOTAGCCATTjGCAAGGCT 
AAACrGATATGGCACACTGGTGGGCA lGIC'1 ICI GGAG AGCTCCTTCCAA 
CTCTTCCCTGTTTTAGCTAGTCCTCAA 1 1 1 G 1 C I G ATGTCTGAACCCCAC 
TGCCAGAGTTGAGTCTTGCCTGCTGAGTT^TGTCCAGACro 
AAGTATGAAGCATAACTGGTGTTACAAACACCATCTTCAGAACAGTGATT 
AACCTTACGCT 
> Sequence Ilt4 

ACXy OGGGG GA AGCTX ATTCTATACCOGAAGAGCAQTCTCACAAAGCAAGA 
TTA C I1I1G1GI1III1 AAAAAATGATTCTTTAATGTA I 1 1 1 1 C I A AACA 
T7CTGATTG<3AAGTAGTGC^TTCCTAAATOATTCCAAAGTCATCTGTAAT 
TCTTCTG I 1 1HOIU IUI IC1GICI 1 1 ICI 1CAH 1 1GOCTTTGGGTGG 
GGGOA OOGOC AGGTOACACANAGGA 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 AATTTTT 
GGAATCTTTTCCAATAAOCAGCTAAAGATITGCACTGAAATACA^ 
ATGCCTTTTGCATT 
>Scqueoce 11*5 

ACTCCTGTA I 1 IGMCI 1 ATG AAATGACTATCTGCC I IC I COIATCTAGT 



CI I ICI ICI OICA AATTATATOOTTA I I I 1 1 1 A 
TATTACCACATCAGCATTATATTAAAAGTGTrTTTAATAGTTGAATGTAT 
TTTOCCAACTACTAGTATAGACTCAAATTTGCTATTTAATTTTTAAA^ 
CAATTTAT/rTTGTAAATCCTTTAAAAAATATTTGGTTAGTTTTGGATTAG 
AAATGATTTATGTTAGCCATGTOTTCAAGATOAAATTGGGATCAGTCTAG 
ACGGTGCTGATTG 
>Sequence 1 1 56 

ACAT A TCCCT A TCT A CT ATGT A AA G ACAAA A AGGCAA ATG AA ATG A TG T A 

ATACAATGAACTOCTCAGAAAATAAGCTCTGTAAAATCTCAGACITXXrrG 

TTTATCATATGCTAGAGTAAACTTACA I I C C 1 1 ICI I G TTAG AG AAAAAT 

GATGGTAAAATCCATCCATTAATCAAAACTAAAAACATGAAAAGGCAAGC 

CAACTACAAGAGAAATACAGTTGGCCCTTGAACAACACAGATTTTJAACTA 

CATG AGTCCGTGTA CC 

>Sequesce I It7 

ACTCTCAAATAACCTGTGAGTTGGGAAATTCCTCT C CI C TTGAGGTCCCA 

AGATGGCGTGGGGTTCCrGGGCCTGTCGGAAAGTC<^ 

CACAGG TCAGGAACCCTGCACAGG AACTG TCT AG ACAAGGTATGAGGCCA 

GTTTTCCC AAGG AACTTTTA TTGGCTCCA TAA GTCAAG TTTGAGTCCTTA 

AAGG AAA GCA CA CCA TTCXXIA TCAAA CnXXTX3GTAAAACAA.CTAGTTTCT 

CT AATTG TG TCC TG TTGCAAAA G AAAACAG A TTCTT ATTGCACTTGTGC A 

AATG 

>Sequcnoe IISI 

ACA T A TCTT A CTTG A TT A TTTT A TTTTCT A TCCCA CCAATCCACACCTTC 
ACTXK3 AA AGTAAGTTCCATAGAGGCGGAGACTTTTGTCT ATTTTGTTCAA 
TG AAC ATCCCA AGCACCTAG AACAGTTTCTGACACATAA GA AGTATTCAA 
TTATGTTXrrGGCTGAATG^ATGAATTAATAAGTTYiAGATTCGATXACr AG 
TTGAAGTATA AATATATATTTTTGCAAGAATAAATGCTACAGT AACTGAT 
TATOACAGCTAATTCT G TG1 ACC 
>Sequence 11*9 

GGTACAATGGCATAGTTGAGTAGTCACXACAGGACCTAGCTGAAATCCTA 

AAATATTTATTATCCCTTTATAGGAAAAGTTTGTTAATTCCTACAATAGA 

CAACGAACTATCA GAATCTATCAT ACA^ 

GT>WAKTGTNTTT>fTTACTGTGACCATGTNNAAAAA^ 

ATTAATGACTGTTTAAGTG<TTAAAACGATG<XTGGAACATAGCAAGGTC 

TGAATAAATGTTAGCT 

> Sequence 1190 

GOTACACCTGGTTTCACaGAAAACAAAGCAACTCTTAAACACCAGCT 
AAAATGATAGGGC 1 1 1 ICC 1 1 1 G AA TT A GTCA CCACAGG TGNG AAAGACA 
GAATCACTAATCCATCTTjATTAAAGATAGACCTIGGAGAAATCAATG^ 

ttatttacacacatgacgactggtactggtccaaggctactaacg^ 
tacgtgagcaaggoctc^gacttatc^cagagggaggaggagggatgcc 

TAGCXIXXjGCGT>JCGGTGGGAAGGGCAAGAGGTAAGAGACCCGCGAGTX>C^ 

ggggagatggg 

>SequeKell9I 

GGTaCTTCTACCA 1 C 1 1 1 rCTCTACl 1 1 CO igacttaaactgocatctgt 

GATACATGAGGACITACCTAAAATTjTCTTjAGAACTGACIT 

accaatgttttggagtttataaagctcaattctaacaga^ 

tataaaaataatcttaaaaaataaaatatgatggtataotaataaagtaa 

aaataaatatggt 

> S equen c e 1192 

acaaaacaaatctgaaatatcttattaaacaagaaagt aaaaatgttatc 
aaaaactactgtcgtcttjatcaaaaagatttjagaagccaattt 
ctt^acactggacacaaaaataatttgagcttcaa^ 
ttaaaacacataaattgtgttaaaatocacaacttt^taatg^ 

AAAAAAAAAICI IO I IGGI 1 1 CC 1 C I AGAGGCTACTAGAAAATCAGCTCA 
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TTATTTCTGATA fTGGTTl AAATAGAAGAAAGAAAAOCAAOCATC 
> Sequence 1 193 

A CI 1 1 1 1 1 1 1 1 1 1 1 1 IU 1 1 1 II M l I I I n I rCATTCAAGAAAOATAAT 
TTTACACTTA TTCTTTGAAAG AGAAATTCTA TGGAA J 1 1 ICnCTICI AA 
TTTAATTOCAAAATACATTCTCTCAACOCTATCCOCTCATACTACTAACT 
TXlATtXriTACOGGGTAAGTAGGTAGTACTAAAAGAACAGAACGGGAAATT 
GGGGOAGCA GA AAACGGAG AAAAA GAAGAAAA GGGAAOCTTCT AGTTTCC 
TAATAAAAAACCTAOAGAATTCCATTOCTGAAAATTAAAOATATT 
>Scqpeace 1194 

GGTACA T A TACA TTA TGT AA TT AAAAAGCGTGCA TGTGT A7GTA TT AAAA 
ATAATCCTATATAAACAAATACAATATGTACAATAAAACACCTAAACGCA 
GAGGCTGCT G TTATOCACAATAGTAATAOCAATAGAATAAATGATGAGTA 
TGTATACaCAGACAAAAGCACAGGACGTATTAATAGGCGGACCCACAAAA 
GCACAGCATA AGCCAACTAGTGTAGCGCACCGGT AGTGTTGGTGGTGCGG 
GATCTAACATCTO AACGATGAAAATAAAG ACA gcoca tcccgg ACG ACCA 

cc 

>$cqaaMX 1 195 

GGTACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGA TCCAGGTGGT 
TTAAACGGCACO 1 1 111 I OC I I ATAAAAAAAGTGCAAAAAAGATGTGOTT 
TACAAGTTAAAGCTAOCAAAAOCCI 1 1 11 rOCTOOAATTOCAOCAGTTTT 
AAAOC C I I I I O GACAGAACCAA I 1 1 I I CGI 1 l AAAA Cl IIOH 1 1 ICItl 
AAAAGCTTACAGGGGTTTGGCTTAA I C 1 ItXl I C CCTTTTTCAAAAAACG 

>Seqoeoce 1 196 

GGTACTAAAGGGAAGTTGCTAGGAAATAGAGCATGTAATCTTATTGTTAA 
TTA TTXIAAACCA TGCCAACA CAGTAAA T A TTA TOTCTCTT AA 1 1 1U I CI I 
TCA GTCA TTTTTTGGCA TGA GTGTT A TGGAA GAGTAAACAAAATTAAA CA 
CAGTGAACTCTGAGTCATTTGCTACCCG AGTTAGTCA 1 I IIC1 H fO AAO 
GTTGTCCAN ACAAACA C 1 H 1 1 1 M l 1 1 1 IAI 1 1 1C1C CCCC-|C 1 1GIOT 



GTATTT 
>Scqucaccll97 

ACAGGAAGTGTCNCGGAGGAATATATAGAAAACTXjCTAGGCTTAATTCTC 
AGAGGGAAGATTGGGTGTTTGGAGTGGGAAGCAAACATTTTITACTGTAT 
ACACTTCTAOC 
>Scqoeace I19t 

GGTACATOGCCCGCTCCCCCGTCCATTCCA O 1 1 1 llX lUC CCTCTACrOGC 
CA TGA OGGTCA TCA CA GTGCCCTCCTCA TTCCT AACTTTT AAAT ACACTT 
GACACCC<CTGATTAATCTTGCACT ANGAAAAACAGAACAATACAAACAA 
GTAACAAAAACAAGACACTCACATACAATXjWI 1 1 1 AATGCTTGAAAAGT 
>Scqacpce 1199 

GGTACCACA l TOC 1 O C 1 CAGAAACTOCTCA C 1 11X 1 1 A AA I rGTCTTTTI 
TOOOOCAGOGTGAAATOTATCCATTTATAACTOCCTATT G CCTGrTCrAT 
TAGCATCCAAAAATGTOGAAGGCCTOCCAACCACCATN FCTOC rOTOTNC 
TTAG<SATGTCCAG>IAAAAAAT ATAG ACCTAACA C-GTT A TGTTATAGAA TO 
GCTTTATTTACl 1 1 00 1 UACTGTTTATGAGTTTTAAAT AAAAOACTGAAC 
A 1 1 1 ICIOO AAAAAAAAAAAAAJIOAAOAAOAAAOTACCTGCTDOOOCCOO 
CCGCTCGAAAG - 

GGTACTTACAAAAAAGCAAGAGAaAAC*GTGGTTAAGGACG<J I OAJL' 1 C I O 
GAGOCAGA TTGTTTGGGTTCAAA TCCTTGC I ClGl CT Cn ACTGTGAOGA 
TTTTAGGCAAATAAOCTAACCTOGC 1 0 1 GC C ICAGTTTCATCATCTATAA 
AATGGAATTTATAATAGAACCTACATCATGAGTTGGTGTGAAGATTAAAT 
ATATTTATATCOCCGCTGGGTtKXKnXXXrrCAACOCTGTAATOCCAGCAC 



CG 
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AOGGAAG AGTAAGTGGGGAGGGATCGGAATGOI IO C I 1 GAG ACAATCTTT 
TACTACAGTAGATGCTTCATGGATGGCAGAGTAGGGACTCGTGACTTATT 
TATAOO CnCTCl 1 1 1 A AAAAAGGACCCAl I IC1C1CI I U AATOOTOTGO 

> Sentence 1203 

g oiocmmih i nm i i i iiiii i iiec ii i ii iiii ini i iiv rr 

rTTTTTTl I H 1 1 1 1 1 I TT 1 11 1 1 HI 1 1 1 1 1 A ATTCAA A AAACA 1 I I 1 1 
TAAA C1 1 1 1 I 1 1 lO CCAAAAA C 1 1 1 ICI I IOG AAATOCAAAAAA TAAA AA 
GO I ICC1 1 1 1 I C I CCCCTTA^C«GAGCTAAAATTTTAAAAAAAACTTTT^AA 

ACTACTCCATTCTCAC ACTGCTATG AAG AAATACCTC AG ACTOAGTAATT 
TATAAGGGGAAGA GGTGTA ATTC ACTCaCAGTTCTTjCAGGGCTCGCGAGC 
cctcaggaaactt ataatca TGOCAG A AGGTG AA gcaagca TGTCCTTCG 

catggcaatggcagcgagaagtacc 

>Scqueocc 1204 

C-G T ACTTTTTTCCT ACAAA TGA GT AA TTC AA GA ATTTTGTTT A GCCAG AC 
CATTTAATTCTCATCAATTCCATAATATTTCTAGTTAAATCO 
TTCTATATTAAGTAACATTTTATTCAGA TCCATATCTAAATAGCAATTTT 
GTGAGATTTACTAAGAA 1 1 I 1 1 CCTGGT A TGTATGO 1 1 1 l O OTCTATTGG 
A A TGTA CCTGCCCGGGCGGCCGCTCAAGGG 
>Scqoeoce 1205 

GGTACCAGAA GCTAA TCCCCACCGGGG TTGGTTTAAA t agggactaact a 
CTTTGGAGGACATCOAAGATACCTCAAGTTTAAATGCTTATAAACCAAGG 
CTCAGCAA T A TTCTA GTT AA T A CTCT AGA GGAATCCTTGCACA GTGCCCA 
AGAA GGTATTAAA AG AATGTTTATTCAGGTGTTATTTGTCATAGTG AAAT 
ACTGGAAGCA CTGT AA CGGTCCATTACAG AAO AA CGG A T AAAAA CTA TTG 
TO ACTAATTTATATAACAGTATAGCATACGGCAG AGAAAT 
> Sequence 1206 

GCATGGTGGTTTGTGCCTT™ 

A GCACTGCTTG AGCCCA GG AGTTTG AG AAT ACAGT AAACTGT ATCACACC 
ACTACACTCCAGCCTGGGTGAGAGAAGAAAACCCTTnCTGAGAAAAAAAA 
AATTAAACTGA GATGCA I I I LXXXXXTTTTT ACACTAAG AAACAG ACCCTT 

GCAGATGCAGCTGCGATATCAGAAGAOCOOG 

> Sequence 1207 

ACCrTGATCTCTAGCAACGAGGGAAAATAAGAAAQATCAAGATTATTGTO 
TCTAAAGAAAACTCGGAATATATATACTT13ACC©G^ 
TTGTCTGTCTGA 1 1 CI 1 CCAGGCATT AA TTAGAATTTTXAACTOCTAGCT 
GGGCACAGTCKJCTT^TGCCrrGTAATTCCA 

AAAOCGCATCTCTACTAAAGGTACC 
>Sequeaex: 120S 

GGTACCCATATTGCTAATOCrAGGATGAAGATACCACATAGCCAGAACAA 
GAAGTTGAAGGTAAACATAGAATATTTTATACAOGCACTCACACCTGOCA 
TTTCGGAAAAGGATTAGGAATCCAGATGOOGTGAATTTA^ . 
CAOGCI lOlOLIOCAATATOCTCTGOAOCAA C I I OCC I UCAGAGATTTCT 
GTATCCACGG ACA TTT AAATATCC^^AAC^XTTATCTCCAGC^^GTOTATOT 

rcci rroci ioica toccoooot 

>Scqacace 1209 

ACC<XKKXXlAOCTCTCCATTCACTA0GTGCOOO0GOATOAAOOOO0 1 G i r 
GGGOCTAAACCACACTCTGGA A TTCTG I CAGCAAATTOCTCGC I GTGTG A 
ACTTCAGCAAGCCATTCACCI 1 ICI 1A AGCCA I 1 1 IC I 10ATATTTCACA 
GAGCCTCACCAAOTATTCAACXiAGAACATGTAAGTGAAATt^^ 
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ATCOCTGGTAAATAATAGATGCTTAGAAAATGGTAGAGAGAGAAAAGAGC 

A GTCTCTGCCCTTT AA TGT ACC 

>Scqucoccl2l0 

GGTAC ATTOTGAGAACTCTGCAATTATTATTTTTATTTCATTATTACT 
ATTTTATCTOACTAGAAOCCATTTATTACCAAA^ 

GTTGAAAACCC- 1 C TO IQ ACAAC C1 I CTC1 b GCCTGOATOGAGATCCAGOG 
Cm 1 1 1 1 1 1 1 1 1 rGAGGCAGA GTCTTGTTCTOTDG OCAGOCAOOAOTOC 
AGTGGCACGATCTCTGGTTACTGCAACCTCCACCTCCTOGGTTCAAG 
TTCTCCTCCCTCAGCCTCCCGAGTAGCTGGG ACTACAG TC 
>Sc*pje»ce 1211 

GGTACTCCTGCCAAGAGGG^OCACAAGTTCAAGCTGAGT aagggggaaatg 

AAGGAACTTCOGCACAAGGGGCTGCtXZAGCTTTGTGGGGCATTCCAGAGA 

ACCAIOIOCIGTGAGGGCCTTCCGAGTOCA I CI GI I 1A ATCCTOTCATTG 

GAGACTTGAGAAACCACAGOCCAGAAGGGAAAAGTGATTGTCCCAAGATC 

ACACAGCACTXKlAGAAAGTGGATGAGGAGOGGCTCAAGAAGCTGATtJGGC 

AGCCTGG A TC AG AACAGTCACCAGCA GGTGGACTTCC 

>Scqucnc* 1212 

ACATACAGTTTACATTTJTGGTAACAAAGTAGGACATGCTATGAAGGGCCT 
TTGAATTCCCTTGACAAGAATGACAGAGATCTACTAGACCCAATTTTTAA 
ATAATATTOCTO OI 1 1 1 IOCIL AACATGAATTAAAATATCGTGGCTAATG 
TCCAGATTTrACATTTGGAGAACTTTAATTTTCACTATTAATTAGAA 
GTTTAATATTACAAATGCATTTAATGACACTT aaaattgtacc 

ggt accaa t aa c<at acct acagttcag attttgg tttctaaa tgccatt 
ctccaattaaaaaggaatcaaagcacctcaoataaatgtttaattccagg 

GCTGGGGCaGGGAAaCTGAAAGAGAATCACAGAACATCCTGTAATGACAG 
AAAAAAGTCACAATAAATGGTGGGATTATCTCAAAAGGACATGGGATTCA 
ACTTG AAA GA TCTTCCAA T AGCCAAATCTG AG AAAA GTT AA GCAACAAAA 
AAAATAACAAAATCTTATAATCTATAGAAAAAATATGAATGTATA 
>Sequeace |2M 

CCCrrAOCX>GCCOCCXXX}GCACGTA C l n I fTTITTTTTn 1 1 1 1 II 1 1 1 
TAGAAATTC^XXXXAGTTTATTAGTCACAACTC<TCACAC<XjAC<XjAC<jT 
CACCACATGGCATGCGGGGTCACAGGAGAG1TGCATTTGGGAATAGAGTG 
AACCAGTAC<jGGCrCTGGAAGGCAGC<rrTTC<AGT AACAAGAGGAAGAGG 
CGATTCCTOGCTCCTCCAGATCTOACAGO C HO I I I G AATAATTTTOCAG 
GCTGGAGGGAAGTGA GOCAjCGTTGA g acccaa GGAGGCT A CCTCGGCCGC 
CAOCAOGCTAG 
>S*9ieacc 1215 

acaatt aattct g ttc 1 1 g i GA CCTG atga itttttgaaaai n oc i 1 1 1 

CTCTTT AAGAAATTTAAGTTTTCAAGGC<CCTATTACnTATCTAAATATT 
TrXXjOCTAATCTTGACTTATAAATAAATAAAAATTrAGAAATATATTCAT 
GATGACAA 1 1 1 i Ul I ACTTACACTGCCTA TTCTTT A I rTCTTTTl IA GTT 
CAAAGGTGAAATTTTGACCTT7Xn"ATTAACAAAGCCTCAAGAAAA 
ATTCTtXXnTTTAAACAIlGL.il I tULMOCATT 
>Se*pence 1216 

ootacatogaggaagtggaggtaaatccaaacogaoct^ 
^ncfgaactcacatca0ctagcactttgatogttcaacaaac(mgocttt 
aa tacc oggcrgcaocatcaooga 1 ul oc i uatccgacatcctggcctgt 
aagcct attq(x gatctqgactctagaataggatttggcr(iatatoccta 
0*^tt acttc tgocoacoggcaagttattg<1atcagai i ttxb i atagtac 

TTOCCCT CICACTGGTOGAGTCTTACCATGT 

GGT ACCACTGTGCTCT AGCCTrGGTGACAGAGCGAGA CI G 1 C I I AAAAAA 
AAAAAAAAuAATAAAAAAGAATTTATTAAAAATTTAAAAAAAATCAAAAAA 
AOCTGCATGCl TOGl nTTCTTTl 1 AGTT ATTCTACATTCTTGOCaTTAT 
TACCAAATATTGGGGAAAATACAACTTACAGACCAATCTCAGOAGTTAAA 
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TCTTACTACGAAGGCAAATGAACTATGTGTAATCAA CCTGGT AGGCA TTA 
TTTATTGAA TT A TCA TT^TTCCATATGTCCAGCACATTTTTAATAGGAAA 
GT 

>Scqueoc* 1211 

GGACAATCTTAAATATCTGA C I mCl ATDATTTOO C 1 1 II C I OC 1 1 O AO 
TAACTATTTAAATA I C TGCO I XiAl C 1 1 C 1 1 1 A ATTTGGGCTA CI 1C I AGA 
ACAAA AC AGAGCT A TTTACA ACAAA CCA crTTGCXlACAGGGOCTTTCAAOC 
GTTTACCTA AG TCAAGTG T AA TG AAAAA CATAACCAAA TGCA CCA TGGGG 
TTTATraTTAGATAATAAAAGGCTT AAAAA GCCCCTAGACCCTAAAAATG 
CCTGC<iATC^ATGATTC^TGCTCATATGCTACTTCAGCATGTA 
>5cqacacc UI9 

GCTAC CI 1 1 II 1 1 1 1 1 1 1 I I 1 1 1 1 CG TCAAAGTCACTATTTOGGCOCTAA 
CATAATXXTGCTCAGAGCGACGGAA A AAAGGCAAGCC 1 1 1 1CAAACATAA 

CICIC1C I acaagccagctattatggcaagggaaaaaag aaagcatctag 

ATAAATATCTATCAAAATTAACTTT AAGAGAAATA C T C T C TTT OC TT A AA 
AGCCCTTATTTTTTAAGACACTAGAAAATAAGTTACTATAAAAAGTOGTC 
GTCTGGGCGCTAAAAACAAAACAAAAAAAATCC I C 11 1 I C I ACATTTTTT 

acttttc 



GGTACAGAATTATCAACTGATTTGGTCAO I IOCI ICCAAIGCIGOTIOAT 
TTCCCTCATTGTGTAAACaTTCACAGXjTATGTGACAAATGGGA^ 
TCCAAATAATAAAGTG ACATA ITOGTOTI CATAAAAAAAAAAAAAAAAAG 
AAAAA AGAA O I CC I 1 1 1 1 1 1 1 1 I H I I I 1 1 I I I I I OI I ACTTAATAAAAAA 
ACTGA GTTTTA TT I CACATGTA ITT I GTTC<r>rrCCXXACCTTn CCA TGT 
TTCACCACCGCTACTACTTAGTCCTATCATAACATTCCATACATACTTAA 
AACC 

> Sequence 1221 

GG T A CCTG A GCCAGGCCAA TCAAAGTG TTTCTCAGG AA TT AGGAATTTCA 
CACATAAAACCTGCAGAG ATAGCACATG C I C 1 1 I C I I IC1 ICI I O GACTC 
TCAGCTGTACCT GC OOGGGOGGOCOCTAAGGO 

> Sequence 1222 

A cii i imiiMimn i iMiniiiAiiniu ii inn i uni 

III I l AACAAACC CIOl ICI 1 OGGTCGO I O I OGGT A TAAT ACT AA GTTGA 
GA TGATATCATTT ACGGGGGAAGGCCC 1 1 \ 0 1 OAAGTAGGOCTT A 1 1 ICI 
CTTCTOCT1 1 C CT ACTGGGCTtX? AATACCTAAAACTACGTGT AAAA TGTA 
AGTAGTGACCAA TAGAAAATAAGGTT ACCTT AAC 11 CC I rTTTTOCTO GG 
GGTTCT AGGACAAAA GT A taggcattgccaaggtcaaagaatcaa 
>Scqaencc 1223 

ACA CTGAACAA TTTGTT AA GA T AGA TCTCACC I IP IOI I CI IA CTGAAAA 

AAC ICC 1 1 lllll rCCl lACATCCAGGAAAl I IOC IO I I IAI 1 1 fO AAAA 
GCAAATTTAAAOCTATTTA AGGGAGAOAOAGCTCI I O I AAAAA TTCATTT 
ATTAGTTCTGGACCAATGTTATTTATAAGCT ATTATTTCAAATGATAAAA 
AATAAA TGCA TAA TACA TTTGATGaTaGAACA I 1 1 I ICI 1 1 1 1 A 

GGTACTTCTCAAGAOCTCACrn 1ATCTGTGAAATCTG0GCAAGGTTTAT 
AAGTAAATGAATGAGGGGTGAGGTTGTAOCATAA 1 00 SO C I TGAAGTAAT 
ATTTOTGGATAGCTAAAAGCAATTTTTGGTTTATAGGTTT A1 ICI I IGG T ■ 

ttattttcgtattoatca i rooi igi icq ra i attoaaat* otcataaaa 

TAAGATCTTJTATGAATTGTATTA 1 C IT CI O' I ATCTTAAGT ATTAGTGTAA 
AT ATTGTA TTA TGGAT A T A TGA TGCAGA TTCAGATTCA TATAT AGACAGT 
ACCTCCO OGGOC OGCCGCTCTGAAGGCTOGATTTTCAAAAAAC 
CO I rTCTACTIGGATOCGAGACTOGAACT AAATOCTGGCGT fGTC ATDGA 
CATAGC1 1111 1 COCO 1 OTGAAAGTGTTAATOCGATGACAATTCCACACT 
AACATTCTAACCCGTAAGCATATTTTTTAACAGCCITGTTaTaCTTGAGG 
TTOTGCACCACACTCACATAAATTTCTTTA* 
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CAAOGGTTAAATATATAA CIC I 1 1 U CGTCTTTTAATGGAACATTGCTACG 
TTTCGCGTAGAAGG 1 1 H I 1A TCTCTGTTAAOGCGTTTA ITTC1 IA TTAC 
AACOOOGGTGATACOGCCTATTCCCACAATATATGATTCTTTCCAOCCTC 
TATAGTAA CICICUIA AAIIOOI 1 IC TGCTGATGI 1 I IX I AAAAATCTA 
AAGGAGACXJCCCTGTTCTGAGTT ATTTTATTATXjTATCCATCOCCCTTTT 
CTCTGTCAAATAAAAOOOCTTCATAT 
>Seqnencc 1225 

GCTACATCATTTCATGTATG 1 1 1 I IQH I 1 1 I 1 AACATAAAAGOATTATA 
TCCTTTTCCOGCAO CTO TT I I CACTCAATACATTGTGAAAATATTTTCAC 
ATATCTTXX^TGGCTITTCTATAACATTTCAAATCACTCOCAAATA'TTTCA 
CTCTATCATCATCATTTAATATTATTATCAATTTTKn'ATATTTAACTTAG 
A ACTT TTOCATTACCATA AACATCATTATGAATG AG C 1 I ICI lOAACTGT 
ATTTTA ATATACTTCCTTAGG ATA A A TOCTTA A AG T AATAA 
>Sequence 1226 

A CA TATACACTATG T A A TT A A AA A GC C TGC A TGTG T A TGTA TT A A A AATA 

atggtatataaacaaatacaatatatacaataaaacacctaaacgcagag 

gctgctgttatccacaatagtaataccaatagtattaatgatgtctatgt 

agacacaaacaaaagcagcggacgtattaatatgcaaacacaca aaagc a 

cacaaaagcaaagcaaaaagcaogccagtaatgttgtggatgcagtttca 

agatttgacttatagagataaatccagtagac^ctgaaatagattttog 

>Scquencc 1227 

ACCCG ATA TGTATGTIG AATTAAG AGG A TTTTTAAAAATGACCCTTAAGT 
CTTTG ACATAACAGCCCCTGTCA C 1 I CI I GIC ACAGTTTOTATGTGTTGT 
T AATGG A A TGTCTA TTTCTTT AAA G AGCA G AG AACT ACA GTT A CAGGGG A 
TACAGTXjTGAGGGGTGACACATTGCTGGATTCTGAGCTCAGGCAAATCTG 
TCTGTGTCTATATTAATAGAGGTCTA I L I II ICI 1 A ATACTG AATGCAAT 
GGACCATTCCAACCTAAGTTATCCTTGATATCCTGGGATTACAATAA 



GGTA Ll I 1 1 1 1 | HI I 1 1 1 1 1 11 1 1 1 1 II 1 1 1 1 A G AGACAGAGTCTCCCT 
G TGTTGCCCAGGCTGG TCTCA A ACTCCT A CGCTTG A GCA A TCTTCCCCCT 
TGGCCTCCCA AA GTGCTGGG ATT ACAA GCA TG A GTCA CCATGCCCAGCCA 
ATAA.TGATTTCTTGATTGAAGGAATGAATGAATTAAAAGGTTCATCTTTG 
GACA CAAAGGCAGACAAAAGTTTC ACAAAAGGCATTTTTGAAACT AGGAC 
CTTTATTATAATATTAGTCTAAACAGGGGGACCCATGAATAAGCATGG 
>Sequence 1229 

CACAGAAAAAAAAT^CTACACCAGGTAACACTGGAGGATGCAGGGCTACAT 
TTGCCACTGAAG AAACATTO 1 1 C I C 1 1 OCATCT0AATTOCAOT0CTTTCC 
AAATAGATGCGTAGATGATGAAAAATGG AGCAGCT TC 111 I ATTTCTTCT 
TCTTTOCTCCTTGAATTCTAGTA C 1 1 1GI GAACTGTFGAGGTGTCCCTTC 
CTAAGTCACAATICACACTGATGCATACACTATAGTXIAAACACTGGCTTT 
AAGAAAACTGATTAACAGAAAACCGGCAA7TGTTATTTATTTTAAAT 
>Sequence J 230 

CCCTTTGAGCGGCCOCCCGGGCAGOTACAGGTrCTAAAACGAAAOTATTT 
GOGTAG TCCACTTAGTG A TATTATGTGGATG TGT AGACAAT AATATTAGT 
TCCTAGA TTAAA AGCTATATXTTATAGAAOAACAAAGATTAOCTGCAAAAA 
GGAGA1 lOI 1 IGAAAGCGAAAACAGTGGGTATGGCAATACTGAAGTGGAA 
ACTTC^C^GATGGTACAGGTGGAAGGGCGCCATGAGACTGAAACACCATG 
TAGTCAGTATAGAGGTGGAAGTOGAGGGGGGTATAGTGAGTTACAAA 
>Seqococe 1231 

GGGCAACTTCTAAATATTGATACAACCATTAATAATAATGCTGATAGGGT 

CAAAAGACAATTGTTGAAGCACTGAATTCAGTAAGCTGGGTCATGGTCGA 

ATTITGCTCACTACTTCATATCTTITATCTAGATTATrCCTATAAACATG 

TIXXCTAAATTtX^CATCAGTTTGTAAAGTCAATGGATrAAATTATTCAA 

ATGTAGCTATTTAACGGTCAGTAACAATOCCTAGAAACCTATTTATTCAT 

CTGTAATA TTAAAAAGCTG AATTTG A TG A TCTTG AAA AATCCTTTCCA G A 
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TTT ACA A CTTTT A TTCTAACA GT AAAA G AAA TACA G AAAAAG TGTCAACA 
AACTTGGATGGG AAGCT AAT AA TTGT AA GTCACACCGAG A GA TT AAGCTA 
ACAAATTTCAATAACAGCAGTCaTATCGGGAAATCCCCAGAACTATTTAG 
AA AOCAA TCITTTGAATGCTAAGAACTACNACTACACOOCATTTACTTAO 
AATlTllTICATAGAAAAATTAGGGCCTTCAAAGGTrATGGCC 1 1 I LI 1 1 

attggaatggataaaagatgcttggaaactaaa mTGcn IUGCGAAAA 
aatctgaaaaaaactgcgctcataaggataatgggaaaaaatgxxaatcc 
tttaaaggcttagaaccaaactgggctttggacctaaagttt 

>Sequeac< 1232 

CTTACGTGGTCGCGGCCCGAGTTCATTTAAAAGGGGGATGTTATACTTTA 
A A AAGGTTT AAG A AATAAATTTAA AAA GCCTTGGG AA TGGTTTA CGGGAA 
GGGTCCGCTTACTOGG C I 1 I I O GCCAGGG ACTTAATTTTAAAACTAAAA 

> Sequence 1233 

CCCTTTGAGCGGCCGCCCGGGCaGGTACTGCATAATaTAA I C 1 1 1 1 AAAT 
GGGC A ACTTCTA AATA TTG ATGCA A CC ATTAATAATA ATGCTTATAGGGT 

a aaag aaaatttttgaagcactgaattcagtaaccctgggtcatcggcca 
attttgctcactacttcatatgttttatgtagattattcctataaacatg 
ttccct a a a ttccaca ttcatttgta a ag aca atxkia tt aaatt attcaa 
atgtggctatttaa«xxx^gtaacaatgcctaoaaaoctatt 

> Sequence 1234 

CGTACAG1 I 1 1 1'GCGGATTGCTGAGAGTGCCCCATGAGGGGGGAGAAAAA 
TAAATTTCTATTTTTATTTNCGATCTAGGCTAATTCTA 1 1(J1 IC ACTTGC 
CCAATTAGTATTTCCCATTGGCACCGGGTTAATCCACAATAAATGGTCrT 
TT A TT A TGG TTCCTGCA A TC AA TTGAAACCl 1 1 IU GG G1 1 1 I O TTCCCCC 
AAACATTTTATTTGTTAA ATCCATAAAA ACCA 1 1 1 1 U I GGTTAAAA AAAT 

aaagici mrocn i ggtaataaaaagcacaatccttttctaggcaacc 

A TGG A G ANTTCAAT AAA CC 1 1 1 1 C 1 1 CCGAGGGAG ACAAAAAG AAAGTAC 
CCCT A TGTA AAGG A TGGGG A T A TTTTGGCX-' 1 1 I lO GGTCCGAAAAAAGGG 
TTTACCCCTTGCXXCTGGOGGCCQGGCCACTG C 1 1 » CCGGAAAAAGTGTG 
CC GAAA AI 1 1 1 1 CCCCAAGCCAACCAACTTTGTGGCCGGGTCCCGGTTTT 
AACTTTAAG 

> Sequence 1235 

CCCTTCCGCCGCCCGGGCA GGT ACTCTGTAAGTCTGG AAGAACAGGTCAC 

ATTTATTCAGACTTCTOCOCCACAATTTTTAATCAAGCAGCTCGCAGTAA 

CAAGTTATTTAATTAGATCGATTTTAAGTTGACAACAGATGTATCAGATG 

AGGAAAAAATTGAGCATGTGTGGTGTGATTATATAATAGAATTGGTTTCT 

ATAAACCATTTATAGTATTCAA C 1 1 1 1 ATAGTATTA C 1 1 1 1 IC AGATGTA 

TGGATATATAGACTATTATTrACTAACTGAGGCTCTOCGAAGTGTAGTGT 

ATT 

>Sequeoce 1236 

GGTACrOGGATCTATGATAGAGTAGAATAAAAOCTCTTTOGTCTACAAGC 
ACAACTTATTCAAAATrGTGTGGCAACTCACACTAGCGATCATACCTGCT 
TATA TTCTCCTA GTTAGTCCCTGGGGG A TT AT A C C T I T I A A ITCTTTC A 

> Sequence 1237 

GGTACTTCTGACTAAACTGGAATTATGAGTGAGGAAGAGAGATTACTAAA 
TAAA TGACTGGGGCAA GCAAAA TTG ATGAGGAAATTACAAACTGTTTG AC 
AAACTTTTAAG AGCTACTTTGAAATAACAGAA G TT C TTOATTAAATATTO 
CA CAA T AA TGGCG T A Q AAAA OTA TGGTTTT AAACTGGG A COCTT ATTTA T 
OCACTTTTTAAAAAATTAATTTCTCA GTATACTCCATTAAAATACCATTG 
TTTGTTAAAAAAAAATITCATAGATATTAOCAAGAATATGGAAATTAAAT 
AAAAAAATTGTATACTTCCCCTTTCTCAATTATTAGATATTTATATGAAT 
TATTA TTT AATT AT AT A CT AA T AT ATTTTTA TCTCTTGTOCCCCAG AGGG 
TAAAAATGGTGGCAAG 

el23f 

CTACACCGAGACTTTATTCTTCTACAACCAAATTCCTCAAACXCA 
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TGAACAGTAGCAGTGAAAGGGAGTTTAAGGTGGGGGTGAAGGAGAAAGGA 
GT A A TA TGGTTTTTT AG TAAT ATAGT A ATTT ACATTTTCA AAA A TCTTCA 
GTGTTTCACTATCTTAAATGAATATTTATTCTGGTtKKKXTC^ 1 I ICI 
C 11 C ICC 1 1 G 1 1 AAAAAAATYOACTGTTATTAACTCCAATCTCATGTGG ATT 
TGGGTTTACTCACCCTCAGATTGGGTAG AACA I C I O ICI ICAAAOAAAAA 
CGAGTGACAI ICI 1 1 GCAAATTCCCCTAAAG ACCCCGTAGGACAAGTATA 
TGAAACATTTAATAAAATCACTTCAC^XXXAACTCKTrACTTTGCACT 
AATTAATTGTC^TACTATTATGGCCCAAGTTC<KXrTGACACTTGATTTCT 



TTG TTTTACCTA CAA CCAGATGGCCTTA ACTA TTCA AAA 1 1 1 1 II 1 1 C CA 
CTAAAAAATTGCCACAAAACCACTTA TTTCTTT AAA CAAAAAC 1 1 ICC I I 
TTAAA GTC l IA AACTA I I II I I AAATTCCTTTAAGGGCAGATG O I IOI 1 1 
T AG A AA AA A AATTAG AATTGCA CT 

> Sequence 1239 

CCCTTTGGCCGCCCGGGC AGG T A CGCGGG GCGGT A TGTGGG GCCA G AGCA 
TCCGGAGG AACACAACCTCTGTGGTCCTGTAGG AGCCACTATG AGG AGGG 
CCCTGGGAAGAA TTTGCCATTTTCAGTGG AAAACCAGTGGTCGTTACTAG 
CTAAGATGTGTTTGTACC 

> Sequence 1240 

ggtacccgggctaccaaacctxxatt'aaaaatttcgg 
ggagcagaacccaacctccga gciagtaccattjctatattggtcactg tag 
ctctgtaaca tagtttg aagttgggtaattngattcctctagctttgtta 

GCTCTG TTGTTTTCACTT A GT A TT A CTTT A ACT A TT A GGGCTCTTTTTTG 
GTTCCATA TAAA TTGTAAAA TAAA H i l l CCA GTTCTGTG AAG AATCTC A 
TCGGTAGTTTCATAGGAATAACATTGAATCTGT 

> Sequence 1241 

CCCTTTGAGCGGCCGCCCGGGCAGGTGGATCACTTGAGGAGTTACAOACC 

AGACrGGTCAACATGGCGAAGCCCCATCTCTACTAAAAATACAAAAATTA 

GCTGGOCGTGGTGGGCOTGTGCCCCCT AATAA TCCCAAGTTA C 1 1 IG OGA 

AAACTTTJAGGCCAGGAAAAATTCGGCTTGAAACCCCGCGTAGGGTGGGAG 

GTTTX3CAAGTTGAGTTCAAAGAATTGGCACCAAGTGGCACCTCTCA 

TGGGGCCAAAGAAGTGAAGACCTTCCAATCITCCATATAAATATAATATA 

AAATAATAAAAATAAGAGGTAACCTTCGGGCCOGGCGTACCCACCGCCTT 

AAAGAGGGCCGAAATTTCTCAGGCAACAACCTTGGCCGGGGCCCGTTTAA 

CTTAGGTTGGGAATTCCCGAAGACCTTCC<KrrTAA 

GCGGTTAAATTCAATTGGG 

>Sequence 1242 

CCCTTTGAGCGGCCOCCCGGGCAGGTTjGATCACTT^ 

AGACTGGTCAACATGGCGAAGCCCCATCTGTACrAAAAATACAAAAATTA 
CKTTGGGCGTGGTGGCGTGTGCCCGTACTAGTCCCAGCTACTTGG 
GAAGCAGGAGAATCGCrrGAACCCGCGAGGTGGAGGTTGCAGTGAGTCAA 
GATTGCACCAGTGCACTCCAGCCTTXXXIAAGAGTGAGACTCCATCTCAA^ 



e 1243 

GGTACA CAATTCAG rriCTGGGGAAAGTGAAGCATGAAGGGAATCATAGG 
AAAAATTTGAI 1 1 1 1 G I U I ATGGTGTAAGAAAAGAGTCOGATTTCAATCT 
TTTTGCACATGO ATTATCCAG C 1 1111 1 CCAACACXXATGTTATTCGAAA 
GAGAACTTA CL I 1 1 1 1 ICUL1 1 1 1 IUIG CGAI ICCI 1 1 GO 1 CATACCCTT 
OTITGGAAAflOTCOC CITTIOClTITTOCCrTTC AACaCICA 
TCACAATAACAAAGCTTGACCCCl 1 UA G1 I IC A SCTIO CAATOCnTCIC 
TTGAATTCCAAAAACCT Gl I C 1CAGI I CGI 1C1 1 A AAATTTCCCCCGGGA 
GTGGAACCGTT TTGGGT TCAACCTTGACTTCCCTICAAAA 

TCCCTTTTTTGTTCCC 
>Sequenoc 1244 

GGTACAAATAAAGTCTrCCAAGGGTTCAGAATAGAAAATGA T C IC1 1 CCA 
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GCTTOGGG AC ATTTGGGAAATTGGGA ' I ICC II 1 G GGG AAATGTACGTAAT 
CACTATATTCTGGGAAAACaTAGTAGAGAATGAATAAATAAATTCCATCG 
AATTGGGAATATGTTGTCCATTC ICCC rGTAACTAATGCTATCAAGATAA 
AGTAGAAATACCACATTTCAGAAACAGCTGGAGTAGACAGGTCTTCATAG 
GCT AGCTTGG A AACCT AAT A GCT A TTAA TAA TG AAATTOT AATTATACTC 
TGGATTCTAAACAATGAACACACAGTGATCTITTTGACrTGCIGCTTGTT 
TATAAGTTG GGGATGGTCTG AATTCATTGTTG A 
>Sequcace 1245 

GGTA CA G A TG TGACCITTCTT AT AGTC A GTCAA TGCTGGGAAGT AA CAGG 



GTCGTACCTTAGGTTTTAGATGGACAGTACTTTGCAT 
AATTCT AG G T A G TTG TTTCTTTGGTTG AAG A GG ATGG TAA CCGG T A CATC 
CTTTCTGGGTGACCT AGGGCAAACTTAG AATGTATAGGTAATTCCTTTGT 
AGGATAGCCTGGATTGGTTCAAGGGAGATGGGATCAGCAGGGGTTCCACA 
CGTG A AA ACTTCCAA C 1 1 1 1 I A A ATTATACTTGGATGTTCTTGGTGGGAT 
GGTTTG A A A AGGTGCCA ATTTTGG AGATCTTTTrcAAATTG AAOCATTGT 
TAAGAGGGGGGGACAAAAATGTTTCTTCTTTGGCCTTCOCAACCCATTCA 
ATG 

> Sequence 1246 

GGTACT I I I I ITTTTTTTTTTTITTGTCTAATTACTAOCri 1 1 ATTCTAA 
TGTG A A CCA TGGGCCCTG A AAGACTG ATAACAAGCTTTGGCATG AAGCAG 
AGGGGGAACTTAGGGGTTCGGCAGAAAAAGGATTTAATGGGGTGGGAAAA 
AC ATTTGGGCTl CT TTCCTTTGGGGG AG ATG AATGCCTGGAGGAAAATGG 
GG AA GTA G A AG TTGG TCTTCAAAOCCTGCCAA GGTAAAAA TTAGGCCT AA 
GAAAAAAGGCTXAAAGGGGCCAAATAGGCTTGGGATAGGGGGTAGAAGGG 
ACCAG G TT CT AGCATT GG TTTCAGACCCCTGGGGGI MC I I G GGATTGTA 
ATGGGGTTTATTTCCCCCTTTATTCCCCTTGGTOGCGC A 1 1 ICC CCAATT 
TTTCGGTATCAATTGAACCCCCTTAAGTATCTTATCGGGAGACC 

>Sequence 1247 

CCCTTGGCCGCCCGGGCA GGT ACTTTTTTTTTTTTTTTTTTTTTTTTTTT 
TTTAGTG AAAA AACTGTAATTCTTTATTTXi AAA CAATGCATTTTJAAAAGA 
A A T AAAAA CCA CTCA G ACTTCTT AG AACA TT A T AAAGTCAAAAAAACGTT 
G TCAAAAA TTTTGGCAATT A GT AG AA T A AGT A T AAAAGGGGTT A ATCAG A 
TA CCAGCCAG AATT AAGGGGT ATAA CCTCCAG 1 I CI I C AGACAG AAAAAG 
AGGGACATAAATTTTTTCATTTTTTAAAAAA C I IC I I IOGAGATATTATC 
CTTAAATTTTTGGACACCTATTXAAAGATAAAATAATTTTTTATTTTCTC 



TATCCCAAACTAGACCCTGGCAAATI 
CAGTCAA CA AAGT 

ACTA TCCCTATGAGGCA TAA TTATAACAAGCTCCATTJTtKZCTACGACAAA 

CAGA CCT A AA ATCGCTCATTGCA T A CTCTTCAATCA GCCACA T AGGCCCC 

TCOTAGTAACAGTOCATTTCTCAATCCAAACCCCCCTGAAGCrTCAAOC^ 

GGCGCAGTTCATATCTTCATAAATCCGCOCACGGGjGCCTTT 

TCATTACTTAATTCTGCCCTTAGCAAAACTTCAAACCTTAC 

CTTCACCAGGTTCGGCAATCAATAAA 1 ICCI ICI rCTCAAAGGOAACTTT 

TAAAAACCTTCTAACTCCCCAACTAAATAAG^ ^ 

A 

>Sequence 1249 

GGTA CT ATA TGTTGCTXTICTCAGTXKXAACAATGAAG II 1 1 1GCAATTCT 
AGAACTTGGA l 11 1 II 1 1 1 1A ACAAAAGTOCCAAAACAOCAAAAATGTAA 
ACAA GA TAAG A G A TTAA T ATTCT AGTGGTGT AATTTAATTAAAGTTATA T 
TTTGGCTTAATTTTAACAACTGAA G I CI 1 A II G 1 1 GAAACTTA I 1 1 1 1 A A 
CAAAA CTGTGCAOTTAAA TTTGTA T ACCTATTCACA TACTGAAAGATGAA 
CCGTTAAAA TAGCACTTAA MUG I G I I I C I l CA ATATGTCTTGATAATA 
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actttggtocaatttaatattacccatgttaaggttga 

> Sequence 1250 

TGTACTTAGGTGCCTACAACATAAACAGAACCTGGTAAATTAAGTATTGA 
AAAAATOCTATGGOGCAGAAGAAGAAATOCTTACCC I ICIOI OAAOAAGA 
GAACACAGCTATGTTACACGAGGTOAATAAGAACACAGCTGCAGCTGAAT 
AGAAGAAAAGGTATACGAATTGCTAAGGTGTGAACATTCCTATGACTGTT 
TCAGCCACTAGCAAATCTCAAACAGTCTACTCT ATAATAAOCOCATAGTG 
ACATOACTT AAAGGTGGATCCTTCATGTGCTTTAAATCGTTAGCGCCCCT 
CTCGOGGAA TTCAGAGAAGCAAGCTTCCCaGCGATTTCATATTXXTOGCT 

aaactictcatcttataggagtgaggcoogataatctta tataa acgaga 
tatcaactcaaacact00c1 1 1 cccgtt attattgag aagttttaatata 
cacatgtcgcaatta tcttggckia aattcgtattaacgcttocctttctg 

TCAATTAAGAGAAATATGGAGTGTCTCCATTGGTAAATGGAAGATAAGCT 

CTATTATCAGCTTrCAATTGGCTCGAAAACATTTCAACGGTAGTATCTGG 

TACCCTCGCCCGGGGCGGTCTGaGAGGCTCGATATOCTGCCCCATCTGGG 

CGGGTCCGGTACCT ATGTGGAATCTCTGGCTATGGTCGCAAATCCTTGGG 

ATGTCATTAAATCCGCCCAAATCTTGG III CCTTG FG 1 AAATACGCGTGA 

TATCTCGTATAACTAGTCGCOOCGCCAAATTTOCOAACTCCTOCAGCCAT 

TAGAGGGTATCCGCTAGGGGTGCCTATATTGATGAAGCTGCCTTCCATTA 

ATATGOGCTGOGOCTOACTTOGCTCO C 1 1 1 1 tAGTTCGAAACTCTGGATT 

GCCCGTATTCATTAAAGAATCGGGCATACGCGCGGTGGAGGGCTGTTOGG 

GATTG 

>Scqocace 1251 

acgcgggcaacagttaaatcaacaaaactgctcgocagaacactacgagc 
cacagcttaaaactcaaaggacctggcggtgcttcata tccctctag agg 
aooctg1 r c to taatcaataaaocoogatcaaoctcaccacctcttcctc 
aocctatataccckxatcttcagcaaaccctxjatgaaggct 
gcgcaagtacc 

>S«qaeoce 1232 

ACCTATTATTATTTCAAATTTAAAAACTTCTTCTTTTTTAAGAGATAOGG 
TATCACTATCTTOCCCAGGCTGATCITGAACTXrrTOGCXrrCAGATGATOC 
T0CTGGGTTCAAGTGA1 1CI 1 CTGCCTCAGCCTCCCl C I rATTTGCTTTA 
CAAGTOCTOCTTCAGGGTTAOC 1 1 U X I GACCACTGC I GCC rCOCTCOCA 
CCATTTOCCAGGGACTGTCATTGCCTTAGTTTA 1 1 1 1 1 ICIU I I HO I 1 1 
1 1 1 1 1 1 IG IC UI 1 1 1 IU 1 1 1 I II 1 1 IGA OACAO CU I l l I AGTCTOTCGCC 
AAGGCTTCAGTGCAGTGGCGCAATCAAAGCTTCCTGCAGC 
> Seq acpcB 1253 

OGTA C1 1 1 1 I I I 1 1 1 1 II H 1 1 1 1 1 1 1 1 U 1 11 1 A CTTTAGTAGAG ATGG 
G OT! 1 1 ACCATGTTGGCCAGGC'I G OICIIGAAACTOCTGAOCTCAGGTGA 
TCCACAOGCTTCAGOCTOOCAAAGTGCTGGGATTACAGGOCTGAGGCACC 
AOGGOCAGOCTAAATATTTCTTTATAGCAATGCAAGGATGGOCTAACACA 
CTCOCTAAATCAAAATTGCTATTCAClTCAAGGGTATrCATTACCroACT 
AO C1 1 1 1 1 1 G OOTOCATTTQAACATAATOTAAATTTTATGGCTGATCAAA 
TOTCATTACTATGAAGATACTCCCTATGAGCTCACAGAGTCAGGACAT 
>So<}ucpcc 1254 

ACA OIO 1 1 1 A TCTTCGGATAAAATGQCTAGATPAOTATGGACAOOOAOG 
CAPGGCAOATACA G I OC I FOCI ICTOOI 1 1 rAOAGTTCTTCTGAAOCACA 
ATCAA CI ICIUCAAACACOCAOCl 1 1GICI IUI A OCACAATAOOOOTCAO 
ATCTATTQCTGAC 1 1 1 ICCTDC A O CI 1CTC1 ACATCAGCAGCACCTAGOG 
OAAGAAATOTTATTGAOACTATAOCTAAAOGAAOAACA I T CIO CI C TO 1 1 
GCACACTATTATCCAATTGGATAGAOCX^CATCTAAATGTCTGCAATTAC 
AGTAATGTCAGCTOOGCATGGTGGCTCATGOCTGTAATOOCAGCA IC I IO 
GGA 

>Soqueace 1255 

OOTA CI 1 1 11 II 1 1 1 1 1 1 HI 1 1 1 1 1 1 1 1 ICI II 1 1 A A 1 1 1 1 1 1 1 I I 1 1 1 

TTTAGAATAACAAAAAATTTTTTACTAAAACATAAAATTTOCAGAGGTTT 



CCAGACAAGCCATACAAAAGGGGCACAAG CI 1 1 1 1 1 G G AGGGGGOAATCT 
ACACTTGACAOCAA TGTTATTAGOGAGGGCTGGG A TGTTTGGTT AATGTT 
CCCATTTAGGGTCCAACAATAAAGOCTGTTCATTTAGA GTGTOCAAA TCA 
AG1 1 IGACTTCGC1 1GAGCA! 1 1 PCTGAAGACCTGGG1 fGGGTGGI 1 1 lA 

CCTGGOG * * " 
>Soqwace 1256 

GGTACTG GI rTTTTTTTTTTTTTTI I I I 1 1 1 1 1 1 A G O 1 1 POC1 1 1 1 I AAT 
GAGCTCAOOCTTTA ACA CAAAAAAAGCAGGGTTG ATGTA TTTTAAAAAAG 
GAAGTGCAAATAAAAAAATCTCAAAGCT ATTTGAGTTCTCGTCTGTCCCT 

GTATGACCAGAATT^TTXJAATTTGCTAGAACTGTGGAAGC I 1 « 1 ACTCCT 
GCaGTAAGCACAGATCOCACTGC^TCAATAACTTXjGTATTGAGCACGTAT 
TrrGCAAAAGCTA H 1 1 I C CTA O 1 1 1 1 1 AGTATTACTTTCA 1 0 1 1 1 1 AAA 
AATG 

> Sequence 1257 

TTG AAAATGAG AGGAAAACAAAATAAAATGA TTTACATAATCAAAGG ATT 

AACTGA t acag actttt a ttctaaa tgctcacaagca ca o aaaccaacaa 

GAAATCAGATCTroAAOGAATTTATAATGATTCTTCCAGGAAGCACCGOG 

gcagocacataaggcgct g ttcacacctggctgt g tctgccaagttagtc 

ctcaaagagaaaacaaggaggaaaaagacaaaaaaaaaaacaaaccagca 

aacccagtgtgcttaaaacacagatcacca7cagaggtttatttcacagc 

AAGG 

>Scquence 1238 

GGTACC rTGCTOOn AATATAACTAAGA 1 11 1 GOCI 1 1 A TTGGGTTAGGT 
A TCTTT TTl f TA TTTTAGCACCTG ATAGC fC TCI ICI A CTGAGTAAAGAA 
TTATAACTTTTAGATGTCACAGAAAATTAGAGTATTTATTGTCAAAAAAA 
AAAAAAAAAAAAAGTT 
>Sequracc 1259 

ACTTCAACAATTCCAAAAGTTTT1XIACTOAAATAAGCAAACTCACT AATG 
ATT ATGA ACTG A ACATAA CCAACAGG C I G 1 T1 GGAGAAAAAACATACCTC 
TTOCTTCAAGTAAG ! 1 IGOCATGOCTACCATATCTGTGAGTGGTATTCTG 
G AATGGCCAAATGGCCCTGOTAGGACTATGC G I O C I GAAGTCGTOCTGCC 
TGGCTCTOGCCACAT00C rGTOGT G Cl 1 1 1 ULATCCTGATCTACAG ATAT 
TCAGAACTOCAGGGAO 1 ICC1 1 1 1 A G I C C I UGCAATCTOAACCTOATTTT 
TGOCTCATCCOCAGAATAGCTOCATAAAAATGTGCAGCAGGAG 
>Seqacnce 1260 

ACTGGTGGGATTGTTAGAOCATOOCAAAAAGGAAGTGCACCTrGGAGTCT 
GTGGAGCTCTCAAGAATA ICTC I 1 1 1 O QACGTQACCAGGATAACAAGATT 
GOCCTAAAAAACTGTOATGGTGTGOCTGCCCI lOIOCGAI 1UC1 1UOAAA 
GGCTOGTGATATGGACCTTACTGAAGTTATTACOGGTOA G I ICI AGGOCT 
AAGGAAAATTGCTAAGTCAGTGTTACTCTCTAGTGATGTTGAGAACTAGA 
GGGATTTOCAGACCl 1 1 IAL I I 1 1 rO ATCAAAOGTTO TO AACTGG I G GCTG 

toogtcaaatocatctcacaga rrrom goatcacacaoca 

>Seqacnce 1261 

GGTAC1 111 I I II 1 1 1 1 1 1 III I I II 1 1 1 111 1 1 rO OCTOCTCTGACTAT 
ATTTTCAAATAOTCTOTCI IIAAOOTCAGTAA I IC I I 111 I CIO CATGAT 
CAACTCTGCTATTAAAGGACTCTGATOCA 1 1 CM ^GTATGTGAACTGCT 
11 11 rCAGCTOCAGAAl I ICTOCI IC A TTC1 1 1 1 A AATTCA A T CICTC1 1 
AAATGT A TCTOGT AAAA TTCTGAA 1 1CCI rCTCl 1 1UI 1A TCTTGAATTT 
CTCTGA GI I I CC 1 C ACT ATTTPGAA I 1CTG1CIGAAAGGTCACAA 11T IG 



ATG 

> Sequence 1262 
GGTACACTCCATt 



TAGGATGCTGOAC I IC I A GCTTAGT 
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gagaatgcagtatactttttcaaaacttcgtgcaggaatocctcaaatgc 
tgtaactaggaatoggtcagtgaagttcaaaogac 1 1 1 icc i ig agogag 

GACTGAGAAGCCAGAGAGCTTGCACCrGAGGCATCTCAGCGGTGAGAGTA 
ACAGTCCTAGGAAAATAGATGGGGGCTGGGGGTAAGGAAATGTGCTGAAG 
ACAGAGCTA TTCTGGA 
>Scqoeacc 1263 

GCTA CTcm nm i uiiii i i n 1 ih iagog om nci i pgta gag 

ACAGGGTCTCACTCTATTGOGOCAGGC I O OICI I GA ACTCATGGGCTCAA 
GTGATCCTCC IGOCTTGGGCTCATG AAGTGCTGGGATTACAGGTGTGaGT 
CACCATGACTGACCTATATTTAATTTTTTAAAGATrAGACTGGTOTTAGC 
TGTAAATAGTTTG AAATACCTC TCTG ATAOOTGCTAGCTTATOGTTACTC 
TTA G rOCTTCT I O CATTTGCA TAGTCAAAACTTGATA C rTTTTG fG AACT 
TTGAAAGCATGC 
> Science 1264 

ACTTTGTGTTT A A G A G A A ATTOCT AAACTGG A TATA TGTGGCAGGCTG AA 
AGCACTGTGAGTTGAAGTCAAGGGGAGAGGTCCAGGCGCAGTGGCTCATG 
CCTOTAATCCCAOCO C1 1 1 U OGAGGCOCAGGCGGGAOG G T T GC 1 1GAGGC 
CAOAAOTTTGAGAOCAACTTGGGCAACATAGCAAGACC P0CTCIC1 ACAA 
AAGATCTAAAATTAAATATTAATATAAAATAAG O 1 1 CC 1 1 GGCOGGGAOC 
ACGCTAAGGGCG 
>ScqncBce 1265 

AOCTTATTGTTAAAGTGAGTCAGATAAATCTTCAATTXXTOGCTATTTGG 
GCAATTGAATCATCATGGACTGTATAATGCAATCAGATTA 1 1 1 1G1 1 IC I 
AGACATCCTTXjAATTACACCAAAGAACATGAAATTTAGTTGTGGTTAAAT 
T ATTTATTT ATTTXATGCATTCATTTTA 1 1 I CCC 1 1 A AOGTCTGGATGAO 
ACTTCTTTGGGGAGOCTCTAA^AAAAATTITTCACTGGGGGOCACGTGGGT 
CATTAG AAGOCAGAGC ICICCl I XA GGCTCCTTCOCAGTGCCTAAAGGOG 
CTATAGGAAACATAGATCCAGCCAGGGGCTT 
>$cqpaMx 1266 

OCCTTAGCGGCOOOCCGGGCAGGT ACTCAACACTGATTTCAQ AAGAAAAG 
TGTG A TTTGCTT AOCTXTTG ATTTTGAGACCTATA TAG TO AAGGTTTGTTG 
CCACTTTTTAGI 1 1 CC 1 CAAAGATGCAGAAGTAATGAGGTTTGACAGAGA 
CATGAGACTATAAGA TG I C I g r CATTGCTGCCAAGCATGQAAAAGATOTT 
AAGATOTtXAGCTtXXXlATAAAATX^TATTTTXlAAAGTCTGAGACACGAA 
GAATA1CI I ICICI IATTTGGAAATATGCTGAAGATAGGAATAAAGAAAA 
GGA TTACACT AAAA TGGAGaCGAGAGATACaGTAAAGCAG AAA TCTATAT 
GCC 

>Seqaeacc 1267 

t^ ^ATGjJ^A Ci^O O^ 1 10001 I L IUIA AA CI 1 11 A TTTTA 
TT ACAAA GGTT ACTTTGTTTGAAAAGATCGT ATGTT AAGGTT A GAT AATT 
TCGAAAATA rTTCCTTCICI AGCTAATaCOCACAGTTTATCTTTAOGCAG 
ATOC TATAAA A1TAA AAATGGCAA CGI 1 1 ti I AGAGOOCTT IT CAGAAAAA 
TCTTATOGAOC1 1 1 IL1 I GGAATTTTTAATAAAAAATOGCAA I 1 II 1 III 
TTTCAA TT A TTGAAAAAGAAAACCAAAAAGCCA TTTTTTGG T AAAAAAAA 
TAGGAOCATAT 1 1 GO 1 ICI 1 1 AACAAOCAAAAAATGGGGTTGTIGGAAAC 
CCCTATTTCGOCCI 1 1 1 ATTATTTTATTAAGGGGOCATTATTA TTG 



PCT7US0Q/3J312 



ATGTGATTAAGCAAAGGAOCOCAGATGGOGAG 



ATTATTTTGAATTAOCTAGCrrGGACTCCAOGTt^ 

ccaaagagatctgagaatxiaaaagcacaagtgagagcaotgggatagoca 

AATITTAAGAGOGTTGTGAGOCAGAGAATATAGGCX.'GCX I C I AGAAGCTG 

AACCXJTTXXCACTCACGGT AGAXTCTTZX^ATXnTXXXjGCTGTAGAATAATA 
CATCTGTGCT ATTTTAAGCCAC I O I 1 IG1G AI lOlCIO I I ACAOAAGTTA 
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TAGAA 
>5equaM* 1269 

GGT ACA TTT AAAA GGTG A TGCTAA T ACTTTAAAA TGTTTAAG A T A TAG A T 
TTAAAAAOCATTGTAAATTOTATACTQCAA IG TCGTCTAACATGGCATTT 
GG AACAA GGACA TAA TG TTT AAACA TT AAAA GTTGCAAATTGTTAACACT 
TAACCATATGGATTAGTGT AATGGCATACGTTGACCCAAA 1 1 1 1 II IG TC 
TTAAAGTTTAAAAA TT AOCA TAAAAACTT ATTTAACA GCTGT A CTTAA CT 
G GGAAT TTAATGGTCCTAATTATAGACAAAAATACTITCGAATATCTTGO 
CATTTTOCACAAACAATTTAACTTGGOCA O 1 1 GCC 1 1 1 1 1 1 1 1 A G CI 1 1 1 
GGCI 1 1 1 FG GAOGTOG OC1 1 1 1 1 UGATOTTXXTTAATGGGOCTAA TTT AAA 
TAAA OG TTCOOGACTAG A1 1 1 1 I T1G1C I IGIGC TTCTAACATA 

GGTACTGCAA GC AAC A GTT ACTGCG A CGTC A GCAGCA A CG A AGT A TCCTC 

toctgaaattattaggcagcacrtgggtcaaccactcogccgtgacccat 
accaaagccgtggccttgggcaccgaataaacacagacgactatocagcg 

ACCAAGATCAGAGOCAGaCAGOGGAAACCOCTGCCACACCACTAAGTTTG 
TTGCACAGGAGACTTGAGTGGAAGAGGGCCTOCAATTOOCTCAACTGCAT 

caactacaacocattagtta 
CGATATTTOCAGCACCAC I IPG I og oooooottactaa 

>Scquencc 1271 

GGT ACAA M i l I AGTCAAGOGA I IG1 1 IG ATA CTCl 1 1 AAGTTCACTGCC 
AGGCCTACCACTTA TCTCTGTOGCAGG AGGAGAG ttcctgtaatgagagg 
1 1 1 1 rAAGACGTCCn r oT TCi gggatgaatcatagggaatgactgoctt 
ggagctcaggatattaaactgagtc^ 

AA i n oc agcacacttggocgooogtt ACCTAGTGGATTCOCAAG C 1 IC I 
GGTTCCAAA IC1 1 1 O OOGTTAATrOCATGGTCAATAG OC 1 0 1 1 1 ICC1 I G 
TGTCGAAAATTGTTTATCOOGCTCACC 
>Seqocnc* 1272 

ggtactcaatgtcacattaacataggaaaggttatatatacactatacac 
ttcagocitgaaatgtggacccaaaaaacattctatttttgagtaatoca 
ttg aattcggtga oggtoccacaccctcaaatoct aatttatcaca gcac 

GC ICT T CA TGAATTGGATGCCAGA G m C G T G ATGA I CC 1 1 ICA ATCTTA 
ATACCATAGACTTGCATOTOTAGGGATGA CTTCOCTI I g caoctgctaag 
gttgataagaatcggacctgcacttggcggccgctct 

TAG AACACTCG C I G TT 
> ScqacDc c 1273 

act 1 1 1 1 1 1 1 n i a t it 1 1 ii ucm i icm 1 1 1 ai imm mn 
1 1 1 1 1 1 1 IAA 1 1 1 1 1 1 1 1 1 aaaacaaaoogoctaaatcaaaaaagcoooc 
AAAAAAAAAAAA ATAAOO CTTTTOOCAAAAAOOCXXXJC'I ilcc aaaaacoc 
OOGGGAAAAAAAACCGOCAAAGOCAAAACCCAAAAOOOOGGAl roooocT 

TTACCCAAAACOO^^ 
AOCCAAAC 

>5«pe» 1274 -.-•».. 
GGTACTACAAACAACAGAAATTTA I I GI CI CI CAG TTCTGGAGGCTAGAA 
GTOCAGAA TAAGGTA TT AGT AG O 1 1 IOGI IC'l I IC'IG AGOG CTG IG AAGC 
AOAATCTGTTCGA 1C O CTC1CT ICTTGICT ICA TC T GTTC T AT GT CT GIC ' 
TTTCTTCAAA 1 1 1 UCOC rTTATATAAGGATAGCAATCATATTGGATTAGG 
OOCAGTOCTAA TGAOCAGATCTTAACATTTGCAAA GGOOCTA 1 I 1CICA C 
T AAGGTOGT ATTTACAGGTA TAAAGGGTCTAGACTTTAACa I CI 1 1 1 I GG 
GGAAGACACAGTTCAA TCOGTAACA G ATGGTT Al 
AT 
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CCCCTAACCCTGCTCCCCGGCCGAGGTX2CATTTAAAAGGGGTTGCTTAAT 
CCTTT AAAAAGGTTTTAAATATlTGATrrAA AAAGCCCTTGG AAAATTGG 
TTTCCTGGAATGGGCCTTTTACAAG<XKlATTTGACCAGGGACATTA>TGG 
TAAAAACAATATAAACTTGGCAAA 1 HO I I 1 1 ACACTTTAACATTATTTA 
TAA G TGAAA TGGGTCAAACOTTG ACCCAAA I I II I IO I H 11 1 AAAGGTT 
TAA AAAAT ATCCCAAAAAAAC 1 1 HTI I ACCOOOGGGTCATAAAOCTTGC 
GAAT TTTT ATT G TCC 1 1 ATATATGGACA AAAAAA I C 1 1 1 1 I GCTTACACT 
CX ITATITT ^XAC XX^A ATAATT n i C li ll T G CCCTCGCCCA Crm il C 
TCTTTTTTAGAATTTTATGAAGGATCTCTC > D i li I ACTGAGTGACCAT 
ATTTCCI I HI I IAAAAAAAAACCCTTTCCTCTTAT7TTGATTTATAATA 
TCTAC 1 1 0 1 1 1 GTTCATTATATATAACAAACC 
>Sequence 1276 

ACT ATA AA AGGTTG AG T A A AAACA GC A A AGCC TGCT A T AAGTTCAAA TCT 
GTTCTA TTAOCCTAAATTAG ATTA A ACC A ACCTGA ATTATAGTAG ATTTC 
TCAAT AG ATGAGG AACTGAAAAATACTATGTAAAATATCTTCCAAAATGC 
1 I I I I AT AC I 1 1 I 1 1 I A TTTGT AA TTTGG TCT A TCTAA AA TG TTCG TTAG 
CTTAACTT AATGGGCGTTATTGGATTCATATGACTAACGTTTCCTCAGTA 
TTGTAA TGCTTGAAA T A TTTG AAAG AAAAAATGTTCTTTTTT AGTTQAAA 
CTGGTATATATAAT^rcAGTGCTrcGCAGGTTAGTATATTTTTATGCATTT 
TT 

>Sequeoce 1277 

GGTACCA ACACAATTGTTAA7 7 I CC I UACAGGCTCAAGGCATTCTGGG AA 
GCTATACAGGGG ACAGGAAGCATTTTGGGG AGCCT AAGGGGAGCCAGTTT 
G<jAAGAOACAGCATTCTXXrrGGCTAGGACAGGTGGTGGCGGTGGCCGGGT 
^^GGTTCTO^AAGGGAOCCTTTGCAGATCCCGGGGCCCTGTTTAtTC^ 
GAGGACGTGAAGATGAGTCACXTAGC1TGGTGGGAATOGCACCTGTGGAG 
CAAAGCCCTACA CACA CAA TGG TGGTGTTT A ACC AGCTTT AT AGCGA CTG 
IOI 1 1 GAGGGGGACTGGTACATGTCACrAGGGOAACATGGTATAGGTGCA 
CCTGCTT 
>Scqueace 127* 

GG TACTAA AACT AAAA CTG AGCAGTTTAAAA CA TTCATTT A A AGGGATAT 
CTA ATGT GTTTATT A TTAACA T AAAT AATGTTTTA TG AAAA ATGTAACCT 
TAG 1 1 1 1 CCAAAACAAAAATGTTTAGGGCAAGAGTAACATTATTTTACAT 
TATTCCATCTCAGTGAAAAATAAATGGCAACAAAATTCTTATATCI^^ 
CTPCAGTTAATCTGTTCAI I HOI 1 1 IOGI I OAAG T AT ATGAA GG AAATC 
TCTCX! ICACACAGTTGTGTAGTGGAAAAAGGGGGACTATTGT AACAGC-CT 
GTGCACATA A TTGTGG A TG A I 11 ICI 1 1 GATACAACAACAAAACTTGGGG 
GATG 

>Seqoeoee 1279 

A CAATGTG ATTTATCAATTAATTAAA TTTG AATTCCATGOAATG AAATA T 

aagtcaacaagtatgacao i i iixjC mom attatggaacaatcatta 
at aatttga taa tt a a atggtcctg aa tggtta gccatgttctccgcatt 
taaataaatagtataaacataaatgaaaatattaaagtaatttcaacgtg 
atagagaccgcttatttttajgttcaggtagagttccaacctaatggtaat 

TAAGATTCtlAGATtXXiAAAGATC-TCATGTGAATATTG^ 

aaattaaagctttcttaaaoatgctgtgtagggctgagaogt^^ 

TGTAOCTCO 
>Soqae*ce 12*0 

GGTAC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 WCA S \GGCAATTTAATAAGAT 
TTOAGCATAGATATTAAACTTAGCATCGACAGAGAAACTTA 1 1 ICI I O GG 
GG ACTG GCATAGTTJAAAGAACAGAATCAGTATGACCIXXAGAOAGCAQAAA 
AACTTTACAACAGCTAATACTACTTGCTACATTGCTO 1 1 G CI II AAGATT 
TGAGGGAGGAGGTACTAGACCCTG<XTGAGATCCT1TTGAGGTC 
GAATTTAAGCC I 11 IC I 1 1 1 1 1 1 1 IC1H 1 ATT AATTTGAAATTTT AAAA 
TTATATTTTTXXXITGTTCTAATTATCACTTAAAATTTCTAA 1 1 1 1 I ' ICTI 
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1 1 1 1 AC 1 1 1 ATACT I"! IT 

> Sequence 12*1 

ACCTCTGACTTTCTAACAAATTACCATAAAGGAAGAATA J 1 1 1 1 CO 111 A 

CTATTGTTAGAACACCTTAGAACXIATCAAAAATATAATTACATGGCTAAT 

AGAAAAAAAAGAGCAGTTTTAAAATATGTTTTATGTAACCTATrTTCATT 

GTTTTTCA 1 1 HOI IO I I OCCG AATAGTAC^TGTTCTAAGTAAATACAGG 

TCTCAATTTCACTaTGAATAAAAAAAAAAAAAAGGAAAAAAAAAAAAAGT 

ACC 

^Sequence 12*2 

GGTACTC TTl C I l Al III CI 1 AATCAA T A CAGCTAAAGGTTTGTCAATAT 
TGTTGAT CTTTTTAAAGAACTAAAA 1 1 1 IOI II »Oll GATTTCCTTTATT 
I I I II II ICIOl 1 1 1 A TTTATCACC AC I C 1 I A I I I 1 1 A GTATTTCCTTCC 
TTCTGGTAGCTTTGGGTTTAGTTTGTTCTTAAGTTCCTTAGGTGTAAAGT 
T A CGCTGTTG A A ATG A G A TCTTCTT A TTT A A TGT A TGCA TTT A T AGCTCT 
AAA I II ICICI I AGCaCTGTTTTCACTGCATGCTCTAAGTTTTGATAT 

> Sequence 12*3 

GOT A C I 1 I I I H 1 1 H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ICI 1 1 1 1 AATTAAAAACCGG 
GACTTGGTGGGTTGCCCAAGCTGGG<rTTGAACTTCTG)G^CTTAA^ 
ATACTGGTTTGGCX^CCCAAAGCACTC^XiATTACCGGGCTGAACCA CC^ 
ACCCAGCTTTTAAAACCACACTATTTTTAGGGCAATATTACACACCTGGC 
CCAAGGACTTACAGGGGGGGGAAAAGCTTGGAC 1 1 1 I O G C I I I I 1 1 1 1 I I 
TTTGG ACCCAAGCACCCTGG AACTTCCATTICCTTCCTTCAATTACGTTT 
AAAATC 
>Scqucace 12*4 

ggtactcacaaataacaagacaaatttgacctgttcaataaatagaaatg 

AAGTGGCT AAAAA TG TTT AAA TGGAAGTGG AAAA CAGTCGTC I ICI 1 IG T 
ACTTGGT CTCTACCTCAGATAATT C 1 I C II I gagcttttgagtagcttct 
CCI 1 1 1 JCACTTAGTTCTACATGTATTCTATGCAGTGAGGTTTCAGATGC 
AGACAATCTTGACTGAAGCTGTTGACAATCTAG G ICI I It I OATGAAGGG 
TTOCCTGAATA H C I I I 11 ACTCACAGATTCTrCATTATGTTTCTCCT 

> Sequence 12*5 

CCCTTAGCTTGGTCGCGjGCCGAGGT^C^ 

ACCCCIG 1 GOT GGTGT GGCTACATTCTTI GAGTTTAGAAAACGAG ATAAA 
GAATTGCTCATATCTTCCCAAA TTGTG TAGTAT AAAAAG AATGCTOTCCT 
GO I IOI II IT I OTAGAATATGGAAGTCCCTOCAGTAAGT AGGCAACATGC 
TACCCTTCTATTCAACACAGCA CTAG AACAAGGCAAGTGGGACC1 11G 1C 
GACACATGATTCGATTrCTTAAAGTCATTGGCTCTGGAGAATCTGAGACA 



TCTAGGCCATACACCCACGTAGCA fCTTTCTCI AATGGT 
> Sequence 12*6 

ACACAGGATGTGATCAACAAAGTrcTATTrrACAGGAGTATGATCCTGTC 

GATACCTTGCXXJTAGGTTATCTAACATGATTGGAGOGC^ 

TCTTGCACAGATCGAGAGTCAGGOGTATTTTOTGACATTACACAGCATCA 

^^AGCCTGGTGCCTCATCAGGTGTAAGTTCTTATAACCACICTTGGCAAA 

TTTATTAAA GACAGGAA CACA GTCAATCTGTAACTCATAGT AGCTCT ACG 

TTTACTTGAATTCCAGAATCCCTAACCCATCTGTCCCT 

GAAAGATGACATOCATOGACAGTCAACAGAAAGCKIATGAAAGCCAGGATT 

CCTOGGATGAACAGACAGTGGCAATTAGGATGTGAAGACAGGrrCACAACC 

TATTACTATGTCTAAAAACGACCAG AOCAGAGAGCCAGAAAGAATAAGCC 

TGAAGTCACXTCCACTCAAAAGCAGCCAAACTCCCTCAAAGGACT 

TT AAAA CCTGGA TCT AAA CCTGAAGOXjGCTAAAAAO 1 0 > C 1 0 1 1 IClOAG 

1 1 1 ICI I ICC 1 1 AAGCTCATGAAGCAGATTjAACCTACATTTTTATTOCCA 

tttcatatcaaatcttxhjtxxitataa<xttag 
aagtgtggactaaatatl g i cci i cg<xxxxiacacgctaaggcgaattca 
acaacitggcgcggtacrgtggaccgagctcgtacca 

i 12*7 
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GGTA CA TTCCAG TTCTTTA TCTGAA TACAA GCO 1 1 1 IOC1 1 1 l A TTTCCA 
Ol HCI I G^ACCAGAAGAATAAAATACATAAG ACATCGTrTCTATATGGT 
CATATACTATATAGAATAAAGAATTGTTATGTAAATTATTAAATGAGTAT 
ACAG ACCTTT ACATAAAAACTAA GGTAC 1 1 H 1 1 1 1 1 1 1 1 1 II 1 1 Ul II I 
■ 1 1 n i 1 1 1 11 H I II II 1 1 1 1 1 II H 1 IKrnTGTTGTTTTOATTTTGT 
GAI 1 I IOI II I HOIOI lOATTGTGGAGTAGGAGAAATAGTGAAATTTGA 
AGGTAGAGG 
>Sequcace 12(1 

GGTACXTTGTGCAGACCGCCTACCTCATCCTGT^ 

C i iX : i uGGAATACAGACCAGTAGGTCTCAGCCTTATTTTACCCAGCCCTT 

GCTACA TTCAAOAAGGAATCACTCTGG 1 1 C I AATGCCTCCG ACAG AATOG 

TCAGATTCTCAGACTTJT AAA GCAAAGAAG ACTA 7GTTCAGTGACAGCAAG 

ACTGTTGAAGAAAAATAAACTCGAATGGCCTTCACGAGCTATTATCAATA 

A AAA C A G T A TA ACTTAT A ATTA TCTGTTG TGTTACAA TG AAGT A TATC AT 

CACTGCT 

> Sequence 12*9 

ACTAAGGTTGTTAGCCCTTJTGCTGGAAGAGAGTGTATTAGTTXATrrTCA 
CACTGCTGATAAAGACATACCCG AGACTGGGTAATTGAG AAAAAGAGGTT 
TAATGGACTCATAGTTCCATGTGGCTGGGGAGGCCTCAC^AT^ 
AAGOTGAAAGGCACATCITACATGTTCGCAGGCAAOAGAGAAATGAGAGC 
CAAGCAAAAGGGGAAACCCCTrATGAAATCATCAGATCTT^TTAGACTTA 
TCCACT ACCACAAGAACAGTGTG<KXKjAAAGCA(XTCCATGATTCAN 
> -S equenc e (290 

^CTTTGAGCGGCCCCCOGGG^AGGTACATAGGCTCITiCCTATCTCrGTG 

GCATGCATCCTaCATCCACAACTACACATTATTTAT™^ 

CAAATCCCAATTCCCCAGAAATGGTCCTCACCTCATTG^ 

AGAGCCAAGGGGGAAACAGCAACITJGGAAATGACTATGACAGACTAACAC 

^^^^^A^AG^ATGGCrCTT^TGGGATGTAGGTCGAAGGAGAGGCCTCT 

GGCATTGGCAGCTCCCTACCAGAGGTGTCCTG<XXrrCTGTTCTCTTGGGG 

TAAGGGAGCCACTGGGCAGGAGTAGGCAG 

>Sequcoce 1291 

CCCTTTGAGCGGCCGCCCGGG<^GGTACATAAGCICTG^ 

GCATGGATCCTACATCCACAACTACAC^TrATTTATTTATTTATTTT^^ 

CAAATCCCAATTCCCCAAATATCKJTCCTCACCTCACTGACATAT^ 

AG^GCCAAGGGGGAAACAGCAACTrGGAAATOACTATGACAGACTAACAC 

AAAAGACAAGAAATGGCTCIT^TOGAATGTAGGTGGAAGGAGAGGCOCTT 

GGCA TTGGCAGCTCCCTACCAG AGGTGTCXrrGCCCTCTG A TCTCTTCGGG 

TAAGGGAGCCACTGGTCAAGAATAGGCAGC 

>Sequeno e 1292 

GGTACAI 1 1 1 II I IC CICI III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' I A ATTCTGAQATTT 
CCCCAAGCTOTG GA1 ICI 1 CC I ACTCCTTAAGAAAAAAACTTTGGGTTTA 
TTTAG CATCT ACAC* I li'lGTCAGTTGTCTCG C T OI 1 1 IC CACCCATTTTA 
TTATAC ICI I AAAAGATGTAATTG 1 1 0 1 CA TTTTOAACAGTTAAACATCT 
TTGGGTATAAAAAGAACCCCAATtXnTATGTT 

1 1 1 1 1 1 OGTTTTACCTAAATAAACTTTCAGCTAATCATATAAGGAAAGAG 
A CTG I C 1 1 1 I'll 

OCTACT AOCTGTTTAAGGACATA CCAGA AAAAAAGTATTGATTrrTATOC 
TATGCTAAACAG 1 OC 1 0 1GATAAOJ 1 1 IOI ATCACTTGGAGAATGCTCCT 
Q^^^^^ ^QCA ACA CTAC TAG^TAACCOCTO<^TCAAAGAGOAAATCAAA 
AGGGAAATTTCACACTGTATTGTAAAGAGAGGAGACTTTT ATGCCAAAAT 
ACAGTAAO I C 1 1 1 1 AGTCAGATAAjCATTAATAATCTTAAATTCCATTCAT 
OTT AA^GAAGAAAGAC^TTAAGAAATCTGACACTAATCAGAAGAAATTA 
GAAAACGAATAAGTAAAAGAATCTGAAAAGGAGAAATAAAAA 
>Sequeacc 1294 

GGTACAGTGGGAGAGTGAGGTGGG AGAAGAAGAO IOICIOCjI 1 1 ioioio 
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CTT CAC IO ICI ICI I GC<ATGAGCTATGTTTTAATTTGGAAAGAGTAGGG 
CCGCTT CAGAG CCTCCTACAAAAGTGCTAGGGCCAAAGA C 1 1 ICI I A GCT 
TPA ACATTTGTA TCTGACTAAAATTGACTTGGGCAGCCCrr^ 
TCACTTTGTTTTTGGCCi 1 1 1 1 C 1 GG 1 UGGTGXtCCCTTATG AGTOGTTCT 
TCGGI 1 1 1 1 ICI 1 1 C AACAAT TTGCCCCCCTTGAAAAATOAATCCACCAT 
GGTGTGCAACCTOTCm 1 1 I'm TGOACTAGGCCCCAATATCACCTGAT 

caatggta a i i i i i m e unc n io gggggc ci ict i ic aatgaaaac 

C CAAA TTCCITTOGCCACCTCCAACAA I 1 1 ICI I IGGGCCCGGCCCTTTT 

CCTTGG 

> Sequence 1295 

ACGCCKXKrrCTCTCCATGGGTCrrG 

TGCATCTCTTAGTT T T 1 ICC 1 ' I ATTTCCTTTATTCTTAGTATCAC AGTCC 
ATGATATCCACTGTCCTTGOGGCXKXCAATTCATTGTGCAAAAGCATTTA 



A ATCAAA ATACCCTA TTTGTTA I'l'ITI'I'l AAAAA GTAAAGTGGGG ATGAC 
A AGTCAAGTGG A AAT TTATCCC AAAAG AGTGGGG ATTACTGTG ACTATCT 
G AGGAGTTATACTTGA 1 1 1 1 1 1 IOICI G ATTTTAATGG ACTGTAGG ATCT 
> Sequence 1296 

ACAATGCACATG«XiAAAGACXrrTAATTrTGGATCTG 
TATG CCTGGAATAAATGC C I HCI I 1 O GG ATGTAACTTGCTTAAAT ACTA 
TTTGCTCCTTJATCTTJTTJTGAGTTACTTTAATTriGr 

CTA TG ATA TTCTGG CTTT A CTA GTGGTGACTCATCTATCTGGGTAAGAAT 

AGACTACATACTCA7TTTGGATGTATTTTGATTTTAAGTTTGTAAACTGG 

TCCACTATTTTAAAATATTGTAGAGTGCTGATTACTTr^ 

TGTAGC^CCCTGTGTTTATCAGGTAGGTAGATTGGATANTTGGAATTGGA 

A T A TT A AAA TCT A TAA T AA ACCAA TGGTTTTACAA A TGCCTTT A T AAATC 

TAACA tTGCGCTTCATCTAGATAG AAACTTTCTGAAAGTGCTCCTGTCTC 

TACTTGGTGTATAAAAGCGATGACATTTXnTACAGACCAAWATATTGTT 

cgttact aggatattatctgtg atcatcgtcctcg 1tcgtcaacaaggaa 
aagaattccattgtttaatgaagaactatgtggattagaaoagnataaag 
aca>iccgtcacaggggtcgcattagt aaa ttc aaatc ag acatggagcat 
attt aaaa tgtcagaagatgtg tgaaa tgtaaatccatgactactcggtg 

G TCG A CTTCCG TCG TTGT AATATCCCACATACTGTAGTGG ACAA GTTT A T 
CATAGCAGAACAGTGACGGAAATAGTCTTOGAGTCTCAGTGAGTAGCTAA 
ATATCGCACCCTTGTCATCGAACATCGAGAACTCATGATCAACTAGGATG 
AAA T ATTA TCG TTGGTCA TTGAA GGG ACACT A CA TATTG AGATGCATG AT 
ACC 

>Scquencc 1297 

GGTACAT1TAAAAGGTGATGCTAATACTTTAAAA TGTTTAAGATATAGAT 
TT AAAAA GCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCATTGG 
ACAGGACA TAA TGT AAAA CATAAAAGTGCATTTGTTACACTTACA TATTG 
ATAGTrjAATGGCTACCTGACCAA ll niGlCICAAGTTAATTTCCTAAAA 
CrATTTCAGTCTCTACTGGATTTATCCCATATTACACATTITGATATTAT 
ATACCTAAATATTACTGGCATA 1 1111 lOCI II HI 1 1 IOI I O GCTTCAT 
ATAGTTTACATTT 
> Sequence 129* 

CCCTT COCKXCa3XKXX>GCTAaXXXX^ 

ICI 1 OCAGAGACTGCTCCATTCCAGTAAAAGGTXiAAGGTTCAACTGGAOA 
CCTTXAAAGTTGGCTXKJGCCTACGGTTTGCC^ 

GCACAGAGAGGGAAAGATJTJCATGCCATTiGTGATAATAAAAAGGOCCACC 
TGGGGTATGATATTGGGGACTAACGCTTGTT ATTXXCAACXXrrTTGGGAG 
GGCCAAGGTGGGCGGATCACGAGGTTTA O 1 1 1 1 ICI A AACCAGTTTAGGT 
CAACAAT 1 G I G T AACCCTGTATATTTCtTATGTTGCTAAAAAAAAAATAA 
T 

>Sequena 1299 

GGTACTAAACGTGATGAAAAATATI 



<^CGCCTGTAATCXXTCCAClTTGGGAGGOOGAGGCAG 
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catcaggacattgacaocatccoggctaacacaotcaaaooctctctcta 
ctaaaaatacagaagaaaaaagaaaaaaaaaaaaaaaagg itci rrom 

ACTGCA G I O 1 O G I CI ACATGGCATTGGACAGGACATAATGTAAAACATAA 
AAATGCAATTGTTACACTTACATATGATAGTCAATCKXAACGTGAOCAAT 



e 1300 

ACATACAAAAAAATCATTAACTCATATATTTCAAGAGTAGGAAATOGGAA 
CrOGTOTTAAAA C 1 C I I A TAACATATGTCACT G TCTT A AGGGACAGTGTT 
TAAAAAOGCATACCTGGCOGGGOGCGGTGGCTCATGOCTGTAATXXX1ATC 
ACITTTGGGAGTGCOGAGGCTXXjGCTGATCACAAGGGTCAGGAGAATCGA 
GACCA rCTCTOCTTl ACACAGATGAAACXrrC AGTCTCTACTAG AAATTAC 
AGAAAATAATAAAAAATAAAATGTCTATTGGACTGAAAACAACACTAAGG 
TGCGAI I IOCAGTTCACTOGGCGGTAC1 1 1 1 1 1 

ACATTTAAAAGGTGATGCTAATACTTTAAAATGTTTAAGATATAGATTTA 
AAAAGCATTGTAAATTCTATACTCCAGTGTCGTCTACATGGCATTGGACA 
GGACA T AA TGT AAAACA TAAA AGTGCAA TTGTTT ACA CTT ACAT A TO A TT 
G TGAA TGGCAACXrrGACCCAni I IGTCIGAAGTTAAAATACCAAAAACT 
A 1 1 ICAGTGGTCTCIGGATTTATCTCTATATTGAAAAICI IGGTTAC1GI . 

attccaaa ci i i rci i g ocataocatatttqa i mcmrGi roTGCi i 

TTGAGAATATGGACTATT 
• 1X9 



OTDO I OGOAOATOCOOCAOGCAAOGCACAOAGGAOCAAAAGTGOOOCACA 
GACAGACAGOCATGTCOTTGCAGCAGTCCOTOAGACCTGTGTGCCAGTCA 
CTQAGCTGOOTCTGGTAOCAOCT G GT G G TGGOGCACTGGGGCTGACTGGT 
CACAOOOTAGGACATAOC 1 1 1 ba 1 1 1 CACC 1 1 GTCGTGCATCTCAAACT 
GCA T C1 roCTO GOCCTCAOP AG CTCGCGI IGGGGACOGCAGAAGTGGCTG 

CGATGOCAGTGA • 
> Sequence 1303 

GGTACTCAAAAAACAAAACAATGGAGTATGTOCT G TTGCTAGAAAAATTT 
GAGCAACAAAA T AAATAAAGTAGT A TA GGA TT A TG ACOCCAAGTATAAAA 
TAACCATCTATGAGTCCATACATATATAAATAAATGATTGAATAAATATA 



TTTATAGACAGCTGOGCTTTAAGAAAACAGAGCTACTOAGTOTOGTCTAC 
AATTAATGCTCOCGT 



GTGCGAAATCGTAATGCGTTAACATCTGGGGCGCCTAATGCGTAGACGTA 
CCTICATGAGTTACGGCTGGCGTGGGCTTAGGAGTTAOCTAATTCAAATT 
TATTTOOOC TOO O C1 fCATTTOATCAATlTTCCATACGCGAAAOCTOTAO 
CGOCATCT1TATTTAAGAAATCTTCCAATCGOCOGGGAAAAGCCGTTAGC 
TTA TTTOOOCATTTATCCG OATAOCTOGGT rATCTTTACT OCCTAACACT 
CTGOCI I ICOGOTTTOGTCAAGCAGTATTAGTTTACTTCAGAGOGGGTAT 
TTOGGTTTAOOCIXXsAAATCAGGGAATTOCAOCOCAAAAAACTTTrATOC 
•AATAOGOCroCAAAOGCOCTAAAAOOCTAAAAGGOCTGTTTATATGTCTC 
TTTC 

>ScqMaoe 1S0S 

ACACTGAAAACTGGACATTATAACATTAATTTTATTAi 
GAGCTACATGATGTIGTOCACTGAAATTACCCAAAT 
TTTOCTGGATGAGCTTCAGAAGGAGTTCATTACTACTTATAACATGATGA 
AGACAAATACIGCTGTCAGACCATACTGTITCATTGAATTTGATAACTTC 
ATTCAGAGGAOCAAGCAGCGATATAATAATCCCAGO icrcm CAACAAA 
GATAAA ICfTTCI GACATGCAGACGGAAATCAAGCTGAGGCCTCCTTATC 
AAATTTCCATGTOCGAACTGGGGTCAGGCAATGGAGIX^CATCA 
TCrGTTOACTCTAAAGGTGCTGGTAAGA rTTCTTCTGCl CAOCAOCOACT 



OGAACCAGCAACTCTGTCAOGGATIGTAGGATTTATCCTTA O ''I CI 1 1 1 A T 
GTGGAGCTCTGAATTTAATTCGAGGCTTTCATGCTATAGAAAGGCTOCTG 
CAAAATGATGGGGAAGATTTTAAATACATCATTGGCA 1 1 1 1 1 L I lO OAAC 
CACAGCCTGCCTTTAOCAAGTAA G I 1 1 II I I IULI 1 1 IA AGAAACACTTA 

ccattattgtttactttaaggatcaagtctaacaattgggcattttaaat 

aaattitaaaactccaaaaaaaaaaaaaaaagggagctoggoggaaccco 

cttaagggaattc 

> Sequence 1306 

GGT A CA CCAGTGG AGGACACGAA TTCTaTAOCTGT AGGA CA GTGCA TGG A 
GAAAAACCTAATGCCGGC 1 0 1 COC fCAAAAOOCIGGGGOCAGTGCCTOGO 
CTGTCA CCTCA TCCA TGCTATCA GTCT AC I 1 rOOCTCI I A GCCACAGAAA 
GCCCTGAAGAAAGTGGCATAAAAATGACCTGGCTGGGCACAGTGGCTCAT 
OCCCATTATCCCGGCA C 1 1 1 O GGAGGCOGAGGTGGGCAGATCACCTGAGG 
TCAGGAGTTCAAGACCAGTCTGGCCAACATGATCAAACCCCGTCT^ 
AAAAATACAAAAATTAGOCGGGCATGaTGGTGGGCGCCTGTAACCCCAGC 
TACTCANGAAAGTGAGGCAWGAAAA TCTC1 1 GA AOOCAGGAGACGGAAGT 

gttgaa i i iccci i rci i nS AAcroo cn i rcATOOTCCi iaaocattct 

C I CI I C AAATG O nC I A TOGATA 1 1 1 1 1 1 1 ICC 1 1 1 lOA GGTTGOACCTT 
TTAATCTACI PGGm nOOCTGGI ro c lO CAAAGTOTACCCTOGCCOGGG 
CGCCCGGTrrAAAGGGCGAAATTOCAACACACTCGGCGGCCGGTTCTAAG 



GGATCCCAO L rTGGTI oo caacttggcggtataaagggcataccgtgtoc 

TGGGTGAAAA fO I I I ll CC IIA OCATTACCO 
>Sequeace 1307 

GOT ACCCTTGTT ACA AAT AT A OC ATCA TCATCAGGTCTG AATGGGTTTCC 
TCTACCCCCGACACCAGCTGATATGCTAAATCCAAGTTCTGGATCCTTTT 
CAACCCTCACTCGAAT CICT I C TT rTGCCAOTTCATGGCCTTGTCTAOOA ■ 
GAACAATGGGGCTGTGTATATGGAGACTGGTGGGCCACTTTCAGCATCAA 
GTAATCAATTA O rTCTTCTC I A GAGGGATGCCTTGCCACAOATGCCTOAO 
GOGGGTGATGTATTTGACTATAATTTGOCTGAGOCCTGAGAGGCTGGOCC 
ATCTGTCCA1TACTCAAAGGCATCTAAGAAAAACATGAAGTATCTTAAAA 
TGAOCAATAATAATGTCTTATTTCAAATATTTGG All 1 C l' l C T TG OAGCA 
TTACAAAAGC ACTAGA GT1TT CACA TTCTAATTAAGTCAAACAATAOCAT 
GOCACTT ACTA TTTTTCT AT AA TTTTAAAACTTAAAAG AAAT AAGCTATT ' 
AATGTCTTAATTCTAAGTI 1UCIGA GTO C 1 IOI 1 U f T ACAC TCA C I Mil 

>Scqueacc 130» * ~ 

TTTATTCGOO CTTATCGTGGTCGCGGCCGAGGTA C 1 1 IO GO G1 II 1 1 1 1 1 
1 1 I I ITim GGGTCACAGGAGTOCTGACTGGGAAAA OCCTGAGCTACAA 
AAGCAAGATTTTACTCAAATTAATTATTTCACAGACAGACTGGAGATCAC 
AGGTCACTGAAAAGTCATTTCACTOAACAGAGCTAAGGATCTAGGATAAA 
TTOTAATAACAGCAAAGTGAAA I Mill I AAAGAAGAGCAAAACTCAAAG 
TCAAAACATCACATACTCTTATGCCrrrGGAAAAGAAATAATAAAAATAG 
AAA 1 1 IGCCTOCATCAAAATTATAATACTATTTCTGAATTCAGGGAAAAG 
ACAGTGTTATTAAAGGAATTAATTAAATATATCAAATTTCTACTCTATTA 
T AAACATACCAAG AAAA TGAAA CAAAAAATTAATTAAACAAATTATTTGG 




ttccaagggataaagattacaggtogtoga cl i nTGCnrn 

ATTATAGAAAAGAGOOCCCl 1 1 lOG GCCATTGGGGAAGGGCAAAAAAAAT 

tgggggggtgggctgcttt 

> Sequence 1309 
ACTTT 



i i iuniiiimmiicuim i iiiiinni 
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ACCCAAGGGGTrAAGTCTTATTAATTGAACGGCCTTrrAAAAAACTGGAT 
TTAAAAAGOTTATTAAAAAAAAAATTGGACCCC G 1 1 1 I G AAAAAAAAAAA 
AAAAAAACCGO 1 1 GGGAAATTGCTTATTGAAACQAACCTGGGCAATTTTT 
TAATAAAAOOCTOOCCAAAGOGGAGAAGGAATTTTTAATTTAATITTTCA 
AAAAACOCAAGACCTTGGGCGGGAACAACCTTAGGGGGAAATTCAACAAA 

atggggggccctaattagggaato^aacttgggaccaaagttgggggaaa 

T AAGGG AAAAAGGGGTTCCTGGGGGAAAA TGTTATTCGCTTAAAAA TTCA 

CAAAAAAAAAGAGCCGGAAACAAAAAAGGGTAAACCCOGGGGOGCCAATG 

GGGGAGOCTAOCTCCATTTATITGGGTGGGCTCTATGGCCCCTTTTCAAG 

GOGGAAAAACTTOTGOGGOCACn G nTTATAAAATOGGGCAACCCOCGG 

OAAAAAGOG Ol 1 1 I OC 1 1 1 1 l UG G CC I 1 11 ICC I I I IC1 lOA TAAAAAAT 

TCTroGC CTCGAA GTroGGTTGGGGGAAAGGTrTAATTT^ 

GAAAGOGGTITTCCAAAAATTGGGGAAAAOCCGGAAAAAATTGTTAAA 

>Scqueace 1310 t 

UJC I I II^GCCGCCTIOOOOGGC AGGTACATACCCTTGT AGGATAAT^ 

CCGATTGCCTAAGGGATCOTTTGACTCAGACrrAGTGAGAGGTC 
GACATOCTTACACTCTGOTATCATGCTATATG G I IOI HjCATOCAGTTOT 
ATAGGGATGGCTGAGCATATGCATAAATTAGCXTTCACTGTATG ACAACT A 
TCTA TATAATO OTTrAATTATACTTGGACGTOGACCCTGCATATACACTA 
TATA 1 1 [CI I UGAGCTATCTTAACTGCGAAGAAATTGTAAAATTTGAGAC 
GTCTCfCATGAAACATATTGAGATATTXrOGATAATGGA 

rccTCcntn iggtataaaatatgcacactgagctcaagcgcatctaaca 

CTAAATATGCAATTGGTAAGTT ATGAA J C 1 C I C- 1 1 Ci I OCATATATAATAT 
ATACATOAOTGGAOTOTACAAOTGGUI rCTCTCTCTOTOTOACAOCTAGC 
ACACTTATATGCACGTCCATATATATACCTACATATATATGTCAACAGAT 

gcocatataagacatatatacctaggtgatahgtctatgcatagagtttg 
agagagtcacagtacatcacttgttjacatatgcagtacgagtgtatacac 

ATaTATATGGTTCCACATGATGAGATCTAAGACT ATA TGGT ACTTT ACAT 
A TGTACC^TGCGCGATAAGTCGACACTCATGNTGGTO A TT A TGTAGTGTGT 
TATATGTCTACTTACACATATGTGACTAGATGTATACACTGACTGT AAGC 
TCCTCXATACATGTGAGTAGTGTACACTCTATCCi I G I G CGTAGTACACTA 
CGCTTXT ATATGTGGTCTXj TTATGTA TCTI CTU I CATACATAT AGGTCGC 
GGTGCGCGTT ATATCTTACGTGTG TGATATACCGHAGCACTCGTGAGAT A 
COCAATGGCATTOGTO ICG I GACATATCATTTOTQACTTATOTAAAQKTA 
Q^TATTAOGTGTOOTOTCATAAAACAKTOGATATGTCTCAATGTCTCrAO 
TCACTATTGATACCACACACGACA roTCl GG I G CO O OA TtSATAAATOACA 
. TATTATGTTCTATATAANACATCTCCTAGCACTACGTCATGATTGACATG 
CGAGT 
>SopoxrI3ll 

ACAAACT AAAA TTA TGGGAGAAGAAACTATGAGTGAAACGATGaGAAAAA 
CCTAATGCATGATGTAGAACTGAGTGGTG TTAATAGCAGAGCACTGGAGG 
GAAGGCCCACAAAACTCTTCACCCXIAAGGTCT ttctagaatca 
ICCTA CAAGCCrAGTTlTCATOAOATTCAOODCTATTrTA 1 1 IC1 FCCTC 
TTG GAATT ATA TOAA ATTACGAA II ILIGIUIUI I O I CAOCTGTAATAGA 
ATAOOCTGOAATTT TATTTACTTTTAA 1 1 HOI 1 1 ATTTATTTATACTTA 

t ctgccatc i 1 ci catgaaaaagaggcagt atcttaaaagtttgagttca 
• gattttctg atotag ataaataagctaaagaagocacggtoaagtotoat 
at atgaoaai t 1 txlaoagcagggtattcgtaacttctraagtatttaxjtoc 
aaottooctcicocaacacattttacactaflaa 
AT GTGG I ggctcacgoctgaaatcactttgggaggccaagocaaotogat 

G GCTTG AOCOCAOGAOTTCAAGACCAGCCTGOGCTACATGGCGGAACCHC 

atctctacaa aaaaatacaataaaaggtaactgggcatg g t gg t g I G I GC 
ctrjtattcccaactacttgggaggctaaagtggg 
agocaggtcagoctgcatggaaocatocttgtaoctc 



1312 

ACAGTAAGCCAAGATTGTGOCACTGCACTT^ 

ALIC IG 1C1AAAAAAAAT AAA TAAA TAATAGAGGTOAATCTCTOCATTAO 

GATCAAGACAAGAAOAAGACAGACAATCACTTTCGAATTCTGAGACTACC 

TtXAAGAATCATCCACGGAAOGATGTCAGCCATTTAACCAC 

TCAAAAAGGAAAAAATACAGTCAGTGGACAAGTAGAAGAGTCTCCTGAAA 

AATATCCGTA TTTC AAAAGGCAGCAGGAGTTGATAGAAAACATAACT AAA 

AAAGTAGAAGACACTGTTAAATTTGAATCTGGATCCTATAT 

>Sequcncx 1313 

GGT A C IIHIHI1H1I1IHII11M O GTNA lllllllllllllllll 
M M 1 1 1111 1 1 M 1 1 I I 1 1 1 1 1 1 1 1 1 lAAAAAAAAAGGGCAATTTTAAA 
AAAAAAATGTAAATTro CQOG GGTAACCCCCAAGGGAAAGTOCCTGACCC 
CCCCCAAAAAACCAAGGCI I ICOC1 1 1CCCCAAATTTACXCU11TTCCAA 
AAACCAAAGTTTTAAAACXTTTGAATTTTAAACCCCCCTTCT AACOGGAA 
AAA 1 1 1 1 1 1 CAAATCCCCTTACGCCAATITAATATAACCCAAAATTATTT 
TTTCTAAAA TAAAAATTACCCCCCCCAA TTTAAO 1 1 1 1 1 1 I A GCCCAAAA 
TTGAACCAAATTAACOOCCGGGTTCTAAAAOCOCAATA I OC I G 1 1 1 1 101 
ACCCA TOCAAT CA^ 

GGCA TAAA CAAGGCCA TAACC I ITfCCI rGGGAAAATTTTTATCCCOCCA 
CATTTCCCtCCACITACTGjro 

TTTTCCACAGGGGAAACC 1 1 ICG I CO0 CACTTTTATTAATAATACGODDC 
CACACCCOGOAAAGAOCCGGTrACGGTTATTGOCCC CC TT T I CC GTCTTC 
CTCCTACAAGACT 
>SoqtKace 1314 

CCCTTGAGGGGCCGCOCGGGCAGGTACCI 1 C I lA GAAAOCTAGACTCCAC 
AGAACAC 1 G 1 1 1 GACA ACCACTGC^GTAOAACATAATATATCAAGATTCT 
ATGAGTGGGI TTC1 1 TCI ~t CA TTTTT ACATGTTGTAGAATAACATOCATA 
ATCAAAGCTAAT AATA C I G I G 1 1 1 I C I l lA C I CII 1 1 ATTTGOCTCTAAA 
GACATCCACACATAGTOGTGAACTTiATTTTrAATGCGTITrAAATAACAA 
GCA TTGAAAAA T A TT AAT AA TTGT AGTT ACT AAAAGT A 1 Tl CTCTTT0 CC 
ATTCTCl IA1CIGIGI 1 1 CCAGACCGGTTGGAGGGTGACAGA TCAGAA TG 

cici gg i caagagaatgaatatgaggatgaggaataataaa c rcrci I IG 
ocaagc actta aatgt tctcaa atttg tataaoacatttattatattttt 
ttc titacaoag ctttagtgcaattttaaggttatg g 1 1 1 1 log agtttt 

ICCCI 1 1 I TTTTGGGATAACCTAAC AI IGGI I ro OAATG AT T CTGTO CAT 
GAATTTGGGACATTGTATTAAACAAAACT ATCAGAATGTTTTAAGACTTT 
TTGCCGTTjTATGAAGAGTGCT AGAAAA TGCAAAGTGCCATA TTTTCCCTA 

accttcaaatgtggaaccitgattcaatggtgaaaataatttc 
tgaa atgtt0c1 icaaaataattctacac1tgcatttgoa a1g 1 1g1 icc 
ttttatataaagaga ctgg ttgt 

>S eqBcac c 1315 

occi 1 1 gcggccgccogggcaogtacai rroci goaotttgagaccagoc 

TGGGtAACACAGTGAGAC OCl OTCTC I A AAAGCATTAAAGCATTAATCCT 
CGCATTT CGATAGGGCrA TGTAGCTTTTAAGTAAGCAATGTTAGAATGAG 
TTGTAGAGTTTT A 1 1 1 1 1 G I GAATATAGTOAGTGACAGATGGCAATTACA 
TGAGGATATTTGAAOGAAGGTACC 

GGTACCAAACAjCACTTATTATTCTAACATGCATCAAGT 
GGAGAGAGGCTGCGG IG1G1 GGGTAGGGGATGCAGGAGAAGCTGTGTAAG 
GTAGTGGACAGC 
>Scqacacc 131? 

ACTTNK AIGlLilllllll l lll im illimil 1C1I I 1UU1 1 II 

ttttttaooctga ctcagaaa ai i i i i i i aatagttacaaaa i rmm 

1 1 1 1 1 1 1 I I I 1 1 1 A CAGAATCAGTTTAAAATAGOGGOGGA 1 1 rCTCCA TA 
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A TT A TCAAAA 1 1 1 1 1 ! 1 1 ICnOC-CJ! 11 iC^KTAJ^GC-GGOCCTC.*-*-*. TC 

CTGTTGATATlTCGCATOCTTACCOCTTATGAAATCAOCCCTTOCrT AAA 
^^^A^AAAAAAAAAGTTCCTTGGGCGGGACCACCCTTGGGQCGAAT 
> Sequence 1311 

CATCAATAAOTTTATTTATGTTCCATCACACAATACTTACACAAOCATTA 
AAAACACATX»CACACOTOTTTATTATaOCATACATACAAACACACATACA 
ACTTAATATTTACAACCACATACAAOCACATACAAACATATAAACAACAA 

acaaca ctaatt taacatacatacaatacttacacctta ujI rm ig co 

TTTAGCACTI GTAG A GGTA GATOAOGCGGTOGGTOTAG C I 1 1 ICO I 1 1 1C 
CTAATCTAAATTTTGGTTTGGOCATTAATCC I OC 1 ICIAGTAAAAATTTC 
CGTCC I AAAGO AAACTO A TCTAGG TCTGCAG A AAA C 1 1 1ICCI I I A AATA 
TACTC OCCAAAA AGAGTATTTTTTAAGCGCAT X^I 'I C l lT AGGTGCTCCAA 
CTGTA I ION I I GACAACACATTGCCTCCG ATCTTACAAAACCTT AACTA 
Tci 1 1 1 A AGGGGCCTCCltKjGGGAGGTTGTAA ACCAAAATTTCAGACTTC 
CA AAATAGTGG AATGCATAAAA IP 1 1 1 1 1 I A TTTTATAACGTTTTCAGTT 
AATGTTATTrrCAAAATTGTAAAAAAAACTCTAAAAATITTTCATTCT^AT 
GOT OOAGOTA CTTXXXjCOGCAAOOCTCTrAAOGCTAATTTAGCAAACTTG 
AGGCCTTTCTAAAT T O 1 1 C Xj AATTAGGTCTCTAA I C I I G II ATAAATTAT 
ATATATCTGAATG 
>Scqucnc* 1319 

GGTACATGAAAACATCAGTCTGACAGTTAATATTAAATGTCAACTTGATT 
GGATTGAAGGCTGTAAAGTCTTG1 1 IC1 UQGTGTGTCAGTG AGGGCGTTG 
CTAGAGAAGACTAACATTTGAOTCAGTGCACTCGGAGAGGAAGACCCACC 
CTCAATATGGGTGGCX^CCATXXACTCAGCTG<XAGCGAGGCTGGAACAA 
AACACGAGGAAAA/^^ 

itrn i ^j GOCTT TOGACTCTCAGACTTACACCAGCGGTITOOOGAGGGCTC 

ttgg occt ttoo ccacagactoaatgctctacaotgttggcttccctact 
ttixiaggccttixkjactxxfgactgggocactactagcttcc i l u cic c ic 
agcitgcacgtggoctataatgggocticaccttgtgaacatgtgagcca 

ATTCTACTTAACAAACGOpCCTTTATACA TACATATATCCTATTAGTICT 
GTCOCICrGGAGAACOCTATACAClCGATAAAATOTCAATAAAATTTTAA 

>Se«}oe»cel320 

OGTAcn i i i 1 1 1 1 1 1 n 1 1 1 1 u i u 1 1 nci i u m 1 1 1 1 n 1 1 1 n 
iiini mii i i iiiimiuimiuuniiiiA A iui iiiiia 

A^'^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATAAA 

AAAAAAAAAAAA ^AAAAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAA 

AATAAAAAATACAOCCATACCCAATTATTTCAACTAAAOCAAAAAAATAT 

■nTNAAAAAAAAAAAAAAAAAATAAAACTTAAAAATAAAAAAAAAAAAAA 

AAAAAAAAAAAAACTAATTTAAAAATAAAAAAAAAACATTTAATCCTTTA 

TAAAAATAAAAAAAATTAAAAAAAATAAAAAAAAAAAAAACAAAAATAAC 

ATTTTCAAAATCTTCATATOCATATAAOOCTAAOCAAAATAAAAAAAAAA 

AATTAAAATTACAAAAATA 1 1 NCI 111 11 1 IAAAAAAAAAACAAACCCCA 

AAAAAATCAAAAAAATATACAAAAAAAAATAATTTTTAAACAAAAATCAC 

^jTCCAAAAAATAAAAAAAAAAACAAAAATAAAATAAAAAATAAAATCAA 

T A CTCAT AAA T . 



* 1321 
GGTA C1 1 II 1 1 
HI 1 1 



rm 

Tl I'l 1 1 1 1 M 



HIlllllllUllllllCllll 

iiiiumin 



iiiiiii 
imiiiiiii mini 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lAAAAAAAAATCC 
CCCAATTTAAAAAAC000O0COCAATTTTAAAAACAAAAAAAATTTTTTA 
AAAATTCOG AGAGGGG AAAAAAAAAAAA AAATTTTCTAAAAAAAAAAAAA 
ACCOCGGGGlil 1 1 1 1 1 1 1 1 AAAAAAAAAAAATTTTTTAAAAAAAAAAAAA 
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A AAAAATAA rn I T 1 GAAAAAAAAAAAAAAAAAATTTCAAAAAATGGAGA 

TCI 1 1 AAAAAAAAAAAAACTAl 1 1 1 1 1 1 1 1 1 1 1 1 I lAAAAAAAAAA 
AAAAAAAAAAACCCOOCTCATTTTTTTAAGAAAAATTtXKKKKKMAAAAA 



G A GG TT AAAG AAAA TA TTTT A TAAAAAGA CCCAAAAAATTTTTTAAAAAA 
AGAAAACCACAAAAATTTATATAAAAAAATATAAAAAGGATCTTCTN 
^Sequence 1322 

ggt acag agc1tcttcctattaagtgccta aactataggcaaactttggt 
gttcccact aaaa cacaa g a gcctcaca ca atta ggaaaaaaaaatcaaa 
agaaacaaggaaactgagaatggaagttaotctaaatctctgcatttggg 
gagttgtcattaactxxagagcccagcatagtttccatggagccctgaag 
ggaggggacctcctgccacaaagag 1 1 1 co 1 1 cc ag ago agtcgtagcag 

TGGCTGTAAACAGC A TTGGGG AA G AA GTC A ATGTCTG AA A ACT AATTCCT 
CCAGGTTTCATCATG ATTCTACGGG A AG AG AAAG AG ACTACAATTAGCAC 
CTtTAGCCATGGGGCAGGAAAAGGGGGAGGAAGGGACAGCAATGCTTrCT 
GGTCTCCTTAAGGGAACAGGGTTCTACAGGT 
>Sequencc 1323 

Aciin unnii i ni i uii i niiic iiiiii i iiiiiii i ii i i i 
ti 1 1 1 in u n 1 1 1 1 1 rj'iTiTni 1 1 1 1 1 1 1 in n i i mi mm 

i I 111 I 1 TTTT AAAAAA TAAAAAAA A CCAAAATA AAGGGGGGGGAAAAAA 
A I 1 11 1 A AA A AAACTTTCCCAATTGGGGTTTTTTAAGGG AAAAAAAAAAA 
AAAAAA AAGGGAAATTCCCCT AAA AAA TTTTTACXXXXXXXXXTTTCCAAA 
AAAAAAAAAATITT TTTTAA AAAAAAAGGGAAA C 1 1 I HI 1 1 1 AAAAAAA 
AA AAAA AAAAAAATrrTTTGGTTATTTTTGGGAAAGGGGGGGGAAAAAAA 
AATTTGGGGGGTTGAAAATTTAAATGGGGTTTTAAGGGGAGGGT7TACCT 
TGGGGGGGGGGGGGGGAAA AA CCOCCAAAAAAA 1111111111 AAAAAAA 
AAAGCtXAATTTGGGCCTTTGGGCTAAAAGGGGCCCACCCTCTCCrTAAA 
AAAAAAAI I Id 1 CI '1 AAAAAAAAAATTCTTCAAAAAAAAACGTTTCTAA 
AAAAGAAGGTTGTIGGATAAAAAAATAAGCGCCAAACCCTCAAAAATTTT 
GGGGGGAAACTN 
> Sequence 1324 

GGTACTTGGTTT AGTTATGGC1GT1 1 1 1 1 GCCTCTAACACTTTTATTTTA 

AAAAGAAAATTAAAATAGGTTATTGGGATCAAAGATATAGOC I II I IP) 1 

A CTTTOA ATGATTTTTOTAATTCAGAATATXX^CTTGTTATTTC 

TAH 1 1 fATAATTATTGGTAGAGTTCATCTAATTACCCTATAAATCCCTG 

GAOAAAGGTGGCCCCCATATACTTT A 1 1 1 C 11 GG 1 1 A TATGTATAAAAAT 

cagtaggcaatgtaaaaa mi t rn gtgtu aatttattjTGaottataatt 

CTAATTCTATCTCAATATTGACCTCACATTAOCACATOAAAGCTCAGTCA 
OCAACTATGOCTCATACTGAAATACCCACTGATTAAATCAGTTGACAACC 
AGCrXXTTATCGTACCTGCOCGGGCGGCCGCTAAGGG 
>Sequcacc 1325 

GGT ACTTATTTTA 1 1 1 1 1 1 1 1 1G1 1 1 1 U AGGCAAAAGGTAAOGTGAAAAA 
GCAGGCATGGCATATAAGCAAG i. 1 1 1 1 1 1 1 A AGGCTGAGTG ACTT ATGTG 
GCTGATAGAGGAAGGATAGGAGGAAAGGAAATATAGTGAAAAGGAACAGA 
GAGGAATAATAAAGCTGGCAAG7CACAGACAACATAATTAGACTATCAAA 
AGAAAATTTGGAAGAAAGGCATGG ACAGGAATAAAGACC I CC 1 1 C I A AAG 

caaggtagggagagcaactogatgtagattgaagagaaaaaggaaagaaa 

aatgaatgttcatattigat(kioctatgattatctgg<ncatgtaaaagg 

caa agta acatcttcaagtoatgaaattgggtaactttaogataatoato 

ACaGTTCATACAAOCAATTCTXjAGAATAGAAGAGAACACTGACTATATGA 

atagaaggattttagaoactttacgactcagccgagtttaaattaaatat 
tootaogcattcagtcaataagggtataca 1 1 1 II I icc gacagctttag 
ccctttatgaaaatagagcctaagaaaaaaccocaaaogtatgggtaact 

ACCTGTG AA TC TOG Tl AGAACTGTTAGAAGG AACTTCCTGCAAAAGTTGG 
TAGTAAAATGCAAATTTAATCAGGTAAATGTOCAACATTGAATGGATATT 
GCTTAAACCACAOCGTAAATTTTTGAAAATGTAAACTTATTTAGAAAATA 
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AT7TAATTTTTAAGGGATT 
>Scqueacxl326 

A cgcggga TA ttt ATTTACAAAACACTTCATTATTTA taaag aatttact 
AACAGTTT ATCTTATTTATACCCATACATCTrGCTACTTTOGOAOOCCCTT 
TACAT AOAAAAC AOCA1 ICI 1 1 1 1 OCCAAATATGACCAAATTACTTTTAT 
TTATAA I I 1 1 1 GATTTATGTTTCAGCTAGATCTAAAAAGCATCTG AAGGA 
A ^^A CAATGAAAGATACCTATCCAATAACATTTAGGATAATCTTrGACA 
^"J"J"''GGAAAAATAAGAATTGAGGAAAAAAGTGTATCriTCAAGTAGATGC 
AAAGCATTATAATCUCTGACACTTGTATCTAACTCCAGTCTrACAGATAA 
CT AAGGCAAAAAGCTAAATAAACAATATGTAAOCTCTAACATTTOGTAAA 

aggaagtatactggtctgrraocaoagacaaacl i i i i i i agaattgaag 
tctgaaacaaacaaaagcaattcaatgtcaatagacattaagcaacataa 
^^^^^^^^"^tcctaagggaacatttcttacagctgcjoctttoctca 
actgtgct' i iggaagataagctctgtcctoaatccaaaccaagct 

>Seque»ce 1327 

TATACGGCGAATTCCAGCTOCACCOCTGCGGCAGCCGAOGTACATGCCGT 
GGAAG AGACTCAAGTAGGAGCOOCTGCCCGAGCTGATACTAGATGTGAAC 
CTTTCACCATGAAAATGTT AAAA GATATAAAGGAAGGAGTT AAA CAA TAT 
GGATCCAAC I CCCCTI ATATAAAAACATTATTACATTCCATTOCTCATGG 
AAA TAGAC TTAC I CCI I ATGACTGGOAAATTTrGGCCAAATCTTCXXTTT 
CATCCTCTCAGT ATCTACAGTTT AAAACCTGGTGGA TTG A TGG AGTAOCT 
GCCCG 
>SeqacNcel32S 

CCGGGCAG GTACCGGAA TCTCCAGATCGCCAAGATTTIO'ATAATGATXX; 
CCTCCTCACG 1 1 1 C I C I GG AAACTGG IT G I GAACTTCGGAAG AGGCTTCC 
GGAAGGAAGACATAAATWWOCNANAOGAGGGGGGACATAGGAGCTOCAC 
^ A C>IWTKTCTTCTATTACTCGGCANCCCCCTGCAAGO C I C I C 1 1 CA TCTO 
GGGCXATTCrTCAGCAATNAAOAAGGOCAAACTCTCCAAAGTTCATTTTG 
OGTAGCCA O AACCAGGGGGCTGCCACTTCTGGCAAGOOOCTGGGG AGCOC 
AGGCAAGGCCTTX^TXXlAAGAACCTCTTGGCCAAAGCrrTGAAGAAACXjAA 
CAATCATATGCCTGNCATGGGGAGG^ 

CTKK K^GT OAAATTTTOGATTATCAAAGGCTrAATlXXjAAATAOOOGTCC 
AACCCTTOGGAGGGOOGOOOCXXXGGGTAACXXAAGCTTTTTCGTTT^ 
TTTTAGTIXSAAGGGTrGTAAATTGGCOGCCCCTITGGCGG^ 
TCAATAGGCTGO 1 1 1 UUCIO IAGTCGAAAATTOT1TATT0CGQCTCAACA 
ATTTCCCACA CAA CCATTACAA GCCTITKjGG AGCCAT AAAA GTGGT AAAA G 
CtXTCGGGGTGGCCTAAATGAGTTGAGCCTAACTT^ 
TTGGCGCTCACCTOCCUJGCI 1 IC CCAOGTCOGA 
>Seqaoce 1329 

^^^^^^yTTCXSX^TTCAG€^TCAClTCCAGAGATCTAACAATAGGTGG 
CTCAGCTXXXATCTATGTCAAAAACATTCnCCCCCG^ 

AGGATGGCCGACTTAAGGCAGGAGACAGACTTATAGAGGTAAATGGAGTA 
GATTTAGTGGGCAAATCCCAAGAGGAA O l Uk 1 1 1 C O C l it 1 1 UAGAAOCAC 
CAAG ATGGAAGG AACTGTQAO CC I ICIUGIli I lU OCCAGGAAGACGCCT 
TCCACCCAAOGGAACTGAAAGCAGAAGATOAGGATA 1 1 G I ICI IA CACCT 
GATGGCACCAGGGAATTTCTXlAC^ 

ATCroCAGGOC 1 I UG IG 1 CAGTGTCAAAGGTAACCGOTCAAAAGAQAACC 
AOGCAGATTTGOQAA1 C 1 1 1 O I CAAGTCCATTATTAATGGAGGGGCAGCA 
TCTAAAGATGGAAGGC 1 1 1 COG 1 UAATGATCAACTGATAOCAATTAATGO 
AGAATCCCTGTTGOOCAAQACAAACCAAOATOCCCrGGAAAACC^ 
GGTCTATGTCTACITCAGGOCATAAAOGAAGAATCATOOOGCCrTCC 
>Seqmee 1330 

ACXXrTGTTTTGATAGTTGACTAACACTGACCTGTA^ 

CTOCACTTACTT ACACTATCTTAGGTAAATAAGACrTTTATTCCTAAGTG 

TGAA 1 1 1 rCACAGGAGGAOAAATCTGGCAGATAGATCCTCACCATCATCT 
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GAACACTCG AACTGGA CTTCCTTTTCTG AAT 
GAAAAGAAAAAAAATATGACCGG 
> Sequence 1331 

GGTACTCTTTGCATTAATAAATTAAA GCTCCATA' 

CTGTGTCATGCCCGCCTCTTCACGGGCAGGTCAATTTAC^ 

AAGAGACAGCTGAACCCCCCGCGTACCACTCTAATCATTAT7CCCAATGT 

TATGATTACATroACAGATAACTOCAGTTTroCTAACCTOAACTGA 

ATGGCC ATAATA TGTTGTTGATTCATGGCAAATGGTGATGTtmSAGTTAT 

GATCCICI 1 1 1 1 C I C ACAATCGTGGTGGAGGCCGGGAGCTTATATGTTTA 

TTTATGTATGAA TGAC GATAGTAAGAGATGGCATATAATCAOCaGACTGA 

tcat att ggat tctttggggaacggagccggaagggagtaaacagagaag 

cttg actcttt a tata tctgt aa tctgcggcttttt ac aa tg agc a tggt 

attttaatatttttaaatatctgattaaoaaacttatgaaagagccgtnt 

tttgaggtttacrrgctaaaataacacttaaatgttatttrttaaacaatc<: 

aactagtctx#ggtgaaagagaccataaggcoctttaaaaccatctattgg 

actcagggaaaaccatgctcccaggggggaatgaaatxttagrggtccttt 

agt aagtcttraaaagacccttcaaaaaattttttgtgttcactt^ 

TAAC CCACACCCTCTTCCCAAOATTOCCTAAAOGOOTGOGGATGGTCXKjG 
CTTTATAATATTrCGGCAATGGAATTTGTXKlATAACGTTT^ 
AATCTTTGGG 
>Scqwaic e 1332 

actg gattt ttocaagccctctattraaaattccccagaaattaaataag 

gaggctttggagggaggaatgoxtagacaaattgtckjagtgggtttgtt 

ttgtttatggagatggtctttaaagtctaaattxrt^^ 

gcccaattgaagaggggctgaactcagctgggagggaggggatggttgtc 

ag cctaca gcttttagttgaaaccaagtccattctggggccaagaagctt 

ccatttttagcaaagagagaaaggcgaaaaatatacaaacxncgtacctc 

ggcgcgcgaocacgctaagggg 

>Seqnence 1333 

acttaattcattctac 1 1 1 G I o r I aactatctttttatgtgtaggtctca 

TCAOCCCAACCAGACTATAAA 1 1 CC III G I CATTATTTAAA TOC A TGCAT 
GG AACTCCCATAG ACA TCAA CCAATCACCAAT A G ACA A OCCTTA GAACAT 

gtattacaggaaaaatagagtaacacatacaactaatacagaggaaoaac 

AATTGACATTAAAGTAGAAAAAAAAATTAACACTCTTGGAGTCTATAGAA 
AAATtn-AAAGAGAAAOAaAATTGAAC^TAATACGTCAACTTAGAAATATT 

CTTTTA AAGCOATTGTTACA ACTCTC TGAOGTGCT^ 

rici 1 ggcctgagactgaaacttrtattcagccattggctgggtaagag a 
atcaattaa gaga ttaatgcatcgcx>gcataaacagaagactoccotggt 

GAGAOGTAAC I I IG I GACATTGTGCTAXXTITrTCATATGGGGTGTGTTAA 

GGGCTGCAAATAAATGTTTAGCATTGTAG 

>Sequeoce 1334 

GGTACAAAGTTCAACAAAG 1 1 lOI ICI I UA TTAAAAAAAAAAAG AA TGAA 
TATCTAATGTATAAACAACTCCAACTTAGATITCX^AAAATCTTGCATTC^ 
TTCACA1 1 roTQcrrcn I C 1 ACACAOCTGTCATTTACATTCCTAGGCTT 
GTATTTCACTATGTAAAATGGGAA TTT AATCTTT AT AAATOAGGCATTTA 
TGTAAAAAAAAAAAAAAAAAGT 
' >Seqt»al335 

ACAATAAAOCAGOCAAAGAAAATAACCAGTTAGCACTTAAATAAGAATCT 

accatgtaaaaaacacaotatoooacactacaaogtagtatttatatatt 

TTTTAAATGACTGAGCTACAGTACC 
>Seqacace 1336 

C^tTAGCCCCCCCCCCGCCACCTACATCTATCTGAOCCCAGAGTTACCC 
TTTTCTATCATGCOCXXXTTAGGATATTGCCrGGGGACACCTGACAACAGA 
AAGTCT AAGGTTTTCATCTAGGATTGGGAGTT ACCCCAACACCAGCAGGA 
TTAACrOAOCOOATGO 1 1 GCC I CA ATCTGTTGATTCTTC 
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AGTCAGTTAGCTCAGA T l ITGTCCAGGAACAG 
ACGTATTOAACTCTACCAAOGCA IC I CCTCACTACAAAACTCCrGCAACO 
TCGTCATAGCAGAAATTCTTGGGAAAGTTCTCAGCATATTAAAAOAGAAA 
TTTTTA 1 1 1 CC II CATGATCCaCTOCTACAGGGAAAAATAAATGGCAAAT 
GAACCCATGTATGTCAG A CTCTCTAAT AAACATCAGTGAGA TCACAGTGT 
CAAGAAATTTCAGCCTGAATTAAAQATA* 
ATAGACTTAGAAA I ro iC OCl roOCOOCAOCAOOCTAAGOC 
1337 



GOTA C I I II 1 1 1 1 1 1 J 1 1 II 1 1 II I I || I PC AAACCTTATAAATAAAAAG 
TOG TATGC C AGTAA AGTTTCAATTTACA I IICICI fCTOA ATGAAACTOA 

ocattttdca una ic ctaoattcttaogaag cc i i iota tctoooat 

ATAAGTTA Cll I C I CC I I C H rOTCA TG TTGI 1 I A A C 1 I rCCACTTTCI I 
TTT AAAA OCTGCA GT AAA TTTT AAA I C 1 1 I IC ATTCA CTG C1 rCTGGTl 1 

tcaaatc acatacagaaagaatctooccagtcacagggtgtgaccacact 
C1GI rCTGGTGC I ic i atggcttcatctttcacatttgaatctctgacgt 

AG TTGG A A TTT A TTCTGGGCT A T AAGGACCXXi ACTTT A TTTT A A GAACAA 
AA 1 1 1 1 1 1 1 AACAAATGTTAAACTTAACTTCCTAAAGGCAGATTATTACT 
OCCACCATOTXmiACrooCATOTCr ATBI 1 IOC 1 I AOGAACA I ICI I CC A 
GAAGAATTTGCXATCXnGAAAGGATGATGACTCAGATCG^ 
TCTTCAAACATTATTGTAATATAGN 
>S«qa coM 133 1 

GGTACTTTTGGTAAAAGATTTTAAGAAGGCATGGGAATATGAATTTCTCA 
CCTAAGTTTAOAOGGTTAAAGGA ITS T GTI A AGTOAGOAAGOAAAAAATC 
TAAAGGTTTAAACAAGTTGTOAAAGGTTTATAAAAAATTAATGTGTGCAA 
ACATA TOGGCTAAAGTTAAA G AGGT A TTA T 1 C 10 M 1 1 POCATAAATTOA 
ACATTGGAATAAAAGTGCAACAGAGTTTTOCTAAATCA TTGTTCTGC TC T 
TTAA CAAAAAAAATATTOTAAAGGGTTATAAAAGGTTTATAAGAATCTTA 
CCTTATGGACAAACTAACTAAAACTGAATGGATTTGTAAAATGCTATTAA 
ACTAAATTAAGGCTOOACOTXXnTMXrTCACACCTOAATTTCAGCACTTTG 
G<MAGOCGAGGCAGG^CGATCACTCTGATCTTAOGAGTTTGAGACrAGCC 
TGCCCCTATGGTGAAACACT G TT CC 1 C I AACAATATGCCAOOGTCTTCCG 
CTOGCATOATOTCCAOCTGC1 IOI lAGGATOCOCTAGAGAATTCCCCTAA 
CIGTTATOCTrroATCGTOTCTTCTN 

ACTAAAAATTTOCACTATCAGAAGATCCTtMTTAAAATAAAGAAATACAT 
AAAACTCAAACAGTAAGTCAATOTOATTA1 HOI I ILA TTTCAGAAGATC 
TATOGXTTOOCACTGOOCGCCACA CGTGTCTCC I OG I IC ICAACQAAGTGT 
GAOCAOCTCFICIUAAGAOGTAGGGTCAATCGOGACT G TGn GI C A AAGT 
CTGCCTTCOTr^TCCCA-n 

TOGGG^GG^ AGCACGCTAAAGG^^ 0 ^ TTGTGGu iiLiA.ni lGGTACC 
>Scqocnc» 134 0 

OGTACfTTTAACTATl I OI 1 1 C T TCTACOATAA I IPG 1 1 ICTTQTOACTT 
TATCTACCTAGAOTAAATTTTGCCAATTTGCA 1 1 1 1 ICTCAAAATAOTTT 
TTOAATT TAT roTOTAAA ATTGCTCAAAATAOTCAATTTAAACAAATTTC 
_ ' V i i . ''^CTATT^OOCOCTTGTCATTTAAA 1 11 1 IO I ATTTCTGCTTOC 
TOOOGCOT 
> Scg»eoc e 1341 

AC1 I 1 uACTA TT m TAOCAACAAATTACTTTIGACACACAGCACAATTG 
ATTTAACACTTOCAATTTTGGAACTATTGGATAAATAATGA TGGGATTTA 
AATAAAGCAATCCOATTCTACTATTACAGCATAGGGTCTC 1 1 G 1 A OTOCT 
CTTAOTAAAAACTATTOTOACAC I ICO I ICI 1 1 C I 0C AAATATTOOO0CT 
GOAAAGACCTAAATACAATOCAOOGATTGAATCAAATTCACACATTTTTT 
TTCCTACGGAAACAACAAOCTl ICI I GCTTATATTTAACAAAAACTAGTA '- 
TAGATT 
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e 1J42 

CGTACTATAGCOAGtOGAOCACGOGTOOGGTGGTAOG I OGTGCGOGATCG 
AGATTCCGOGCTATGOOGOOCAAG G 1 1 1 1 1 CO I cagtactgggatatccc 
CGATOGCAOCGATTGOCACXXjCAAAGCCTACAGGACCACCAGTATT^ 
GCG 1 COC 1 GGOCTGACCXnCGCTGCCTACAGAGTCACACTCAATCXriXXXj 
GGCACC I ICC 1 1 G AAOQAOTGGCTAACGTTGGACAATACACCTTCACTOC 
AGCTGtrrCTCGGGG UJO IO 1 1 I OG OCTCACCACCTGCATC>GOGCCCATG 
TCCGCGAGAAGCCCGACGAOCCCCTT1AACTAL II CC I ICO I G GOCTGOGC 
CGAAGGCTGACTCTGGGAGCACGCACGCACAACTAOGGGATTGGOG00GA 
rrACTTTGGCATACOOGOC rTCCTGGTCAAGAATGQCCGOC 



CCGGGGACCTCAGCCTGCAAAATGCCTCOWJAAAT^ 
GTGCGAAAAAAAAAAAAAAAAAAGGGCCGG 

> Sequence 1343 

CGTCTTATCGAGTCGACCCACGCGTXXXJAATCK^GTGAAAGTG^ 
CTGAOC 1 1 CAAGaCTAGATCATCAAAGGTGCTACAGC M UGC 1 1 IPGCT 

GAGAAC1TCTATGCTAOCTGGAGAGGCCTTCTATAGATATTTCAGTCAAC 
AGGOCrAGTTAAAGTTTCAGOCAGCXTrCAAOCACOCAACATCTGGGT^ 
TGAACXXrrCAAATGATTCCAGCTCOCAGCC 1 1 lOAOl CI ICCA GTTOOQO 
TCCCAGTCATTGAAACAGAGTCAAGCTGCOCOCGCTGTGATTTATCTGA^ 
TTrCTGACOCACTOGGAGCATAATAAATGATIG I'l'l I ATGTTNAAAAAAA 
AAAAAAATAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 

> Sequence 1344 

TGTACTATAGGGAGTCXiACCCACXXXrrCCGTCCAGAA 1 1 ICI A GAGTGGG 
TGGG<lATGATTCCAGTCAATGGGGGAC»GCC«mrrCT 

aggagago agogagatoaogtca i ron ioicattgaoici rcrcrcAGA 

ATCAGCGAGCCCAGCTGTAGGGTGGGGGGCAGGCTXXXXATGGCAGGGTC 
CTTGGGGTA COCCI IT1 CCTCTC A GCCCCTCCCTGTCT GC GGCCTCTCCA 
CCTCTCACCCACTCTCTCCTAATtXX^ 

GGTCATCCCAAACCC I I CIO 1 1 ATTT ATT A QGGCTGTGGG AA GGO 1 1 1 11 
CCI ICI 1 1 1 IC I lO GAACACTGC CCC I PI IC I ICA CACTGOCCOOCATOC 

cttaaactcatacagattgtccatcatggggggcatggg 

GGC 1 1 CJC 1 1 AAOOGCGGGAGGCCAAGGCAATTGGTAAAGGAAGCACrTGC 
coocci i i crooooo ci ICI 1A AI C I 1 1 A ATAAAAAACOCGO CI ICI I AT 
TTTTT AAAAAAAAA CC ' 1 111 I O I IACAAAAAAAAAAAAAAAAAAAAOOOC 
CGC CCCCT TTCACTTATCTTAOAGAA AAAAACAT TTOCAAO 
GAACCTTGAAOCATAAAAGAAATCCA'I Til IPG 1 1 U I AACCTCTTATTTG 
CAC1T AATAAGG GTTO CAAAa TAACAATA I CC 1 1 CO C AATTTTCCATATA 
AGOCA 1 1 1 1 1 1 1 ACTGOCTCT 
>Seqoeace 1345 

CCKXXXXrrCTACTGOOGCCAOG<UOCCCAGGTC^ TC fOCTCAOOCTT 
CACCAAGGGCCCA 1 COO 1 C 1 1 CCCCCTGCCACCC I CC 1 OCAAGAGCAOCT 
CTG<XXXK^CAGOOOCOCIGGGCI G OCTG G TC A AGOACTACI fCDOOG AA 
OOGGTGAOG O I G 1 OG I OGAACTCAGGCGCCCTGACXJIGCGOCGTGCAGAC 
"T A CA GQCCTT A QGACTTT AC 11 CC 1 1 AACAGOGTGO 



PCTAJSOQ/3JJ12 



OTTOTOOOCAAATT 
TAAOCTCTGGGG 



GAATTAOCAGOCCATt 
TTTGAOCAAGATGATACAT1 
GGGCCOGCAGT Ci 1 IULTI nC OOCAAA 
> Sequence 1346 

GGTACTAGATTGGG1 G I OTP I ATTAAGACAAAOACAOCAOTCAAAOATAO 
TTCCAAAA C I I I I UAACAGAAGACTOGATGAATACTOTTTACTOAOATOQ 
GGAACACTTAOAGAAAAATGCATITGGAAAGCAGAAATACXiATCAAGACT 

Page 245 
(of 261 pita fa TiMe 2) 



467 

Table 2 



PCTYUSOO/3JJI2 



tdcatttttcatacattaagctrcgtatgtrtaattcatagctatataga 

ggtattaaattggcagoacaaaatcatagctaoagataaaaatttagagt 

tcaccagtgtaaagatoatattrgatggcacaggatggac 1 1 ici i ciu g 

gatttgagtatacatagaggaaagatgtgaggattgagcaccaggggact 

t caaca ttgacagoctcaacagaggagaattcocaagaggatgaggttoc 

aoctttaggaoogocaaagaagacttoocagacaagtacctggccgggc^ 

gccgctaaaggg 

>Seqn CTcc 134 7 

GGTACTTTTAACTA TT I G 1 1 IC I r CTACCATAATTG G I I PGTTOTOACTT 

TATCTACCTAOAGTAATTTTGGCAATTTQCA 1 1 1 1 1 C I ca aaataotttt . 

TGAATT TAT TOTOTAAAArrOCTC AAAA TACTCAAm 

IGT n I ACTATTTOOOCCTTG TCATTTAAA 1 1 1 1 1 G I A TTI OTGCTTOCT - 

COCGCGT 

GGTAC AAATTACTCTGTAATA I ro c nTCI A TTAAAAGG G rOTOGTTTTl 
TTTTTTG I IOI I llll I III 1 1 1 I AGCTAGTCCAGTG G 1 C 1 1 1 1 I GATGT 

tggttcagctta g rocnci caaccctggaacaa CCCGTA GA CCCACCTG 
gggagctcttaaaattatcagtgcctaoooc^cct 
taaatoctota oioi 1 1 1 ia agocaccocag g tcattgtaatgtacctgc 
taaaggg 

c 1349 

OOCTTAGOOQOCGCXXOGOCAGGTAC1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IPG 

qui i ill i i i i n 1 11 nun n u mi m i n hi ii 1 1 i n m ■ 

I HI 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 AAAAAAAGGGGTAACA TTAA 1 1 1 1 1 1 1 1 1 • 
TCCCCCCCAATGGGCACGGGGTTT AATTCCCAAATTTTTAATTTTrGGGA 
AAAAAAAAAAAATACCATTTTTAAAAACCCCAGGGGGG Ol 1 11 1 1 1 1 1 1 A 
AA AAAAC TTGTT AAACCT ATTT TTTTG GGGGO GGTTAA AA T 1 1 1 1 1 1 1 I I - 
I IOOOCCAAAAAAAATTXCCCCCI 1 1 rT TTCCCTTTTTAAAAAACGGAAG 
TGOGGOCTGCTTTTTAATTCACCCTTTT AAAAAAAA TTCTGGAGGGTTTC ' 
CCAAI riTriTTAAGGAAATTTCCCCTGGAAATTTTTTAAAAAAGGGAAA 
A AAAAAAAGG TTTTAI M II I IP I AGGGCOC lCC ACCX^GTroGTGCGAAA 

AAAAAAA II 1 1 1 II I A AAATCTTTTAAAA AI IOOIOOI 1 1 IU AAATTTAA 
CAAACOOTTTOTTAGCCCCCI I O I AAA 1 101 1 1 CCA AACCCAAAAAAAGG 
1 1 ICICCCCCGT AI I IC I 1 lOGCCGGAACCACTTAAGGGGTATATTCCOC 
^TCTGGOGGOGTTTTATATAAAATTCAATTGTTAACACAATTTCGGAAA 
ATAGGAAATAATTG 
>Scqa e»oe 1350 

GGTACT I CO I CTTC1 AATTTCAAAAATATAACTTAAAAATGTAAATATTC 
TATA TGAATTT AAA TATAATTCTGT AAA TGTGTGTAGGTCTCACTGTAAC 
AACTATTTGTTACTATAATAAAACTATAATATTGATGTCAGGAATXAGGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTAOCTGKX^CGGGOGGOCAA 
GGG 

>Seqoeaee 1351 

ACAAOT ATTATOTATCCATAAAAATTAAAAAATCTTTAAAAATOCATATG 
CCGGTCAGTAtXiAAAAGAAAAGAGAACCAAGAGAGCTG 
CAGCI IOCI I lAAACATGAGATOCAOCn^UnGATCATGCGGGGOAAAAG 
O O00CG C A 1 1 PC I OOAACTCCTAATATTTAAAAAGATGATG^AAACTTGA 
AA] 1 1 1 ATATTTAA ICI I C I CATTTTTAAGTG TTGOCAATOTATTGAAOA 
C I I IgAACOCTCTCroCTGGTCAAACAACATOTATCTOTA^ 
A CTtX^CA OCroOCCAOTTTOAAAACTOA A I CCTOCTAOOCTTAATTTAA 
Al 1 1 1 1 1 IAAATTTAA I I IOCI 1 1 PA TT OC T GC ACTOCTGC T C A AAAAAA 
I CI rCAATG GCIOCOCACf GT C TISC AAOOTAAAATOCAAA Cn IPICA CC 
AG I C C n CAA GCAACOCATGACTA TATOCKGAOCCCAAACCATATTrCTA 
CCTT AA T A TCAG I LI UZATCTTTTXACCGCACCAGAATGATAGTTGAAAT 

gtai — 
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C1352 

ggta ci 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 111 1 1 1 1 1 1 i i 1 1 acaottatactctoo 
aaagttattcaaatttcaaatttatttaca p i p 1 1 1 o aaaagcacacaac 
agaagatcttcatttatgcaacaagtcaatcatttgcagtatgtatggaa 

A^TAAAAATCTAAGGTAAGTCAAACATACAAACTCTAO CIC I IOCI I ICI 
CCATTAG AATA TACA C ATTGGAAATCT AAGTrCCAAAGAGT TtXrTCT CT A 

TGCTTACA 
>SctfMM » 1353 

gttccaagttataggaggttitccaattggcagttcgttoaaaga 

tcttaagaoctggaatcaatacaagggagtgtgtctcggttaaaataaao 

go^ttgtggagatcaaggttcrtattaggcagatgaagcctccaggtagc 

AGGCTTCAGAGaGAATAGATTGTAAATGI I ICI I ATCAGACTT AAAAAGG 
TCCCAGACTCCTAGTTAATTTTCTAGTGGATCAGX2AAAAAGACCTGGACA 
GGGAAGAGGG 
> Sequence 1354 

ggta ci 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 room 1 1 1 1 1 ui i rn i n rn r m 
Minn m in 1 1 ii ri iMiiiiMiii rrni mi n 1 1 1 1 1 1 1 1 

TTTTGAAAAATACAAAGAATTAATAAAAA A 1 1 1 I I I 1 1 1 1 A AAAAATTTT 
TTAGGAAAGGGGGGAAAAAAAAAAAATT AAAA AAA AAA 1 1 1 1 1 1 1 H I 1 I 
TTTT AAAAAAATTTTTA TTTGGGGGGGGGGTA TAAAAAG AAAAATTTTAA 
GAAAATGGGG 
>Scqw»cc 1335 

GGTACAGAACCTGCCTGAGTATGACCTCTCCACCTTATAGTTTATGAATG 
ICI I O I I I O I OA AAGTG ACTATA ACOCAAA C I 1 1 1 1 1 1 1 1 1 1 A AAGAGGA 
TTTGGAAGTTGTATGGA T 1 1 1 1 IO 1 1 A I CI 1 CA CTTTACTOCATAGGAAA 
CAATCTACCTCATCATTTAAAATGACATGGGTOTCG O 1 1 1 1 O TAO ATCTT 
TOP 1 1 1 1 1 1 IOI CAGGTTTAATTTCAGTT AACAAAATGTAAAACATGACA 
TTCCCTGCAGATArT G rTGrATAOCAGTATG PI I IC I IC I C 1 1 IC I 1 1A A 
A 1 0 1 1 1 1 I P GCCATCAAGTAGN 
>Sequeace 1356 

CA Ciiuii i iHii i ni ii imi i o A oiiiiiin i iiiiini i n 

1 1 1 1 1 1 1 1 1 1 1 1 I III It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lA CCCCAAA 
AAAAAAAI 1 1 1 II 1 1 A CCCAAAATTTCCCCTTATOCCOCTTTOOOOCTTT 
TTAAAAAOCCOCCAG O 1 1111 1 11 1 IA ACCCCCCI IC CC C CGTTAAGCCC 
OCTAAAACCCTTTCCOCTGGOOO COC I I I ! I ITA AAAAAAGCOO0O0D0 C 
CCOCCCCAAAAAAAAAA I 1 1 1 1 1 1 1 1 1 1 1CCCA AAAGGCCCTT 
>Swyvnry 1357 

AC AACACI TTAAAAAGTGAATTTTAAGCTATOTGAATATCTCAATAAAAA 
CATTTTTTAAATAAAAACAATTX30CAAAGGCCTGGAAATTCAGGAACATA 
A TTCAAAA TAATTTATGGATCAAAAAA TAAATGA T A T AAA GA TCTGAGAA 
CTACAATGTAAAAATATAGAAAAAAGTCATAACAATATTAGAAAAAAATT 
TGAGCTGGATAACAAAAATAGTACC 



GGT ACTTACATGGAAATAAGTGTT AAGAAAAGGA 



GGTACAAAGAAAAAGCTAAGOAAa3GTATGTATATrAATCXX^T TATTA A 

AAA TGTAAAA AGCCAAAAGCAAGATACAOGCACATATOTGOCAAA^TATG 

TA I 1 1 1 1 1 1 1 ICC I PGAACAAATCACAAGAAATGTAATAACAGTTACAGT 

GAGAGX2AGOCTTTGACAICICI I ICI AAACTATTTGATATCATTTGTATA 

CTAAOGATCT 

>SeqBe»1360 

GGT AC GOGGGAT AGGCC I ICI IP I IATTATTTCAAAGAAAGAGACTTCAC 
GTTTTATCAGTGGGGTT3GATTXrTAGGTTGAGCAGAACT AATOGGAGAGGT 
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GCTGGCT AG AGAAAGTTAAAAA 1 1 ICI GI I AGCTTTGCATTGAGCTTTTT 
AATATCAI 1 Id 1 CATT7CACCAGTTC>G AGO ATTCOOGGTGATCGCCAC 
AA CAGAAA TGATGG AA TATAGGCCAAA TGTTA CA AA T A GA TAAAATTACC 
TG ACCAGTGAAGTG I G 1 1 CCTCAGTCGOCATGOAGCTCAGATTGGACTCC 
CAAAAAAAAAAAAAAAAAAAGTGN 

GGTACT ATAGCTTCA G T G 1 G O 1 1 i AGTAAACTTAGCCT AGG AGGOCAAGA 
TGTCT<XCTAAAACTTAGTCTCPCTCCTATTrACTTTG7TT ATAAGACTG 
TC ACCTAACTTOOCATGGCCAATICAATCGACTAGGTTA tctttactcca 
ATGGAOOCAGGOC rriTOO CAGTCAATCCATXjTOCAAOCCTTCATCTOCA 
GCGTGATCACTCAACTCTTCAACTTGCClXKrTTGC^^ 
CACCAOCATNCTGTGCTTTTCCCCCTAATCGCCCA TG ATGCCOCCCAGTAA 
AAATAAACTAAAOOCACTTGAAGTGCC 
>Seq U ence 1362 

CGTACATGAAA ATGGCTGTTTTTCCO CACATTAGTCAGCTCTGGATTTrG 

CATGTGTG GGGC I 1 1 J 1 1 ill I » rTTGATAGTTATTTGTTTTTTATTITA 

AA AATTTATTITGCCA ACCCACTAGAG AACAGCTGAGCA T C I TCICATGT 

A TTT A TTGOOCA TTTGCATTTCTOCTGCTTA TTGGOCATG T ATTT ATCTG 

^^^^*00GTCTGCTGTGAAATGTCTTACATTATTTXiOCCAT7TTTCTA 

GTGATAAAACACTGAAGCACAtTTTTAAAGACTTCTCATCATTTTTATTG 

TCAGAT 

> Sequence 136 3 

GGTACATTTAAAAGGTGATGCTAATACTCT AAAATGTATAAGATATAOAT 
GTA AAAAGCATTGTAA ATTGTATACTGCAGTGTCGTCT ACATGGCATTGG 
ACAGGACATAATGTACAACATAAAAGTGCAACTKnx^CACTTTACATAT 
CG A TG A GTG AA TCGGCAACTACGACCAA 7TTTTGTCTCAAGTCAAAA T A C 
CAAG<>CTATTGCA(^GTCTACTGGATTTAKrrATATATGACATATCTGG 
ATACTXjCATGCACCACATTATTGGCGGCCTTTTAGCTAAGCTGTAGAGTG 
CT ATTGTGCGACCOCTTAGTGATACT ATTCCTGGT 
> Sequence 1364 

GGTAC TCAAA CTCTTTCAGCCTACTACTGCACACXn'AGGCTATGTOGTAT 
AGCTACCTTG TATATCTGGTCTGTCACTGACTAAAACTTTGCTACA CA GC 
^^*' 1 ^^CCCTACTATTCAGOCTTGAGAAGATGGAAATGCTGTCATTTGCA 
A CAATATGGATGAACCTOGAGGAC^TrAAATrAAGTOAAATAC©<XAGGC 
ACAGAACGACAAGTAACACATAATCTCACTTATATCTAGAATCTAACAAA 
GCTGAACTCATAGATXXIAGACTTAGATCATOCTTACCATGGGCTGTGAGG 
AGGATTATGAGGGACGCAGAG 
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« 1345 

GGTAC1 1 1 1 1 11 1 11 1 1 1 II 1 1 1 1 1 1 1 1 1 ACTTT A TTTT ACTTT AA GTTC 
CAGG ATACA TCTXX^GAGTATGCAGGTTTGTTAC^GGTATACATGTOCCA 
TOOTOOI 1 1 GCTGCACCCATCAACCCATCACCTAGGTTTTAAGOOCCACA 
TGCATTAGGTA 1 I IU ) 1 C 1 AATGC I C 1 OCX I CCCCTTAACAGCAGTTTTT 
CTA TAGGTCAAAACAAATTTGGG AA CCAG AATTGTCT ACTGTCTTT ATA T 
AAATGATCATTACXM,TTTGGGAGGAG GT 1 1 1 i 1 1 1 U GTCACATTTGATAT 
GATTAGTCACTAAAGCATGATCT 
. >Scque»ce 1366 

M^CACTCCCOOACTCCTCCATtATTTCA<TtAATACT<X3CTGOCCJAGTT 
TTTCAGGATACATXZATGCAAATAAO 1 1 C HI lATTTTTCAAAIIAJJTTA 
1 1 va. 1 AAAGTATCTT TAA 1 1 1 1 11. II 11 1 Uti 1 ' I ATACAGCTTATAGAATA 
AACAAGTCACAAGAATCTTCA1 1 IUI 1 1 C I AAAGTATATAATTCTACAAA 
AG TTXj ii tl ACTCAATGTG AATTAAAATTTCO^CnTTAAAAAAATAAAA 
AAA TTTT AAAAAGTAAAAAAAA 
>Scqueacc 1367 

ACAATATA TT ATGAAGCATC ACCACTTTATTTTGAAACTTA GCAA TTGTA 
T I OCT GGGGTTT ATTCTATCTOT AGCATGTCACTGA TTATTTCAGTTA GT 
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TTTATAATGA 1 1 1 1 1 AAAAAACATATCI ATTTGGAATAAGATACAGCAAC 

AATCATTGCTATTGACITGTTCAAOCCCTTAGTTACACTGTATGATCAAC 

ATATAACAAOATACAGTXXiAATGGCCCATACAGTATATTACTGTTGTGTG 

ATGATTGGCTTTCGAAC^OrrTG^TTrTGAAATGCTTTC 

TGACATGGAACAAG 

> Sequence 136S 

PP^^ ^CCGGCCGCOOGGGCAOGTACATATCATtXXXKXAATOCACAATA 
^•imATCACAATCAALI ITTICn ITfl ATCCCTATTTCAATG AGCAGTC 
AGTCTCAA G A GGTTACTOCA7TTCAGTTCTAACTAGACATTTGTACTTGT 
OATCA CACTACOGGAATC TC I G I GGTATATACCTGGGGCCATTCTAGGCT 
111 tlAAGTGACTTTTGGAAATCAAC Cl 1I1H ATTTGGGGGGGAGGATG 
GGAAAAAGAGCTGAGAGTTTATGCTGAAATGGAT>TrATAGAATTTTTGGA 
AATCTAi 1 1 1 1 AGTGTTTGTTCGm 1 1 1 1 AACTGGTCATTCCTT 

> Sequence 1369 

OGTACAGCTTICrcTCCCTCACGTTrCAAGCTTAATGC^ 

CA1CI 1 1 LC ACA TCT A TTTCT ACT ACA TGCTGCT CTC 1 1 rCTCTATCTTA 

CATCTCXpCAGAATGTTTTATTTCAACAAATTGCTAATCTGTGCCAGCCAT 

TGTTATTAGCAAAATGATAAG<XCTGCATGTAGCAAAGTTCC?TGCCTTCA 

CTGCATATGCATTAACAGCTCTOATTAGTOCACTTAAAAAOCATroTTOC 

CGTCATGCAG AACTCCA TTGCCAA GCCCCA CAA CA CCCAGCCA GTAGGTT 

AGCA AGCTCCCTGAA GCAA GGT AAA CATGTTG 

> Sequence 1370 
GGTACTT 



PCT/USOO/13312 



— liiiiiiii i ni fTTTATm mm 1 1 

i A ^^^OG^j AAAAAAAAAAA^^^^^^^^^ 



hi urn i i 
mmum 



"~^™™~«nuVJU-LJ l A#VlAAAAUlJUlll.l I iTATOGCA 

CAA T AA A AAAACC AA AGGGGGGTCT AAA GGG TA A TCGGGCA TTTTTTTTG 
GGOGGCCGGGAAACTTTAAAATCTATTTAAAAGCCGGGAAAAAAGTTCTT 
TTTTA ^^AAAAAAACCCCCTCAAAAA TTCCCCGGATCCCGGTTAAAAAA 
CGGGGGGGATGGAAA 

> Sequence 1371 

ACTGTCGTTTCCTTCCTACCTCG^ 

TCGAGT GTGA ACCTTA CI 111 1 1 C CCGTTCTCCTCAAGGCAGTTTG AACG 
ACAC^GGTTTGOAAGGAATAGTTAACTCTCCAGTATTATrGGAACATCTG 
G ACACCACCAACAAAAAATCTTA G AAAA GGGTCATTTAAGGCCTATAAAA 
AGTGCCACCTTTDCCAGAATrAATTCAGAGAGAAAAATCTTATCTOCCTC 
CTGGCAGCTACAGCGCAG AAAGT ACC 

> Sequence 1377. 

GGTACTTU 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IU G- 1 1 1 1 1 1 1 1 1 1 U IU 1 1 1 1 n | 
• 1 1 ■ 1 1 1 i 1 1 lATAI 1 1 I I IT i'l l 1 1 1 rn rri AAAAAACCCATGTACCC 
GGGCA CAGAA GOTCCAGCATCClTTGAAACATGAGTrrTrACCAACAAAA 
GCAAACCTTT ACCCCAACACCTCATCTTTAACAGCAGG AAGGGAAACAAC 
CCAACCCOCCTTCACCTAAGAAAATA I I rTTTCCCAAAACCAAOQAOOCA 
7 OCCCCTCAAG GGTT0CACAACCTTGAAACACAAAGATTCCACAAC00GT 
W 1 ICCI I lOCCCTGGCCGACGTTATAAAAG 
>Sequencc 1373 

GGTACAGCTATTCTCAATXX^TAATTtn'ATAAAATATTTAAAOAAGAATC 
AACACCAGI I C 1 CXJACACTCTCCTCTAGAAGAAGAGGAGGATGGAATACC 
rroCOOCITAATITATOACXXXAATATrACCCIO^ 

AAGATATI G 1 tXXXXIAAAATAAAACTAACGATCATAGATAAATACCTCTT 

ATAAATTTAGATGCANAATCTTAAGCAHAATATATTAiai^KAATGAATTC 

AACAATGAATAAAACATATTATACACCAAGTGCGATTTATTCTAGCTATO 

CAAGACTAC^TTGACATTTGAAATTGATTA 

>Sequenc* 1374 

CCCTTTGCGGCCXKXXXXjGCAGGTACTOGGAATACAG<3CA7TJAGCCACCG 
CACCCCGCCAGAAATTATAAATCTAACCAGGATTCCCAAACCTACAATAC 
aatggaaatatcata 1 rcfcrci I A TAOO O 1 1 1 1 IO GGTTTTAACCCAAT 
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CTA TTTT AAAAGOCGGCAOATTCAAGCGATTATGCG TTTATATGA GTGG A 
TTCTCTGTTTGAATACTGAATCAAATCnTCACTGGG A G AACAACGCAACT 
AATTTGCAAAAGGCATACTCTATGCCTTACATTAAATGTGGA I C C 1C I I C 
TAAAACTAGAATAAGCATCAGTTCAGTCACXXAACGTGGGAAGAAAGAAT 
AGAGGTGGCAAGAAATGAAGCTCAAO 1 1 IUAAACAAAAOAAGAAA 
CTTATTTAAAGAA HI IC1 U AGTTACT ACA T AACAACCA 1 1 C 1 1 I T AATG 
OCAGTTT AGCTAGTCCCAGGGACCAAAAAAACAAAACCAAAACCAAAAAC 
CAAAACGGA TC CICI 1 1 1 1 UGGCCCTA TTA TTGGGGCTT AA TCCT A A ACT 
GGAAATTCTATCCTTTAAGAAAATGAATTTGGGTAATTA 1 1 1 1CCCA ATG 
QUI I OGTTTTITAAAG A AAGG TTAAAATAAACCAATTTAGC LG 1 11111 
TCTAGGGAAAATAAATTCCmAACCTCACCCCITT^ 
GTCAAAAATCACAACCCGGA 1 1 1 1 ICC'II ATTTTTTAGAAACTTTTCCCC 
CATTTT 
>S*quenc* 1375 

AOGCGGGCGATATGATTCKXXXXXXjAATCGTGGTT C IUI 1 1C CTCCTTG 
GCTOTCTGA AG ATAGATCGCCATCA TG AA CG ACACCGT AA CTA TCCGCAC 
TAGAAAGTrCATGACCAACCGACTACTTCAGAGGAAACAAATGC^TCATTG 
AATXMCCTTCACCCCX3CG AAGOC TACAGTGCCTAACACAOAAATACGGAA 
AAAACT AGCCATAATGTAC 1 1 1 IUI IX XTTTACCACCCTGAGGOCCCATTT 
TTTOCCCACT AACCXXJTCGTTACTAGTGGAGACTAGCGACX3ATTCCAAGT 
TGTCA TATTACTCG ATATATCTTA 

act i ic i i mi i c n 1 1 i c i 1 1 lo ri A i n 1 1 1 1 rmrnu AGACAGo 

GTCTCACTCTTrrCACCCAC<>CTXX}AGACAGAOC^ 

AACAACAAATAAATAAACAAAAATGOXAACAAGGAAGAGAACGGGAAGT 
CATA GGCAA TCTCATTCATG AA CATA GA TTAAAAA CACCTG AA GT AT A T A 
CATACOCACACCCCC<^CATXlAATACATATGAGATGTGTAAATGTGAATA 
CTTACA TGT A TGTA TA TG AAAGCAAACCAAATCA AACAATGTTAAATA AA 
AAT AACACA TCA TGA CTGA CTXKK^ ATTTOTCCCAAG AATGC AAAGCTACT 
TG A G AA AATCTATTAA TTCATCAATTTAATACTTTAAAAG AGGGGAAA AA 
GTTTATGAACATCTCAACAGATGTAGTAAAAGTATCTXiATAAATGCTCTT 
AGGGTGGATAAAAACTTCCTAAGTTATAACAGAAGGGAAGTrrTTTAAGT 
^^^^^^^^OTTTACCAAAACAOCTTCAGCAAACATTACACTTACTGAT 
GAAAC CCTG A AAGCl I ICI I UAAAATAGAAACAGGGGAGGG AGCCCACTT 
TTACCACTTCATTAGO ATTACACAACAAA 1 ICI I CCAAGCCTTGGAGGAC 
AGGAAAAGAACTTAAGACC1TAAATGGAAAGAAACCAAACTACTTTTTAT 
ATTGTGAATGGAGTAAACAAG 
>Sequence 1377 

GGT ACCATATAAAAACATTOCAGTTjICAACaGCACTTT aaattttcacag 
TAATATATGAAAGAACAGACTTTACAC I 111 1 1 lO CACAGAATTATCTTT 
GCTATCTTTT AAAATACTTAAOAAATAGAAACAAATTTAAGAGAGTTTTC 
ACCTTT AAAATTT ATTACATAAGCTATACACACAAAATGAAA TCCT AGTT 
ATAAAAGATGCA TCTAG AAGAAT AATTT AT AA TAAA CCAACAAAAATGAG 
AA I u lo > ATCTCCA.GGAATATAAA TAT ATTT AAATGTTCTt^GTGACTXKji 
CA1 1GCI 1 1 ATOCATTACATAAGATAGTATGT 
>Seqpeace 137$ 

GGT ACACAGGOGCrrTXaACTTTTTCAAC I ICCI I ItXJI I IGi PGGAOTCAA 
v AAAGAAC CACl 1 XJ ICCTTC TAAAAGCTGTXIAAGGTOATTTAAGGGCOCAG 
GTCAGCCA CIGH IGl 1 1ACAAAATCAGGTAACTAACTGCATACACTTTT 
TCTCTTTCCATGACATCAAGACTTTOT 

CAGAAGCTACTGCGATGCXXXXKXlAGTTAGCCCXXnGGTAATA 
ACTTCCAA 1 1 1 CI AGCCATACGCTCAGCrCATCCATG<XTCAQAAGTGCA 
TCTXj GAOAOAACAOOTTTCTAAXK^TAAAAGATCAAAGAGCAGTTGGACT 
TTTTAAAAATTCAGCAAAG J GO 1 1 OCC I CTCTTAGGGACAGTCAAAACCA 
AGTCACTTAGGTAGTACXriGCCCCGGCGGCCGCTAAGGG 
>Scqoencc 1379 
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A «**°<?^ 

AAlrTAOC^^AA^AGTroAATGATATCCTAA 
>Sequeace 13X0 

ACAGTAATTrTGGAAACCTXnTTOA 

GTTCTTATA1C1UC1 1 IJ IGAA7CGATCTATCGTAATGTCGTATTTTGGCT 
GAAGTATGTt^GACAATACTACOTACAAAGATATGTAT>mCAAAAGG 
A AATACATATCATA A AGTTTGACAA AGCCA GTG A GTG ATACTAAAGTTGT 
CACGATG GATGGT GTCTATCTGGAGAGCTGGCACKjGAACAGCCAAGCCCC 
TTGAOCGC1UCI IUCAOAOCAAOAACTATGATTOA1U IUI 1CITACTCAC 
TTGTAA ATTGCGTXSA TTGTGQCTCTATTTCAATAACTTTTCAGAAG^ 
GAG ATATTCTTCTTTGATACTACATCAG AACTTTGGC AAGT ACTTCAAGT 
AGTAAATTTCGG AATA TTTCTTTTAAATAATGTGG 
>Sequence 13S1 

ACAAGCCATTGAATAAGCCTCTTC C TT I 11 n iOCTCAAACATTCCACAT 
CXnTXrrGGATTCCCCTGCA l IG1 1 IG1 1 1 1 A TATAACATTTGATA TTTGT 
TGTAGCTTGTATAT G AACA TA A 1 1 1 I C I HA CAGGTAGTCACTGTTC IU1 
CCAGT ATGACC CAGG 11 ICI 1 OACTCTGAGTAATGCACCTTCTATAACT A 
iCTAAAl 1 1 i ATT GAAGC I 11 1 ItKUTTATCAGTATGCTGACTTTTCAC 
GATTGGC 1 OG i ^CATGTTTAG ACTTAAATGTCATATCCTTXATGTCTCAA 
AGCCAAAATAGTAACATCTX1ATCTCAGAACAGAGCTGTGACCACATGCCA 
A TA TA TGTC TCACAAA G TCT A CA T ATGTTACA TTCCTTGG AAGTCTCCTT 
AAAIOIl 1 CACAAAATGTCAACAAG C 1 1 GI 1 1 I G I 'I A TTGATATTTCOGA 
GATTGGGCACA7TTAAGACAGTAAAC<KX)AAAGGTC<iTGAAGATGCTATA 
AGAAGATGKXnGTATCTTGAGAATITlAAAAATGAAAATCrrGA^ 
?^^J^ T ^ OTTATATAT ^°^<^ TC TCTAGGAGCX^TTTA 
AATTCATAACAGJATtrrGCCCCTTAAGCXrrTTr 

AAGAGATACATTTTCGGAATTTtLACAAAAGGGATTTGAAATTAGGTCCTG 

TGGAATATTAATGG 

> Sequence |3i2 

AC £ A j^IIE A J T ^ < ^ GAAATACATArc 

AATCCCATCTCT A CAAAA CA TAAAAAAAAAAAA A TTAGTCGGGCA TGGTG 

GTGCACACCTGT AATCCCAGCTTGTCAGGAGGCroAAGTXJGGAGGATCAC 

CT^GCCCAGGGAGGTCAAGGATOCAGTGAGCCATGGTCTCA 

CTCT AGCCTGGGTG ACAGAATGAGACCXXXntTTCAAAAAAAAAG ACGAAG 

TCGATAATCTGAATAGCCCTATATCTATAGAAACTTAATAGTCKrrGGGAG 

ATATAGCTATTATTATCCTCATTTTACAGATOGGAAAATTGAGGCTCAAA 

GAAGAAAAGTCTATTGCTCAAGGTCATGTGGCTAGAATATGGCAGAACCA 

TOATTCAC^T CCAG GTCrrCTGAATCTTATTCCA 1 I CI I U I AGCAT 

ACCATGTTGCCTCTAAAGAATC«CAGCTTCTTAATTACTAGAAAAT^ 

CTGGCCAATTTAAATGTX1AAATTTAACCCA iTITTTD I A AGCACTATGGT 

OGGGTTTTAG AAGCATAATTTAATGGCTTTGG AATACAGGT li 1 1 I GO 1 1 1 

GGGAAAGGAAAAAAAATC 1 1 1 1 1 1 1 CCAA CTTGG ACTTCG ACCAAACTGG 

GAAAAAAAAAACTTAAAATGTTGCrrC<nTTGTACC^^ 

CCATATGGGCGG A C 1 PCTCTA TTTOO 

GGTAc ITTO TOn ^TTGGTATCCAAAATTAGGACTCTGACtATl CTTGTC1 

A TI? A ^^' ITmAOTAOaAAA ^ oaACA ^ 

AATAGTTATTACATGACAAAATATGTAGATCTGATTTTTAG A A a mm * at 
J A< ^ Tt - : '^ AAAA ^ AA ^ ^^GTXXKjAAAAGCAGTAAAAAGTTCTICTTGAA 
TAT1 OC 1 GTTGICATCC AAAGTATTCTTAT I ICIT TTAGGTGAAAAATTT 
OCATTAc rCTTTTGl GATATTCTCAAAAGAAAGTTT AGGATTTTACAGTG 
JTCTGAA>^ACTGAATCTTAATTCAATATT^ 

TrCCTAAAATATAATCCAAAGTAGTTT AAAAAGCA GAT ATTT AAA CCAAA 
GTATAAAGA 1 C 1 1 1 1 1 1 C I GGAGCATGCTA TATTT AACA GTTTTTTCCT A 
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ACATGAAATA ITTGCTTCT1 A AAOCAAAC 
ATCTAAAAAATTT 



AAT1 

TTTAATTCTATTAC 
AAGTTAAAT1 

TAAAGGGGGAATTTCAAOCAATGTGGGO 
OCGTTCTAGATGACCCCGGTCTOG 



OOCrroACOGCOOOOOOOGCCAOOTAOCItaCICATCTC ATOCT IG OCTC 

AGOCCTGC TGCTTAGTATTTACTATTTATTTTACTAAGATA 1 1 rCTCTCI 

CTATCATCCTCAOACTTCAACrOATCTCG C I rOTC A TI 1 1 1 CA GT AATAA 

AAAAAGTTACniAAlTTAATrGTTCAATATCATGCATATCTCATTCATTA 

OOATTTATCACAAAOCAAAOATTTAAATTOCCTAOA TTTCTOG1 Id I TC 

TCTTCCTAAQTTOCCAOOOAC 1 OL I 1 1 CA AAT ACT ATTTTCT AAATTTCA 

CCAAAGGAGCAACGAGGATAAAACAACACTCCATAAAGGCCTCTTGGGAT 

CTCAGAAATCTAAAATCTAAAACAAAACAGACACAGAGCAAGACAATAAC 

ATC ACAAGCTAAAA OCCACACAAATTTAAAATTAOCAACATOCI rCTTCG 

ACTAAGACACTAAATATCAGOCnTGCAGCAAGACAGCTCrOAGCAGCTOT 

GGGCAAAGAGGTAAACCAGTGGGGGTGCAAGGAGACTGTCTGCAGCTTGG 

GGCAGAAATGGTOGGATYXAACTTGTGAAATGCITCATGTTTTACAAAAC 

CAAAAG7XIAGGTAGCAACAAACTTATTGTATGTCAAATCAATAAATGTTA 

CTTTCAAAAA A A AAAAAAAAA A AA AAAGTTCT AAGACCTT A GCTTCA TTC 

AAGA CTCAAAGTGTGTAGGAAAAAAGCTTATTACCC 

> S equence 1315 

TTAT TTTA I I I II 1 1 I I 1 1 I A 1 1 1 HI I H 11 1 IAI H 1 1 1 AAAAAAATT 
TTTTAATTTATTATTGAAATTTGGAAAAAAATTAAAAAAAAATGGGGGGG 
GGGGCAAAAAAAAAAAAAAAATGGTGGGGGGGCCCCAAAAAAAGAAAAAA 
AAAAAAATATAAATGGGGGGCCCCGCX3CAACAAAAAGAAAACTGGGGGGA 
TAAAAAACAGAAAAGGGGAAAAAAAAAAGA 1 1 1 1 1 M I IC CCACGGGOGA 
AAAAAAAAAAAACCCT I TTOGGOCCAGAGAGAATTAAAAAAGAGACACCA 

ta roi oo 1 1 ogcaagattataatacaaaaaaacaaagaaaaga 1 1 rocro 

atcatgaaaccaaaaatatttgagaaaaaaaaaagattatgtggcgccca 

caaatttgaaaaaaaaaatagagaaaaaaaaagaccccocaggagtggag 

aatcattttgaagagaaagaaatacacx^cgtaggcgggagaacccgttaa 

aac^ckmgaacaccaccogcacgaogaggacotggataataaagagaaga 

caaaaactxmtcaaaaggtggaagaaaaaatatgaggagaacaacatgtt 

ccoocaaaaaaaaaagatcctcgcocaaaaaaatatcacaaacagaggac 

ccccaaaaaaaaataaggacaacccgtgggggggttcaaaaaagagaaca 

tcaccaaatattggggcocgocacoccgaoctattaa 

>Scqucacc 1316 

cccitgagcggccgcccgggcaggtacgaaaocagtcatag^cagtatgt . 
AAACAAATOAOTOCAOCTQTOT nXAATAAAOCTTTATTTACAAAAAOOG 
GOlATGAGATCXIATTTGGCCTATGGCCX^TCATTT^ 

agaacaaooctgocatg^ 

GQAA CTTTrTTTTT AA l I III 11 I I 1 1 1A AAGA I ILLI 1 1 1 1 1A TTATAT ■ 
TATTAI 1 1 1 1 1 1 1 1 AATAAATAAATATATTAAAAATTAAATATATAATAA 
AATAAATTTATTAAAAAAAATTTTATAAAAAAAAAATGQWNAAAAAAAA 
AATTATATAGOGAAAAAAAAAAAAAGrrTifiOr.AGAAAAAAAATTATATAGA 
AAAAAAAAAATTATAAGAAAGAAAAAAAAAAGGGGGGGAOGAGGAAGAAA 



TTTTTTTAAAAAAAAAGGGOGAAAAOAAAATTGG AACTAA AAAAAAAAAA 

AAAAACACACCCGAAAAAAAAAAAAATGAGCU 1 1 11111 1TTTTGGGOGA 
AAAAAAGAGGAGAAAAAAAAATTGTGAAGAAAAGAAAAATAAATAGATAA 
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AAAAAGAAGGGCCOCGTATAT 
AAAAAAGAGA I 1 1 1 rGTAi 
GGGGGGTAAAATATATAAGAAAAAAOAAGAGTGTGTATAAAGGAGGAAAG 
GCTTTTAAAAAAGGGAGAAGGTGAGGG 
>Sequeac« I3U 

GGTAC1 I IN I 1 1 1 1 1 1 I 1 1 1 1 1 1 ITTTOGTAGTAAAAATATCCCAATCTC 
TT A AA TCTA T AGGTGAAAAATACTTA GTTTCG A A ATGATTOCTTAAAAAG 
CAACAATAAAAATA L ILI Id I CACTTGAAAGAAAAAACCCAAAAOOCAO 
TGTTCATACAAAGTCATGAAOAGAATtTAAATTAAOGTTTTOGTTCCACT 
7TGTCTCAACTTTAACTTTTAACA G T TCI TT A TAGGCTTTTGAAA CCTAC 
TTTGOAGAAOOAAAAAAAGT AGGAATAAC I O n C 1 1 LA AAAATTTTACAA 
AAACAGTTTGACTCAGCTTCAGTTGTTAAA 
>Sequoce I3S9 

CCCTTAGCGTGGTGCGGCCGAGCTA C 1 1 1 1 ! IL 1 1 1 1 1 1 1 1 1UAGACGOA 
GCATCGCILI 1 1 L I OCCAOOCTOGAOTGCAATOOTOCTATCTTGGCTCAC 
TGCAACCTOCACCTtXXXXXnTCAAGCTATTCTCATGTCTCAGCt. I IL LA 
AGTA GCTGGGACTACAGCTGCCTGCCACCATGCTCAGCTAATKTTTGTAT 
TTTTAGTAGAGATGGG GI I ICACCATG1 I' GO TCAGGTTOGGCTCOAACTC 
CTGATCTCAGGTGATOCAOCTGOCTCOOCTICTCAAAATGCroGOATTGC 
AGGCCrGAACCCAOCTtiCCTGGCCGCATGTTGCAT , l 1 1 T7 T TGGGTGTAA 
TACCOACOOTATAATTOCCAOQTATO Cn 1 IOTQ QACAAA CI IL I 1 1 1 1 1 
GGGOGAAAA AAG OI I rOCTTCAA ICTI 1 1 ICACTI 1 ICG rO C A OTTOOGG 
GTTA TCC CIO 1 1 1 1 1 1 I ' m OG A CT ITT CC TT OC TT G GTACATGGGGAGT 
FTTTCn GAGGGCTTTAAAGOCAAGTICITGGAAAAATOGOGGGTAGGGGA 

acatcaaatccocttttqqg g remrrm atacccaaacctttacccoct 
aacaccttccggggtttgaaaatggaaaagataggggtttttocctaaaa 
atgaaccgcgtttaaaacacttt 



TTTC 

taattggcagcaataaaaacaattagagcggccccg 
gggctttataaccaccaacacct 

> Sequence 1390 

CCCTTCGAGCGGCCG TCOGGGCA GGT A CTCTCAAAA GCT AGGGCTXiATG A 
CTGAOCAACTACAGAGCCTCACTCTC1 1 IC1 ACAGACAAAACTAAAGGAG 
AAOACTOAACAAQAOAOCC I ffCTOC lO AGTACCCTTGCCAAGATGTCTGC 
ATATG LI 1 IUCLOALI I O I L ' I A CTOAGTT AG ACAAACTTQ AGA HOI 1 11 
OCTTTC1 1 AACACAGATTTGAAATGTAGGAGACAAATGCTAGOGGATAAA 
ATGACATATCTT^AATATGAGTACTGTGGCAGTCTCACATTjGAOGTCA 
AGTAACaGACTCAATGOCTAAATAGCTGTTTACCA C I OL 1 1 1 r C TO AGCA 
ATATTTAACTTATACAGACTATAAAGGAGCAGGGCAAATGGTGAGGTTTT 
TAAAGGATGTCAAACACTG1 1 1 ' I AACAATACGAAGCCCCATAGTTTAAAA 
GGATATTCATTCCTATTCTCAAACAATATGGAGAATTATCCATCTGAGAA 
TCTATTGGGTCTACAGAOCCTT AGTAAAGGAGGGGTOCTTAAAAAATTTG 
TATGGAAAAOCCAOTTGCCCI I AATTTOCTAACCATTTTTATOGGATGTA 
TGCAGATCGAGGGGCACCTTATGGGGGCCGCATTCCAAAAATAAGGGTAA 
C^OCKIAACOCOCAAATTTGAAAATAGGAAAAACTTGTGGOCGAAGAAAGT 
TTTAAII 1 1 1 0 C 000C I C TAATTTCGCAAGGTOA G 1 1 rCGTDOCTTTOTC 
TTAAAAAAAT 

GGTACTTTGGTrGTOQGTCGAbG O IU I 11 1 AAATAACAGCTTTACAGAGA 
OA TATCA TTCATAATTCATAGOTTTTJPi L 1 1 1 1 1 1 IC1 1 1 1 1 1 IAAQACA 
AAGTTTTALL I JC1U1 CACATTGAAAAATCTOCTATATTCTTGAAGATTC 
T GAGCAA TACATTCAOGACOCAGGTTTGGGATTTGC^TACTATTGGAGAA 
ACTOTTTOCTGAAGATAAACACTTCAAGAATTTGAGAAAATTAAAACTAA 
AAACCCGAAACATGAACACAAAGGCAGAAAAA£ATTGGCCTAACATTGCA 
AAAAATTAC CTTAAAATOCTGGA I 1 1QL1 1 O GAAGAAOAAAATAGTTTTT 
AOGAATGGCATGAACCTTTAAAAO 
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GGGGAACAGCCTTCCAAATGGAGAI 1 1 1 1 1 1 1 ACCTTTAGAAGGATTGAG 
CTT AAAAAAAAA TCTGGGTTTAAAATTAAAOAATTA Ml I I I 1 1 1 ACAGT 
OGGCAAAA AAATTAAAAAAGA ACTTAA OG C 1 1 1 ICCTTCC OGAOAAOOOC 

TTOCA CI 1 1 roGTTAOACCAAAGOOOGTOGACTAAAAA rOTTTTI lA CCC 
TTCAAAAAAAATTTGGGAATTCAAAGGGGCTAAAAGAAAC C 1 1 1 IOG 1 IC 
TTTGGOGGGCTTOCAATCAGGGAAAAAAACTTAI 1 1 1 1 1 1 1 IU lUCCAAA 
AAAGAAAAATTG 



TGGCG GA ATTTAA AAAATOCOCCTCTAATCAGAGOQTGGAAACOCCTACG 

GTTTTAAAGT7TT 

>Sequeace 1393 

GGTACAACTGCGCTACATTACTGCCTAAAGGCAATTTCCAOACTACACAG 
ACAGAGAGGAAATGCACATAG AGOOCAACT G T C TCI C AAAAGAGACAAGA 
GAAATCTAATTTCTAGGGGGTAGCTAGAATTTTCAGAGCAATGTCCTGGA 
GAACAAGTTGTOC^CXAGAAGATATTCAAAAATCATCATCaG 
TGAATGTTTGGGAGAATACAAATCTGCTTATGTATAGGGTGAAAGACCAC 

agagctgatcaaagtgcaactatcaggatcaagaacaaattaaogcgaag 

gtgaagtgtaagctoaataattcocaaagctcacatgaagttgcaaattg 

agttcocagocaacagaatgaaaagatcttattgaaogtggaggaatttt 

aacggagccttaaaaggatttttactggggaactacactxkktrctaaaca 

caaaggttctttaaaccaacccttaacatggcttaaaaataagcccgaaa 

aagataagcttgtcctacaggaaacttaatggoocaactaaaaaaaagaa 

attcaaaagggtttaaaggaaggcataaaagccaaacgcccaacaattag 

aaatttogaaggttatacttaaattaaaaaaatactottcctoccgggog 

cccttcaaagggggaattcaaacactttgggcggtataggggtacccagc 

tag aaocca tatgta tag 

>Scqueacc 1394 

GGT AC 1 I I III II I M HI 1 1 1 1 Tl ' m C AGTATGGGGTCT G TGTTGCOC 
AGGCTGGAGTGCAGTGACTATTCATAGGGGCAAGCATTATGCACAACAGC 
CTCAAACTOCTGGGCTCAAOTOATOCTCCTOOCTGAGOC rOCOGAOTAGC 
T GGGACTAT AGGAG TCCAC CA CCAC GCCAAGCTGGCA iTC TCTOT TTT C I ' 
TAI 1 1 LCI GATTCTA L 1 1 1 1 A G CmC IT A ATATGCTGATA IO I 1 1 ION 
TGGTATATCATATATTAAAAAACAGTTCATCTCATOOCCATCATCTATCT 
CTAAGAAGCACOCCAACCATTTACACATTAGGTAAACAATGGCAGGCATA 
AOOGA CrOAACATTCCATAGOCTOCI 1 1 1 GATAAOCCACATCCTTAOCTG 
CTTTTACT GTCAAA AAGTAAATCTACATTGGG rTTCTGCTOCTAATAATC 

AAACTT AA Ll 1 1 1 1 1 1 1 gaaggaattggtoatocaataaatgccttcatt 

TTTTTGAAATCTTGAAT GAAT GAGCGGATTGAATCAATTAAGGTT^ 
TATT AGGTT AAATACTTTTTTAAAA ILI I ILI lUAGAATTCAAAAAOOCC 
TTGGGTTTT AATCTAAGAAACCATAA 1 II 1 1 1 ULAAAAATTAAGAAATTT 
111 ILI I ILA ATTGCCAAOAA I 1 1 TCBC1 1 A AGGOL I tj CCC 1 1 lO GGOOA 
CACATTAAATTGGGG 
>Soqoc3Ka 1395 

GGT A f OC O GGGGCGGAACTGGGGTTGOGGOGTCTAAXilG 1 1 fOOG GTOOA 
IT CCCAOGGACIUTCCQAOOTCTTOQACTCTOACCl GC C1 A OGCTGGTCT 
OOGHAAGATGTTCTAOCATATOCTOOCTAGAGCAOGTAAA roCTTOCTOC 
AOCTC0q3CTACTOTCITXXXXAACTro 

TCTTT^OOGAGGTGGAGGGGAflCIGCACAOGOAAGTATGGCTTTGTAATT 
OCTGTCACCAOCATTGACAATAI rGGTOCTOGTOTOATDCAlOOCAGCOOO 
ACCL II rCTTCl I IATOCAGTTAAGTAjCTATGTOACIGATGAAACTaCTT 
GTTGAGGCTGCTGGAGCAGGGGCAACTAACI 1 1 1 1 iggcatcaaattaaa 
ACTGACA CATTATAATCC1TAAAGAATTCA1 1 ILI II 1 1 rOCTOGTCl 1 1 
CTCTTTGATCATOGGATOGAAACTTAGGCTOTTTAAATOOA Ai 1 1 1LILI A 
CTAAAAGTGAAAACCCCTTTTATAO L I LL HI AAAAAATAOOATTTGTTA 
AATCCATCOI Mull I IXiAAAAAAAATTTTAAAATTTTGGCTGGACAGTT 
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CCTTTGGGAAA L 1 1 1 1 1 1 A CTTATGOOCCAAAAACCTTTGTAAAATTTAT 
LI 1 1 1 1 OA AAATTIT TTAGG OCCCCTTATTTATAAGGGTTCOCAACCGOG 
GATATTATTTTTATATTTTTTA 

> Sequence 1396 

LtXIlA GOGrCKirCGOGGCCGCGGTA LIl l llOI I MA I IllM A II I II 
TTGAGAGGT ATGA I I L I I ILI AGAGA 1U 1 1 1 L I LA TOOCTACTATAGAT 
CAKOAATXjGCTGATTGGAGATTATTAGATTCTAGGTTAACTTCT 
TTACOCTAATACATAAAA L I 1 1 1 1 LL 1 A AATAAATGATGGAAGOAATAAT 
ACTTCGTTACCTCGCATTATTTTTCAGTAAGAAAAAAGCTITACTAACCA 
CTACA TTT ATGGAAAATTGTAGGGGT AA GT ATTTTA T AGGTCA T AAAAAA 
CACCATAATATAACGAATCTCA I U I LI 1 1 AAATGTGAATTAAATCCTAA 
CAGTCA TCTTTAT AAAATG ACCATAGGCTAAAA TCTTAOOTGTAAGT 

> Sequence 1397 

CACATGTGTOGCClTAAATCAT«^CClTlt> 

GGCAGTCTOCTAGGTTCCAGGAATCTGGGGCTAAGTGAATAAGATGCAGC 
1 CC I ' I ACTTTAAGTCTCGCAAGGAAGATTjCA I I 1 1 1 1 AOGTATCTTOCAC 
AGTCCATTGTGAAACATGCCATAAGGAAGGGATAAACACTGATOACAAAG 
TAATTGCCAACTKTTACTAA 1 1 1 I O I LA AATTTCAGAG AGGTACC 



ACAAGTTOTAACCCCTGA 1 TCTG I CAATGTQACL 1 1 FCTO GAAOTACGGT 
CACTGClAGATGTAATAAGTTXIAGGATCTCAAGATGACiATCAT^ 

CAGGATGGGACCT AACGATAATGGCTGGCGTCTI iataagagaaaggaga 
atgagatttgagacgcagacatgcagagaggaaagcgagatggagacgga 

AGCAAAGCCTAGAQTXXnTAACCTACAAACCAACGGTTGOCAGCTGTCAC 
CAOAAGCTGGAAGAGAGGCATATAAGATTCWTCCTTAGA G TCTCTGGAAG 
AAAOCAACCCT G CTG G I GTCT I G ATTGCAAGCTTCTGGCTATTAAGAAGT 
AGA L I IL I I O CCTTTTAGAACAGATATCTGTTACACACAAATGOOTAGTA 
ATTTTTTAATtnCANGOCTTAAGAAAACTGATATATTAATGTAAGTTTCC 
CTGTGGGGCTCTACGGGCCAAA I ILI 1 1 IOOI I L AAGAGGCTTGGTAGAA 
TGGTTACAGGTA CC LI I IGGCCGGTAACCACGCCTATGGGGCGAATTCCA 

taatgggattcgcgacc rccn a caaac 

ATTGTTA 

TCCCGCTCCX^ATCCCAAACAATATAACGGCOCC^ 

AAAAGOCGGGGCGGOCCAAATAAATTGCGCAACCACATAAAAGCGGTTGG 
CC 

G&ATAAATAACTTGTTACCCTGGGACTTTAAGG^ 

G<X^A CCn A AGGGGGGTTATCCGTCGCTTT Xt rACA C XrrO GG OGATAACX* 



GGCCT 
> S cy— MOO 
GGT A C 1 1 1 1 1 1 1 1 1 1 1 I I 



II 1 1 1 1 ILI I IL I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

n i uiini i iuuimii i nn mimum milium 
iiiimum 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 AAAAAAATTTAAAAAAAA 

AAATTGGGGGGAAAAAATTCCAAAAAAAAAAAGGTAATAAAATTAAAGGG 
AAACA 1 1 1 11 11 1 A AAAAACAAAAACAAAAAATTC AI 1111 fO OCOOGCT 
TTAACAAAAAGTTTTGAAAA TCCAAATTT AAAAAAAATTTAAAAAAAAAA 
AG O 1 1 1 1 1 1 UAAAAAAAATTGGCAAAAATITTAAAAAAAOCCCCOCCCAA " - 
AA TTTACCCTAATTTTAAAAwtCCTTTGGAAAAAAAAAGCXXXnTXSCAATA^ 
AAAATTCCAAAAAAAAAAAAAACCCCl 1 lOGAAACAAAAAGGAAAAAAAA 
AATTGGTTATTTTTTAAAAAAAA 1 1 1 1 1 1 1 1 lOCACATl 
AAAGAAATOCAAAAAATATTTCTTATAi 



GGTACTCAATCAGATOTTAAA 1 ILI I L AATGTAAATGCTC1G I CATGCCA 
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TCCTAOCTCCTOTCTOOOOCAOOCC3CTCACACACAOCTAAAAGCACTC 
> Secjucnc e 1402 

ACTTTT AAAATATATATTTTCTAATTTTGAAACATTCAAQCTGCGCATAA 

^ ^T^^ CA0CTCTAATCTICk5CTACTroAGAC<XrroAG<X^CGAGCATG 

CCTTGAG<XXAGGACTTCAAGACCACCATtrrCCAATACAGTOGGA 

TCTGTATTTAAAAAAAAAAAAAAAAACAAAAAGGAAGGAAATGTTCAAAT 

ACACAGAAAAGTTGAAAOAATATTATAAAGTGAATATCTGCATACTTTTC 

CCCTAGGTT ACCTGTCACCTTGACATGCCTTCTGAATTGTAOC 

> Sequ ence 1403 

ogta c i 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 n i n n n 1 1 

1 1 1 1 1 agaaogctcgtattataacatttattaaaataatgctgooootta 
atagaaaccgccaagaaccaaagaattaaaatgcaagctatitaaaatcc 
caact aaaacoc^ aaagggtctaatxjtrrtcattcattaactaactaaaa 

COCCAAAAAAGACACTACAOOCAATGTTATAAAGTTCCTGCCCQGGCAGO 
CCCCTCGAAAGOOCT 



ACTTTAA TTT TTTCTA 7TTATCAA TTCCTTATTTGCTTTGCTCATTTCTC 
TAGTAAGCTGCI1 1 TG7TAATTTCTGAGTAATTTATTCTAGCTATCAGGC 
CTCTGGCA TGTTTCAAATTTCCTAGTG TCTTTGTCAAAGAGAAATTTTTA 
ACTTCAACATAACTAAI IIGrCATCTTTGTCCTTTA Gl 1 1 I 1GIU ATTTT 
AAGGACATAATATCTATTAC7TTAAAAGTATT0AAAGCTGTATGTATATT 
CTTCAACTAGCCaCC TTA 1 1 I C1U1 I CTAGAGTTTOAATTTCTTAACTCC 
AA AAA CAC A CA A T AA TTTTTAAAGTCTTG A TCAAACTCTGTT ATCTTCTG 
CATAGTCTATTTTTCAGCATTCCATTAAATX1AATTGAGAAAAAGGAGGTA 
CC 

> Sequence 1403 

ACCTGGCTACAGTAAATGCTCAAGGCC C II I OrrATTATTTCAGATGGTC 
AAG AATA AA IU 1 1 1 1 1 C AAGG AT C I ICI 1 I IIL.I AGACAACTGTCTACTC 
ACAGTTT AGAGTtXnAAATTATCTGCCTGGCAAGATACTTT 



GGT ACA TACAATAG ACTA TTA TTCAGCCTTAAAAAGG A TG AAAAAA TCCT 
G ACA TOCTA AAAT A T AAATGAATGTTGAG AACA TT ATGCTAAGTGA AA TG 
AGCCCATCTAAAAAGGCAAATACTGTATGATTTCACTTAACTGTGATATC 
CAGAGTAGACAAATTCATAAAAACAGAAAGT AG AATAGAGGTTTCCAGGG 
ACTCGG AGTTAGTTGATATAGAGTTTCAA TTTTCCAAGATAAAAGAGTTC 
TGGATATTGGTTGCACAG<^AATATGAATATA(riTAACACTACTGAACTGC 
ACACTTAAAGATGG TTAA GATGGTAAA i 1 1 IOI 1A Q GUU1 1 ICI IA CCAC 
AATTrAAAAAAAAATTTrAATrAAACGAATTAATAAATTTACAAAATACT 
^'T 1 GATCATTGTGTTTCCAGTTTAATATTCAACACAGCAGTATTTCAGGT 
ATAGTAATTAACTTACTTTCATTTGAAAA GATGTCTATAGCTTAATAAAT 
ATCAAACTCTTATTCATACri' n U1TGATAATCTAGGAGAAACCAAGGAC 
CCAAATG CAAA TGGGCTTCTXACTACTTCACCTGCCCACCTIGAAATAGA 
AGCOOGATTTCACACT ACCTATAG 
>f najicin M07 

GGTACT AGAAGACCrrCCTCGCCACTCT^CCACATGAGAGAGTCAGCTG 
C^iJ±i<XXCnQCCTCKlCAGGAAGAAClVlVl ICCATGGCACATCTC 
AOCTCCTCATTGAGGGATAG mi CmGATAAGAAACCTGGAGTOCATT 
T ACTC TGAOCICICI 1 1 AAATCTATATCCA GAGCCACT AGCGCAGGAAAA 
ACTTjO GGTGAOCXXn rAATT TCr CTT C TO CnXJ C ro TO C l rT OCTC T TA OQ 
COCXACCOCAACTOOCOTAAATT/TTACAGGCITATGACAGTTTGTATGT 
GXTT CAGO CAATGAGKIAGAAAAOCTGGAAAGAATTTCTGOACrTTAOOOCA 
OCAO 1 1 1 G IC 1GG 1 1 GACT AAOCTGC TG AGAGOTAAAATTGGCACCCATT 
GCCOCGTGXXTTCAGGCAGTCnxrrGKX3GO^GUUn'ATGOCACCATOGGAA 
TATC AG<^AClXMGT0OGAT0TTKXnX3ATG<riCACATGACTGGCTAGAGC 
TTTGGCGG TGGGGAOG<KSO GOTAACTACTATTTATTroGOCATGATCTCT 
TTAOGCCI ICI 1 1 J ATTTTTAATTAAAAAAATOGATOCAATTAATAATTC 
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GGTACCCTTTATAGGAACCCTCAAATTAAAAAAAAATGTCTTTTAATGGA 
TGAGAGGGAACCACT ATAACATGAGTCCAAGOCCAOAAQACl I C 1G T C T A 
TACAATA I I 1 1 1 I I I 1 AATTTTGGAGATAAAAGCTTTAAGAAACI'1'ITIG 
AGTTAATTATACTCATAAAATGA O 1 1 TCI 1 1 A ATAAATTAAATTTTATTG 
TGTAAA A TGTATTATTACATAAAATGTG 1 11) lU AATCAATOCAGTTTGG 

ggatgaatataattaaaatatgtttaataacttagaattcaactaataaa 
aatttagccacacttacaagggggaggaagtcoctagtttaaaatgtata 
actgagtggtagatcagt 

> Sequence 1409 

GGTACTATGTGTGTATTGTTACTATTACAGTTAATCCTTTCTTTGTGTGA 
GCTTGTTAATGCAGTCIAGGATTGTAGCACTGTCCACTGAGTCTCTGTGCA 
ACAACTrAGTGGTGTGGCAGGGCTTTCCGG TG TCTGGCTCTGA TCTTGGA 
CGCTGGATAGTDGTCTGAGTATCTTCAGTGCCCAAGGCGACGGCTTTCiGT 
TTGGGTCACAGGATGGTGTGGTTGGCCAAGTGtTIGCCTAATAGTTTTAGG 
AGAGGATACTTATTTGCTGCTGCATCATCAACACTGGTAGATTATGGTTT 
CTGAGAACAGATGGGGCACACAATTACTAGAGCGCCCATTAACAGGTATA 
TACAAGTACCTAGCACGCGOGGOCTGCTCG I IGGGOGAATTTCAAAT 
>Scqueacc 1410 

GGTACGAGCCTATAATCTCACCTACTCGKKJAGGCTGAGGCAGGAGAATTG 
CTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCCGGGATCATGCCACTGCA 
CTCCAGCCTGGGCAACAGAGCGAOACTCCATXnTAAAAAAAAAAAAAAAA 

aaaaaaaaagagagagagagaaggaggggagaaagtgaagtcataagtgt 
agaccactccttctgagggagaatccaccccaccttcctcctagcttctg 
gtgcttgc tcgc aatctttggcgttocctagcttccagatgcagcacixx 
aatccctgctttca1ci ici 1 agggtggtctccctatgt 

> Sequence 1411 

OGTA CI I II 111! I Hllllll l II HIIOllUUIIHUllllllll 

1 1 1 I 1 1 1 1 IA AOGGAGTAA GI 1 1 1 1 1 A ATCCACTTAAAAATACAAG AGCA 

CAAATCCACATTTATTTATTGATTTTTCGTTAGTTTAAATCCT 

TA CI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 1 1 G GGGGAAGATAG AAAATGTGCGGC 

TGATAATTOGCACATCAGTGTTTrAATCCGCCCCC 1G C 1 ICI GCGGAGAA 

AAATCTTTTGATOTrAATACTACCAACATTA GI IC I II I A GGGGGTGAAA 

GATTGCCCCAATGGCGGGGGAGGAGATCAATrTTrTGGGGGOGATTlT'IG 

AGGGGGGC<X»TGATCAGCAACAACTCTTTAATAATTGCCGG<rTTTTATGA 

CGCCTACTGGGGGTGATAATTTTCTA CI 1 1 I ICI ACGAG G I ' l l 1 11 1 ACT 

AGCG AAAAAA GAGTT GCT L 1 1 1 1 1 lU GACCAACAAATTAATCAACGAGGT 

GTTT A>GGAGGTTITGGCGGCATATTAGAAGGTCGAATATTA1 T I TTGGT 

GAACACCCTTTATX^CCATCCTOGGTOTGTTTGTOOOCCCTTATIGCTGG 

CGGCCCGT 

e 1412 

G AGCGGGCCGCOCGGN GCAGG TN ACGCCGOGGGGTGG 

ACCGCTGGGACCAICCIOCI till IUCI 1 agcoaaaaaaatacccctttt 
ggotaggaagacccccagaagaaaaggaaaaatttaaatcaaagoooooc 
aagaaaaattccccccacaaaagtcaat 
gcccoocxxaaaggcocaagt 

>Sequence 1413 " i 
AC! n 1 1 J 1 1 1 1 1 1 1 



rrrTTrrrrTTGTTrrrnTi 
ttt 



nun 
liiimiiiiiii 



IIIIII IIIIIUDII I 

ttrrttgggogggctoocaaaaaal 1tttttggggaaacttic0caaaat 
ttaaaaaaaataccgatttttaaactcaggaaaagttaaaaaattittta 

AGOCCCCTGAAA TGAAOCAAAA AAAAA I U 1 1 H II 1 1 i 1 1 1 1 A CCACCA 

atgaaccaacctagagggaaattccacoccagtggggggggtaaataagt 
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aatoccctccacagacacacattggacaaaaatagaaaaaaaactgtttc 
tcagaaaaaaaataattgcccgaaaaactcaaacaaaaaaaaaacccgaa 
ataataaattggacacxxxxkxkkttcacaaaaaatgacacaacagtcat^ 
attgcggtagcccaacoogacctttttaaaaagagaaacactattctgcc 

CCTTTATTTA^T^^ 

>Scque*ce 1414 
GGTACGCGGGTCAATTA 
>S eqneac e 1413 

CCCITCGAGCGGCCOCCCGGGCAGGCACAAC^^ 
TCATQAOCATAG1GI 1 1 1 1 1 1 1 CCTATTACTCTTTCACTTACTGACAGGA 
TT ^CCCA TCTG^CTCATCIU ICC1CCICCCAGA C1C1 IC I lOATCTT 

TGCAAGGATGCAGACTTTTTTATXTTGCGATGCTACTTTTATAAAA^ 
ACCGT AACA TAAATAACTCTTTAATCA AAA CTCAGAAAAAT ATT AAA TCT 
A TTCT T A A A A GGG TTT A GAAA G A AAGA AAA G AC A GCTGTT AGGTT ATTTG 
A TTTTCA A GTTTA TCAAA TA AAA TTCAAATAGAATTGGCAAATCTTTAA T 
GGCATATGAAT ACTTCTATCACTTAGTAATTAATTTGAACAG AGATGTTA 
TTAGGG 1 CC" 1 1 AGTATCACnXXATCXnTTCXXTTCCATCTrrrATACAAAAA 
AGAACATACAGAAATTTAACAAAGATATATGACTTACTCATATOTTTTAT 
A AAAAGTA7XACCTAGCAGGTGTCTTtXATTTAATCTAACANAGGTTTAT 
GTAGCANAAGATACATGAATGAAGCCCTAATCACAGAATCTK 
> Sequence 1416 

GGTACACGTO 1 i I I CC I GAGTTCTGGGGACAGCTTTAGCAAATTAATCAA 

ACCTAAGAAGGGGGTCATGGGA ACACTGACTTGAAGCTGGT fGGCCAGAA 

GTrcTGGATGAGGCCTGGCCTTACAACTAGTGTCTGAAGTGQGQQCAGTC 

TTGTGAGACTGAGCCCTCTICTCAOCCTGTGGGATXTAATGCTATXTCC^ 

GTAGATAGCATGAGAATTGAATTGGXTTAGAAGGTGCTCAGCTGGTGGTA 

TCTTC TGCAGAACTOATTGCI ICI 1 GT1 GGTGGGG AG AAATCCCCACACA 

TTTGGTCACAGAAGTCTACTG1GTTOATGATTGTGGTGTAAGAGCAGAGG 

AAAAGCAATTTGAni 1 1 CI CCACAAGGGGAAGAAAAT/GTTTCATGATTC 

AACTAATGATTTACCTTTCATTOTAAGGTTATCATG<rrcAAGTATTAATG 

TAGGAAGGCTKTTITGATGCAAAGTGTCTCTGTGTCTCTGT^^ 

TOl'GlOIOl I IGGAGAGGGCTAACTTAAAAAGGGGAATGTATAAGGAAGA 

AGAAATOGTGTICTAAACTrAANACOCATKn^TCTGCT 

GTGAATTN 

>Seqoeace 1417 

ggtacagatcacaoctttaagatggtcctccaaacaaaagattctacaac 
tttagtt a tttagaa tt agctttga g actttgggcaggtcacaatttttc 

TCTATCTGC TATOCTCTAACTTCAGAACC AGACACACTACTAACATCATA 
ACATCCAAACI JOG 1 1 1 1 IUI 1 1 1 1 1 1 1 1 AACAGAT AAAAAA TGTGACTG 
GGCACAGTGOCIXATGCCTGT'AATATCAGCATTTTGCKJA^ 
OAAGATCOCI I GAOOCCAGO AGTTIGAGAGOGGCCTGOGCAACATAATAT 



ggtgtggtattgagcactgt a . „ _ _ 

agga1 iuci luagcocaggagttcaagaccagcctgggggaaagrrttcta 

CTGGGCTOCAAAACAGCATCTAGCCAI IlilCClCI I C AATGT 
>Scqaeace I41t 

GGTACTAATTTACAGCAACAGGTGAAGTITOCrAGAAGAGTOGTCAACrG 
GTAACATGG<jATTAOCT0CTAGAGGGACTGAGGACTCTAAAGAGAACATA 
AGCAGCAAA TTGCAAGAGO^TCTGT AACTCCTG<KXrTAAGGCAGGOGACC 
CAGGAGGGAGCAAATCCAGKiAATGGGGTGGCTCCOCAGG^ 
GACCTCATTAAACAGGATTTGGTCAGGGCOCACTGGATAGfrOGGGAAGOC 
TOTOGGCTTGTOCATCTOaTOOCTOGCAAOCAGOOGiX. llA .1 1 ICIGGGC 
C IGC fGGTOGAAATCACTAGACACTTAOCCTGTGGXnOOCTCCAACACTT 

tctgggogttataaggaagatcgoctctagtgtgctagtggaa c rcTCTG 



Page 258 

(of 261 paces • Table 2) 



PCT/US00/33312 



Table 2 



GAAGCTACCTGG AGGGTGATGOCAAGAGAA 1 1 1 GC I G GG AAGOCATGCTC 
TGOGGAACTGGTGG AACTCCCTAGG AAACT G CC 1 0 1 GGGTATOGTGCCAC 
TOAAATTCACTGCGAAACCiCCIfCT G <^ llllClllCIICCIIIIIIJL 



TTTC 



1419 



TTTTTGGCTTOCAAAAAAA 



GGTACACATAAGTTCATTCTTGGCTTrTTAAATTTTATGGA^ 
TACATTTG1 GIXTI ATTAATCAAAATATGAATTTAGAAGGAAATAATTTTG 
TGT AA AAA ATTGT A TGTGGT AA AATTTT ACCT AATTTA AAATTGTTGTTC 
CATAA T I T J ITITAAAAAGAAAAATTACAG AAATAAGACTTGGGGGGTGGG 
GGTTGAAAAGTGXTTGAAAGAACTAAACAAGTAGAAGAGGAriTCTAAAGC 
ACTGGTCTCATGAAAAAAGTTTCA7GTGTGACTGGGTOCACTGAGATTO 
A AA G A A ATTGTTT A T ACG AT ATTCT AAAA ATT AAATXj ITGCI G1CAGGGA 
TGACATGATACAGGACCAGAGTC I G I O I A A ACAACAAAGTTTTCTTAAAG 
TA TTG A T A C ACGCTTTTAAAA A TTGCA AGA GGTTTT AAGTTT AATTCAAA 
AAICIGI'I IAACAGCCATTTTGT 
> Sequence 1420 

GGTACACCTCAGAGAGGACTrGTATCTAOACCAAGAGGACTATGCCTGTG 
GGCC AAATCT AG CCCAAGG TCTTG 1 1 1 1 1 G 1 A AA GTCCCTG TG AGCTAAG 
AATAGl 1 1 1 CAT AC i 1 1 1 1 AAAGAGAGAGAGAGAGIG IGTGTATGTGTGT 
GTGTGTATAATGTGACAGAGACTTTATATGGGCCTCA 
>Sequence 1421 

GGTACGACGTAACTCCAGACATAGGCTTTAGACGTTCTCATGCCACCCTA 
TCTTCAAAACCACAGAGAGTTCATGAGCCAGTCITGCCCATCTCCAATCA 
GGGAACTTCTAAAATAAAAATCTTAGCAA IC I CCI I G GCCCAAAACTTCA 
CXXCATCTTGG AAGGGAGGGG AG AG AG AATGTTCTG ATCTAT ATCTGATG 
AGG<K^TOTOGTTGGGACCTGAGCATCCTCCTGGTTGGGCTAGTGATGGG 
G AGAGAGGGCTGTTACTCACG AC 7CCC TOCAACAGAATACCAG AAACAGO 
CAGGCAGCTXIAGGTGTA TGTAAOG ATGTGAGGCJCAAGAAA OCAGC CCTCA 
CCAAGTT A CCCCTGT AAA TCCTTG TCTCCCCATGCACCTCTA CTTTG A GT 

cagaaatxksattca ttgca ggctcag i ig htg i attatgtgaatgaact 
gaacgtaaccaagcaocaagagagccct aaagacacagtagaoctcctgt 

AGAAGGGCTCTGATGGACCTTCAAACATTGCTCCTCCAACTNTATGGTGC 

ACACAAATCACCTGTGCATGTAAAAATGA 

>Sequoce 1422 

ACGAAATTCTCrTATCAGTCAGGGTTCAACCAGAGACACAGAACCAGTAG 



CCCAGG<JATGAGTAATtXK3AGGGGAGCAGCAAGCACAGGGAAAAGATGAC 



CCTCACCTGTAAAACTGGATXAGAAATCTTACAAAAACAAAAAACAAAAA 
A CCTCTTCA GT A TTTCCCTCAAA CA GGA TCCTCCTCACA TCTGTATTTAT 
A TTTAAAAAATAAAAA CAGAAAAG AAAAAGAACCAGCATGACATCATTAG 
OTOTGTG TACC 
>Scqueaoc 1423 

ACATCATAOGACTACTCACTTCTGCI 1 ICATGGATACTGCCTXX3GTGGGG 
GTTCACAACACTTATAAGTTAGAGAGTTTGAGAGCCAGTGGAAAGTAAGT 
GGAA G I ICI I L 1 GAAATAAGCCCCTGGCAATTTTCTGCAATGAAAAOQAG 
CAGAGGTCAI 1 1 ICI I A TAATOCTCAGOCTCAOAGATAGAACAC TGCCCU . 
. COTACTCTGGTTCGGGTTCAAGTGAGAGG C 11 II C ATGAAAATCTTAGGA 
TTGAAGAGCrCTAAGTTCAGGATATCTXAATGTTCAGAAAG 
AAGAAGCCAAACCAAAACCATTTAATGTGAACACAAAjCC ICI 1 1 ICI 1 1 1 
AGTAAGTTNTA CTTTTAATACAGA CTCAAAGAAAATAAAAATTTAATAGG 
CT AAAA CAAGTCAAACACCCATTCT ACA CTTGATAAAACCTTCACAAAAG 
TCAACTGAAGTAATCCGGAGKnGAAACTGAATtGGGCCCAT^ 
AATCACAAAAGTCA TGTAACACAAACAAAAGTCGATT AT ATTTACACA CT 
CACAAGCCCTCTAAAAA TG I GCCCCAAGAAGCATTTAOC 1 1 ICI 1 1 IO GC 
CATTCTGAAGATTTCACATTTTATTTTTAGATAGCTTAACATT^ 
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OOGOrrCTCTACATOOOCTATQ Ol ITCTOCI IO GOCCGAACACCCTAOOC 
CAT 

>Scyxaor 1424 

ACTAWT TTTTTTT A AAAAA Arrni T OGAAAAAATlTAAAAATTTATTTT 
TACTTAC H ill I ATTITTTATTATATTAAATATAATATTTTAAGGCAAA 
AAACATGCTTTAAAAAAAAATTATTTTAAAAAATCOGGCAAAAOOTTTAA 



AAAAAAAAAATTTAAAAAAAAAATTGGCTOTTCAAATTAAATTATTAAAA 
CAAAAAAAAACTCTA TCCAAAAA TTT AAA TTTT AAAA AAAAATTTAAAAA 
AAAAGCTCCTTAAAAAAGGGGCITAAAAAAAACAATGTGAAAAAAAAAAT 
CCA TTTT ATACAAAATAAli I 111 Hi 1 1 AAAAACATAACTTGAACTAAAAA 
AAAAOCCTGGAA TATTT AAAA T AAAAAAATT AAAA GTTCTC A AA T AA A TC 

A CACAAA CTAAAGCCAA AAA TTCAAA AAAAT A TCOGCAGT AA CT AAAGA A 

GACCAAACTC1XMAAAAAAAATCCGAAAAAAAAAAAAAAAAAATCATTCC 

TTTCAAAAAAAAAATAATCTCCCCAAAAAATCAACAAAAAAAACTACGCG 

AAAAAAAAAATTGAAAAAAACCAGAGCOCTAACATGACAAAACTCAACAA 

ACAATAATGTTOCCTAACCAAAA 

>Scqueacc 1425 

ogtactaocatcttaacaatattaao ictici gatocatooocaocaaat 

CT CTI fOCACITATTTGCCTCI ICI 1 TAX} I 11 1 I FC AACAA TC1 1 1 1G T 
ACTTT OCAC AGTAAAACTTTTATCC 1 1 WrrCGCTAAACTTATTOCTATCA 
AA l IU I 1 1 1 C ATOCTATTGTAAATOCOA I POCI ITCTTTTTCTT1 TCI 1 I 
I 11 1 I I I rrOOAGAGAGGOTCTTOCTCTOTOOTOCAAOCTAOAOOOCAOA 
AOTOCAATCTTOGCTCACTGCAACCTACACCTCC T C OCCTCAAOCGGTDC 
TCC1t!OCTCAOOCTCOCTAGCA<nTOGGACTACAGCCACATGTCAOOCAA 
AAAAAAATAATTKTTCTA 1 1 1 1 1 IOTAOAGACAGGGTTTCACCATCTCGG 
CTAGGAACCTCTTGA ICTC1 1 C AOCTOCTCATCTOOOCAGCTOGOOCTTC 
CAAAGTGTTAGGATTACAOGOGTGAGCA G 1 1 1 I C 1 1 1 1 QCTA HOC 1 1 1 A 
QAAATOOAAT TTCCCTC TOCT 

CCTO CCAAT ACCTCCGATTACAGCT^ 

TGATTTOCAAOCTTC 

QOTAOOCOCTTCAOOOOOCTOI I CA ACTAAGCACTCT ACTCTCACTTT AC 
TCCTAAATCCACCIXXMCCCTTAAGTTTCATAAGGGCTATCOTACT^ 
TOGGCTAGAAAATGTAGCCCAI 1 PCI I U CCACCTCATGOGCTACAOCTTC 
ACCOCCGOGT 
>Stxfaasoe 1427 

ACATATTOCTTAGAGCA GTOCT TTCAQATATGAATCATTTCrAOAATOOA 
TT ATAOA AOOATC GOAOC TTT AGTATTTAGT AG 1 1 IXXTTTCTTCTOCCT 
AACTTTACAATCCATTTTAAAAAATGAATOAATTAAOT ATCTCOCAAACA 
AACTOGCAAT ro CTC TO AAOACAACTTTAGCAA 1 1 1 00G I C AAATAATTC 
TCrOOCTTCOOCCAAGCCOUnXUTTOATTTCT 

iTAGCTA 
AQAOCOCTAAACTOAA 
OTCOCCAATICr O I VI A CTCTAOOGTOCTGGACAAAACTAOTACC 
>Seqtx3Kc 14 28 - 

ACACTCTTATTTTCAGOCTAAAGAAA TOOACACTTCTCACCATAGGOGGA 
CCTGATTGGTTGTGGACAA I IX 1 1 1 1 CC I AAOCAOOATCCATAAT ATCAC 
AOACAAGCTAATATAGCACTXnOAAOO ATCTOTCl I ICI FCAAATOOAQC 
CATOAOAOATOGTOGTTTrTTAAGTTGATTTOATG 
OCTOTO OOAG AOAATTT TTTTAA ATAAAAAATACrCrrTTA 
TTCTAACTTGATXrrCTACCTTTTCCCCTC 

CCAGCTACACTTTCOCAGTTTGAAATAATGAACAAATtX 1 1 1 IOCTG ACA 
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GACCAAACCTTAGTTOCTGTGGGCAAATGAG 
ATGAACAA 1 1 U l l rO AAAAA GTCTCTC AAAOA TGI ICI lA TTGGAATAA 
COCTTCTTAAA CCCAAACAGCCTAAATGATTGGA TAAATGTTOCACAAGA 
TGAAAGAGCCACCCAGGAATtTTACAGCTOGACTICATTACTTC 
TTGCAAAAATAGAAGCCAAAGCTTGACTT AACTGGTAATAG ACTTAAAAT 
TTGAGTTTGCTCTTGGTGCKXrrGCTTATCCC Xr IT I TT ACATOACAGAAGT 
GTTGGAGTAATTTTATGTGGCCTGGGTroGATGG C 1 1 I I 1 1 1 ICTC ACTA 
TCOCAATAATTTTAG 
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>i.i 

GCGGCTOAGCTACTJ ITTTTTTTTTTtTTTTTI I GGACAT ACTGAGAGAA 
TTTGGAATTATATGTTATGGTAGAATAAAGATCGAG^TCCATTTfTCTAT 
ACAT GAAAATTT AAATATTTAGTTTGGGATTTGAGACTT CT ATTAGGCCT 
CTGT Al I ICI I rCTAGI I 1 1 1 1 00C I ACCATTCTTT AATCGGAGT ATCCA 
AGC OCAATCACTX rrGTATCCTATGTCCTAAAGCATCTTGAATTGCTTGTT 
CATGI II I I ICI I CATGTGGAGTGTCTTTTGCCACCCT CTT AGCCTATCT 
GATC CCACTT AGCCTCTGAGGTTCTGTTAAGTTCT CACCTTCTTTATGAA 
TTTTCCCCAGCCAT AAT GAT CTTTTT AACCTCTTT GAGCTTTT ACT ATTT 
ATACT CTTT ACCTAACCAACT AAATGGTTTTT 
>2.1 

GGCGGCCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTGAGCCGGAATC 
CGGACAATTATGTCCG^XaAAACCAAGTTGGACTTAC^GAGAGTTCCAAGA 
AA CT AT GATCCT GCTTTACATCCTTTTGAGGTCCCACGAGAATATATAAG 
AGCTTT AAATGCT ACCAAACTGGAACGAGTATTTGCAAAACCATTCCTTG 
CTTCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCA 
GAGAAGCTGGCTACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAAT 
TTGGAATCT AACTCAGCGGAATTGT ATCCGT . ^ 
>3.1 

GGAGAGGAGTCCTTTACTT AGAGTCAAGCTGAAGGAGCATCACAACCCCA 

AAGACI G 1 1 ATGTTGTGAAATTT AGGCTGTGTTTT AATAATACTGATGAT 

GAT AGGATGAAAT AGT AATTTATTGATTACT ATATCT ACT AT ATGTCCGT 

AAGATAGCAGGGTCTTTATACTCGGAATCTCATTTGATCCTCATA^ 

TATTGGTTATTATTATCCTCATTTTAGAGATACAGAAACTGAGGCTTCAG 

AGAGGCTGTGT AATCAAGAGTTTGT ATGCCTTT CATCTGAGGAGGTTGAG 

GACAATOCCAAGTTAGAAAAATAAATGTCTTTAGCATTATTTTTCCT 

TGTTTAGAATATTAATAAGTTACTCAGATAATCTATTGGAATTTCTTCAT 

GGCAGGGGGAA 

>4.1 

GAGGTACTCAGTTTCCTT ATCT ATAACATGGGGATAATATTCGT AGCTAC 

^Jj^JTC^ATGA^^TCAATATCT^ 

TCTTGTACT 

*4J 

CTCAGAAATTAAGGCAAAAAGTCTTACTGAOCATGTAAAGGAAATCCAAC 
^^ATAAACAGTCrTCTGCCTTTAAGGAGCTTATAGTCT AGTT AAGAAAC 
CAGA 
>S-1 

GGOGGCCGCCCtkSGCAGGTACCATGGAAAC 
AATTAGGTT GAACCTCOVGGAGOCXXTCAGAGTCTGAGGA 
TGATGTCT AGATACGACGACAGCAAGGCTGCTTAGAGCTAACAG 
GCU 1 1 1 CACTAOCGGACTCTCCTTTGCAGCTGCCTTGCT 
TCAGCATGTCTCTAACCCAGAGCCAGGCT falfaC I I ITTTTS1 
>6-1 

**^I GA 5^ T CTACATTATTTTTATG^ 
AAJGTGGGCAGTAACTrGC 

IE^^ 1 ^?9^^ CrcGCACTCC1 00 1 CTAGGTCTCACCTACACGCTGGT 
11 u i t*GGCGGAryiGGCAG€TTG^ 

GCATGCCI ICI rOCTCl FGTGTCIC1 1 GGCCACCTTTTCCAAAAAGTCAC 
CAGTGACCAATTCTCCCAGTbl I ICI I IGGGACTCAATGCCTTGGGCTTG 
GCATTG^AAAGOCGACTGGOCAG^^ 

GTCXXSGTGTGGACCTCT GCOCTCCCTGCTCTGCGGAAGC I rCCTCABCCT 
TTGCTTCTCACT ATTTACTATTTGCGGG^^CTGGGGGTAC 1%XJGMXCT 



^^r^r^J. r^T ^ TAC GGATCAATTCCGCTGAGTTAGATTCCAAATT 
C 3 AA £CTCTTXATCACACGCOCCAGAAAGGAC^ 

atgctttggcaagcaattgact covtcacggtgaccatccagcgaagcaa 
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ggaatggttttgcaaatactcgttcx^gtttggtagcatttaaagct 
atatattctcgtg ggacctcaaaagg atgt aaagcaggatcat agtttct 

TGGAACTCTCTGTAAGTCCAACTrGGTTTCGttXSGA^ 
TC<XXKrrCAGCATCTTCACCTT CATCTCGGTTGCT CTTC 
>8.1 

GCAACCGAGMTGAAGGTGAAGATGCTGAGCCGGAATCCGGACAATTATGT 

ccgcgaaaccaagttggacttacagag AG ttccaagaaactatgatcctg 
ctttacatcci 1 1 rgaggtcccacgagaatatataagagctttaaatgct 
accaaact ggaacgagtatttgcaaaaccattcctt gcttcgctggatgg 
tcacc gtqa tggagtcaattgcttg gcaaagcatccagagaagctggct a 
ctg i cc i ii ctggggcgtgtgatggagaggttagaatttggaatct AACT 
cagcggaattgtatccgt 

>9.t 

ggcggccgaggtajcgacatgcactgatagctctctttgt atgaacagagc 
t ^ggcaggccctatgccagggagaaagt aagattggaaaagagcttacc 
aaggaggtggcatttgc^actgtgcttaaggggcaagaaaaacgtcttcca 
atcaggagccacaaatgcttggctgaagtgctactgctctitc^ 
agctggaacagacgtcacgagtc 

>10.1 

tggcggccgaacatccatgttttaactagcacagacaaa^ 

act atcaaaataaaatttagaaaaaca ai I mci i ataaaattttctgtt 

tgt atttggagtacataaactggcttt aaaattgagaaatatgccct AAA 

accataaggaaa aaggc aagagaaagaacaaaaagatcacagcaattagg 

ccgttct attcaattttg ccatgagct aaaaatcacattcttcacaaagt 

AAATTACGCCCT GTTTTTTATTCTTAAGCACTAG^TTAGGATTGTGATC 
TGAGCTTTACTAAATCG^AAAAGAAAATCTCAATTATAGAACATTTAGTr 
• TATTTATACCTTAATGOCCGGAGAGGTAATATTIT ACTTT AAAATGCAT A 
ACCCATGTGACATGCTAGGTCTTCGAAAAC 
>11.1 

gctggcctggt tctcx^agggagctgagatcactgaagctgtggtcgctgc 

CGTGATGTGGAGCAGGCAGAGCTCAGATAGAAAAGGAGGGAGTGACACTC 

AAGCTGCAAGCAGTGACAGTGCCCAGGGCTCTGATGTCT 

>12.1 - 

1 1 IC IC I 1 1 [GCATCATCAGTCTCTI I C I C ATCACTGAAACCTACAAATA 

TTTTAAAATCTTTCCATTAAAAAAATTTTGCTGATCATTC^ 

AATT ATTAAGAGATACTT ACTTTGT ATGAAAAATTTTGTOGAGATGT ATA 

ATCCA 1 1 Mil ICTGGGAAGAGAGTCAGTT 

>13.1 

ATTGCGTCCGAGGT ACCAGCTGTCATTCCTGCAGCAGGATTTAACAGATG 
CAGATCTGGCCCCAGTGTGAGCATCl GIGI I AATGGT ATCAGACTTAAAG 
AAGGAAA GACCTGATT TGACT G C1 G I P S G1 1 ItSGTAGTGI rCCCTGATOC 
GGAGOCA U I 1 1 lU I U GGAGGGAGTCCCAAAGCAGGTTTGAGCTGTGGTAA 
TGACCGAGTTGATCCTAGAAGACAAAACAGTAGAATCGT 
>14.1 

CGCGGTGGCGGCCGAGGT ACGGTATTCT CTT AAACAAGAGCAAGCCCATG 
ATGATGCCAI I IGU1CAGMGCI I GGGGGACAAACAAGAAGGAAAACTCT 
GAGACAGTGGTCAGAGGCTCCCTAGATGACCTGGTGAAGGT CTGGAAATG 
GOGTGATGAGAGGCTG^ACCrGCAGTGGAGTGTGGAGG 
GA GTGGT GICIGIGGACATCAGCCACACCCTGOOCATTGCTGC^ 
TCTCrTGATGCTCATATTOGTCTTTGGGACTTGGAAAATGGCA 
AAAGTCCATAGATGCAGGACCTQTQQATQCCTGGACTTTGG 1 1 1 ictc 
CTGATTCCCAGTATCTGGCCACAGGAACTCATGT^ - 
TTTGGGGTGGAAAGTGGGAAAAAGGAAT A I 11 1 i IGGCCACGGQAGGAAA 
ATTCATTCTT AGTATTGCATATAGTCCT GATGjGGAAAT ACCT 

ggtactgctccctgcacgatocagtcagcocci gcccggctggttatgta 
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acaaacaagtctgtgtctgtgtggagt6ttgcagoac 5 *gtgg.a.** t g.*g 
tgtttccaagttcatggcaattcagaaggcccttcagccagactggttcc 
agtgcctct ccgatggagaagtatcttgtaaggaagcaactt ccat aaaa 
agggtcagaaagtctgttgaccgatca ci ii i i t i c i i ggataactgtct 
gcggctgcaggaagagtcagaggttcttcagaagagt gtgatcattggag 
tgattgaaggtggag atgtgatggaagagaggct gaggtcagcacgagag 

ACAGCCAAGCGGCCTGTGGGTGGCTT CCTT CT GGATGGTTTT CAAGGAAA 
TCCAACAACCCTGGAGGCTAGACTACGCTTGCTGTCATCAGTCACTGCAG 
AGCTGCCGGAGGACAAGCCA 



TTTCATGGG ACACTTGGG AGCTAGATTAGAAG AAGCCAAGACT AGAATCG 

gggagatgagttgcagagggaagtggtgaaggtctgaaggaaggtaggaa 

aaggtcggacacattccagacatatttaggggtggaggtggttggatatg 

gggagtttaaaggggaaggaatgtggggtgatctgggtggtgag 

>t7.1 

tggccggccgcccgggcaggtgactttagtcctcactctgtgggcagggg 

cattacaggataggggtcccttttgtcagggatttatgatggcatcacac 

gcaggattca 

>18.1 

CGCGGTGGCGGCCGAGGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
tcggtcattaccacagctcaaacct GCTTTGGGACT ccct CCCACAAAAC 

TGGCrrCCGGATCAGGGAACACTACCAAACCAACAGCAGTCAAATCAGGTC 
TTTCCTT CTTT AAGTCTGAT ACCATT AACACAGATGCTCACACTGGG G CC 
AGATCT GCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGT 
>19.1 . 

CCCCCGGGAGAGGAATTGGGAAGAGCAAATTGGTGCTGAAAATTTCTACA 
TTGATCCAGACAAACAAGTTAGAGCAGGCTGAAAAAGAACCCTTGGTGTT 
TTT ACT GTGTTCAACCAGATCAACT GGAAAAGTAT AGATACCTT AATTAG 
CACTGTGCTCTGTGGGATTCTGGTCAGCCTGGCCCAGTGGTTTTTTT 
CTGAACACGCCTGAAAGGGGAGCTCATAATGACTGCTGTGCAGG7GGGCG 

TTATTGGTAGCCCCATTTATACATGGTGGAAAAAAAGT 
>20.1 

GCCGAGGCACCACAAI I I II 1 1 AAGTT CT AAGGT AGCTTTCTCAAAGAAA 
ACCATTrCAGGGTGTCCATTAAAAGAGCATCTGCGAATTGTTTTTGCAG 
G ACTCCT AATCAGTCAGGAGAAGTAGAATGT AAGCAAAGTCACAAACCT C 
CXX5TAAGAATTTGGTTCACCAGGACACAGCTCCTCTCTT ATGAAGGGATG 
AGA^CAGACCCCAAACCCAGTGCCACAGTCTCCCTGGAAACAGCAGCAG 
GCTTOGGGAATGCTTC<iAAAAGGCTATGCCATTCAAGGTCT CAGGTTTTT 
TGGTT 
>21.1 

TGGCGGCCGAGGTACGATTCTACTGTTTTGTCTTCT AGGATCAACTCGGT 
CATTACCACAGCTCAAACCTGCTTTGGGACTG 

CTOGATCAGGGAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCC 
TTCTTTAAGTCT GATACCATTAACACAGATGCTCACACTGGGGCGAGATC 
TGCATCTGTTAAATCCT GCTGCAGGAATGACGCCTGGT 
>22.1 

cgcggtgggggccgaggtacagagtagagagagttctgcagggatgaagt 

GGGAGACGTTGATAGGACCAGACCAGACCAGGCCTTGTAGGC^ 
GACTrTGGATTTTACAOCAAGTGC^ACAGGTAACTGCTGGAGGGAATTCA 
^^^^^^^^^^^^^^^^OACAATTTGAACGOOCAC^CT ggctg 
CCATGTGGCAAATAGATTGTAGGAAGAAAAGAAGAAA 
gTGGAAGCTACT^ 

>23.1 
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GCGiaTG^GGCCGAGGTACy^CAGTAGAGAGAGTTCTGCAGGGATGAAGTG 
GGAGACGTTGATAGGACCAGACCAGACCAGGCCTTGTAGGCCATGGAAGG 
ACTTTGGATTrTACAC<>AGTGCAACAGGTAACTGCTGGAGGGAATTCAG 
CAAGAGAGTGACAGGAGCTGATTGACAATTTGAACGCCCACT CT GGCT GC 



TTGGAAGCT AC TACT GTTGTCCCAGAAATATGTAATGGTGGCTTGGCCCA 

GGT 

>24.1 

CGCGGGGCGGCCGAGGTACAAAAAAAGCACAGCCTGGCTCTGGGTTAGAG 

acatgctgactgatgagatcaccaaggcagctgcaaaggagagtccggta 
gtgaaaggcaatgcgctgttagctctaagcagccttgctgtcgtcgtatc 
tagacatg^ gccag cgtctcctcagactctgacgggctcctggaggttc 
aacctaatttcctttcaatgaaagagtgggtttccatggt 

>25.1 

ggcggccgcccgggcaggtacxscgggaggcacattcttttctacgtgaag 
agtttrgtaaactgaactttgttrrcagttccggctccagccatcctggg 
gt^cfrtgccaatagatgaatcccactcgtttgacccatgac^sctccttc 
tmcatttctccctctttc<xx^cagcagtgcatgtccaccataccacc 
tg agagtctgtggaatct aa7tttctgttatactt ctttccttacactcia 

TTTTCCTGTCTTTATTATGATAGTCT aactttttc 
>26.1 

GGCGGCCGAGGTACGGATACAATTCCGCTGAGTTAGATTCCAAATTCTAA 

cctctcc^tcacacgc«;cagaaaggacagtagc<^gcttctctggatgc 

tttgccaagcaattgactccatcacggtgaccatccagcgaagcaaggaa 

tggttttgcaaatactcgttccagtttggtagcattraaagctcttatat 

attctcgtgggacctcaaaaggatgtaaagcaggatcatagtttcttgga 

i^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^rccG 

GCTCAGCATCTTCACCTTCATCTCGGTTGCTCTTC 
>27.1 

CGGCGGCG«»^GTACGGATACAATTCCGCTGAGTTAGArrCCAAATT 



ATGCTTT GCGA AGCAATTGACTCCATCAC^jGTGACX^TGCAGCGAAGCAA 
GGAATGGTITTGCAAATACTCGTTCX^TTTGGTAGCATTTAAAGCTCIT 

A I ATATTCT ^ gg gacctcaaaaggatgtaaagcagg ATCATAGTTTCT 

TGGAACTCTCTGTAAGTCCAACTrGGTTTCGCG^ACATAATTGTCCGGAT 

tccggctcagcatcttcaccttt ATCTCGGTTGCTCTTC 

>28.1 

GCGGCCGAGGTACTCAGTrrCCTTATCTATAACATGGGGATAATATTAGT 
AGCT ACATCGTrGTTATGAGGATCAATATCTGTAAAGCTCTTAGAACATG 
CA 

>28.2 

. CTCAGAAATTAAGGCAAAAAGTCTTACTGACCATGTAAAGGAAATCCAAC 
AATTATAAACAGTCTCTGCCTTTAAGGAGCTTATAGTCTAGTTAAGAAAC 
CAGA 
>29.1 

CXaCGGTGGCGGGCGAGGTACTCA G I H C U I I ATCT ATAACATGGGGATAA 
TATTAGTAG CTACA TCGTTGTTATGAGGATCAATATCTGTAAAGCTCTTA 
GAACATGCATTTTTCTT CT ACT 

CTCAGAAATTAAGGCAAAAAGTCTT ACTGACCATGTAAAGGAAATCCAAC 

AATTATAAACAGTCTCTGCCTTTAAGGAGCT 

>30.1 

GCGGCCGAGGTACTCAGTrrCCTTATCTATAACATGGGGATAATATTAGT 

^ctacatcgttgttatgaggatcaatatctgtaaAgctcttagaac^ 
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C I C £ G ^V AAC ^I C £ AA ^^ 
aattataaacagtctctgcctttaaggagcttat 

>31.1 

5^^^^?^5^^ tact ^ g ^ c ^ at ctataacatggggataa 
tattagtag ctaca tcgttgttatgaggatcaatatctgtaaagctctta 
gaacatgcatti I icttctact 

>31.2 

CTCAGAAATTAAGGCAAAAAGTCTT ACTGACCATGTAAAGGAAATCCAAC 
AATT ATAAACAGTCT CTGCCTTT AAGGAGCTT ATAGTCTAGTTAAGA 

S^^^ A 5?J^ TATG ^^ G ^ G <^ T CTAAGCAGGGACAATGG 
£ AG IL C A TATCATGAT ^ AC ^GATTCTCTGACCA^ 

tgtagccaggtctggaagacagagctgggttaaagctgggtgggagaagt 

^^r^^^^S^ ^-5^^*^ GTTTACATTOCTACGCX3 G«uAAAAGiGLA.TGTAACACGGG G C 
?^~]^^J < ^ G ^ AA ^ G ^ A ^ G ^^ GG ^ G ^ C AGGGAAGTGGCTGCCAA 

£??I9IL GTAGGAGAGTAATAAATGAC ^^ 

caagtgggaaggggagtgggctgtaaaatagtttaagagactctctcagg 

^Z^^ TAA ^ GATGTGTAGAAAGGTAACAGTCA ^ G TTCTCCT^ 

caagacagcttwaagcagcagctatagtggagcattc^gaggcctgct 

GVCAG^TjCAAAGC-ATGAATGiTG CA.G\ACTGi GTCCTCTTGVCCC^G CG5 TTTCTT 

J^^tg^gcacatgttatattttagaggcaagttcagttct 

>33.1 

cgcggtggcggccgaggtacgtatgcacttgcttgccatct^ 

CAATGGCAGTrCATATCATGATGTTACTTTGATTCTCTGACCAAACTG 
SI G I^^ CCCTGGGCC ^ mc GTCTGTCAAAGGC^ 

GAGAAGTGAAAAAGGTCAGGTTTACATTCCTACGCGGAAAAGGATGTAAG 

CTGCCAAACCT GTTGT AGGAGAGTAATAAATGACTTGAGAGTAAGCCTAA 
GC^AAACTCAAGTGGGAAGGGGACTGGGCTGTAAAATAGTT^ 

™i A 57^ c I AG ^ 

AAGGGCTAATATGAGTTTTGTCTCTTACAATTTATCTGCATOT 
CCCCCACCCCCC^C^ 

C ^°I G I GG ^ CTG <^TATTTGGAAAC^^ . 
CACTCCACTGGGA^^ACAGTA^^ 

ATCAGTTACAACCGCTGGGTGTGTATTGGTITCAAAAGGACCTACCTAC^ 
GGTTATATCAATCCATTGTCXiAATTTGAGAGATTTTrTCT 

£ A ^T AA ir™J GGC 2 

I^^CTAGTGCCCAAGAAAGCCAA^^ 
JCCAGATCCCCAACAGAC^^ 

>35.1 

CGCGGTGGCGGCCAGGTACGGATCAATTCCGCTGAGTTAGATTCCA 
2 A j^ C I CC *I CACA CGCC^^ 
ATGCTTTGCCAAGGMTTGACTCCATCACGGT^ 
GGAATGGTTTTGCAAATACTCGTTCCAGTTTGGTAGCAT^ 

^attctcgtgg^ 
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^^Z^TZ^^^9^ AGGTATTTGGACAC ' 'CAATACATTTTTA 

TTTAGCACTTATTCTGTGTCTGCTGCCCTGGGA 

>37.1 

acx^tggc^ctgtggcttostgcgattcgcgtccgagctcagacgagcrr 
c ^5^ gacccct ^ Cac ^ g ^ g ^^ t gaggactatgattttgaag 

^1^3^P!^S£ GA ^ TGGGG ^ GATA ^ GCTC ^ C ^ GTA GCX3^TTATGAT 
GATCTTGAGCCTGAATGGCTGGACAGTGTGCAGAAAAATGGAGAGCTGTT 
TTATTTGGAATTGAGTGAGGATGAAGAAGAAAGCCTCCTTCCTGAGACAC 
CA^GTGAACCATGTCAGKSTTCAGTGAAAATGAGATTATCATTGAAGAT 

>38.1 

CCGAGGTACTT AAGI I II ICI I CAGTTACAGCTACCATGTGAAAATAATT 
CTCTGCTT ATCAAGTTTACAACTTTAGAATTTCCTGTTTTAAAGTTTTCT 
CATTTACTT ATCACACAGTCATCTT CTTTTTG CC AAACG CT AT AGTAGCA 
CATTAAAAGGAGACTGATGTGAAATCAACTCT GTGCAAAAAGTATTGGGT 

^^^9 G I AGAAGTCTATACAGAAGACACTGGAGAGA 

TGTCCAGGTGAGTTGA 
>39.1 

GGTGGCGGCCGCCCG GGCTGGTACGCGGG AAAGCAAAACGACAAGCACGC 

CCTGAGCAGAGCCCCGGGAATTCAACCTTTAAGTGG^ 

G GTTTGCCAA GGAAG CAQQGCATCAAACAGATGAAACAGCXrrATTGTCCA 

TTTCAACAGGATTTTTCAGGAGT^^ 

^ AA T3 AmAATA7mGATAGTCAA ™CCTAAGACACGGCATC^ 

ACTGACCAATCAGAAGAGATGCCAGTAGTTGGGC 

>40.1 

CTCCCCGCGGTGGOGGCCGAGGTAGAGTrTAGAAAACTGTGGGGCTGAGT 

^3^I? A ^^^^ A ^ AGA ^~'~^ G ^ GT ^^^^GTCTCCrrrAGT 
GATTAAGT CTGTGAAGTAGTCCTTTGCATAATCAAATCCATG C I 1 1 I CI I 
TGATGCCATTGCG^CAAACAGTGTAATTATAGAAGCGAGAATTCTTGATT 

AA JS9 A ^ GC ^^ CTrocCACCCAGGGGG ^ 

ATTGAGGT ATTTTCCAAAAAAG G CTGTTCT GT AGCCAGTGTTGTT AAGAT 

A T AGA 5^ CA j^ G Ii^^ 

CAGTTCTCGTTGTTGGTGTAGA,CATTGTGATTGTGCACATACTTCCCGGT 

e^TGGAGGACTCTG^ 

TTGATGAAAGTGGCCCCCCCATGTTCCATAAT 

>41.1 

GGGGGAAAAAACTCX^CTCAA^^ 

AA l GG ^I AA i AACA If^^ 

?I ^ AC T c ^^ G ^TI ACCAAAAAATGCCCAAGGG ^c^ 

AAGCAAAACGACAATCATCAGGAAAACATGCAAAAGCATAAAACT 
>42.1 

AGATGCTGAGCCGGAATCCGGACAATTATGTCCGCXI^ 

CCCACGAGAAT ATATAAGAGCTTTAAATGCT ACCAAACTGGAACGAG T AT 
TTGCAAAACCATTCC \ IGCI lUiCTGGATGGTCACOGTGATGaAmrAAT 
'TCCTTGGCAAAGCATOCAGAGAAGCTGGCTACTGTC^^ 
TGATGGAGAGGTT AGAATTTGGAATCTAACTCAGCGGAATTGTATOCGT 

ATTGGAGCTCCCCGCGGTGGCGGtSCC^^ 
AGATGCTGAGCCGGAATCCGGAC^ 
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TT ACAGaGAGTTCCAAG AAACT ATGAT CCT gcttt acatocttttgaggt 
CCCACGAGAAT AT AT AAGAG CTTT AAATG CTACCAAACTGGAACGAGT AT 
TTGCAAAACCATTCCTTGCTTCGCTGGATGGTCACC GTGAT GGAGTCAAT 

TGATGGAGAGGTT AGAATTTGGAATCT AACTCAGCGGAATTGT ATCGGT 
>44.1 

AGGTTTTTACXTGATGATTGTGTAGGTTTCT(XrTAGCTCCAAAGTATCCG 
GCTCCTACGACTCT AAATATAACCTTCAAGGAAAGTGGAGCTGGTTTACT 
CTTTTCTGA 
>4&1 

TGTCAGTGCTATTrATAGTAATACAAAAATTTAATGTAATTTTTGTCAAA 
ATCTCAATGGTATATTTTTGCAGATTTTTCAAATTATATATATATGATTT 
ATAAATTATTGTTATAGATTCCTGGAAAGTTAATCCATCTCACCATTA 
>47.1 

CTAACCTCACATTTAATTGCGTTTGCGCTCACTGCCCCGCTTTTCCAGTC 
GGGGAAACCTTGTTCGTGCCAGC 

GCCGCCGGGGGGAGGAGGGCCGGGTTTTTGGCGGTATTGGGGGCGCCTTC 

TTTCCCGCTTTCCTTTCGCCTCACTTCiAACTTCGCCT 

>48.1 

GTGGOGGCCGCCCGGCCAGGTACAAGGACATGCT GGATGCCAAGCAGTTC 
aSCCCTACCGTCTCACTGCCCCTCAAGACTTCAA^^ 

AACATCAGACTACAGATTTAGGTGGAAGAGCAGCCATGTTTGAAGGGCAC ' 
ATGTGATGAGTGGGGGGCAGCAAGATGGCATTTGTGCATCrrCCCAGAAGG 
GATGAGTCTTTGTOCCGATG CAAGCCCCCT ATTCGTTGGG CT CCCAGCAG 
TGCTTACCTTCTAGAGCGTTCACTCA 1 1 1 1 G TTCTTTCCCCCCAA C 1 1 1 I 



>49.1 

GCGGCCGAGGTACAACTAATGGAGCTCAGAAGCTGTCAAGGATATAAGCA 
GTGCAACCCAAGACCT AAGAATCTTGATGTTGGAAATAAAGATGGAGGAA 

gctatgacctacacagaggacagttatgggatggatgggaaggttaatca 
gccccgtctcactgcagacatcaactggcaaggcct AGAGGAGCT ACACA 

GTGTGAATGAAAACATCTATGAGTAOCTGCCOGGGC 
>50.1 

GCGGCCGGAAGAGCAACCGAGATGAAG GTGAAGATG CTGAGCCGGAATCC 
GGACAATTATGTCCGCGAAACCAAGTTGGACTTACAGAGAGTTCCAAGAA 
ACTATGATCCTGCTTT ACATOCTTTTGAGGT CCCACGAGAAT ATATAAGA 
GCTTT AAATGCTACCAAACTGGAACGAGTATTTGCAAAACCATTCCTTGC 
TTOGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAG 
AGAAGCTGGCTACrrGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATT 
TGGAATCT AACTCAGCCGAATTGTATGOGT 
>51.1 

GCGGCCGAGGTACCrrCAGCATATATTGGAAGTGTTTTAGAGTTGGTGAGT 

TCCOCGTGCCTTOCAGAACTGAACGCTAGGAGGAGCAGOCAGTGAGGACA 

GACGTCTATGCAGAAACATGGGGAACCTCTGGAAATGACACACTCTCCGG 

GCACAGGGGGOCATTCGTCCATCTTGAGGTGGACTAATCATGGAGATTCT 

CGCAGGGCCGGCTGCTATCTCAGATTTTCTAATCGGAGAAGGAGAGAGAT 

CAACTTCCATCGACTCCAGTCTGTCGGGGGCTGATGAGTX2AGGTGGCAGC 

AGGCATXIXXiCGTGGATTTGTTGAAACTGGACTTTTT ATTGTGCTGAAAGC - 

TGCTrGTTGTGATGATCTCATACTTTGTAGTTGTTCT ATCTGCAGCACTG 

ACTTCCTAAGGGATTCTTCCAACCTAGAAA 

>«2.1 

ATTCTG 



AAAATTCATGAGGCTGTGTmAGGTGAGGCTATTTCTTCATTCAC^ 
OGGGGCAOCC AACAG GCTCTTAATATGAAGACTTGGGCCCTrCCTGAGTT 
CTAGAAAAGCATTTTTACTAGTTCTTCAGTAAT! 
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CTCTGTTCTCTrTTCCTCGGACTCCAATTGGATCTrGGGCCTCTAAGTAT 

AGGCAAGATCATGTTTCT AAAAAGGTTCTTAGAGGGAGGGAGTTCCTGGG 

AGTGTTATGTGGGGTGGTGCAGAAGGTGCTAACAGGTGGG7TTCTCTTTA 

GGATGAGCAGGTGGGATGCCAACTGTCAGGCTGGGACCTTTCCCTCCAGT 

GCTAAAATGAAAGTTTTATTCTGGTCCTTTGACATCCACACCAGAAGTCT 

TGACTTTC 

>53.1 

GTCX»CGAAACXAAGTTGGACTTACAGAGAGTTCCAGGAAACTATGATCC 

TGCTTTACATXXrrTTTGAGGTCCCACGAGAATATATAAGAGCTTT AAATG 

CT ACCAAACTG GAACGAGTATTTGCAAAACCATTCCTTGCTTCGCTGGAT 

GGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAGAGAAGCT GGC 

TACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAA 

CTCAGCGGAAATGTATCCGT 

>54.1 

GCGGCCGAGGTACACTGGGAAAATGAAGAACTTAACTACATAAAAATAGA 

GGGACAGTCyVAAACTTCACAGGGGGGAAATCAAGTTAAATTCAGAGCTGG 

ATTTAGATGATGCCATTCT AGAGAAGTTTGCTTTCTCCAATGCTCT ATGC 

CTTTCTGTAAAACTGGCAATTTGGGAAGCATCACT GGATAAATTTATTGA . 

ATCTATTCAGTCAATTCCTGAGGCTTTAAAAGCTGGGAAGAAAGTGAAAC 

TATCTCATGAAGAAGTTATGCAGAAAATCGGTGAACrrcrrTTGCTCTAAGG 

CACCGT AT AAACTTGAGTTCAGACTTCCTGATT ACT CCTGATTTCTACTG 

GGACAGAGAAAACCTGGAAGGACTTTACGATAAAACGTGTCAATTTJCTTA 

GCATTGGCCGAAGAGTTAAGGTCATGAATGAAAAACTTAAGCACTGCATG 

GAACTAACAGATCTAATGCGGAATCACCTGAATGAGAAGAGGGCA 

CGGCCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTTGAGCCGGAATCCG 

GACAATT ATGTCCGCGAAACCAAGTTGGACTT ACAGAGAGTTCCAAGAAA 

CTATGATCCTGCTTTACATCCTTTTGAGGTCCCACGAGAATATATAAGAG 

CTTTAAATGCTACCAAACTGGAA(X»AGTATTTGCAAAACCATTDCTTGCn" 

TCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAGA 

GAAGCTGGCTACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTT AGAATTT 

GGAATCTAACTCAGCGGAATTGTATCCGT 

>56.1 

CTCCACCGGGAGGCGGCCGCCCGGGCAGGT ACGCGGGCT ATTGTGATTCC 
CAGTGACCCATAGAACAGGATTTCACTAGTCCT ATGACATGTGACTGGGC 
TTGGGAAGTTCGGGTGTCAGGTCCAAAAATCCT AAGGTGGGATCTT CGCT 
TTGTGAAGGAAATTAATTACACAAOCAAATATTGCCAGATTCTTGAGGTC 
TATTGACACAATGGGAACTTCAACCCCTACTTAGCTT AGCATTTTTTTTT 
TCAAAGAGTGAAAAGTGGTCCACGTAGAGCACAATATAATTTAAGT AAAG 
GAAGATTAAAACATATTTTTATCCATTTCTTATGGTGGGAAATTAACATG 
TTTTAGATTTGAGGTCOCOCT C T CAGGAAA CCC 1 1 rC AACTTOGTATTAT 
TCACTCCTGAGT AGT ATGGGGTAGAAAATGAGTGGAAATCAGTTTGGCCA 
CTATTTCCGAGTCTTTTGCACTGCAATACTTTCATC^ 

TTCAGTCCTGTTTACAGATGvGGGATCACATCAGGCTC . . 

>59.1 

GGCGGCCGAGGGACGCGGGAAAGATCAGTTGTTTT ACCTTGGGATTCAAA 
GAG 1 1 MCI 1 1 GACTOCCATGGTTCTCAAAGCGTGATCCTGGTCCACCAC 
CATCAGCATGGGGGGGAACGTGTTAGCACTGCAAATTCTGATTCCTCCCT 
AATTTTCTGAATCAGAAATTACGGAGGT 

>S9JZ 

TAATTTATGCTTTGAGAACCGCTGATCTAGTTTGTCCCTCT CATTTTGCA 
GGCAAAGAATTGAATTCTAGAGAGGTTAATTGA 

>59«3 . 

CCCAGGATTCATAG M I I C 1 11 CTAATACTCCATATTTaACTTGACTTTT 
TTACAAGTTGTAATTACAAATAAGTCTAAGATGGGAAAGTTGTGGAAAAC 
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TTTATA6AGAACATGAGATTTGACTGAACAGTAAACATTAAGTAGAGAGG 

AAAGAAAGGGGTGTTCT AAGC 

>€0.1 

GCCGGATCCGGACAATATGTCCGCGAAACCAAGTTGGACTTACAGAGAGT 
TCCAAGAAACTATGGGGGTGCTTT ACATCCTTTTGAGGTCCCACGAGAAT 
ATATAAGAGCTTTAAATGCTACCAAACTGGAACGAGTATTTGOKAAACCA 
TTCCTTGCTTCGCTGGATGGTC ACCG TGA TGGAG TCAATTGCTTGGCAAA 
GCATCCAGAGAAGCTGGCT ACTGTCCTTT CTGGGGCG TG TGATGGAGAGG 
TT AGAATTTGGAATCT AACTCAG CGGAATTGTATCCGT 
»«t.1 

CGCGGTGGCGGCCGAGGTACACGTTACTGTTCCGTCGTATTTTGTAGTCT 

CTGTTCTGC CCTTT GGAACATCTCTTCGGTGTTCCTGTGGGATCTCTCTA 

CTGCATTCTACTTTATGTAATAATCTGTTC^ATAAATAATTTTTAAAAGG 

AGACAACAACGCCGCAGGTGATCTGGAGGCT cct ggaggacctcagcgac 

tcaggtccagtccaaggagggccggagatcaggctgaaggatggatccac 

atgttt agaggagatcgagaaatgcagaagagagatgcagcagagaaatg 

ccacagaaaggggagctggagagaatcaaagcatgagaggaattcaacct 

gctgtcact ggaaggggtccagatggaacgcttgagaagaaacgtgtgta 

gcatctaggagtaaagactcgccctggctgacagctagtaaggaaatggg 

aacctcagtgctgcagcctcaaagaattgactttaacccacagcct 

gcacttagaagoggatgcattcac 

>62.1 

aaaga^tttgagcaggacgaccctc^^ 
cctgtgcagtaacaacgtaggctcggaggatgggt 

>63.1 

tgagtgagcctaactcacatttaatttgcgtttggcgcctcactgcc^ 
ttttccagttc 

aggagggogggttttgccgtatttggggcggctcntt occgccttccttc 
ggccttcaacttgactt cggcttgc - 

>€S.1 

VJli 1 1 1 1 " > ' » 1 1 1 1 GGAGGAGATGGACAGTGTCAGTCTCCTGATAA 
GGGGGTGATGGGTAGGTAATTTAAAAGCTTCT ATTATAAAATCTAGTCTC 
TCTGACACTGCCCTGTCCACTGCAGTCACATCTCCCAATACTGAAGGATC 
CTGAGAATACGAGCGGGCATGACACTT ACTCACGTCATTCACCAT 
>68.1 

JATCTTCOCATCCCATATTTAGGTTAGATAGAGAAGTG^ " 

TGTOl gtgtgttgctcgcacagtgatgaactgtaaacataaatgaagata 
^sgaaaaatacatcaattaggacaacatgacaatttcattagactcctat 
caaagagt atcagt tcacagtttttatagatactagtataaaattcagat 

^^^^^JTCTtSGGGATAMGCMGGCTTf ACAATTT AGCAGTCTGT 

^cctagcttgaaacagt aaaacaacaacagcagagcctt aagt gt atttt 
tgtgacct aaaacatgaactcagggtttccaaattt^ aacaatgaatag 

>67.1 

^ttggagctcoccgcggtggoggcogaggtacrreaaggataagaaatt 
actgt gtcaaattacccacaagtraaatgcccatgttccagacctgtggc 
tcttaot atcaggcttgtgatagagaaaaggctgctatgaattctactca 
gtgtgcttagaocaaaggaaaccaccacagggatttcacaggc 

IvXXX^GOGGTGGCGGCTCGGGTACTrGACAGGATAAGAAATTACTGTGTC 
- ^I1*CCCACAAGTTCTATGCCCATGTTCCAGAC^ . 
^^^^^^^^^^^^^^^^GCTATGAATTCrfACTCAG 1 G I GC I 1 

agaocaaaggaaaccaccacagggatttcacaggc 
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>€9.1 

ttggagctcx^cgcggtggcggccgggtcccatttcatcttgcacccgca 
t accagggattgttg cgaagaatcagttgtgttatattgtccaaatcatc 
aaagataccctgaggtaaattacttaggttattattggacatatccagtc 
gatagagctgccttagataagaaaaagcatttgggggcacccgattgatg 
tggttatcttgaagataaagcttcctcaggtttgtgcctggaaggtttac 

ctaggttgaagaaaactitgtcacctaaaccatgat^^ 

ccatctagaaccaggcgttttagactagtgagaccttgaagagatggtga 

tgaaatagtggatatgcgattatcatccaagcgtagttcttctatagtcc 

tgggcaaacxxx^gggaattgtgctaaggtgattacgggacaggaaaagc 

agtcggagatagttgctgtctcggaatgctccctcttctatgct aactgc 

agagacagagttgtcatctaaatgtaattcttccagatagggaatttttg 

aaagtgaatcataagtgatagtccttatgttattttc 

>70.1 

ttggagctccccgcggtggcggccgaggta ci > IG AATAAAAGGCTTTGQ 

TTTCTCTGATGTCnTCCAATCAATCACACAGAGCTTOCCCTGATACTCAG 
CCAC ACAGTCCAGCAGACCT ATATAGTTTAAGGTTTCATGTTGAACAGGA 
CTTTCAAGAGCTCGCACTCCACTGACATCTTTCAGAATATGCTGGACACT 
TTCAATGT AACCAGACTTGAGGAGATTTTCATCTCTCTCTTTT AAGGTTT 
CCTGGGGTGAAAGTATGCTTTCCAAGGCTTCGTGGAACCGTTT O CC 1 1 G I 
AAAAAGACGTTTGAAGTGTATrCTTTAAAGCCATCTTCTOCCAGTTCCAG 
AATCATCCGCTG TTTCCACCTCT CCAACAAGAAAACCTGTTGTTTTGTCA 
TGGTCTGCTGAAGGACTCGGGTCACACTTGGTATCACATTCXrrTTGCAAG 
GGGATTTTCAAAGGAACTGAAGGATCACTTGCATTTGGTTTATCAC 
>71.1 

GATTGGAGCTCCCCGCGGTGGGGGCCGAGGTACTTGAAGGATAAGAAATT 
ACTTGTGTCAAATTACCCACAAGTTAAATGCCCATGTTCCAGACCTGTGGC 
TCTTAGTATGAGGCTTGTGATAGAGAAAAGGCTGCTATGAATTCTACTCA . 
GTGTGCTTAGACCAAAGGAAACCACCACAGGGATTTCACAGGC 

>72.1 

ACATATATCATTTATTCAAGAGGCAGATTTTAAACGTTTTTGTAAAAAGC 
TAAATAACACCCAGAGTGACTCAAAAAATTrCTCAACTTTGCC 

>73.1 

CTGGCTTTAAAAGCT CAT AACAGTAGAGATCAGTTGTCTCTGGTTGCAAA 
TCTAACATATATTCATGCAA 

>74.1 

ACCTTGTGAGAAGAGGAAGAAGGTGATAAGAACTAAGATCAGAGCATAGT 
AGAGAAA GTAG CCCTGTAAACAGAGGAGAAGCAGAAAGAGAGAAGGGAGG 
ACAG^AGCTTTTATTTTGCTCCAGGTT AAAAAGAAAAAAAAAGCACATT AC 
AACTCT ATGTCAG I G I C I G I CCCAGGTCCTAGAACTGGAATAGaCCAACC 
AAGCCCAACCCTTCTT AAAAGTAAGACT AGG I G O I I O C I GATTAT AT ATT 
CAACTGCCTGGAAGCATGCAAGTAAAATTTCCTTGATGGCATTTCT 
TTCAAACA T ATT CTTCCT AAAAATGCATTT ACAAAAAAT ATT AAGATTGT 
GTTTTn 1GGTTTGGACTTTAAAAAAAA1 TGI 1 1 1 C AAAACCATAATTGG 
GGCCTACCCCAAAATGGAT 
>7S1 

TGGCGGCOGAGGTGOGCGGGGAGGCGTTGTGGGAGGAGGTGCGGGGAGAG 
AGGAAGGGGCCrGTGCACTGAGGAGGGATGAAACATTAGTGGATGGCCTT 
GCGTCTCAATCTGCAGTAAAGAGGAAACTAATCTGAAAGGGAACGATAGG 
ACTGTGTGTC1 1 1 1 1 

AGGCTGA - 

>7&1 ^ ..... - , 

GAAGT AACAGACACACGATGTTGTGAAGGATGGTTTTGGGACT AGAGGCT 
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CTGAGGCTGTAACTGAGAGAAAGATTCTGGGGCTGTCTTATGAAAATATA 

>771 I CACATTAGGCTGGTTGGTTCAAACTTTTGGG 

CAGG ACG CGGGGAGA CAG CAGAAGGATCACT GGGCTGG AAGCTCT AACAG 
GCATTGCC AGCCTAGCTACCTGCAGTTTGAGGCAAGG GCAGGGTCACTTA 
SS^?J5^T5 T HL^ G ^^ < ^ GAGG CTGCACTCACTGGC 
o^Z^^^^^ 0 ^ 10 ^^ 0 ^ 0 ^^ CTCTGGCAGGTATG 

G iJ AGC 5I GG r ^ ACC ^ 

° AC ^TGTTTC^GGAC*^ 

CCGAAGTGGAACCACTGGACCGGAAGCTCTAGCAAGTGTTGCCCACC7 

T^CCmA^GTGAGGGGTTAATTGCGCCGCTTGGGCCGTAATCATGGTCA 
>7B.1 

>79uJ 

G |^ G ^ A ™ CTGTGCGAGAQGTAA ^^ 

>ai.i 

G ^I G j? CGGCCGAGGTACAGGGA ^^ 

ggtgggcagatatgcaqagggacttggggctttgccaaagggtaagcac^ 
agggajgctactttgcaggaacctagcataactttgtgtgacgagactgca 

CCTCT GTATGAGTGTGCACCCAGCTGAAGAGAAGAAATGGAGAGCAGCAA 

GCCCAAAGCAAAAGCTAGAGAAGCAAGTGGAGAAATGCTGGAG/^^AGCTG 

GA «^ACAGGCAACC^<>CTTTAAAAA 

AAG6AAAAAAG 

>82.1 

TGGCGGCCGAGGTACGCGGGGGAGTCAGTCTCAGTCAGGACACAGCATGG 

>ea.i 

GTGGCGGCCGAGGTTCCTrGTTGCAGCTCTTT ATTTCTTAGTCCCACTCC 

CCCGAGGTAAGACAT^CTGCTTTTTTAGCTGT^ 

CACCTTTCTAAl 1 1 1 1 GATTCAATCACTTAACCACCGTTACATACTACAA 

J^S A £ G J^I C ^ AAG ^J^^^ 

TTAAAATTTTCTT AGT ATTCTATGT AACTATTACtXlATXTTTTTCCCCACA 
>85.1 

ATTCT ATCCTTAGGTTCTGGCGTGCAGTG AGCGAGG CAAGG ATGGCATTC 
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aagatttcattcctttgttccac 

>66.1 

cgcggtggcggccgaggtacatocctgtttatcccattccatccaccgag 
gcccaacagcatggatgatctgtttgcagggaagcctccctgctcccgtg 
acagct atctcaccagctgacactttaccatatctggcaacaaactgttt 
gctctci ici i ggatttcaaatccaccagctttt accagggccagggcca 

GGCCTCCCCCATG CAGAAGATCTTCATTGGCTGCATTCACCAC AGCATCA 
ACAGCATG TGTGG TGAGGTCATCTTTCCACACTGATAACTCTATCCT AGG 
AGTCAGCAI 1 1 1 I CTGAACACTTGCAGAGA7TTGCTGTTGCCTTCCTGAA 
CTGGAGAGACCAG GGT AGAGAT ACAG CCAAACTTATTCT GGAGG ACTT CA 
CACAGCTGACGCTCATTATTGT7TAAAATTTTGAAGTCATTGTGGTTAAT 
GG GAAATTTGCCAACTATAGTTTTCT CCAAG 
>87.1 

CGCGGTGGCGGCCGAGGTACTCTrCAAAATTGTCAAGGTCATGAAAGACA 
GCAAAAAGTGAAGAATTCTTACAAACTAGAGGAGACAAAGATTGGAG 

>S7.2 

AGTTGGAAG ACCG AGCCCT GTACOCAACGTGGAAG AAACCTCCCATTCCT 

CTACTT AAAAATA 

>68.1 

AAAACCGTAAAAAAGGCCGCCGTTGCTTGGCGTTTTTTCCATTAGGGCTC 
CGCCCCCCTTGACCGAGCCATCACCAAAAAAATTCGACG CTCAAGGTCAA 
GAAGGGTTGGGCGGAAAACCCCCGACCAGGGAAC 



ACCGCTCAGCCTGCTTGGTTGCATCCTCCGGATGGCGAGTCAGCTCTGAG 

ATCTGAAGGTCAGCATGCTTACG CTCGGCCTCACATGTGTCAAAGTG ATT 

CTGGATCTCCTTAAGTCGATCCAACATCTGCAGTTGCT GTTTTTCCCCAT 

TCTCCAGTTCACGTGTTAAATTCTCTACTTGTGATGCCAAATGTGCTTTC 

TTCTTGTLI 1 1 ICI 1 1 CXATGCACCGTTTCACTTCCTCTAACT CAAATGC 

CATTGCGCTGAAGTTCAGCTGCACTCTCAAAACTGACATTTGCT 

>90.1 

TTGGAGCTCCACGCGGTGGCGGCCGAGGTACGCGGGATCACAAAGCAGAC 

AAACAGGAAAGACTGAACCATCTATTTGAAAAAAGTGACTTCATTCAATT 

GGTTCAGCCACCCG T ATCTG T AATCTCTCCATTCTG CCCTCTTGATTTT A 

ATGCAGCT ATAAAGGAGAGT ATTTT AAAAGTGCCTCCCAGTAGGAAGAAC 

AGTCACAAGGCACTGTTATATCAATTCAGTGTGACACAAGCCCTGATTAT 

TTAATAGTAT AACAGCAGTGAATCAGAGTTCTTTCATCTGACTTTGCTGA 

GATTTCCAGCAGCTGTATATTTAATTCACAGTTAGGGGCTGAACAAACTA 

CAGCCATTGATCAGAATGTAAGCAGGCATGCTTGAGCTTCTTCTAGGAAC 

ATATACAGATGTGCACAAAATTTTCATTTATTCA 

>91.1 

TTGGAGCTCCGCGCGGTGGCGGCCGAGGTACGCGGGATCACAAAGCAGAC 

aaag^ggaaagactgaacx^tctatttgaaaaaagtgacttcatt 
ggttcagccacccgtatctgtaatctctccattctgccctcttgatttta 
atgcagct ataaaggagagtattttaaaagtgcxjtco^gtaggaagaac 
agtcacaaggcactgttatatcaattcagtgtgacacaagccctgattat 

TT AATAGT AT AACAG CAGTGAATCAGAGTTCTTTGATCTGACTTTGCTGA 

H A II35^ <>GCTGTATA ^ AA ^ CACA ^ AG< ^ 

CAGCCATTGAJCAGAATGTAAGCAGGCATCC^ 

AAATACAGATGT 

>9*1 ... 

ACACAGCTCCATGAGGTCACCAAGCATCCCATCAGCC 

catggcaatggctgccaggc>atggcacatcaXIlatcc^ 

GAGCACTGTGCAATTGAGTCAACAAGGTCTGAACTACTGACT GGCT AAGA 
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T G ^ C ^ GCG 3?]^ CCAAG ™^ 

A ^ T GGCCTGAGCACCTGGACTATCATCTTGACTACAAAGT 

ACCTGTATGATAACATTGC^TCAAACATATCTTGTGACAGGAC^TTTT 
T! G ^ GGG ^ GGAGAA ^ AGAGG A^ 

CTGTGTGAGCTGTGCGCTCCCGACTGGGAAATGTCTAACTCCATCXSAAAA 

AC J G Z G J G J AAC 9I mGCATAmTCA ^^ 
^ C ^TT GTGGGCTCATCCTGA ^ AGAAAAT ^ AAA 

J^ G ^ AA ^ G ^ GA ^^TATAAGGCATGCATGAATAATAATTTTTGCTTCTT 

aaatgtcgccaaactgccgtcttccctcctcggccgctgcgacaaacacc 

CCACAAAATGGCGGCAGCGCCGTCGCCCTAGA^ 
>97.1 

I G ^ GC I C ^^ CGGTGQCGGCCGAGGTAG CTTCCC^ 

caaggagggaaagagaatgcagagaagaggctggtgatagacaagtttca 
aaagaaatcgagaattggg 

>98.1 

agctccccg^gtggcggccgaggtaccagcagagatggcttcaagatga 

^ AGGAC 77 GGG TCAGTAGCACTTACT^ 

JS^HJ^ IM -"C^" 1 1 1 FATTCTCTGGC^TTTCTCCTATATAACTAG 

? C ^ C 2TJI* AAC ^ TA ^ 

A JT^ G ggCCTGAGTCCTTACCTAG A I » N CI 1 C ICI I CI I A CTCCTGG 
A ^ G ^ GGAAAGAGA TCTCCACGTTAGGCTTCTATTTGT AGCATCAGACCCA 

gag1 ^9j^ gtg ^ ggag ^ agataggctgag "1"t GGATGCTCTGCAGGCCT A 

TATTAGAi 1 1 1 1 ATCTGATGTTAATATTTCT ATGTGGGATGGAGGACTTC 
»99.1 

GGAAGAGGGCCGGTTTTGCGGTAATTGGGGCGCCTCTTTCCGCT^ 
CGCTTCACTGGACTCCGCCTTGCGGCTTCGGGT . 
>100.1 

J£ G ^I A ™JS CAGGG ^ 

CCATTGAATTATTGTTTTTAAAAGTTGGGATTCTCrrACATAGAACATATT 

T! C 7 A G ^ A X^ CAAGAATA ^ 

AAG^ATCCAATGTGTCTCAATCTATTCCATAATATAATCAATGATAAA 

GATTCACATGTATCACCAAATTCGAGGCAGCTTAGTTGAAAAAATTT^ 

ACAGC JJ AC ^ AA JI C J^ 

^^l^^^C>T^GGAGTTGGGAG(SCGGCCTGCT GGGGTTCCCTGGGTGGCA 

>iot V^^^*^ 1 ^ ^^ t ^^^^^^ GGAAGGG ^ GGATGG1 ^^^^^ 

GCGGCCGAGOCCAATTCTTGAl T ICI 1 1 CCATCCCAAACTCTTTAAACTC 

CTCTGAAGATACTGCTCTTCACCCCTCTGAAGGGGGTCTCCTCAG 
>102.1 

ACCATAATMTGCAATTAACAAAATCCAGGATTTAAGG^ 

™ AAAA ^ TGAGG ^ G ^^^ 

CAAAACTGGTGAGAGGAAATGACCCCCTGATGAAAGA^^ 
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GC 7 G ^ GAT X™ G ^^ 

tgatgaagagtcactgaagtgtcacaaagaaaacaagatttgagaaagat 

I C ^ G ^ AC 3 CG Z GCATAGGAATGAACT 

agaactaggccatgagggcctagtatccagaatgaggcagagggagggac 
gctggatgtgagca 

>103.1 

gagctccccgcggtggcggccgaggtactc c i M CI igtttaaagcctca 

cc^gaccaggaagtcttgatagagccatctagtaattcttaagtccta 

cctcatccaaccttgttttgactcctgcagtgagcacagcttgccctcac 

cc J CCCCTC I CTATGCCCTCACCmGGAGGAG ^ 

CCAC ATCAGCTCT cagaccaccaaag caagggtta i i r r t 

>104.1 

agctccccgggtggcggccgggacacgtaacagggtggttgcatgcattc 
ctcaagtct gt atgactctaccaagatactgtgaagttgtccttctgatt 
gcacatggggagaaaatgctgaaactagtggccacagatgtctttaattc 

CAAAA 

>1Q5.1 

ACT n C J AG ^ TATATCATGTGCCGTAATGTG CTCCTAA^ 

^ ag 2I!T^^ gaaaag ^ A7 ^ g ^ Gaaagggagga ^ a ^^ gg ^aagtgcat 
AGGC ™^ AATCATAC ^ GT ^ G ^ 
AA T AA TC TT GTGCAAAACCTGAGCTGA ^^ 
caatactttctgtgataatgggtgcaaaacacaaggtatactggt^ 

j^ C J G ^ GG ^ G ^" n TTCTTCTTAGTTTC^ 
>106.1 

G Z G ^5 GTAGTOGGGAGGGATGAGGGG ^ GGGGCTG ^^GCTGAG7TGTGTC 

aggtgagagattgtgagaacttggcttgcagggtttgggcatcagctgcc 
cattgaggggccgttcattgtctcaaagtgaatgtggggtggtttgatct 

GCATGTGTCATTrGTATCCACACAAGTTAATTATTCTGCTTTTGTTGTAG 
I AC ^ G ^II QTGAAGCAGAAGCTACCAGGCG ^ 

A I CGC J C J^33 AAGTAAGATGAGGA ^ CACTC ^^ 
A X^ A r G ZJI c J7 CCACACAAA ^ AGGC ^ 

ACAGTGGGGGTTTGGAGTAATATTCATACGGCATGGACTTT accaagatg 

GGGTATTTAAGTTTACAGTT 

M07.1 

J^tgcagag^ 

cac i ga z g ^i cg j ggaag i tgatggra 
I A Z c ?J GC ^ c ^ GACCimcTA ^ 

GGTAGGTGAATTTTCCAAGTGTTCTTGGAAATAAGGAAACATCAAGAATA 
ATGTAAAAGCCT CATAT ACAAT AATGAATAATAAAGAAT AATGTGAAGGC 
TTCATTCAAGGTTGGGGTTTGCX^GATACATTGCAACAAAATGACAGAGC 

AGGGAAGG T A T r ^ aggatagtggccaaaggattgtaatgatggcttatgg 

aagtgtcagctggataaagagtgaaaatgaataaaaactaatg 

>ioa.i 

atggattct acatcaggtgtctgtgcctcgctgctgaaggataacccaga 
gtgcaaggtcatctttgttgctgaacagggctggacctgtcgcacttaag 
CACA CTT aaagga ttctattcttcattcaggtcccccagagaaattggct 

CCTTAI 1 1 MCI I lACCT ATTOCTAGA l l I ICI I I 1 G TCTAGAGCCAGTT 
TTGCAAAGGGCACI 1 1 1 ATCCATCT CAGTT ATTCCCAGAGGTGACAGAAT 
G^TAAACCATATGGG^CAAATAGCATATATGAGCTAAACCAGTTAACTG 
TTAACCAAGGCACATGGTCAATGCCTTAQT A J I M 1 I I MMI 
>109.t 

CCAC ^T CCG * SA £ AC T^ 

AAGGAAAGGAGAGAGATGCXXrrTTGAGATTAATGAAAATGCTCTCAGGCA ■ 

aataaaatctaaaaatagcctccttgtgatacgaacgcgtggcccctaag 
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GGTCCT AAAGAGAGAGCTAGGGGAGGTTCAGCTGGCCACAGAG ATGCTAA 

^GTCAGG>GCAGACTTTTAGGGTTTGCTGTTTTATAGGTTTAAAGACCA 

G GT CTGTGTTTTGAT AACTGAACTTGCTAATAGCTGGCCACTTGAGTTGC 

TTCTTCCAGCTCTTTGTTTGTTTTAAATAAAGAGATTCAGCGAGTAATAA 

TGGGAAGAGCTGCAAATGACTTCCCCAGTTGGGAGTGCCTGCTTGTTTTT 

CCTTCTGCCTGGGCATGCTGATGTGC 

>110.1 

gtgctgcctgcactgtgactaagactttctggactatcatcatgtttagg 
agttgatgagattatagtttcatgtaagtgtatgattagatgacaactct 
acatctttaggcatggaaacaaacatttttcctggaagaaaaaaaagtga 
acatccaacc tcca tttaaacaaatttgattgtttctttgct attaagaa 

actcggtg ctcttt ct GCCACT CTATTATATTGTCAAAATACATCTGG AG 
ACACTATATAAACTTmCTCCTTTAAATTACCTGGTTTATATATTATCT 

CCT gtagcctg catatagat aaaggtt AAACATAGAG g atttaggttgtt - 

GGTAATTTAATAAATA 

>111.1 

gtcgcgggattggaccgacgcagccatggtaggtccagatcccgtagaag 
ggagcx^ggtcccataggttacggccgattcctggagcttctggactgag 
ggccgcggtaagcagtggtctgggctcccgc 

>112.1 

GTGGCCGAGCGGTTTGCATCGCCAGCTCGCGCAAGGOCATGAGGTTGGTC 
TGGGTG AAGAACG CATCGATGGCGGCACG GGGCT GTTCCGG CACG TAGAC 
CTTGCCGTCACGCAGACGCTCCAGCAATTCGCGCGATGGCAGGTCGATCA 
GCAGCAGCTCATCGGCTTCCTGCAAGACCCAGTCAGGCAAGGTCTCGCGC 
ACTTGCACGCOMTGATGCCGCGCACCTGGTCGTTGAGGCTTTCCAGATG 
CTGGACGTTGACTGTG GTGAA TACGTTG^TGOCGGCAGAGAGCAATTCCT 
GAATGTCTTGCCAGCGCTTTT(X3TG»GCGGCTGOCGGGGGCGTTGCTGTGG 
GCCAGTTCCTCCACCAGCACCAGGTTGGGCTTGGCGGCGAGCAGGCC . 
>1 13.1 

GGGCGCGGCCAGCCG ACTGGACCCCTTAGCCT CGAGGCC7TTGCTGAAGC 

TCATGTGAGGGGtJCGAGTGCCCCTGACAGGTGTTGGATTCCAGcfGCTGT 

^CC^GAAGGTGGGTGGTGGGAAGAACGGGAGAATGAAGCXJWiCCTTGG 

GAGAGGTAGGACGCCA6CCCGGCCCAGCTGCTTCCAGCATCTGGATOCAG 

CCTCACCT GAAGCCAGCCACCTTCTGGACTGCAAAGTCATTGTCAACACC 

GAAACACAGGGTTTCTGACCATTGCAAOCCAGGGTCCCGGCGTGTC^^GG 

CTGCAGACCCTGCAGACOCCT ATGAAGATGGTCCTGCCTGCCTTGCATCG 

GGCCTCTAGCTAGGGACTGTGGTTGCA 

>114.1 

TGAAAAGGAAAGAAAAATTTGGAATGCACATCTCTGTCCAGCTCAATTry; 

TGACTOCTTTTTTAAGATGGAGAGCTGTTAGGTTTGTCTAC^ 

AAGACCTGATTAAATAACAGCATGGAGCC^ATCT^ 

> 11S.1 

gcccocagggcccagctactcgaagaacagccaatggattggaacgtcct 

aggacagatgccacggctttgacccaggctggg^ 

ggggttagttggtcggagggggaagcctxiatgggtccaccaggatgaggt 

GTTTAACT CT ATCAGGGT 
>116.1 

CGGTGGCGG^X^AGCGAATTGGTGACGACGCTGATCTTGACGTTGCGCCCG 
CGGATCTTJGCGCATCACCTCCAGCCCCGT^^ 

TGJ^CGCACGCTGTCGACATCCAGCAGC^ 

GTCTT GCCGATCACGCGGTCAGGCGAATCGGCATACGCCTCGGCGGTGGT 
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ccagatcaggccgagcttgccggcgtttgaggtcttcgaccatcgggctg 
tagccgag 

>1 17.1 

tgagctcaccgcggtggcggccgaggtactctaatggagccactcaggac 

tgtcttaa aaaga caaaaatacctcctacagttgttatcatcaacgtcag 

ttgctgggttttcctaaatttgtcttctacctcagatctaaaccatttga 

taa catt agggcaatatcatggcaatcgtggcccagtaaaaccatagcaa 

atgttttctcxx;taggacactatctgttttcacaggaaaatttttctcat 

agaaaaactgtaggaaaagccatggatgagctgagaagaccaaacctatc 

tcttggaaaacaacagtagggagcgtggattagaatgtcttgggtgcgtg 

aaacaggcagacaatcctgaaacatcttttctggggacgtaaggcatgaa 

aaatttctatacacttaggagggcttctaggaaacaggaaacgaca 

>ne.i 

GTGGCGGCCGAGGT ACGOG GGGAACCGAG GCAGCAGCGGACGTGAGCGAT 
AATGGCGGATATGGAGGATCTCTTCGGGAGCGACGCCGACAGCGAAGCTG 
AG CG T AAAGATTCTG ATTCTG GATCTGACTCAGATTCTGATCAAGAGAAT 
G CTGCCTCTGGCAGT AATGCCTCTGGAAGTGAAAGTGATCAGGATGAAAG 
AGGTGATTCAGGACAACCAAGTAATAAGGAACTGTTTGGAGATGACAGTG 
AGGACGAGGGAGCTTCACATGATAGTGGTAGTGATAATCACTCTGAAAGA 
TCAGACAATAGATCAGAAGCTTCTGAGCGTTCTGAOCATGAGGACAATGA 
CCCCTCAAGATGTTAGATCAGCAGAGTGGGATXAGAAGCCCCTAATGATG 

atgaaga 

>119.1 



cagtgaaagggtctaatataaaacacaccgaggctgaaatagocgctgct 
tg tgag accttcctcaagctcaatgactacctgcagatagaaaccatcga 
ggctttggaagaacttgctgcaaagagaaggctaatgaggtgctgtgcca 
ttgtgtatgtctgcagatttccccagggttgggatgggttcatcctacaa 

CGGACAAGATGAAGTG GACATTAAGAGCAGAGCAGCATACAACGTAACTT 
TGCTGAATTTCATG GATCCTCAGAAAATG CCATACCTGAAAGAGGAACCT 
. TATTTTGGCATGGGGAAAATGGCAGTGAGCTGGCATCATGATGAAAATCT. 
GGTGGACAGGTCAGCGGTGGCAGTGT 
>120.1 

CGCGGTGGGGGCCGAGGTAGCGAGGTACCAGGCTGTGGAATGAGAGCGGG 
AGC1 1 1 1 ICGTGCTAAGATGCCGTTACGGAAACATCGCTGTCGTTTCAAG 
AGCTATGGGCATT G 1 1 I C ACA 
>121.1 

CCGCGGTGGCGGCCGAGGTACAAGTTTATGTTTTCCTTGGTGT AAGGCTT 
TAAGAGTTCCCACCTTTCAGCTGCCTGGGCATTGATTGCTCACCT ACCAC 
T ATGACTAGAT ATGATTCCATGTGCTTTTGACTAGATTCTTTGTCTCTT G 
TGTATGGAAAGTGAGACTTTAAGTAATAGTTACTGCTGAGAGAAATAGAA 
GACOTGA CAACG TTTGCTTTCCCATTCAGT AGTCAGOGQTTGAATGQAAT 
T ATCTTCGTTTTTGGACTGACAGA I ITGTTT1 ACAATTCAGCTATTCCCA 
AGCCTTACT ATTCAAAGCAGAACOCTTCTGT C 1 1 CI II CTGTAGTTGCTC 
TCTCTCCCTATATTCTGTTGTA 1 1 II II I CAAAT AACTT ATTACT ATGTC 
AAGTAAAA! IGI 1 1 1 ATGTTTTGTTTTT ATCT ACCCTCTT AATCAGGG CA 
GGGATATGTCTGTTGT ATATTTT ACTTTTCCCAAATCAT AAAG ITTTTS Q 
GAATCTGCTG 
>122.1 

ACCGCGGTGGCGGCCGAGGT AGACACTGGATCT CCTT ACT CATTTTTAAC 
CCTGACTGGGACACCAGAGACATGCTGCATCTTGTATTAGGTGTTTCATC 
TTGCAGAATGGCTGTGCTCCTGAAATATTTCCTGTGAAGAAAATTGTT AC 
AATC CCAT TACATCACTGGCTTTTATTATTAAATTGAATG7TGGCTGGAA 
ACAATTTTAACCCCAAATTGTGAX^AAACAAAACTATATGGAAAAGGT^ 
>123-1 " • ' 

cgggtggcggccgcccgggcaggtacgcgggtgtgcaactgcaaaccagt 
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aacctgctatggc 

>123J 

agactccaaacagtaaggtcagaatttatcaagacattacataggagtaa 
gggcacagccaggggtggtgggg 

>123J 

GGAAGGA(^TTTTCG^GCACTAATTAACAGGTTTTATGATTCACTAGGTT 
GGCCCAACT ACTGTTCTCACCTAATTCCCAGGCCAGCGTGTCAGGAGGCC 
AAATGACAC 
>124.1 

CTCX^CCGCGGTGGCGGCCGAGAAATGTCGCCAAACTGCCGTG 1 I C CC I C 

TAGAATCCCGCGAGTCGCCTCTCCCCGCGT 
>125-1 

ACAGACTTT CATTCAACAAATATTTATGCATCAGCTACATGCCAGGATCT 

GTAATAGATTCTGGGTGTGCAGTAGTGATTACTGCAGAATGCAGACATGG 

TCCCTGCATTCTTGAGAGGGAGACAGCAACCAAATAAACAATTACAAAA^ 

AGT ATGTAACT AATTAACAAGTGGGAGAAGGGAGTGGGATT ACACAGCAG 

AAGT6GAAGGAAGGGCCCACTTAGAGTGGTCAAAGGCTTCTTGAAGGTAA 

CATGT AAGCT GAGACCTGAAGAAGGATGCAAAAGGGCCAGCATG T AAGGA 

ACAGAGAAT AAACATCCCAGAAAT AGAAAAT AACACACAAAAACCT AAAG 

7CATTAAAGAAGATGATCATCTTTCAAGAACT AAOCCTTGAGATCAGAGT 

AGTTTGATTATAGAGGAAAGGGGTGAGTGCAATGAAACGTTAAAAATAGC 

CAGATCACGT AGAGCTCTCTA 

>12&1 

^*CTCCCCGC<MCCGGAAGAGCAAOCGAGATGAAGGTGAAG^T6CTGAGC 
CGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACTTACAGAGAGT 
TOCAAGAAA CTAT GATCCTG CTTT AGATCCTTTTGAGGTCCCACGAGAAT 
ATATAAGAGCTTTAAATGCTACCAAACTGGAACGAGTATTTGCAAAACCA 
TTCCTT GCTTCXiCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCA 
» 127.1 

GGTACTGAAAGTGAGGTGAAAAAACAAGAAAGCTGAGAGAAATCAACATG 
TTCCCAAGTGCTGTATGTGAACAAT AAATCTGAGAGAT ACCT CT AAGGCT 

tttgcagagacaagaaagctctcaaoctgtaaagaattcctgggagatga 
ct gagagcaatgagaactccaggcagaaggttagcagatatagtgtagag 

CATACACAGAT AT ACT ATAGTTCAT AACACTGGTGtGCTT AGCTGT AAATC •. 
ACAAAAT AGCACT GGAATT AT ACTAGTGATCATAGCACATAGTCCAAGAA 
GAAAAAATTTTGATCTTGTTCTT AAACTTTGTGGAGCCAGTGGTGAAATG 
AGTCACACAAAGATGCAACAATG 

>127.2 

ATGAACCCAGCtXrTGTTT AGACT AAGAT ATTCTT GCCCATCACCAOCAAT 

ATTACAATAAAAATCAAGACACATGAAGGAGCATAOCT 

>12B.1 

TTGGAGCTCCXXXaCGGTGGCG^50CGCnjTGAAACAATGCT 
GAAACGACAGCGATGTTTCCXrr AACGGCATCTTAGCA CCA 
CGGTCTCATT(XACAGCCTGGTAGCTCGGT 
>129.1 . 

TCTTCATArGCCTOCTTTTGG^^ 

AAGCTTCTCT GGCTCTCXX^GCCTGGCCCAGGGCTCCAGCTATGAGCTTC 
CATAACACCOCTAGI 1 1 1 OC Y UACATTGCCCTCATAGT ATATGGAATTTG 

GCTGCTic i gggtattgcttgccatcaaggccxttcacacccaacctaatg 

cctgggocagagtaggtgc 

»1S1.1 

^^gc^accgosgtggcggccgcccgg^ 
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T CATT AG C^GTTTTTCXiAAACAAGCGACTTTTAGCAAATT AACCGTTAAT 
TTTMTGAGATTCAAAAGTTAATAGCX^rTCTTAACGTTTTATAATTAGA 

* g ST9ttatataattagagctggacacccacatggagaaact aatttgac 

agacccttgggagttgtagggagttaaagctgatcattatatactattat 

atactt agggatacaacccaagggcaacccctggccttt atgaaaacctg 

gagtgagttattatttcctggtaatacaattctctgccagccagttgctg 

catcaaaacagttctgatacacacacctaaagtcacc^ 

ggtccccaataaccctataagcctctctccttgtag<5tgacctctgccct 

gtgaagggttggctcaccccaaga 

>132.1 

gtggcxsgccgaaaccgtggtggccgtgatcgtgccgttggcggacggaac 
ctt gaagatgttctgggcgg^xagcacaatcgccgccttgccgacgatga 

CATTGTTGGCCTTCAGCCCGTCAATATCGCCtrTTGATGTCX^ 
CTCTCCTCATCATGGCTCAGCGCAATGGCGGCGTTC^ 

TCAGGTTGCOCTGAAGCAXXXJCCTTCTTGTCCTGGGTX^CATCACCGCGC 
AGCCGCGTGCCGCCGGCAATGAACTGGATATTG CTCAGGCGTTTTTCGTC 

^CCAGAC^GATACCTTGCOGTCCGCGCGTCCTTGACAG^ 

gaaggagaacgcgcc i roci gagcttgccccggaaagtttgccatcc 

>133-1 

gtcgcggccgaggtacgataattcatgcx^tttctttgo 
ttctgatataataggttacaaagcaaaattgagatgatttttaaaatxscc 

^ < ^5I rArnTTJCT QAAT AACAT AAATTTT AAACAGAGAGCT GAAAA 

TAACTGCCTTCTCTTCACAAXxAGGCAATCAGAAGGCAGGACT ATAGTTTT 
CTGTGI UCI 1 1 1 C CACA GGAGAGAT AATT ACATTTCTAGAGACCCATAG 
AAACAATTOCATAGTTTTAATTTCATCTCTCT atctct 
>134.1 

G i^IS G ^ CCA ^JJJir ' 1 ' G I caagattggctccatgctgttatttaatc 

™ I^TJT^^T 07 GTA ^VCAAACCTAACAGCTCrCCATCTTAAAAAA 
GGAGTGAGGAATTGAGCTGGACAGAGATGTGCA 

CCTTT CAT AAAGACTTGATCGTCTT ATTT ATCTGGATTGGCCAT ACACAG 

T AATCTCACT AGCTGACAG TTGCTTOCCGCGT 

>135.1 

AGCTCCCOGCGGT GGCGGCCXaAGGTACCTCTCCTGC^GGGCCCTCCATTC 
AGGGTCnTCCTGGAAAACCCCCTGGAGGAAGCGCTCCTGTI GCAGTCGGA 
GTGAACACCCGTCI IGI I rAACCACCAGCAGGGGGATTCCl I fCTGOAQA 
GTm^AS3TCATCATCTCTTTGM0CTCTGC AI I I I UCUCCAGAAAGGC 
GAGCATG TTACT T GTCATCTTGGGATCCGAATGACAAACTCX^MSCAGATG 

ATTTGACAGACTGCT(XACAAGTCATCA 
TTCTT AGCATTTCT ATAGCTGAACTTCTTTAAGT 
>136.1 

CGCGGTGGCGGCCGAGGTACTT AAAAGT AT ATCAGGGCAG TTTCATGCCA 
GGGAGCCAGGGAAGGCA CCC AAGGAAGTGATGGAAGAGTAGAAGTTCACC 
AGGTGCAGCTCAr^AAAGGGCTCAGCAAAl I rCTCTGTAACAGGATGCAG 
ACCCCGCGT 
>137.1 

GCGGOCGAGGT ACTAAATTTAGCAACTTT ATTCATGAGGAACACCAGTCC 
AATGGTGGTGCTCI rOTCCTI CATGCTTACATGGATGAACTCTCATTTTT 

' ^^^^^^^^^T^^^CCAGACrrrCTJpGAXnACTTTGCTnT AATTT 
»138.1 
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CAGTTTGCATaCA i GC i aaaCaGAGaAA i G i CCTCaaaa i 1 CAGl I aC i a 

aaaattactgatatctccatgattagaaccacactgtggttgtgtgtgta 

GTCAAAGGAGGAGAATTTTTAATGCTATATAAGCATAACTGAT AACT GCT 

ATTACAAATAAATATTCCACAAATTTGGAAAGTTATTAGAGGAAGAATTT 

TTTTTCCTTGTAATTTCCAGGTGTTTATATTAGTTGGGCCATAGTGAAAA 

TTACATGGAGGAAAGAAAATAGGAAAATAAGTCACAGAAAAAGAAAATCA 

AAAGAAA 

>139.1 

TTGGAGCTCCCCGCCGTGGCGGCCGAGCCCAATTCTTGAI 1 1 CI I ICCAT 

CCCAAACTCTTrAAACTCTTGACCTCTGCAATTCAAGTTGTGAACATGAA 

ACTTGTCTATCACCAGCCTCTTCTCTGCATTCTCTTTCCCTCC7TGCTAT 

GCTAAAACTT GGATGG CCTCTGAAGAT ACT GCT CTTCACCCCTCTGAAGG 

GGGCTCCTCAAGGGAAGGT 

>140.1 



CAGCa^TGTTTCCGTAACGGCATCTTAGCACGaAAAAGCTCCACGGTCTC 

ATTCCACAGCCTGGT AGCTCGGT 

>141.1 

tggcggccgagcccaattcttgai i icm i ccatcccaaactcttt aaac 
tcttgacctctgcaattcaagttgtgaacatgaaacttgtctatcaccag 

CCCCTTCTCTGCATTCTCTTTCCCCCCTTGTTATGCT AAAACTTGGATGG 

CCTCTGAAG^TACTGCTCTTCACtXXrrCTGAAGGGGGCTCCTCAGGGGAA 

GGT 

>144.1 

CTCCCCGCGGTG GCGGCCGTTGCCCTTACATCTCTCATTTGGAAGTGACA 

GGTATTAAATAACGGCATATGAAAGCTrAAAAGTCATCAAATACAATCAC 

TGGGTACTTTCGATTA(XXiAAACCAGGCACTrTCCTAAACTCX^CACTTC 

TTTACTTCTGCGGTCTCC I llt l ll I ATTCCCCCGCGT 

>145.1 

ACCGAGCTACCAGGCTGTGGAATGAGACCGTGGAG C 1 1 1 1 1 CGTGCT AAG 

ATGCCGTTACGGAAACATCGCTGTCGTTTCAAGAGCTATGAGCATTGTTT 

CACA 

>146.1 

CCCGCGGTGGCGGOGGTrCTGCTTAGCCAGTTTATTCTTT A I ! II 1 1 > A C 

TGGAGTCATTGCCAGTGATGGAAACGGTGTTTGC I rCTCTI ICAGTCAAG 

ATCTGCACAAAGTATAGCATTAGGTGGTATTTATTGTTTATATTATGAGT 

TCT ACATTCATCTTTCCAGCACTCT GAAGTT ATCAG CAAG TTCT CAGTCA 

GTTCAAGGCATTGGATTCTGCTTGA 1 1 I C I .I 1 1 1 AATTCATT G I fTTTO A 

CCCCTTTGAGAGTTTTAATAGAGAGGAGTCTGGAAGGCAGAGATCTCCAC 

CaCCTAAGCGTGAGAAATTTGGAACTAAGGACTTGCACTGGTCCCCAAGT 

TAACAGTGGATATACTTCCTGCA 

>147.1 

ACCCAAGGTGGGCATTTTTrTAAAAAACCCATGGAAATAAATGCTACTTC 
TTGTTAGTGTTGTTTGAAAATAAACAAAGAAAATGCAAACAAAACAAAAA 
CCATGGTCCATTGAAGCTCAAGAGTATTTAACCAATGCTCT gttgcctct 

taaaggattggtagctatttccccatctacaaatacatgacaattaacta 
agcxx^aattctttaaaactatctggaattaggtcaaaattatctaatttt 
tttctgatttaattatggattacgtaatccaatagttggcaacattataa 

AACCCTAACTTTA<Xn^C^TTGTTTGGCTATACCAGGTCTCATGACTCTGG 

ACATAACCACCA 

>148.1 

gtggcggccgaggtacctatgtgcgccgtggtagaaaagcacctgggtcg 

GGTGCAGACTGCGGAGCGGGCCCTACCGTGTGCGCAGAAAGAGGAGGCGC 

tggacttatcctaccttaagtttgaag^^gaccagcaa 

CAATCTtX^ACAGGCATCTTTACTCTGAGCCAAGGAAG TG1C 1 G 1 1 C I I Gl 
GCTGAGTTTCAGGGGCCTTCAGCTTGCGGGAAATCCCGAAGATGGCCAAA 
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CiACAACTGAACTGTTCGTTGCTTCCAGGGCCTGCTGATTCTTGGAAAl G 1 

GATTaTTGGTTGATGCGGCATTGCCCTGACTGCCGAGTGCA 

>149.t 

TTGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCCCCTGAGGAGCCCCCT 

tcagaggggtgaagag<^gtatcttcvvgaggccatccaagttttagcata 

ACAAG G AGGG AAAG AGAATGCAGAG AAG AG GCTGGTGATAG ACAAGTTTC 
ATGTTCACAACTTGAATTGCAGAGGTCAAGAGTTTAAAGAGTTTGGGATG 

gaaagaaatcaagaattggg 

>150.1 

GGTGGCGGCCGCTGTGAAACAATGCTCATAGCrrCTTGAAACGACAGCGAT 
GTTTCCGTAACGGCATCTTAGC^CGAAAAAGCn"(X^CGGTCTCATTCCAC 
AGCCTGGT AGCTCGGT 
>151.1 

CCGCTCTGAGCCATGGAAGATACTGG^GTTAACAAAAATTTTATAAACTA 

TGAATTCrTATTCCTGAAAGAATGGATAATGAATCAGGAGATGAGCAAAA 

ACGTATCTTTT ACAaAGCTCTAGTCTTCCAAAAGCCTCTAAACTCAAACG 

AAACCTTTTTAAAGTAGTTTTGTAAAAGCTCAAGGTATGCCATTTCCAGA 

AAGTTGCAGATGAGCACM^TTGGCATTACCCAAATTCTGTCACACATTGA 

GCAATGAAATTCAGGAATTGG ACAATG ACCTCTT GGCATATGAAAGAATT 

AAAAGAGGGC 

>152.1 

GCGGCGGGTCCACCTAAAAAGTCACTGCAGCAGAGAAGAAAACATTGGAC 

AAAG AAGAAAGGCG ACAG AAGGCT AGAGAGAG GCAGCAGAAATTGCTTGC 

GGAGTTTGCTTCACGA(^GAAAGGCTTTATGGAAACTGCAATGGATGTTG 

ATTCTCCTGAGAATGATATTCCTATGGAGAT<^CCACGGCAGAA<XACAG 

GTTTCCGAGGCAGTATATGACTGTGTTATTTGTGGACAGAGTGGCCCCTC 

CTCTGAAGATCGACCTACTGGATTAGTTGT 

>153.1 

GGTGGCGGCCGAGGTACACCT GCAACTGTGCGAATGGTCCT GTTGCCTCC 
TG CATTTTGGCCTCTGTTCTATAAAGGhAAGAGTAAAGATGGAGCTCCTCC 
TGCCTCCATCACGAAAGCACATATCATCTGTCCCTTTGGATTTTACTTCC 
AGGACGCGTGTCGTCCCCAGCGTGTGTTGCCTT ATGGTGCCGGCAGAGCC 
T(^G CT ATCTG CCTG G GAAGTCGG ATGTCCTTGG AGAGAATTTGGAATGC 
AGATA A IIIIIUI1 ATTTCTTGAGAGCTT ACTTT AATCAGCATGACACTA 
CCTAAACACT GAAGATGGCCTT AT ATTAGT AAGATTTGCACAAAATTAAG 
TATACCT ATGCAAACTATTACTTTGGTTTTTAGGAGTTTGGTCAGATGAA 
GAAGTAATG GGATCACATATATATGTAAG 
>154.1 

TCCACCGCGGTGGCGTCOGGOCCCCGC C 1 1 1 1 CT GCGGCTTTCAGCGCGC 

GTITCAGGTCGTCAATGAGGTCGTGGGCATCTTGGAGACCGATGGACAGG 

CG G ATCGT G CCCTGG CTG ATG CCTGCGCCCGCCAGGG CTTCGTCG CTCAT 

GCGGAAATGCGTGGTGCTGGCCGGGTGGATCACCAGGCTGCGGCAATCGC 

GGACGTTGGCGAGGTGGCTGAAGACCTTGAGGGTTTGAATG>>GTTCrTTG 

CCCTGCTCGCGGTTGCCCTTGAGGTCAAAGCT 

>155.1 

GTG GCGGCCG CCCG GCAGGTTT AAAAAG AACATGTATAAACG CTTAGCAA 
ACCCTTTTT AATGTT CTGAAGTCAGTCTTTGTAAGTGAAATOGCTGGAGA 
CTAGAAAGTATGAAATGGCAGTCTACCTGGGCAACCTACAAAAAATTTAG 
CTTGAAAAGACTTCAGTCnTXXaCTCCCCTGTTGATCTCATGGAGTGGGGA 
ATGGGAATTGAACCAGAACTGGAAAATTATTrAGGAAAGTTTGTTAACTA 
CTCTTTGTTGATCTCATGGAGTGGGGAATGGGAATTGAACCAGAACTGGA 
AAATTATTTGGGAAAGTTTATT AACTACTCTTTCTGCTGAGTAAATTTAA . . 
ATGTGTTCT GGACATTGTTGAGGTCT AGAATTGTCTATACAATGCCCTGT 

>t56.1 
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ACC GGGCTGG CGGTCGCCCGCTCTGGTGCTTGCATCTTGGCTTCCT AT AG 
CTTTCTTTTTT ACAGAG G CCATGAAATG C AATCCAG CTG AAGT ATT ATCA 
TCTTGTAGCATTTCAAAAGGAACGTCGAAGTCATCCAAAGGATGGGAACC 
ACAATGTTCTTGTTGTTCCTTGGG I HC l I AATGATTTCTGAATCATCAT 
TATTAATTATGGAATTCTCTGGTCGAAAAGTCACATl 
AGTTTCT CACATC 

CCTTTTTTT ACTGTCC I 1 1 CCTTGTCTTACTT CAGGT 
>157.1 

CGG GG G CG G CCGAG AAATGTCG C 

CCGAGTCGCCTCTCCCCGCGT 
>158.1 

TGGCGGC£X3ACTCGCTGACCAGACCAGGCCCCCAGGGCCCAGCTACTCGA 

AGAACAGCCAATGGATTGGAACGTCCTAGGACAGATGCCACGGCTTTGAC 

CCAGGCTGGGGGTGCACGGATCTCACTGGGGCTAGTTGGTCGGATGGGAA 

AGCCCCATGGGTC<>CCAGGATGAGGTGTTTAACTCTATCAGGGT 

>159.1 

ACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAG 
CGCAAAAGGCTCCAGACTTTGATGTCAGT GGATGATTCTGTGGAGAGGCT 
GTATAACATGCTC^TGGAGACGGGGGAGCTGGAGAATACTTACATCATTT 
ACACCGGCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGG . 
AAATCCATGCCATATGACTTTGATATTCGTGTGC C I 1 1 1 1 1 1 A TTCGTGG 
TCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTG 
ACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCT CCTGAT 
GTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCAGAAAAGCCAGGTAA 
CAGGTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTA 

>ieo.i 

ACACAGGACX^UKTGCTGCXCATCCACATGGAATTTACAAAC^TTCTACAG 

CGCAAAAGGCTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCT 

GTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATACTTACATCATTT 

ACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGGG 

AAATCCATGCCATATGACTTTGATATTCGTGTGC C I I 1 1 M I ATTCGTGG 

TCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTG 

ACTTGGCCCCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGAT 

GTGGACGGCAAGTCTGTCCTCAAACTTCTGGACCCAGAAAAGOCAGGTAA 

CAG GTTTCGAACAAACAAGAAGGCCAAAATTTGGCGTGATACATTCCTAG 

TGGAAAGAGGCAAATTTCT ACGT 

>161.1 

GGCCGAGGTACCATCCTATTAATACTAACTTCTGCTTCTACATACTGTAG 
ACCTTTCTGGATGATAGAAATCAATGCAGCGGGTGGGACGAGGGCACCAT 
TT AT ATTGGACTGACTGAT ATGGCTTTCT AT ACCAAAGGTAAATGCTGAA 
TGAGAAAATCCTGACTCTTGCAAGTATCTATATACCAAGAAGTTGACCT 
ATCACTGCTTATACT CATCTTT ATTCCCACTT AAAOCATGAGGT CACACC 
ACAGGATATAACCCATTGGCAGTGCATTGATGTGGGGATGTGCAACT GAA 
TATCCGGGCACCGCCAATCACAAGTTGCTGTTGTTGATGCT GGAAACGGT . 
GGCCTTCAACGCCGCTTCOCCCTTCCGGGAATCCCCGCG 
>162.f 

GGCGGCCGAGGTACCT GGCCTGCTGGCATAGTTCTTTGACCGG1TCATAT 
TTGGGCAAGTGATTTGACTGTTGGATATTCTrGCTGGATTCCTCC ITCT I 
ACGTAGAAATTTGCCT CTTTCCACTAGGAAT G 

TTGAGGACAGACTTGCCGTCCACATCAGGAGGTGTGTCGAGCC^ 
ATCCAGGATCGTGGGGGCCAAGTCAATGTTGAGAACGATCTGTGGGACTA - 
TTG ATCCTGGTT C7 ACACTTGGACCACGAAT AAAAAAAGG CACACGAAT A 
TCAAAGTCATATGGCATGGATTTCCCCTTGACCAGTCCAAACTGCCCAAT 
ATGGTAACCATGGTOGGCGGTGTAAATGATGT 
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>163.1 

TGT ACATTGTCTT AAAATCTG TG GCTTGCCTGTTCAATTCATT AGTGGTG 
TTTTGTTAAGCAG I 1 1 1 t AATTTTGATG AAGTGTAACTTATTCA 1 1 1 1 1 1 
ATTATGGTTATTGCTTTATGTTrCAGGTCCCAAATTTTGCCTTCTCACAA 
ATCACAAACATTATCCTATGTTTTCCTTCAAAAATTATATG 

> 163.2 

TACT AAAG AAATTTGAGGGATTTGCTAT AATGTT AGGGA 1 1 > 1 TC TAGAT 

> 164.1 



AGTATTTAGCCCAGAGGGTTGTGGTGAAGTGT 
>165.1 

TAGTAATCAAC CTGTT AATCCAAGGTCTTTAGAAAAACTTGAAATTATTC 
CTGCAAGCCAATTTTGTCCACGTGTTGAGATCATTGCTACAATGAAAAAG 
AAGGGTGAGAAGAGATGTCTGAATCCAGAATCGAAGGCCGTCAAGAATTT 
ACTGAAAGCAGTTAGCAAGGAAAGGTCTAAAAGATCTCCTT aaaaccaga 

ggggagcaaaatcgatgcagtgcttccaaggatggaccacacagaggctg 
cctctcccatcacttccctacatggagtatatgtcaagccataattgttc 
ttagtttg cagttacccct aaaggtgaccaatg at 

>166.1 

tgatgagctctctaat<>gcaggaccaaggtgtgaagtgggaatgaacat 
g gat ccat cgcattgg atggagaagaaaggtggacagcctgttcgt ct ct 
catgtcagcctagggctgggaacagtttgtgaggacttatctgttgtacc 

T 

>167.1 

AGCGCAAGT AGGTCT ACAAGACGCT ACTTCCCCT ATCAT AGAAGAGCTT A 

TCACCTTTCATGATCACGCCCT 

>167.2 

CCCCT ACCGCCAATCCCTTTTTACAATAAAACAGGACCGAAGGGTCCAAA 

c 

>167.3 

ACCrTTGAAACXXXrrAACCGAAGTTACCCrrCGGGCCCGCTTCTTAAGAAA 
CTAAGG 

>i6a.i 



TCCTAGGAC^GTTCGCGGAAGAAGTGGCTCACGGCTTCCAGAGCCACATC 
ATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGGCCTGCA 
GGT ACCTCGGCCGCTCTAGAAC 
> 169.1 

GGCCGCCCGGGCAGGTACTTCCACTATTATTGAATGTATTCTGTATTATA 

ATTGTATATTTGATTGCCTATCTCCCCTCAACTGCATTATaCATTTTCAT 

GGGTGAGCCAGTGT Cl I 1 1 1 CACTCTATTTCAGTGCCCTGCACATTTTCT 

GGCACATAGTAAGCAT 

>1692 

AAAATACTAAAATCCGAAATGCTCATAAAATTCAAAGCTTTTTTGAGGAC 

CTGACCTCGTGCCTCAAAGGAAATGCTCATT 

>170T 

TGGCGGCCGAGGTACTTAGCTGTGTTTTTATTCAAAGTCTAC^ 
TAGTGGTTAATGTTTGCTGTTCATTAGGATGGTTTCA jCAGTT 

tcx^gggagaaaacaagggatgaccattgttggttgggagactgaaggtg 

ATTGAAGGTTCACCATCATCCTCACCAACTTTTGGGCCATAA ■ 
A CCGT TTGGTGGAGCCTGAAAAAAATCTGGGCAGAATGT aggacttcttt 
ATTTTGTTT AAAGGGGTAACACAGAGTGCCCTT ATGAAGGAGTTGGAGAT 

. cctgcaaggaagagaagg^gtgaaggagagatcaagagagagaaacaatg 
aggaa catttcatttgacccaacatccttt aggagcat aaatgttga^ 
taagttatcccttttgtgctaaaatggacagtattggcaaaatgatacca 
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CAACTTCTTATT CTCT GGCTCT ATATTGCTTTGGAAACACTT AAACATCA 
>171.1 

GGOGGCCGCCCGGAGCGGCGCGGAGCATGATGGAAGTCGTAGTAGGAAAT 
GGCGTCGTGGCATTGAGGGGCATCCCTtXJTAGAACCTCCAGGAAAAGCTC 

gcggaagacgaggttctgcggagagagaggct CCAAGCAGTCTGGGAAGT 

gtagtccagttggcttagcagtagtttcgttgggggggagccgaggttcc 

ggcaaggggctaggccggcttgaaaagagattatgactgtacctcggccg 

tcgagcggccgcccgggcaggtagaacttttatacaactcaggagattaa 

aaaaaaatctccacaagaagaagcaactcagcaggccctggcattaaaac 

atttcccagaataaacagatatgcattgcattaaaggtaattttcaaata 

tttaagttacaccaagatttccctccaatatgtgcctttctcaaaccaat 

gca acta arrcattgctaatactggggcatgaatttttggcaaatgttta 

TGGTTTTACTTTCTTCATTAATCAAAAAATTTTTTAAAGTGCT accaagc 

agcaaaacatgtcgcatcagttctctgctcatggcagaagtgcccactgt 

GAAA 

>t72.1 

gcggccgggtacagatttaaggttgatggactcagggtaaggatagctac 

agct gtgtg gggctgaaggtctgtggcactgagctactggggaaggaggg 

ctctgttttcattgtgacacactgagttaataaagcacttactgagggag 

ccagagcccaaactctaaatgtgctgtagaaaaagggccaagtcattgac 

tgcaccactccttcagccagaggtagaaaggatttactcttcagccatct 

ggtagagccccaagaacaaqttacatgtggacaaagggagggagaggtat 

catggtgattaataaattcaaacaaagctgaatgataagaccccaggatg 

aagcaggtcaaggaagatacac ggaggcagagggactgaagg 

>173.1 

tggcggccgagtacgcgggataggtggaaaaaaacactgccattcacaag 
tcaaggaacccagggccagctggaagtgtggagcacacatgctgtggagc 
acacatgctgtggagattgcagtgtgtctgaggtttgtgtagtagtggaa 
gattttaggtatgtagagcaagttgaaatggattgagactgcatgggggc 

ATAAATGAGAAATTGCCTGTAGCATCTAGTCT ACTTGAAGGAAGTGGAGA 

cataaggagagacaaaaacaggtttgtgccataaagtattttttcaaaga 
caocaagatgtgggtaaatgaaaattattagttcac 

>174.1 

GG7GGCCGAGCGG0CGCCCGGGCAGGTACCACTAGGGTGTT GTTAAAGGA 
CTTGATAAOCAGCTTGAAGAGGTrCCTACTGACGAGAAATGGAATGAAAT 
TTAAGCATCAATAAGGGTAATAACTGCAAGAGACTGACATCCACT ATGGT 
TTAAATCCATGAGGTCACAATGATACTTAATTTTTCATTATTCTGAAAAC 

cagtaaataaaggctaagattcaacaagcatttatccagcct7tcctcaa 
tgaaatatatcttaagagaaccgaa 

>175u1 

aggtaccaaaacctggggattaagctaagaagtctggtggagagactctg 
tggacgtaaagaagggaatgaacacagagaaactttcagccagattcctg 
agtgtcacctgaacaagaaaagtcaaactggagtgaaaccatgcaaatgc 



GAGAGCTCATGCTGGACACAAACGATCTGAGTGTGGTGGGGAATGGAGAG 

AGACGCCCCGGAAACAGAAACAACATGGGAAAGXXTTCATTTCCCCCAGT » 

AGTGGTGCAC<>GCGCACAGTAACACCAACTCGAAAGAGACCTTATGAATG 

CAA 

>176.1 

AGGOGGGGTGCTQTGAAGAQCTTTGCATTGTQGGA 1 1 I UC 1 M CTC 

TAAGGCAGGAAGATGGCGGCCGCACAGAAGACGAAAAAGTCGCTGGAGTC 
GATCAACTCTAGGCTCCAACTCGTT ATGAAAAGTGGGAAGTG .= 

TATGAATTA 
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TTTATTTTCTTTCTCAGAAAAGGATGCGCCTCCACTTAGCAAGGCTGGGC 
AGGATGTGGTTCTGCATCTCCCCACAGACGGGGTGGTTCT AGA 
>178.1 

TGGCGGCCGCCCGGGCAGGT ACCAAACCATTTTCACT AGTT CAGGAT AGG 
AATATTCATCAGATTGTCTCTGTAAAAGTGAATCACAAAAATTCCACCTG 
TGTAG GTGTG GG ACTGG ACAGCTGAGTGACAGGGCCCTGGQAAGAACAGA 
AACGAC1 1 1 ICCTCTTTCCTCTGAAATATCAGAAGTTAAAAATCTACTCT 
GAGTTATATGTGCATCAATTTTAGACATATTGCTGATTTTATTATGAAAA 
TGAAGTGCT AAAGACAAAGG ATATTTCCATTCCTCTGGACAGGCAGCCAC 
AGACCAGCACTGCTTGACCCATGTGTATACACATGTGTGCTTTGT 
>179.1 

CGAGGTACTCACAGTCACGCAAATTGACTGTCTGGGTGCACGGCTCT CCA 
TTCI ICI I CI IGGCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAAT 
TAAGACCTTCTGAGGATGAGCGAT AGAT AAACACAOCT CCT CT GAACCAT 
CCTT GGGC TTCATGGGGTTGGCATTGAGGATCCCTACGACAGTCOCCTGC 
TCCGTCTTCCAGAG CG CTTTGTGAACTTCTCCAAATAA GAACAAGGACAC 
ACATTGTGTCAGGTCACGAAGATCATTCAGTTTCCATATGCT GAAGGTTT 
TTCCAGTATTCACACTCTGTGGCGTAAOCTTCTTCAATATAACCCCAAAT 
GTCAOCCAATCTATTTCTTCCAGCTTCTCTCTGGCCATCTTTTCCTTGAT 
CTGAGACAGTCTGATCAGTTT 

>isai 

GCGGCCGAAAACTGATCAGACT GTCT CAGATCAAGGAAAAGATG GCCAGA 

GAGAAGCTGGAAG AAAT AGATTG GGTGACATTTG GGG TT ATATTGAAGAA 

GGTT ACGCCACAGAGTGTGAATAGTGGAAAAACCTTCAGCATATGGAAAC 

TGAATGATCTTCGTGACCTGACACAATGTGTGTCCTrGTTCTTATTTGGA 

GAAGTTCACAAAGOGCTCTGGAAGACGGAGCAGGGGACTGTCGTAGGGAT 

CCTCAATGCCAACCCCATGAAGCCCAAGGATGGTTCAGAGX3AGGTGTGTT 

TATCT ATCGATCATCCTCAGAAGGTCTT AATTATGG GTGAAGCTCTTGAC 

CTGGGAACCTGTAAAGCCAAGAAGAAGAATGGAGAGCCGTGCACGCAGAC 

TGTGAATTTGCGTGACTGTGAGT 

>181.1 

GGCGGCCGAGGTACTCACAGTCACGCTCCTCTGAACCATa^TTGGGCTTC 
ATGGG GTTG GCATTGAGGATOCCT acgagagtccogtgctccgtcttcca 
GAGGGCTTT GTGAACTT CTCCAAAT AAGAACAAGGACACACATTGTGT CA 
QGTCACGAAGATCATTCAGTTTCCATATGCT GAAGGTITTTCCACT ATTC 
A CACTCTG 1 GGOGTAACC 1 1 CI I CAAT ATAACCGCAAATGTGAOCCAATC 
TAI I rCl ICCAGCTTCTCTCTGGCCAT CI I HCC I IGATCTGAGACAGTC 
TGATCAGTTT 
>1«2-1 

GCGGOCGAGGTACATGGATACG 1 fCTC I I CTGGGGGCGGTCTCCAGTCCT 

TTCTCATXaAGGGAGCACACTXXTCTGCtrrCATTGCAGTGGCCTCAG 

ATGGAATTAAGATCCACCTGGrrGTGATGAATAAACCCAGACTCTCAGCAA 

CGCAGGAAAAAAAACAAAAACTGGCTGGCGATCT GGAGT AAAGGATCCTC 

ACATC^ACGTGAACCAGGAAACTCTGTGCCX^AAATCGACGAAAAAAAAAC 

ACTGGGAGAGCCGAACTAAAAGTCTTTTAGCACGGGT 

>183.1 

gtggcg6ccgaggtaogcggggagcggaaagggagactgtgg<jgaactag 
gagcaacagcaggcatggagcaaagcagtgaaggatgtatgaaaaagatt 
agcagtgtgaatcttgacaaactt ataaatgactt ctcacagat agaaaa 

GAAAATGGTAGAAACCAATGGAAAGAACAATATACTGGATATTCAGTTGG 
AAAAAAGTAATTGCCT ATTAAAAGTAATGCAAGCAAAGGAGGTCTCCATT 
" AAAGAAGAATGTGCTACTCTTCATAATATAATAAAAGGGCT ACAACAGAC 
GATTGAATATCAACAGAATTTGAAAGGTGAAAATGAACAACTAAAAATAA 
GTGCTGATCTTATAAAAGAGAAGTTAAAGTCTCATGAACAGGAATATAAG 
AATAATATTGCCAAACTTGTAAGTGAAATGAAAATCAAAGAGGAGGGATA 
TAAGAAAGAAATAAGCAAACTTTATCAGGACATGCAGAGAAAAGTTGAAT 
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TAAATGAAGAAAAGCACAAAGAACTAATAGAGAAAAAGGAGAT 
>184.1 

GGCGGCCGAGGTACATGGATACGTTCTCTTCTGGGGGCGGTCTCCAGTCC 

TTTCTCATGAGGGAGCACACTCCTCTGOCTCATTGCAGTGGCCTCAGGGA 

TATGGAATTAAGATCCACCTGGTGTGATGAATAAACCCAG ACTCTCAGCA 

ACGCAGGAAAAAAAACAAAAACTGGCTGGCGATCT GGAGTAAAGGATCCT 

CACATCCACGTGAACCAGGAAACTCTGTGCCCAAATCGACGAAAAAAAAA 

CACTGGGAGAGCCGAACTAAAAGTCTTTTAGCACGGGT 

>18S.1 

GTAC GCGGG GGTGTCOGGGGATGGGCACGGGCATTTCTTC»TTTATAGCT 

GTCTGTTTGCATTCTGATTGGGAACACTGGGATCATTTTCATCATGCCGA 

CAGTGGTGCTAATGGATGTATCCCTTTCCATGACCCGACCTGTGTCTATT 

GAGGGGTCCGAGGAATACCAGCGAAGCACTAAGTAATATGGATGATTATG 

ACAAAACCTGCTTGGAGTCTGCATTAGTTGGTGTTTGCAATATCGTTCAG 

CAAGAATGGGGTGGTGCAATTCXrTTGCCAGGTTGTCCTGGTGACAGACGG 

CTGTCTT GGCATTGGTAGAGGGTCACTGGAACA 

>t8&i 

CGCGGTGGCGG COGAG GTACTCACAGTCACGCAAATTCACAGTCTGCGTG 
CACGGCTCTCCATTCI I CI ICI IGGCTTTACAGGTTCCCAGGTCAAGAGC 
TTCACCCATAATTAAGACCTTXrrGAGGATGATCGATAGATAAACACACCT 
CCTCTGAACCATCCTTGGGCTTCATGG GGTTGGCATTGAGGATCCCT AOG 
ACAGTCOCCTGCTCCGTCTTCCAGAGCtaCTTTGTGAACTT CT CCAAATAA 
GAACAAG GACACAC ATTGTGTCAGGTCACGAAGATCATTCAGTTTCCATA 
TGCTGAAGGTTTTTCCACTATTCACACTCTGTGGCGTAACCTTCTTCAAT 
ATAAG CCCAAATGTCAGCCAATCTAI T 1 CT TCCAGCTTCTCTCTGGCCAT 

CI 1 1 IOCI I gatctgagacagtctgatcagttt 

>187.1 

ggcggocgc ccgg gcaggt accagagattccagagagtggtctttggaat 
ttcccaac lixil 1 1 gcttcagtgccctgatctct gaact aacaaaccaga 
aagaagtggcagcatggacttatcattacagcacaaaagcatactcatgg 

AATA7TTCCCGT AAAT ACTGCCAAATCGCT ACACAGACTT AGTGGCCATC 

cagaataaaaatgaaattgattacctcaataaggtcctaccctactacag 
ctcctact actggattgggatccgaaagaagaat aagacatggacatggg 
tgggaaccaaaaaggctctcaccaacgaggctgagaactgggctgataat 

gaacct imycaacaaaaggaacaaggafwactgt^gtggagatatacatcaa 
GAGTCCGTCAGGCCCTGGCAAGTGGAATGATGAGCaCTGCTTGAAGAAAA 
AGCAGGCATTGTGTTAGAC 
> 166-1 

AC1 1 1 HI 1 1 1 1 1 II 1 1 1 II 1 1 1 U I AACT ACAGGT GTCAGATGCATCACA 
AAAGCAGAAGTGUXIIICAG CJCIIC1 CTGTGCC AIJC CI IGI CAATTT 
' CATGCTGCCTACAGCAACAGCATAATACTGCAAACAGCCATGATGTCA 
>1tt2 

rraATTraAGGGOAG^^ 

agcggatgggaagtgatactaggtatgtaaaggatggtcagttacctcta 
aatgtaagttagaccaggacagccag 

>189-1 

GAA G G AAAGCA G CTGCAAACTTCCCATCTGCA G I G 1 1 I GI I nSTCTC GGC 
TCCGGCC^TCACTGCCACGATTACXXXrrGGATGAATTCX^CAGTGGAAAT 
^IpAACAAGACTCAGCCCACCTGCACCCAGG^^ 

" "** * " ' ATGGAOGCGOGGGACATT 



TGGTGCCCTGACTTGGATtAGGGGACC I UOC 1 1 G GGAGATCAATCCCCTG 
^^^T^ 1 ' * * * 1 ^^^^^^^CCAOCTACG^^CTCTG^IGC 

tcagaccaaccagcccaaggaacatctcagcaatttttaatcaaoaatat 
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TCTGTGAAAAAGACT AAG AT ATCAGAG AAATT ATT AGTGCACATT ATT AG 
AAGAGAGCTTCAGATGAAAATAAAGATCAAGAAAAGACTCTTGCTTTGAG 
AAGACAGAAAGAAATCACATCATCTTATTGGGATTACTGGC 

>igai 

c^tosccgtcxx^ttgctcacagggactgggaaggcgatgcctggcggga 
gct gcrrggtggagagactcgggatgactcctgct cagattcaggccttgc 
tcaggaaaggggaaaagtttggtcgaggagtgatagcgggactcgttgac 

ATTGGGGAAACTTTGCAATGCCCCGAAGACTT AACTCCCGATGAGGTTGT 

GGAACTAGAAAATCAAGCTGTACCCTGATGCTACAGACGAGGACATCACC 

TCACACATGGAAAGCGAGGAGrrGAATGGTGCATACAAGGCCATCCCOGT 

TGCCCAGGACCTGAACGCGCCTrCTGATTGGGACAGCCGTGGGAAGGACA 

GTTATGAAACGAGTC^CTGGATGACCAGAGTGCTGAAACCCACAGCCAC 

AAGCAGTCCAGATTATATAAGCGGAAAGCCAATGATGAGAGCAATGAGCA 

TTCCGATGTGATTGATAGTCAGGAACTT 

>191.1 

GTACTCCCTGGAAAGTCCAGCTGAGAAAGCGATCCTGCCCTCTGCTCCTC 

CCAGGGTTACCCTCCTGTAAGTCTT CTGCTT AGTGTTCAGAATTGGGGGA 

TGCTGGGACTGGGCAAGGACTTGTAGGC^CACCCCATAGCCTGCTCATG 

CCTGTrGGGTT GCCTAT GGATCATTCCXn"GCTGGGCTCACTCACCGGCTT 

CGTATAAGGTCC1 1 1 1 1 GAGGTTTATTATTTCCTTGTCCATATACTTGAT 

GCTCTTCATTGGCTTGTCTGGGACCTGCCTTAGG7TCTCCX5AGGCATAAA 

AGGGCCGGACAGCCCCCGAGTTGGGGGAACTCTGAAGC1TCTT GGTGGCT 

GGAACCTTGGTCATCTT AAAAATCCTTCAGGTTTTAGCCTGTGCCCCCAA 

GACAAGGATTTTTCCAGAATCTTCTACTTCAGTAGTTACTGGTATGAGAA 

GTTTCGGCAACTTCTCCCTGATCCCCAAGTCCGAATTACA 

>192.T 

TGGCGGCCGCCCGGGCAGGT A C 1 1 1 1 M I T 1 1 1 1 I I I I I 1 1 1 T TTTTTT C . 
TGGCTTGAAATACAGCTGAAATAACTGAATTTTCTACTTGAAACGTGTGT 



GGTCTCACAGATAGTTGCCCCCGCGT 

> 193. 1 

TTTTCTCTTCCTTCGCTAACGCCTCCOGGCTCT C 
C - 
>194.1 

CGGCCGCAGCGGCAGCT ACAACAACCGCG TCGCT CT CCGCT CAATTTCCA 

AGAGCCAGCTTTGAAGCCAAGTGCCCCCGCGTACCT 

>19S.1 

CXXaCGGTGGCGGCCGGTGT GCTGTGCTCAGCTGCCTTCCAAAGGAGGAAC 
AGATCXjGCAAGTGCTCGACGCGTGXSCCGAAAATGCTC 

taaaaaocctgaaacatgacgagagtgttgtaaagtgtggaaatgccttc 
ttaaagtttataaaagt aaaatcaaataca 1 rmm i c aaaaaaaaaa 
aaaaaaaaaaaaagt 

> 196.1 

G<iCGGCCXlAGGTACTTTGAGCTCATAAGCTGGTA7 AAAAT ATCAAACATT 
TTGACTGTTTAAACAACT (^AGATATGTTTTGCAAAATT ACAAAACATT A 
TACAGGTGACTTAATTAATATCTACTCCAATTATACACAACACATCATGC 
TGAAGATTTAGATTTAlTTGAAAACACTrAGTCTAATTTATATTAGTGCA 
GAAAAATCACATTCAATAAACCACAATTGTAGAAGAGACAGATAAGTGTG 
TTTGTCACATTTTCACACAAATATAATTTGATATTTAATTAAGGGATGAT 
GAATCACAATCACCATt 
CCTCACTCGGATCCCCCCGCGT 
>197.1 

.GCAGCCCGGTATGCCGCCAAACGCTrCCGCAAAGCTCAGTGTCCCATTGT 
GGAGCGCCTCACTAACTCCATGATGATGCA 
>19&1 
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UllG CTCAGCCTTTCCAGGCCCCTCTGATGAGCTCTCT AATCAGCAGGAC 

CAAGGTGTGAATGTGGGAATGAACATGGATCCATCCCATTGGATGGAGAA 

GAAAGGTGGACAGCCTGTTCGTCTCTCATGTCAGCCTAGGGCTGGGAACA 

GTTTGTGAGGACTTATCTGTTGT 

>1OT.1 

GACCAAGGTGTGAAGTGGGAATGAACATGGATCCATCCCATTGGATGGAG 
AAG AAAGGTGGAC AGCCTGTTCGTCTCTCATGTCAGCCT AGGGCTGGGAA 

cagtttgtgaggacttatctgttgt 

>200.1 

aagatggccagagagaagctggaagaaatagattgggtgacatttggggt 
tatattgaagaaggttacgccacagagtgtgaatagtggaaaaaccttca 
gcat atgg aaactgaatgatcttcgtgacctgacacaatgtgtgtccttg 
ttcttatttggagaagttcacaaagcgctctggaagacggagcaggggac 
tgtcgtagggatcctcaatgccaaccccatgaagcccaaggatggttcag 

AGGAGCGTG ACT GTGAGT 
>201.1 

gttcaagctcaacaagtcagaact aaaggagctgctgacccgggagctgc 
ccagcttcttggggaaaaggacagatgaagctgctttcca 

>201.2 

ctgatgagcaactt ggacagcaacagggacaacgaaggtggactttccaa 

GAAGTACCTGCCCGGGCGGCCCGCTCTAGAACTAGT 
>202.1 

TTGGGGCACAGAGAGGGTTTCAGAGGATCCTTGTGAAACACTAGTTAAAA 
GATGACGAGTGGGGAGAAGTGCGAGGAAAGAAGGAAATTAGTCTGACTGG 
CTTTCTGTCCT GCACGATTGATTCAATGGAGACTGGCGGGAGGAAATGGA 
AGACTAGGGTTGGAGATGGGATGGGTGGGGCAAGGGATGGAAAGGAAAAG 
GCAGACAACT AATGCGTTCCATTTATAACAAGTAATATATATCAAAGACT 
I^ G ^^^ AA ^ C ^ T ^^ TM ^ <M ^CTTTAATTCTTAG 

>2 Q3. 1 

TCCTTTCTCGTTCCCtXaGCCATCTTAGCGGCTGCTGTTGGTTGGGGGC 
TCCCGCT CCTAAGGCAGGAAGATGGTGGCCGCAAAGAAGACGAAAAAGTC 
GCTGGAGTCGATCAACTCTAGGCTCCAACTC^TTATGAAAAGTGGGAAGT 
ACCT 

>ao4,i 

OGCGGTGGOG^CCGAAAACTGATCAGACTGTCTCAGATCAAGGAMAGAT 
GGCCAGAGAGAAGCTGGAAGAAATAGATTGGGTGACATTTGGGGTTATAT 

*S*gaaggtt^cc>^ 

TGGAAACT GAATGATCTTCGTGACCT GACACAATGTGTGTCCTTGTTCTT 

TAGGGATCCTCAATGCCAACCtX^TGAAGCCCAAGGATGGTTCAGAGGAG 
GTGTTpTTTATCTATCGATCATCXn'CAGAAGGTCTT AATTATGGGTGAAG<2 
TCTTGACCTGGGAACCTGTAAAGCCAAGAAGAAGAATGGAGAGCCGTGCA 
CGCAGACTGTGAATTTGCGTGACTGTGAGT w«««-wn.UA 
>20S.1 

ocgggtggcgxkxx3aaaactgatcagactgtctcagatcaaggaaaagat 
g g cgagagagaagctggaagaaat agattgggtgacatttgg ggtt atat 
tgaagaaggttacgcxagagagtgt^ 

TGGAAACTGAATGATCTTOGTGACCTGACACAATGTGTG^ 
ATTTGGAGAAGTTCACAAAGCGCTCTGGAAGA^GAGCAGGGGACTGTCG 
TAGGGA7CCTCAATGCCAACCCGATGAAGCCCAAGGATGGTTCAGAGGAG 
GTGTGTJJTAT^ 

TCTTGACCT6GGAACCTGTAAAGCCAAGAAGAAGAATGGAGAGCCGTGCA 
CGCAGACTGTGAATTTGCGTGACTGTGAGT ^^^^^ 
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>206.1 

CGCGGTGGCGGCCGAGGTACTCACAGTCACGCTCCTCT GAACCATCCTTG 



CTTCCAGAGCGCTTTGTGAACTTCTCCAAATAAGAACAAGGACACACATT 

GTGTCAGGTCACGAAGATCATTCAGTTTCCATATGCTGAAGGTT7TTCCA 

CT ATTCACACTCTGT G G CGT AACCTTCTTCAAT AT AACCCCAAATGTCAC 

CCAATCTATTTCTT CCAGCTTCTCTCTGGCCATCTTTTCCTTGATCTGAG 

ACAGTCTGATCAGTTT 

>207.1 

CGCG6TGGCGGCC GCCCGG CAGG TACA TGGTTCTTCCT AGAAAGTGGTTC 
TTCCTTAATGTG I rTCTTI 1 IACCCCM MCI IC1 ICTTCI I CACAGATG 

tttcttcttcttctg ccactttttcttctt cctcttcttcaact gaat ag 
ggtaagtgtaaaggcacaacaaattaacactgtatcagatctcattcctt 
ccaaaaacgtttgagtcctag hi 11 t 1 c i gtcattctcatcaactaccc 
aatgtttgttttgtttattttataattgggaaggttctccaaggcct acc 
actaactttaacgaatgatatagatagagctcagagcaatcttctcacga 

TCATG AAGTCATGTAT AAAAATC AGGATT AAAAC AAAGGTCATCT GATCT 

ccaatcattattgggaagaaagtcaattatattagaaatggttaagagct 
tgcactctgaagtcagacggcctgggtttaatct acctgctgcaaccctg 
aaaaattgtatttacccttggtgaag ctcccta 

>208.1 

ACATGGTTCTTCCTAGAAAGTGGTTCTTCCTTAATGTG 1 1 IUI I i 1 1 ACC 
CCM II CM CM CMC I I CACAGATG I I ICI ICI I CGTCTGCCACTTTTT 
CTT CTTCCTCTTCTTCAACTGAATAGGGTAAGTGTAAAGGCACAACAAAT 
TAAC ACTGT ATCAGATCTCATTCCTTCCAAAAACGTTTGAGTCCT AGTTT 
TTTTCTGTCATTCTCATCAACTACCCAATG I ML.M I I GTTTATTTTATA 
ATTGGGAAGGTTCTCCAAGGCCTACCACTAACTTTAAaSAATGATATAGA 
TAGAGCTC^GAGCAATCTTCTCACGATCATGAAGTCATGTATAAAAATCA 
GGATTAAAACAAAGGTCATCTGATCTCCAATCATTATTGGGAAGAAAGTC 
AATTATATTAGAAATGGTTAAGAGCTTGCACTCTGAAGTCAGACGGCCTG 
GGTTTAATCT ACCTGCTGCAACCCTGAAAAATTGTATTTACCCTTGGTGA 
AGCTTCCT ATCT ATAAAACTTAAGAATGTCTTATCTT ACTGGACTGTTAC 
TGATTTAAAAAGAT 
>209.1 

CGCX3GCGGCGGACGAGGTACACGACATAGGCACATGTGCAAAXACAAAGA 
AGGTGGGCTGCTGC I I C I M C 1 ATCTGCCCCTAG^CCAGGCTOCTTTGCT 
TCACGTAAGATGGAGACTGTCCCATTCCTCTGAAGTTGCTGGAAGGACAT 
TTCCCAGGAAGAAAC^TTCCTCACTGCCTATAAACTGTAGTCACATGTG 
GGATAGTCAATAGAACATGAGAATCAGAACAATCTGGGCAAATGGGTATG 
GGAAGAATGGGAACACCACAACAGGACAGATGCGAACT CTCATTCATGCC 
AGGCCTTTTGGCATATGGGTGCCTTCTGTGT C I I C1 11 C CA 
>210.1 

GGCGGCCGAGGTACTCACAGTCACISCTCCTCTGAAC^ 
ATGGGGTTGGCATTGAGGATCCCTACGACAGTCCCtrTG^ 

gagcgcggtgtgaacttctgcaaataagaacaaggacacacattgtgtca 
ggtcaogaagatcattcagttf ccatatgctgaaggtttttccact attc 

ACACTCTGTGGCGTAACCTTCTTCAATATAACCCCAAATGTCACX5CMTC 
TATTTCTTCCAGCTTCTCTCTGGCCATC 1 1 1 MCM GATCT G AGACAGTC 
TGATCAGTTT 
>211.1 

CTCACCGCGGTGGCGQCCGAGGTACTCACAGTCACGCT 

CCTTG«GCTTCATGG«GTTGGCATTGAGGATCCCTACGACAG 

TCCGTC1TCCAGAGCGCGGTGTGAACTTCTCCAAATAAGAACAAGGACAC 

AC^TTGTGTCAGGTCACGAAGATGATTCAGTTTCCATATGCTGAAGGTTT 

TTCCACTATTCACACTCTGTGGCGTAACCTTCTTCAATATAACCCCAAAT 

GTCACCCAATCTATTTCTTCCAGCTTCTCTCTGGCCCCATCTTT 
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>212.1 

tggatgacat tggcg gtggtccttgataccagataagccctcagtgtgaa 
gcagctctt atttttccttgtcttg agattgctct ggaatggaaattagg 
ctttttt g aaggtgtgaccxstttttgttgatttcttcagc>gttactttt 

TAATTTTTAAATGTTTGACACACAGTCTCTGATAAATGATCATTCACCAA 

tcaccgattactctccttgctctgttaagtgtgacactgtcicctttgaga 
atctggcgacagctatgtatcccataaccacacaccccaaaaaaaaaa 

>213.1 

gg cggcc gtttgagaagccagcgctcacccacccggggtctctgtgcatt 
gacctttgggtgctgacttggagaaaagcacaaacacgaccagtcx«ccc 
gcgtacctcgg 

>214. 1 

ttttaacacaatatacctaacatatttttatttcaatatctaaccagtat 
aaaaatttacttgttttgccctctagagatagtaagctcctt aagtaaac 
agaagtaatacx^gattaattagaattcxx^ccgtcatcaagtgtgtgc 
ttatatagaagaaacxx>gtaaatgtttgttgattgaaagatattaatac 
tcttgcttggatgagagtgaggaaaaaggtattagtattg gcttttcac 

>215.1 

GCGGCCGAGGTACTTTGGAGTCCtXrrGGTTTCT AAGAATTGCCGTTGACT 

CI MCI I tgg gttctgctggcacggtaaccag ACT ccct acaact gcact 

CTTrGTCTTTGTCATGGAAGCCGCG^GCGTAGAGGTTCCGCGTGCTCTGC 

CGGACTTGAGCAGGTCACTGGGTCCTTTACACTTGTGAATTCXjyKAGCTTG 

CCAGATGTATCCTCAATGCATT6CCACTTCTGCGCCGGTTGTTCACAGGC 

TGTCTGGTACGAGATCTCCGACCAGTCTGGGGGCGCTGGCGGCCTGCGCA 

GCCACCTCAAG ATCACAGATTCTGCTGGCCATATTCTCTACT CCAAAGAG 

GATGCAACCAAGGGGAAATTTGCCTTTACCACTGAAG^TTATGACATGTT 

tgaagtgtgttttgagagcaagggaacagggcx;gata<^gaccaactcg 

tgatcctagacatgaagcatggagtggaggcgaaaaattacgaagagatt 

gcaaaagttgagaagctcaaaccattagaggtagagctgcgacgcctaga 

agacctttc^agaatctattgttaatgatcttgcctacatgaagaagagag 

aagaggagat 

>216.1. 

CCACCGGGTGGCGGCCGAGGT ACTTTGGAGTCCCCTGGTTTCTAAGAATT 

GCCGTTGACTC 1 M CI 1 1 ggcttctgctggcacggtaaccagactcccta 

CAACTGCACT CTTTTGTCTTTGTC^TGGAAGCCGCX^GCGT AGAGGTTOXl 
CGTGCTCT GCCGGACTGTGAGCAGGTCACT GGGTCCTTTAGACTTGTGAA 
TTCGAAGCrTTGCCAGATGTATCCTC^TGCATTGCCACTTCT 
TGTTCACAGGCTGTCTG^TACGAGATCT CXXJACCAGTCTGGGGGCGCTGG 
CGGCCTGCGCAGCCACCTCAAGATC^CAGATTC1GCT6G<XATAT^ 
ACT<X^AAAGAGGATGCAACCAAGGGG>AATTTGCCTTT accactgaagat 
TATGACATGTTTGAAGTGTGT T 1 1 GAGAGCAAGGGAACAGGGCGGATACC 
TGACCACTCGT GATCCTAGACATGAACATGGAGTGGAGGCGAAAAATTAC 
GA 
>217.1 

GCGGCCGAGGTACTATCAAACAACATGATACAATTTAAATGTGTCATAGC 
AACTACTAGTGGTCACCTGAAATCCATTTTCCCCTCC 1 1 CACAGTAAGAG 
TTTT AGCTGAATGAGTGGCCACTCATAGAGAG^TTGCATTTCT ggcttcc 
CTTGCAGCCATAGGTAGCCATGGGACAAAGTTCTAACCCAGGGGGGGTCC 

AAIC1 1 1 1 ggcttccctgggacacactggaagaagaagaattgtcttggg 
ccacacataaaatacactggcatcaaggatagctgatgagcaaaaaaaaa 
aaaaaaaaaaaaagt 

>218.1. 

cggggtggcggccgaggtaccatcctgttccacagagcca 
ctaaattgaatccgactgggcgtgcccctcctcggaacacaacagtagac 
cttaatagt ggaaacatcgatgtgcxttcccaac^tgacaagctgggccag 
ctttcataatggtgtggctgctggcctgaagatagctcctgcctcccaga 
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tcgactcagcttg g attgtttacaataagcccaagcatgctgagttggcc 

aatgagt atg ctgg ctttctcatg g ctct gg g tttg aatgggcacctt AC 

caagctggcgactctcaatatccatgactacttgaccaagggccatgaaa 

tgacaagcattggactgctacttggtgtttctgctgcaaaactaggcacc 

atggatatgtctattactcggcttcttagcato 

accccxiaacctccacagagctggatgttcctcac^tgtccaagtg^ 

cagtggttggcattg gccttgcatatcaaggg acagctcagag acat act 

aacgcgcgact<x>ccgo^tcttcx^cctac«gcctgcgag^c^ 
ccgcgtacctcggccgctctagaact aagtgggatcccccgggct 

>220.1 

gcggtggcggcx^aggtaccatgatatcatgtatcctgcttgga(^tttt 
gggaagggggacctgctgtttggccaatttatcct acaggtcttggacgg 
tgggacct ctt gag ag aagatctggtaaggtcagcagcacagtggccatg 
gaaaaagaaaaactctacagcatatttccgaggatcaaggacaagtccag 
aacgagatcctctcattcttctgtctcggaaaaacccaaaacttgtt gat 
gcagaatacaccaaaaaccaggcctg gaaatctatgaaag atacctt agg 
aaagccagctgct aaggatgtccatcttgtggatcactgcaaatacaagt 
atctgtttaattttcgagg cgta gctgcaagtttc^gtttaaacacct 
ttcctgtgtggctcacttgttttccatgttggtgatgagtggct agaatt 
cttct atccacagctgaagccatgggttcact atatcccaatcaaaacag 
atctctccaatgtccaagagctgttacaatttgtaaaagcaaatgatgat 

GT 

>221.1 

ccgggcaggtacagcaacaagaatcagatgctctttagagatcctccatt 
tcattactct aacattcttcaatgtggttccagccacgcatagtcatata 

GATACT ACATATTCAAAGATAACTTACTGAAGCTTGTTCACAGAACCAAG 

CTTTCTCCTGATAGCTCTTCTTCCCCTACCCCGCACTTTTGGAAGTATTA 

CCCCAAATGCTCTT CAG GATTT AAAT AAC AATTTTTAAAAAGAC ACTTAA 

CACCACAAAATGGAATTTGCTGGCATGAGGCGAACAATACGGTTACTCCA 

G ATG CTGT ATTCAAACTGT ATG G G TCCGTTGAAAAAAT AGATAT AACCAT 

TTrrCTCATAGACAGCATCTACTTTATCACCAATTCCTGGGAAGTCTTCT 

TCTATTAGTCTCGGATAGTCTTTATCCATAATATGGCT AGT ATCATCATA 

TCTCCAGACCT GGTTTCCTG>GAACAGG AGAGTCTTGCCTGTATCCTCAA 

AGT GAAC^GC TGCACTTATCTTCTTAA CI ICI 1 1 I GGAAGACCCAGTTCA 

GATATTTTTTTGGGATAACCTTCCAAAATGTCATAACCAT 

>222.1 

ACGCGGGGAGTGTAACT ATGGCCGGCCTGCGGAAGGAAAGTGAACAGGAG 

CCGCTCTT aggcgacacacctggaagcagagaatgggacattttagagac 

TGAAGAGCATTATAAGAGCCGATGGAGATCTATTAGGATTTTATATCTTA 
CT ATGTTTCTCAGCAGTGTAGGGTTTTCTGTAGTGATGATGTCCATATGG 
CCATATCT CCAAAAGATTGATCCGACAGCT G AT ACAAGTTTTTTGGGCT G 
GGTTATTGCrrCATATAGTCTTGGCCAAATGGTAGCTTCACCT ATATTTG 
GTTTATGGTCTAATTATAG>CCAAGAAAAG>GCCTCTTATTGTCTCCATC 
ZI^ 1 ^^^ T ^*^*^^^^ < ^^ CTATGCATATCTT CACATCCCAGC 
TTCTC ATAATAA ATACTACATGCTGGTrGCTCGTGGATTGTTGGGAATTG 
GAGCAGTTTTTC 
>223.1 

GCGGCCX5GACTGATGCCATCTGCAGTTTTCT 

CTTCGAGGTCAGGCCATTAT CTTTA ATCCTGAC I MM I G T G GAGAAACT 

CCGACATGAGAAACCTGAGATTTTCACTGAGTTGGTGGTCAGCAAT^ 

CAAGGCTCATCGATTTACCTGGAACTGAGTTGGCTCAGCTGATGGGGGAA 

GTGGAGCTTAAGTTGCCTGGCGGGGCTGGCCCAGCATCAGGA 

GTCTCTCATGTCTCTCAAGCGAAAGGAAAAAGGAGTGATATTTGGGTCCC 

CACTGACGGAGGAAGGCATTGCCCAG^TATACCAACTGATTGAGTATCTA 

CACAAAAACTTGCGAGTAGAG GGTTT GTTTAGAGT 
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GGCCGCCCGGGCAGGTACTCCCTGTAAAGGGGAAT7TCCATGCCGTCTAC 

AGGGATGACCTGAAGAAATTGCTAGAGACCGAGTGTCCTCAGTATATCAG 

GAAAAAGGGTGCAGaCGTCTGGTTCAAAGAGTTGGATATCAACACTGATG 

gtgcagttaactrccaggagttcctcattctggtgataaagatgggcgtg 
gcagcccacaaaaaaagccatgaagaaagccacaaagagtagctgagtta 

CTGGGCCCAGAGGCTGGGCCCCTGGACATGTACAGACTCTCATTTTATGA 

tgtatcctactgcatcaggacatttgtgtcaatgtcaggtgacgagggga 
aatgaaagtgatgagacgatgagaggagtgaaat accaaggacg ccatac 
taggaaacccaggtctatttgttatcagagtaaggatcaagccagatagc 
ctgttatgtaatttctccgataaaagattttgaaagcaggtgctgtgggc 
atctgtatggggaatcgcactcatagaattattttcatttgtaaatattt 
ggtatcaggccaagcaagggaaagaagcttt actgtattaccatcttt 

>22S.1 

cgctxxxxx^gtggcggccgaggtactcacagtcacgcaaattcacagt 
ctgcgtgcacggctctccattcttcttci iggctttacaggttcccaggt 
caagagcttcacccataattaagaccttctgaggatgatcgatagataaa 
cacacctcctctgaaccatccttgggcntcatggggttggcattgaggat 

CCCTACGACAGTCCCCTGCTCXX5TCTTCCAGAGCGCTTTGTGAACTTCTC 
CAAATAA6AACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAGT 
TTOCATATGCTGAAGGTTTTICCACTATTCACACTCTGTGGCGTAAOCTT 
CTTCAATATAAOCCCAAATGTCACCCAATCTATTTCTTCCAGCTTCTCTC 
TGGOCATCT 1 1 ICCTTGATCTGAGACAGTCTGATCAGTTT 



ACGCGGGATGGATAGCOGCTTGCAGGAGATCCGGGAGGGGCAGAAGTTAC 
GGOGACAGCTCCTCGCGCAGCAG TTGGGAGCT GAAAGTGCCGACAGCATT 
GGTGCOjTGTTAAATAGCAAAGATGAGCAGAGAGAAATTGCTGAAACAAG 
AGAAACTTGCAGGGCTTCCTATGATACCTCTGCTCCAAATGCAAAACGTA - 
AGTATCTGGATGAAGGAGAGACAGATGAGGACAAAATGGAAGAATATAAG 
GATGAACT AGAAATGCAACAGGATGAAGCTTATCATCAATTCATTGTATA 

aaaataaagagattttcctgagagaactgatttcaaatgcttctgatgct . 
ttagataagataaggctaatatcactgactgatgaaaatg 

>2Z7.1 

acggaaagtgattcagagaaogctggggctcacaggcgctgtagcaaacg 
tgcaactcttgaggaacacttaagacgccaocattcagaacacaaaaagc 
tacagaaggtccaggctactgaaaagcatcaagaccaagctgttactago . . 
tctgcgcatcacagaggggggcatggtgttocacatgggaaattgttaaa 
acagaaatcagaqgagccatoggtgtcaatacoc1 roctacaaactggat 

TATTAAGAAGTTCAGGGAGTCrTTGGGCACAGAC^AAGCCAGGAGATGGAT 
AAAATGTT AAAAAATCAAGCAACTTCTGCT, ACTTCtGAAAAGGATAATGA 
TGATGACCAAAGTGACAAGGGT 
>22B.1 

AGACT7GGCTGTTGGGAGGGGCGTGTCTTACAOCTTAGGAAGAATCCTTA 
GCTGTACI 1 1 CCTGTC I C ! CO I GGAGCTCCCTCCTACCCOCT AGCTGAGT 
AGGCCAGGTTTTGGTGCAAAATCTCCCACATTGGCAAA G T 1 CC I GCATAT 
GCTGCGCAGT A I G I GCC 1 1 G AATAAAAATOCTGAAGATTAGATGGTTCAG 
G^TG^TCATCC CAAAGCAAAGAGCACCTCTTTG AAGCTCAC GTGC CCGG 
GOGGCOGAGGT Anlri iitiimII'IMMMIII IQAGTATGTAGCI I T AA 
AACAGTTACATATAACATGGAACAGTATGACATGAAAAGAGAGAOGTTTA 
TAGAGGGAG 
>229.1 

GGCGGCCGAGGTACT ACAGGATGATGGCTTTCTCTTCCTCTGGGTCACAG 
GCAGGGCCATGGAGTTGGGGAGAGAAtGTCTAAAOCTCTGGGGGTATGAA 
C3GGGTAGATGAAATTATTTGGGTGAAGACAAATCAACTGCAACGGATCAT . 

CTT A 
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>229£ 

TGTTACATTGGTCAGTTTTTACTTGTAAAAAGTATTATAGAAGA6T7TTA 
TTGGAATG TTATTT TATTAAGCCATTTTCATGGGTT A 1 1 1 HI 1 1 1 AAAG 
TTTAAGAAGTTTTTACAACAGGCTGGGGGGGGGGGGTTACACC 
>230.1 

ggcggccgccgggcaggtacgggggggagtcagacccagtcaggacacag 

CATGG 
>231.1 

TCCCCGCGGTGGCGGCCGAGGTACGACGTTTCCATCAGCTTGTCTGTTTC 

ATTCCCTGATGTTACGAGCAATATGACCATCTTCTGTATTCTGGAAACTG 

ACAAGACGCGGCTTTTATCTTCACCTTTCTCTATAGAGCTTGAGGACCCT 

CAGCCTCCCCCAGACCACATTCCTTGGATTACAGCTGT 

>232.1 

AAAAAGATATTTTAATAT ATTCAGATCCACAAATATGAAATAAAACT AAG 

TAGAGCTGGTATTCATTTACACATAATTATCTTATACCGTTTGGAATAAG 

AATTTGGGGCACGTTAGCAAACCAAAAGGCTCAAAAAGACGTOGAGATAT 

TTAGTTCTTGTCTCCCTCTAGAAATGTGAAGCACTCTTTTATCCGGCATT 

CCTAGGGGAGTTCCT ATTTTCAAATTTGGAAATCATTTCTGGTGCTAAGC 

AATCTCAAAAAAAACATTTACTAAAAACCAGAGGAAAAAAATCTTATAAC 

TTTGGGAG 

>233.1 

GCGGCCGCCCGGGGAGGACGCGGGGGCCAG 1 I C I C HCGGGGACTAACTG 
CAACGGAGAGACTCAAGATGATTCCCTTT7T ACCCATGTTTTCTCT ACT A 
TTGCTGCTTATTGTTAACCCTATAAACGCCAACAATCATTATGACAAGAT 
CTTGGCTCATAGTCGTATCAGGGGTCGGGACCAAGGOCCAAATGTCTGTG 



AACTGGTATAAAAAGTCCATCTGTGGACAGAAAAGGACTGTGTTATATGA 
ATGT TGCCCTGGTTATATGAGAATGGAAGGAATGAAAGGCTGOOCAGCAG 
TTTTGCCCATTGACCATGTITATGGCACTCTGGGCATCGGGGGAGCCACC 
ACAACGCAACGCTATTCTGACGCCTCAAAACTGAGGGAGGAGATCGAGGG 



»234.1 



TGCCTGI 1 1 IG 



ACGC 
>234_2 

CACAGATTTT ACATATATATATGAAACTGT ATAATGT GTTOGCTTCAGTG 

TCTGGCTGCTTTTAC7CAACATTGTGAAATTAATTCCTGTTATOGTATAT 

GGGATTAAAAI I IGI 1 1 GCCTAGTTTTTGCCTTCTCATTGCTTCTGAATT 

GGGGCAGCTTTGCCCCTCAAGGGAAATTTAGCAATGTCTGGAGACATTTT 

TTATTTTCATAATTTGGAGGGACATGGGGGAGGTGTGCT AGAGAACTTAG 

TAGGT AGAGGACAGGGTT AGTGCTGAACGTTCCACAGT 

>235.1 

CCTCCCAATTATCCCCAATTGAGAGATGAAAATTCTGACAAGCTCTGAAA 

cgttaactgacttgcccataaatgacagttccaaagttataaggct AGAA 

CT TGAATOCA GG1 CTG M AGAAATCTAGGTTTGAGAATCCATATTCTTTC 
>238.1 

CGGGCGCCCGGGCAGGTACCTACGCGACAGACAGCCAGAGGGAAAGOGAC 

ccagacaggagcocctcctcgacaggcccaccctgcagctcaggcaccaa 
gaaaacagccgatactggcagccattgcagctccaaactqcagaggcaag 
gccaattttaacttttcaatttacagtcgattttgaagagcttct 
tcggttatgtaaattcatatatgtatttttggaatcagttcttataaaca . 
gctcgattcagttttagctaaatttatagtctaggt agt atgtt acattt 

GAACI II I m CI I A AGAAAAGTTGACTGTTCAGATATTTTTCTACTGTAA 
AGAAATATACTTTTCTATTAAAGATCTGT 

>237.1 
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GCAGTTTTGT GATCTGCA ATGATTCTTCOGTTCGAGGTCAGCCCATTATC 

TTTAATCCTGAC! fill TGTGGAGAAACTCCGACATGAGAAACCTGAGAT 

TTTCACTGAGTTGGTGGTCAGCAATATCACAAGGCTCATCGATTTACCTG 

GAACTGAGTTGGCTCAGCTGATGGGGGAAGTGGACCTT AAGTTGCCTGGC 

GGGGCTGGCCCAGCATCAGGATTCTTCCGGTCTCTCATGTCTCTCAAGCG 

AAAGGAAAAAGGAGTGATATTTGGGTCCCCACTGACGGAGGAAGGCATTG 

CGCAGATATACCAACTGATTGAGTATCT ACACAAAAACTTGCGAGTAGAG 

GGTTTGTTTAGAGT 

>23ai 

CACCGCGGTG GCGGCCGAGG T ACGCGGGGATTGTGTG CAAAATCAGAGAG 

GGGTGCAAGATCCTGATTTTTCAGGAGTTCAAGCGACAATGGCAGCCCAA 

TACGGGAGTATGAGCTTCAACCCCAGCACACCAGGGGCCAGTTATGGGCC 

TGGAAGGCAAGAGCCCAGAAATTCCCAATTGAGAATTGTGTTAGTGGGTA i 

AAACCGGAGCAGGAAAAAGTGCAACAGGAAACAGCATCCTTG<3CCGGAAA - 

GTGTTTCATTCTGGCACTGCAGCAAAATCCATTACCAAGAAGTGTGAGAA 

AOGCAGCAGCTCATGGAAGGAAACAGAACTTGTCGTAGTTGACACACCAG 

GGATTTTCGACACAGAGGTGCCCAATGCTGAAACGTCCAAGGAGA 

>239.t 

CGCG GTGGCGGCCGAGGTACCAGTTAAGTGAACAGCTCGTCT AGGTCTGC 
TTTTGTAACACCCAAATACAATTAGCACTTCTCTGCTGGTATTCCCTGGG 



TATGTGCTTTGTAACTGCTGATTAATTCAGTTTCTTAACTAGGCAGAGCA - 
GGTCATCAGTGTATCT AATTCACACT ATTAATACACT GT C I TG C1 GAAGA 
GTCTGACCTGCCCAGAACCCCGTTATGGCT AGCCCAGGGAAGCAGTAAAC 
TGCAAAGCAGAGAAAAGGGGCAGCTAAGATGAGGCTAGTGCTGGCTGAGT 
CCCAGTT aggtct GTT ACT GTTCTGTTCCAACTATAAATCCAGGATGACT 
GTTACTCAGATTCAGTGCTATGTAGAAAATAGAATGCACAGCCAAAAACA 
TAATTTGGGGATGACTGGCAGCAC C I 1 1 II I I CCCI 1 I CI I 
>240l1 

GGGGGCGG CCGAGG TA U I II III I I I I II Mi l rGGTATGACTATGAAG 
GC TAGT GGTCTTTTT ATT AGCT atcaagttcattt aacagacaaaaaatt 
CA QTTCAAT GGGGGCATTAAAATAGGAAGAATTAACAATAGTTCATTAAT 
CAATCTTTCAGC rBTTOCI ATTTTATCACAATAACTTTTCCTATAATTGA 
GAGATCCATGAGGAAGTCTTGAAAAGAACGTATG IIICIM CAATTCCAT 
■ AAAACATTCAGCCAAAATAATAAAAQAGGOGCTATTACI I IG I I I TGGGT 

gaatgatatgcaggctaggctttgctgtagt 

>241.1 

ggcggccggtgtgctgtqctcagctgcci iccaaggaggaacagatcggc 

aagtgctcgaogogtggcogaaaatgctggcgaagaaagaaataaaaacc 

ctgaaacatgacgagagtgttgtaaagtgtggaaatgccttcttaaagtt 

TATAAAAGTAAAATCAAATTACA 1 1 1 1 1 1 1 1 CCAAAAAAAAAAAAAAAAA 
GT 

> 842.1 

*ctgtctcagatcaaggaaaagatggccagagagaagctggaagaaatag 
attgggtgacatttggggttatattgaagaaggttacgccacagagtgtg 

^TAGTGGAAAAACCTrCAGCATATGGAAACTGAATG A T C I ICGIGACCT 
GACACAATGTGTGTOCI IGI ICI 1ATTTGGAGAAGTTCACAAAGCGCTCT 
GGAAGACGGAGCAGGGGACTGTCGTAGGGATCCTCAATGCCAACCCCATG 
AAGaX^AGGATGGTTCAGAGGAGCGTGACTGTGAGT 

CC 1 1 tjGGCA GATGCTGTATTATGGGGATAAGCCACACA L 1 1 1 1 IG AACTG 
OO006G7CAGGGGGGACATAACC AI I rC C TCTGOC ACCtXATCAATCOOC " 
ACCT ATTCTGAGTGTAGGCTOCTCCCCTGCTTGAGT AATGGCCACAGATC 
TTGGCJOGGCACTCCTAAGCTGCATGTTG^ 
* GGC1 i G i GGTTOCATTC T CCAGATCCTTGGGTTGGCrrCTGGGTGCACTA 
GGAGATCTGAAATGCTCTCAGGOCACCAGGAAAGTACTGGAAGTAAAGTC 
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TGACTCTAAAG AAGATGAAAATCTAGT AATTAATGAAGTAATAAATTCTC 

CCAAAGGGAAAAAACGCAAGGTAGAACATCAGACAGCTTGTGCTTGTAGT 

TCTCAATGCACGCAAGGATCTG 

>244.1 

gtaccaccacagttgctatctcttgaacatctttcattaaaac^ 
tcta gt ttgag aata c 1 1 1 1 aagcctgctggc c i cc i i iggggcattctt 
ttttctu 1 1 icagcacgcatci i ici i i iccacttactocgtaagcttt 

TAGCGATGTTTTACCTTGAGGGCCGAAGTTAACTTCAGCGGGAGTGAACG 

acaggggtgggctccactttatccagtgcactcggaagccggagggcccc 
caccaaaaagagcaaggggaaccctcgccctcaacaaggcctgcatctcc 
ggactggagctgaagtatag 

>245.1 

acaattgcttgagtgagttcatggtccgtaggaggatgaccactagccca 
ccaccttccactg 1 1 i c i acagtoctggocagcaagtttggagttaaggc 
ttgaaaatcctgcagcacacacatgccgaaggtattgcccaggatcttgt 

GGGTCTCGTTGTAGTAGCAGTAGCGAATGTTTGTGGCTGCTATGAAGAGT 
TCAAAGGGGTCGTCCTGCTTTATGTrCAGTGTTCCATTCTTT A 1 1 1 I C I I 
CTGCAGCTGTCGCATT C I 1 1 IC I I ICG 
>248.1 

OGGGTGGCGGTCGTGGGGATCAGCGT AGGTGAGCTGTGGCCl 1 1 IGCGAG 
GTGCTGCAGCCATAGCTACX3TGCGTTCGCTACGAGGATTGAGCGTCTCCA ■ 
CCCATCTTCTGCGCGGGAOCATCTACATAATGAATCCGAGTATGAAGCAG . 

TTCTGTC 



AACTCTGAAGATGATTCAGCC 1 1 C I CCATCTGGATCT C I IG I I GGAAGAG ■ 

AAAATGAGCTGTCCGCAGGCTTGT(XAAAAGGAAACATCGGAATGACCAC 

TTAACATCTACAACTrCCAGCCCTGGGGTTATTGTCCCAAAAAAAAAAAA 

AAAAAAAAAAGT 

»247.1 

CTTGCTTGACTAGATGAGCTGCTATAGTAGCCAATCCTGTTAGACTTGGA 
CCATTGTTTGTCTGAAGAACGGGGATCT GTCGCT cgccctgagcact GT a 
TTTATTCCCCTTr ACTCAGTCCCAGGGACTTCT CCAGTAGCGACAACTCTG 
CGGOCGCCGCCATCTTC 
>24&1 

AGAGACGAGGAATTT AATT AGGGTTGT AACAAATGGTT AATT AT AGT AAG 
AAAAACCAAATTGAATAATTTTCT AACTCACTTGGCAGGGGGGGTCTCGC 
AGOCAT AATGAACATCACAT AATGAAGTTAC I C C 1 1 1 C CAGATCTATAAA 
CAGGCTCATGTAACTAACTGATACTCAGTAAAAGGGTCCATAATCCAAAT 
TTATATAACAAATGGGGCTTGCTATAAAATCTCITACATTTTAATACTTA 
CTCTTAATAAATCATCTATTCTT C CC ICC nC I I CTCTAAGGCAGAATTC 
TTACIGI I f ICI AGGGCAGATAI 1 1 1 T T CI ATTGTGAGGTGGGACTGGGT 
CTGTCTGGGCTGGATGGAGATCT G I I I M G GGAGCTGCAGGAATGCTCTG 
TGTTGCCAGATCCCGTAAATGAGGGACTGT 
>249.1 

ACTGTCTCAGATCAAGGAAAAGATGGOCAGAGAGAAGCTGGAAGAAATAG 
attgggtgacatttggggtt at ATTGAAGAAGGTT ACGCGACGGAGTGTG 

aatagtggaaaaagcttcagcatatggaaactgaatgatci i os i g aoct 
gacacaaigiuigicci lb i ici i atttggagaagttgacaaagcgctct 
ggaagacggagcaggggactgtogt agggatcctcaatgocaaocogatg 
aagcccaaggatggttcagaggadggtgtgtttatctatcgatcatcctga 
gaaggtcttaattatgggtgaagctctt gacctgggaaoctgt aaagcga 
agaagaagaatggagagccgtgcacgcagactgtga ai i fgogtgactgt 

GAGT. . 

>2Sai 

G GGGG CO GGAG T G^TG OCA TCTGC A GI 1 1 1 B r GATCTGCAA TGAT I CTI C 
Cu* ICGAGGTCAGCCCATTAiCI t lAATOCGGACI I II I IUTGGAGAAAC 
TCCGACATGAGAAACCTGAGATTTTCACTGAG I TGGTG G1 CAGCAATATC 
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ACAAGGCTCa I CUAT I I ACCTGGAAC I G AGTTGGCTCAGCT GATGGGGGA 
AGTGGACCTTAAGTTGCCTGGCGGGGCTGGCCCAGCATCAGGATTCTTCC 
GGTCTCTCATGTCTCTCAAGCGAAAGGAAAAAGGAGTGATATTTGGQTCC 
CCACTGACGGAGGAAGGCATTGCCCAGATATACCAACTGATTGAGTATCT 

acacaaaaacttgcgagtagagggtttgtttagagt 
>2s^.^ 

TGGCGGCCGAG GTACCAG CAC AAACCG ggcc a gcct cct aaactgctca t 
ttactgggcgtctacccgggaatccggggtccctgaccgattcagt ggca 

GGAGGG 
>252.1 

GGCCGAGGTACATTTTACTACGCACCCTT ACGCATTCTnTTCTCACCTC 

TGTGTGTGTGTGTGCGTGCACATGCACACACACAAATGGGTGAAACAATT 

CT CACCATaCCAAGAGCCACCGCGCCCTGCCG AGAATTTGCATTTCT aac 

A AGTTC CCAGGTG ATGCTGACACTGCTGGCTCATGGAACCACTGCT GT AG 

TATT TTCCAAATTATCCTGATTCTAAGAACCACCTATGACCTGTGCTGTT 

TTTTCTGTGGTrACTGGCTCATGTCACATAAATrCTTTTAGGATTCAAAC 

ATGTTTGTGATATT ACTCAGTATTTACATCTTGCTTTTACTGCAGCATGA 

TGGA AAAATTAACCACAGGTATATCATAACAAAAAGAACATGAGTTACCA 

TTTTCACAAAGTTCAGATATATTTAAATTAGCCT ATTT AATCT 

>253.1 

CTATTTCAGATTTATTAAAAAACACATAGGTAACGAGTCAGAGCTTTGGC 

taggaatgagttggaaaagaactgaaggcataattccacaggacattcac 
agttgtgtgctagagacagagaggagcaggaaagtgttttagaaggattt 
gcggcggacaatggaaggcccggcttcatcgaattcctgtttgct g atcc 

GGGTGCTAGGGCCGGGATCT(XTTCTGCATTCGGGCGGCAATGCCAGGGT 
>254.1 

AGCTC*£XX5CGGTGGCGGACGAGGTACTCATGGTTGCTGTAAATCTGGCC 
GCX^TTCTGCAGGGTTATGCTTAGCCAGGCTCCTATGAGATCTGGCTATT 
CTGTCTTGTGGATGGTCAGTCCCOGCGTACCT gcccggg 
>255.1 

agctccccgcggtgg<x5gccgaggtacgcgggga7tgtgtgcaaaatcag 

aggggggtgcaagatcctgatitttcaggagttcaagcgacaatggcagc 

ccaatacg^gcagtatgagcttcaaccccagcacaccaggggccagttatg 

ggcctggaaggcaagagcccagaaattgccaattgagaattgtgttagtg 

ggt aaaaccggagcaggaaaaagtgcaacaggaaacagcatccttggccg 

gaaagtgtttcattctggcactgcaggaaaatccattaccaagaagtgtg 

agaaaggcagcagctcatggaaggaaacagaacttgtcgtagttgacaca 

ccaggcattttcgacacaagaggtgoccaatgctgaaacgtccaaggaga 

TTATTCGC 
>2S6.1 

gagagaaatcaactattcaggaccggcccccacctttcct caggagtcat 

TTCTGTrCCGCACAGGCCTGCTGAACTGGGTGCTTTATATAG . 



GCGAGCGTCGTTGGCTGATGGGAGTAGAAGCGAGAGAGAGTCTTCCTCTT 

GGAGGTACAGTCAATTCTGAGGTTTGGGCGTCATAGACTAAACCCAGAAA 

AGAGAACATTGGGAAGTCTT CGGAATATTCTCTATCTTCTT caccaacga 

GTAAGACGTTTTGGAATAATGGGAC 

>258.1 

ACGTGACCGACGGCAACATTGCGGCGCCCAGTTGCGTCCAGCTGCTTGTC 
CGCAGAGGTTCTGATAGAATTTTCTCTTCACCACTCAATCATATCT ACTT 
ACACAAGCAGTCAAGCAQTCAACAAAGAAGAAA 1 1 I C I I I I 1 iC GGAGAC 
AAAGAGATATTTCACACAGTATAGTTTTGCCGGCTGCAGTTTCTTCAGCT 
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CATCGGGTTCCTAAGCACATAAAGAAGCCAGACT ATGTGACGACAGGCAT 

TGTACCTGCCCGGGCGGCCGGCTCTAGAACTAGTGGATCCC 

>2S9.1 

GGTGGCGGCCGGCGGGAGGCTGACGACAGCCCGGGAGGCGTT AGCGAAGG 

AAGAGAAAAACCGAAGACGAAGCCACTACAGCCCCGCGTACCT 

>260.1 

TGTAAAGCCTGGGTGTGCCCTAATGAGGTGAGCCTAACTTCACATTTAAT 

TGCGTTGCGCTCACTTG 

>260.2 

GGGCGGCTTCTTTCCGCCTTTrCCTTCGGCTTCAACTTGAACTCCGCTTG 

CGCTTCGGGGT 

>261.1 

AGTCATAAAGTGTAAAAGCCCTGGGGTGCCTT AATGTAGTGAGCTAACCT 

CACATTAATTGCGTTG 

> 262.1 

ATTT ATTT ATT ATGTTGTAGCCG GGGCGGCCGAGGTACCCGATAG AACAT 
GGCATCATCACCAACTGGGACGACATGGAAAAGATCTGGCACCACTCTTT 
CTACAATGAGCTTCGTGTTGCCCCTGAAGAGCATOCGACCCTGCTCACGG 

aggcacccctgaacccgaaggccaaccgggagaaaatgactcaaattatg 

tttgagactttcaatgtcccagccatgtatgtggctatccaggcggtgct 

gtctctctatgcctctggacgcacaactggcatcgtgctggactctggag 

atggtgtcacccacaatgtccccatctattagggctatgccttgccccat 

gccatcatgcgtctggatctggctggccgagatctcactgactacctcat 

gaagatcctgactgagcgtggctattccttcgttact actgctgagcgtg 

agattggtcgggacatcaaggaaaaactgtgttatgtaactctggacttt 

gaaaatgagatg 

> 263.1 

t^'*' nTmTITTTTTTTTTI IGCAGCCUI 1 1 1 ICI I ACT A6AAGCTA . 

ggcggaaagaggtgttactcagatttcttgaacttgagacgtcaaaggtg 
agacgccagccaaggagaagggatggtcaggg 

>264T 

ggccttt aaagc^jttcgctttggcttcagctttaggaggggcaggagctt 
cc 

>26S.1 

caaccgggaccccagcttttcagaactgcagggtaacagccatcatgagt 
gaggtcaccaagaattccctggagaaaatccrrtccacagctgaaatgcca 
tttcacctggaacttattcaaggaagagtg 

>266.1 

ttggagctccccgcggtggcggccgaggtac 1 1 1 i cia ggtattggtggg 
caagatccttgttggagtcctccici 1 1 igctgccccactcagaggatag 

'GCAGAGCAGACTGGCAGACACAACAGCACAAGGAATGCAAGATGCATCAT 

tctcactgcccttaccttctttgtctactgggcttctcccc^ 
cccgggcgg 



gagccgccgggcaggtactaccttcaccaactttttcatt^ 
aaagacgagtcttctgatgttctataagcaatatgtttatatgaaagtca 
gaagtttagc«aaaattcggcctaaacagtaataaatgaaaatggaatgg 
aaatcaaagttc 

> 267.1 

acttt acctcatttcctaccaatcatttraagaguvatttggttgtatttc 
aaagaacaaaacaagac^tttctgtcctgctgtttattttagcgtggtc 
gcggccgaggtacggatacaattccgctgagtt agattcgaaattctaac 

CTCTCC*TCACACGCC«1MSAA^ 

TT GCCA AG caattgactccatcagggtgaccatccagggaagcaaggaat 
ttctcgtgggacctcaaaaggatgtaaagcaggatcatagtttcttggaa 
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ctctctgt aagtccaacttggtttcgcggacataattgtccggattccgg 
ctcagcatcttcaccttcatctcggttgct cttc 

acattt atatg aaagtcctcactttcagaagcagaaaaggagtaact aga 
tgg gcatt ttct at acca gctaaggctttaaacataacaacgtctactga 

ACTATTTTCT ACTT ACTTTGACTGAATAAGCGAGTGAGATCGTGACTGCA 
AGTGGAAGACCTTCTGGCACTGCGACCACTAAAACTGTAACTCCAATAAT 
GAAG AACTTC ACAAAGTATTGTATATAAATTGGTGTGCACTCAGCAAGCC 
ATGGTu 1 1 1 1 CT GAACCCAGAAGGTGTCAATGACAAAATAT AATACTAGA 
ATGATAACTGTGATGGCAGGCATCAAC>^CCTTTCAGAATAGAAATGAA 
AGAAAAATGTGATTATTAAATTTCCAGACACT AACCCTTG ACAGATATAA 
ATTAAACACTGT AAAG AGTTATAACTTGCTTGATAGTATTGAATTTCTCT 
GAGAAA TTACI ICI I ICI I GCACCTTATAACTTGACATTGTCAGATTTAA 



>269.1 

TTGGAGCTCCACGCGGTGG<^GGCCGAGGTAGGCGGGATAGTGGAGGCACT 

GAAAGACCAGCAGAGGCATAAGGTTCGGGAAGAGGTTGTTACCGTGGGCA 

ACTCrrGTCAACGAAGGCTTGAACCAACCTCGAGCGGCCGCCCGGGCAGGT 

ACAGATGCACAGGAGGCCATAGGGTTTAGGCAAAGGGGAGCACAAAAGTT 

GAAGATGAGGCGCTGCCACCAATGCTGGGACTTCAGGCCAGGGGCAGGAG 

CTGAGGAAGCCACAAGGGAGGAC^TTTTCTGCAGTTG ctgaaccagtagc 

AAAAGCTTTTT^TTCTGCAAAGCTGGGGCaSJ*^^ 

CGCGTACC 

>272.1 

TTGGAGCTCCCCGCGGTGGCGGCGGAGTCCCAC 

CGCAGGCTGCCTCAGGGAAAGGAGCCTGG GTTGATTAACTTGTGTGTCAA 
TGTCCCACCCGTCCCAGGTAACATTTTGCCCCCT GAG GTCCGGGGTAATT 
TAATGGCTGCT GGACAAAACCTCCAAAGTTCTTGAAAGATGAGAAATGAT 
AGCTACCTGGAGTGCAGCTGTACGGGACTTGGCX5TAAAGCCGCTTCCCTC 
AAGAGTAACTACAATCTTCCCATGCACAAGATGATTAATACAGATCTTAG 

cagaatcttgaaaagcccaggagatccaaagagcccttcgagcaccacgc 
aagaagatccatcgcagagtcctaaagaagaacccact gaaaaacttgag 

AATCATGTTGAAGCTAAACt^TATTGCAAAGACXATGCGCCGGAACAM 

ATTCTTCGCCAGGCCAGGAATCACAAGCTCCGGGTGGATAAGGG 

TGCA 

CCCT GTGGT AGGTAA 
>Z74.1 

cagttct agtcagaataa 1 cu g ci catcctccagctccccctgttccac 

aaacgggagctttcgcaccccccattgt acgcgggggaggagcctgagga 
agagggcggcgacggtggtggtgactgagcggagcccggtgacaggatgt 
tggtgttggt attaggagatctgcacatcccacaccggtgcaacagtttg 

CCAGCTAAATTCA 
>274i 

AAAACTCCTGGTGCCAGGAAAAATTCAGCACATTCTCT 
TTTGCACC^AAGAGAGTTATGACTATCTCAAGACT ~~-™-™^ 
>276.1 

CGCGGTGGCGGOCGAGGTAOGTTCT ATTCCTGCTCCTATT AGGTCCTTCT 

TTCI^CCTCCCCCCAOGOGO^ 

GCrCCAAGAGCCTCTOCCGG 
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>277.1 

GGAGCGGGCCCTACCGTGTGCGCAGAAAGTGGAGGCGCTTGCCTTCAGCT 

TGTGGGAAATCCCGAAGATGGCCAAAGACAACTGAGCTGTTCGGTGCTTC 

CAGGGCCTGCTGATTTTTGGAAATGTGATTATTGK3TTGTTGCGGCATTGC 

CCT ACTGCGGAGGTGCATTCTTCTTTGTATCTTGACCAACACAGGCCTCT 

ACCCACTTGCTTGAAGCCACCGACAACGATGACATCT ATGGGGCTGCCTG 

GATCGGCAT 

>278.1 

TTGGAGCTCCCCGC<5GTGGCGTTCGCCCGGGCAGCTACTTTCATCCATAA 
AGGCCTGCAGCTGTTTCATTGATCCTTGCAGTTCATCCATGACCAACTCC 
ATACAGTCAAAGACrrTTGCTCTGGTTCTGTAATATTTTCTGGTAGTCAGG 
TTT TGTAT TAAG AACTT C ATTCT GAGAAGACGCAAG ATATGTCAT AGGTT 
CCACTTTGACCTCAGTAATTTTGGCCTCAGTTGATCCTCTGGAGAATATC 

TTCTAAAGAAAACTTTGTTATGTATrCCTrCATCTCAGCCACAGATGCTT 

CCAAAGAAAAATCTGATGCTTTTCCATTTGAATCTTCAAAACATTTTTGT 

AGAGTTCCATCAGTTTCCAGTCCGTCTGCA 

>Z7&2 

AATGTTTCAATTCTTGAGAAAGAGAAGATGCTTTGGCTCT AAAACTTTCA 
AG ACTG AAG CCCTT AGTGTCCCTT AG G AAAGGTTCAAGTTT CT GAAT AGA 
GAAC 

>Z79.1 . . 

GCGG TGGCCTCCGAGGTACTACTCTGCACTGTT C 1 1 I C I 1 1 CTAATAAAA 
CTTT CCCT GTCG AACCT ATACT AGTCTTCTGT AAATTCTT CTT ACT ACCC 
T ATGACCCGTGAGCCAACCACTTTCCGATGCCAGGGTTCTG ACACXTT CAC 
CT GGCATAATATAAAGTG I II 1 1 1 I II 1 1 ATACCCTTCCACTTGGAAAGA 
CT ACAGAGGAATCTTGCTCTGCATAGTTCAAACTAAAAAGAGAAGAGTTA 
ATTACCT GAAAAGCAAGAG AAAACAAGAAGGGGTAAATTTTGAACCAAGG 
GAAATCATTTAAGAAGTGTCTGGTATTTTTCAAATTTCTGTCAGTTGTTA 
CATTTGTCATAAGTAAATGTTT AGGAATAAAGGATGGAGACATGCTTATT 
TT ATTT AACT CCCCCAAAATT 
>279J2 

AAAAAAAAAAAAAAAAAAAAAAAAAAGTACCTGOCCGGGGGGCCGCTCGA 
>280.1 

CGCGG TGGCGGCCGGAGTGATGCCATCTGCAGTTTTGTGATCTGCAATGA 

MCI I CCCTTOGAGGTCAGCCCATTATCTITAATCCTGA C 1 1 1 1 1 IGTGG 

AGAAACT CCGACATGAGAAACCTGAGATTTTCACTGAGTTGGTGGTCAGC 

AATATCACAAGGCTCATCGATTTACCTGGAACTG^GTTGGCTCAGCTGAT 

GGGGGAAGTGGACCTT AAGTTGCCTGGCGGGGGTGGCCCAGCATCAGGAT 

TCTTCCGGTCTCTCATGTCTCTrCAAGOGAAAGGAAAAAGGAGTGATACTT 

GGGTCCCCACTGACGGAGGAAGGCATTGCCCAt»TATA^X^ 

GTATCT ACACAAAAACTTGCGAGTAGAGG G 1 1 IGI I IAGAGT 

>281.1 



ATCTCCCTGGGAAGGGTGCCCCCGAAGTGAAAGAGATGGCCT GGT GGAAA 

GCC 

>281J! 

AGATGGGCCAG GAGTCCAGTTTCTGGAAGGCCAAQAATCGAAGT AGCAAQ 

CTGCAGCCGTTTTCCAGACAAGCATGATGnT^GGGATGCAGAAGAATT 

GACT GGAGGGGCAAACTCXXiATGTGACTGAGGCCCCACT GCCAAATGGCG 

GCATGCTCAGATAGCACCCAAGAATTTGGGGAAAAAACTGGTCCT 

CT 

>282.1. . 

GGGCCCCAGGG^AAAAAACCTTTTGGGCCCCAI II IM I I C CAATTTTCC 

AATTGGGCCTTGGGCCA 

>283.1 
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GTACAGCATTGGAAATGGATCTGTCTTTGGTAAAGATCAGCCTATAATTC 

TTGTGCTGTTGGATATCACCCOCATGATGGGTGTGCTGGAOGGTQTCCTA 

ATGGAACTGCAAGACTGTGTCCTTCCCCTCCTGAAAGAATGCATTCGCAC 

CAAATAAAGAAGACGTTGCCCTTCAAAAGACCTGGATGTGGCCATTCTTG 

TGGGCTT CCATGCCAAGAAGGGAAGGCATGGAGAGAAAAGATTT ACT GAA 

AGCAAATGTGAAAATCTTCAAAATCCCAGGGTGCATGCCTTAGATAAATA 

CGCCAAGAAGTCAGTTAAGGTTATTGTTGTGGGTTAATCCAGCGCATACC 

AACTGCCTGACTGCTTCCAAGTCAGCTTCATCCATCCCCAAGGAGAACTT 

TAGTTGCTTGACTTCGTTGGATCAC 

>284.1 

TGGCGGCCGCCOGGGCAGGTACGCGGGGGCTCTAAGCTGCAGCAAGAGAA 
ACTGTGTGTGAGGGGAAGAGGCCTGTTTCGCTGTCGGGTCTCTAGTTCTT 

g cacg ctctttaagagtctgcactggaggaactcctgccattaccagcct 
cctttcttgocaaagggagggggaaacat agatttattcatgccagt ctg 
ttgcatgcaggctttatggcttcctaccttgcaacaaaataattgcacca 
actc cttagtgccgattccgcccccagagagacctggag<x>cagagctt 
ttttgctttgcattgtaggagag ggactaagtgctagagactatgtccgc 
tttcctgagctaccgagagcgcccgtgaactggaatcaac i g c 1 1 c 

>285.1 

tggoggccgaggtactaggtcccaaatgtttcaaccgattttaocctatg 

ttttcaagggtattatagaaggggagaggtatcctgtagtgatgtccacg 

tatcttggagtt atgggtcgagttct act acaaaacactagtttttttct 

tcacttactaaatgagatggcccataaatttaatcaggagatggaccagc 

ttttgggaaatatgattgaaatgtgggttgatcgaatggacaacattacc 

cagcctgaaagaagaaaactttcagcttt ggcttt gct ctctcttctgcc 

atctgataatagtgttatccaagataaattctgtgggattataaacattt 

taagtagaaggcctgcatgatgtcatgacggaaagatcctgaaacaggaa 

cttataaagactgtatgttgatggctcatcttgagg a accaaaagtaaca 

gaagatgaagaaccacccacagaac 

gtggoggccgaggtacccgatagaacatggcatcatcaccaactgggacg 
acatggaaaagatctggcaccactctttctacaatgagc r roorro n gcc 
^^gaagagcatcocaoctgctcaggga^ 

ccggggtggoggccgaaaactgatcagactgtctcagatcaaggaaaaga 
tggccagagagaagctggaagaaatagattgggtgacatttggggttata 

TTGAAGAAGGrrTACGCCACAGAGTGTGAATAGTGGAAAAACCTTCAGCAT 
ATGGAAACTGAATGATCTTCGT GACCTGACACAATGTGTGTCC I I U H C I 
TATTTGGAGAAGTTGACATAGCGCTCTGGAAGACGGATt^CGGGACTGTC 
GTATGGATCCTCAATGOCAACGCCATGAAGOCCAAGGATGGTTCAGAGGA 
GGTGTGTTTATCTATCGATCATCCTCAGAAGGTCTT AATTATGGGTGAAG 
C I C 1 1 GACCTGGGAACCTGTAAAGCCAAGAAGAAGAATGGAGAGCCGTGC 
ACGCAGACTGTGAATTTGCGTGACTGTGAGT 



GCCAAACGCTTCCGCAAAGCTCAGTGT CCCATTGTGGAGCGCCTCACT AA 
CTCCATGATGATGCA 



TGGACAG ACTGG CTCAGTGAAGACATTTACTTTGATGGGACGAGATAGAA 
T CCGATAATTTT TCTCATAACCTGAGAGGAGTT ATCCCACGAAQTTTTGA 

ai 1 1 1 mil i rog i aattgatc gtg^ aaaagaaaaggctggagctggaa 

AGAUI I ICCI I rGTAAGTGl ICCI 1 1 ATTGAAATCT AT AACGAGCAGAT A 
TATGATCTACTGGACTCTGCATCGGCTGGACTGTACTTGGC^ 

TGGCGGCCCXXXX5GGCAGGTACGT CGGGGCTCOGTAGGAAGCCTCATCTC 

CCTAACtAGCTCCTTACACAAAG 

>»1.1 
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CTCCGGGTGGCGGCCGCCCGGGCAGGTA C I M I I I I 1 1 I I I HI 1 1 1 1 lb 
GGGGAGTTAAATAAAATAAGCATGTCTCCATCCTTTATTCCTAAAGATTT 
ACTTATGACAAATGTAACAACTGACAGAAATTTGAAAAATACCAGACACT 

t cttaaa tgatttcccttgggtcaaaatttaccc c rT C TTsrm ctctt 

GGTTTTCAGGTAATTAACTCT ICICT1 1 IT AGTTTGAACT ATGCAGTGCA 
AG ATT CCTCTGT AGTCTTTTCCAAGT GGAAG GGT AT AAAAAAAAACACTTT 
AT ATT ATGCCAG GTGAGGTGTCAGAACCCTG GCATCGG AAAGTGGTTGGC 
TCACGGGTCATA GGGT AGTAAGAAGAATTTTACAGAAGACAGTCTAGGTT 

cgaaaaagaaagttttatttgaaagaaagaa 

>292.1 

ggccttttggtgacttggtgctcctt ggagtcactggagttctactttga 
atcccactctgacatcaatcgactgccttaattcctggtccagctgcccg 

ACCCTGACTCTCTC<XXKrrCTTrrC<rrCAGGTGGAAGGTTTCCTTTAAGA 

tcacgctgacgtcggacccacggctgccgt 

>293T 

tggc^cggcccgggcaggacgcggggacattc«agtggggattaagaga 
aggaaggctgccttgctggagctgtgtggtcttctccaagtgagagtcgc 
aggcaatagaact actttgcttttggaggaaaaggaggaattc at t t1 i a 
GCAAACACAAGAAAAGCAG I I f II 1 1 icaggtgctgacggccacccacca 
tcatctaaagaagataaacttggcaaatgacatgcacgttcttcaaggca 
gaataattgcagaaaatcttcaaaggaccctatctgcagatgttctgaat 
acctcrrgagaatagagattgattattcaaggaggatacctaattcaagaa 
ctccagaaatcaggagatcgagacattttgtcagttttgcaacattggac 

CAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTTTGGCTGTCCTGGGC 
ACAGAATTGCTGGGAAGCCTCT 

TGGCGGCCGCCCGGGCAGGTACCCGGGAGGCACATT C 1 1 1 I CTACGTGAA 
GAGTTTTGTAAACTGAA U I 1 1 G I 1 1 1 CAGTT CCGGCTCCAG CCATCCTCG 
GGTAGCTTGCCAATAGATGAATCCCACTCGTTTGACCCATGACGCTCCTT 
CTTTGCATTTCTACCTCI I I CCCCACAGCAGTGCATGTCCACCATACCAC 

c tgaga gtc tgtg gaatct aattttctg ttatac ttctttcctt ACACTC 

GCT GCCTTGCTTTCATGAAAACACACTTT CCT ATTGTGATTTATCAGAGG 
CCTTTCCATATCTCAGCCACTATGCTATGACAGATTWATAATTAATAAG 
TGCATTTCAAAGTGAAAACGTTACAAACATGCTTA " 



GTGGCGGCCGGAAGAGCAACCGAGATGA^GTGAAGATGCTGAGCCGGAA 

QAAACT ATGAT C C I GC 1 1 1 ACATCCTTTTGAGGTCCCACGAGAATATATA 
AGAGCTTTAAATG CT ACCAAACT GGAACGAGTATTT GCAAAACCATTCCT 
TGCTTCXSCTGGATGGTCACCtrrGATGGLAGTCAATTGCTTG<XiftAAGCATC 
CAGAGAAGCTGGCTAXXTGTCCTTrCTGGGGCXiTGTGATGGTAGAGGTTAG 
AATTTGGAATCTAACTCAGCGGAATTGTATCCGACT 



G CG G CCGCCOG GGCAGGT ACG CGGG GCTCCCTTGTGAGT AGACT ATGCAA 

agaaaaagtgggccaccatatctggaaactacagtctatgctitgaagcg 

CAAAAGGGAATAAACATTTAAAGACTOCOCCGGQGAOCTGGAjGGATGG^ 

TTTTCCATGGTGGCCGGAGCAGCAGCTTACAATGAATAATCAGAGACTGG 

TGCTCTT G GAGAAAACT AT AGTTGGCAAATTCCCATTAACCACAATGACT 

TCAAAATTTTAAAAAATAATGAGCGTCAGCTGTGTGAAGTCCTCXiAGAAT 

AAGTTTGGCTGT A TCTCT ACCATGGT CTCTCCAGTTCAGG AAGGCAACAG 

CAAATCTCTGCCAGTGTTAACAAAAATGCTGACTCCT 

>297.1 

. OTQQCQQOOQOOOOOQC^GOTAOOCQQGGCGAaQGCrOOGAA QTC TTM 1 1 
TTGGGCGGGAATTGAAACCGCCGCTGAAGCCAACAAGAATTTGAGAACTG 
TAAATACCAAGCCTTGAAAGGGACCATGGTGCGXSCCTGTGAGACATAAGA 
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AGCCAGTCCATTACT CACAGTTTGACCACTCTGACAGTGATGATXiATTTT 
GTTTCTGCAACTG 

>zsa.i 

TGGCGXKXX5AGGTACTCCCCAGCAAATATTCTTTGTTGGCTTGCTTGACT 

AGATGAGCTGCTATAGTAGTCAATCCTGTTAGACTTGGACCATTGTrTGT 

CTGAAGAACTGGAATCTGTCGCTCGCCCTGAGCACTGTATTTATTCCCC^ 

TACT CAGTCCCAGGGACTTCTCXJAAT AGCGACAACTCTGCGGCCGCCXKX 

ATCTTC 

>289.1 

TGGCGGCCGAGGT ACTTCTGTCTTTCCAGTTTTCCACTT CAAACTTCTATC 
TTCTCCAAATTGTTTCATCCTACCACTCCCAATTAATCTTTCCA^ 
TCT GCGTTT AGT AAATGCGTTAACTAGGCTTTAAATGACGCAATTCT CCC 
TGCGTCATGGATTTAAGGTCTTTTAATCACCTTCG^TTTAATCTCTTnT . 
AAAAGATCG TCTTCAAATTATTTT AATCACCTACAAGTTTT AAACTAAAC 
TTTAAGCT GTTTAAGTCACCTTCATTTTAATCTAAAAGCATTGCCCTTCT 
ATTGGTATTAATTCGGGGCTCTGTAGTCCTTTCTCTCA A I I IICI 1 1 1 AA 
AT ACATTTTTT ACTCCATGAAGAA GCTTCATCTCAACCTCCGTCATGTTT 
T AGAA ACCTTTTATC I 1 I ICCI I CCTCATGCTACT CTTTTTAATCTTCAT 
ATTTTCTCTTAAAATCTTAAG 

>3oat 

GGCGGCAGGTACTT AAGGTTGACTGGTAATX^AGGGTAACTTCTGATACTT 
ATCACACAAGATGGTGCCTCAGCATTTAAAT AAATGGAGGT AGGGGAGGG . 
CGTGGTQGTAACATACTTTTAAACCAGCGATTGCACAGCAAACCACAATG . . 
CAAGTATTTCTGACTCCCAAGATTGCCGTTTCCT AAAGAGCAATTCTTCT 
GCAGGCAACAGCAAACCTA CC1 I ICC I I G CT AACTGCTTTCAGTAAATTC 
TTGATGGCCTTCGATTCTGCATTCAGAC A T C I C I rCTCACCCI IC I 1 1 1 1 
CATTGTAGCAATGATCTCAACACGTGGACAAAATT^ 

>301.1 

OGCGGTGGCGGCCGGAGTGATGCCATCTGCAGTTTTGTGATC^ 
TTCTTCCCTTCGAGGTCAGCCCAT™ 

AGAAACTCCGACATGAGAAACCTGAGATTTTCACTGAGTTGGTG^ 

ATCACAAGGCTCATCGATTTACCTGGAACTGAGTTGGCT CAGCTGAT 
GGGtXlAAGTGGACCTTAAGTT 

TCTT0CGG1 C I CI CATGTCTCTCAAGCGAAAGQAAAAAGGAGTGATATTT 



GTATCTACACAAAAACrTGCGAGTAGAGG G HIGH 1 A GAGTACCTCGGC 
CGCTCTAGAACT AGGTGGAT< ~ 



ATC 



TTCTCCCGC rGCTGAAATTTCAGTTGCGGGCGCTG 1 C A CCTCAGGAOCCC 
TCCGtXSCGOGT ACGCT GGAT AGCCTCCAGGCCAGAAAGAGAOAGT AGCGC 
G*^CACAGCTAAGGCCACGGAGOG^GACATCTCGGCCCGAATGCTGTCAG 
CTTCAGGAATCCCCGCGT 
>30Sl1 

ACTCAGGTTTT ATCTCTGCACT CCAAGTAGGATGAAAAGT AAAGAGCAAA * 
GGCTCATGTTT GCtAAGTCTGTCCTTTTGTAACAAAAAACCCAGCAGCTT 
TATCAAGCAGAATTCCACCTGT ATTTCTT AACTTGCCAGAGCTGAGTCTC 

^^yGTCCCCCACCCAAAGTGGAGCT ATTGCTAATGAAAAAGATACAATG 
AGAIU I 1 1 ATGAAATTATCTGTAGCTATTAATGTCAGGTTTrTGAAATTT 
ACTGACCTGGAAGAATACTCATAATGCAATGTCAAGT^ - 
AAGAACATTTGCAAT ACAGTTGTATTTATAAA A 1 HIGH I ACACACAA 



CTCAGOCCATGTGAGGTGCCTCCTGCCAA^ 
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TCCTGGAGGTTCAAAGAATTGCAGGAGGGTAGAAAAGCACCTGGGTCGGG 
TGCAG>CTGCGGAGCGGGCCCTACCGTGTGC<y>GAAAGA^ 
GCCTTCAGCTTGTGGGAAATCCC GAAG ATGGCCAAAGACAACTCAACTGT 
T ^^GCTTCCAGG^KXrTGCTGATTTTTGGAAATGTGATTATTGCTT GTT 
GCGGCATTGCCCTOACTGC«^TGCATCTTCTTr 

AGCCT CTACCCACTGGCTTGAAGCCACCGACACGATGACATCT ATGGGGC 

TGCCTGGATCGCATATTTGTGGGCATCTG 

>307.1 

ATCACCATTATTCCOCTTTAGTCACCTCAGAGGCTTGTTAATGCTTTCT^ 

TGTAATTAGGCTATATCTGGTATCTGTATAATATCTTCAGTTCTTCTTTA 

CCAGGGGTCTTACTCTGTTCTGAAACATGGCACCTCAGGC^ 

GCGCT GGACACAGGAAACTCCTGGGTCCXXXiACTCCGGCT CTCCTCTACC 

SI^IIP 001 ^ AACT CCGCTTGTTTCTCT ACAAAATGGCGCCX^GGTC 

CCCCGCGT 

>309.1 

TTTACAACCACAGCT AATGCAATTTTTTCCATTGTTCXXIATTTTTTTCCA 

AACCTATTGGG 

*30&2 

GC AAAGC CCAI Mill lCCATGCATCTAAATGATAGATACAGGCTATGAA 
ATTCTTT ATTCT ATTTGT AGCAGCTTATGCAGGTGCAGCCAAAjCACAAAG 
CTTCAGGACAAATTGTACCTGCCCGGG<X5GCCGCTCT 
>312.1 

ATGCCT ATTTTCACAGGAAAGAGAACACATACCCAGAAAAATG4XXTTGGC 
AATCATCAAAT CACAGTTTTCCAACATCAAT AAAGTGTTTAACT CCTGAT 
TTGAAAGATGGTGTTCCTGGATTGAATATTGAAGAATTAATAGAGAAACT 
TGAGTCTGGAATGGAGGTTATGGATCAGATTTGT GATGTGAGAAT ATCTG 
ACATAATGGATGTAT ATGAAATQAAACTATCCACATTAGCTT CCAAAGAA 
AGCAGGCT ACAAGATCTTTTGGAAACAAAAACT CTAGCCCTT GCACAGGC 
TGATAGACTGATTGCTCAG^TCGCTGTCAAAGAACICAAG 
*313L1 

TGGAGCTCCCCGCGGTTGGCGGCTTCCCGGXK^GGCACCTT AGCATTAGAT 
TGAGTT ATGTTGCT AGGAGATGTTTATTCATCAGCTGATCATTAAGCATA 
TGGGGCTT AC TT GGCC COCCTATCAATTTGCGTCAAAATAAATTAATTGT . 
AGACCTGTCTTGTTTT ATGAAAAAGCAATGTGATAGTCTTT AAATTTATC 
TTTCT AAACAAG ACACAAGTTTACACATTACCCAGCACAGTAACCCCTCT 
TGGT ATTGTTTACCT AAAAGGAAGAAG7GTAGGAAAAACTGATATAAGTA 
GAGAGTTT ATTTGGGCCAAGCATGAGgGTTACAACCCAACTGT ATGGAGA 
CAAGTTGTCCTGAACAATACACATTC 
>314.1 . . . . . . ^ • 

•CG6TCGAjGgT*CGCGGGCQGT C C TOQ ^ 

TTAAAGCACCTGGGTCGGGTGCAGACTGOGGAGCG^ GC 

GCACAA^AGGAGGCGCTCAGGAATGCATGAATTGATTAA 

AGAGCTGT AGATGGC 1 1 1 FCTCAAGGTGC1 ICAAGTGCAGAAGGCCAAGT 

GATT GACGC ACACACTTACCl I IGIGI ICCI ICCAGAAAATCCTCAGGGA 

GTGCCTT CAGCTT GT GGGAAATCCXXiAAGATG^KXAAAGACAACTCAACT 

GTT0G1 IGCI I CCAGGGCC I UC I GATTTTTGGAAATGTGATT ATTGGTTG 

TTGCGGCATT GCCCT GACTGCGGAGT GCATC 

>315.1 

GCGGTGGCGGCCTCCCGGGCAjGGACCCTTAGCATTA 
GCTA ^ A ? ,ATGTTT A ^GAGTCAGCTGX4ACTT AAGCATATGGGGCTTAC 
TTGGCCCCCCT ATCAA 1 1 1 GOG I c a aaataaattaattgtagac c t g I C I • 
TGTTTT ATGAAAAAGCAATGTGATAGTCTTT AAATTTATCTTTCT AAACA 

agacacaagttt acacattacocttttaqtaacccct c I iggtattgttt 

ACCTAAAAGGAAGAAGTGTAGGAAAAACTGATATAAGTAGAGAGTTTATT 

tgggccaagcatgagggttacaacccaactgtatggagacaagttgtcct 
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QAACAATACACATTCTT*Tr*GCA*C*GTrAT.AAQT*G 

AAAGAAGAGGCAGTTCCTAAG 

*316.1 

ACAGAGACCTCCTTACTTACCCGCCTTCT CXTTrCGGCTGGAGCTCGG CGA 
GCGAGAGGCXK5CGCTGGCGTTGGAGAGCGACGGOGGCCCCCGCGTAAGO 
GTGGTAACAACGCAGAGTAACGCGGGAATGAAGAATCTTAGGCGGGTGCA 
CCCAGTTTCCACGATGATTAAGGGTCTTTAGGGAATAAAGGATGATGTCT 
TCCTTAGTGTTCCTTGCATTTTGG G ACAGAATGGAATCT CAGACCTTGTG 
AAG GTGAC T CTQACTTCTGAG GAAGAGGCCCGTTTGAAGAAGAGTGCAGA 
TACACTTTGGGGGATCCAAAAGGAGCTGCAATTTTAAAGCCTTCTGATG 
>317.1 

CCCGGGCAGGTACTCTGCAGAAAGTATAGCAGCAAACAATGCCTATAGAC 

AACAG GAAAC AGAACATATACCCAGAAAAATGCCCTGGCAATCATCAAAT 

CACAGi 1 1 lUCAACATCAATAAAGTGTTTAACTCCTCATTTGAAAGATGG 

TGT TCCT G GATTGAATATTGAAG AATTAATAG AG AAACTT CAGTCTGGAA 

TGGTTTTTAAGGATCAGATTTGTGATGTGAGAATATCTGACATAATGGAT 

GTAT ATGAA ATGAAACTATCCACATTAGCTTCCAAAGAAAGCAGGCTAGA 

AGATCTTTTGGAAACAAAAACTCTAGCCC7TGCAGAGGCTGATAGACTGA 

TTGCTCAGCATCGCTGTCAAAGAACTCAAGCTGAAACAGA 

>318.1 

ttggagctccccgcggtggcggccgaggtagtttt attgatgttg aagat 
gagaaatctcctcagactgaaagttgcactgacagtggagcagaaaatga 
aggtagttgtcacagtgatcagatgagcaacgatttctccaatgatgatg 

GTG7TGATGAAGGAATCTGTCTTGAAACCAATAGTGGAACTGAAAAGATC 

TCAAAATCTGGACTTGAAAAGAATTCCTTGATCTATGAACTTTTCTCTGT 

TATGGTTCATTCTGGGAGCGCTGCTGGTGGTCATTATTATGCATGTATAA 

AGTCATTCAGTGATGAGCAGTGGTACGGGTGGGAATAGCACTACACTGTr 

CATCT AGCXJTTGTAGAATAAGTCCCAGTG AACTGATATTCTGCAGAATCT 

TCACT GTTATATA 

> 319.1 

AU 1 1 1 III 1 1 1 1 1 J | 1 1 tl M 1 1 1 1 1 CAATGTTCAGTTTCCTTT AATGAGC 
CCCATCTCCCTGAAGGGCAGGTGCAGGCAGCT AGGTGATGGCAAGAG ATG 
TTCACTTGAAGATCTT GCCCTGATTGAAGGCTTTGCCCACATGCTGGAAG . 
GCCCCCTOCCAGGAAAAGTACCAGACATCAGCTGCCTCTTCTrCATTTTC 
AGCCAAAGAAAGGGCACGTTCAAATG AGGTCAGAGTCATATCATACT GCT 

gggcatagaagcaacacagccccagattgttaaaaagctggccgtrataa 
atgcccatctgcagcagccgcctgtaaaaccggagagct atttctggctg 

ATCAGAATAGAAGTGGTTGCTTCCAATGCATGCG 
>32W 

TCTTCCACA AAAATACTTTATTTCTGATCT ATACAAATTTTCAGAAGGTT 

Annum atcattgctaaactgatgactt accatgggatggggtccag 

TGCCATGAOCTTGGGGTAC I H I I I I II 1 1 H I IHI I H I IG GAAAGGT 
. CTGCGATAAAC^CTAGCGTGTGC*>ATGGTCACCTG(X^CACTCGCACC 
AGGTTGTCCGTGTAGCCAGCAAACAGAGTCT GGCCATCAGCAGACCAGGC 
CAGGGAGGTGCACTGGGGTTCTrCTGCCTTGCTGCTGGT^^^^ 
> 324.1 . 

AU 1 1 1 1 MJIMII 1 1 1 1 mTTTAGG GGGAGTTAAAT AAAATAAGCAT 
GTCTCCATCCTTT attcctaaacattt actt ATGACAAATGTAACAACTG 
acagaaatttgaaaaataccagacacttctt aaatgatttcccttggttc 

AAAATTT AGCCCl 11114,1 1 1 IUCI I GCTTTTCAGGT AATTAACTCTTG 
TCTTTTTAGTTTGAACTATGCAGTGCAAGATTCCTCTGTAGTCTTTCCAA 
GGGGAAGGGTTTAAAAAAAA 
>325.1 



ATGCTTCTGGTAGAGAGCATTTAATCACTCTCAAGTTGAAGGCAAAGTAT 
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CC I GGAtaAATCACCAGATTATTTTGTGGATTTTCCTGTTGCATTTTGTGC 
CT CCT GG ACACCTCAGGT AAATTCTCCTCAG AGCTCCTTAAT AAG GATTT 
ATAGTCAGI I 1 1 IGGCAGCAATAGAATCACTAAAGGCATTCTGGGATGTT 
ATGGATGAAATCGATGAGAAGACCT gggtacttgcccg 
>326.1 

tgagctc ccgtggtggcggccgcccgggcagga c i rTTTTTTrrrrrm 

TTTTTTTAGGGGGAGTTAAATAAAATAAGCATGTCTCCATCCTTTATTCC 

TAAACATTTACTTATGACAAATGTAACAACTGACAGAAATTTGAAAAATA 

CCAG ACACTTCTTAAATGATTTCCCTTQGTTCAAAATTTACCCCTTCTTG 

TTTTCTCTTGCTTTTCAGGT AATTAACT CTT CTCTTTTTAGTTTGAACTA 

TGCAGTGCAAGATTCCTCTGTAGTCTTTCCAAGTGGAAGGGTATAAAAAA 

AAACACTTTATATTATGCCAGGTGAGGTGTCAGAACCCTGGCATCGGAAA 

GTGGTTGGCTCACGGGTCATTAGGGTAGTAAGAAGAA 

>327.1 

GTGGCGGCCGAGGTACTrAAAACCAAATAAAAAGTGACATTTGAATTTCT 

TTTAAAAGGATTTCCGAGCTCACAGTCAGCTTGCGAGCCATTCTCGCGCG 

TACCAGCACAAACCGGGCCAGCCTCCTAAACTGCTCATTTACTGGGCGTC 

TACCCGGGAATCC 

>328.1 

CCGCGTCCG CCTC T AGTGTCACAGACACTCCTGGGTTTGGAATTTTGTTG 
TTCTCT GTCT CTTT GATTTCCTGGAAG ACGACACCATG ACAATTTCAAAG 
AAAATAGAACAAAATG AAGG AAAAAGAGGCTCT GTCTT AGGAGATTCCTG 
TGAC CAGCCT GCTGTCTGTGGCGTGCCCTCCTGGCCCGGCCTTGGCACAT 
GTTCGY 1 1 1 1 GTGGTTGTTGCCTGGACAGGCAACTCTGCAGGGCTGCTTC 
TCTACGCATCCCTTTGCCTGCCTGCCTGTGCCAGGGGTTGTCAAGGGCTT 
TTGGGT CAGAGTGGGCACCCCTTTCTCCAAGGCTCCCTGCAACAGCTGGC 
CTGTCCCTGGTGGGGCTGACAGCTTTCTTCTT ACCCTGCCAGGCTGGCCA 
AGCCCCAGAGGTGACCTATGAGGCAGAAGAGGGCTTCTT 
>3 29.1 

TACTT T ACAGGATGGCATTTAATAGAGATATTTCGTATTTGCCCCACTGC 

i it 1 1 atttgtacagcatcattaaacactaagctcagttaaggagocatc 

AGCAACACT GAAGAGATCAGT ACT AAG AATTCCATTTTCCCTCATCAGTG 
AAGACACCACAAATTGAAACTCAGAACTATATTTCTAAGCCTGCATTTTC 
ACT GATGCATAATTTTCrrATTAATATTAAGAGACAGTTTTTCTATGGCA 
TCTCCAAAACTGCATGAGATCACTAGTCTrACTTTTGCTT AATTTTATGA 
GAAGGT ATTCTTCATTTTT AATTGCTTTT GGGATTACT CCAGATCTTTTG 
TTTAATTTCTT GACTAATCAGATTTTTAATAGAGTGAAGTTAAATTGTGG 
GTCATAAAAAGCATTGGATTGAGATATGGTTTGCCAGCCTAAGGGTTTAC 
AGGCATTGTCCAAACATTTTTTGAGAACTATATTTATAAG 
>330.1 

CCGCGGTGGCGGCCGAGGTACGCGGGGATAGrrTCACTCACTTTCAAAGCC 
AGCTCAAGGAAAGAGGAAGTGCTAGAGA 

GACTTTT ACGG ACTTGG CTTGTTAGAAGGCTGAAAGATCGAGCGGCCGGC 

CGGGCAGGTACI I I 1 1 III HI 1 1 1 1 1 1 If 1 G G CI 1 10 1 1 IG CTCCTTTC 

TTATGATCAGCCACATTTCTTCGACCTCCTTCTCCTTCATCCTCAGAATC 

TGAGAATTCTTCATCACAAGCTATCCGCTTGTCTGATGCTCGAATAGAAA 

TTCTCTTGTCTGGATCTTCTCX^ATCTTCATCT CCACT GTCTTCATGAACA 

GCATCTT CTGG AATAGCCTGCATCTGGACACCAGGTGCATGAGGTAAGAT 

GCGCAAATTTTGAAACAAACGCTGTTTTATCTTTTC^TATATTTGG^ 

GTTCTGGTTTGTCATGTTTG 

>331.1 

TGAGCTCACCGGGTGGCGGCCGGGTACT AGCAGTTGCCATGAAGGAGGCT 
TTGTTCGATTGTATAACACAGAATCAGAAAGTTTCAGAAAGAAGTGCTTC 
AAAG AATG GAT GGCTCACTG GAATG CCGTCTTTG ACCTGGCCT GGGTTCC 
TG^GAACTTAAACTTGTTACAGCAGCAGGTGATCAAACAGCCAAATTTT 
GGGACGT AAAAGCTGGTGAGCTGATTGGAACATGCAAAGGTCATCAATGC 



Page 44 of 111 



527 

Table 3 



PCT/US0Q/33J12 



AGCCT CAAGTCAGTTGCCTTTTCTAAGTTTGAGAAAGCTGTATTCTGT 

CCGCGGTGGCGGCX^CCCGGGCAGGTACCATCTGACTTGGCAATGTAATG 
ACACACACGTTAGTGTGGGGCACAAACGTGGAATATTAGGAGAGAGCTGG 

CTTTATGCTT AATATTCAA7TCTGCTCCAGTAG AACATGGTACCT 
>334.1 

CCCCGCGGTGGCGCCCGAGTTTGATTTCTTGCAGTCCTGAGCGATGGAGC 

TTTTCAAGAGAACCTTTTTCGAT ATT CATTGCT CCATCGATTG GATCCAG 
TCCTTGTTCAGAAAATTGTTTCAAGGCACTTAAGGCTGCCTGAAAG^TT 

Uf AGAACTAG 
> 33&1 

CTCCCCAACCTTACCCTACACCCCTC^M^ 

tttccctgctttctcaaaccatgtttgguwxtgcttggaagctccc^^ 
J^^tgtotgtgtggtatcatcagotcaacatctgaagcaaatgtt 

>336.1 

ACTCATGAAGGAGATGGCCCCTTTGGGAGCAACCAGAGAATCACTGAGAT 
CCCAATGGAAACAGGAGGTTCAGCCAGAGGAACCGACTrTTAAGGGATCA 

ACAAAGTCCCCTCTACAGGGGTGCCACCCAGAGGA^ 
CGCTGGCAACAAGGTGTGCCCTGGGGCT ATGAAGAGACCAAGACGCTCCT 

>337.1 

^S^S^IS^T^?^^^^^^**^*^^^ OCACGTG CTG^aAG^OCATTTGTTOCiAGCC 
TTCCCCJGGAGGAGCAGTCT^^ 
* AAA 5TGGAAGGGCCGGGGACCAGCCCTTGAAGAT^ 
CACAGGGCAGCTO^GCTCCT 

JTCAATTACTGGCATCCAGAAATTGTGTATTCAGGGGGCGAra 
TCTGAGGGGCTGGGACACCAGGGTACC WWWW ^ TQQ CCT 

>33a.i 

GGGGAGGCCAAAAA AAAAGGGGGGGGAAAAAAGCCC^ 
>339l1 



I ) i m 1 1 1 1 iTTTAATGCT GAAGATTTAGA7TT ATTTGAAAACACTTAGT 

AAGAGACAjGATAAGTGTGTTTGTCACATTTTX^CAC^^ 
ATTTAATTAAGGGATGATGAAT • ^uwswvnai AT AATTTGAT 

*340.1 

i 
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GGAGCGjSGa^TACCiG^TGCGCAGAAAGAGG^ 

CAGGGCCTG CTGA 1 1 F T TGGAAATGTGATTATTGGTTGTTGCGGCATTGC 

CTGATGGGAGTGTATC 

>341.1 

AATACTG CGAGTTTTCCAAG AAATTTTGTAAAGTTGAACATGGGGATCTA 
CTCTTGCCTT AAAACTTTTCTCACCACACCCACCTTCCCACATGCATGAT 

atccaaggtcgacagacctggattagaatcactct AAGCTTT ATGCAGTG 

CGTATTGTATTTTCT GCAT AAGAAAGG G CTGCCTCT AGAACACAGTAAGT 
GTAiTTGCC^GTAGTGAWTTGCCTACATATAGCCAAGTGTTATAGTAT 
ACCAACTTAGTATA 1 1 1 1 1 CAAGGAG AGCTAAACCACCTTTTGTAATGTT 
CGGTTTCTCACTGTTATCTTCCTiTCCTATAATTAATTTATTTTAATCTA 
caaattgacata g G GCT AAAAGCTT caat ATTTT AGAAAATATT AATT AA 
TGTAATTGTTCCCAATTATTAGAAACTTiTTTCCATTTTTCAAAATGTTT 
GCCAACTTCACACAAGTGTGT aaaaatagggctctggattttcaaaagc a 
c^tacatgaataatttattagctattccaggcaagctaagta 

>3«2.1 

gtggc ggccgaggtac aggt ttagtctgaatgcactgtcatgaaatttaa 
actttcattataatactgttttaagaactt acagcatctgctttagaaat 
ggtgttagctacatgtcgac^cagcatctttagccag I IIICI II IGGAA 
gttcatctgatgtoatctggaaactgagtagcacatttgcctgctctgtt 
ggtggcctcacaagcaaggcaaaagcattatggcaatctagggttccaga 
ataaccataaacattaagtgtcactccttggaaaatgacagatgtatgca 
agttt agttgcct c^gagcaatgaaattccaatgaaatgaact atcactt 

CTCCACTTTCCTTGTCCT ATT T TT AAT AAGACAAAGAACATCACCAT ATT 
AAG TTGAAGT 

>34ai 

ACA TCAGAG AT GCTC ACACATTCTTTGAGTAGTTTAAAAACTCAiTTTAA 
CCACi 1 1 1 1 ATT CTTTGTATTCAAACCAATCACTGG CAAT AGCTCT AAGT 
AGGTCATCAACT CTCCTCCATGTCTTCTTTCT AATTCTGCCACAGACTCA 
CTTCTTCCGT AAATTAATGGAAGGAAATGAGTGTCTGAGTT CTT AGAATC 
TCAAAAGGCATGAGGATAAAGCTTTCCTGGAGATAATATAAGTGGTGGCA 
GGAAGATTTGGGAGCCAGATGATACTCTTTTCCTCTTAGAGAAACTCTGT 
GGAAGCTCTGCCTATACTGTGGGAAATAAATTCTAGACGCTGGCTTCTTT 
CTGTAGT AAACATGTGGGCCCTTTAAAATGTTGAACCAAAATGTGCTTCA 
AATATAGTTTAGTT ATAAAACATTTATGGGGGAGTATGTATGT GCCAACT 
ACAGAGGCTTCAGAGATGAAGAAACAGTTCTTACCCTAGTGTTGCTTAGA 
ATCTAGT AGT ACT AAGT AATAATT ACT AACATATGCATTT ACT ATATAGG 
CAATACTAGGGTAAAT ATTTT ACATAGATTACCTT ATTTAGT AGCT CTT A 

> 344.1 - 

AC U I Mi ill M HI 1 1 1 M III IGTGGGAGTTAAATAAAATAAGCATG 
TCTCCAi IJCTI i ATTCCTAAACATTT ACTTATGACAAATGTAACAACTGA 
CAGAAATTTGAAAAATACCAGACACTTCTTAAATGATTTCCCT^ 
AAATTT ACCCCTTCTIG II I ICTCI I GCI TTTCAGGTAATTAACTCTTCT 
CT 1 1 1 1 AGTTTGAACTATG CAGTGCAAGATTCCTCTGT AGTCTTT CCAAG - 
TGGAAGGGT AT AAAAAAAAAACACTTT AT ATT ATGCCAGGTGAGGTGTCA 
GAACCCTGGCATCGGAAAGTGGTT GGCTCACGGGTCATAGGGTAGTAAGA 
AGAATTTACAGAAGACAGTATAGGTTCGAAAA 
>34S.1 

ACACTGCGG CGGGGGCAGAAAAGCTGCAAGGAACAGAACCAGJCAATGCAG 

AAGCTCCTCGAAGGGCCACCATCATCCTGCAAAACACC^^ 

CTCTT AT GCTGT GGCTCTTCTCAAGGATGTCTCAAGGGCTCCGGTGGTGC 

TCTCCTGCTCTATCCGCTGCTGTGGCAAATCCT CT AAAAACAGOGTTTTG 

CACAGCAGAGAGCAAAGTCCGCTTGTT ATTCCACCCGATACGTGAGCTCA 

GTTTGCCAGCTAGTGATCAAGTCCAGCTGTTGGCAAGTTGGTCC^ 
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CCTTGTAGACTGACCTGTGGCAGAGAGCTCCCTGGGTCCAGCATCTGTTG 
CCCTCACCCTTGACACATGCGGACCCTCCCCAGG 

TTGGAGCTCCACGCGGTGGCGGCOGGGGTACAAGAGATAGAAAGACCAGT 
CCTT GCTGAAAGACAAG TCTGAATG CT CCACTTTTTCAATTCT CTCT CCA 
TTCTTCAGTAAGTCAACTTCAATGTCGGATGGATGAAACCCAGACACATA 
GCAATTCAGGAAATrTGACTrrCCATTCTCTGCTGGATGACGTGAGTAAA 
CCTGAATCn 1 1 GGAGTACCCATTCCCTTGATGTCTACAATATCA CC I I I C 
TTATAGATTCGCATATATGTGGCCAAAGGAACAACTCCATGTTTTCTAAA . 
AGGCCTAGAGAACATATATCGGGTGCCTCTGCT C I I l UXi l 1 I U TGTTCG 
TCATTTTGGCGAATTACTGGAAGATG 
>347.1 

ATCCCTTAA I MCI 1 1 UCTGGAGCATTTTAAAGCAAATATCAGACATACC 

CTTTCACGCCTCACACTTCAACATGCGGCTTGTTGAAATTCGTGCTCCAC 

TCCAGCAACTGCTTTCAATCGGAGTTCCATGCTCCGCCGCAGTATGCCCT 

AACGCAGCGTTATC 

>3*B.1 

ACTTGACTGCTACAACTTTCAAATTCTTCTACTTACTCCCT 
CTTCACATCTGGGAAAACTGATAGGGAAGCCTAGGTAGGCCT ACCTTTGG 
TGCCAGAGGGAAGCTCAATCCATGCAAGCCCCAGATAATATATGAGAACC 
TCCCCAACCTTACCCTACACCCCTCACCTCCCM 

TTCCCTGCI I IUICAAACCATGTTTGGAjCCTGCTTGGAAGCTCCCTCTGC 
TCTCOCTAGAAAG CTTCATTATGTGAGTGAT ACA I CI 1 1 ICATATCTTGT 
TGGTGTGTGTGTGTGGTATCATCAGCCTCAACATCTGAAGCAAATGTTGG . 
GTGGGGGGT 
>S49.1 

CGCGGTGGCGGCCGGAAGGAGGAGAGGTGCTGTGCTGTGTATGAAGAGGC 

AXrrGAAGACTCTGCCAACAGAGGCCATGTGGAAGTCTTACATCACCTTTT 

GCTTGGAAAGATTTACTAAGAAGTCAAATAGTGGGTTCCTTAGAGGGAAG 

AGGTTGGAAAAAACCATGACTGTATTCAGGAAGGCACATGAACTGAAGCT 

TCTGTCAGAATCCCAAT ACAAGCAGTTGAGT GTTTCGTTGCTGT GTTATA 

ACTTCXrrGAGGGAAGCTCrGGAAGTGGCAGTAGCTGGAACTGAATTGTTT 

AGAGACTCTGGGACAATGTGGCAGCTGAAGCTGCAGGTGCT GATCGAGTC 

AAAGAGCCCTGACATAGCCATGCTTTTTGAAGAAG<XTTTGTGCACCTGA 

AACCCCAGGTTTGT CTGCCATTGTGGA 1 I rCCTGGGCAGAGTGGAGT 

v350.1 

GTGGCGGCCGCCCGGGCAGGT ACCCGTGCT AAAGACTTTT AGTTCGGCTC 

TCCGAGTG1 1 1 1 1 1 1 1 rCQTCGATTTGCGCACAGAGI M C CIGGTTCACO 

TGGATGTGAGGATCCTTTACTCCAGATCGCCAGCCAG 1 1 1 1 I G I HUM 

TCCTGOGTTGCTGAGAGTCTGGGTTTATTCATCACACCAGGTGGATCTTA 

ATTCC^TATCCCTGAGGCCACTGCAATGAGGCAGAGGAGTGTGCTCCCTC 

ATGAGAAAGGACTGGAGACCGCCCCCAGAAGAGAACGTATCCATGT 

>S51.1 

CGCGGTGGCGGCCGCCCGTGCTGG I UU 1 1 A TT A 1 1 LHAAAj I IG I I rCTGG 
ATGTGAATGGATTACAATGTA I Mill T AGGGAAATCCTATT ATT ATCAA 
TGTGACTCCACGGGGGAGTCCATGGTGATGATGATGAGGAGGAGGATGAT 
GATGATGAGACACCTCTAAACTTGGAACAAGTTTAAGACTTT ATGAGAGA 
AGAAAAAAAATCACCAACAAGAATTGTTTCAGGAAAAATCATAACT ATOC • 
TGTGTTC A I I II I I I I I I \ ATAAACAATAAGAAAAAGTTGTTGGATTTTT 
TTTTAATGAI IICI I III 1 GGGGGAGGGAA 1 1 1 m i IG CA G1 1 1 I A TGGT - 
GGAAAATGCAAAAACCAGAGCCAGGTGCATAATCTTGTAATCTGTGGATA 
TCCCTGGAGCAGGACTGATGT 
>3SZ1 

TGGTAACAACGCAGAGTOOCGGGAAGGAGTGGTAACAACGCAGAGTOCCG 
GGAAGCAGTGGTAACAACGCAGAGTCCOGGGAAGCAGTGGTAACAACGCA 
_ _ _ ^ 1TACCCGGGGAAAAAAGGC 
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AAATAGAATGAGAACCATATTATGT 
>3S3.1 

OG^GTGGCGGOCGAGGTACACCX>GCTTTGTCTCCTG^ 

Ci 1 1 1 CCTT ACTTTGGGCATTAACTGCTGTTGAGGTCTCACAGCCTGATG 

GTCATTATCOCTGAATGGCATAAATCAACAGGCTGT ATGAGCATTGTGTG 

AGATTCTACATGAGGGAGAGCArTTCAAACCCATGACAGATGAGAGAAGT 

TAGTACACT CTCACTGAACTGGGGATGTTTGACTTAAAATGATGGACAAT 

AAGATAGTGAGCAGTAAGTGTGCTCT AGGCT AGGCT ACGAGAGGCCATGA 

GCTtXTCATCTCTTCTCTGTTCTGAGCTCTCTGA^ 

CAGGGGGTGCATTCTCTGTGCCTCTCCTGAGTCrrACTTTCTGCATCATTG 
GTTCTC«XAGCTCACTTC^TAATGTCCTCCTAGGCTGCATTGGAATTGT 
GTGTTGTCT AGACCCATGGCCAACACTGTC^TTGCCTGTGAGGGAG 
> 3S4.1 

AG 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 GCCTTTAGAAGGTTAAAATGCCAATA 
TAAAGCT AAAACAGTAATCATCAGAGACAGCTCTAATAAGGCTTT GCT AC 
TGTTTTTACT AT ATAAATCTTTACGTGTT AATGGAAAGAAAATT AATTCA 
TTCTGTTACTCCAI llll! I CTCTCCATATTGTATGCCTGAAGTGAGCTG 
ATGAGGGGCAGAAAGATCATACAGTTAGGAATGAAGACATCAasAATGTTC 
CACT AAACAGATATTTAACTAGATACTATTATACTACTAAGAATAGCAAG 

aatgtctctcaattctgggaatttctcctagctcacacaaatgaaacgca 
gatctccatgaatgctttctaataaatgcttccaggatagtatcataaac 

TCAGCTCACC IMCIICT I ATCAGAATCCACAACTGCTTT 
>355.1 

TGAGCTCACCGGGGGCGGCCCAACXXXX^TCTTCCAX^ 
GACGA AC^CAAAGGGAAG^AGGAGAGGCACCCGATATATGTTCTCTAG<^ 

cttttagaaaacatggagttggtoctttggocacatatatgogaatct AT 

AAGAAAGGTGAT AATGTAGACATCAAGGGAATGGGTACT CCAAAGATTCA . 

GGTTT ACTCACGCCATCCAGCAGAGAATGGAAAGT(>AATTTCCTGAATT 

GCTATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGWKCTTACTGAAG 

AATGGAGA GAGA ATTGAAAAAGTGGAGCATTCAGACTTQTCTTTCAGCAA 

GGACTG6TCTTTCTATCTCTTGT 

>3SC1 

CGCGGTGGOGGCCGAGGTACCT GACTGT GGCTCAGATCTGCGTCGCAGCA 
GCX5AGAGAAGAAATCACTCCATATtX>^TGAGAGGAAGGGTGGCACAGAG 
ATGGTGTCT ACAATT AGAGAGATTTCTGACTCX>CCTT AGCCTAAGCAAA 
CTTT ATGTACTGAQTAACATTTGAAGGTTGT CTTTT AATGGTGGGGGGTG 
1 1 1 1 1 1 CCTT 1 1 1 AAACTACAGTGCTTGCAXXAAGAjGAGGGAGGGACTCAG 
AAAAGGTTAGGGWGGTGAGGGAGACAGTAGATGGCCTGGGATGACTTGA 
GTOATt^TACTATTGCTTGGCAGGTGTCTO 

TTCCATGAGTGACCTACCI I I UCCCAGGAATGGGACTGAGAGGGTAGTCT 

TTGTGCTTTTGTAAG 1 1 IC 1 1 ICICIGCACTTCGACTTACCTTTGAATCA 

GAAAGCAAGCCCAGCAGGTGAATG 

>3S7.1 

ACCATCTGACTTGGCAATGTAAGACACACAOGTTAGTGTGGGGCACAAAC 
GTGGAATATTAGGAGAGAGCrGGTTCCAGCACCAAATCCAGAGTCACTCG 
GGGAAGGAGGTATGGTGGCAACACTTTATGCTT AAT ATTCAATTCT GCTC 
CAGT AGAACATGGTACCACCATTCTTCX1AAGTTCAAAAATTATCTTT GAT 

tc^t tttgttcccx^trcctctaatatgtcaccaattctgct gat acatt 
ctttgt aatctctccatctattttaatctgtrattcacctgagctacaca 
aacattcatctgcacaag gagt attccacgtgctgaaaagacagaggatt 
AAGCCCTOC 1 1 gtggaggcattcacagtctggttttaatacacaaaccaa 
c^ttataatacacagggataaaaaaagtagaggcacttattg^tacct 

GT 

>3sa.i 



Page 48 of 111 



531 

Table 3 



PCT/US00/33312 



acttttctagcagtctgtggccactccatactcagctgaaaacactgttt 

cagccccctctctx>gtgaoctcagocttct ccaggtgtatctcttgatga 

tc^ggagaccagcagccacagctgctgctactcctgcaggaga^ 

ggctgtggtggggggcaggggtgttggaggagaagttgaaaatccgtgtg 

itctctgtccctctgctcctccatcttagcttctggaggagttaaggcax 

caagggca 

>as9.i 

gtggcggccxkxxx^caggtactggtgttgtgatcggaacgtgtcgatc 

ccctcttctcatcactgctgctccaactggatttattactccgggaatgg 

tagagaataaagatttgtaggaaaggtgctgaactg(xaaggaaggcatt 

tcttgtgccgtgtctggaaccgtgtatcctt act acatcactgaacgaca 

ccaagcaccccatgcacttctgggtccaaccttggcccctgaagaaagac 

actg 

>360.1 

tgcaaaactaaacacgcccgaggaaatttggccagttatccaattgatga 

actagtagatagagccaaacaatcttttcaagagggtgtttgtgagatat 

ggttgaccagtgaagacacgggggcttatggcagagatattggcaccaat 

ctgcccacactcctgtggaaactggttgaagc^ttcctga^ 

gctgaggcttggcatgacaaatccgccctatattttagagcatctggagg 

AAATGGCAGAAATCCTTAATCACCCCAGAGTCTAGG C 1 1 1 1 U I (j CACATA 

CCAGTCCAGTCTGCCTCCGACAGCGT 

>3B1.1 

TGAGCrrcACCGCGGTGGCGGOCGAGGTACTT AAAACCAAATAAAAAGTGA 
CATTTGAAI II C II 1 1 AAAAGGATTTOCGAGCTCACAGTCAGCTT GCGAG 
CCATTCTCCCGCGTACCAGCACAAACCGGGCCAGCCTCCT AAACT GCTCA 
TTTACTGGGCGTCTACCCGGGAATCC 
>362.1 

tgagctcaccgcggtggcggccgaggtacgtatgcacag<^ 
ataaatgtatgtgtcctgaatttcagagctt aataatgaattatggaact 
tgataatgattggatcaggcagacaagaoctxmtcxgtcctaatatcaga 

AAAGAGACAAGTAGACATTATG IGC I I CC I GAGGTGAGGCAGTAGTAAGG 

AAACAACATCACACATGTAGCAGTCTTGGGAAAAAAAATGT AACCTGTAT 

CTCGTAATGAGGAAACAATCAGTAAAAAAGTCTAGATTGTGGGACATTGC 

ACAAACTTGCCTGAACTCrTTAATAATGTCAGTG'TCATGA^ 

*CACACACACACTGCAGATCATACAGAAACACCACOOCAGCACCCACCAC 

TCAGACACACACAAAAGGGCAACTCT aatcaattaaaggaaacaaaagag 

AATGACAACT ACAT AT AACGT AT AATTCTTGATTGGATCCTGGATTTAAA 

AATAAACAGCTATAAAGGATAT 

>3S3L1 

CT O CCCGCGGT GGCGGCCGAGGTACTITAAAACCAAATAAAAAGTGACATT 

TGAAI I ICI IT I AAAAGGATTTCCXaAXXnT^ACAGTCAGCTTGCGAGCCAT 

TCTCCOGCGTACGAGGACAAACCtaGGCCAGGCTGCTAAACTGCrCATTTA 

CTGGGCGTCTAOCCGGGAATCC 

>364.1 

cgggtggcggccgggtcaacgcagagtoccq 

CAGAGTCCCGGGAAGGAGTGGT AACAACGCAGAGTCCCGGGAAGCAGTGG 

taacaacgcagagtcccgggaagcagtggtaacaacgcagaggct^ 
cacagcccagggtgcccgggactgaaaa ciuji ic accagococctccac 



gggaaaaagttggaggatgaaggaattgaaaaaaaaaaaaaaaaaaaagg 

TT 

>aes.i 

accaagcact gggt aaggcacttttgtggagcatt AGACAGTAACOCTCA 
aggagcta gaga accg gatgggagac atgagoggtaattaactcacttgt 
tccccagagtttct atttgttttgttttctttttctgtgactt attttoc 
tattttctttcctccatgtaattttcact atggcccaactaatat aaaca 
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cctggaaattacaaggaaaaaaaatttn roctct aataactttocaaatt 
tgtggaatatttatttgtaatagcagttatcagttatgcttatatagcat 
taaaaattctoctcctttgactacagacacaaccacagtgtggritctaat 
catggai^tatcagt aattrttagtaactgaattttgaggacatttct ct 
gtttagcatgtatgcaaactgatatgtaatccggggttccaaag 



tgagctccccgcggtggcggccgaggt actttgcatccttcaacccaatc 
aagctgacactcagtattaaccatcacaaggcgtgaggacagatagctgc 
atccgovaaatagagaaccaagaaatagtcccacaccaaagtcaggatca - 
aatgattcctggacaagccac 

>36&2 

tgaacccccctccgtcaccacacacaaacgttaatttgagatggattgca 

AACATAAAAGCTAAAACC^TTAACACTTCTTQAAGGTAAC^TAGAATATr 

ttgtaatgttatgataggcaaaagtctctt 

>3S7.1 

accgcggtggcggccgaggtacattgagattcaagagaaaagtcacagca 
ggtctcagctcctccagcaggccttatgtaatgctaagattt^ 
gatgaagttgaactgatgaactggctgaatgaagtgcatgacaactgagc 
aagctctcagtccaggattacagcactgaggggctatggaagcagcagtc 
tgaact tcgg gttctgcaagaggacatctt actcaggaaacaaaatgtag 
atcaggcttt agtaaatggrtttagaact actt aaacaaaccaga gat 
gaagtttt aat aattcaagataaattggaa(kxattaaagcaaggt actg 
ccagatacgaattgagcataccacaaaaaagttctcattttgtgtcctcc 
catcccattctcctcactaaccaaaggct aggaatt atct 

>36&1 

CAATG TGCCAGGCACCTT ACAAGACACAAATATGCT CTT AT AGGCTGGGG 

AAATAAGAAAATATGAATGAAGCAACCCAGGTCTTGAGCCAAAGAATTAC 

CTGGGGTOCGTTGAGTTCAAATCTGAAAA I I I CI U I CI I I C AAGGTCAGC 

ATGGGCCACAAAC 

>369.1 

ACGCGGGGGI I ! CCGGTTTGGGTOTGGCCGCATGGCGTGCTGGGGTGCAG 
GT GGWXiAAGGGGGOGTTACrrGTTGCGACTGGCATCCXiCATCOGGCAGAT 
GT AGATGGAACCAAAGCCAGAAGTTACGOGT CACCCTTGCT CT ACAGCCA 
AACATGCAGGACTCTAGTAAOCCGCGAAATGATGGGATAGCGTTGCAAAT 
CCTT AAAAGAGTCTTAACGGAGAAGGAAAAATGTT ACATTGTCAAAGTCC 
GAAAGCC 1 1 1 CAGCCTGAAGCCAGGAACAATTGTTCAAA b 1 1 ICIII GGA 
ACATCAAGGAAGGAAATCCAGATTTTACTTT AAGTGCAATGGGGAGTCAT 
TAAGGAI 1 1 IUIGIAGATACAGCAAAAAGACAACAATCTTCAAGCCACAA 
TGGCCCTCACCAGAACCCAGOCATGTGGTCAGCCTGATCT CGGACTTCAC 
AGCCAGCAGAACTGTGAGAATTAAATC 
>370.1 

GTGGCGGCCGAGGTACTTAAAACCAAATAAAAAGTGACATTTGAATTTCT 

TTrAAAAGGATTTCCGAGCTCACAGTT^AGCTTGCGAGCCATTCTCCOGCG 

TACCAGCAGAAACCAGGACAGCCT CC I AAGCTGCTCATTTACTGGGCATC 

TACCCGGGAATCC 

>371.1 

ACGATTATTTTAAACAAGCCTACXrrCCCTGACTAACCGAGTGGAAGGTGT 
GAGTGGGACTACAAATTCACAAAAGAACTGT AGCCTCAGATAATCAAAGG 
AGAGAAGGTCAGATGCAATCACTT^TGCATGCTAGTAATTCTCAAACtnT 
CGTTTT CAGAAACGATTGGATTTTCAGATAGATTTGCAGTAAGAGAATAA 
CAAfiTCTTrATTTT TTTGATO CCAA CI ICI I I d IGCAC AI ITTTCTTC1 
AGC TATATTT AATATCTG1 rCTCOOC^CACACTTCCTAATCTACATTTCA 

caatctttttccacttcaci i ramc rocA 

>371.2 

AGAAATCT ACCTGGACAGAATAGCA I L H 1 1 1 1 1 1 IC 00CC 1 GACCCTTG 
M ' ' " " ATCCTAGGATGGAC 
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ACGCGGGGAT6TCTCTTGTCAGCTGTCT7TCAGAAQACCTQGTGGQG CAA 
GTCCGTGGGCATCATGTTGACCGAGCTGGAGAAAGCCTT gaactctatca 
TCGACGTCTAC»M>AGTACAAGAGATAGAAAGACCAGTCC7TGCTGAAA 
G ACAAGTCTGAATGCTCCACTTTTTCAATTCTCTCT ccattcttcagtaa 
^JP^^^^^aatgtxxk^tggatgaaacccagacacatagcaattcagga 

AATTTGACTTT CCATTCTCTGCTGG>TGACGTGAGTAAACCTGAATCTTT 

ggagt 

>373.1 

acgcggggagaaggaatggaaagcctggagaaagaggatgaaatgacgga 

TGAAGCAGTTGGAGACTCTGCTGAG AAGCCTCCTT CT ACTTTTGCCTCAC 

ctgagactgctccagaagtggagaccagcagaactccaccagcctgtgaa 

ACCACGAACCCTT CAATCAAGAAAAGACCTTTGATCAGGAGAAGACTTCT 
CX5TCTCATTTCTGGGGACACATTCAGGATTTCTCCAAAGCAGGTGAAGGT 
>374.1 

CTCACGGAGGCACTTGCGAGAGTTAATGGGCAGATGGAAGGAGATGGCAA 

ggaccaatctggggccgagcaggaacaaaagcagcaacgct aacggaaaa 

^ G ^^^^ CTG ^^GCCAGACAAACCAGACATGGTC^CCCCGC 

gtactccttatacttattaaacacaaaattaattgtaaaatagcctcagg 
caggtccttcaggaggtatccagaagaaggcattgtgatcataggagctg 
atggctccgcctgggttactgcccctgtagactrccagtgggacaggata 
tggaggtggaagacagtgacatggatgatccggacc 

>375-1 

S^T^^?F*^^ AOTCA ^ G ^^ TCTCTGGA 6CCTAGGAATC 

attttactggaaatgttctcaggaatgaaactgaaacatacagtcagatc 

TCAGGAATGGAAGGCAAACAGTrCTGCTATTA7TGATCACATATTTGCCA 
GTAAAGCAGTGGTGAATGKXX5CAATTCCAGCCTATGACCTAAGAGACCTT 
^I CA ^!^ GCATGC ^ CA ^ TGATCCAA GCAGAAG^ 

GGC J^ G J G <* G i£S^^ 

GATGATTATCTTGAGAATGAAGAGGAATATGAAGATTGTTGTAGAAGATG 

AAAGGAAAATCCTGGCAGAG 
>37&1 

CAGGTCACAGGTXn'CGAAAAAGCGGGTGGTGCAATGCTCCATGGGGATGA 
GGGGAGCACG^GTGGAGCCAGCTCGGT 

G £ c IJ c ^ A ?£l AG J^^ T ^^ 

AG^ACACT^GATAT TAACCTA CTACTCAG H U G » G I GGCAAAACAG 

GGACTTAATTCATGCTCTACAATGTTAGTTGAATAGGTGAGTGAGAGAAT 
TTGTTATTCTTAAACCATTACTGTTTGTT AGTGAG 

>377.1 

TGAGCTCACCGCGGTGGCGGGCGGACGGAGGAGAGGTGCTGTGCTGTGTA 

TCACCTTTTGCTTGGAAAGATTTACTAAGAAGTCAAATAGTGGGTTCCT 

AGAGGGAAGAGGTTGGAAAGAAC^TGACTGTATTCAGGAAGGCACATGA 

ACTGAAGCTTCTGTC^T^^ 

TGTGTT ATAACTTCCTGAGGGAAGCTCT GGAAGTGGCAGTAGC 
>378.1 

S?^^ G J! c ^ GG Jii CG !S GGTOGTCAA ^^ 

CCT cctgtattaccgccgagtggacct gctgtaaaccctgtgtgcgctgt 

GCoiun^iii f AGTGGCTCn TAGAGAGAAA C 'I CI I I C I C CCTTTGCAAAA 
ATGGGGCTTAGAATTGAAACAGGAGTATCGCCTTTGTGGGTTTCGATGCA 
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ACAAACACGAGCTTTCTl li T I UAC J iCTAACT I i 1 CAAATCAAAAT CAT I 
TGGTTGAAACAG ACTGTTGCTT GATTTTAGAAAATACACAAAAACCC ATA 
TTTCTGAAATAATGCTGATTCCTGAGATAAGAAAGTGGATTTGATCCCCA 
GTCT CATTG CTTAGTAGAATAAATCCTGCACCAGCAACAACACTTGT AAA 
TTTGTGAAAATGAATTTT AATTTTT CCTTT AAAAAAGAAATTTTT 
>379.1 

AGCCAGCCAATAAAATATAAACTCCATTTGTCTTAGTTATATAGAACTGT 
GTTTCCAGCTTAGAAAAAGTCAAACCAATGACTTGTAG AAGAATCT ACTC 
TCATTTTTT ATTCAG CCTCTAGAACATG GAAGCTTT AAAAGTGAATTGGC 
TAAATAGGCAAGACCTTCTGAAAGT7AACATCTTAATGATTAAAAACAGT 
AAGTACGCACAACCGAAGCGTAGAGTCACACTTGCAACAAAAGGTTACAA 

tattgtaatgggctctgtccggttctgcttgtccagctggaccatctatt 

TCATCCTCCTCCTCTGAGCT GTCATTTAATTGCTCATAACAGTAGAGATC 
AGTTGTCTCTGGTTGCAAATCTAACATATATTTATGCAATGTAGGGTGTT 
CTCCATGCATGATTACAGCTGGGTTTCTCTACGTGTTCTTGATGATCTGC 

aa caagag atacctcgaccggqccaccggccccttatattatggaatctt 
tgctttttg gccagaggtctttgctttt 

>380.1 

GAGGTACGTTAGCTCATTTTCCCTTAAGCGGGTTGTGACGTCGTTGAAAT 

TGCAACGCTCAAACTTCCAACACTTGGTATACACTTGTAACCCAGCTTTG 

TTAAT6AGACACGCATCAAAATCAGATGAACAATTGACGGCTGTTTTGCA 

GTCAGCAGTTGGGTTAGGACAGTTGTAGCACTGCAGGCTATGTCCTGAAT 

G 

>381.1 

ctcaccgcggtggcgg ccgaggtacaccatgtgaagactggactt aaaca 
gctacacgaccagatgccgagagagaggctggaacatagccttccctttg 

GAGGTAGCCT GGCCCGGTGGGCACTGTGATCTCAGACTTCCAGCCTTCAG 

aactgtgagacaatattttattgtttaagccacttattttttggt 

>382.1 

ACTACATGAGTAATTGCATAAATGGACGATGTCTTTCCTCTACTTTTAAT 

TTCCAATGACT 

>383.1 

CACGGCTCTCCATTC I 1 C I I C I I G GCTTTACAGGTTCCCAGGTCAAGAGC 
TTCACCCAT AATT AAGAGCTTCTGAG GATGATCGATAAATAAACACACCT 
CCTCTT AACCATCCTT G GGCT 
>384.1 

GACTGCAGGAGATGTGGGCCGTGCCAAAGAGATGGATGAGACTGTTGCTG 
AGTTCATCAAGAGGACCATCTTGAAAATCCCCATGAATGAACTGACAACA 
ATCCTGAAGGCCTGGGA 1 1 1 1 1 1 G I (J I GAAAATGAACTGCAGACTGTAAA 
TTTCCGACAGAGAAAGGAATCTGTAGTTCAGCACTT GATCCATCTGTGTG 
AGGAAAAGCGTGCAAGTATCAGTGATGCTGCCCT GTTAGACATCATTTAT 

atgcaatttcatcagcaccagaaagtttgg gatgtttttcagatgagtaa 
aggaccaggtgaagatgttgacctttttgatatgaaacaatttaaaaatt 
cgttcaagaaaattcttcagagagcattaaaaaatgtgacagtcagcttc 

AGAGAAACTGAGGAGAATGC 
>385.1 

GAGGTACTCCGTCTCAGAGGAGGGATGCAAATCTTCGTGAAGACACTCAC 

TGGCAAGACCATCACCCTTGAGGTCGAGCCCAGTGACACTATCGAGAACG 

TCAAA GCAA AGATCCAAGACAAGGAAGGCATTCXJTCCTGACCAGCAGAGG 

TTGATCTTTGCCGGAAAGCAGCTGGAAGATGGGCGCACCCTGTCTGACTA 

CAACATCCAGAAAGAGTCT ACCCTGCACCTGGTGCTCCGTCTCAGAGGTG 

GGATGCAGATCTTTGTGAAGACOCTGACTGGTAAGA 

>38&1 

TCGCCCGGGCAGGTACTCCCTGATAAAGGGGAATTTCCATGCCGTCTACA 
GGGATGACCTGAAGAAATTGCTAGAGACCGAGTGTCCTCAGTATATCAGG 
AAAAAGGGTGCAGACGTCTGGTTCAAAGAGTTGGATATCAACACTGATGG 
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tgcagttaactt caggagtcctcattctggtg at aaagatgggccgtggc 

agcccacaaaaaaagccatgaagaaagccacaaagagtagctgagttact 

gggcccagaggctggg cccctg gacatgtactctcagaatgtttgtcata 

tgcttcttgcaatgcatattitttaatctcaaacgrrrcaataaaaccat 

ttttcagatataaagagaattacttcaaatt 



gagtaattcag aaaaaa ctcaagaatttaagttaaaaagtggtttggact 
tgggaacaggacttttatacctctttt actgtaagaagtagctcggcccg 

CtCTAGAACTAGTG 
>389.1 

GATGCTGTTCCCCATTTCTCCACTAAAACGCCT GCTTTTCTT AACTCCAC 

ACCGAAC CAACCT GAAATATrTTGGCCCAGAATGCCAACAAGAATTGAAG 

AAAAGATGCTTTACAAAAATAACAATATAAAAGCAAATTATATTATCCCT 

TTTATCTCCATTCTT ACATTAAAAAAAAAAAAA 

>391.1 

ccgaggtacgcgggatgggatttctgaccatttgccctgc^cttgcaaa 

ataggtct aatggcaggatggtgtcataattaaggct accaagactgccc 

at tgtt oc aggct g ggca gttcataatgggggcagacaatagtgcaaaaa 

aattttacattttatctttagagtgtcagggtcaaat7gatttccatggt 

tgaggatgtagccaagtgtggaatcaggtggaataggtggagagttgccc 

atagtggtttggaaaagagaagaggactttgaaaagtggagggctcatta 

GGTGACCCAAATTTTACCTGGGGCATCCCCCCTTT agggccccaacttag 

TCT GTCAGACATCTCTGACCTTAGATGGGTGCTGGCACCACTTTGGAATG 

G TT CCCTCCATCACTGAG GACCTGACTT AAAGTTTTTCT ATCTCACTTAA 

AACAACCCTTTAACGCTCTCAACTTAGGCAATAATAAATTCCTTrTCATG 

AATTCCCTTCACCACCATGCACCACACAGACCACATGCCCG 

*392.1 

AGCGCGGGGAGAGGCCGGTTTGCAGTATTGGGCGCTCTrCCGCTTTCCTC 

GCTCACTTGACTCGCTGCGCTCGGGTCGTTCGGCCTGCCGGCCGAG 

>392-2 

ATTCAGCTTCACTCAAAAGGGCGGTAATTACCGGTTTATTCCACCAGGAA 

TCAAGG 

>392_3 

CC 

>393.1 

ACAGGACACAG GCACTCCTTTGTCTGGT AGAGAGGAGGAGGGGAAATGGA 
GC TATrCC AGGATACAAGGGATGGCACTGAGGGATGCATAAGTCCCCTGC 
CT CCCTTGTCTCAACATGTTCTCCTCTGCCAGCCCAGTCAGCTTGGGGAG 
CT AGGTATCAGAAACCTGAAGQATCCAGCCCGCTTTGTCCTACTAGTGTC 
TATAAGTCTCTGTCCTGAGATCCTGGGGCTCCTCCT ATTTCTAGAAGGGA 
TGAGGTGCCATCAAAAATAACTTGGCTGGTGTAAGAGTTTAGAGAAGGAA 
GTCACACCT GTAGCCTGGCTGGCAGGCAGGTGGACATGAGGCTGAGAAGG 
GAAGCCAGATGTCAGAACATACTAGGCT AGCATGCCTGCT 
>394.1 

TGGCGGCCGAGGTACCAGGCTGGCGACAGGTGCT ACCAGGAGTGGGCTGA 
GGGGAGAAAAACTATCTCOCACTCTTTT GGCCCAGGCAATGTCAACGACT 
TCCACATTCCCTGGCCCACTGGCTGA6GAACO0CAGGTTCGGCTCTGTAT 
AAGGACCCTCCCCTCCGAACCCCAACCCAGAGTGCAGTGCAAATCAACCA 
ACAATTT ACT GGTGGAATGGGAATCAAAGGAAACAGTTAAACACCAAACA 
Mi iCi lAAAG CCAAAAAATATTTITCATGGAGTTGAACATrrTTOGAGT 
GTGTTTTTTTCAAGTQTAAAAGCAQTGAC AI I I » G I I CAAACAQAAGCAG 
CATCT AGGAATTCTGGGACTTGGGTTCTAGGGGGTTACAGGTATGCATCA 
TGGATTCTTCTCCCTOGTATTTAAAAAGA 
>395l1 

ATCTT AATTTATTTAAAGCCATAGATTCAGTTTAGCTTTAACCT AGACAG 



PCT/USOQ/33312 



Table 3 



AAAGTGAAAAGCATTTTACAAGTAGAAGAGGCAATGAGAAATAAGGCAAC 

AGATA ATAC G TCAAA GCTGGAACAAGGGCAGAATCAGAACGTGTCTGGCT 

atcagctttg n I 1 \ GACTACTAAGGCCAACCTTTTTATTCCT CTGGATG 

GTCTGC AGACCA AGTTGAG^ATTTAGGCAAAAGGATTTCCAAATGGATCC 

CTATACAI 1 1 1 CAGAAGATTCAGGTTGAGGAAGAAGCCACAGAGGGCTTG 

TGATGAACCCAAAGGAATCTTTAAAGAAA 

>396.1 

TGGCGGCCGAGTACCGCGGGAGCTTCTCCTTGCCAGTTTTTCCCAGCAGG 

ACCCTCTTCTTGTTTTGAAAGATGGTCX5GCTGCTTTTGGTAGGCACGCTC 

AGTCTGAATGTCCGCCATCTTCC 

»397.1 

ACCCTCTTCTTGTTTTGAAAGATGGTCGGCTGCTTTTGGTAGGCACGCTC 

AGTCTGAATGTCCGCCATGTGTCC 

>398.1 

CTCACCGGGCGGCGGCCGAGGTACAAAATTTAGAGGTTTCXXXrrTTATCA 
ACAAGAGACCCAGGTGCCAGCATGTTACTACCAGATCCAGTTCTTCTTAG 
GACAGTGTGGCTCAAAGGGATGAGACCTTCCAGACACTGGTATCTGAGCA 
TCTGGG(XrrGCCCCTGAGTTGTCAAGAAATTTCTTATCTCTGAAGGAGTC 
CAG ACAGGAATGCTTCCACTGCTGGGTG GGTGCTCGCCCCTCTTGCTCCT 
T AAGCGCCCG GCT CACCCCCTTG CT A G CACAG GGTGTCTT AGACAG TTT A 
TGGGACTTTT CT GTGAACTACCTGAGGGCAAGAACCATGTCCCACTCCCT 
GCTTGCTCCTCAAATATTTTAT AG G AAAGCAGTCCAC AGT CT GACACAG A 
GGAAACATGAAGTTTAAGTTCT AGCCCTATG 
>399.1 

GCCGAGTACTCGGGGAGAGAGGAAAAGAACACAGATCT CGCATGGTTCAG 
A 1 1 1 1 1 (J 1 1 1 1 1 AG GTCCAG GAG TAAGATATATCAT ACGAAAATGAAAAT 
TATAATTCTTCTTGGATTCCTGG GAGCCACATTGTCAGCCGCACTT ATCC 
CACAGCGTCTCATGTCTGCCAGC^AATAGCAATGAGTTACTrCTTAATCTT 
AAT AATGGTCAACTTTT GCCACT ACAACTTCAG GGCCCACTT AATTCATG 
GATTCCACCTrrCTCTGGAATTTTACAACAGCAGCAGCAGGCTCAAATTC 
CAGGACTCrrCCCAGTTCTCTTTATCAGCT CT AGACCAGTTTGCTGGACTG 
CTCCCAAATCAGATACCCTrAACAG^AGAGGCCAGTTTTGCCCAAAGGAG 
C 

>400.1 

GTACAGACAGTG CTTGATGTTCATAAAAAAT ACAATGCCCTGGTAATGTC 

TGCATTCAACAATGACGCTGGCTTTGTGGCTGCTCTTGATAAGGCTT GTG 

GTCGCTrCATAAACAACAACGCGGTTACCAAGATGGCCCAATCATCCAGT 

AAATCCOCTGAGTTGCTGGCTOGATACTGTGACTCCnfTGTrGAAGAAAAG 

TTOCAAG AACC CAGAGGAGGCAGAACTAGAAGAGACACTCAATCAAGTGA 

TGGTTGTCTTCAAGTACCTGCCCGGGCGGTCGAGC6GCCGCOCGGGCAGG 

TACGGGGGGGCTAACCAGGCCAGTGACAGAAATGGATTCGAAATAOCAGT 

GTGTGAAGCTGAATGATGGTCACTTCATGCCT GTCCTGGGATTTGGCACC 

TATGCGCCTGCAGAGGTTCCTAAAAGTAAAGCTCTAGAGGCCGTCAAATT 

GGCAATAG 

>401.1 

GCGGCCGGTTGACCTT GTATGTCACGAGCAATTAGGAGAGTCAGAGGATG 
AAATAGATGAACCCGACCATGCAGTTAATCACCAACATCAACTACTAGCC 
AGACGGGATGAAOCAGAGGGTCACACAATACAGTGTTCCTGTTGTAAGTG 
TAACAACACACTGCAGCTGGTAGTAGAAGCCTCACGGGATACTCTGCGAC 

aactacagcagctgtttatggactcactaggatttgtgt gtccgtggtgt 
gcaactgcaaaccagtaaoctgctatggc 

>401.2 - - • 

TATTGGAAAACCTT ACl III! { CAATGAGCATTTTTGCATCAAGGGGTAA 

- caggqacattagg g i 1 1 1 m, 

ACAC^TATCCTCTGTGGGAAAAACTGCTCTCAGAGTGTGCACTCT 
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CAAGCCAGCGCTCAAACTGGAAAAAGTATCTCAATGTCCTGAATGTGGGA 
AAACCTTTAGCCGAAGTTCTTATCTTGTTCGGCATCAAAGAATCCACACA 
GGCGAGAAGCCTCACAAGTGCAGTGAGTGCGGGAAGGGCTTT AGTGAGCG 
CTCCAACCTCaCTGGCCACCTACGAACTCACACAGGGGAGAGGCCCT atc 
AGTGTGGGCAATGTGGGAAAAGCTTCAACCAGAGTTCCAGCCTCATTGTC 

caccagaggacccataccggggaaaagccttaccagtgcattgtctgtgg 
aaagagattcaacaacagttcccagttcagtgctcaccggcg 

ttggagctccacgcggtggcggccgaggtaccaaattaagtattaaaatg 
aggattgaactggggcaaacaggttattgtgaaaacagtcaatatgtaag 

CTCCTTCAAG GGAAATCAACTACTGTTCCTCAAGATTAGAAGATGTCCAC 

ACTCrrTGCATTACCTCCCTAAAGGAGGAAACACCCATTAATTTTGGCTT 

ATGGAATCAATATGGAGTG GAAAT ATGAAATGAGGAGATGTTTTAGAAAG 

CAGGACATATCTACCTACCATTACTGGAATTAAAATGTATCCTCTGGGCC 

CACTCCATTGATTCCGATCTGAGGTGAGGAGGACTAAAAGCAGCAGCAGG 

TTACAGAAAGACTGAATAAGATGAAAGTATGCTACGTATGTCTAGCTGGG 

GAAGGGGGGATCTGGAAAAAA 

>404.1 

CCGCCCGGGCAGGTACGGACGCCCAGGGATCCGCGCCGAAGCTAGCACGC 

AGCCTACCGAACAGTCTACAGAG CCGACCAAAGCCCCCGCGT ACCCAGAG 

GAGTCGCTGGTGAGTGGGAGCTCAACCCTGTTCAGTGCTCTGCTCATCAA 

GTGTCTCGAGAAGGAGGTTGCGGCATTGTGCAGATACACACCCCGCAGGA 

ACATCCCTCCTTATTTTGTGGCTTTGGTGCCACAGGAAGAAGAGTTGGAT 

GACCAGAAA 

>405.1 

GCGGTGGCGGCCGAGGTACGCGGGGGGCGGCGGCGGAGAGAGCTGGCT CA 
GGGGGTCCGCTAGGCTCGGACGACCTGCTGAGCCTCCCAAACCGCTTCCA 
TAAGGCTTTGCCTTTCCAACTT CAGCT ACAGTGTT AGCT AAGTTTGGAAA 

GTCGTCGTTGTAGTCTrnTGCCCAAGGCTGTT GTGTTTTT AGAGGTGCT 

ATCTCCAGTTGCrrGCACTCCTGTTAACAAGCACCTCAGOGAGAGCAGCA 

GCAGCGATAGCAGCCGCAGAAGAGCCAGCGGGGTCGOCTAGTGTCATGAC 

CAGGGCGGGAGATCACAACCGCCAGAGAGGATGCTGTGGATCCTTGGCCG 

ACTAGCTGACCTCTGCAAAATTGCTTCTCTACCTTGGTCATTCTCTCTCT 

ACTTGGGGAGATCGGATGTGGCACTTTGCGGTGTCT 



CGOGGTGGCGGOGGAGGTACAGTTCACAGTGCTTGATGATAATAAATGGT 
T ATTTT ACTGGT IXSATGTATTTACT ATATCAT A C 1 1 1 1 I ? IGATTAGAGT 
GTGCTOCTTCT ACTTATGTAAAAAAAAAGTT ACCTCAGGGAGGTOCTTCC 
TQAGGTCTrCCAGCACACGGCATTGTTATCATAGAAAATGAGAGCTCCAT 

GTGTGTTACTGGCCATTACCACCTT ccagtgggaaggatgtggaggtgga 

AAGCAT ACT GATGATTrrGTCCCCGTGGAGG CCT AAGCT AATGT GTGTGT 

TTGTGTCTTAGC 

>407.l 

GCGGCCGGTGTGCTCATCGTAGOCTCGGGTIGGGGGGATGCGTCTCCGCTT 
TAGCGCCAAGATAGAACTTCCTCAGACCACCGCCGCCGCCCCCGGGT 



QTACCTOCACTG QCTG AAOTCTCTAGATAGCTCTCAGGAACCTTCGGAAA 
GGCATOGAACTCTTTTACCAAACTTAAAG 1 1 1 1 II I C CGATTCAGTCGCC 
TCATmCAGGAAAACCTTCCTCTTCCTTCATATAGTCATGCTTGTGTTA 

tggtoccagcctaccgccatgttttacagaagcccgggtcgccggggctc 



acc tccacac tctcccggactaggtggtcaacagggccacagggttggtt 
tctgtctttggtggggcaggggagjtgacagggatgagggtccaaggaat 
tagcatgaatgacaagat aacaagggaaagagttaaccctgtcacatagt 
aggttaa ci mm cagggttt 
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>409.1 . 

mi ll ATTT TGOI Ml 1 II I CGCGGGAGTTAAAT AAAATAAGCATGTCT 
TCATCC1 1 IATTCCTAAACATTTACTTATGACAAATGTAACX5ACTGACAG 
AAATTTGAAAAATA CCAG ACACTTCTTAAATGATTTCCCTTGGTTCAAAA 
TTTACCCCTTCTTG I I I ICICI T GCTTTTCAGGTAATTAACTCTTCTCTT 
TTTAGTTrGAACTATGCAGTGCAAGATTCCTCTGTAGTCTTTCCAAGTGG 
ACGGGTATTAAAAAAAAACACTTTATATT ATG CCAGGTGAGGTGTCAGAA 
CCCTGGCTTCGGAAAGTGGTTGGCTCACCCCGCG 

>4iai • 

GGGCAGGTACTGTGCAGTAGTAACCATAATTCTAAATGAGGATTATGGAT - 
TTTTCTGGAAGATTCTT TTTTC CTGTGGAACATGATGAGAAATGTTTAGG 

agaggggacatagccatttttgtatgaagaccaattc 

>4102 

cacactcacacao^tgcacacatgcacgcacaacttcactctatattt 

ATTCT 
>411.1 

ccttgagcagatgctgtattatggggataagccacacactttctgaactg 

gcccggtcaggggggacataaccatttotgtgccaccccatcagtaccc 

acctattgtgagcgaaggctcctcccctgcttgagtaatggccacagatc 

^ggctcggcactcctaagctgcatgatgaattcctgggacaacaagact 

ggctcgtggttccattctc<^gatccttgggttggcttctgggtgcacta 

ggagatctgaaatgctctcaggccagcaggaaagtactggaagtaaagtc 

tgactctaaagaagatgaaaatctagtaattaatgaagtcataaattctc 

ccaaagggaaaaaacgcaaggtagaacatcagacagcttgtgcttgtagt 

tcctaaccacgcaaggatctgaaaagtgt^ 

gacgaaacgaaacctgtgcctcgagcg 

>412.1 

gccggcaggtact agagttttcaagtatgttctaagcacagaagtttcta 
aatggggccaaaattcagacttgagtatgttctttg 

cttcagtgtggtataaaacgtggtttttaggctatgtttgtgattgctga 
aaagaattct agtttacct caaaatccttctctttccccaaattaagtgc - 

CTGGOCAGCTGTCATAAATTACAT A I rCCI 1 1 IGGTTTTTTTAA 
>413.1 

GGTACCTAGTCTATATGAGTTTGATGCTTACAGTCAAGGCTATTAGCAAA 
TATTCAGGAAAAGTAAAGCCTAAAGAAGAAAAGAGGGAATGAATAGTTTG 
TCTAGAGATAATAAAAGGAAGGTGAATTTTTAAAAAGAGAAAAATAAGGC 

tagaaaagactgagtggagaaagcctacagaatttcagaaagct AAAGAA 

ATTGGAAATTAGATTGAATATAGATAGAAATGGGAGGACAATGCAGCCAA 

TGAAAGACTGTGGGGACT AATAAAGGGAGAGCCCTGTGGTTTGGAAAGTG 

TCCCTTAATCAGCCTGCAGTGCTGCAAAACAGAAACCCAGAGAGGGTGCT 

TGAGAATATACAAGAACCCTTGCGGTGGTGACTGAACAAAACGCAGCCAG 

GGATTTCATCAGAAGCATAATCCATTCATGGCACCAGTCTGGCAGTGCTG 

GGGAGCTGGTAAGATACACAC 

>414.1 

GGCGGCAGGTACGCGGGATCCAAGATGAAGT GCAGAGAAAATAAAGAATC 

CAAAGTCATAGTCATGAGGACAGAAT 

>415.1 

AAAACCAAGACGAAGCCACTACAGCC«X3GCGTACCTGCXX»GGCGGCCA 
AAGGCCAACAAGGCAGTGGG 

>4ia.i ■ 

TCACCGCGGTGGCGGCCGAGGTACGCGGGGCTGCX3GAGGACCGTGGGCAG 

AACCATTGACTGGGTAGCTTTTGCAGAGATCATACCCCAGAACCAAAAG 
CCATTGCT AGTTCCCTGAAATCCTGGAATGAGACCCTCACCTCCAGGTTG 
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GCTGCTTTACCT GAGAATCCACCAGCTATCGACTGGGCTTACTACAAGGC 

CAATGTGGCCAAGGCTGGCTTGGTGGATGACTTTGAGAAGAAGTTTAATG 

CGCTGAAGGTTCCCGTGCCAGAGGATAAATATACTGCCCAGGTGGATGCC 

GAAGAAAAAGAAGATGTGAAATCTTGTGCTGAGTGGGTGTCTCTCTCAAA 

GGCCAGGATTGTAGAATATGAGAAAGAGATGGAGAAGATGAAGAACTTAA 

TTCCATTTGATCAGATGA 

>418.1 

tggcggccgaggtacgcgggattttgaatgaattctcaacaaaatgtgct 
agocact ggggaggcaaaacaagtaagatccctgttgcaag aaattcatt 
ttatagtgagggaggttggcatggagactaaaattctcaggaaaatgaga 
tc^t gtta gatagaatcctgatgtgaaatgggaggactcaggaaggagg 
atcgtcttt acctgaggatttctagccagaggtcccagatgcctgggctg 
agaacccagcgataagggggcgttcccaaagcagacacagggataagaac 

AGAGGAGGCAGCAGCATTGCACAGCCCCAGGCACAGTGGCAGTTAGGATG 

GCTGGAGAGTAGGATAGTTCTATGGGTTGCCCAAAAAATGTGATGTGCTT 

CATGT1TTCTCTGACTGATGGATCTGGTAGAGACCATAGACATGATATAG 

ACTA ACTTGC CCATTTTTCACAAGAGGAAACCATGCTT atgacttacctt 

AAAGTTTTTTGTTCT GTTTTGAAAGAAACCATGTGCTTCATGAAACCTAC 

AGTTGA 

>419.1 

CGCGGTGGCGGCCGAGGTACAGTATATTGACCTTAAAAATCAGTAAAGCA 

GTCATGGAAAT AACAGGTCGTGTATTATTCATG GGCACAAACTGACTCAT 

CGCTGGGGAAGAAGCAGCCACCTTAGACCAGATGGACAAGCCAGATACTG 

CAGAGAAGTTTCTGGGCTTTT 

>419-2 

GGGAGACTCTAGATTCAATTCTGTAAAGTTATGATGCAGTTTTCTCCTTC 

ctctccici cacctcctctgagcacagcttt CAACAAAAACTTTGCATAC 
GOCGGGTACCTGCCCGGGOGGCCGCTCGAGGT ACTTCTCT GAGCATTGGC 

ctctggctgggattatgcttcaacagtcttgaaatgaggtccctggctcc 

CTCTGTTACAAAGTCAGGGAATGTGAATTCAACCCGTGATATTCTTTTGT 
AGGTCTCTTGGTATGTGTTTGCCTCAAAAGGAGGCTTCCCAACTAAAAAT 
JCATAGCAAAGAACTOCAAGGCTCCAGAGATCCACCTTCTCATCATGCAT 
GCGAOCTTCAATCATTTCAGGGGGCA 

CGAGGT ACG CGGT GGTCGGCGCCATTTTGTCTCGGCAGCGGTGGCCGTAG 
CTOCATCGCATTTTATGTrrCTGGCGAGAAGGGAACGGAGTTTTCATCAG 
GTAGATTGGTTTTTGT 
>CT1.1 

gccgcctgccctgaaagacctcctgct GGAAGACCTCCAGGATGGAGAAG 
yGAGGCJGGGTGGCTCCCTGCGAGGGGCATTCAGCAAC^ 

aaaaa^ men cagagtcagtttcaaaaatggatcccaaagtcagaocga 

CJCGCTACAAGCCAATGACACTTTCAACAAACAGCAGT 

* H*** 1 ^AAQCCAAAGAAACx m 1 I f G1G1GCTGCCCGGGAAGCTGGGGTG 

CTT GWrrCCGAGGGATtXTTTCCTAAATCCCACCACCGGGAGCAGAGAGCT 

ACAGGGAGA aacaaaacttgagcagatggaccaatcggacagtgagtcag 

ACTGTAGTATGGACACGAGTGAGGTCAGCCTCGACTGTGAGCGCATGGAA 
CAGACAGACTCTTTCTGTGGAAACAGCAGGCACGGTG^ 

*CAGAAGCATGTGCACrrCGGGAAGCAflGCCTGCATCTTACCTGTACC 
>422.1 

ACGCGGjUACTGGGGAATTCTGGCCCTACG 

GGTTTGTGTGTATGGTGTCATGAGATOCT CT ACCT CAT AACAAAAGGACA 
GTGGGTAGACTAAGGCAGTAGCTCAAAGXaGCTTrGCAAAATTTAATATAT 
- T AAAAGAAGAGGCATCTGCT AGAAAACATT CT ATTGT AT ACATACTGAAA 
ACCCIATAAGGJCCTGGATA^ . 
TTTATTJTCCAATTGTGTGAAGTTTTTGA^ • 
AACCATCAAAAAAAAAAAAAAAAAAAAAAGT 
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x423.1 • 

ATTAGACAGGGGGAAGTAAAA7TATCTTTTTGCAGATGATATGACTTATA 

TGTA 

X24.1 

TGGG GGCCG AGGT ACTGCCGT AGCXXKn"CCTCCCGCAGCTGTGCCGCCTC 
CGTCATCCT CTGCTGGTGTGGCTGGCTTCCAAGCTGGTGCCCGTGGGCTA 



G GACAGACTTGCTG GAGGAG GAG ATCACCAAG TTTGAGGAGCACGTGCAG 
AG TGTCG AT ATCG CAG CTTTCAACAAGATCTGAAGCCT GAGTGTG GGT 
X25.1 

GGTGGCGGCCGAGGTACT AAGTGGTTTAAGGATGGAAAAGAGCTAACAAG 

TGACAACAAATACAAAATAAGCTTCTTCAACAAAGTATCCGGCCTTAAGA 

TCATCAATGTAGCGCCGAGTGACAGTGGGGTATACAGT7TTGAGGTGCAG 

AACCCTGTTGGCA AAGACA GCTGCACAGCTTCATTGCAGGTTTCAGGTTG 

GTTGATTTCTT GGGCT T T7CC7TCATCATT AT AAT AATGT AGTTCCTGAT 

TTTCATAAATGTATATGGGTTGTTACATCTTCTATAG GATAACATGAGTC 

CGACATCrrrCTGAATCAGCAAATTCAGAGGCAATACCATCT CAAGAAGCC 

ACCATTGAGACCAGAGCCATTAGCTCATCC^TGGTCATCAAGAACTGCCA 

GAGGAG CCATCAAGGCGTCTATTCTCTTAAAATGAGAGGCAGGACTGGCT 

AGGGTGATGCCTAAAGATGATTCCCAGGCTrGACATGCTGGTATTCTTAC 

ATATC 



TGGCCGGCCGCCCGGGCAGGTACTGAATGTGGGAAAGCCTTTTGCCAGAA 
ACCACACCTGACCAACCATGAGCGAACACATACAGGAGAAAAACGCTATG 
AATGTAAGCAATGTGGAAAAACATTCTGTGTGAAGTCAAACCTCACTGAA 
CATCAGAGAACACACACAGGGGAGAAGCCCT ATGAATGT AATGCATGTGG 

gaaatgcttctgc cacag atcagccctcactgtgcatcagagaagacaca 
caggggagaaaccttttggatgtaatgaatgtgggaaaaccttccgtcag 
aagtcggccctaattgttgaccagagaactcatataagacagaaaccct a 
tggatgt aatcaatgt ggaaaatcattctgtgtgaagtcaaaactcattg 
cacatcatagaacacacacaggggagaaaccctatgaatgtaatggttgt 
ggaaaatcattctatgttaagtcaaaactaact 

>427.1 

TGGCGGCCGAG GTAO CTTACTTAGCAGAGCACTTTGCAAACATATTACTT 
ATT AGCAGAGCTTJTTTGT AGACCTTCCACATCT GGCT GTCAGATCTT AAG 
gttgtgaatttaggct CCAGTT ATATTCACTGGAGAGCATAATCCCACAC 
GGGTT ATTTATAAATACAGAGCCTCTGATTGGACGGTCTTX^GCCAAGAA 
CTAGTAATAOCC 1 1 G 1 1 1 1 AAAATCTTCACAAGGTAAAACTTAAAAAGCC 
AAGCAAACAAATTGCTC TOCATTC TACTTTT AATTGGGCCAAACAGCATA 
TGCT ACAGT AGT AAGATGTTTTTCGGAGAGT GT AAAAAAL! 1 CTGT I IACA 
TTTGCCTCCTCOGTGGGTTGATCGAAAATGT AT AAAACTGACTGCTTCTC 
GCCAGCCTCAGACAAGAAGAGTGAGCTGCTGGTACCTGCOCGGGC 



GGCCAAAT GCAGAAACGTCCCACA TGCCCACGAGGAGCAAGCTTCAAAAT 

GTTCAGCTTGCGGGGCA 

>429.1 

TGGOGGOCQAGGTACI rTTTTTTTTTTTTTITTn I G T GATCTCAACTGC 
TTTTAGCAAGTTGTGAAT ATACTTGGGCTTTCTGTC 1 1 1 COCC AAAAGCA 
ATTTGGGATTA 1 1 1 ICCTCC1 II I III I U I GCATTTCATCATAAATACTG 
TCATATTCATACACAGT AGCATCTTCTGCAAGGGCCTTCTGGATTTCCAG 
TTTGGTCTGTTTCATGGO C I.GI I I CI I AGCAGCTTOCCTCTGAAGGCTTT • 

c actcaca gaggtctcatcatcatcatc agaatca ttcccaaacactgat 

. Gui i.i i,iA»CAAA A CAGGX3TQCAACTGCIGIGI 1 1 1 CI TTGGCAAAATAAQ 
CCCATACT * 
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GCCGAGGTACAGACaaaaC J aCaGaC II AGTCTGGTGGACTGGACT 
AATT ACTTGAAGGATTTAGAT AGAGTATTTGCACTGCT G AAG AGTCACT A 
TGAGCAAAATAAAACAAATAAGACTCAAACTGCTCAAAGTGACGGGTrCT 
TGGTTGTCTCTGCTGAGCACGCTGTGTCAATGGAGATGGCCTCTGCTGAC 
CCA^TGAAGACCCAAGGCATAAGGTTGGGAAAACACCTCATTTGACCTT 
GOCAGCTGACCTrCAAACCCTGCATTTGAACCGACCAACATTAAGTCCAG 
AGAGTAAACTTGAATGGAATAACGACATTCCAGAAGTTAATCATTTGAAT 
TCTGAACACTGGAGAAAAACCGAAAAATGGACGGGGCATGAAGAGACTAA 

tcatctggaaaccgatttcagtggcgatggcatgacagagct agagctcg 
ggcccagccccaggctgcagcccattcgcaggcacccgaaagaacttccc 
cagtatggtggtcctggaaaggacatttttgaagatcaactatatcttcc 
tgtgcattccgatggaatttcagttcatcagatgt 

>431.1 

gc^gc»agaccaaac^cagccctccaacaatgatgaccagtggaaaaac 

aatggagtcaccaaaacctgggacaggctcatgctccaggacaattgctg 

tggcgtaaatggtccatcagactggcaaaaatacacatctgccttccgga 

ctgagaataatgatgctgactatccctggcctcgtcaatgctgtgttatg 

aacaatcttcgagcggccgcccgggcaggacgcgggagttcaagaagctg 

gtggtcaaggaggaggaggtggaggtggcagtggaggaattgcagaagct 

ggaagtggtcatatgaactacattcaagtaacacctcaggaaaaaaaagc 

tata g aaa ggttaaaggcattaggatttcct gaagg acttgtgatacaag 

cgtattttgcttgtgagaagaatgagaatttggctgccaa 1 1 1 icncTA 

cagc^gaactttgatgaagattgaaagggacttttttatatctcacactt 

cacaccagtgcattacact aacttgttcactggattgtctgggatgactt 

gggctcatatccacaatacttg 

>432_1 

ggcggcggaggtaocactgcttcccgggactctgcgttgttaocactgct 

tcccgggactctccgttgttaccactgcttactgcgttcccagcatttct 

tttctcttctcgtttcctgtagattccggctaatggtttcccctggcatt 

tgacttcgtgatgtgtaactgattctcttcctgaagggggaaacgcattc 

c^pj^I^^CQCQCTCAT^^ 

AATCCCGCGT 
>433.1 

g^®^®^®^J®^^^^^^TACAAATCTAOCTCCCCAOCAAATGTOCTrAGA 
>433.2 

gcagtca tgaagct ggcaaatggcagaactg gagctag aaactgctgact 

CCCTTTATCTTTTCCATAGCACCCCAAGCCT AAAACCAGACTGGCACAAA 
>434.1 

GCGGTGGC GGCC GAGGTACTTTTCTAAAAGCTCATCCACTCTATCATTTA 
GATATCCAATTTTCAGAATGTGCTCAaCATTGGCCACTCCATCTGCCATT 
CTTAAGTCTCCTT GGGAGTCTCCCAGAAGAATTATGTTACTATTGTCTTT 
TAGTTGATTGAAATATTCTGTATTCCrrCAAGGCACCATCATGTTTGTTAA 

TAT^TCTTTTGAACAGGCTTTAAAM 

TT TTGGA GCTCTT AATTGATGTCACTATTATTTCATGATATTTGTAAATA 

T^CTAAGAAGGGCTCGGAGAAGTAAATGATTTTTTAGATACTATTTTAA 

ATGGT AAAACAAAAGCCGGGCGCAGGGGCTCACA 

>435,1 

ACGCGGGGGTTGCTCAAACCGAGTTCTGGAGAACGCCATCAGCTCGCTGC 

n AAAA IT AAACCACAGG1TC ^^ 
. TCATCCTGA ■ 
>436.1 

GGCGGCCGAGGTACGCGGGGGAACACCACCCAGTGTGGAGCAGCCCAGCC 



Page 59 of 111 



542 

Table 3 



PCT7US0C/33312 



GGAATAATAAGTGAGGGGTAGATCTGCCCATAGAGCTCACTTTAGACCGG 
CCTAT ACT CCTACAAAG AATTGTGGT AGGATC 
>437.1 

GGCCGAG G T ACCTTTTTAGAAG AGAAAAGAATCTTG AATTGTAT AT ATTT 
ATTTTGCTTT ACAGAAAAAAATGGTTTCGTAAATAATTTGCCTATTTTGG 
TTAACATAGCACATGGAGATAATCATCTGAAAGTTATAGGGCACTGCCAC 

tgctgaatcaagagcatgcccaatatttgaggtggctctgatttcctggc 
agctgaactcgggtagtccagtggcctagctggtcctgcccg 

>438.1 

aggaggaattagtggatcccctaacaacagtgagagagcaatgcgagcag 

ttggagaaatgtgtaaaggcccgggagcggctagagctctgtgatgagcg 

tgtatcctctcgatcacatacagaagaggattgcagggaggagctctttg 

ac i rc 1 1 gcatgcgagggaccattgcgtggcccacaaactctttaacaac 

ttgaaataaatgtgtggacttaattcaccccagtcttcatcatttgggca 

tcagaatatttccttatggttt 

>439.1 

gtggcggccgaggtactctgtgatttcacctagatttggagaaggtgagg 
gaggaaagg ct gtcct ctttgatcccataccatgcaggggcaaatggctg 



ggcggccgaggtacgcgggatgtctaaatatcttgtaaaaagtgttaaaa 
taaacaaacccagtcaattaaaaattttgactgttattgagaaaactcc 

>441.1 

gcggccgaggtacattgtagctttgaactcagtgtttaaaaattcaatct 

ggttacacactctatcttctagatcccttgagacactgtcttccttgaat 

aagggccaggtgaaatggcatttcagctgtggaaggattttctccaggga 

attcttggtgacctcactcatgacrrgccctctgtgtctctgctgttccga 

aaagctggtgaccaggctgatttgttcttcagaagtcttcctgtctgccc 

ccgcgtactgtnxjrgcaggttaaggcaggactggaactcctccagagct 

tgcacatagttttcagattcaacactaacttctccgagtttaagatgtgc 

CT GGGCAGCATAAAGCTGTGCTTCTTTTGTTTCTTGCCTTTTAAAAATGA 
^GTTTGCTAAATCCAGCATATCCCAGGCAAGCT CT AGGTTCCCAATCTCC 
TCCTCCT CATTTTCTT GAAGAGACTTGTTTTCAAG GACTGAATCATTTGG 
S*!!!? 71 '?^! 071 " ^GATTTTCTTTATC^TCCTCTTCX^GAGCCTTCAG 
TTTCTTCACCCTCrrTTCATCTGGTCTTCT CTTJTCTTGGGGCTCTTCATTA 
GCAGCTA '* 
>442.1 

GGCGGCCGCCCGGGCACGTACTTTTGCTGCT GAGGAATGGAATCAAAAGA 
ACGTAGTCTCCTGGTAACCAOCTCAGATCTCTATTATTAGGCTAGATGTG 
GGGCGGGTGACTCCCCCAGCTTCTTGCTCT<XlA<XXn'GC^ 
CCCTTCTATTAGCAGCCAAGGAAAAGGGAAACXTGAGCTTATCCAGAACG 

?Z GGG ^ GAGTCTCC ^ GGCAATCAAC< ^^ 

ACACT GTATAGCTCATTAT AGGACGTCAG G H I G H GAAAAAAGTGGGCA 
A GACATG ATTAATGAATCAGAATCCTGTrTCATTGGTGACTTGGATAAAft 
ACTTTTT AATTTT AAAAAAAAAT ATTCATGGAATAG GGT 
>443.1 

CCCGCGGTGGCGGCCGAGGTACATGAGAGACACTT^ 
GAATAGAGTGAGTGW^ACTCAGATTGTTTAAGCTATCTXrrGAACCCATT 
CCT ACTGCG TTT AACT ATTTTA1 lUUI 1 1 CTAACT ACTACCACAGACACG 
GATACCTCACAGGTTCCATTATTACTCACAGCGnTGTGGTCCGGGTTCAT 
CGOCATCCTGCTTX^CGCTGTC^ 

CCCTCTTCTATAAG GGACATACACX^GAT(>CCGAAAACTCCTCCTTTCT 

°^JI G I TCC 7^ G ^ GTGGGTGGGGACTCCAA ^ 
. CCTACTAGGCCACTCT ACCCCaTT 



CCACCGCGGTGGCGGCTOSAGGTACXXAGCCCCACC^ 
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GACATGTTTCGTAAGTGAGACAAGCCAGTGCA 
M44.2 

GTTATGGCTAAGCA(>TAG^GG(^CAAAAAAGGAGTTTTTCAAACCCAGC 
AAATCAAGTGCTTGGATTCT GAACTGCCAAAAG AAAACTGCACTTCCCCT 
CTTAAGTAAAACGAAATGAGTTTCTTAGGTAAATGTATTCATCAGCCCAG 
ATAAAAAAAAAACCAGTT ATGTGAG CGTTAGTCACTG CTCATTTCCAGGA 

AT AAAGCAAAGAGCCCCG CCCACAAGCCAGCA 
>445.1 

ACTTT ACT AAAATG A CTGCA TTCTTTGG ATTCCTT CAGTCTATGGTTCAA 

?J£^J^ GA ^ CA11 ^ G ^ GAGTCCTTATGA GAAACAGCAGTATGA 

ATCTTGACGGTTTCTGCCCGTCCTAATGGCAGAGCTCTCTGACTTGGGTG 

TATGCTACCAGGCTGGGTTCAAGTGAGAAGTTCTGGTCAGTCTTCTGTGG 

G II G i^ G iJ CAATATCAA " CTG ^ CAAAG CC^ 

A ^ T 9s!3J GCrCGGTATATGCCACC< ^ TGGT ^G T CTGGGACCTAGGTGG 

TGAGCTATCCCATAGTTCATTCTCAACGTCTTT ACTGCACTGTTTAGGGT 

CAG ^J AC ^Z A ™^^ 

TWGGGCTTGGTGTTTTGATTTATAAACAGGAGTTTATAGAACT^ 
GCGGCCGAGGTACGCGGGGAGACACAACTTCCTGGGCTT AGATATTTCAG 
>44&2 

TTAGTTTGTTTTCATGCTGCTGATAAAGACATACCTGAAACTGGGAACAG 

AAAGAGGTCTAATTGGACTTACAGTTCCACATGACTGGGGAGCCCTCAAA 

ATCAaSGTGAGAGGTGAAAGGCACTTTTTACATTGGCAACAA^ 

TGAG 

>447.1 

CGGCX^GAGGTACGTTTTGTGACAGGCAATAAAATTTTAAGAATTCTTAAG 

I c ^ A /l GGG t9J TGCTCCTGATC ^ CCTGAAGAT 

GAAAGCAGTG 
>447.2 

CGAAAGGATCTTGAGAGGAACAGAAAG GATAAGGATG CT AAATTCCGTCT 
>447£ 

g1 ^^ ggg cttccgatattaataagaccaagctgagtcctccctcccaat 
t gg ^! a I g ^ tga tctacagccttctgcc^ 

TCT^GTGTTCT CAAGGCAATTAATAATGATTGTTTTAACACCAACAA 

GGCGGTGGCGGCCXSAGGTAC 1 I H I 1 1 1 1 1 I U II 1 1 1 1 GTTAGTGTTTT 
CTGATGTCI 1 1 1 CTAACAAATCTTTGCCTGCCCAAAAGTCTCAAAAACAT 
TCTCACGTTTCT A 
>44&2 

^Z c ^™ G .9£ c ^ ACTGGTCTGGAACCAOC ^ 

GTGAATTGCGGGTACXXAGCACACCGGCXSOCGTCXJrGGACTGCGCCTTCrrA 
C^JPC^^TGCCTGGAGTGGAGGACT 

atgatttgaacactgatcaagaaaatcttgtt gggacccatgatgcccct 
atcagatgtgttgaatactgtccagaagtgaatatgatggtcactggaag 

^ GGG £ CAG ^ G II AAACT ^ 

GGACCTT CT CTCAGCCTGAAAAGGT ATAT ACCCT CT CAGTGTCTGG 
>449.1 

CGGCCGG^TACAAAAAGCAGGGGCO>GCCCC^^ 
TATTTAGAGGAAGTAAGGTAGCAGGCAGTCCAGCCCTGATGT^ 

A J^ G ^T7^ GG ^ TCAGC ^ CTGGAGGAATGCATGTG ^ 

TTTTTCAGAGAGTGGTGCAGCGCCAGACATTTTGGAC^ 
CCTGCCCGGGCGGCCGTCGATCTOCTTGTGTTCAAGCAACTTCTTGCGGT 
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AGTCCTGAAGCG CCTT ATCTCTAG GGTCCGCCATGATGAGAACCCCGCGT 
>450.1 

GGCGGCCGAGGTACTCXXrrACGGCACTAGTCTACAGGGGGAAGGACGCTC 
TGT GCTGG CAGCGGTGGCT CACATG GCCTGT CT GCACTGT AACCACAGGC 
TGGGATGTAGCCAGGACTTGGTCTCCTTCCCGCGTCAAGAGATAGAAAGA 
CCAGTCCTT GTGAAAG ACAAGTCTGAATGCTCCACTTTTTCAATTCTCT C 
TCCATTCTTCAGT AAGTCAACTT CAATGTCGGATGGATGAAACCCAG ACA 
CATAGCAATTCAGGAAATTrGACTTTCCATTCTCTGCTGGATGACGTGAG 
TAAACCTGAATCTTTGGAGT - 
*4S1.1 

GAGCGGCCGCCCGGCAGGACAAATGAGTTTAGAAATGTTGTATAAGGCTG 

ATCTGGACCCAAACTAAAACAA 

>4 51.2 

GGT 1 1 T m GTAGAAACCATGGTAAAAAGGGAAAAGAAACCTTTGACT GG 

CGGGGGCAGGGGGAATACAAAAAAAAAATCCCTTGATTTTTAAAATATAC 

TTG AATATCAAACTCAGAAAG AGTT A I 1 1 1 1 G I G AAAGAGGCAAAATTGG 

TCTTGAGCTGCTTCAGTCT ATGTCTGAAGGTTTTACTGAAATTATGGTCC 

AGTTTTAGGAGAAAAATTCACAGAAAAGTCAGATTGTAGATTTTGAGAAG 

GAAACTCTGAGGTGGTGATTTTCTCCAAGGTCATGGTTATGAAGCTCAAT 

GAG GGCCTGAATTGCTTCTTCCACAGATCCCAATTG AATGAG CGCCATTT 

TGCGATCTTTCTGAAAGAATTTAAAAGCCTTCACTGAACATCCAGCTTCT 

ATGAAAAGG TTCTTCAGATCATCCACTGT AACAGAAGGGG GAATG TTGGA 

AAGA 

>4S2.1 

GCGGCC<SCTAATGTTAGAAGTTAAGTTGGAACCTATATTGTAGA6GAACA 

AAAGCCAATCAGTGTCCTTTTTGT C 1 1 1 1 1 1 1 ACATAAACTTTT ACT ACA 

AAAATTAATATATGGATTTTGAATTTCCAGTCAAACCAAA7TGTAAAACT 

GTTTCATTTG GTT CT AT ATTATGT AT ACATAATTT AT CT ATT AT AT ATTT 

ACATTAAAATATATGCATATATAATGGATTTAATTTCCTrrnrGGCACCCC 

CATATCTAGAAGTCTCTTCATAAATTAATAAATAATCTAGGGCCAGCATT 

AT GTTTGC TAGACCTGGATTTGGCTCAATACTT AAAGTTAAAAGTTTGTG 

TCTTT TTT CTTGGACTrG AAACTGCCTAGAGCGTCAGTCTCTCTGTTATT 

TTTTTCTAI I I ICi I I J I CCCCCATCAGTCrrTTAGCCACTTGAAGCCAA 

AATTCTrAGTTTCTGTCCTAGTCGATAAGAGTAAAAGGGGAAGGAGGAAA 

A 

>453.1 

TGGGGCCCTT ATCCCG GGTAAGCT ATACG I C Ml GAGGTCCCAACCCCGG 
>454.1 

CGCGGGGACCTTTCACX3GGCGGGGG<5AGCTGAGGCTCCTGCCGACATCT C 

TGATCCTTGCACCCTGGCAGGAAGCTGGTCGCGGGCACT AT AACGGGAGG 

CCTCCaC^TAITCCAGAAAAGAAACCACTCTGCAGTGCCAGaCTGGJMGA 

AGTAACGGTCACTCTGaAAACAGGGGGGGaGAGCTGCCTCCCITTGAACC 

TCTCCCAGGACCAACTCTAACCCAGGXaAGGGGAACTTGGTCG^aTGCAAGC 

GGTGGCTTGGAGACAGAATCATCTAATGGAAAAGATACACTAGAAGGCGC 

TGGGGATACATCAGAGGAGAGGGATACT(^WXXX5GGCTCCGTGGATGAAG 

AGAATGGCCGACAGTTGGGCGAGGT AGAGCTGGAATGTGGG 

>455.1 

ACGCGGGGAGGATCTCTGT CTTTTGTT CCCTCACCTGTCTGCCTGTCTCC 
TCTCCTTTCCTGCCTGGXSGGGACTGTCCAGAAGAC^^ 
CTCTGCATTTGAACAGCTGATCCCCCACCCCTCAATACCGTTT AGAGCAG 
AAGCCAGCAATAACTAAACX^TCAGGGACAGATAGAAACTATTTTCGGCT 
TCATGGGCCACACAGCCTCATTGrrAGCTTCTCAAATCTGCTGTTGTAGCA . 
AGAAAGAAGCCATATACOCTGTGrTAAACAAATGAATATGGCTGTGTGCCA 
, ATAAAACT ATTCACAAACATAAAGAGTGGGCTGX^TATGACTCAGATACT 
GTTGTTTGACAACOXrn^tt^AGAGTAAAAATt^^ 
>456.1 



Pap 61 of 111 



Page 62 of 111 



545 

Table 3 



546 

Table 3 



FCT7US00O3312 



GCGGCCGAGGTACAACATGACATTTTTAACCAATCCAATCTAAAAATGTG 
CCAGAATCCAOCTGT6GCCCGAATCGTGT7TGGTTCCTCTTTCTACTCCA 
CTGCAGATGACCAAACCTGTCCCGCTGCCACTTTCCT cactgatattggg 
AGGAGGGCAAGGCCCAGCCGAAGTTCCACTAAAAATGCCCCAGGAGAATA 
GGCACCGGCTGGCTTGCCAAAGGGTTTGGGTTTTATTGCTTTCTGTTTTT 
TCl ! I ICCCGACAGCACAAAGAAGTAAGGGCAGTTATTGGACAGGTGTTA 

tttaaagattctattgtaaatgaatgtgttgtttggttctactgcattgt 
ggagcatgcgggggaagagaactgacccaggtaatgaa 

>4 5a 1 

ACAGACAAAACTACAGACTTAGTCTGGTGGACTGGACTAATTACTTGAAG 

GATTTAGATAGAGTATTTGCACTGCTGAAGAGTCACTATGAGCAAAATAA 

AACAAATAAGACTCAAACTGCTCAAAGTGACGGGTTCTTGGTTGTCTCTG 

CTGAGCACGCTGTGTCAAT6GAGATGGCCTCTGCTGACTCAGATGAAGAC 

CCAAGGCATAAGGTTGGGAAAACAOCT CATTTGACCTTGCCAGCTGACCT 

TCAAACCCTGGATTTGAACCGACCAACATTAAGTCCAGAG AGTAAACTT G 

AATGGAATAACGACATTCCAGAAGTTAATCATTTGAATTCTGAACACTGG 

AGA AAAA CCGAAAAATGGAOGGGGCATGAAGAGACTAATCATCTGGAAAC 

CGATTTCAGTGGCGATGGCATGACAGAGCTAGAGCTCGGGCCCAGCOCCA 

AGCTGCAGCCCATTCACAGCACCCGA 

>459.1 

TGG CGGCCGCCCGGGCAGGTACGCGGGTAGTG AGCTGGTTAGTGAAGGCT 

TTGTAGCT GAGCAGTTTCTAAATAACACAGCCACTCAACTGACATACCAT 

GGATT ATGTGAAGT AACTT CAACGGTTCAGGAAGGAGAACTTTGTGTGTT 

CTTTCGGAATAATCATTTTA6CACCATGACCAAATACAAGGGTCAACTGT 

ATTTGTTGGTAACGGACCAGGGGTTTCTTACTGAAGAGAAAGTTGTTTGG 

GAAAGCCTACACAACGTAGATGGTGATGGAAATTTCTGTGACTCAGAATT 

tcatcttogacctccttcagatcctgaaactgtatacaaaggacaacaag 
atcagatagatcaggattatcttatggcattatctct acaacaagaacag 
cagagccaagagatcaattgggaacaaatcccggaaggaatcagtgattt 
ggaactagcaaagaaactccaagaggaagaggacagac«ggcttctaata 
ctatcagg 

>460.1 

tggcggccgaggtacgaatgtgcaaaattaaagcatggtaaact GATATT 

TACATAAATATCAAACCAACAATTAGT7TATACATTGTCAATGACCTTCT 

AAGATATGTCATGAGTGGATCCAAGAATATCTTTCCCCCAATGGAGAAGG 

TATTCAGAGGCTAAATTCCGACACTTTAAAATGACACACATCATAGGCTT 

TACCTGTTTGACCACTGCCTCAAATGTGTGAGATGTGATTTTATGATCCC • 

GCGT 

>460.2 

TAGACTfCAGCGAAACAACACGTCCTGAAAGAAAGATGATTCCCCTCAAG 
CCACAAAGGATTTTCTCATCAAGTGTTTTCACCTCTGCATTAGATTTGGA 

cacaagaagaggagagcatttactcaggtaaaaatagtt ctctt agtctc 
ttcctct agttactt aatttttaattt aaaaat acaattaact gatct agc 
tgataaaagtcacagacagaaataagctaagttctctcttcctttaggga 
acgctggtggcaattcaccatataaactggatggaagaattctccca 

>461.1 

CTGGAGTrCTAGCAAGTCGGCCAGGATGTCTAAGGCTGAGTrrGAGAAAG 
CTGCAGAGGAGGTT AGGCACCTT AAGACCAAGCCATCGGATGAGGAGATG 
CTGTTCATCTATGGCCACTACAAACAAGCAACTGTGGGCGACATAAAAAC 

aggaacggccogggatgttggacttcac»ggcaaggccaagttggatgcc 

TGGAATGAGCTGAAAGGGACTTCCAAGGAAAGATGCCATGAAAGCTTACA 

tcaacaaagtagaagagctaaagaaaaaatacgggatatgagagactgga 

TTTGGT rACTGTGCCATGTGTTTATCCTAAACTQAGACAATQCCTTGTTT. 

1 1 1 1 U I MTACCGGGGATGGTGGGAATTCGGGAAAATAACCAGTT AAACC " 

AGCT actcaaggctgcttaccatacgggtct AACAGATTAGGGGCTAAAA 

ACGATTACTGACTT 



>462.1 

ACCGCGGTGGCGGCCGAGGTAOGCGGGATATTGTTCCTGATTTGOCTGAT 

GTGTGGACGGATCACCAAGCGAGTGACACGAGAGCTCAAGGACAGGCTAC 

AATACAGGTCAGAGACAATGGCTTATAAAGGTTTAGTGTGGTCTCAGGAT 

GTGAGAGGC AGTQC AGCCTGACCTTTCTGCACACTCCAGACAAACTT ccc 

AGACAAGCTCCTTTGTGCCTCTACGTGGAGAGGGCGTGGAAAGTTATCAC 

ATTAAAAGATGGAGGATTTAAAAAATAAAAAAAAAAAAAAAAAAAAAAAG 

>463.1 

CCGACCCTGCCGCTT ACCCGGATACCT gtccgcct ATTCTCCCTTCG GGA 

aagccgtgggcgctttcttcataagcctcaccgctgt agg 

>463.2 



AAGGGATTAAGCAG 



1 1 1 1 I II 1 ITTTTAAACCGCTGCCACCACCATGAAAGAGGGGGCAOCACA 

r r i m attgcatactcaggggaataacttattatacaatgaacactcctc 
cattaggagaccatgcccacttacagaatgcagccgtaaatgcggt aaat 
ct atttacagaggttggggtgcaagatgagagaagtatcaccoccaggaa 

TTTGAAGTGAGAATGATCTACAAATTCTCCTGACAAGGAGCAACCGGGCT 

tgtgctagtgagggctgaaaaaattcctggcaaaacgtagggggagatta 
aatctcggaattgacagcaagtttgggga 

>465.t 

gcggccgaacgcagagaaggtag^agatagcaccatgcx^ttostcgaa 

ctgtgaattctacccgg gaaactcctcccaaaagcaagctt gctgaaggg 

gaggaagaaaagccagaaccagacataagttcagaggaatctgtctccac 

tgtagaagaacaagagaatgaaactccacctgctacttcgagtgaggcag 

agcagccaaaggg ggaacctgagaatgaagagaagg aagaaaataagtct 

tctgaggaaaccaaaaaggatgagaaagatcagtctaaaggataaaaatt 

atgataaataaagt 

>466.1 

ggcggccgaggtacgcggggagxstcggtgcgcgcttctccx^gaggtggaa 



>46 7.1 

ttgttttcctgaacactcagctggactgcatttcccagc^^ 

TTAAGTCACAAGTAGCXKrTGTGACTGGGTTCTGCOCGGTAGAAGGTAAGC 

agaagtgatgtgtatcact 

>46&1 

tggagctccaccgcggtggcggtcggt gtgctgtgctcatctgtcttcca 

aaggagg^acagatcgggaagtgcatctgacgcgtggccgacaaatgctg 

togaagaaagaaataaaaaccctgaaacatgagcgagagtgatcgaaacg 

tgtggaaatgccttcttaaagtttataaaagtaaaatcaaattacatttt 

tmrrcaaaaaaaataatttaaaact aaatgt acctt aaa 

>4eai 

tggagctccacgcggtggcggtcggaaggagaatggtatcactcaggctc 
tcagagtgacactgaagcaagacactcatggggt aggacatgaccctgcc 
aaggagttc*caaaccactggtggaatgagctcttcaacaagac^ 
caacttg gtagt ggaaactgggcaggatggagt accttcaggattggcct 

GTTATCTTCTTT AGAACT AAGTTCAT CTT AAAAATTT AAGAAGGTGGACA 

tttcaacaccatcaagtgc^tttaggtgacatgtttaagttaacttgact 

TCCTTGAATGACCT AGTTAGT AAACT AGTCACT AGT AATT CGGTCACCAA 

GCAAATGAAGCCTGGAAGAAAGGAAGCCAATATTCAAAATGGCATtTTTAG 

CATCTAAACCC . 

>470.1 ...... 

t^ggagctgcccgcggtggcggccg^ - 
^gtggacttgctcccagcatccggaccaaaaccatcagtgccacagocac 
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gacagaagccgaaccggaagttgacaaccttctggtttcagatgccaccc 
cagacggtttccgtctgtcctggacagctgatgaaggggtcttcgacaat 
tttgttctca>-\atcagagataccaaaaagcagtctgagccactggaaat 
aaccctacttgcccccgaacgt 

>471.1 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTA U 1 1 1 1 I I I I M I I I M I l 1 1 

TGGGAAGACACAAAGATTCAGACCACAGCCTACAGGGAGAGAGGATTTCT 

GAGGATGGTGGTGCACTGTGAGTCCACGCAGGCCTCCTGGGCATAGGATG 

GAGCAATTCTATCTCACCTCAGGCCTAGCACAAAGGGCTTCAGTAAACCA 

CTGGAGTTTCCTTCATTAGGATTCCATCCCAGGATATCCAGAGGACAAGA 

GGCTGGCCAACTGCAGGATTAGCCTATGCTCCCGTGCTGGATATAGGCTA 

CACGCAAGAGAAAGCTTGGGTGGGATCTCCTGATCCCGCGT 

>472.1 

CGGGCAGGTACTATGGGTGTAGTGTTACTATTACAGTTAATTCGTCCTTT 

GTGTGOGCTGATAAATGCAGTGAGGATTGGAGCACTGTCCACTGAGTCTC 

TGTGCAACAACTT ATCGGTGTGGCAGGGGTTTCCGGTGTCTGGCTCTGAT 

CTTGGTCGCTGGATAGTCGTCTGTGT 1 1 T TTCGGTGCCCAAGGCGACGGC 

TTTGGTATGGGTTCGTGGCGGGGTGGTTGGCCAAGTGCTGTCTAATAATT 

TGAGGAGAGGAT ACTTTGTTGCTGCTG caggatcagccatgg t agatt a t 

GGTTTTTGAGAACCAGATGGGGCAGACAATTTCTAGTGTGCCCATTTAAC 

AG 

>473.1 

accgcg gtggc ggacgaggtacaaaataattataatgtattaactcatac 
tgcctgtctttt ataggg gaaaaaaataac c i t ill i attttaaagttat 
aaggtgggttaccttttagttgcttggatgacagggaattagcctaccxxj 
am i ggtctggaacagaagactttcaaatttaatatggcccaagtgtct 
tcctacttaagtgcaagatc 

>47X2 

TTCTCATTTTGAAAAGACATGC I unci iaaagcaacaaaggtggtaga 

GGAAATTTCTTAAACTTTCTCAACGAG 

>474.1 

ggcggocgccggggcaggtacgcgggggagctgagccggtgggtgagcgg 
oggg(>cggcatcctgtgctgtgggggctacgaggaaagat€taattatc 
atggacctgcgacagtttcrtatgtgcctgtccct^ 
ct toagcaa acccacagaaaagaaggaocgtgtacttctaaaattgcact 
ttatgttttgt aggcttggagcttcttgatt atgggttttttcgtt acaa 

AATTCAACAACAGAATCAATACTTTGCAT AAACATT ATGGATGCTTTTT C 

TGTTTGT 

X7S.1 

og oggtg gogacagggttacattggtaagggtgacagttagaaggggaag 
tccttttagtga aataga tgagaggttttagatctgcacaaa w imiic 

a ^GGAAGTCCAAC7TTGCTCCTGGGTAGTTTAAAGGACGTAGTCCCATGT 
>476.1 

GGCTACACGCTAGGAACCTTGCAGCTTACAGTGACAGAGCTOCCATT^ 

gaggccaccactcatctcgatttctggatctctagggaatgagtagagct 

OCAGCTGGATTCCC1 1 1 ICCAGTTTC1 1 ATGTCCACAAGTCACTGTGCAC 
agataagagtgttcgi T CTCAAAACTCACAGGGCTCAGGGTCAGCGTGGA 
attggtcgcc i i cacttxttcaccttccxxkrrcagagggctgtctatctgg 
gttctccaggagaaagatggggattgacagcocatgacacctacatgtca 
acatgactgagtctccaatctgagcagcaatccggggtccaggggagatc 
tcaacagtaaatggtttctc 

>477.1 . „ ■ 

CGGCGGOCCOCCGGGCAGGTA Lnilllll.ni |i n| | | | |jniM I G 
^^w^^T^T^ AAAATGATTTTTTT AAGTTTGAACTTT A7TGQAAG 
QAGTCCCTCTAATTCACAC TTTGATC CTAGAT^ 

TATGGAATATAAAGCATTGATTTTTTAAAAACCACATAGTAGGACAGTGA 



P**r 65 of 1 1 1 



WO 01/42447 PCT7US00/JJ3I2 
548 

Table 3 



AAGAAATGCAATTCTCCAGGGTCTTAGAGAATTCAAAGGGGCATCTTAGG 

tggtctaagaaaccaattacagtctcatggttttc^ttttggttcaagat 
tagaagagtcaggttacx>ctacctgtttttagaggtagaatatgaactt 

TCTACT AGTCCACAGT7TACTGGTCAGGTGGCOCCAAC 
>478.1 

cgcggtggcggccgaggtacctgcatcagggataagaacccattcccctc 

ccttgttccggtgtgctctcgccattgcaccatccatgagacgcactctt 

gtatagaagtaaaattgccttgctgagaaaaaaaaaaaaaaaaaaaaaaa 

GT 

>479.1 

CGCGGTGGCGGCCGAGGT ACGCGGGGGGTGTGGCCTGCATCTCAGCTGGC 
GGCCATCAGTGT AAATAG AGCTTAAAGTCATGGTTTGGCT GCATAAAAAT 
TTTCT AACTTGGGTTGAATATTTGTAGeGAAGTATCTGTTTTCATTTT 
>480.1 

ACAGATGCAAACGGAGGTGTAGACT gtgcagctgccaaagtggtgacaag 

GAATCCAGAGGACCATGAAAGGATCTT AATGCAAGTCATGAACTT GAATG 

TGCCGATGAGGCCTGGCATTCTTGTCCAGAGACAGAGTAAGGAAGTGTTG 

GCGACACCCTTAGAAAACAGAAGGGACATGGAGGCAGAAAAAAAAAAAAA 

AAAAAAAAAAACGT 

>4«1.1 

AATA AOCCAAATTTGTTGATGl 1 1 1 GTGGCCAAGGTGAGGGCTGCAAGTG 
TTTTCTAAGGGTTGAAACATCAGAATAAAGGTATGGTGGCAAGTOCTCCT 
TCTGCTAGGCTGGCTGGCAAGGCCCT ATGTCTTGACCTAGGTGGT AGTT A 
CAA GGGTA TTTATTTGC CTT A T AATAATTCACT AAACT ATGTATTTGAGT 

aga rrm atgtgtgtgctttaatttacaaaaaaaaaaaaaaaaaaaaa a 

GGTG 
>4&2.1 

CTGA GAGATCCCCTCATAATTTCCCCAAAGCGTAACCATGTGTGAATAAA 
TTTTGAGCT AGTAGGGTTGCAGCCACGAGTAAGTCTTCCCTTGTTATTGT 
GTAGCCAGAATGCCGCAAAACTTCCATGCCTAAGCGAACTGTTGAGAGTA 
CGTTrCGATTTCTGACTGTGTrAGCCTGGAAGTGCTTGTCCCAACCTT GT 
TTCT GAGOATGAACGCCCGCAAGCCAACATGTTAGTTGAAGCATCAGGGC 
GATTAGCAGCATGAT ATCAAAAGGCTCTGAGCTGCTCGTTCXiGCT ATGGC 
GT AGGOCT AGTCCGTAGGCAGGACTTTTCAAGTCTCGGAAGG 1 1 IC1 I C A 

atctgcattcgcttcgaatag^tattaacaagttgtttgggt 

>483.1 

tggcggccgaggtactcttcaaaattgtcaaggtcatgaaagacagcaaa 
aagtgaagaattcttacaaactagaggagacaaagattggag 

>484.1 

gatgtgaacaatgtgtca n uc i ci caagagaaggatgtggatggcctgg 
accgcacagctggtgcaattcgaggcogggcagoccgggtcattcac 

GTCACCTCAGAGATGGACATCGAGCGG<XX3CCCGX5GCAGGTACAGA 

ttattg««^acagcaaogagccacgctggcaaa<>atgaaagtagagtc 
gctcagaaacacgaaagatcatatgtgtgtcatcacagcatcgagaattt 

* aaatcatct ggaagttcctgct aaattaaagcatactgtgccagagctcc 
cctct aatcaaaaaacgctgtcctggt gaaaatttgcaatgaggatt aca 
gagagagagatcaaccagtgaggaaatcacagactcttacatgagtttac 

AfiTTAACCCCACTGCACAAAATAATAAATTAGCCATAATTTGGTrTTTTT 
TO 

>48S.1 

CGTGAGGTATTG 1 1 1 uGATTGTT ACAATGAACTTGC A \ ITCTTl fGTA AT 
' GAAGAAAA T AATACAGAGGAAAT AACAACAACT AAACCTTTGGCCTGGAT 

• . TATCATCGGCTGGAAATTt^TGTTGGATGCAAGTTTTrATTGATAACAAG - 

TTATTTTTTJGGTI I ATATGCAAAAAATGTTCATTGAATGCCTCCTATTTG . . 

gctggcactgcct aggcactttcacaggtatttcatcct aatcctcacaa 
cagcoctatgaggtaatcattggtoccagtitacagaagccttgggtggg 
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AGATTATTGCTTGATATA<rrrCTATTTGt^C^C^TTTTTGTTGG^**«A 

cgttcgtatcggctggtgattcactggtcaagagctctcattggccagga 
gttcctatttgttgct gtaagattcaaat aatcaaaatactag aattttt 
cccccacaagaatgatgg 

>486.1 

TGGCGGCCGCCCGGGCAGGTACGCGGGAGTGTGGATTGAACAGAAAATTG 

^*^^^^*®TC*^GGGCTTCCCTTGGTrCGCCACTCATTTATTTGTAA 

CTTGACTGGGGTGTTTTCTGCTT AAAAATTTCAATTCTCGTGGTAACAAC 

GCAGAGTAGAAGGAGAGGGTGACTTTACCGAACTGACAGCCATTGGGGAG 

GCAGATGCGGGTGTGGAGGTGTGGGCTGAAGGTAGTGACTGTTTGATTTT 

AAAAAGTGTGACTGTCAGTTGTATCTGTTGCTTTTCTCAATGATTCAGGG 

atacaaatgggcttctctcattcattaaaagaaaacgcgacatctttcta 
agattctctgtgggaaaatgactgtcaataaaatgcgggtttctgggcca 

AAAATTATAAATTTATGGAATATATAATACTAATAGAATAATGTT 
>487.1 

ggtacttgttatttgtttctattattactgtttgactt ctccccagggtt 
cagtcctcaaggggocatcctgtcccaccatgcagtgcccctagcttaga 

GGCTCCCTCAATTCCCCCTGGC^ACCACXX^CCCACTCrrGTGCCT 

gaggagtcttgtgtgcattgctgtgaattagctcacttggtgatatgtcc 
T ATATTGGCT ^^gaaacctggaattgtggggcaatctattaatagct 

GCC^AAAGTCAGTAACTT^^ 
TTTTGTATTCAGGGG I ll l l igtgt 
> 488.1 

" 1 ' I /T? 1 1 1 1 1 1 1 1 ^GGTGCTTrATTTTCAATATTTGTCTTATTA 
ATATTT7TCTTATTTTATAATGCAATTACAACGGTTTAGGAGACAAAACA 
ATATAAAGAAACGAATGTTAAATAG I l l l ll l I A AAAAATAGCTTGTTGC 
^CAAGAAAGTCCATATAATCTTATTO^C^ 

TTTGCACT AAACCAAAATAGCTTATGGAAAATTAGTATTAAATAGCT AAA 

^ C /* GAAAA ^ A ^ CTAT *AATAA^ 

GTGATGCTTAAACAAAGCAAAC 

>489.1 

GGTGGCGGCCGACCGAAACCTGGTGAAGCCCTTTGGGCGATTGGTGATCA 

1 1 lUi tCTGGTAA ATGA GAGCCTTTCGAAAACTTTCTGCCCTCAAGTATT 

CCTCTGCCCTT TTCAAAGATTTTTGGAGAGTCA A I 1 I MCI I 1 1 G I f cat 

ACT I CI 1 1 1 IC 

>4S01 

AGTTTTCATCCGAATCCACTGGGGAATGGGACGA I || IGCI I I I G TTTCT 

TGGCCAGGAATCGCTTAATCCTGAAAGTCTTGTGAGAAGACATGGCGAGC 

AQ OSGj^TCAAGAA<^CAa*C^ 

GCGTCCTGCCCG 

>492.1 

tc^caa?at^a^^ 

>49Z2 

TGGCTTGAATGAAATTACAAAGAGGAGGTGCATTAAAAAATACCAGGAGT 

aaaatctcttgaagaattaaaatgacaggctaaaaataaatataa 



AOGCjGGGGGTGGCGGCGTTGGGTTGAGCG^GGCTTTTTGG^ 

£^n9i G T^ii TGAGcAA ^ TGGAccA ^ 

GCAGAAAAAACAG^AGA^A^ . 
GAAJGCTrrTTTATCTCATATTAGTGCCT^ 

cacgtgcttttgaagatgatgatatcacgcacgttgaaggaagtgtagat 
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CCTAI I Cgagatatagaaataatacatgaagagcttcagcttaaagatga 
ggaaatgattgggcccattatagataaactagaaaaggtgcctgtgagag 
gaggagataaaaaactaaaacctgaatatgatataatgtgcaaagtaaaa 
tcctgggttattagatcaaaagaaacctgttcgcttctatcatgattgga 

>494.1 

cggtggcggccgaggtactcatggttgctgtaaattaggcagccgttctg 

cag ggtt ttgctt agcc aggctcctctgag atctggct attctgtcttgt 

ggattttcagtcccggcgtacctgcgcgggcxsgttcgagcggtcgtccgg 

tcaggtacatatacattatgtaattaaaaagcgtgcatgtttatgtatta 

aaaataattggatttaaacaaatattatatatacattataacacctaaac 

gcataggctgttgttattcacaatagttataccaatattattaatga 

>495.1 

GATCTC^AGATCTGGACTTCTGTTGAAAAATTTTCCACGTGAGGTTTACT 

tatgtctgtaaagatgggaaaaaaatacaagaacattgttctactaaaag 

GATTAGAGGTCATCAATGATTATCAT7TTAGAATGGTTAAGTCCTTACTG 

AGCAACGATTTAAAACrrTAATTTAAAAATGAGAGAAGAGTATGACAAAAT 

TCAGATTGCTGACTTGATGGAAGAAAAGTC 

>496.1 

CGGGCAGGTACCGTGAAAAGGCACTTCTCCTTGAGAAGCCTGACAGTGTC 

gttaatgtcctgctggcgcatggtgaaaatttcagggcaacagtaaagca 
ccctctrtaatttcccttctccaagcccaagcttttgcaggtaactggag 

CGCTTCCT CATTTGCAT AATAGGCAGTTTCAATAACT GGGGACTTT 
>497.1 

gagctc^ccgggtggcggccgaggtactgggagcctcataaggctggctg 

TTGAGGTGTATTGACTGTGAAAGCCCGCATGTGAACTCACAAAA 
>498.1 

ACAGGGCCTTCCACTrCAGCTGACTGAATTTAGGCAGTTCTGGCCACTTC 
AGTTTCCGCACCCAGGCCTCCTG ACCCATGGTATCT ACGATGAGATCCAG 
CTGTCCATTATACACCGTCACGTTGATCCCT 
>499.1 

GGGGGC GGCC GAGGTACCTCAATTGATGATTTCTGGTATGACCTAGCAAA 
TACACTGCTTTCACTGAAAT7TCAGTCTTGCAATCTGCTTTGGGTTCCCC 
AATCT AAG ACAGAAACAT ACTCATTTTCCCATCACT GGACTTCCAGGTTG 
JTJTGAATTTTT AACTGTT ACAAACAAGGTGG CAACATTTATCTAGAAAC 
CTCTGGATATAGACGTAGGAAGCTTTTGGTATTTCCACT AGTGAAACTGC 

tgagttgaagggtatgtggatcttcatctttaataaatattaccaacatg 

TGAAAAGCCCGACAATGTCAAGGACTGGCAAG AGTGCCACATGTGATGGG 

tgtggaatggcagctcactgtagcaggtgctggggactcaattggggtct 

TGGAGAAGCACTTAGTT AT AGCAAGAATGTCT CATAA 
>500.1 

G^GGTGCGCTAAATGAGTGAGCGTAACXrrCACATTTAATTTGCXXrrTTG 

CGCCTCAACTTGCGCCCGCTTTTCCAG 

>501.1 

TGTT AAAGCCTGGGGGTGCCTAATGAGTGGAGCTT AAACTTCACAATTAA 

A7TGCCG 

>503_1 

GC GGTGGCGG CCGAGGTA U I 1 1 1 1 1 1 1 1 1 1 1 1 H I II 1 1 IATGAA7TAT 
TTAI 1 1 IUI 1 1 CTCAGAAAAGGATGCGCCTCCACTT AGCAAGGCTGGGCA 
GGATGTGG 
>S04.1 

TGCCTAATGAGGTGAGCTAACTCACATTAATTTGCGTTGCTGCTCACTGC . 

CCCGCTTTCCAGTCGGGAAAAC 

>506.1 

ACTCGTCTTGGTGAGAGCGTGAGCTGCTGAGATTTGGGAGTt^ 
GGCCGCTTGGAGTTCTGAGCCGATGGAAGAGTTCACTCATGTTTGCACCC 
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^^C^TATGAGAAAAT&WAGACTCCAA^^tS^tSS 
>507.1 

cggccgaggtacactcccacgaccacggcatggtctctttcatatggctc 



9?^P^ GA AAG GG ^^ GA( *^f A CGCT AAG ATCAGCAAATTCG CCAf; TTT r!r. 
^^^CCTTTTCCGCCCTTTTCXXX^ 

>514.1 

acctoogaaatcttaccttcagtcttctctgggacgcagtc^ 

CTJCCTGCACTOTCAGTGTCTTC^ 

^tgSSSa^^ 

G^CTCAAATCTGCTTTGGGAC^CAGGGGTCTTTCGAGGCCAGGTGTAT 
>515.1 

T2^^^^^I G ^ GT ^ GGTA ^ CGGG G A CGGCGGAGCTGGC 
J^CTCAGGTGGAGGCGGGTTAATTTCGCCCACCGGAATGATCACCAAGAf^^^^ 
CACAAAGJAGACCTTGGG^ 
CAAAGAACCAGAG^C^TACTCAGT^ 

GGG 5^ AGGGA i^ GAGATG ^ 

ag ; acg Igcgcgggcggcg^ 

^^^ G ^ GGG ^ GWAA ^ CACAGC ^^ G 
^I?f^ C £I G /^ TA ^ GT ^ TT1 ^ T TCCGGGA^ 

gaaaaacaaox^tgagactatctggcam 

>516.1 

agttatcatggtgttgggtaaattagc 

>617.1 

acgcgggtgttgatccagttcttgctttt caacgagaaggatttggacgt 
caoagtatgtcagaaaaacgcac^^ 

AGACTAAACTCAATACAGTGGATGACTAGAAAGCAGGTTCTC^ 

gatgtgggtccttccctgggtctgaagaa^ 
ccagaccgcagttgcc^ 

>S18.1 

2£Sa^^ 
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>5 19.1 

ACCTTTGTCAGCAATTTTGACAGTCA7TAATG7TTGTCATAAT77TAAAT 

AAAGTGTC 

>S20.1 

actatgttgaataaatgtttttttccctttt aatttttctgcttccctag 
tgcatagaa 

>521.1 

CCGGGCAGGACGCGGGCGGCTCTT AGCGGTGGATCACTCGGCTCGTGCGT 

CGATGAAG AACGCAG CTAGCTGCGAG AATTAATGTG AATTGCAGGACACA 

TTGATCATCGACACTTCGAACGCACTTGCGGCCCCGGGTTCCTCCCGGGG 

CTACCGCCTGTCTGAGCCGTCGCTTCCAAAAAAAAAAAA 

>522.1 

ttggagctccaggcggtggcggccgaggtacacctccccaagctctcttc 
ctccggctctagctatataagacgtgcctgcttccccttcgccttccacc 

AAGACTGTAAGTTTCCT g AGGCCTCCCCAGCTTCCTGCATGCTTCCTGTG 
CAGCCTGCAGAACTGTAAGTCAATTAAACCTC I I NCI I I ATAAATTACC 

cagtctcaggtagttcttcacagcaatgtgagaacagactaacaacaatc 
aactcatggctttaa caca aaaaaaataggtaagttcaaaattaacatat 
taccacatccaacttcttt attcttgagaaaacaaaaaagtccaaaatca 
aaggaaagcacccgttttaaaccctgatatctttctcagggctcactgca 

GTCTGGCCATATCTCAAGCAGGTCT 
>S23.1 

acgcgggggagtgagagggaacgagagtaagagaaagaaagaagtgaggg 
gatgtaaactcgaataaatttcaaagtgcctccgagggatgcaacgggca 
aaaact gaactgttcaggcttcagattgtaactgacgatgtgaggaaaaa 
tgaggtttgtgtgattttgctaaaatgcatcaccaacagcgaatggctgc 
cttagggacggacaaagagctgagtgatttactggatttcagtgcgatgt 
tttcacct cct gtg agcagtgggaaaaatggaccaacttctttggcaagt 
ggacattttactggctcaaatgtagaagagagaagtagctcagggtcctg 
ggggaatggaggacatccaagcccgtcca 

>524.1 

TTGGAGCT CCACGCGGTGGCGGCCGAGGTACGCGGGGCTCTTGAGGAGTG 

AGACTGCAGGAGATGTGGGCCGTGCCAAAGAGATGGATGAGACTGTTGCT 

GAGTTCATCAAGAGGACCATCTTGAAAATCCCCATGAATGAACT6ACAAC 

AATCCTGAAGGCCTGGGATTTTTTGTCTGAAAATCAACTGCAGACTGTAA 

AT7TCCGAGAGAGAAAGGAATCTGTAGTTCAGCACTTGATCCATCTGTGT 

GAGGAAAAGCGTGCAAGTATCAGTGATGCTGCCCT 

>626.1 

TAAAACTT AATGTCTTCCI llllll I CTCACTGGGTTTTTCATAGATCGA 
GACATGTAAGCAGCATCATGGAGGTAAGTTTTTGACCTTGAGAAAATGTT 
TTTGTTTCACTGTCCTGAGGACTATTTATAGACAGCTCTAACATGATAAC 
CCTCACT ATGTGGAGAACATTGACAGAGTAACA I llll l l IG GGGAAGAA 
GAATCCT ACAGGGTCATGTTCCCTTCTCCTGTGGAGTGGGGGGGGAAGGT 
GTATGGCCCCAGGGATGGCCATATTACTGACCCTCT ACAGAGAGGGCAAA 
GGAACTGCCA GTAT GG TATTG CAGGATAAAGGCAGGT GGTT ACCCACATT 
ACCTGCAAGGCTTTGATC I I I C II CTGCCATTTCCACATTGG ACATCT CT 
GCTGAGGAGAGAAAATGAACCACTCTTTT 
>527.1 

ACTCACAGTCACGCTCCTCTGAACCATCCTTGGGCTTCATGGGGTTGGCA 
TTGAGGATCCCTACGACAGTCCCCTGCT CCGTCTTCCAGAGCGCTTTGTG 
AACTTCTCCAAATAAGAACAAGGACACACATTGTGT CAGGTCACGAAGAT 
CATTCAGTTT CCAT ATGCTGAAGGTTTTTCCACT ATTCACACTCT6TGGC 
GTAACC IICII CAAT AT AAC CCCAAATGTCACCGAATCT ATTTCTTCCAG 
CTTCTCTGTGGCCATGl 1 1 ItCI I GATCTGAGACAGTCTGATCAGTTT 
>529.1 " 

CTCCCCGCGGTGGTCGGCCGAGGTACATTGTATACTGCAGTGTCGTC7AC 
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ATGGCATTGGACAGGACATAATGTAAAACATAAAAGTGCAATTGTTACAC 
TTAGATATGATAGTGAATGGCAACGTGACCAATTTTTGGTCTCAAGTTAA 
AATACCAAAAACTATTACAGTGTCTACTGGATTTATGTCT ATATGACAAA 
>530.1 

TTGGAGCTCCACGCGGTGGCGGCCGAGGTACTTGGAAACCCATTTGGATT 

AATTAGAGGTCTGTCTGAAGGAGTTGAAGCTTT ATTCTATGAACCCTTCC 

AGGGTGCTGTTCAAGGCCCTGAAGAATTTGCAGAGGGGTTAGTGATTGGA 

GTGAGAAGCCTCTTTGGACACACAGTAGGTGGTGCAGCAGGAGTTGTATC 

TCGAATCACCGGTTCTGTTGGGAAAGGTTTGGCAGCAATTACAATGGACA 

AGGAATATCAGCAAAAAAAAAAAAAAAAAAAAAAAAAGTACCTGCCC 

> 531.1 

CTTCCCTGATAATAAATCACTGGAGAACAAAAGCGAATAACAGCAGGTCT 

CTCTT 

>532.1 

T AACAC T CTTAGAAGACTGGTTTGTTCATTTGACATTGGGACGTGCACCA 

ATTTTTATTACAAAAATCAAAAAAGTAAAAATTATTACAATATTTGCAGA 

GTATAACCACTAGTTGCCTAGACAAAAGCTAATTTCTACAAAATCAAAAA 

CTT AATGCAGTTTT ATT AAGAGAGTCAAAATTCT CTCAGTT AACTGGATA 

TACATAGTGGTATATATCTTAAAGCAGAAAACCCCAAAAAACAAAAACAA 

GGAAAAAAGAAAAT ACATGTCAACAG TCAGTTAAAT ATTTTG ACCTGACA 

GTTTCT ACAAATAGTGATTTTCACTACATATAAAGGAATCTGTTACATGT 

GGTAAAACTTOCAGAAA C CAAGTAGGAAGTGTGGAATAAAAACAATAAAT 

TCAAACGCAGOCCCAGGCTGGGCCTG 1 1 1 1 ICATG 

>S34.1 

TCTGCAGATGAAAAAGCAGCTGAAAGGAGTCGTAAGGCTGGACCAATAAC 

CCT AAAACTGAAGCCTGATT ACTGGAGTGACAAACT ATTGAAAGAAGCAG 

AAGCGTTTGCTTATTATCGCCGGACACACACTGCCAATGAGCGGCGGCGG 

CGTGGT GAAATG AGG GATCT CTTTGAGAAATTAAAGATCACTATTTGGAT 

TACTTCATTCTrCCAAGGTTTCCAAAAGTCTCATTCTT ACTCGAGCCTTC 

AGTGAAATTCAGGGACTAACAGATCAGGCAGACAAATTGATAGGACAGAA 

AAATCT CCTGACTCGAAAACGGAATATTCTGATACGGAAAGT ATCGTCTC 

TTTCAGGTAAGACAGAAGAAGTGGTCCTGAAGAAGCTAGAGTATATTTAT 

GCAAAACAGCAAGCACT 

>S3&1 

CGGCGGGTTTTGGAAACACTGGACTGGATGGCACATGATCt^GAACTCCG 

CTCOGTTTGGC TrOOGAA GGATCCCACCAACTCATTCTAATCAGCGATCA 

CTGTTTTAATTTCTTTTTT 

>53&2 

CACAQATCAGGCCTACCTCATTGGCATATTAAGAAAGTTGTCTCAAGTAT 
ATTTAGTGTTT ATCATTTT ACTATAGTTCTTCAAATGACT GACATTCATC 
TTTTOC CTAOCTCTAA A I I OU I FC1 1 1 1 1 C ACATTATCTTTCTTGATTQ 
CTTTTTAATAGAAAAACA 
>538l3 

ACAAAGACATGGATTTACTGTGCATATTAGCAGATCCATACTGGAAAATG 

CATGGAGGTTTCATATACACCACTT AC 

>639L1 

AGAAAATCTGAGACACGTGAGGCCAGACAAAGCAAGGOCGGGGCTGATGG 
CCTGGCTGCCTGG TGGTTGATG GTTTTGCTCCCOCTAC C I 1 1 1 1 1 I I IGA 
GTTTATTCT GATTGA 1 1 1 1 1 1 I ICI IGUI I 1 C I UGAT AAACCACCCTCTG ' 
GGGACAGGATAATAAAACATGTAATATTTTTAAGAAGGAAAAAAAAAAAA 
AAAAAAAAAGGGCOCOGGGCCCTCTAAAAATAGAGGGTCOCC 
»S40j • -. . * . ' 

ACTT T ATTTG GTAAAAAAATGCTAATGATATGCAAACCATCAGCT ACTTG 

TAATPrTTTTGCTGGTGG^GGGTTTTGTCTCAATTTTC 

TGATCAGCGTGGTGGTTGCTGAAGGTTGGAGTGGTTGTGG 

>541.1 
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caoggoggggggggcctgatgt A C i inr n i iiiiiinn igttaaaa 

GACA CAAGTA GTGATA TATCAACATCTGTTTAACTCGTGAGCG 1 1 1LM I 

TTTTCAACTTC HUM CTTTTCAGTGGTTTCTTCTTCCATTaC C HU G 

CTGATTTCCACTTTCAGTTTCt^TTCGTTCGCTATCTTCTGGTAGCCACA 

GCTCAGCTCCAATCTGCGAAATACGGCACTCTCTTTATTGACTACTGCTT 

CTCTCGGCCCCCGCGCTGGCC 

>5A2.1 

GGGG CCGCCCGGGCGGT ACAAAATGTT AAGACGTTGTTTGTATTTGT AAG 
GCTGG TGTATTCAGAGAGCATATCTCTTATTCCTGACTTTCCACCCCCGT 
ATTTTGTAATGACCATGATCAATGTTTTTACTTTTTGTATAATGGGGTGG 
GGTGG AGTGGGGGCT ATTGACAGTCACCCTGAGGTCTTT AGaGGACCAGC 
TATTGTATCACCTTGGATACTTGAAGTTTAATGCTCAGTTGGGTCGGGTG 
GCATTTG ACTTGGAG GCTGGCATGTTCACCAG agcctg gggccct gtatc 
TGGGC AGCCTTTG AGGATT ACTTATGATATTGAATGACAGTCTT AAG TGG 
CAACT CACGCCCAGCTCATGCXX » IN I IGCCTGGACATGTGCT Al ti ll 
AT 

>543.1 

GACACAGCATGCACGCAAAACAAATTGCAATAATGTGATAAGTTCTTTAA 
AAGAGGTAAGAGCAACGTGCTTTGGGAGCAGAGAAGAGGGAGAAAGCAGC 
ATCTTGCGTGGATGAGCCAGGGGACACAGAAGAGAAGCCCaCTATCTCAT 
TTAATCTTTACAACTCTCTTGCAAGGTTCCCTGGTT GTGAAAATACATGA 
GATGAATCATGAAGGCCACTATCATCCTCCTTCTGCTTGCACAAGTTTCC 
TGGGCTGGACCGTTTCAACAGAGAGGCTT atttgactttatgct AGAAGA 
tgaggci ICIGGG^TAGGCCCAGAAGTTCCTGATGAGCGCGACTTCGAGC 

>S44l1 ctgtcaatgccat 



t attta atctaaaattttaccaactttgatttgtctggttaggatatttt 
gttttagtggatatgctrraattcggatcaattactgcagtaaatctcat 
occtaagcatgaaatgttgtcaacaaatacccagttccatttagttatca 

ATTAGCCCAAATAAGAGATACAAAGTATAACAGTGACCAACCTTGTAOCT 

GCCCGGGCGGCCGCTCGACCACTGACATAGACTGAAAGCAAGAAGAGTGC 

TGTGTTTGTTGCTATATXXXXTrCCAACACCTAAGGCAATGCATTTCACAT 

CTT gctgagagcagataactcaatacctggaact agaaaattagaatcta 

AAAGACGGAAGG CATCT AAAGAA CAGTT CCCATCATGCCACAGCTGAGAA 
AT 

>54S.1 

TG TTTCCTGTGTGAAATTTGTTATGCGCTTGAGAAATTCC^CACAACATT 
AC 

>54&1 

CCCGGCAGGTACCTGATGCAGGGAATTQAAGCCAGACCCAAAACGGGCAA 

CCCAATAGGATGGCCATCTGCCCCATTAATGCCAGCTTGTCCAAGTGTAA 

TTATTAACAGTGCCCCCTrTCACTCTCCAAAGAGTCCCTGTOCAGACAGG 

TAATTGTGAAAGTCGCCTTCAAAATGACT GGCCGGTAAGGAAAGTGGAGT 

GAGGGAAGCAGGGTAGGTGGAGG7GTGAAAGGGAGAAGGGCCTCATCTCA 

GGGTGGCT GGACCTGCACCAGCATCGGCCTGCATGAATGTGCTCCTACTC 

TTQCCCAGGCTGAGTATCAAGAGAAGCAAG^^TCTAGATAAAAATGCAA 

ATCCAGAAACATCAGCGTTTTGAGGTTAACATGTTGGCAATTATTCAGCT 

TTATGA 

>547.1 

GGGCGCCGAGTCAGGTAAGCCCTGGCTGCCTCCACCCACTCCCAGGGAGA 

CCAAAAGGCTTCATACATCTCAAGTTGGGGGACAAAAAAGGGGGAAGGGG 

GGGCACGAAGGCTCATCATTCAAAATAAAACAAAATAAA 

ggtaccctttgtaatatcctttatataaa<xa 
agttctgtgacctgctctggcaaattaatcaaacccaagaaggg ggttgt 
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GGGAACCCCAAmATAGCTATTCAGTCAGAAAAAAACAGGTTAGACAAT 
CTGGGGC1TGCGACTGGCATTGGAAGTGGGGGACAGTTGTGCGGGGCTCA 

gccttcaacctgtgggatctgacgctatctctgggtagatgaagtagaat 

TGAACTGGGGGACACCCAGCTGGTGTCCACTGCAGAATGAATTGCTTGCT 

tgatgtctagggaggccgcagaattatagcaggaggtgaaaagcacttct 
tattagcagtggcaagagaaaatgagaaggagcaaaagctgaaactcctg 
ataaaccaatcagatctcatgaggctcattaactataacaagaatagcat 
gggaaagact 

>549.1 

TGGAAAGGCTGCAACGATTGTCGAGTCGTATAGCGTCT ATGTACATATAG 

CAT 

>549-2 

ATAGTCATTGGTGTAGAGATAGAAAATGCTTCGTACATGTCAATGGGAGA 
ATGGGTGGTACCACTACACCGGAACT ATCCCTAAGTCCATCCGCCTGGGG 
OGAAAGGAAGGAAAAAAAGA 

>ssai 

CTGTGGAGAACCTACGCACTGCACGCCATGCCTGTTTCCTACTCAAGCCT 

CAAGACTTCTACCTTGATCTGCTTGCCTTCCTTGACCATCT ACCT AGAAC 

TAAGCGAGTCCCAGCTCCCAACCTGGCATGAGCTTGGAGAGGGTGGACCG 

CCACCCTGCCTGAACCATGGAGACAGCC 

>«51.1 

GGCGGCCGAGGTACATTAGCAAAAACAGTGGACTTTGTGACCTTGAAAAA 
GTCATTTAACATCTCTGAACCCTACTTTCTAAGTCTCTACAAGTAATATA 
TAGTGGGTGAGGTQTTCTl ICI I IGI ICTGTTACTCGGATGTGAAACTCT 
CCTTTT GTAGATGAAACCATTGCGTAAGT AATATAAAGACTTTTCCCTGT 
AGTTATCTT ACAGACTGGAGAGAGTGCTAGTGAATGCTTTTGTCTTCAAT 

gcccatctcttggaaatattgaaggtggagtagcaaccgggcattatatt 
atctcttggaaaaggacctcagcaatggagaatatoxx^tcatcacaac 
tgtcatcactct gccgcacgtgattgtggagaatatccctctccatgtga 
atgcagaatgagattcatttacaaaacgaa 

>5S2.1 

TGGCOGGCCGCCOGGGCAGGTACT ACAATGATTCTGAAGCACAGTGTATT 

cagacagatacagtgaaccaagtgcaatatgtaaggatgaaagaagaaga 

gatgacaaagaaatocaagtaaatgccttgtctttgcaaatgtttttata 

ttaaatcataagggaagggaactactgccttaaatgttatcaaaagagtt 

TTGT AACAAG GTT AAT ACCTTAGTTCTT AAC A 1 1 1 1 1 1 1 IUI 1 7 ATGTGT 

AGTlil 1 1 1 CATGCTACCTTGGTAGGAAACTT ATTTACAAACCAT ATT AAA 

AGGCT AATTTAAATATAAATAATATAAAGTGCTCTGAATAAAGCAGAAAT 

*^ATTACAGTTCATTCCACAGAAAGGCATTCCAAACCACCCXAATGACCA 

AGGCATATATAGT ATTTGGAGGAATCAGGGGTTTGGAAGGAGTACGGAGG 

AAGAATGAAGGAAAATGCAACCAGCATGATTATA 

*553-1 

TGQCGGCCGAGGT ACCWTCTCTGCCCATCACCGCT GGAATTTTGATGAC 
CT ATTGCAAAAGATCTGGGACTATCTGAAACTAGTGAGAATTTACAGCAA 
ACCCAAAGGCCAGTT ACCAGATTACACATCCCCAGTGGTGCTTCCTTACT 

TGACGTT ATTCT ACAGGACT ATGAACCTTCATATCCACATTT ACAGTCCG 
QACAGATAAAGGAAA AC AACCCAAATCCAGGAGGCAATATAAAAGGAAGA 
GAACAAAACACACATTJ^T ACACTCACACTTAAAAAT AGGGGAAGACCAA 
CAGGGGAACI ITCGTTCTC1 1 CTGGATGTCT ACTTAAAAATCCCATGT GG 

*«54.1 

7GGCGGCCGAGGTACTCTTGAGATTGCTTT AAATTTTGTATTGAAACAAC " 
^^^* ^T^GC ^CrGTA6TAATGGGAGCACTAACrCTTACAACAGTTAG . 
TGAATOTrTTTAAAGAATCAGTTCAGTGTAGA^ 

TCCTGTGCTCTACAATAGCTTAGTGCAATGTGCACTTCTGTTTTACTTGC 
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CATTTTCCTGCTCTGTTTTCTCTGTGACATGAAGCAACAGAAACTGAGAT 

CAAAGTTAAGATTATATCCTGTTTGTAGTATCAGATATTTTTCTGTGTAC 

ATTTACATTCAAG TTGATAACACTGGTGGTTTCATTTCAATACAAATTAT 

GCTAGAGAACTGACArnTCAGACATGGTGATATATATGCTATTTGAATT 

CCTTTATCTTGATACAGATCTTGATTGTGAATCTCTGATGATAGATGTGC 

AGCTAATTTGTCCCGAAACTCATGAAGAT 

> 555.1 

TGCAGATGTTGTCGTTGGAAAACTG^TCGT^ 

GACCTTGCTGCTGTCTTGGTTGTCAGCAAGAAGCT GACACACCTGTGCTT 
GGOCAAAAACCCCATTGGGGATACAGGGGTGAAGTTTCTGTGTGAGGGCT 
TGAGTTACCCTGATTGTAAACTGCAGACCTTGGTGTTACAGCAATGCAGC 
AT AACCAAGCTTGGCTGTAG AT ATCTCTCAGAGGCGCTCCAAGAAGCCT G 
CAGCCTCACAAACCTGGACTTG^GTATCAACCAGATAGCTCGTGGATTGT 
GGATrTCTCTGTCAGGCATTAGAGAATCCAAACTGTAACCT AAAACaCCT A 
CGGTTGAAGACCTATGAAACT AATTTGGAAATC 
>556.1 

CT CAAAATAAAGAGGCAACGGCCTTrCTCTTCCTCTCCATCTCTCTATAG 

CACACCTTTTAI I ICI 1 1 ICI ICI I 1 1 1 1 1 AAGCCTCACGAAAGATTTTA 

CTTGTAGATCAACTTTCAAAATGTAGGAAGTCAGAATGGGTGACATCATC 

AGAAAAATATGTGGAGCTGATCACAAGAAGTGAAGAACCCAGAGCACGAA 

AG CGGTTG TGACT CCTGGGCCCAGGGAGTTGACAGCGTCT GGGCTTCAGA 

GGAGCCAGCGCCTCCGAGTTGTCTTGAAGTGAGGCTCT GCTGT AGTCCTG 

TT CTTGTG GCTCT AAQATCTGAAT GTTGTGACCACT AATTTGCTC 1 1 I C C 

TGGAGGGTAACCCCAGTTTGGTCCACAAGGCTTGCTGCCCAATCTTTTGC 

AACAGTTGAACCA 

>557.1 

C TCCG GGTGGCGGCCGAGGTACTGGATGTCAGGTCTGCGAAACTTCTTAG 

ATTTTGACCTCAGT CCAT AAACCACACTATCACCTCGGCCATCATATGT G 

TCTACTGTGGGGMCAACTGGAGTGAAAACTTCGGTTGCTGGCAGGTOCGT 

GGGAAAATCAGTGACCAGTTCATCAGATTCATCAGAATGGTGAGACTCAT 

CAGACTGGTGAGAATCATCAGTGTCATCTACA : 

»SS7J2 

ACGCGGGGGGAGCGGGCGCTACCGTGTGCGCAGAAAGAGGAGGCGCTITJC 

CTT cagcttgtgggaaatccggaagatggccaaagagaactcaactgtt c 

GTTGCTTCCAGGGCCTGCTGA1 1 1 1 1 GGAAATGTGATTATTGGGTTGGTG 
OQQGATTGCCCTGACTOCGQAGTGCATC I IC I m g gatctgaccaacaca 
GCCTCTACCCACTGCTTGAAGC 



1 1 1 reo ICI I A CAGTAG TCGG CGTAGGCCTTAGGTGGGTTOGTGCGOCT 

tctacctcgcim 1 1uugi 1 1 loctggctcctcggccci 1 1 iiicccug 

ttgcagctgggagcggacgaagcgcgaagctgggattitttactgtctcc 

tgaagaatttaacacaaacatggatatcagaccaa^tcatacaatttata 

tcaacaatatgaatgacaaaattaaaaaggaagaattgaagagatcccta 

TATGCCCTGI 1 1 1 ctcaatttggtcatgtggtggacattgtggctttaaa 

>seai 

ogcggtggcggccgaggtact n n 1 1 1 1 1 1 1 u 1 1 1 1 1 n 1 1 1 i g atog 

gcaagcgacgcttagacaggcgtagccccgggaggaacccgggg^ 

gtgcgttcgaagtgtcgatgatcaatgtgtcctgcaattcacattaattc 

tcgcagctagcttgcgttcttatcgacgc^cgag^ 

taagagtcgccccgggtc 

• >S6i:i . . .... . : . 

cgggtggcgggcgaggt accatgtgggaagcgctgtgaagagttgttgcc ' 
tttcaagatatacccaaattcccagttccagcccgtgtcattaaaactcc 
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_ TGACGTCCTTAGCCCAGCAGCTGCAaCGACTOGCCC 

tccctqaaagggatgccagccttttatttagagatgaagttgcttctttg 

TTATT^GACCCTAAGGAAGCGGCCACAATTGACAGGGACACCGTCTTCGC 

CAjrGGTGAGCCATCTTTTAACTTAGAAAAGCTCTTGGAAGCGTTTGTTT 

TCTGGATGTTACTG 111111111 CCCCCCTGT7TTCTCTTCTGTACCCGT 

GCTCTTCCTTAACAGTTTCTGCATGTTGATGTATA 

>S63.1 

CGATAAGCTTGATATCCGAATTCCTTGCAGCCCCGGGGGGGGATTCCCAC 
TTAAGTTTTCTT AAGAAGCCGGGCCCCGCCCCCGGGGGCCAAGGGTTACC 
CCCCGGGGGGGGGCCCGGG 
>564.1 

ACCAAGTAGGATAATTACTACTGCCAACACACACATGCACGCATGCACAC 
ACAGACACAGATGTATGCACGCACACACACT CTCACTCCT AGACTGCTAA 
AAGCAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCC 

>S6S.1 

GCCAGA CGCTCC TTATGAAAGTACCAAGAAGTGGGAAGCGGGGTGAGCTG 
CTGAAGAi i i 1 1 GGTATCGACAGGGATGCCATTGCACAAGCTGTG AGGGG 
CCTCATCACCAAGGCCTAGGGCGGGTATGAAGTGTGGGGCGGGGGTCTAT 
ACATTCCT GAGATTCTGGGAAAGGGGCTCAAAG 
>566.1 

TACGCGGGGGGGGACTGGAGGAOCTGTCTGGTTATTATACAGACGCATAA 
CT G GAGGTGGG ATCCACACAG CTCAGAACAGCTGGATCTTG CTCAGTCTC 
TGCCAG GGGAAGATTCCTTGGAGGAG G CCCTGCAGCG ACATGG AGGGAGC 
TGCTTTGCTGAGAGTCTCTGTCCTCTGCATCT GGATGAGTGCACTTTTCC 
TTTGTGTGGGAGTGAGGGCAGAGGAAGCT GGAGCGAGGGTGCAACAAAAC 
GTTCCAAGTG GGACAGATACTGGAGATCCTCAAAGTAAGCCCGTCGGTGA 
^^CJGC^C^CCATGGACCCAGAGA^ 

CCATT AAGTATTTCAAGGAAAAAGTGAGCAGACAGAATCTGCT ACT CCTG 

CTGACTG 

>568.1 

CTTGGCGGGCGCTTTAACCCGAGGACCXXrrCCCCCCGCGTACGCTGGATA 

GCCTTTTTTCCAGAAAGAGAGAGTAGCGCGAGGACAGCTAAGGCCACGGA 

GCG AGACATCTCGGC CCGAATGCTGTCAGCTTGAGGAATGGCGGCGTACC 

TGCCCTTTTCTTTTTT 

>569.1 

GGGAAAAAI I IOI I II 1 1 1 AGATGTCCACAAAAAAGGACATGTAAAGGGG 
AAGGTC^GTTGTTGAGACAGCTACTTTATTCTTGGGATGACTG 



GGAGGTGGTGGAGATGAGCCTTGTTTGCCAGATTTCXX5TTt3GTAGTTCAC 

GAGTCGTTGACCCACAAGGT . 

>571.1 

CJGTAAG^GGGACCTCTCATGGTTAC^^ 

CTGGAGAACATTCCGAAGCCGTTCTT ATAAGTGTCTCCATCTCTACCTGG 

GCT GAAATGG AATGTGCAAATGTAGCCCAGCCTGGTCCTTGGGTGTTGCC 

AGTTGATTGATGACTGGGAGCCAAAGTGGCA 1 1 1 TCI 1 1 GACCTAAACGG 

GCGATGATGAAATAAATCGAGOGGCCGCCCGGGCAGGTACATCTGTGAAT 

GTGAATGCCAAAGGGAAGGCATCCCTGAAAGTCCCAAGTGTCATGAAGGA 

AATGGGACATTTGAGTGTG«CGCGTGCAGGTGCAATGAAGGGCX5TGTTGG 

TAGACATTGTGAATGCAGCACAGATGAAGTTAACAG 

>572.1 

GCCTCTACTT cctctgataaaaatgttgggaaaacacctgaattaaagga 
agactcatgcaaci I Gill ictggcaatgaaagcagcaaattagaaaatg 
agtccaaact attgtcattaaacactgataaaactttatg 

. >573.1 

AAAG gaaaacacaaaagcgcaagccgg ACACACACACAGGACAGCGAAG 
GCAAC 
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>6/b.i 

TTGGAGCTCGCCGCGGTGGCGGCCGAGGTACGCGGGGTAGGAGCCTCTCT 
CCCTACTGCTGCTACACAAGACCCTGAGACTGACCTGCAGGACGAAACCA 
TGAAGAGCCTGATCC MC I I GGCATCC 
>577.1 

ACAGAG ACCTCCTTACTT ACCCCCCTTCTCCTTCGGCT G GAGCTCGGCGA 
GCGAGAGGCGGCGCTGGCGTTGGAGAGCGACGGCGGCCCCCGCGTAAGCA 

gtggtaacaacgcagagtaacgggggaatgaagaatcttaggcgggtgca 
cccagtttgcaccatgattaagggtctttacggaataaaggatgatgtct 
tcctt agtgttccttgcattttgggacagaatggaatctcagaccttgtg 
aaggtgactctgacttctgaggaagaggcccgtttgaagaagagtgcaga 
tacactttgggggatccaaaaggagctgcaattttaaagtcttctgatgt 
catatcatttcactgtct agg ctacaac 

>578.1 

ttggagctccacgcggtggcccgcccgggcaggtacctcacaacgagttc 
agtcagtagcagaaggatcttctctcttgttcctgatgatttcaaggtcc 

TCACAGTCCTG AT AATCTGGTTCTT GCCGAAACTCCCAAATATCTATGGA 

GAGCTGTTCTAGCTTTTGCACAGGGAACCAGTGGACAGAGGTATCATTAA 

ACATGTCCATGTATTGCGAAGTCTGAGGAAACTCAAGCTCCTCCAGTCCT 

TTTAAAATCTTTGCAATGTAGGGATAATTTTTCTGCAGAATCCTTGCCAA 

CAACCTCTCCTCAAGTCCTTTGAAACTGTTCCCAATGATGACCATCTTAG 

aaagggcatctactgacx^ttactccataaaagattgttgtacctcggc 
cgctctag 

>579.1 

GGTGGCGGCCGAGGTACTTTGGACAGTGAGGGTTCGATTCATTTT AGGGG 

tagggttgg gg gtggg agtgggagtgtgggttggcaggaggaagaatgag 
tctactttggagacaattaagtcatggtao i i i i ii h i ii m l i 1 1 i i i 
tttttttggctacatagacatctttctcatgtattgttactagaacaact 

TGTAT AGGGTTTTATGGTTTGG GGAAAACATTTTTAAAAAATGG ACTT AT 
CTCTATTATACAGAGTTATAATATAAAAATGATTTAAAGGCTATATTTTT 

caggatgtaggtagctacactgtaatcctgttgaagaaactttcct attt 
aagcttataggatgaaaatatataattaaagtcttctgatcatagctt 

>580.1 

ACC atccaaatg cttccctggtcttgatgatct cttccagagtcgatctg 

agtggccttttctgcaccctcccctt ctttctctttgaatggaattaaac 

c^aatttggaaacaacattgacccagtcaaaagcttctaatggtttcttt 

ttctrcctccagttttagtttgctttrattaaaaaaagaaaatagtgcat 

ggccatagctgcttcagttctcttattgcagactaaccatcaggatggta 

tcaaagcacaaatactttggaggggaatgcgttgaactggggcaagt 

>581.1 

cacttgacttcgctggcgc 



ttttagagatgagctcacoscggtggcggccx^ggtaccaaattgtaaaa 

tactcgaaggccttcaggaaxxrrgtgaxn-gatttagataaataccagaac 

ctattttggatgaggtaaaagacatgtgctcatctccaattacagtttca 

AGCTGCTGTCGG CCAACCCT ATGAGOGGGGAGGCCACAAAGCATAAG AAT 
TCTTTTGGGATTACACTGACATCAATAATTTTTATC^CTATCTTCCATTA 
CACTATTGTG CACATT AAGCCAATTTTCT GATCATCACA 

>S83.1 

CAGACAAGCTTTCA 1 1 1 1 1 1 1 1 1 CAAATCCGACATCTACTCCAACTACAT 
GATACACTAAAGTGCTTGCTGTGTGGGCTTCCAGGGGAGATGAAATGGTA . 

agtcgggctgcagcatctctgttcaaaatatacaggaatttctgtttctc 

AATGGCACTAATCATAACGGCTCGCCCnTTGGGATCCACAGCTAAGAACT 
GGCCAGGAACGA \ 
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GAGCT ACCCGCGGTGGCGGCTGAGGGACATTACGTATTGGTTATACAACA 
TTTGTTTAATAAATGCAACTAACAAAGCTACACAAGACTTAGATATTGAA 
GCAGAAAAGGTGGTTTTACAGTCCCTG C ATTAACCT CT AATTCTTACTAC 
CCTGGCCAAGAAAGCATTTTCACCTCCTGCGCTTTCCTTGCTGTGTGCTT 

gtggttggttctttcttctcaggctttcttattctgatgctgagatagtt 
ctgttcacttagcaacttgggacagtgacacagggtttgttct gtacaag 
caggttatccaagaggcatccataccctgggttttcctccaaccataagg 
aaaattgatgc^ctgtttctgacaaggaaaagaagaaaacatacttctt 
tgcagcggacaaatactg 

>585.1 

tagtacctgg gccac caaacacagctggactcaatatatggggaaggtaa 

GTGTCCTCAGi I I r iGGAGAGAGATTACCCTCTTCCAAAAGAGTGCTTGA 

ttctggtagtccaagctgtctccgtctggtggcacccx:aatttcccctgc 
ctagacccacctcctttcctcagcccccttcgcctgccgctgaaaagtga 
gagcgggctcttgcgtcccccgcgt 

>587.1 

ttggagctccc^cxsgtggcgcgttcgggtacagctttaaagcatcataa 
tgactaattataggtgaataattttacagacagtctatattctaggaggc 
agct gtaggcgttttaattggaaataagcattctgagataatgataatag 

GTCCTTCTGGGATTTTTATTTTAATCATCT cctccacagag aacaagcag 
>SB7^ 

i n I n I I M I I I I I i ittggggtttattttatgcacaaagagccatcgt 

GGn 1 1 1 1 ATT AG GTAGATGCCCTGGAT AATCCTTTCAAGGAAGATCACT 

tagtccaacttaatgaaaccaatatccttcgcatact 

>588.1 

TGGACAGGGAAGAGACCAACGACTGGATCCTTCCCTCAGACTATGATCAT 
GCAGAGGCAGAAGCCAGGCACCTGGTCTATGAATCAGACCAAATCAAGGT 
i i i i i I GCTGTCCAAGGAGGAGATCGCTGACAAGT atgacttatttgttg 
GCAGCCAGGCCGCAGATTTTG GGGAGGCCTT AGT 
>589.1 

CCAGGGTCTCT GGTCTACCGATGTCAAAGCAAATGAGCACAGCATCCGAA 

TCAGGGTAAGAGAGGGGGCGGACATTGTCATAGT AAGGAGAATCCG AATT 

TTTCCACAGGCTCAACTCTATTCTTTGTGTGTCGATTTCAAAACTGGCCG 

TGT AATTCTCAAACACTGTAGGAACGTAATTCTCGGGGAAGCAGTCCTTG 

GGGAAGACATGGAGCAGCGCAGTTrrTCCACACTGACTGTCTCCCACCAC 

AACTATCTTGCAT 

>591.1 

TTGGAGCT CCACGCGGTGGCGGTGGCXXX3GCAGGTACTCAGGTT7TATCT 
CTGCACTCCAAGTAGGATGAATAGATAAGAGCAAAGGCT CATGTTTGCCA 
AGTCTGTCCTTTT GT AACAAAAAACCCAGCAGCTTT ATCAAGCAGAATTC 
GACGTGTATTTCTT AACTTGCCAGAGCTGAGTCTCATGGCXJkCCCTTAGC 
AGGAGTTGGGGAGGTATTTTTAACAAGG CACATT ATCATCTCCCCCACCC 
AAAGTGGAGCTATTGCTAATGAAAAAGATACAATGAGATGTTTATGAAAT 
TATCTGTAGCT ATTAATGTCAGGTTTTTGA^ATTTACTGACCTGGAAGAA 
TACT CAT AATGCAATGTCAAGTGAGAAGCAGGACAAAGAACATTTGCAAT 
ACAG 
>592.1 

CCCGATTTAAAATTGGTGAGAAGTT<XnTCGGCTGG<KrrGAGGAC^ 

GTCATGGGTGGATCTCATGGAGAGAGGGCG^GGACAGGGGACCGGTCTCC 
CAAAGGAGT 

*593.1 - - ' . ... 

CCGCCjGTGGCGGOCGCCCGGGCAGGTACATAACT cccgcaggatctcagg 
GOCT GCCGCCCCATTATGATGATGTCXiAGGTTTTCATCCTGCAG CTGGAG 
GGAGAGAAAC*CTCGCGCCTCT/^^ 
GTACCT 
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>594.1 

GTG CG ATTCTG G ATGACAAAG AAGATGCTTACTTCACAG AAATTCGAAAT 

TTCATTGGGAACAGCAACCATGGCAGCCAATCTCCCAGGAATGTGGAGGA 

GAGAATGAATGGCAGTCATTTTAAAGATGAAAAGGCTTTGTCGAGCGGCC 

GCCCGGGCAGGTACTTT 

>594.2 

II I II I I II I 1 1 1 1 I 1 1 I 1 1 I AAGG AGCTTTTATTGTTTTAGT AATCTTA 

ACATAACTTAAAATAAGAGAGGGGAAATGACATCTGGAGATCTAGGTATG 

TGGCCCATTGCAATTGAGCACATTTCTTGGGTCTGTTTCTCTATCTCTAA 

GGGCAGTCTCAAAACCCCAGCTCAAAATACGACACTAACATGATGAACAT 

GCATGAGCTTTGAAAAGTGCTCTGTAGTCTTATGATGATCTAGAAGAGCA 

CTGTCCAATAGAACTTTCTGTGATGATG AAAAG ATTCTACTTTT GA 

>595.1 

TCACX5GGTGGCGGCCGCCX:GGGCAGGACATGGCCACCAAGTAAGAATGGT 

TGGTGACAACGACAGAAGGCTAAAACAGGAAGGTAATCTTGTGCACCTGA 

CAAATAGAAAGAATAAAGGATCAAAATTGAAGGCA 

>596.1 

GGTGGC^GCCGCCC^GGCAGGT ACT ATTT AAGAAAAGAACAAGGTTAACT 
AACTAAAAGCAGAACTCACTTATTTTTTGCTCCCTAGCCAATTAAAAATA 
AGTTCATT AAAAG CiACTTGAAATTAT ATATTTAACCTG AAAAAAAGTTGC 
TAAAATTCCAATATAAATGTAAATATCTTTAACTTTGCTTAACCCAGCT AT 
CCCCAAAACAGTGTAGTGGGGCAAAATGTTCAAAAGAAAAATCATCCAGT 
GCACGT AG ATGG G CACCAAG AAGCTAAGCTTCCCTGGCGCCTAC 
>S97.1 

CACI 1 1 M I 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 I GAGTTACTCTGATGTTTATTTTA 

ATGCATCTTAGTCCACACAGTTGGTATAAAATCAGAAAATGCAAAGCAAA 

AACAAAAGGTCTGGAGTCTTAGCATCAGAAGGGCACCATATATACATCTA 

CAGTTGGTGGCCAATACAAGTCATTGCCAGACAGTCCTTGGAGGCACAGA 

ACAGCCCAG ACCCAGCCAAGCTCTAG GAACTCACGGGTCCCAGGGAGTTC 

TAAACCCTTGTTCTGATGCTO>ACXX3TAAAA>^ 

AGGCTTTATGATTTACTCATTATCCCGCGT 

>598.1 

AAAGGCGCTGGGTGTTCAAAATAGGCTCTCCTGGCXX^CGGCTGACTGTC 

TTCCTTGTGTCTCT ACAGTGGACGTGACTCTGGACCCAGACACGGCCTAC 

CCCAGCCTGATCCTCTCTG^TAATCTGCGGCAAGTGCGGTACAGTTACCT 

CCAACAG G ACCTG CCTGACAACCCOGAGAGGTTCAATCT GTTTCCCTGTG 

TCTT GGGCTTCTCCATGCrrTCATCGCCGGGAGACATTATTGGGAGGTAGAG 

GTGGGAGATAAAGCCAAGTGGACCATAGGTGTCTGTGAAGACTCAGTGTG 

CAGAAAAGGTGGAGTAACCTCAGGCCC<XXWiAATGGATTCTGGGCAGTGT 

CTTTGTGGTATGGGAAAGAATATTGG 

>599.1 

GTCCGGTAGAAAATAGAGGTTCTGACTCCTCAGGAGCAAAAAACATAACC 

TGAAGAGGGAGGAAGTGGATTTGGGGTTCACX>TTTCTTGGGGCACACTT 

G ATT GAAAACTGAGACTTCTGAAGAGAAGGCCAGAAGATACAAAGACAGA 

CCAT GCCAGTTG^ATGCTGTCTTCCAAGAACAGAAGAAAATGATCCAGGC 

CCAGGAATCCATAACACTGGAGGATGTGGCTGTGGACTTCACTTGGGAGG 

AGTGGCAACTCXTTGGGCXXrrGCTC>GAAGGACCTGTACCGGGACGTGATG 

TTGGAGAACTAC^CAACCTGGTGGCAGTGGGGTATCA^GCCAGCAAACC 



atggaatccacagtggagcctgt 

>600.1 

tggcccaggtgaccaatggcc6caggctcca7 ggcggctggcttcttcca 
gcccttcatgtc^cxxxxjcttcccagggggcccx^ 



tgccgagtcagcctcccgcagg 
ggcgc^tggagccctccccacgag 

cccaatgcagagggtgacgcctcxrrcgtggcatggccagc^ggggcccc 
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AGAGCTATGGAGGTGGCATGCGACCCCCACCCAACTCCCTCGCCGGCCCA 

GGCCTGCCTGCCATGAACATGGGCCCAGGAGTTCGTGG CCCGTGGGCCAG 

CCOCAGTGGAAACTTCGATCCCCTACTGCTTCTCATCCCCCGGCAGCT AC 

ACCGGACCOCCAGGAGGGAGGTGGGCCOOCTGGAACACCCATCATGCCTA 

GCOCTGGAGATTC 

>601.1 

aatct gaaagtcagaggtgattattgat agtacttttgtattttg at atg 
gacagtttattcatttgcatacagttattgactttttcccagctgattaa 
aagatagtcaagaaattctgcaatatagctgccaaaatagacagctacat 
ttttatgatattgtcatcttttctg 

>€01-2 

i m 1 1 111 ic1 1 1 1 1 1 1 ictttagctattttacttaagcataatagccac 
aataggacatataaaagattataaatacw^gctttattatcttgacgtc 
ttgggt c i 1 1 iaagtatatacttttctgaaaggtatccai 1 1 i g t 

>6Q£1 

catctgcgtgggcggccaagatcggagcagcgacgctgcgggct accccc 

atgccacccatgacctgtagggaccacctctagatgcctactcgactcaa 

ggacaacacaccat gtctc cgctcgatctggccaagctgaaccaggtggc 

aagacaacagtctcactttgccatgacgcacggggggaccggattcgccg 

gaattgactcxagctctocagaggtgaaaggctattgggcaagttttgga 

tgcatctactcaaaccacccatgaactc^ccatttccaaaaaacttaatt 

ggctgcataaatogggcgccaag 

>603.1 

CGT00GGGAAAAATTACCTGTCTTGACTGCCAT6TGTTGATCATCTT AAG 

TATTGT AAGCTGCTATGTAT GGATTT AAACCGTAATCAT ATCTTTTT CCT 

ATCTATCTGAGGCACTGGTGGAATAAAAAACCTGTATATTTTACTTTGTT 

GCAGATAGTCTTGCCGCATCTTGGCAAGTTGCAGAGATGGTGGAGCTAGA 

AAA A AAAAAAAAAAAAGCGCTTTTCAGTTTGTGCACTGTGTATGGTCCGT 

GTAGATTGATGCAGAATTTTCTGAAATGAAATGTTTGTTTAGACCGAGAA 

TCATACCGGGTAAAGCAGGAAATGACAAAGCTTGCTTTTTCTGGTATGTT 

TCTAGGATGTATTGTGACTTTT AACTGTTATATTA 

>€03-2 

ATTGCCAATATTAAGTAAATATAGGATTATAATATTGTATAGGGGTTTTC 

ACAAAGCTTTAGACCCTTTTA 

>604.1 

GCGTCCGAGACAATACAAAGTTACATTTTTGGACCATATTAAAACTGCAA 

GAAGACAGGGGTCTTACTGAAGATCTTTTAGAAAACTTAAATCCTGTCAC 

AGGATATTTAGACATGTGTAGAATGTAGCTCAATTTTTTAAAAAGTAACT 

GACCTAGAG GGTGAAAGTTGAAACTGACACATTTTCAAATTAAGATTATG 

CTTTATTTTGTAACAGAAAACAATGTTTAAACACAAGCAGATCT GTTGT a 

TGTAAT AAGTAACACAQ^ TTTTTT 

AACAGCAGTTTTCACACCAGAATTAGCAGTGCCCT 

>eos.i 

AGCTCCCCGCGGTGGCGGCCGAGGTACCCAAATACCACrrTXiAGGAAATCT 
GGCCAGATCACCTGAATCCAAATGTTCT ATTAATTCAATACAOGTTATCA 
AG TCAAATCCAAGCAAACGAGA GTC T CTC T CCACAACGGAGCCATGATAC 
AATGTGATGGTCAAATTCAGATCXXX3AGGTTTCAGAAAATCCCCCAGGAA 
AGGAGCT AAGGAATCCCCTCTCCATCGT AATTT ATCCTCATTAATATCT A 
CTCCAACAAGCAATTCAATGCATGGATTGACTTTTAGCAGCCTT AAGAGT 
GAAGTATCACCACATCCCAGGTCTGCAACCTT CTTAGGCT CATGTTGATC 
CACT AAATTTTTAACGAACTGGT • „ _ 

>60&1 . 

GAGCTACCGCGGTGGCGGCCGGGTACTrAAAATAATTACTGGCAGTAGGT 
T AT AA I rGGTGGI 1 1 AAAAAT AACATTGGAATACAGGACtTGTTGCCAAT 
TGGGTAATTTTCATTAGTTGTTrTGTTrGTTTTGATTTGAAACCTGG AAA 
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tacagtaaaatttgactgtttaaaatgttggccaaaaaaaaaaaaaaaaa 

AAAAAAGGTCCGCGGGGGCGGAGGTCAGGGACAAGATGGTGCCACCGGTG 

CAGGTCTCTCCGCTCATCAAG 

>€07.1 

GCCGATGAGAAGAAGAAGGGGCCCAAAGTCACCGTCAAGGTGTATTTTGA 

cctacgaattggagatgaagatgtaggccgggtgatctttggtctcttcg 
gaaagactgttccaaaaacagtggataattttgtggccttagctacagga 
gagaaaggatttggctacaaaaacagcaaattctatcgtgtaatcaagga 
ctttatgatccagggcggagacttcaccaggggagatggcacaggaggaa 

>608.1 

agctcaccggggtggcggccgaggtatgcgggagctgagagaacagacac 

agacctgtcggaaggtcctctgcaggtccoccttccgctctgccgatcga 

cttccgcctcgggcagtcaacatactgccaaggaaatctgatgtggaaag 

gaaaatagaaatagtgcagtttgctagccggacacgccaactcttcgttc 

gatt att agcttt ag tgaaatg ggct aat aat g ctg gcaaag tgg aaaaa 

tgtgcgatgatttcaagcttttt agatcagcaagccatcctgtttgtgga 

cactgctgatcgcxttggcctcgttagctagagatgctctggtccatgcac 

gcctgcctagttttgccatcccatatgccattgatgt 

>eo9.i 

gtggcggccgcccggcaggtacttccgccttgccgttagcttgtggagaa 

CGTGCTTC1 1 attcctggcaggcttcaagaacagctgcacatgtgccgct 

aactgaccgcgttgcxattggcgacctggactrctgaactcaggtttattc 

taaacccagtgagaggtgagggggagtgatgaaaggggatcagctgtatt 

tgtgtgtgtgtgtgtgtgagcaccrrgacaaatctatgaaaccgagtgaaa 

ggagaaat gtta gattctttattattttattatatttatatggaaagctc 

gactctccctttggtaagtccgaagcatgttgtctgttcgtccgtgactg 

tcttotcaggtctgtggcctgtgatttccagtcacccttgtagttactg 

acaggaaattgactggact 

>610.1 

TTTTTTCCT AT AT AAAGTGAT ACTGAAATATG CTAATTAATATATTAATT 
TTAGTTAAATGCTGCT AATATGCATACCTCTrACTrGAAGGTTTTTAATA 
TGTTTTGATAACTTrAATAACTTCAGGTGATGTCTGTATAATTTTTAAAG 
TGCAGCTCTCTCTAACAAATGTGCCCTACAACTCCTGATTAAACGGCGTC 

ttgaaggttcaaaaaaaaaaaaaaaaaaaaaaaggt 

>612.1 

gt ggcgg xxxsccogggcaggtaccaaagaagatggagttcaaaatactgg 

cagttttccaagaaattttgtaaagttgaacatggccatctactcttgcc 

ttaaaacttttctcaccacacgcaccitcccacatgcatgatatccaagg 

tcga cagac ctggattagaatccactctcaagcttctcatgcagtgggta 

ttgtattttctgcataagaaagggctg cctctagaacacagtaagtgtat 

ttgcccagtagtgacattgcctacatatagccaagtgttatagtatacca 

acttagtatatttttcaaggagagct aaaccaccttttgt aatggtttgg 

tttctcact gtt atcttcctttcctataattaatttattttaatct acaa 

attgacatagggctaaaagcttcaatattttacaaaatattaattaatgt 

aattgttcccaattattagaaacttttttccatttt 

>613.1 

ggggaaaggaaagctaactc^cgtctgttccaaaggcctctgctggtat 
tt actttacgagaggcccaccttatccaaagagctatatgccctgggggg 
ccttgatgggcttcacacagtacctgcc 

>6T4.1 

. AAGAAACATreTrrATAAATTTTGATCCCTCCGTTCAAGCCAATACCATA 

atttaattacagatggatattatatggtaacgggtatttacagaaggaag 
ggtott attacggaaaaaflctaacqgcacgacgttt ai 1 1 1 1 ooqccaca 
atctttcaTacaggaact aacaaattg aacttt g Caaaagcact aaaacat 1 
cacatgtaaacccagctaacagaaaaatacattcacaagcgttgttggtg 
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gtggtgtgtatgtgtgtgctatgggtcaatgtgctgaagaaacagaaggg 

agactttggcacggctcatttttttcagtctatagttacatgaagtttac 

aattaggtt 

>61S.t 

1 tttttttaattttccatgtattggccttaatcaaact ATA 

AGCTGTGGAGTGGCCAATATACT(X^TTGTGATTAT ACACTGATTTCCAT 

cacctgcctttgtactatcaactcttattagattaaaagga 

>616.1 

GAGGTACTGTGCCCTL 1 1 ICI 1 CACT AGGTGACCAGAGTGGTTTTGACTC 

CTGTGGTGCJTGAAGTC>TTCTCAGGGGTCTCTATGACCTTXrCCCTCCT 

GCAG TTCACTCT AGTTTCTTCT ATTTCATCAT CCGCACT G CTCTT AGCAT 

CGAAGTCACTGTCTGCATCTGGTTCTCT ACTTT CACATGAGTTTGAAGAA 

TGCATTTC 

>617.1 

ACTAGCCT AAAAGCCCGTGACACTTGCAGCAGGTGCTTGCCACGCTTGCA 
CCCCTCCgA^ 

>€17.2 

GTTCCGCGTGCCGG<XAATTCAAGCAAGGTGGCAACCX»GG 

>€ia.i 

TCACCGCGGTGGCGGCCGAGGTACTGGGACAGTTGGGTGCGTTATGGATC 
ATAACCTGAGGAGCCGGGGGAAGCTGGCCTTGGG 
TATATCCA CACAAG T GC 1 1 CI C 1 1 UACATTTCTCGAAAAT GGGAGAAGAA 
GAATAAAATTGTTTATCCTCCACAACTGCCT GGAGAACCTGAGACCAGCA 
GAAATCT ACCACTGTCGAAGACAAAT AAAAT AT AGCAAAGACAAG ATGTG 
GTAJTTGGCAAAATTGATACGAGGAATGTCTATTGAOCAG^ 

agttggaattcaatgacaaaaaaggggcgaaaa t aatt aaagaggtt ctt 

TTAGAAGCACAAGATATGGCAGTGAGAGACCATAACGTGGAATTCAGGTC 

tgccattattttgtgaagttggtggaagggccccca 

>€19.1 

aocgcggtggcggacgaggtacctactatgtgtcagccatgggg^ 
aagatct ataaggcacaagaccctcagtcttgtagtcgcctgacagccag 
ccagct acaacat aatgtggaaaggacaatggtgggaaatggactcaggt 
cttcctaatgcacagagtatgctcaggctgtgacataggaa 

>€1W 

ggacttggagggaccttcaaaaaacatgt^ 

taaaag ICI I gatitaaaaaaaagaaaacactttctgtggataaagatag 

gctgcaggaaatgtaacct atgaaattttctcaaattagctttcagacac 

ACACAAAAAATTG^TTTGTTrGAGGAGCAGAAT^ 

aataaactactatttagtatctgagtgaagt 
>eaoLi 

GGGCAGCTACATTCTAATTTTTATGAGACATAGATATCTArTTAT 

CAGAAGOTTAflAAACACCAAAATTAACAGCCAGT 1 1 1^1 I C ATTTTfrrr 
CrTGAAGAAGAGATTGGTGTTGACTATGGT GAGAT ATACTATGGCfTITriA 

^^^tf iCTGAJaA ^IP CCCGJ G^ACTTACTTTGACAATGTTGG 

^C^^TCTCAAA G r iC I 11 1 IG I I IG CAAA CCCI 1 1 1 1 1 IAGTAACCT ■ , 

GGCATGGTCCp^AATATAAACTAT^ . 
CACCGATGGTCCCCATGGCATITiAf^CAAAGAGATGAGA 
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CGCGGTGGCGGCCGAGGTTAACGACGCCTGCCCATGACAGAGCCTAGGAA 

ATCGCGATGACAGTTTACAGCAGGTAAAATCCGGTGGAGACCAGCAGCAT 

CCCCGAGAAGCCGTGCGATTGTTTGGGCGTATGTAACTCGCTGGTACTCT 

TGCGCCAG 

>622.1 

gagctcaccgcggtggcggccgaggtacatttatttcagataaggacaat 
aagtttactttgtatctgaactcaaaacaaagtagttgtatattttaaca 
ttcaaaattgggatttcccaatgtgacacjvtcatgaatgcaaacccctcc 
agcccatcagacgccaggctgcct actggtaatctgtgt atagtatataa 
acatgtaaaaat aggttgtattttactctatgt atgatgct aatcaatga 
acactttatttattttacagagaaaacttatctgtgaactttactatata 

TCT 
>623.1 

CGCGGGGCGGCCGCCCGGGCAGGTACAGCCATTGCTCTTTGAGTTTGTCT 
GGCT AGCAAAAAGCTGGCTGTGTTATGTAAATAAAGCCCCT AT AGT AATT 
AAAATTTAAAAAAAGTT7TTTAAGCTTGGCTGTTTTCCT ACCACTT CAGAG 
TC^TTGACCCCGTAATTTAGGTCXX^TTCAGATITGACAGACAGAAACAA 
ACAACAAAACAGTTAAGCAAAACT AACAATGGTCACACAAATTATAGAAT 
TTCTGAGTGCTCTAAGTGCATTGGAAGAAAGCTGAAACTCCATAAAAACA 
TCACCTGCCTTCCATCATCATGAAAGCAGGAAAACTrGCCTTCCTTGTTG 
>€24.1 

GGTGGCGGCCGAGGTACGGCGGGGAGCCGCCTGGATACOGCAGCTAGGAA 

TAATGGAATAGGAOCGCGGTTCTATTTTGTTGGTTTTCGGAACTGAGGCC 

ATGATTAAGA 

>€25.1 

CAGGTACAAACTGATCTTCATGAATGTGTGGTCCACTG CT I I I CTGTTTC 

TGTCACAGTAGCT ATAAACAGCTGTTTAAGGATATCCTT ATCT AAATTTC 

TGCCAATGAGGAjCCAATCGATTTGTTCTCTCAGTGTCATCCTTCCAGCTC 

ACTGAGTCTCTCACATAGAGCTCATCCCGCGT 

>626.1 

GATGAGTCCTAGCAGGCCCTGGCTCTTTGGCCGCTCGG 

TGCTGCTGCTGCTGCTGCTGGTGGCCCCTTTGCAGATGT attgct gtcct 

TGAATATTAGCCCATTTGAAAACGCCTGGGAAGTTCAGCCATOU3TATGT 

CAGTA 

>€27.1 

ACTTTI ICI I CAGAAAAATTCTCCTTGAGGAAAAATGTCCAAGATAAGAT 
GAATCACTTAATACCGTATCTTCT AAATTTGAAATATAATTC I G TT1 GTG 
AOCTGTTTTAAATGAACCA AACC AAATCATA C I I 1 1 IC I I IG AATTTAGC 
AACCTAGAAACACACAI I ICI 1 I GAATTT AGGTGAT ACCTAAATCCTT CT 
TATGTTTCTAAA 1 1 1 1 G I GATTCTATAAAACACATCATCAAT AAAAT AGT 
GGCAAAAAAAAAAAAAAAAAA 
>CZ8l1 

GGAAG^GGAGGCGGGTTCT^A^^^ 

CATACTCGTCCAGAACOGGCCTCAGCCTCTCCGCGCAGAAGTGC^ 
CATGGCGGTAOCT 

TACAGACGACGTCACCGTATATC1 IC I 1 1 ICGG 
CCAGTGGAGGAT ATCACCGAAGAGGACTT AGAAAA TGTTGCCAT AACT GT 
TCGAGAT AAAAT CTATGATAAAGTTCTGGGTAACACGT GCCATCAGTGTC 



GCGAGGACAG II C I GT GGACCATGCCTGCGGAACCGCTATGGGGaGGATG 
TCAGATCGGCA7TGCTGGACCCGGATTGGGT G 7 Gl CCCCCC I G I CGTGGG 
. ATCTGCAATTGCaGCT ACTGTCGGAAGC . 

>63ai ; •' ■ " * 

ACA T AGTG T OGOGAACTCAAATCGGCATTT AGAT AGATCCAG TGGTTT AA 
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ACGGCACG mi i GC7T A I AAAAA aaG I GCAAAAAAGATGTGGTTTACAA 
GTTAAAGCTACAGAATCCCTTTTTGCTGTAATTGCACCAGTTTTAAAGCC 
TCTGGAGAGAGCAGATCGTT TAAAACT TTGTTTTTCTTAAAAGCTTACAG 

tgtttggctaattctcctcccci f tttacaagacgggggccggagggtgg 
acactggtggcaggttaagggatactgtcacttt aagaagcctgcagatt 

G AAGTGT AAACATG G AGAAATTAG GGGCTGATTTTTT AAACTGTGTGAG A 
TATTAACCAGCC^CCCTGTTATAAAATCAGGAAATCCAAACAGCGATTTA 
CACCGATTAACACCCCCTTrATATATTTTTT ACAAAAATACACT GAGAAA 
ataatcaaacgttttcatctctcttgtc I I | n I IGTTTTTTAAAAGTGT 
caaaagtctacatttaaatat 

>€31.1 

catcagcttgcctcaagtctggaaagaaattggcttgggctcatcaagtt 

gaagggaccaccaaaagagctaagattgcttgtaatactcatgtggcccc 

taggatgcaccgactggtagtgatgagccaggtttacaagcagacactgg 

ctaaga gctcag acactctggcgggggcacatgtaaagattcatcgttgc 

AACGAATC 1 1 I iatatatctgctctctcccttacgatctgtgacaattga 

gaagtgcaggaatagcatctttgtcttgggccctgtagggactacacttc 

acctccacagttgtgacaatgttaaagtcattgctgtttgccatcgtttg 

tccatctcttctacaacaggttgcatcttt 

>632.1 

cggccgaggtaccacactcagggcagtttccagctcctctcacaaacagt 
aaatct acacaactttcacagagagtgtgtccgcacacattcaccatcag 
cttcaaggaggggttccxiatatttggtggtcttacaccgagggcaaccct 
gatogtccatggcggtttccctcctacagactctcgcaggcgcctgtttc 

AGCCAGAGCCACCTAGAAGCCCCCTOCCOGCGTACCAGCACACTGTCCCA 
AATTACCTCTrCATTACCCAAATCAAAGAATCTTTCTGTTTTCCCAATCC 
TCAAAAGGAATGAAGAAAAACCAAAGAGCAAACTCAAAAGATGATTTTTA 
CCATAAACCTCAAATGTGGCTTAACAAGTACCTGCCCGGGCGGC 
>633.1 

ATTGCTGTTTGTTrGTTTGCTTGAAGACCAAGACGGAGTTGGGCCTCTTG 
ATTCCCAGTGGCTGCAAGAACT GGGATTCCCTCTCCTTCTCTCTCTTCCC 
CTCTCCCCCGCGT 
>634.1 

TTGGAGCTCCACCGCGTGTGGCGGCCGCCCGGGCAGGTACTGAAAACCAC 
TTCCAGAGTCTAAAGCAGCTCAGATGTTATCTCTGGGGGAATTAGTGTTC 

cg<ttcattt agcaacctccataccacaaggtctct gtct gtagtt actg G 

GAT TATCC AGATACACTATCAATGATACAAATTCATAGGAGTATTAATGC 
ATTTCTTT AAACACAACTTGATTAAGAAGCAAATATGTTAAGCAGTTTTC 
TTTTTCT GCTGCTAAATTACAGTTAGACACTTCAGT ATCTTCTCTTTACA 
tgtgt atat AAATTAGTAAGAACCTGGATCCAAAGCAATGTAGTGTGTGT 
ATGTATCTATATATATrTATTCTAACTCAGCACTTCAGAAGCCTTTTTQA 
GTTACAACAATATTTTAGTTTGCCTC^TCTGTAGAGGTAAAATTrCTATA 
TrACCAAGCTCCAGAGGAATATGATATTTTACAGGCACAATTTTCTGGCT 
GTAGTCCCTGGGGCATTTATTTGCTTGCtnXX^TGGGATGCTGTTAGAAC 

aa^^gttagccggcaagagaagaaaggctaccaggaca 

££ G i£ GTGGCt5GCCGATCTACA ^ T ^ 

TCTCTT AGACAGAACTCACATAAACAGACAAATACAAGAGGTTATTTTCA 
AGACAC^CACTTGCAAGTAATCTTTCTATAGAAATGGCCACAGCATTATA 
ATATTCAAAATATGGAAGATTGACAGTCTGAGGATTTCTAGGAAAAAAAA 
ATCAAAGGACTTGCCAAAAGGATAACTACATAACAGATATGACAATCTAC 
AGGACAAAAAGACAACATGTCACCAAATATTQTTCATACAACAGCGTTAA . 
TGGAAAACAGTAAAAGACCTTTTAGCAGTGTGCATGTT AAGTC7TTTAGT 

AACCACCACAGAAOCAGGAATAGCACCCATCACTGCTGC 1 1 1 g f cactcc 
AGAAAGCTGAAAGTCAACCGAACAATGAAAAAAAGTCAAAGAAGCATTTC 
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CCTTT G AATTCAGTCCTAAAAATATGAATGCCTT AT AATT AATTTCAAAA 
TAAGTATCTT ACAAGTGTTTCATGAAACATTGTTTTCCTAAAAGGCAAAT 
TCAACATT ATG AAAATATATATTTTG CCCGGTAGTTACTG AGAAATGTC 
>636.1 

CGGTGGCG GCOG AGGTACT AAAGGGCAAGGTTCACCACT ACAAAAAGG AA 

gttgtct aaaagcaagaattcaattaacgctgggt aagaaaagtcaaaac 
act aatgagttgtccatgaagcc aactg ct aagaacg cgctc aactat ac 
gcgacatgaagacactacgcacgaagccttacttggcgagtctgaatttc 
tattaactaag ggcag agtgagggagaac aaagagct actt ccgtaacat 

TTTAGTATCCAGATAGT 

>637.1 

AGCTCACCGCGGTGGCGGCCGAGGTACAGGAAAGGGAAGCACAGTTTGGA 

ACAACAGCAGAGATATATGCCTATCGAGAAGAACAGGATTTTGGAATTGA 

GATAGTGAAAGTGAAAGCAATTGGAAGACAAAGGTTCAAAGTCCTTGAGC 

taagaacacagtcagatggaatccagcaagctaaagtgcaaattcttccc 
gaatgtgtgttgccttcaaccatgtctgcagttc^ttagaatcjcctcaa 
taagtgccagatatttccrrcaaaacctgtctcaagagaagaccaatgtt 

CATATAAATGGTGGCAGAAATACCAGAAGAGAAAGTTTCATTGTGCAAAT 

CTAACTTCATGG CCTCGCTGGCTGTATTCCTTATATG ATGCT GAGACCTT 

AATGGACAGAATCAAGAAACAGCTACGTGAATGGGATGAAAATCTAAAAG 

ATGATTCTCTrCCTTGAAATCCAATAGATTTTTCTACAGAGTAGCTGCTT 

GTCTTCTATTGATGATGTATTGAGAATT 

>638.1 

GGTGGCGGCCGCCCGGGCAGGTACGCGGGAGAAAACTAAACCTTCATTTA 

CTGTGAACATCrTCTGACTGTGGCTTCCAGATGCTAGTTTACAGAACAAC 

CACACAGCAAGACC^GCTTATGCTGAGTTGACGGAACAATGAGTAAACA 

TAAGGATATTACTGTGACTTTGAAATTCTGAAATTGTTCrriTCTT AACTT 

TTGCATTAAAATCAC^TTTATTTTATAAAATAATG 

>€39.1 

CGCGGTGGCGGCCGCCCCGCACAG<3TCCTGGCGCTTAATCCCATCAGATT 
TGTA GATC TTAACCAGGCAGTCACCG^GGCCTCGGAAGTCttCTTTCAGCT 

ccagctttacccacatcagctgct agacgggt 

>6401 

ct gg^gcggg^m^^^rs^^^cgga^tgc^tco^^c^^ c 
ccgccaccatggagt 

>€41.1 

acgc gggt cttcagaaaccaggctgctttcaggaacattgctgtggattc 
ccagctttcagacaacac^tgactaagacagaatgagaccactctagttg 
cctcatg^gaaactcgggaaaagactgcaaaaacaacattgtttctccct 
ttggaattct ggagttataaggcagaggtcccccatctt cccgaactg gc 
ctattccgctagaagcaagatggctgaactcaatactcatgtgaatgtca 
aggaaaagatct atgcagttagatcagttgttcccaacaaaagcaataat 
gaaatagtcctggtgctccaacagtttgattttaatgtggataaagccgt 
gcaagcctttgtggatggcagtgcaattcaagttct aaaagaatggaata 
tgac 

>€42.1 

CGGGGGGGGCCX^GACTTGGAGAATATTTCCAGAATAGCCGATGACTTGT 

TCTT gttgacaagagaaagttcttt GGCTGTT ACCCTCAATGATAGTG AG 

GTCCATTGCCGTCT ATTAAATGGAGATGATTCCATCTT GTCT ACAGACAC 

TGAAATACCTGGCTAAAA«C«3CCTTTCCTCTGCGCTGCTAOCAGCCCTG 

TCACAGGTCCCGGCGCrCTACCTCCCCGCGT 

>643.1 

. TGAGCTCCGGCGGTGGCGGCCGAGGCACGAGAAGCTCACTGGCTGTGCTA 

aaccaaatgaatggaaagcgccaaaagtgattttataccaagggtccatc 

CAT ACAAAT AAAC AAAATCCT ATCCTCTT CTTTCTATATTGTGTTTCTTA 
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CATTTCTT ATACAAATAACAG AATGCTTCATTrTATTCACTTCAATAGGA 

CAAAGTCCT TAAAGA AAGACTGAAAAGAGCTGATAATCAAAATCCCAAAT 

TTTATGCTT ATTTTTGGTTTAGGG CTATCAATTTTCT GACAT ATT AACAT 

AGGCAGGAAAACATTCTCAGTAAATTGAGCATTTGAGTCTACAAATGTCT 

TGAAG CACTC TGGCAAGTTACATGTATCCCATGTTGCTTTTGGTTTCCCA 

TCTCTT CTTT GCTTCAAACCCCCATGCAAGTTTCTT CTTTTTTCGGGCAG 

GCTG TGAA TATTC AACCT CCTTTTT6GCTTTTACAAAGGTGTGGCAGGCA 

ACTGCTTTGGCA A I ITTTA C 

>644.1 

CG CGGTGGCG GCCGAGGTACACCCTCTGGCCTCT CCAAGCAAGCAGTG AG 
GTGTGCATTGTTAGAGGTGCACCGGGAAGGGAGCTTGGTTTCGGACCCCA 
GGACATCCTGTCCGCAAGCAGCTGCTACTTCTTGGGCTTCTCTAGAATAT 

tgagg aatttcccccgtgtcatctctctggactcatccag ccccag ctg a 
taggctaggttctgtaggcctcgaaccttct ccatcaaattagccgtggt 
gagacrrcgccagttctttcaacatgtcgatgtcatcacgttct atctcag 
ccatccatttgggtggagaactagtaataggacttttgaaggaagctgca 
aattca gcaac acctggtaattgttctggccaaagatctggtgaggcacg 

G J C * AG " rr ^ CA * A >*^ A GCAAAGATGOT 

I gggag ^ cgccatc ^ c<:aacgcatgt ^ < ^ t 

GOGGCCGC 
>645.1 

GAGCTCACCGGGTGGCCGGCCGCCCGGGCAGGT ACTTCAGGGAG GCCT at 
ATATTGGCACCCAAGGAATGCCAGGACTG CCACCTGCTGCTCCAG CGTTA 

GCCTCACTCGTGTGCTT actcactttgact gcctttttgtctatttctgg 

GAGGTTGGTAGAATGAAAGGGATCCTCCAAGGCAAGCAGATGGCCTGTCC 

acctcctatatattgacagtgccaatgagtgtagagtcttgctacaagaa 

ACAAAGTC^TGAGA AATCC CAGGCTTCXrrGTTACACCCAAAGACTGCTGG 
CCCT CCTACTCTATCCTTTAGACCAGAAAC nTTTCTTCI a agcacttgc 

ct accgggaaggttgaggagtc i rom i accgt 



gtggcgggcgaggtaccggccaagcctggtccc c i rci mi i g ggcactg 
tgtatgggcggagaaaatccagcttgttcttgctgatgacgcaaaggtca 
a ^gttgcttccggagcccaggtcxntgaagatgccagctgcx3atggcttc 
gctcaccagattctaggui ICC I ictcctccatgtctggcctaaacttat 

CTTCAAATACAGACCATTGCTGCCAAGGAGACCAGJVACCGATGGTGACAT 

aaggcaacttatcagtttgatccatgaggatagatgctgtagaggtgagg 
tctcagttacatct acttccccctaaaactagggctgcaccaatgtaacc 
ttgatacctgaaaagcatctgcttcagcatccgattggctgtcacaactc 
tgggaagacggccagtggagagggagtggagctccaggttggaagaaatg 

AGCTGGGTTGTCATGTCT 
>647.1 

TGTCAAAATATTAAAGC 1 1 1 1 1 1 1 1 AAATTGGGAACACT CAGGATATTGG 

GATAATTAATTAGGCAATGATTCAAAGATGTTTGGTTTTAAAATTCAAAA 

CCCTCCAAAGG 

>64&1 

CXaGGGGAAACCCCCXrTTTTGGGGTTTTTCCCCCACGGG^ 
^ACCCCGGGGACCCCX5GGX5GGGGGG^3GCCCACAACCCCAAGGGTGTTAAA 

Cl 1 1 1 1 CCCGGGGGGAGAAAAAAGGGGGCCCCCCCCCCC 
>64 9.1 

? GG 7^?^ AG 9 AGGGGATG ^ CTTTCACTTCGGGGGCA WTTTTCtXiAGGGA 

GATGAAGAGACACAGGTTGGCCTCTGCTGGGACTCCA^ 

GT 

>€sai ... ... 

GAGCT CCCCGCG GTGGCG GCCGAGGTACT GAGTGGGGAAGAAGGTAAGAA 
ACAGTTGATTAACACCCT GTGTTCTGGCAGGTGGGATCAGCAATATGTAA 
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TCCAACTCACCTCCATGTTCAAGGATGTCCCTCTGACTGCAGAAGAGGTG 

GAATTTGTGGTGGAAAAAGCATTGAGCATGTTCTCCAAGATGAATCTTCA 

AG AAAT ACCACCTTTGGTCTATCAGCTTCT GGTTCT CTCCTCCAAGGGAA 

GCAGAAAGAGTGTTTTGGAAGGAATCATAGCCTTCTTCAGTGCACTAGAT 

AAGCAGCACAATGAGGAACAGAGTGGTGACGAGCTATTGGATGTTGTCAC 

TGTGCCATCAGGTGAACTTCGTCATGTGGAAGGCACCATTATTCTACACA 

TTGTGTTTGCCATCAAATTGGACTATGAACTAGGCAGAGAACTCGTGAAA 

CACTTAAAGGTAGGACAGCAAGGAGATTCCAATAATAACTTAAGTCCCTT 

C AGCATTGCTCTTCTTCTGTCT GT AAC 

>€51.1 

TGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCGTTATGCAGAGGTGTCCA 

GCCCCCTTCCTCTTCCTGGAAATTAACATTGGCTCCACCTTCCAGCAATT 

GCTGGACCAGGTCAACATCTrCGTTTTGAACAGCTTTAATCAGCAAGTGA 

ttgtcttcgactgcagcccttctaccgctggaggacgtgggtccctcctg 
g gggttgttatg atccctgctctccatgacg gtaaatgccacctgct acc 
actttt agc ci 1 1 1 ccttgagaaaatgcaaatttatctcctagcacttaa 
tcaaagaagctttgagtgtaaattgggattctctggcaacagagcagcag 

TATGAAGAAGGAACAATGTTCTCAGTCTTCTGACATTCCACCTGCTCAAC 

tcagacgtctcaattattcctttggcagccgcaaagcctggaagactgct 
tgcagcccgagcagtttcctcct gctgcctccgcgtaccagtgaggaagg 
aaagagcattctcctttagggcagcaatcac 

>652.1 

acgcggggagggccaggtctcagggctcctggagctgcaggcggcgggag 
gggct acawtgcttgactcagtgatgwgaacctttcagagttagctgg 

AAG CCACAG CCCTGCCTCTTGATG CAGCCTG GATCCAG CCGGTGTGAAG A 

ggagaccccttccctcttgtgg^gtttggatcctgtgtttctagcctttg 

CAAAACT CT ACATC AGGGATATCCTGGACATGAAGGAGTCCCGCCAGGTG 

CCAGGTGTAI 1 1 TTGT 

>€53.1 

ACCTGTG AACTGAGG AATT AT A G AT AAACCTT AG G TCAAATCATTTCGCA 
ATTGCATT GGTGG TATTGAAAAATGATGAGATTTCTCTGACAGAGAGCTT 
TGTCCTAGI 1 1 1 I GTTCTTCAT AGGTCAAAACTGGCAAT ATTCT CTTGT C 
TGCAAGATAAAGTGTTTGTGCTTCTATCACCATATGCATGAACATGTAAG 
AATCAGATACAATTTCT GCTTCATCAGTTTCACATGTTCATGTTGTCACT 
GAAAAAATGGATCTACTGTTTATAGCT CCCAAG GAGACCCC AAA 
>654.1 

ACCGGGGGCGGCGAGGTACCTGTTACCACTITAAAAGTAAGTTCTCC^ 
CCATAAAGCCATTTAAATTCATTAGAAAAATGTCCTrACCTCTTAAAATG 
TGAATTCATCTGTTAAGCTAGGGGTGACACACGTCATTGTGCTATATGTA 
TGTG^CTTCCCT CCCCCTGCCAGAATACTGCTTGGTCAATTGTAGGTATT 
crm 1 GGTTTAA TTTTT GCCAATGTAATTAAAAAATGGTATGTCATTTT 
T AAAATTTGT ATTTCTTT CATTACAAATAAGATTGTTATGTCAGT attgt 

tatt ggci 1 1 i cxsta ttcctcttaac^tgaaccctctgttcattgttttt 
acctgttttcgttttagcaagtagt 

»€55.1 

GTGGCGGCC6AGGTACGCGGGGGAAGTCGGCCATGGACTGGAAAGAAGTT 
C^C^TCGGCGCCTAGCGAGGCCCAACACCTGTOCAAACAAAAAAAAAAG 
TGAACAAGAATTAAAAGATGAAGAAATGGATTTATTTACAAAAT ATTACH 
CCGMTGGAAAGGAGGTAGAAAAAACACAAATGAATTCTATAAGACCATT 
CCCCGGTTTTATTATAGGCTGCCTGCTGAAGATGAAGTCTTACTACAGAA 

attaagagaggaatcaagagctgtctttct acaaagaaaaagcagagaac 

TGTTAGATAATGAAGAATTACAGAACTTATGGTTTTTGCTGGACAAACAC 
CAGACACCACCTATGATTGGAGAGGAAGCGATGATCAATTACGAAAACTT 
.TTTG 

>€56.1 " ■ " 

TGGCGGCCGCCCGGGCTGGTACGCCACAAGGCATTTAATGCCCACAGTAA 
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cagggctgtttg acagtggcagaag aggacg ggact aaagttacttt gtg 
ctgagaggggg aaagaagcacaaagtttggtct gttgcgt aattgaattt 

TT AACACTCTTATCCACAACAAACACTTTTrCGTGTOCTGCTGTGT aaaa 

gacatgagatatattacagattttcaaacaggtgagcatccttttacgag 

ctgggcaggtggggagtggcgtggttttgatggagtgaggagatttggat 

gaatgaacgctaag>tggccagacgcacctcttggatcgtaactctgcag 

gctgggattccagagctgcaaacaaccactgaattcgatctgtaaacctg 

ttgtcatttgacgtttgcaggcaggcatcaacatttacattgagattcaa 

tagacgctactactacaaaggagctttattgttgcagcttaaaatggttg 

ctgcgggaacactgaagggtgaaactgacttttt 

>657.1 

CTCCCCGGGXGGCGGCGGGTACATTGCAATGAAGAATTTCTTCATTCTGA 

TCTCCT agaagacagcaaat accgaaaaatctact ccttt actcttaag 

>65B.1 

ttgttatacactttcacaaaagattttatctttgatctcttggcgatctt 
C1 1 CI I gcccatggcagctgtcactttgcgggggtagcggtcaattccag 

CCACCA 

cttcccaggtttc^tgaacttgoccatttcggcagcaaccacccoggggc 

cgataa ccggtcggagcctogaagggggggggccccgggttaccccaagc 
nun 

>659.1 

ggtggcggccgcccgggctggtg<XjCcacaaggcatttaatgcccacagt 
aacagggcrrgtttgacagtggcagaagaggacgggactaaagttactttg 
tgctgagagggggaaagaagcacaaagtttggtct gttgcgtaattg aat 
ttttaacactcttatccacaacaaacacttttrcgtgtcctgctgtgtaa 
aagacatcagatat attacagattttca aacag gtgagcatcctttt acg 

ttgaatgaacgct aagatggccagacxk^acctgttcgatctc^actctgc 

agcctgg gattc ca gagc tgcaaacaaccactgaattcgatctgtaaacc 

tgttgtcatttgacgttttcaggcaggcatgaacatttacattgtaattc 

aatagacgctactactacaaaggagctttattgttccagcttaatatggt 

tgctgcggcaacac 

>660.1 

gtggcggccgcccgggcaggtactatgacctgaagaggcagaggccatca 
ctgttggrrccggtctccacctggggaaactgaggttgcacagtgtctctg 
tggtgacgagcagggcttcatccagtgcctctgtccccaccgagg ggact 
atgggagacatgg agggtgt gtgagcaacaggtgagactggagccagctg 
aaaactgggagaccgacccagccaacaaacaatgtcggtctctgtcttgg 
cacctgcaggaaacaaggtcctacttccagaaaaagtgctcctgggactc 

CAGGATACCAGGCATCTGGGTAAGCTACAATGCTTAACCACTT AACACAA 
TCAGGAAGCAACAGCCATGCATTCGGGAAAGGAACTTCAGTGTTGTGTGG 
CTCAGTCTCCAGACCTAACTTTCCTTTTGGT ACCT C 
>661.1 

GCTCCCCGCGGTGGOGGCGGAGGTACGCGGGAGAGA C I 111! 1 CTCACCA 
TGAATGTCACCCCAGAGGTCAAGAGTCGTGGGATGAAGTTTGCTGAGGAG 
CAGCTGC7 AAAGCATGGATGGACTCAAGGCAAAGG 
>662.1 

TTTTTAA TTTAT 1TAGGGGGAAGGAGGGGTGTCTTTGGATATACCACAGC 

GAGGGCTTTTGGGGGGTATCCTGGGGCATGGGGGTCAGGGGTTGGGGTCT 

TGGGGAGGGTTTTAGGGGCCCGGGTC 

>663.1 

ACCGCGGTGGCGGCCGAGGTACTTGTGGAAGGTAGTGACCAGCACAGCCA 
GCGCCTGCTCCAGAGAACrGCACATCATGGATCTGTGGCAGACCAGGTGG " 
CAGAGACAGACCCAGGAAGGAGAGCAAGGCCCCCGCGT 
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>664.1 

GC»GCGCX»GCAGGTACGCGGGGGCGGTATCTGTATCGGGCCTTACTGGC 

ttcaagagccx5aattcccttccaagcacccaccagg GGGAGCCCAATTAA 

GGGTTTGGGACCCACTA 1 1 1 1 1 I AATAACGCCAGCACCTTAAAATGCCTG 

GGAAGATGGTCCTGATCCTTGGAGCCTCAAATATACTTTGGATAATGTTT 

GCAGCTTCTCAAGCTTTTAAAAT(JGAGACCACCCCAGAATCT AGATATCT 

TG CTCAG ATTGGTGACTCCGTCTCATTGACTTGCAGCACCACAGGCTGTG 

AGTCCCCATTTTTCTCTTGGAGAACCCAGATAGATAGTC 

>€651 

GAGCTCCCGCGGTGGC«GCCGAGGCTAACAAGGAAAGCCCCTGGAGCTCC 

tgtaataagaatgtggttggaagatgcaaactgtggatgatcatcacctc 

CATTTTCCTAGGTGTC^TTACAGTGATCATCATAGGCTTATGTCTTGCTG 
CAGTAACTTATGTTGATGAAGATGAAAATGAAATACTTGAATTATCATCA 

aacaaaacattcttcatcatgctgaagattccagaggagtgtgttgctga 
agaggaattgcctcacctgctcaccgaaaggctcacagatgtgt 

gtggcx»gccgoccgggcaggtttaatctc^^gtctccctatagagttctc 
agcctcagcacctaaccctcacacaacactccagtattggatgcagtcaa 
tcttgtataacattttttgaatgtccaatgtgcaaagcacgatgttggaa 
at7atacagaggtgaataagagaaaaactcttgctctcaaagatgtcagt 
ctttttcn i gcaaggataacaca 

>€67.1 

GCGGCCGAGGTACTGGAGAGTCGGC1TTGACCATGGCCTCAGCTCAGCTC 
CAGGTTTGGAGCGGAAT AAAA GAG GAGCTAGCAAGATGTCTCATCTGAGC 
TT^CCCAGTGCCCAACTTATCTrG^GGCXrrGGGGCTGAAG CCAGCGCTGACG 

>668,1 

GTGGCGGCCGCGCGGGCAGGTA C I I I 1 1 1 1 1 1 1 I I I 1 1 1 M rCTGGTCGA 

AAATTTTTGTTGGAATTTTAAAGAAAA 

>66&2 

gaaaggcaaagtagcactcagatggcci i iii i i gtaaagtgaagtcaac 
ctaatactctggtg cttaictttgcaaatcttttccataagtcaagtatta 
gtgttaacaatacacttaagaagtaaggataaacccatcaaggtccacag 
ctaaataaccagcagattcccagaaactttatgtatttgggaaaagtaaa 
atatacaacagacatatccctgccctgattaagagggtagataaaaacaa 
aacataaaacaatttt actt gagat agt aat aagtt atttgaaaaaaat a 
caacagaatatagggagagagagcaactacagaaagaagacagaaggggt 
tct gcttt gaat ag t aaggcttgggaat ag ctgaat 

>669.1 

ggcggccgaggtac^gagtaggatgaataaaatctg i g i g l| acagcggc 
agactgaaggacgggtgcctgttttcagccatgaggt agtccctgaccat . 
ctgagaaccaagcctgaccctgaagtggaagaacaggagaagcaact gac 
gacagatgctgcccxscattggtgcag^tgcagcccaggttggactgagtc 
actgoc r ro ci gccccatcc^catcccatcatgagaagctaggcattacc 

ATTCCTGTCTAGTAGGGATACATAGTTGGTTGCGOCTAAGjTTG C i rCTGG 

cagaacccaaggaataaatttctccatatcgtttcctagttaccctaatc 
tctgcacaaatttgtgtgttacagaagcagatccagagcttgaataaaat 
gtgttcaaaccttct ggagaaaatcagca 

>670.1 

TGGCGOCCGCCCCGGCAGGACATT U 1 1 1 1 II 1 1 1 1 1 1 1 IAAA C I I T 1 A GG 
GTCTTGGCCTATTGCATACT AAAGGGCAAAAGGCTT AGAGATATCAAAGG 
GGCT AATTTTTT ATTGACAGACCATGGCGATGTAAAATTAGCT GACTTT G 
GTGTGGCTGCAAAAATAACAGCT ACCATTGCAAAACGAAAATCTTTCATT 
GGCACCCC7TACTGGATGGCCCCAGAAGTTGCAGCAGTAGAGAAGAATGG 
YGGCTACAACCAACTCTGTGATATTTnSGGCAGTAGGAATAA^ 
AACTTGGAGAACTTCAGCCACCTATGTTTGGATCTCCACCCAATGAGGGC 
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TCI CI I CI I aatgtcaaaaagtaattttcagcctccaaaactaaaggaca 
aaacaaaatgggtcatcaacattccat aatttttgtcaaaataagcact a 
atcaaaaaaaaaaaaaaa 
>en.i 

gagctaccgcggtggcggccgaggtacgcggggtcttctcatgctc 

atgcatgaggctctgca(jaaccactacacgcagaagagcctctccctgtc 

tocgggtaaatgagtgcga 

tcgagctcaccgcggtggccgccgaggtactcttctgcactgttctitct 
ttctaataaaac i rTcrm icgaacctatactgtcttctgtaaattctt 
cttact accct atgacccgtgagccaaccactttocgatgccagggttct 

GACACCTCACCTGGCATAATATAAAGTG I I I I I II 1 1 lATACCCTTCCAC 

TTGGAAAGACTACAGAGGAATCTTGCACTGCATAGTTCAAACTAAAAAGA 

GAAGAGTTAATTACCTGAAAAGCAAGAGAAAACAAGAAGGGGTAAATTTT 

GAACCAAGGGAAATCATTTAAGAAGTGTCrrGGTATTTTTCAAATTTCTGT 

CAGTTG TTACA TTTGTCATAAGTAAATGTTTAGGAATAAAGGATGGAGAC 

ATGCTTATTTTATTTAACTCCCCC 

>«73.1 

GGATTGAGCTCCCCGCXJGTGGvCG^5CCGTCCCTCTT AATCATGGCCTCAGT 

tcxx»aaaaccaacaaaatacgaaccgc<^tcctattccattattcctagc 
tgcggtatccaggcggctcgggcct gctttgaacactct aattttttcaa 
agt aaacccttcgg<sccccccgggacactcagct ccgogtacct 

>€74.1 

GGGGCGG^X3AGACTGAGCCACACAGTGTCCGG>.TGGAAGTCrrG<> 
GGTTGCTC AGTGT CCCGGTCATTCATTTACAC^TrTTAACTTG CATT AAA 
GAGCTGTTCTTTTCTGTGGCCTAGACT CTTTTGACT GATCTCAAAAT 
>«742 

ctaaaattac atgga gttactgn^tattcttttitxsccttrtgcagcaac 
ttaca cagcatttttaacac c i 1 1 1 1 1 1 ictagi 1 1 1 1 1 igttcggtttt 
gttt1 ccatcaggaatttgagttctctct aacccagcttactgtgggaca 
taggaaaactcagtagaaatacctttggtgatcttgttgagtttaagtct 
6atcttgatcttaaactcagtaagccactatctgcaattttgt 

»€75.1 

AGCTCAC<X»CGGTGGCGGCCC<AGGT ACGCGGGGCTGTAGTGGCTTCGTCT 

TCGffl 1 1 1 fCTCI % OC1 iwjCTAAOGCCTCCOGsQCTCTCQTCAQ 

>€7B.1 

^CTCACCGCGGTGGCGGCCGCGAGAGCACAGATGACCACGCCATCGTCC 
AGTATGAGTGGGCACTGCTGCAGGGGGACCCGTCAGT ggacatgaaggta 
aogcatgttgtc*ctgctggcagctaggtctgct 
gtgagggagggafa^ccagcatgcggtgct 

>677.1 

' ACTATOTAGCGTGTTTCTTAGGTAAAG IOIC I 1 1 1 1 GCTACTGAAAGGGA 
AATGGTCTC1 AAACACTGGTCACTGTAGCAGGTAAACACTACTCTAACGT 
GGAGAAATGAGCTTCATGCTGAGGTAGTGGTTGCCTTAAAGCTGTTTTT^ 
AT GCTGT AAAAACCAAAA TGGGTTTGGT 
>6Tai 

ctccccgcggtggcggccgaggt ACTTGT GGGAQACG TCGATGATOGAGT 
TCAAGGCTGT CTOCAGCT CGGCCAACATGATGCCCACGGACTCGCCCCAC 
CAGG1 C 1 1 GTGAAAGACAGCTGACACGAGACATCCCCGCGT 

ATCCCCTCTTCTCATCACTGCTGCTCCAACT GGATTTATT ACTCCGGGAA . 

i*. 1 1 «XXXrrCCCAATTGATGGTTCAAACACT GCCACCGCT GACTGCCCTG 
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CATCTGTGGGTCTGTAGAACAGAAAGGCAGAACAACTT ATTTTTTAGGAT 
TTAACGACAACCGGTTGAAAAAAACGGTAGGGTGTCATGCTCACAGAGAA 
TAAAGATTTGTAGAAAAGGTGCTGAACTGCCAAGGAAGGCATTTCTTGTG 
CCGTGTCTGGAACCGTG T ATCCTT ACT ACATCACT GAACGACACCAAGCA 
CCCCATGCACT 

>6sai 

AGCT«XX»CGGTGGCGGCCGGGTACAAGGGGAGGTAAG^TGGGAGCTCC 

ACTCCTTGGACCACCAGCTGGTTCTGGACCGTAT(XXXATGAATCTGTTT 

GAACGTAAGGAGGAAGTCAAAAAAGTTCTT ATTTAGGG I Mill I GAGAT 

GTGGGGCX}ACTTC<^TTCCCACCCGGCACAGGTAGGCACGGGCATACACC 

GACACT AGTG GGTCTCCGATCCCT CTGATCATGCATGTCAACOGGGGCAG 

GCACTCT GAAATTCtXGTTTTGGAGAGGJWTTTGTTACATTTCAGGATGG 

ATGCCT CCACGT AAAATCTTGGAATGAGTTCCCT GATGGAGGGAATCTTG 

AAAAACCAATTTAGGCATGTTTCCTTGGCCGTGTCATTTGCATTCTCTGG 

AGAAAAGTGATCTGGT AAGACGCTGCGGCT ATCCACACAGATGGAAAAGA 

TGCGCTCGTACCTGCCCGGGCG 

»€81.1 

TGGCGGCCGAGGTACCCT AATGT AGT AGT AAATTT AAGGCCTGTCGAGGA 
AATTTT AAGACTT CCAACAGGTG ACT AT A TCAGGAAGGAGAAAACCAAGT 
GCTTCCT GCTT CACCTTCTGCTGCTTTTGGGACTTTTTATGAGCT AGTT A 
GCT AAGGACAAG ACCC TGAACCCAI 1 1 1 1 1 cactgggagaggaaaaccac 
caggcttctcagctttggcttggcaactctggagttcctatggcttgcat 
cagggctccaggaccctgataagtggcctcaggcaggaggagatcgggag 
cggatgggagag^agtcaggaaggtggaatagggaccatccccaaacac 
gttgg cgta tgatgatttgaggaactggacgtagttctgcatgctgcggt 
tggagcttt cxsgactgctccaggcg^tctttcaggtcttgcaaccggctt 
tggtagcgg 

>682.1 

ttg^gctcaccgcggtggcggoccaggtactctcgtttcagctg 

TATGGCCAACCGCTCGGCTTGCGCCCGCCGGGTTTCC<XaAGATATGTTGT 

at tcgg ctgggtcgagggtctcaggcagagtgcgcaggctcgacggctta 

tactttgggaacgacatcttggcgaaccagggcacaattgcgcct 

attctgaggccctttgtctccccgcgt 



gcggtg^cggcxxkxxxsggccgjstac^cgggatggcacatgcagcggaag 
t agggtctacaaggacgct acttcccct atcatagaagagcttatcacct 
ttcatgatcac 



gccctcataatcattttccttatctgcttcct agtcctgt atggcctttt 
cct aacact<^caacaaaactaactaatactaacatctcag>ogctcagg 
aaatagaaacccgt 



tggactatoctggcc«5ccttatoctaggc<xtaatgggcc^ 

AC 

>684_t 

tggctct acaatcct cagcatgttaatcgaagcc 1 1 G 1 1 GAGCTTCACAA 
aggttccattgaagatttgac 

_ _ jtcactccattg 
atacctctgattctgatgacaaacgccaatttgggttctgcaggtacgag 
GACAI 1 1 1 gccccgccgcxigttggggtctcctttacccatgttgacaga 
tccgcgtccacocgagggtattggagggtatt ct t gcctggtgcgagctt 
ttcctcagagtcccgcagagcgggcgctctagaactag .... 

ctttatt aaaaaaggccct aaggggcc t c i rattaaaaaaggtataaaaa * 
ccccataaattc«gggcccctgggctgggcagg^ 
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GGGAGGAAGGGGGGGGATGGGGGGTGAACCGGGGACTGGGGAAGAGGACC 
AGGGGGCACATTGTTCCTCGGGTTTGGGTTCAAAGATGGAGCGGGGGGGG 

GGCGACAGTTTTTTCCCAAAAT7CTGGGGGAAGGGG 
>€87.1 

^?f^^^^ GG G C1 ^ TA ^TGGCAACAAGTATGGCGGCTGCTAGTGG 
I^^I^^^ T( ^ MG AGT ATCGMGAG CGG/ ^G AAC AG ACCCG GA 
JI?^^ G ^^ TC T^ GCGCCAGGCTA AAGCCAATTTTGAAAAAGAA 
G *^ GGGG ^ AAG ^ G ^^GCGACTTCGGGGTGAGGATACATGGATGCT 
2 T ^ TGAGAGM ^ GAACAG ^ CTCACAGGAA <^"^GTGA 

agaaaaagaagaaaaaagacaagcttctatttaa 

>688.1 

^^ g Z?P/^ g ^^ ggtacactcgccagcggt ^ gg cacaagagtat 

o C 5™$^ G ^^ CAGGCTCA ^ ATAATGTGAGGG <3GCCACCCTCCT 

TCCCGACTGGCTAGCACTTAGAACTrnTAAAAGAGGCAAA 
>689.1 

ACAAACTGGGCACTGGATAGGTAGTTCCTTTGGTGGTCAAGGTGGCTCTA 

CCTGTCCTTGAGCTCTCGTGTCACTCGCTTGGTGATCCGTCCACACATCA 

GG , CGA ^ C A GG ^ CA ^^ 

T AG J TC ™ AGA J GGAGA ^ 

*I9^9J?^^^^ GGAGG ^ AGAA ^ Gag ^ gaagggaa TGTGATTCTCTGAGAG 
^^^ CTGGTGGCAGTGCT AGGTCTAACTTATCCCTCTCAGTTCCT 
AGTTTAATTTATGTC 1 1 1 ICI I 1 1 GGAGAGGGAGGGCAGGAGATAAGAAA 
A £ TC £ AGAG J G A G ^ 

>€90.1 

C ^™I G ^ CGGCCQAG ^ GAmC ^ GCAGTG CT GAGGG ATGG^ 

IS^^P^Ph'JJ 'IpGATATTCATTGCTCCATCGATTGGATCCAGTCC 

TJCS3 G » AG -^^^^^^^^ AAGGGAG ^ AAGGG ^ GGG TGAAAGCCTrGAA 

TCCTTGCTAAATATTCCAGTTGTTTTGAAGGTTGT 

>€91.1 

GCTCACCGCGGTGGCGGCCGAG GTACTACAGGAAGAAACTAGAGGAAACG 
GG ^JTI9 ATCCATQTCCTQTGTAmGG TGGCAACAGGTCAGAA<»GCC 

^^I^TZ AT I2 CCTGC ^ GCTGCCTAGGGTGCT 

^^^^^^^^^GTTAJGACACAGCAACCAGTTACrCAG 

AGTCTGATGAGAAAAACAGATTTrAGGTTTGGGAAATGGGATTACTGTAA 

TTTAC/^ATOCA/^TGCAAACTGGAGCXCT 

AGAACTTGATGCTAACCCACAGGT 

>692.1 

^5 CA ^I GG I G ^ TCCTTTCACGAA I ATA ^ 

TGT Cr^GGAAAGGACTAGAAGAATTTTTrGATGAGCCAAAAAACTGGGGG 
CAAGAAAAAGTAAAATCTGGAGCAGCATGGACCTGTCAGCAACTAAGGAA 

St^ G Z^ TG ^ TTTACACAAA ^ 

GAAAG ^y G ^]3 G y AAGGG ^* GAGG AGGAGGCCAAGOGGCAGAGATTGCCA 

ATGGCAAG J«^!^ 

AGATAAAGGTTGTCCAGGAAAGAGAAGATGCCCTAAG 



ACCTC^GGGACATTTAAGAGTTGGAGGTGCAAATATATTCCAAAAGGGTG 
GAAGAG >£ A C^GTATCC©Cra^ 

TTCGi . 1 1 1 CTTGATCATAGCTATTTTGTGCTTGAGCTTTATTGGCTA 

. CCGCGGTGGCGG^OGGGTAC^AAACAGGGGCTAGAGCTTCXrrACTAGGTC 
^GTTGGGGCTCCCCTTTCCT 

CTTGGGGGTTTTG7TGTTGTTTTGTTGGCATTTTTTCTGTGCCTGTTGGT 
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GATTCCAGCACAGACCAGAGTGCCGCGT 
>695.1 

GCGGGGGGGCGCCGGC GGTCT GTATAATGGAGGCTGACCAGAGCAGTTTA 
GGAGATTGTAAAGGGAGGTTTTGTGAAGTTCTAAAAGGTTCT AGTTTGAA 
GGTCGGCCTTGTAGATTAAAACG AAGGTTACCT AAATAGAATCT AAGTGG 

catttaaaacagtaaagttgtagagaatagtttgaaaaaaaaaaaaaaaa 

AAAAAAAAAAGT 

>696.1 

G TGGCGGGCG GCCGAGGTACAGCAGGGTGCCTCAGCAAG AGAGGACTGAG 
TGGATTTTCCTTAG GG ATATTT ATG AACCTTAAAGCAGGAGCTTAAAGGG 
AATTTGGGCCATATTAACCACTTAGGTCATGATAAATGATTACATTTTTG 
GACATTTTGGTGTCTTAATGTCAGCAAGGGTTGCACGATAAGTrTTGACA 
TGCATGCATGGGAGACATGTAGAAATTC7AGTTAC7TACAAGTTTTTGGG 
GAAGAAGCCTGG ACCCAGATG CCAGCTTT AAATAACAG GG GAGTCT AATT 
ACTTCTAAATTCCTCACATAGG GAGTTTTG 
>€97.1 

GGCGGCCGCCCGGCAGGACGCGGAGAGACAGCGTCAGGCGCTTGATTTCC 
CTGAGTCCCGGTGCCTCAGCTGCCCAGTGCCCAGGTTCGTAAGAAGGCAA 
CAAGTTCTTCTCCTCTACAGAAGGATTTTGCAAACAATTCGGCAAGTTCC 
AAATGATTCTGATCGCAAATACCTGGAAGATTGGGCAAGAGAAGAATTCA 
GAAGAAACAAACG TG CCACCG AAGAGGAT ACAATCCGGATGATGATT ACT 
CAAGGCAATATGCAGCTCATG GAGTTAGAAAAAACACTTGCTTTAGCAAA 
ATCTT AACTATAGCATTATTCTGAAGGATTTTCAAAGTCTCCATGTGTCT 
TTGCTGCATTTAGGATTAACAATG 

CGCGGTGGCGGCCGAGGTACACGGCCCGCTCGTAGGGATCGTGTTTGTTC 
CTGACGACCCTACGGTAATGCAGCCGGAGCTTGTTTTCCGTAGCTGGGGA 
CAATCTTCT GTCCTTGCTG 



AG ^ CA Z G ^ TT II C ^ TTTATAACATGGCAGAGTG ^ 

ACTGAGTGATGCTTGTCAATGGCCACTATCAGGAATTTAAAACAAGATTT . 
GGAATTATGACATCTGG ACAAACCACATATGCAAAACCT ACCTTTTGGTC 
CATTCTCCAGGGCTT CTTCTGCAGCTTCT GGTTCCAGTTCTTTTTCGGAG 
CT GTCAGT GTGTG ! 1 IT 
>€99.1 

GGCGGTGGCGGCCGAGGTAL. I N II I I 1 1 1 M I ( 1 1 1 1 1 GTTAGTGTTTT 
CTGATGTCTTTT CTAACAAATCTTTGCCTGCCCAAAAGTCTCAAAAACAT 
TCTCACGTTTCTA 



AGTCCATGTTGCCCAAACTGGTCTG^SAACCACCACACOCAGCTAATTT^ 
G1rt5AATTGCGGGTACCAGCACACCGGCGCCGTCCTGGACTGCGCCTTCTA 

cgatccaacgcatgcctggagtggaggactagatcatcaattgX(m 

atgatttgaacactgatcaagaaaatcttgttgggacccatgatgcccct 

atcagatgtgttgaatactgtccagaagtgaatatgatggtcactggaag 

ttgggatcagacagttaaactgtgggatcccagaactccttgtaatgctg 

ggaccttctctcagcctgaaaaggtatataccctctcagtgtctgg 

>700.1 

ctgacctccgcaccggaagcccccgcgtacct 

■401.1. 
>701-2 

c £ ct t gg ap'cctgggagc^^ 

tctcatgtctgcagcaatagcaatgagttacttcttaatcttaataatgg 
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TCAACTTTTGCCACTACAACTTCAGGCCCCACTrAATrCATGGATTOCAC 

ctttctctggaattttacaacagcagcagcaggctcaaattccaggactc 
tcccagttctcntatcagctctagaccagtttgct g gactgctcocaaa 
tcaagatacccttaacaggagaggccagttttgcccaaggagcccaggca 

GGCCAAGGTTGATCCCTTACAGCTTCAAACACCGGCTT 
>701.3 

aaacacaaccaggccccagtcacggggatgccctatgtattctccttcaa 
aatgcctt aagagcaagggccagatggtttcaatacctat 

>701.4 

caggtttacatgggcccgcggtggcggccgcccgggcaggtactgcaagc 
aac agttactg cgacgtg ag atcatcaagaacacgtag ag aaaccc agct 
gtaatcatgcatggagatacacctacattgcatgaatatatgttagattt 
gcaaccagagacaactgatctctactgttatgagcaattaaatgacagct 

>701.5 

aggaggaggatgaaatagatggtccagctggacaagcagaaccggacaga 

GGGGA 7^ AGAA TATTGTAACCTTTTGTTGC^AGTGTGACTCT ACGCTTCG 

GTTGTGCGTACCT 

>702.1 

AAAAGGATGCGCCTCCACTTAGCAAGGCTGGGCAGGATGTGGTTCTGCAT 

CTGCCCACAGACGGGGTGGTTCTAGA 

>703-1 

G GCGGCGCCG GGCAGGTCAGACCTGAACG CCCAAACACTTCCTGCAGATG 

TTGTCGTTGGAAAACTGTCGTCTT ACAGAAGCCAGTTGCAAGGACCTTGC 

TGCTGTCTTGGTTGTCAGCAAGAAGCTGACACACCTGTGCTT^ 

GAACTCTCATTTGGGGATACAGGGGTGAAGTTTCTGTGTGAGGGCTTGAG 

JTACCCTGATTGTAAACTGCAGACCTTGGTGTTACAGCAATGCAGCATAA 

CCAAGCTTGGCTGTAGATATCTCTCAGAGGCGCTCCAAGAAGCCTGCAGC 

CTCACAAACCTGGACTTGAGTATCAACCAGATAG CTCGTG GATTGTGGAT 

TCTCTGTCAGGCATTAGAGAATCCAAACTGTAACCTAAAACaCCTACGGT 

TGjKAGACCTATGAAACTAATTTGGAAATCAAGAAGCTGTTGGAGGAAGTG 

>704.1 

GTGGCG GTCTGCCCAGATCCATGATGTGCAGTTCTCTGGAGCAGGCGCTG 

GCTGTGCTGGTCACTACCTrCCACAAGTACACGGGTCTATTTGGCCGTGA 

CCTTGCTCT GGAGACGATGATATCCCTTCAGCCTGAGGGAATTGATGTTG 

ATGAACCCGGAGGCATCAGTTGGCtCATAATCACCCTGCACGTTCATGCT 

CAC<>GCTCCTCATTGTTCAGAGACAGTGGGGACT 

> 705.1 

CCGCGGTGGCGGCCGAGGTCCGACGCAG caggctccgaagatcatacaga 
CGCGATTACCACTCTTGGCTCCCAGAAACCT CT GCGCCCCGCGTACCTGC 
COG 
>706.1 

ACGAGT AAATTTTCATTAtXnTTAATTAGGCAATGTTTCTTAGATAACCA 
^ AAAAG ^ GGAAA * GG ^TJ~'^AAAAATGATAAATAGGACTT catcaaaa 

agt aaacgcttcaaaagatactactgagaaagtc*cagaataggagaaaa 

ATCTGATG 
>707.1 

ACC ^I ATCCAAGGC ™^ GCAAC1 ^ 

AGAGTCCAGAATCACrrGGTGAGCATrCCAGTAGGACGGTGGCATTTTAG 

GTCTGGATCGCCTGTGGTAGGGGTGTAATCCCAGGAAGGCATTAAATATA 
TTTGA ATTAAT GT ATATTTTGAGAATAAAAGGCTATTTCTAGAAAATATT 
ACACACTTGTCTTATGTTAAATAAAAATTTGCTATT^ . 
TACCCACCUI ICI I CCCAATGAAGATCTT ATGCAT ACCTTCACTGGAAGG 
TTTAAGATGTGACAATCTTAATAGATCTTTGTGAGACCAGCCATTTCTCT 
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G1 ^ A TATTTTGGAA(X^CCAGAGCAAGGGCCATGCCACCTTTCTCATTG 

>7oa.i 

ACATCCTTTTG CATGCTCAAG AGCCCATTCTTTT CATCATTCGG AAGCAA 
CAGCGGCAGTCCCCTGCCCAAGTTATCCCACTAG CTGATTGCTATATCAT 
TGCTGG AGTGATCTATCAGG CACCAG ACTTGGGATCAGTTATAAACTCT A 
GAGTGGTAAGTGTCTTCACATTCTTTAAGCACT AAAGAAAACTTTTAATT 
AGCT ACCTTGCTTCCAGTAATCAAACTAGAGCTCCTCTGCCTTGTGTAAG 
TTGCTATAAAGTATTGACTATTAGAATGTCTTGAACTTTGGTTACTGTGA 
GCCAAGTCGGTGCTCAAAGTATATTTCATAGTCTCAATTATATAGTAATT 
TAGG TTCTG AAAAATAGGTTCTGTCTTTGCATATGTAATATTTTGTGAGT 
ATTTACJTTGGAAAGTTTGGTCGACCTAATGATAAATTTAGAGTTTATTT 
TCCTTTT ACAAGCTTACTGCATTG CATGGTATTCAGTC AGCTTTT G ATGA 
AGCTATGTCATACTGGTCGATATCAT 
>709.1 

ACAAGCATGGTCCATACCACTGTTTACTTTTCTAG AAAGTTGTT AGACTA 
Ai i i 1 1 CAACAAAAATTCTTTATTGTCTTGGTAACAAAAGAAGCATACT A 
AAAATTCTCAATAAGGCACAGTGTCTCTAGAAGCTTGAGCATTCAACATA 
AACTTCTAATT AACACGAA CTTGTG CT CTTATTTCA GCCATTG CT GT GTG 
GGCTrGGAGCCAGGAGAAGATGCAGAGGAATTTTACAATGAATTACTTCC 
ATCAGCTG CAGAAAATTTTCTAGTTTTGGGGAGACAATTACAAACATGTT 
TTA 
>710.1 

ACGCGGGCTAATCCC^STTATGAGGGCTCTGCCCATGACCTCATC^VCTTC 
CCAGAGGCCTT ACCATCT AAT ACCAATACATTG G G TTT AGAATTTCAGCA 
TGAGAATTTGGGGGAGACAGTCAGACTGTAGCGATGATTCTGGAGTATTC 
ATCATTTAA GAGAC ACTTAAAAATGATCAGAAAGGAGAGGATGAAGGCTA 
G AACTAAG ACTTTAGCGTTGAACATG GAAAGGAAGTGATGACTGCAGAT A 
TCTCCAGT 

> 711.1 

AO M I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 GATAGCCATATACCAAATAA 
ATGTTCTGTGACT AGGGGTTATGGCACAATGGGTATTGAGACACTAAAAA 
CTCTGCTTCAGGCTTCCATCCTCTT AATTTTAGAATATCTCTGATTTCCT 
AATTTTCTGATTGACATCTmGGTAGATTATCtSTGTTTTTACTTTATGT 
TATTG ACTGATCCTTTAGAATGA I I H C 1 1 H I U TTCTGGGAAAAAAAAT 
GCATTCT AAATCAGATTCACTAATACTTTGATTGACTTCCAAGGA 
>712.1 

ACTTACAAAAAI I I 1 TAACATTAGGAG GTAATTATAAGTAGATTCTGTGA 
TTAGGACTTCATTC^TGTATCTTTTGCTACATAAACCTTTGTTAGATTAA 
ATGGAAGACACCTGCTAGGTGATACTTTTTATAAAAC^TATGAGTAAGTC 
AT ATATCTTTGTTAAATTTCT GTATGTTCTTTTTTGT ATAAAGATGGAGA 
GAAAGGATGGAGTGATACT AAGGAOCCTAATAACATCTCTGTTCAAATTA 
ATTACT AAGTGATAGAAGTATTCATATGCCATTAAAGATTTGCCAATTCT 
AT7TG 
>713.1 

actgacacaaggactccaggcx^cacatat^ 
tgtttgaaaaaaagatcgtttgtatttgatagagcaaaagaaggccacaa 
aatgaattgtcttctrtgtgggctgtgtttcagaacggcoggtttgtgggc 
gatgctgacctrgaaagacagaaattttcagatttgaaactcaaccgacc 
ccaggtaattctttggctcaagacctg g gttg CTT cattcatattttctt 

ATTTCCCCAGCCT ATAAGAGCATATTTGTGTCTT GTAAGGTGCCTGG 
>714.1 

agatatgcactatttagaatatgacattaatcaaccactagaattaaaat 

CAGGTTATAAATCCTCAAAATCACCXGAGTATAA^ 

CAGACCACAGAACAAAAACAGAAATACCAAAAAATAATCACAAAATATTA 

aaaacagtatataaacacagtgacagaattaggactaaacatatctgtaa 
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AACAATAAATGTAAGGGTAATCTCACCAATTATGAAAAAGACCTTCAGAT 

CATATTTTAAAACAAATTTAAAAACTCAACTGTATGTTT 

>715-1 

ATTGGTGTAAAATTTGAAAACCATGAAAAAATAAAACAATAAAGGATCTA 

GATGCTAATAATGTGGTTAGTTAACATGTTGACCATTTCAAAGCAAAATA 

AGTCTTTGATGTTTTATACTATTCATAGCAAGATATAAGT AT7TAATCT G 

CAAAGACGTGGATTrGAAAATTCAGCTGCCAAATGTAAAGAACAGATTCC 

TAGATTATTATTAATAATATCTCTATAAATA7TATATTTATCAATAATGG 

GT 

»716.1 

TCTAAGTGAAAACCTGG AT A I HI Ml I C 1 CCAAAGTTATTTCTTAGTTC 

TACCTATGACATGAGGGTGATCTTTATAA 1 1 1 1 1 1 1 1 1 GTTTTCACTGAA 

GAAATAAAACATTGCTTAAGGGAGAGTTGGGGGAGTGCATAAGGATCTGC 

AGTTGGGACTGGATTTTTCGGGTTTGTTTTACC 

>717.1 

ACTAATCTAAATGCTAGACAGTTCAAGTGTAGCTTTGGAGACTTACAGAT 
AGCCAGCTAGAGAACTACCAATGATGATATCCATCACGAGGAGTTTGGTG 
GCCAGCCTCCAAGATGGTCCTCAATGATCTTTGCATCTT CATATTTCCAC 
CCTGTGT AGTCCCCTCTCTCAGGGGATTAGGGTTGGTCTGT ATGATCACC 
AGATGGCTGCAGTAATGGTATGTCACTTCTGAACTTAGGTTATAAAAGAC 
TATGAGTCTCATCTrGGGTGTCCACTCTCTGTCTCTCTGATCTTACACT C 
TAGTGGAAGCTGCCATATTGTGAACCTCATGGAAGGCCCACAGGGTGAAA 
AACTGAAGCATCTAATCAACAGTTAGCAAGAAACTGAGGCCTGCCAACAA 



>7tai 

GTTTOGGGTATTTGGGGCGGGATAAACATGGOGAOGTCTCTGCATGAGGG 

ACCCACGAACCAGCTGGATCTGCTCATCOGGGCCGG 

>71S^ 

TCTTTT ATTTTCT ACTTCT CTT CAGATTTGT CTT ATGCATTTTCCAACT A 
TGTATGGATCACAGCTATTCTTTTTCTGAGTTATAGCT ACAGTTTTCCT A 
CTGTTGTCrrCATGCCATTTCATTTCACATGGT 
>719.1 

AACCCAAAACATTAAGAAAAATATAGGAACAGTAAGTAGATTACATTTTG 

TAAACAGACAAGCrrACAAGTTTTCTCAAATATGAAAGTCATACTAAACT 

GGGAGACTGTTAACTTCTTGATGGGGTTAATCTCTAATATGAAGCCACAG 

TCATAQCTAACT ACAAATTACATAT ACAATGGCAAAAATATTCAAAAAT A 

ACATTTTTTGCACCTTAATGAT 

>720.1 

ACTTGAAGAACATG GT AAAAAT ATGTTCACAAT AATATTTTATCTT AGAA 

ATGTATTCAGTAAAAAATCTCTTTA 

>7202 

ttcaact atcctcttgattcaggggaaaaaaggatt agcatgggagataa 
cagaataggaag tttag qaqataatgagacttctgttttagtaaagtaaa 

T AAGCTTT AATAGTTTTTTGGTCATGT ATTCAGTTTACCAGOCTTGAAGA 

TATTTGTAGGAAATTTTAAAAGTTrcTCTATTTCATOOCCCATGATAAAA 

ATTATATAQAATAAAAGCTGAATTGAA C 1 1 1 C I ICACAGCACACTGAAAA 

ATATCTTCTATAGCATTAATCAGATCACA 

>721.1 

CATAATTGAOCCrrTGAGCGGCCGCCCGGGCATGTAGGGGQGGTTAACTAT 
GTTTTCn rAACAOAAAGTTCTGTI 1 1 rGTGATOCI 1 1 IAAAAATAAAGC 
TTCAGGGAAGGATGAGAATAGTATTTTTGAACTTTAAATTTCTCATTACC 
AGAAGACCATGTGGTAATTCTCTGTATACAGTTAGAACAGCAOSGAAACT 
TGAAGGCCTAAAAAA TTAGCTG ACCrrGTTAAAAATGTTGGCGTGAGCAG 
TATATTATTACCTATCTTTTTTTATTGTGTG ?JU fGTGTGTGTGTGTTTTA 
A ACT AA TTGGCTGAAATATCTGCC I ii 1 1 > COCTCtTTAC A 1 1 I I I C I I U I 
TTCTTTCCTrAmATCTTTGTCCATCTTGUVGATCTACTGT 
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TTTTT AATGAAAACAAGTCCAAGTTTT ACTCT CAGTGGGTTTGGG ACATC 

agatgtaattgagaggccaacagggtaagtcttcatgtc 

>722.1 

acatgaacctattaataaaccattcatgcttcccagtttggcagatgtga 

gcaaact atgtatagg aattccaaaggtaactttttccttt cattacttt 

acagaaatactgtcaagtccaatagagagcacagacttgggaggcggatt 

gggtgggtttgaatct ctgc tctgccacttttattaatcatgtgagttga 

gtatgtgactt aatctcttttagctcaatttccccatctgtaaaatagga 

ataataaaaatactgacttcagagaggtttgtgaggatcaattagacagt 

catgttaagtctgtaaattgtttctgtaatgggcaagatagcaaatattt 

tagattttgtggaccatgcagtctttatcataactgcttaactgccatta 

tagtgagaaagcag ccaca gacaatatgtaaatgaaaaagtgtgtctctg 

ttccaataaaactttattttcaaaaaccagctggcttgtcacatctggcc 

TAT 

>723.1 

actta ctttgt tgctct i it tctaagttttaaagatggatgccaatctca 

ggui rem iCGTGTGTGTATGTGCGTATGTCCATAAATTCTCTTCTAAT 
TACAGTG TAAGCCA CATCCCACAAGTTTTGATAGTCACAGAACTGTATCG 
TCACACT ATTT 1 1 1 AATTTCAGTAAGTTCTTCACTGATCCCTGTGTAATT 
TAGAAATGTTrCATAATTTCCCTACATTGGAGGGGAAGATAGTTTTGTTT 
>724.1 

ACTCCTCAGCTT GTGCTGCCCTTCTCGAATGACTCGCGTTTCCTGCTTTC 
ATCACTACACCTCCCAGCGCTCrrCCATCACCTGCTCTGCTCTT ATAAGGA 
TCCAGAGAAATGGAATAATCTTATTGCTGATCTATGTAAACAAGTTGAAG 
AATCGTCTGAAAGAAAATACAGTGTGTCTAAACTGGAAAAGTCCTGTAAT 
AGTTTGTTCATGAGCATTTGCACAGTGGAGTTACTGTTCATCATGGGGGT 
>725.1 

ACT AACT ATTCCTAAATATTAAACACTGGTCAACT AAAATGCACAAATTC 

ATGAAT TGGA TTTGCACTCAAACAAAAAAAAATACCATAGGCAGTATCAT 

TTCT ACCTTTGTAAGAGGCAGGAATATTCATTAGACTCTATGCTTGACTT 

TTCATATGTATTTTAACACTGTAGTAGGCT ATCGGGTCT AGTTTAAGCTT 

CATTTCTAAACTACTCAACAGGTCAGAACTGACAAAGATCACAAGAAATC 

AACT ATT AACCTCTT GCCTGAAGACACAAATGAAATATTCCCT ATTTT AC 

AAAGCAAATTAGATTCXIAAGATTTTCCAAAGCCATACTCXrrGCAGTTCAC 

TTGGGTTCAAACTTAAAATCATAATAGTAATATACACATATTTACATTAT 

AACCCATTACACA1TATTTTCAACTCAATGCAAGT 

>726.1 

ACTCACTT AAATAAAT AATTGGTAAGATGATTTT ATCTGACAATT AAAAA 

AAGGTATATGTGAAAAACCTTAAAAAAAATCTATTTCATTACATGTTGAA 

ATGTTCTGTGCTTAATCCAATACATCATTTAAATTCTTTTCACATTT 

CAACAGAAAAACrrGAAATCTATGGATTCCAAGCTGCAAAGTATTTTATCT 

AAAT 

>72&2 

GCAAATCAAAAAACATCTATAACAT C iTGTI GGGGATACAAAGTTCTCCT 

GGCTGATTCTCATGCTACAGAAAG 

>727.1 

ACATTCT ATTGTT ATCT CTATTTTTTGGATGAAAAAAjCAGCAGCACAAAG 

aaqttcagtaactggcctaaggccacagagc rtgtcti cctqaagactgg . 
acccaaaoccaggcagtcatagaacatgctggtcgctatrgggccgcttg 
ctctatgggggacggtgctccaggaacacagcaatgcggtttaggattcc 
aggacctggggcagctgctgcml1 1 ici lagttctcgacagaccactga 
gtgcagtttrrctaaatcttttccccactttgatatgt ggtccat aaaac 
. tgcttccacacgtataacogactgtgaagtttaaaatgatttcatgtttg 
ggcaaattcctactgaatgttaagct agataggaaacaagttctgact aa 

CACAAATG 
>72B.1 
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TAAAAGAATCTTCCAAAG GAGG AC AGCAG aaatgaaaataaagtaagttc 

AAACT AGAATCCTTGACACAACTGGTTTTATTCCCAATGCCTCTTAAAAA 

GAATCGTTCCATGGGTGGCAGGAGGGGTGTTTr<>TGGTGTGATGCACCG 

TGACTTGTTATTCAAGATGTAGT 

>729.1 

ACTT ATCAGGATGAAATCAGAATCACAGTTGGCCTTTTGCCATAAGGGAA 
GGGTATTTGGAGAAGAGTCAACCACCACTCATGCCTCTCCCCTGCCCAGC 
AGGACCTT GGATTTTCCT GGCTTT ATGCCT CCT GTTTCCCCTGGCTGAGT 
AACTGCAGGCATTAGGTTCCTCT ACACACGATATATTACAGGGAAATGGC 
AGOGATGGTCTGGAAQGGCAACACTGGC L. I I C I I I CCTCCTGAGCACTAA 
AATCCTAAACATGCAACTTAAAAAAAAATTCT AAATGTGAACACCACCTT 
TCAATAATTTATA 
>730.1 

ACTTCACTT AAAT AAAT AATTGGTAAGATGATTTT ATCT GACAATT AAAAA 
AAGGTATATGTGAAAAACCTTAAAAAAAATCTATTTCATTACATGTTGAA 
ATGTTC I <i i GCTTAATCCAATACATCATTTAAATTCTTTTCACATTTGGA 
CAACAGAAAAACTGAAAT CT ATG GATTCCAAGCTG CAAAGT ATTTT ATCT 
AAATTGCAAATCAAAAAACATCTAT AACATCTTGTTGGGGAT ACAAAGTT 
CTCCTGGCTGATTCTCATGCTACAGAAAGOOCGAGTTrCTGTTCTC 
TTGGGACAAGTGCCCGCG 
■ > 731.1 

ACTTTTCTGAAGAATACATCTTCGTTCAATGTGGTCGT attctt aatttt 
TTCT ATAATATTGCTTGTAATCTTTAGAGTTATGGTTTCATTTTTTGACT 

attaaatttgaaattgttqacatcagcagttgactc rrcTGTQi agatca 

Tj^TTTTTT'AATTAAGAAGACACTCTCAAGTGTTGAACTATAATTGTAGA 
GT AAATTCTAAGTGGAGGATATOGTAAATT U I IM F I GTCTTGGTATTGA 

catgt aaatgttaacatattjtgaataattcagtccacgattgtcacaggt 
tctatgtctttacctcctttcaaaata c III C 1 1 1 aacaaatactttgac 
aaatttattaacatttataagacaagacttaccaagttgtg 1 1 QG I H AT 
gattctttaaatgttttccaatacttagatacatc^aaat^ 
ctcaattccatcctattgtta 

>732.1 

ctcaocctgaattctrrcctgcn^agato mm ig aggt 

ctggatogggaccxx^ttcctgtaacaogactgtat^^ 

t atgaatctgcacccccgcttggtct ccaat atccagggatg 

> 733.1 

acaaaactatgtgagaacgtatactacttctcggccacaact 

tagatattcataaaataaccttjtgattgtgttttcacattgac 

gttctg tccaatcttataattctgattaaatgttctgggcctcaaaa 

atttttaaaaggccactaactocaaatctaggaaca 

actactgt aacttgtataaaattaacttgaaaaattcajctcactccaata 

aaact atgatttatgtagctcataagagggtgaattttgaatatttactc 

TATGAA AAAGCCTAAGCAATTCAATAAAAACTTGATAACTGCAOGTTTAG 

TTTGCAGCATCTTGTACCTGCCCGGGC 

>734.1 

AGTTGTGAGGAATCTGCAGAATTCCCC I 1 ICXilGIGGAOGGCCTTTCAGG 
TAw 1 1 1 u ii; i GAATTTCATTAGCT ACATTAAAAAAAGAAAAGATCAAATG 
CAATAGATAGCACTGTAATAGATTTTGCTACATTAAAAAAAATCCATTTG 
AAT ACACAGTGAACTAAAACACCAGAGTGGCT AAAAAGT CCCTTCATGCA 

CCAAAGCAAATCCAGCTACTTCTCT agctgagagggtggaatgactccaa 
AATAI IUI 1 1 CAAGCTCAAAAAGCCTAAAACAACTOCACATAAAA 
. >734i 

aacaaaaatctatctaattggacatttacct^^ . 
agtgggaaaaaaaaaaaaaaaaaaaaaaagt 

>735.1 



g tcac agacacaggctgggaatttcc 
caaatctt acaagttctcgtcccctttcgctt aacaactctttcggagta 
T CTCC GTCCi 1 tcacacttt attgtaagcgaggagagcagccaggctgca 
ccttt aacatttcattcacaggatctcagctcagccaagtcctcagccat 
tttgt aatgaggatcaci ii ci i ccggttccccgtgacctgtccctcgcc 

TCCTCT AAGCCT CAGCAGAAAGGCCTTCAACATCCACTTTTCCACAAGAT 

TCTGTCTATGATACCTGCATTCTCTGAGATGCTAGJVAGCTTTCTCTCCAG 

CTCTCC CCI I I CC I CTCTGAGCCTTCACCCGAGTCCCCATTGATGTCCGT 

Ami I ACCAACAAGCTCrrCACCGCTATGGAGGCTTTCTCCAGCAGGTC 

CCTGAAAACGTCTGCAGCA 

>736.1 

ACTTGTCTGCT TCAA TAAAATTTGTCTTTGATTTCACTGGTGGAAGGGTG 
CT TGATC CAGCTTTTGCTTCTCCATGAGGAGGACTrCT GTTTTTCAGTTTC 
CGci 1 1 I ATTTCCTCTGAGGGGAAAAAAAAGAAGCATACATTATAAAACT 
GGACAGCAGAAAGACTGAGTAATTTCTTAAGTTCT ATAAACTCATTTGGA 
ACTTCT ACAAAAAGTTGGAAAGAATGCAAATTTAATAAAAATTAGATGCT 
AAAATTGTTTCATCTAAATTTrrrAATTCACACAAATAACATAAAACTAT 
ATGAATAGGT 
>737.1 

I 1 1 1 M 1 1 1 1 1 II 1 1 I M 1 1 1 GTTTTGAAAACCCTTTATTCGGTTTCTCA 
S^AACAGT^TGCATTAAGAAATT^ 

cgat cccagt cctcacctcattgtgtggt agcocagcagcagagaagata 

ggaattttctgcoocctagcaatactgtrcatcccatogatggccgaaat 

gccagtctgaatcatttcctctgggtagattccacatt^ 

gctgacct aatgt atttccaaaaaggaaaatttcaacaagttgccgcatt 

attcatgaatgaaattagatatc>tatcaaattaaaagaaaagaaaaagc 

acc agaag accagaactacataaagcatctctttactagaaaaaaaatca 

gttatttttcaaat atgaaacttgaaataattgtttccttt actcttttg 

gagactcacaaaacattgggtaatagaattcaagtt 

>73ai 

ACT ATCTGCTCTGAAlTAAAATTrAGAACAAAAATCACCTGCCGTGCCAC 
T ACACATGGACATAATCAACTGCTAAATTATGA I iTGTTI IC I ICCAGTT 
ACTTTTCCAATTATTTTTACATATACAAATATTTTCTTGGT AGAAGAACA 
AAAGT GGCACT ATTCATTGTGTAta I M II 1 1 G I AACTTATATTTTACCCT 
AAGCATTTTCTCGTT GTCTT AAATTATT AATTGAAAATTATTCATGGCT a 
AATAATGCCTAGGCTGCCATGAGTC 1 1 1 ICIOUI I C TATAAACOGTGTCA 

gcattcttttatatatatctttcagcacatctgcaatga i men fg gaa 

TAAAT TTCT AAAGTT C«CTGGATCGAAAAGATTCAGGGATTTTT AGTGTT 

i rCAATTTGGCAAAGTATTITTCAGAAACAAGCCCATTTCAG 
ATAAACAAAI ICI I M 1 1 ATGTTGCATTTAAAATCTACC 

»739.1 

ACACAGI I ICCI ICI I OGAAACAATCCAGAAGT AGGCT AGCAATGGTCAC 
CCCT ACATACTTCOGCACACATCTTT AAGAACAGGACACCATT ACCACAC 
CCAAGAAAACCAGCATTTAATGAATTTATTCAAGAGTATCATCCAACATA 
GTCA AATAT CCACAGCTGTTCCGAAAGTATCCTTCAATTCTGGATCCATT 
GATGGTrCACAGgTTGTATTTGjaC TGT^ 

TCAG AGTAAAACTGGU; I GOODCTCI 1 1 CI MCTTT ACAATATTGACTCC 
TTJGAGGAACCGGGGCT^ATJST^^ 

TtX^TGTGACCAGATTtXiGGTCACAAATTTCT GGCAAGAACCCTTCACAG 
ATGACCATGTATTGGTT ATTAGGTAACAATAGATT ACTCAAGT AGAGAAC 
TGGGAAATT 
>74ai • ■ ■ 

" ^CAJTGTCTGCATT^GAGATTTTCCTATTATCrrTCT 
TGTTTAATTATACTGTGATCTACAAGCA 

>741.1* ' * * *" * ' « • - . 

ACTTCAGGTTAGAGATGACTT CAAT ATATGTCGCAGACCTCCCAAGGTGA 
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vj oATCACAC AGCACTTA I OA I AA I CAOGAACiCAGCTCCACAGAGGCTAAG 
ATGAAAACAAAAATCTCAGGAAATTTATGTTTATAAAAATGATACTTGCA 

aaaaaatgaatggaacgatctccattgctt atttagagtgttgactcact 
gaataagattttaaattagtcaatagtattggatgcctctatatctgcat 

ATCAATAGGCTCATAAACAAGGTrGCTCAAAGAACTGCCCATCAAOCACT 
TGGTTTCATCTCTGGACACCACACTGTTATCTTOCTTTGGCCTCTGTCCA 
TAACGGGTCCAGGCTACGTGCACCAAAGGAAAAGAATTGGGT 

>742.1 

acaggtttcccttgcctcaacttctcatcctgggtgatgagactgttact 

ttcci i c n gtataaagagggcaactttcatgtagaaattttacctccta 

cttttaagaaaaaggaaaatcagagtgctttaaaggaaaatcagagtgct 

tttcttgcatctgctatttttcaagtgtctttaactcaaaaaaatcaata 

tgccaaagtggcatgtttgggggtatctggttctgaattccttcaggaaa 

gatagaaagcaaaagcaaaataatagg 

>743.1 

acttcctccttggcagcatcaatcaggcagggctcagcccacacccggctc 
ctaaagacaagagagcagagaaagcagaatggtgtttagagaccatcgca 
gtgacctgatcctgaaagcacctgtaggaaattggcctccgccaagtgaa 
tgtgacaatgcagtcagccacagtgacggagtgcaagatcggatcaccac 
acagatccaagagaccggtcaccacacctgagaaacaagaacccaagaca 
gc^catggaggtggaagcgtgctacgcagttatggcttcactactgaat 
gcgatcttgc 

>744.1 

acgcgggtg 1 1 1 1 1 1 1 1 igggtaai 1 1 ici i gagtt agaaatgtagttag 
aactgtgactaagggcattgcctggaatgtgctacaaacacgattagata 
ttgattt atcttcctcgt att agactgcttg t atagag act cagtgttta 
gacattcatttctcttccttgtataag actcctt gtataagactcggtgt 

TCATTTATCTTTTTAAATTAAACCACAACAAATATATGAGTTTTTAACCA 

TTGCAATGTGCAATAAATAAATATATCTGAAGTAGCATTAGCCTTCTAGT 

TTTAAATAATAA 

>7 4S.1 

ACCI 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 CGTCAAAGTCACTATTTGGGCCCTAA 

CATAAT 

>745.2 

CCTGCTCAGAGCGAC«GAAAAAAGGCAAGCCTmCAAACATAACTCTCT 
CT ACAAGCCAGCTATTATGGCAAGGGAAAAAAGAAAGCATCTAGATAAAT 
A TCTATCA AAATTAACTTTAAGAGAAATACTCT C ITTCCn A AAAGCCCT 
TAI 1 1 II I AAGAGACTAGAAAATAAGTTACTATAAAAAGTGGTGGTCTGG 
GGGCTAAAAACAAAACAAAAAAAATCCTCTTTTCTACATTTTTTAGTTTT 
CTG 

> 746.1 

Alii I I I 1H I III 1 1 1 1 1 rTTTTAGTTAAAATGCTTTACCTCAATGGTTG 
AGATATTTTGAATGGATTTTTCAAGGGGGGGAAATGCTTATTATAATAAT 
AAACCAAAATACTTAACAGAAAATTGTCAGCT ATTCTGACAAAAATAAAC 
ATTTTGAGAGACTTT ATTTCTriTGTCCGTTTCTGTGGTATCACTCATTG 
TCGTT AAGTAAGT AAAGCTTTTTAT ATTT AGGT AAGAACT GATTTT ATTT 
TTTA AATTATATTTTATATTTATTAGCACAGAAGAATAATGAGAGCCACA 
TTTTAGTTCAACTT 
>74 7.1 

ACTCI I I I la 1 1 1 AGGTATTTCCCTCCT GCTGTGTCCAGGATTGCTGTGTG 
GTGGTGATGAGTGCTGGGAGGTGAAAAATTAAAATAAGCCATTTACCAGT 
CAGCATCC CAAT TAAATATTTGATGT AACTGTGATCTTTG AGCCAGGCTT 
. ATATATTCATTTTCAAGCAGAGGAGTTCCCCATTTTAAATAGAGGCATTG 
TCTGATGTGTTTATGGTTAACTGCATCTGGCTrGGGTCTTTCTGTTTTCr 

men igctgaattagaaggggttactctgaAgagtccaggtcttacag 

TGTGGTTT 
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>748J 

aattttacacttattctttgaaagaaaaattct atggaattttcttcttc 

taattaaattccaaaatacattctctcaaccctatgccctcatactagta 

acttgatggttagcgggtaagtaggtagtagtaaaagagcaaaaggggaa 

atttggggagcaaaaaagggagaaaaagaaaaaaagggacccttctagtt 

tcctaatagaaaagctagagaattccattcctgaaaattaaaga 

>749.1 

ACCACTCACTACATTACAAAATAGrrCTCTAACATAAAATTGCCTTAATAA 
CTATACTATTATAGAATCTGATAAACCTTACATTATTAAATTGATTATAA 
AATCTTCTTGGAAAAACTTTGGTATGTATCTTCAGAAGG TTTTTTAAAAA 
TAATATTTrAAGGGCCTGTAAACATTCCATTCTATTAAAGCACAGCAGAA 

taagtaatggatattcaactgcatacagaatatagaatcaaaaaaacaat 

TTATTATGTATTTGTAG AAAATCATTACCAG AGT AAGCAAAAAA 

>750.1 

ACATTTGATTGTGGCATATTCAACTATGATTTTAGACAAGATGTGTGTGT 

gtgtgtgtgtgtgtgtagacaaaataaaattcagaaagagaaaatctatt 

CTACAATGAAATTCAATCTCrrACTTAGCTATTTTGAAATTGTGTCCCAA 

taccacattaacagagccaaaatgaaatttaaaattatggttatactatt 
attcacactaggtagggtcaggtttttttgtctgaattaaatggctcctt 

TACGCT AG CTACrrTAGGAACCACTTCCCATACCCTCAAGCTAGAGTAATA 
>751.1 

ACATTTGATTGTGGCATATTCAACTATGATTTTAGACAAGATGTGTGTGT 
G TGTGTGTG TGTGTGT AG ACAAAAT AAAATTGAGAAAGAGAAAATCT ATT 
CT ACAATGAAATTCAATCTCTTACTTAGCTATTTTGAAATTGTGTCCCAA 
TACCACATTAACAGAGCCAAAATGAAATTTAAAATTATGGTTATACTATT 
ATTCACACTAGGTAGGGTCAGGTTTTTTTGTCTGAATTAAATGGCTCCTT 

tacgctagctacttaggaaccacttcccataccctcaagctagagtaata 

GATACCTGACCC 
>752.1 

A l l I I 1 1 1 1 i I M II I I I I I I I 1 1 i I GGGAGCCATGGCAATU I I I I ACA 
CTTGATTrTAGCCAAAAGGCCAAGAAGCAATGAAAGCCATGATAATCTTT 
TT ATGCAAT GTT ATCAGGT AAAAAAATG GCT AAAGT ATATT AGCATTTAC 
CCGAGTGGTATTCTTTTATAGAACTCAGCTACTAAAACCAGGGAGAGTAC 
TTGGTGTA7TTCTGAAACACTCTGCGAAGTTGTGGAT AGCTT CT GGTGGT 
AAGGATGGTAtTGAACACGTTTACGTCTGTCCCC 1 1 I CIL X il 1 1 CTCCTG 
CTTCATACAAGG 

> 753.1 

AC I 1 1 M 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 M 1 1 GGTATTATATAAAATAATAATGC 
A TCTTAC AGGGGAAGTCATAAATCCAATGAAATAAAGTATTTAOCT GACA 
TATTTTTCCCATCTTCTTATTTCAACCATTTGACTG^ 
AATTGTTGGACTTTTTTAAACAATTCACACTGACTGGCAGTC^ 
TAAATAGTTGAGTTCCA I UCC 1 1 1 AAAATCATTTAAAAACATGATTTTTA 
AATTT ATCT CCATTACCTT A 1 III G 1 b II 1 ACTTTTTT ACTTTT ATTTAT 
TTCCT 
>7S4.1 

A CI 1 1 I I I I II 1 1 1 I III 1 1 G GTGGGGAGCTGTATTTATTTCCCAGGGCT 
GTCAAAAC AAATATCCATAAATTG G GTGGATT AG AACAACAAAAATTT AT 
TCTCTCTAGAGAAGAAC G 1 1 1 ICM GCCATTCCCTGGCTGCTGGTCATTG 
CTGGCAGTCCTrGTCCTTCCCTGACTAGTAGCTACATC^TrCTCATTrCT 
GCC1 CTGTCn CATATGGCTGTCATTTCACTGTGTGCT I G I CTCI GGGTC 
TTCAAGTGGCXnTTTTATAAGGACACTGGTCATTGGATGTAGGGCCTACC 
CCAAT 
.>795.1 

ACATGTTGGAAGGGTTTTTAAAATGTTTIGAAACTGTGCACAG^CCAAAC 
CGAACTTTCAGGACATGGGTTTTCAACTTCTGGATGGTATGATGGGGTGA 
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TAGTAGGGTATAAAAGTATCCTGAGAAGTTGAAAGCAGTGTGTGAATGGG 
GTGTTCTTTTCTCCCCACAAT«rmcC^ 

CACCTCACAGATGCTTG G G CCTG GAAAT GAAGCC ATGAAAATG AAGCCCT 

CAGCCTTCTTGGAG^TCAGAGCCATGGTCCTCACCCACAGCACATGGG 
>7S6.1 

ACACAAAAT ATT AAATAGGAT ATTT ATTTCTAAGCCAAATTTCAGAAAAC 
AATTTACAAACJ Mill 1 AAAGTATAAACATAGTGTATGCTTACTATAAA 
AGGAAAAGTATAAAACATTACTCAAGTATATATAGAAAATGAGTGGGCTG 
CTGATCCCCCTCTATATrATCTATTGCTGTGTGACAGTATTACCACAAAT 
ACAGTAGCTGAAACAACACATTTGTmCTCACAGTTTCTGTGGGTGAGG 

agttcaagcatagcttggtcctctgcaagcttacaatccaag g gttgg 

>757.1 

ACTTCi Ml H I 1 1 1 1 1 AAATGAGTAGGAAGAGATGGTATCA 

CAAACACAAAGCACAGGTTACrTGTCTTTAAAAATTTGCCTTCTTCTATTC 
TCCAATGGAAGTGGGAACAAAGAGAAAACCCCTGTGTGTCCTAGCACAAT 

atgggcatttgtgtggatttaat aaatg gg catttggattgttg ggaaaa 
tgtgatcaatcagca ggctata gaaac^cagtttgatacgatggtgaaaa 
crrtgtctacaatgatgttttttcagaaatgttggtgtgattagaacaagt 
cagcaatgatgatgacaaaatatttacat aatgttatagatgtg gcttgc 

TAATGGAAAT ACCT ATCT GAGGCTGTTTAGG AATACACAAATTGAGAACC 
GTTT AGTTCAGTTT GCTTTAAAACAGTGGTTTTCTGAACCCTrTTTATGT 
TCGTGATCCTATGATTAGT AACATCTT ACCATTTTAGAATCACT 
>758.1 

ACTTGTTTTAAAACAATGTTGGAAATGAGGAAAATGAGCAATATCAACAT 

atagttcagtcttagggatgaagggatttatgtct ctcctccctcaggta 
cgcggggactacactggtgtctga c i 1 1 1 1 icctagagatttctccctga 

AAAATACAAGGGCTGTTGGTGAGAGCAGACTTGAGGTGATAATAGTTGGC 

CTCTGGTCT ACAAAG ATTTCATAACTCCTTGGAAAGCTTCT 

>759.1 

ACTCCGATTGCCT CTrCCCATGCTTCTCTGCTTTCCAAAGAAAAAACTGAC 

CTrGTATAGATOCTGTCAGCTGATTGCAGTGCTGTTAACTTCTCCAT^ 

GAGTTGTTCAGTCrGAGGAGTTAGGTATAAACCCAGAGTGGTATTCTCTr 

TTCTGrrGTGTrrGGTTTTGCTTACATATTCAGGAGCTGCTCTTTACaX 

CAGAACATCCGT ATATATG MUM I CTGTTTCT AGATTT AAAAATATTC 

C*GAAGCCTGGCCTCAAGATAGATAATATTTTACTTTTA 

>760.1 

AAAATTCTCCTTTA 1,1111111 mAAA A^™TCCTTAATTAGGTA 

>7G0-2 

TCAAGCGGGTAATTGGGTTATTTAATGGGGGG 1 1 1 1 I II 1 AAAGTTTAAG 

GGA 

>761.1 

AAOCGAATGAAAAGCACTGGCTTATTAGACAAGAjGTTTCCC^ 

TGCT AAA ACAGT AAGAGCGAGCTTCCAAATTAATGTTG CL. 1 1 1 1 1 1 1 1 1 1 

I 1 1 I •CCAAACTGAAAGGAGGGTGGGGAAAACAAACGCATCATATGTAAA 

GCACTGAGTCCAGCCTG 

>762T 

ACGCGGGTATGGTTTTACGAACAAATTTTTAAGGAAAAAAATTATCATGG 

TTCTAATCTTACATGTTAACATTTCCTTGTT ATGTAGGGATCAGACTTGT 

TATAACATAATTCXJACTTTATAATTCAATGAAGAAGAAAGTTTTGTCTGA 

TTCTGAGGT ATGTAATATTTCATTATTATTACCATATTGATATTCT CT AT 

ATAAAAAAATTT ACAT ATTGTAGTTTT CAGGTAAAAGCT GTT GTGAACAT . 

TAi i f 1 1 1 GTCTAGTGTAGTT AATTT AAAAAAAAAAAACAACTGA 

> 763.1 .----.-» 

ACGCCT AAGGGAGAGCTGGGAACTCATCAAAGAGACAAAAAGATGCTTTT 
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ttgctctgaaggcatcgctgtggtg 

>764.1 

ggccagtgtgatgggatatctgcagaaattcgccctt agcgggcccgccc 
gggcaggtaccgcgggattcattrgagtgggaatctcaaagcagttgagt 

AGGCAAAAAAAA 
>764.2 

TTCATTAAGGGATTAAAAATGTATAAGG(XAGCACCGTGTAAACCTTCGA 

CTTTCAAAGAATTTTCCTGGAAA 

>764.3 

TTTTAACC limn I AAATAAAGGGGG AAGGCC AAG I 1 1 1 1 1 1 1 ICAAA 

AACTTCCCCTTAAAAAAAATGG 

>765.1 

ACAGAAGCAATG I P 1 1 1 1 GAAAGTTTTCT ATCTGAGGATTGTTGAATCCA 
CAGATGCAGAACrrCATGGAAACAGTGCCCACTGTATGTCACAATTTCAGA 
AAATCAGTATTTCATACAATCAGCTAATAGCCTAATTTGTTGAGCACAGA 
AAAATACACTGAACCAATTCTGATTATTGCAGAGAAATGATTGGCAGGAT 
ATTGG G AAAT AG AATGAAGG G CGG AAAG AATTTCACAT G G ATTCAGT AT A 
CTCTCCGTCAGG AA I 1 1 1 1GTTCCCTTGATCTTTTTGTGTTTATTGCCTT 
ATTTATTGGGGCeCT 
>766.1 

ACAGAAGCAATG 1 1 III 1G AAAGTTTTCTATCTGTGGTTTGTTGAATCCA 

CAGATGCAGAACTCATGGAAACAGTGCCCACTGTATGTC^CAATTTCAGA 

AAATCAGTATTrCATACAATCAGCTAATAGCCTAATTTGTTGAGCACAGA 

AAAAT ACACTGAACCAATTCTGATTATTG CAGAGAAATGATTGGCAGGAT 

ATTGGGAAATAGAATGAAGGGCGGAAAGAATTTCACATG GATTCAGTAT A 

CTCTCCGTCAGGAATTrrGTTCCCTTGATC 

>767.1 

acaatcaaaggagtctaatggaaccaagtagcaatgttcccgaaaacaaa 
ca aacaa aaaacc ccaaa cattttgctg 1 1 i ci 1 1 cccctctgtatttgc 
taacttt atcatgactttattcttaaagcctatcactggtctgcttttat 
taatagattagtggaaattncacctggcctattagcaccttataaagaa 
atagattaagagtaggaaatatatagatgaagatgtactgtatagaagtt 
gtgtaaaatcagtatgaaagttcvutgttgctgttcttgctcagtgattt 
taaagaaattgagtagttcctatgtg a I I 1 1 1 1 1 II I IC I II I C TAAACT 
>76ai 

acat atacarratgtaatgaaaaagcgtgcatgggqatgaaaaaaaattt 
ttttgtttatatgcggatacaatatatacaataaaacacctaaa 

>769.1 

acttatttttttact aaggh rim 1 1 1 ggagacttgti i gaaataaagt 
gatcctcattcaggatttagaaacaaaagttatactccacatgctaggga 
ttaggaag gct aatgt gaactgatcaaaagtatgaattatggaatgcctt 
tagaataatcaacttttaggt aatttgatactgctataatttcaagctt a 
gagaaaagttgtaagaatggcataaggaactcct atatatcctttatcta 
gattcactaaatgttcattttgtgccatttgtgttattcttt gt ctcatc . 
ctag«x^tcagcctaacaccaccagggataaaocagtagtctgat 

> 770.1 

ACCT CT CATTTGT C^CITTTCAACACTTCXrrGGCAGGCAGGCAGCATAAC 
TGGTGCTGCTGGGGACCAACACACTGTGCAACT L. 1 1 ICI I CTGAGCCAGG 

CTCCCCT ACT gtcttttcatttatgtcaaggcaggggaagacctcaaagg 
gctcttgcatcccagtctcactt(xx^wiagaggcac 
tgtggggacaggaactttgcggcaagccggggctgtccaga 
ggagggctaaatagtagaaaggagagtcttattggtgatatgtttgcaaa 

CTGGGAAAA GAT AGCCTCCAGTGTGGAG CAAAG ATG CT CCTTCTTGAAAG 
AGGGCAAGGGCAGCTTGGATTTTGTGCCTTACAGGGTCGGTATTATATAA 
TAGAGTCATGCAT ATTCAGT AGGTTTGGGGGA^^M^X^^^ATATATTT^T 
GAGGGGaGCCAACTACATGGGCAATGGATAAACATACATGT AACACATCC 
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CATGTTCAC 
»771.f 

gaaaaggaaggcaaacaacttgttgacatttgggaaattgggatatcctt 
tggggaaatgtagtaatcagtatattctgggaaaaacattatagaagaat 
gaataaataaaattccattgaatttggaat atgttgtccattctt ccctg 
taactaatgctatcaagataaagttagaaataccacatttcagaaacagc 
tggaagtagacagggtcttcatagggctagcttgggaaacct aaatagct 
attaaataaatgaa 

>772.1 

accaccaataatgatgcccacatttgtatcctaaaaaaaagtgatttctt 
gttcctttgcctacaagaacat 

>773.1 

ACT atcatcccccaaggccttttacagtctgaaatatcaaaattgaaagc 

AAAAATAGGATGACCAAAGGAACT ACTATTTT ACCTTCTTTT C 
>774.1 

AGATAT ACATTATGT AATTAAAAAG CGTGCATGTGT ATGT ATT AAAAATA 
ATGGT AT AT AAACAAATAGAAT AT ATACAATAAAACACCT AAACGTCAGA 
GGGCTGCATG7TATTCCACAATAGGTAATAACCAAATAGTATTTAATGAA 

>77S.1 

ACI 1 1 I i N M 1 1 U> ITTTTTGAGAGGGGTCATCCTCCAATCATTAACT 
ACT TCTAATCTTCA CTGCTAGACAGAAGTTTCCAATATTTTAGCAACAGA 
TG GCTTT GC 1 1 1 1 ACCTTATAGATGAGGCCAAAGCACCAGGTAGGTGGAA 
GGTTCTTGTATCGGTTCGAACCCCGACAGCGOGCCAACAGACAACACGAG 

gcagtggggagcaagacgctgttttaacgagcgcctgggtgcaggcgtgc 
ttgagctgaaaatggcattcagccccaagtgaggacaggg 

>77E.1 

AGAGCI I IGI 1 1 agaggctgttgtagtaatccaggtaaaggcttttaatc 
atgtcctgaacaatgatcagcaatggcaatggagatgacagaacagaatt 
ta agaa ggaataaaaaaggcttgctgactacttggatgtgggtgatgcta 
tcctttgacacaaaggatttaa 

>777.1 

actgcaagccaaatgcaatgaacaaaccaaggttattgataattttacat 
cacagctcaaggctactgaagaaaagctcttggatcttgatgcacttcgg 
aaagccagtttccgaaggtaaatcggaaatgaaagaaactttagacagcc 
agcttgaggcagcttgagaaacagaattaaaca 

>778.1 

aaaaactccttattaaggaaattttttaacataccaaaaaatagtaagaa 
t agt atcatgagttcctgtgtgtattcxxxx^aacttcaat AATTATCA 

AT AGTCCACCATTCTT ATTTTACTT ATACTTCCCCTCCCCAAGACCTTAC 
Tui i i I GG CGGGGGCTGAAATTATTTTAAACTAAATCCCAAACATATCAT 

tcacctttaaatacttcaatgtatatctctaacagataaagact 

>779.1 

actacgaagctgcagatcattacgctgatatgaatgactgcttgaaagaa 
CAATGACT CTGG CACAGCC AL. I G C 1 1 1 1 caoccaggaaagcagtttttca 
cagaatggctttgatttatactttgcacaccattgagagaataaaaagaa 
aatct aaaagtt agtcttaqagcatacaaacattctat atactatttcat . 
caacttt atgtt^taatgatatataatttatatatactgaaattattttc 
agatccacttactgtgcttaaaccgaaagtgaatgataaagagcaatgaa 
ttatctaatgtatctttataattaagaaatca 
>78ai 

acag acagtgtg atggatgatgctgctggttgtaaatttcatcgtgtgtg 
tctaai Hi m i cctgttgaatgggtaaaaagaaaacaaaac i nrm 
agaagatgaattttgct gtcatgtttttgtggaaatgagggatccgttga 
gcttcact atccaccttggaagtttgaGtttgaagccatgaaa 

>781.1 
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tggcggatgagtcttttaatagaaaa 
acacacgtgcaacagtatcaacacacattttttggcaatcctgacagcgc 
tgaacttcagttcttcaccttggggggtggcctgtacatatcaaaatcta 
tcaaattggaccctcaactatgcarrtttctgtgtgcaagttatatctca 

ATTACAAACAAACAAAAACACAAAAGCCTATG GTT AACCCAAAACCT AAA 
CTATCACCAAGAAATATCAATTGGGGTTATGGCATGACCATCCTCCCCAA 
■ GAAAAT AAAATGCTTGACAGATTCTG AGCGGGACAAATTTCACTGATCAT 
ATCCCA 
>782.1 

ACAAATAAATGAGTTTGCAGTGAATTGGGCCTTCAAATTACCTCAAGTGA 
CAGATAGT AAGAAAAG CTTCTTG AGCAGGTGGAG GTCACTGAATCCCCTA 
CTATGCACTTATCAAGATTTTACTTACTTTAATTTACTGGAAATTGATTT 

tt taaaa aatga ctacactg taacaagggaagggatctgggtttttttgt 
tgttt tattct tg I I 1 1 H 1 1 aagtagttcaaattctgaaactgtgattt 
aaaaattttttacagtcaagcattctgattttgaacataactccctto^ 
tttct gtgtaacaaaggtcrrctctgttatctcttaaattttgtt aca 

>783.1 

ACTCTTCACTGTCTTTGCCATGAAACTTTATAACATGGCTCTCCAGGTGT 
TGAATCTGGTGCCCTGTCACCCTGTGCTCAGGGAACACATGGCGGCAATC 
AGCATGTGAGGCGCAGAGGGAGGGGAAGCT CCCCTT GTGATATTTGAGGT 
ATCAGCTGACTCAAGTCTCTCTCCCTTCTCTCCTTATTCTCATGCT ACCT 
CTCCCAACCATTGTCTTAACTTCCCTGGCCAGGATGCCTGCCATATTAGA 
TGGAGAGGAGGCAGTTTCTAAATGGCTTGACTTTGGTGAAGTCTCAACTC 
AAGAAGCT CT GAAATTAATCCAGCCAACAGAGAACATTACCTTCCATG 
>784.1 

ACTACTCGATTGTCAACGTCAAGGAGTCGCAGGTCGCCTGGTTCTAGGAA 

TAATGGGGGAAGTATGTAGGAGTTGAAGATTAGTCCGCCGTATTCGGTGT 

ACC CCTGG GAGGTGCCAGTCATTGAATAGATAAGGCTGTGCCT ACAGGAC 

TTCTCTTT AGTCAGGGCATGCTTTATTAGTGAGGAGAAAACAATTCCTTA 

GAAGTCTTAAATATATTGT 

>785.1 

acaagaggatatgtgtgcattacatgcaaccact acaccatttaatatct 

GGGGTGTGAGTATCCGTGGGTTTTGGGTATCCGT GGGGGTCCTGGAACCA 

ATTTCTCCTGGATACTGAGGGATGACTGGATTACTGTGTGTTTGTGTGCT 

TGTTTTTAAGCTTCAAAAGATTATGTGATCTAGGAGTTGTTAGATTTTAT 

TATTGGT CTT AAAAG AT AAGCTT AGATGTGTT AC 1 111(1 GGAGTTTTAG 

TTT ACAG TGATTCATGAATCGGGCAGCTTCAGACCACAGGAGACATGAAG 

CAGGTAGAAGTTrAAGAAAGCTTGACAAGCAAAATATTTGATTTGGT 

>786.1 

ACTAAAACT AAAACTGAGCAGTTrAAAACATTCATTTAAAGGGATATCTA 
ATGTGTTT ATT ATTAACAT AAAT AATGTTTT ATGAAAAATGT AACCTT AG 
TTTTCCAAAACAAAAATGTTTAGGGCAAGAGT AACATTATTTTACATTAT 
TGCATCTCAGTGAAAAATAAATGGCAACAAAATTCTTATATCTGCTTCTG 
CAGTT AATTCT GTTCA TTTT GTTTTGGTTGAAATATATGAAGGAAATCTG 
TCCTCACACAGTTGTGTAGTGGAAAAAGGGGGACT ATTGTAACAGGCTG 
>787.1 

ACGCGGGATTCCTGGTTAAGCAGGCATTGCTTTGCCCTGGAG<>GCTATT 

TTAAGCCATCTCAGATTCTGTCTAAAGGGGTTTTTTGGGAAGA^ 

TTTATCGCCCCTGAGAAGATCTACCCCAGGGAGAA 

>7sai 

ACCTGCAGGCCTCCTACACCTACCT C T C I CI GGGCTTCTATTTCGAOCQC 
GATGATGTGGCTCTGGAAGGCGGGAGCCAC i 1 10 1 1 C CGTGAACTGGCCG 
AGGAGTAATCGCGAGGGCTACGAAGCGTTTTCCTGAAGATGC 
> 789.1 . . " 

ACTTT AATTTCTTT ATAATTTGTTCAGCTATTTAAAAAGATAATCCACAA " 
TCTCCT ACCGCCATTAGAGCACAGGAAAAAAAAATTCAAAAATAAAGGAA 
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aaacatggctcatatatctacagaagtcacaaaaatact atagggcacat 
atacccaggcctcagcggtgggaagaaaacatacaaccaccgggcaaaat 
gtttgaacactgaagacgggaattttttagggccat ^ 

tcaagaccatgttgaaggtaactgggaaagtcctggatagaaatagatta 

>790.1 

actcaagtcgcccttatggagcccttgattcaggcttcaatagtgtggac 
agjgctgataagagatggtagggaatgaagtaagtgttttttatgttcxx5 
tgtgttataacacctgattaagagaaaacagaatgatgaaaatgaaaagc 
gtcttaactggattcagtttctxactacataaaatacagaaaagtcaagg 
tggaggcaagattc^xaccctctccag cagaattggcattctgcgtcctt 
ACCGGC 1 1 TCTGI cacgtggatttccgcct gtttcctcattgcctcatgg 
aaatagtttcatatcatagaaaggcaaacaggagctgagccagttgaaac 
tgaagcctacaatctgaggtggggggtaatctcg 

>791.t 

ACTAAi iui 1 1 1 CCTCTTTCXTT agaccgattctagtttgttgccttcxct 
^cctcggaaaccccaagtttgggatgctgcagacactctgtgcccccct 
gcatgctgggtgcctggccagctgccagggcataa^ 
ggcctt tgtcc tt aagaatgaggtttgaaagcctcagttcttccatgtta 
ggtgai 1 1 ci igcagctcttggtatctgcagaattagtgtgaatgcttaa 
aaaatattaacagctttatatcatc^aagttttaacagt 

>792-1 

SXEJU 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III! gaagctgaaggccacagt 
^^^^^^^OCCACACCACTGAACACTAAAACTTAACCTTT ACTGGC 

tactttgtagataacattcacagctcaccatgaatgcagctgcagtcaac 
taacagatatgaagttaccactgtatt acatggtt atatt agggactgct 

gtttatatacaggctt aatataaaagaaaaccct agaatgaactcaacac 
aattatg 

>783L1 

^^^^^^^^^^^■^^^^CTGAGTCTTCAT^CXCCTCAGCOCCXATCT GTTCA 



ITGGCAGCCA 
AAGGGAACC 



tggtct G TTGCAGAGAGAAGAGGA'I _ 

Tcc Gi nroen i c tggaaccactctcttaatg 

>794.1 

^CGAACTTAAATTTATGATGAATATCTrTG^ 
ATTTTACTTTCAATTTrATTTTAATTTG^ 

TAGAG T AATTCTACTGTTGGATTTTAATTTTTAATCATATTAAAfiTTT^ 
??3S^?7ftJT ^^*Q^ CT AAAATATTTAGGACT AAAT AAAATTTTATT 

CAAAAAAGCGAACAACAGAGGCTTC>T(^^ 

AAGTGTC 1 1 u 1 GT AAT ACTGATAGTGAAGAAACT GTTTTT ACATCCGAGA 

GACATGCTAGAAGAGGAGCTTCTTAATG 

ACCCT AGGTGATCTTTGGCTTCCTCAAGTTm 

TTCATATACCACCTTTGGCAAACATGCC^^ 

AGCTCTCCCAAGCTCT^TTGATTAATTTATTAa 

GATTACAIUI I lATCI 1 1 1 IGI I ACAGAAGAAACTTTGAAT AGCAGTTGA 

TTTGAAGGAATAAAATGGAATT ATTTATAAAGTTTTCATTTGT ATTAGAG 

- AG . . 

>796.1 " * - - 

ACACT ATCTGACCT AATOTU>ACACAAACTAAGGCAGGAGACACAGGGC 
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TGCAAGGACArrTGCTGCCATCCAATTTGTGCCAGCCTGTTTTATCAATC 
TGAACCTATATTATTTTTAAGAOCTCACGGCATCACTGAAAGATGAGTAT 
TATTAGTTGGAATTrTAGGGATGAGAAAACTGACCCTCAGGGAGAATAAC 
TGACTTGCCCCGGCTCCAACAGTAAGTGGCCCTGCTGGGATTTGAACCCA 

ggtgtgtctgaocccgaagcctgatctgacctctgacagtcgtgataaaa 

ATAA 

>797.1 

accgaaaaatgattttgttatatatatttaccacaataaaaaagttttaa 
atttattat aggtgacactgtttgctcactgtaggtcaggt a i 1 1 n IGG 

n I H 1 1 1 rCTCTrrATTTTATTTTTGACCAATGGATTCACGTCAOCAGG 

TGAI I 1 1 r 

>797J2 

ttctgcacctcttatgacaat aaatgttagaat attttcatcact caaaa 

agaaaccagtatccattagcaat 

>7sai 

acaatttttatgtttacagctgtaacccctgagttatcaagagatggaac 
attagatatgatttattcctatttaagataataggacattgcttgattac 
attt tcaga agat atttatccaaag AAA I H I I t 1 1 1 1 i aatctaaagga 
aaggttttgattcttatgagaaaagaatgaga rTTcm aactggaaaat 

TGATTT ATGTCCT ACAG TCCATTGTGT AGTGATGTTGGATGAATCAGGT A 
TCGCT AGGGTGTCTGTAGAAGTATCT ATATATTGCTTTTTAAGTTCTT at 
>799.1 

accatgtagctctacttttccatatacagagttgtttcctagctttctgc 

T AATCT AACTGGATTCC I CI I O CC CA TTTCCTCATTTACT AGATTATAAT 
GOVCATCACATAAT AAAAGCTTAAAAATGGGCTTTCACAGTTACT GTTTT 
CTTTTT AAAT AATTGTGAGAGAGCTTTTGCATCATTT ATT ATCT AATCAT 
GATTCAAGTGACT AGGCTGT AGCACCCAAGAACCTTGCCTT AAAACAGTT 
T ATTTT ACXXJAAT AAT ACT ACTTTGCCTTCTT ACTT AAAAATG TCCCGTG . 
CTTAACCCTTTTGCTCTTTATTTIGATTTAA^ 

>eoo.i . 

ACTCTCTATTTTTAACAAGGCTCCCTCAAGATATTAAT 
CATAGCCaGCTGTAAGATATCTTTCAAATGCGCAAGTAACCT AACAGATT 
TGTGCATGTCAGCCAGTAATTTCAACATACATTATAAATATGGCCAATTT 
TCCCAAATTCTAAATGAATGGAGAT AAAATGCT ATATAATAAAT ATGTT A 
GAGCACC 1 1 1 C 1 1 GAGAAACTrCTAAAAGGAAAAAATAAAAGACATAATT . 
AT ACT cacaccaccagtaaaacctctg GTCACCTGTTTTGGGTTGTGGAA 
TGCCCCCAGCAGCCGAGAGACCTATATTAATATCAACAGAGAAATATCAC 
ACACAGAATTAAACCACATACAGTAAACAAGAGCGAGGAAG^ 
ATGGT AATGCTGCAACTTG 

>eoi.i 

ACTGATTATTCTCCTGCTTAGGGAGAAGCGGAAGAAGGCCCTT 
TGAGTTTTGCATTCCAACTTGCTAATTCAACATAGATCCTAATTCCTT aa 
ATOCTTBXAATTAfiAAATTCTCGTQAACXGTATTG G ' 1 ■ 1 ■ ^ T CAftfl C AA 
TCTGTTTGGGGAACTTGAGCAACTGGGGCACT GCTGGCT AGGGTGAAGTT 
TATTTAAI I IGI II 1 1 ATGACATTCTTCATCTTGGAAATGGGGTTTTCAA 
ATA ITGCTTI (XX^GGCATCATTACTTATTTGCTGGTTTTTATTTCAAGA 
TTGGGAC 
*602.1 

ACGATAGGCATGCAATTAAAGAAGACCTGCCTCAAACA1 1 1 rCTGTGTGA 
CCTGAGG CAAG IULI 1 1 I ATAGCTATAAACT AGGGACAATATTTGCTGTC 
ATTTTTTCT ACAAATGTCACAAAGAACAAATTTGAGCCTG1 OGCTGTGAA 
AGAACTTAGCAAATGAAAGCATCCTAGGGAGTGTTTTAGATATCGATATT 
TTT ATCCAATT AACTTTTCAAAATGAAGTT ATTTGCT CACT GAAACTGAA 

- - * 

- >603 .1 - 

CCCTTTCGaGGGGCCGCCCGGGCAGGT ACGCGGGGGGTTCAGCTGTCTCT 
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I ACrTTTAACCAGTGAAATTGACCTGCCCGTGAAGAGGCGGGCATGACAC 

AGCAAGAC 

>6Q3.2 

AGAAGACCCTATGGAGCTTTAATTTATTAATGCAAACAGTACGCTTGGGA 

GTCCTCAGCAGGGGGATCATTCACAGTGAGGACAGACACAGGTGAACCTA 

TGGGTCGTGGAACAAAAGTTATCCTACACCTGAAAGAAGACCA 

>S04.1 

accttgcacagtgccttttaaattcattttgctggacagttggcaggctc 
tttgacttgagaggctatatcttaacgatttagaatggagagtttggctc 
aagctccctgtgtgtggt ctgtgc tttctatacttttattctt g gt attc 
c agagtctgg ag g cttctcttttt aaaaattgct agg ct cct g ccaaatg 

TTATAATTTGGGGATGTGAGTTCACTAAGAAATCAACTGACAAGAGGCAG 
ATTAATAGGAGAAATGACATCGAAATTTATTAGCATGCAGGGGGAAAAAA 

ttgattaccaaatatcccagtagggtagagatgcttatataccccacctc 
ttaagagagagggaagtggatgattttaggggaatagtaaatacttt 

tatgggaactcactgggcttgaagaatataacaaaagcctgggacaaagt 
ctgttgggcccacagaacagacagtggtttatgacaaaagtc 

>80S.1 

ccgggcaggtactattactaggttcattgtttccagaggggtgaaacggg 

gctttggagaggttaaataacttgcccagggtcacacagctattaagtgg 

taaagctgggatttacatgagcccagacaaagaacccaagaagctaagct 

attctcttgtaatacctccaacataggaggcaagaagtgaggtattatac 

aggttgaggagataaaggggagagaggcctgcagtgctaacaggaggagc 

tgggattcatcctggcttgttctgataggtcagttagtcttagagatacc 

catgaggtcacctactcaaaatggggctcagagtagccttgtcccattct 

tgtccagtgggcgcagctacagtcttcctggcctggagtgactggaggct 

gtccccacgtcccacttcagtgag gcattc^tgtgcacccaacacacttt 

ctagctttatttgcctggaggggaagattctccagaaccttgttaagatg 

gacagtgtggtcctcggactggcagtgtggcctcggcagtccctgggagc 

ttgttaggaatgcagaatctcaagctcctccctactg 

>ao6.i 

CCCTT AGCGTGG7XX5CGGCCGAGGTACAGATATATACACACATATATAGA 
TATATACACOCACATATATATTTGCTGACA7TTTAATGTGAAGTT7TAGT 

ctgggatataaaatggaatgtatgacatcctcaaatgtctgaatactgtt 
cactcct atg llll acatttaat t ttccaaagcaaaacatttcagttgag 

GATTTTATTAGAAAATAAATAATCATTrAGCCATATCTAGAAACCAGAAT 

AAACAATGCCATAAAGCCTATAGGAAAATGCAGGTCAGATTCATAAATAT 

TCATGTGTTT ACTTTCAGTACAG GGAG GAATTTGAAGTAGATAGAAACCG 

ACCT GGATTACT CCGGTCTGAACTCAGATCACGTAGGGACTTT AATCGTT 

GAACAAACGAACCTTTAATAGCGGCTGCACCATCGGGATGTCCTGATCCA 

ACATCGAGGGTCGTAAACCCT ATTGGT 

>«07.1 

AATTCCCATGATGTGAGACCACTGGAGTTTCCAGG GGCAACACOCCATAA 
CCGTCCCG CTGCAGAAGAGGATCAGACGTTCAGT AAGAATGCAAAGGGT A 

GGGCT CTCrAGGCCCTTTCAQGCTAGATCTTGACGAGAGAAOAQTvSSLG^'r 
TCTTTCTGAGGTTGGTGCAACTGAAGAAACGAAAGTTTCGGCCTCTGCTG 
TCAGATCT ATGAAAGGAAAGAACTGTGAACTTGTCCC C I lllblll ICI I 

tgacttaaaagaaaagaaaatcactggaacaaagtctt aaagtaataaca 

GAAATGTCAGAAAAGTTGAACATCTTATGGGCACATGCGGTGAGTTACGC 

TAACTTATAGCATCCACTGAGATTAGCCGCATAGGATTCTTCCCATGTTA 

GAGCTAAAAGGACCTACTGTCCGCCAGCTGCATTGGAGT 

>608.1 ■ ■ 

ACT ATCCCCTACCTATAAGGCATTTATAAfGTGCTC 

TTTCAT AT ATTATCT CATG AAATCCTCAC 
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>80&2 

AATAATTCT G AAGGGT AGCTGGTA T I T ! I ATCTCCACTTTACAATTCTG A 
GGCTTACAGAAGTTAATTCAGTGGCCCAGGGTCACACAGTTTACAAGTGC 
CACATTGGTGAATATAAAGTAGCAACTTCT AAGTTTCACTCTCCCACTTC 
CCTAGTTATTTTCCTAAGGCATGAATGTCTGGGAAATAGCATGCATCAGA 
T 

>808.3 

TAGGGTGTGACTATTCATAGATACCTTTGAGCTAATCTTCTGGGAGCCAA 
TGTAACCGCAATGCACACTGCAAAACAATGCACGCTTTCTCTGTAAATTA 

aaaatgccaaccgagcttgggaaaagcccatcttttgatatgaaccaata 
g gg cagtttagttttag aaataaag aaagtccact gttcctgcttttctt 

TTTTaCAC 

> «09.1 

All I I II ICI 1 1 ICI I I U I I IU II I I IGGAAGAATATTGCATACCTAT 
TAGAAAAGTCTTTTAACAATTAAAATTGAAAATGACTGACAAACTTACAC 

tatttgatttaaataaataaataaatggtavcatgataacaatctcctga 
ttgatatgctttatttaaccaggttctgaaaccattggatgtgaaaacca 
aattttacaatgcagaggtaagtgttgagtgtttaatgggatttcatarr 
AAAC attaagatcgtatttgactaaaaatctcttatatacatttct aata 

CTGAAGCAAATCGCCAACGTGACTGTAAATTATTTGAAAAAATCACAAAT 
TTCAGTT AAAATTGAAT AATTTT ATT ATAGGTCTCATAATCTTTTTCAGC 
TTACATGGAATCAATGTGTCTTGATTTTTATTCTCGTTAATTTTATAAGG 
CCTrCATCTCCTTTCGGTAAATG ATTG CCCTCTCATTCCATTTAATGGTG 
GTTGTTACACTAGCAATCTGTGGAAT1TTACATGTGGTTCGGGATTTTAC 
AAAAATTGGA 

>aiai 

TACT CAAGAAACAGGTTCTCCAG AAACT AAGCT AAAC 1 1 G I I I G AAATGT 
AAATTCTCAGGTATT CTCAG TATAGACCTATAGATTCACTT AGCTGGTGG 
GGTCCACCCAACI ICI I I I AACAAGTCCTCCAGTGGATTCT GATGCAATG 
CTAACATTrGTGAACACTGTCAAAATCAAAATGGAGTCACTTGTGTTTAA 
AAATCCTGACAAATAAAGCCAGGGACAGCTATGAAGAGAGGGTTCTCATG 
CATCAATGCCTGATTAACA 

>aia2 

AAACCACAATCCTGCACAAAGGTCATCACAACCTTACACAAAAAATATCT 

TCAGAAGGACATC 

>811.1 

ACAATCATTAAAACTATGTTGTAATACTGTTTGTCTTTGTATCCATTCTG 
GCGTGT CTCCATACACTTGACTAATATTTGATATACCTGTTTTATACCAA 
TATAATGC TGCT6CT GTACGTAGAAGCTGTAGTCACCATATCCTCT ATTT 
GTTCAATTATTTTTTCATCTTCTGGCAGACT AGGATCTATAACAATGACA 
ATATCTT CAAAGCCATTATTATTCAGCTT AATGAAGGAAGTATTTGACTG 
GTGCAGCAGGCACAGAACTAAGAGGAAAACAAAACCT CTGAATAACCCCA 



>812.1 

ACCT AAGAGTTATTAATACTATTTCAGTAAAAAAAAAAATTTAATAAACC 
CTGTGTGATCCGATTGTAACAGAAAGGCTGATGTTTTCTGTTGTGAAATA 
GAAATGCAAG GAAAAAATCA I 1 I CI I I G I 1 1 C AAAGGATGC A 1 1 ICI I C C 
AT AAAGAATAATTTGT ATTT ATTTTTAAGG GTTT ATTTTAACTTAT ACAT 
CAGCCT AT AT AAAAT ACATTTCAAAATGATCTGTGCT CTTT AAATTACCA 
AAAGCAAAT GTTAA I I I I ) ) 1 1 1 CCCTCT AACAGAT AACAAGTTTTACT C 
CT ATG CTGATTTTTCTG GTG CCACTGAAGTT ATTTTGGA 
>813.1 

acatgtgcataagagggaatgcttccctacattactccagaatacaaagc 

TTCTTT CTGCXjrTTCTCATCCAGATAATGGAAGAGACTTCTTGGGTGAAA 
TACTCCACAGTT ATTTCAGTTCTCACTG GTGAGTCTG AATATAAGCTCTA 
TGAGAGCAGGGACCTTGTCAGTCTT ATTCACAATATCCCCAGCCTCTAGA 
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ACAAGGCTGGCACATAGTAGATGCACAAAAGGTGTTTGCTGAATGAATGG 
ATGACTGAGTCTGTGTGGGGTAATGATAGGGCTAAGGATGGGACTCTAAA 
CTCAGGTTTCCTCTGTGGGTTTCACAGTTTACTGGTCTT AAGAGGAG AGT 
TTCCT AAACTTGCCTTATGATAAAAACCACCTTCAGCATTTGGTAAAAAT 
TAC CCATTCC TGTAGATTCTGAGTCAGTGAGCT G AAGTGGAGCTGATG AA 
TCTGi 1 1 1 1 r GTGATACTGCTGCTGCTGCGGTTTTTAACACATGCTTCAG 
GTGGTTCTAAGCTTAGGAAACCTTGCCCAAGGATACCATCCTGTCTCTTG 
GGA 
>814.1 

TTATCAAAAACCAACAACCAAACAAAAAAAATATTACAACAAACAGAGAA 

ACGAATCAAACCAAAAACCAAAAATACTTTCTGGAATTCAAATGATACAT 

TATATATACCTATCAAGACAACAAACTACTAACTACCTAAACTACAAATT 

ATCATAAAAATGACn-CCTGTCCT AT ATCAATAAAAAACTGCT ATT AAAAT 

TGAGTATTATAACACA 

>ai5.1 

ACAAGTATTATGTATCCATAAAAATTAAAAAATCTTT AAAAATGCATATG 

GG GGTCAGTAGGT AAAAGAAAAGAGAACC AAG AGAGCTGCAGCGGGGAGC 

AC AGCTTGCTTTAAACATGAGATCCAGCT CAGTGATCATG CGGGGGAAAA 

GG CCCG GCATTGCTGGAACTCCTAATATTTAAAAAGATGATGGAAACTTG 

AAATTTT ATATTTAATCTrCTCATTTTTAAGTGTTGGCAATGTATTGAAG 

ACTTTGAAGCCTCTGTGCTGGTOAAACAAGATGTATCTGTAGGCTGGATT 

TAGTCCACAGC 

>816T 

ACAACTGTAATAGCTATTGGTCTTCAAGTGGGTTTAGATTTGGTGACATC 
AGTTTGATATTCT CTTAAAGGAAATAAATATTCAAGAACTGATTATGTTC 
TAACATGATTAT ATTCATGGTGTTACATAGGCXrrCAATTTTTTCACAGAA 
AGAI I I J I GGAACAGGACTGTGAAGTGAGGCTTTTT AAAAAATTATTTTA 
TAAGC AGAG AACACAGCCTGATAACTTAGTCAAGGATAT ACTGTCT GTCT 
CACTACTTTGGACrrrATATGGCTTCAGATTAAGTCATCCAAGAAACATAC 
ATA 
>817.1 

A CATGT AATAGACACTATGCTACAGCAAAAGU I H I CI I ATTGTCTTTAA 

AATTTTCCTGGGTGCATAAAACT ATGT 

>817.2 

GGTAACT CTTT CXXXAATTTTTAACTTTTAjCATTACAAGTCATTTTCAGAG 
TAAAAAGTCATTT AACAAAGGCAGATAGAAAGGCCT CAAATCCCTGAGGA 
CCAAAAATCCCAAGAGATTTTCAAAAGGGAGAAAA I I IC I I IAAACTTCA 
T6GGAAA AGT ATr rTTAACATAATAGAGAGGCTTT ATGCAGTCTTTGACA 
AGATGATACTTTTGGAATAGAACAA6AAGAGGAAAATATTTCATATTATA 
AT ATT ACT GTGTCAAATTCT ATGTTGTCAAAAGAGTGACT ATCT CTGATG 



AGGGAACAAAATTAAATTTTTCAATTCTAATl . . . 

TGTATTCC I 1 1 I AGTGGAAACAAAGGAAAAAATAA C7 T IT7T C T CCAAAT 

AGTCGGCCTGGAAAAA 

>619.1 

ACAACTGTAATAGCT ATTGGTCTTCAAGTGGGTTTAGATTTGGTGACATC 
AGTTTGATATTCTCTTAAAGGAAATAAATATTCAAGAACT GATTATGTTC 
TAA C^TGA TTATATTCATGGTGTTACATAGGCCTCAATTTTTTCACAGAA 

agai 1 1 1 1 ggaacaggactgtgaagtgaggcttttt aaaaaattatttta 
taagcagagaacacagcctgataacttagtcaaggatatact gtctgtct 
cact actttggactt atatggcttcagattaagtcatccaagaaacatac 
ATACATTCT aaatggtatatattgggaatatatgccccttt aaaagaatc 

. AGGTCAGAAATGCAATAACAATTAGACTAGACTGTT GCCCGT GTT AGGAG 

aatgtg tggt catcctagttactaattactctcactcaagatg GAGATGT 

TGTCCAGTTT AACATAGTCTTAAG MUCH AAAOCCAAATAATTT ATGA 
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GTAGCTTATTACATCTGCAGAGCT ACCTTATTATAATAGTACCTGCCCGG 
GC 

>820.1 

A CTAGAATT AGTTCCAACTACTGCTGGTGATAAACTCACX^TCTACCTTC 

AC I IGI 1 1 ictcttaattctccaagaagtaatcaggtgaataaagaatca 

TCATCAGATAATATTCTCCAAG ATTCTTT AAGAAATTAATTTTTATCT AC 

tcttaaatgattgcacaattataggatagaaattactatcttgtgctcta 
attcaaat tgct cttaatgatcctagagagaaatgaattactagagataa 

AAGATAAATTTTGCTGTGGTTrGCGATCTTTGTTTCTITCCTTAAAACTT 

AAC 

>«21.1 

ACTGGAAACCAGACCTTACTTAAGCCCACCAAAGGCAAGGT1TGGGCCTG 
CCACAGCGGATTTCAAAAAGACAAAGCAATGCAAGCCACGTGTTCAAAAT 
GCCCTAAGTGGCTATTCAGGTAATATATAAAAGTAAGACCAGGCTAATTA 
GTATACAATGGGGTAAACCAGAGAGCAGAAAGCCCTTCTTTAAAATGAGC 
CT ACCACTG CTTGG CCT CAGT G TGAATTTAGACCCCATCTTCT GAT ATTT 
CAGGAGAAAGTAAAAATCT AGATTTTT ATCTAAAATCTTTTTAATTTTTA 
AACAGTCACCT GATTTT 
>822.1 

ACAGAGCATCTTAAGGTTGGAAGGACTCTT AGAGACCATAGTCCAGCCTC 
CCACTTGATACTGAAACACGTTTGTGAATTCATGGCCGATGTCTAACTTC 
CCTCA CCACCTTrCCGATATGGACAGTTCTCATGCCCAGAAGCAAAACCT 
TCTTTATTGTGCCTGTCCTCCCTTGACTGTCATGCATATAATCAGCATCT 



TAAACTCACGAGGGGCTGCATTCACACCATCAGCAGGGAGATTACACTTG 

TGTCATTTG 

>623-1 

ACCAAGACTTTAGAGGGCAAAGAACAGAGGATTCTTGAGAAAGGGGACTT 

GAAGGTGAAGAGATAAAGGCTGGTGCTTCCAGGAGCGTGGGTCTCCTACG 

TTTGTGTTCCTGGGAAGAATCTTGGACTCAGGCGTGGGCAGCTGGATGCC 

TGGGTTCCTTAGG CTTCCTCCAGGCAATGTAGTTGCCTCTTT CTCTCCCC 

GCGTACATAGTAAGTGTATGATAGATGTTrGATTTGTAAATTACAAATAT 

AAATTATCACCCCCATTTCCATTTA I M ICI I GATATATCAAAATGTGTT 

>824.1 

ACCCCCATTATAGTAGGGAGACT GAATCTTCAAAGTTACAGGGTGAATCA 
ATGATAATGATCTTTGCAGCTTTCT GGAGTT AAAAAGCATCAAAATTGGG 
AGATATTAGATGATGACATCTAAGTATT AAAAT AAGGAGATATTAAATGA 
TGACTCCT AGAAATGAACCTGAAT AAGGACT ACCGCAATGTGTGT GGTGT 
GGGAAAGGACAGTTCTTTT AATGG CTG G CTG ACCCAGCCT CAATTTTCTT 
GCAGCTTCGCCGACACGAGGTGACCATCTGCAATTACGAAGCATCTGCCA 
ACCCAGCAGACCATAG 



ACCTCTCATGGCTTTTTGGTTCCAGCAGTGAGGGCATTGGTGAGATCAGT 
GGTA AACTG T GCAAGC II I C I M 1 1 ATCATT AG GAAATGTGAAAOGTTGG 
ACAAA Ull GAGTTTTAACAAGGACAAAAAGTTGAAAGAAAAGGCACAGT 
TAACAAAAAAGGGTGGCTAGATTTATCTT GGGTGATGGAGGAAATGAGAG 
AGGAATGCTCTT GAAAGGTGGTCT GTGGATCTGTCTGAATAGAAAGAGCA 
CAGT AAGT ATG CATTGCCGGAGAAAACGT CCTTGAAG CTGCTTGT CT CAT 
GTGTATGATGTG 
>826.1 

ACT CAACAAGCAGCT GACTTT ATGTTTTATTGGACATTGTG^T ACAGGAAC 
TGTTTCCAGAGCTCAAT AAGGT ACGCG GGAAAGTCAACTCAGTT ACCT CT 
GTTTGGTGT GTGT ATCACTTGCAGATGCTGTCT ACCACCTTTT CAGT GAC 
. ATCCTAQA AGCTTC TCTATTA CCACAGTA ACTGGCT AACT AQATATGATC 
TTTCCCTAATTTTCATGAGCA I U I llll I C I GAT AT AAACCAGGGAGGGA 
AAATAACAAAGTTGCTTCACTCTGAAGGAGT ATTCTCCT CT AGT ACCTG 
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>827.1 

ACATATATGAAAAGCCAACATTCTAAAGTAGAGGTTCACTTAA I 1 1 1 II 1 
TTTTrrCAAGAGAGGCTTCTTGGTAGTTTCATCACACAGT GGTTTT ATTA 

ggggatgtaaggattacagaaacatcgtattttttaacatatagtatttt 

ttgaatatgatttgaattaatatagaaaagtgcattttttccagtttttt 

tagggaaaaggagatacttcaccaggaggataaaaaggaacaagagggga 

aggggaaataaaaattccagaaagatgaaaaattgttgatgtaagatgga 

ggcaca 

>828-1 

ACAAACAAGCTTTGTTAAACTAACCCTTGCCATCCTGGCTACTTTACCCA 
ATTAAOCACCCTAGCCCAGGACGTrrGCTTTATCACATGTrCACAGTTTG 
CTATTCTTT GTTCAATCTTGTAACTGACTGCAACTGCTTCTGTGGGTCTC 

TGTTTCTI I atgaagtttcccaggccatagaaaacttgtgttagcctatc 
ttct gtcag tttaattgtgqaactcagccaggcccttaagaggatggagg 
agagtttttcccacagcagttctgaatgggatgaagtgaaaaataaaatc . 
tocc cattgccacta caccacctc ctgatgagtcttgcagcagaaatacc 

GTTTAACTGTTTCTGCTTTTAI 1 1 1 1 1 ictgattatcatccagttttata 
TATTCATATCTGGGTGCTTTGATAATTATAT ATACATA C 1 1 1 1 1 IGATAT 
TATTTACTTATTCTTTAC 
>«29.1 

ACTCACAAGCAATAACAGATTCATAGATCAGTTGACATTGGCTGGTCTCC 
AGGACAGGAATGTGGCCAAAAG GTGC TTTGTATAGACGCGGGGCACTGAA 
TCT GTGT CTCCOCTGTTACCT ACTTTTGCCAGTGAAATTTAAGTTTTAAA 
ATACTTTCAGAATGTATTTT7ACTACTGCAAGTTTTTGGTCTTTAAAATG 
TCAAGTAGCATGTCTCTCTTTCTCT CTGTCTCTTTCTGTTTCT CTCTCCA 
Gl II 1 1 1 1 1 1 1 1 1 II 1 1 AATTTCCAT ATG GGCT AAAGAATCCAAATATTT 
TAA AAATC TGGC rCTCTTTTCn CTCTCATAAAGTGAATTATTCCTCTTT 
nron 1 1 A TGTAAGTGTATATATTCTT A O I UlIU iuaaatcattgta 
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NNNNNNNNCCACCTCCTGCGTTTAAGCAATTCTOCGCCTCAGCCTCCC^ 
GGG AJJACAGGCGCXXACCACCATGCCCGGCTA AI llllllll I GT ATTTTT AGT AGAGATGG 
GGJin^ACCATCTTGGCCAAGCTGATCTTG^ 

CTCCCAAAGTGCTGGGATT ACAGGCAT AAG CCACTGCGCCCCGCCTCT AATAATAAATATTT 

AATGAGCTCTTCCATTAAAAAACAGTGATAAGATTTATGAGGTTTACAAGAAAGA 

TGGTAGATGATGTGAATGAGCATATACCCTAATTCCTTGAGAAAACAAAATAGAAATACACTA 

AAAGGAACATCACAAGAAGATGCTATTAGTTGTAAATGATTAAATATTTTGTGACATACAGTAT 

TTTATTCATTCATATTTATATCTTCTCACATTTCACAAAAGCGATTTAGTGGTTAGGTAAAGAG 

TATAAATAGTAAAAGCTCAGAGAGGTTGAAGAGATCATTATGGCTGGGGAAAATTCATAAAG 

AAGGTGAGAACTTAACAGAACCTCAGAGGCTGAGTGGGATCAGATAGGCTAAGAGGGTGGC 

AGAAGACACTCCACA7GAAGAAAAAACATGAACAAACAATTCAAGATGCTTTAGGACATAGG 

ATAGAGGGTGATTGGGCTTGGAT ACTCCGATTAAAGAATGGTAG GGAAAAAACT AGAAAG AA 

ATACAGAGGCCTAATGGAAGTCTCAAATCCCAAACTAAATATTTAAATTTTCATGTATAGAAAA 

ATGGACCTCGAiui 1 1 ATTCTACCATAACATATAATTCCAAATTCTCTCAGTATGTCCAAAAAA 

aaaaaaaaaaaaaaagtacctcggccgc 

>2 - 

„ ^cctgtgggaggacgtccgggtgggcggaactcctagcggacacctcgtgga 

gtccggccggaagagcaaccxi^gatgaaggtgaagatgctgagcc^gaatccggacaatt 

atgtccgcgaaaccaagttggacttacagagagttccaagaa 

ccttttgaggtcccacgagaatatataagagctttaaatgct 

gcaaaaccattccttgcttggctggatggtcaccgtgatggagtcaattgcttggcaaagc^ 
tccag agaagctggct actgtcctttctgg g gcgtgtgatg g agaggttagaattrgg aat 
ctaact cagcgga a ^ qtatcg <^^ 

I^^CTm^GjGGACTrCI 1 1 1 1 icactgttggtgatgacaaaactgtgaagcagtggaa 

AATGGATGGGCCAGGCTATGGAGACGAGGAAGAGCCATTACATACAATATTAGGAAAGACA 

GTGTATACTGGGATTGATCATCACTGGAAAGyVAGCTGTTTTTGCCACATGTGGAC^ 

AGAGATTTGGGATGAACAAAGAACTAATCCTATATGTTCAATGACCTGGGGATTTGACAGTAT 

AAGTAGTGTT AAATTTAACCCAATTGAGACATTTCTCTTGGGAAGTTGTGCATCTGACAGGAA 

TATAGTACTG7ACGATATGAGGCAAGCTACTCCTTTGAAAAAGGTTATCTT AGATATGAGAAC 

AAA T ACAA T G I G 7 TGG ^^ 

S£I A T A T A £T^GATATGCG^ 

TGCAGTGCTTGATGTGGATTACTCTCCCACTGGGAAGGAGTTTGTGTCT GCTAGTTTCGATA 
AATCT ATTCGAATCnTTCCTGTAGACAAAAGTCGAAGCAGGGAGGTATATCATACAAAGAGAA 
TGCAACATGTT ATCTGTGTAAAATGGACTTCTGACAGCAAGTATATTATGTGTGGATCTGATG 
A *^ GAA ^!Jf^ CC raGGAAAGCT 
GAAAAAG ^ CC £ A £ G ^^ 

A ^ A ^I A T AG ^? G I£*^ 

CWTCA7GAAAGAAGCTOTCGACGAAAGGA 

CTGTGCCAU I IUI G I CAGAGAAGAAGAAACACCTAGTGGCAGTITGTAAAATAATTGGTATTC 

?i aacaatcci ^ tqtataa ^^^ 

GGGACT AGATT AATTG CAAACATTTTAGTTATATGTGTAGAGCTTT ATTGTTAC 1 L XJI III AGO 

T ACCCTGAAA t A £ T ^ 

Tl- .' V" !5^TT5^ ^^ TACAOTA1TTOCAACTC AI rTTTTCTTorm t a ttagagatatac 

^ACTT TCTCT TTGAT CT ATTATTGT AGAGACT AT ACATTCAAATTGACATTT AAGAOCAAACAT 
STST 7 ATGTTATCTTTAATATTACTT TGAATAA TGATTGCAATGAT G 1 1 1 Ul I U CTGTGATTCCA 
CA J AAGA J™ AGAA ^^ 

TTI 0 ^ ' ' ' ■ 1 G ^£L lJ 1 1 1 ' AAAACAATCAGCCTGCATTTATATAAjCTTTTATAAATAATAATAT 

AATTTGGGTCAAGTTAAGATATTAAAAGTTCCTTTT>G^ 

NNN 

>3 

>4 

>S 

GACCATAGCGGCCTGCCAGCCAGGGGGGAGCGCCT AGGAGAGGGCCTCACGTGATGGTCA 
CCACT AAGAGGAGGGACAACGCCTGTTCCCGGAGAAAGCAGGCGCCCCAAAACGCTTCAGA 
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CAGAACCATAGCCAGTAGACAGAGCTCTACTTGACTTGAGGATAGAGAGAAAGATGAAAAGT 
CCAGAGCCGGACTGCATTGACGGAGACt^TCACCCCAGAGACACTAAGACCGGAGTTAAG^ 
TTCT AATGACTTCATTAAGTCTGGCCTTGAAAATGAGAGGTCT GGATGGCTTCTGATTTTCAG 

^ g .9 a £I a ^ tccc ^^^ gcctaccaag 

AGCATTQTGGGAGTGCCCTCTGTTAGAACAArTCrrGATC^^GCAGGGAGTAGTTTGCTfV-. 
CCAAGTCTTCAGC I 1 1 J CCATGTCCACACACAGACAATCCATGAACTATGTTTCCT AAAGCCA 
GGGCAAAACCTGAAGTCTGCTGGCTATTCTCCACTAACAGTCGAAGCTTGTTCATAACATCC 
TCAGCCTCTGTAGCCTCAATAATTCC^GCACTGAACGCTGATGTACATACACAGCTAAGTGT 
GT AGGCAAGGACCTCCT GAAACGTTCTGCTCTGGCTCCCGCT GTCATCTACGTGCGCAGAC 
AGCTTCCGGAGCAATGCTGCTACGTGAACGCGGGACTGCATTTGGCTGCTGTGGCTCATGA 
GGGACAGAACAAGTCCAACTCXCAATATACAGCCCG TGTTGTATTCAAGACT AGTGTCAAAG 
CAGCAATTTTCCAGGGCATCCAACGACTTCATC^GAAGAAGGTTCATCTCTTTK^ 

TTTGCACGGCTTGAGACrCATCAGCACTTGGTTTCCCAGGTAACCTG^ 
AGGATTTCCTCAACCTTCTCTTTGCAAGAGATAATGAAAACTGGCACAAGG 

cgtgggggcagcagaacgggcaatggcactagctgtgttttcacca^aataggacttataa 
taaaaccaggagagaagttgccctctgggttggt aatggct atccagaatgaccaagagtgt 

cgtcagagtctgaggagaggctggcttcatgtctagatacgacgacagcaaggctgcttag 
cagtcagcatgtctct aacccagagccaggctgtgc i 1 1 1 1 1 1 g tactgcaixtcctrarttrt 

I-™" 1 • CCATGTTTTAACTG CAACTCCAGCTCTCCTAGTCTTCCCTGT AAAATGGCATGAT 
AAGCT CGATTCATGTATGCAAGCCAGGCCTGTG GAAGAAAAATTGCACGGTGCCACTCTGAA 
AGCTGGATCAGACN 
*« 

NCGTCCGTCACGCGTCCGCCCACGCGTCCGGCGGATGGATCGCrrGAGGTGAGGC 
TGGTCTCAAACTCCTGACCTCAAGTGATCTGCCCGCCTCGGCCTCC^ 
TAGCTGTGAGCCACTGCACCCTCCCGGAAATGCTT7TT AACGGTCACCTCCTAGGGGAGAT 
GAAGAGAGTTACCGCATAATCAGTTTTAACTGT ATTTGTAATG I rTSTTTCTI I A CCTGGGAG 
GATATGTTGTAGATACGTGAGATAGAATACCT AAGTGACACAAT6GTGT GCAGGCT ATGGCT 
GCAAAGTAGAAAGTTTCTAAATAGCATGCTTGGATTTTTATTTGA^ 

TOCTAQAGAAATTCACAAAATGTTCCT AGTTTCAGATAAAAGGTTATtlATTTAArjkTrrTTTf^frT 
ATGACCATCTT ACATT ATTTTT ATGGGTGGGGGGCATTGACTGTGGAATGTGGGCAGTAACT 
AGGTCTCCACCTgAO^ 

GGGCACTGGCTGTCGATpGCTTT^ 1 1 ■ I i Ll T mr n :i rGGCCAq _ f f f l^ tViAAAi'lGTCA 
2SSSiSS^I^3i22£*^I£!Jl 1 U ' 1 GGGACTCAATGCCTTGGGCTrGGCATTGGGTAA 
AGjCCAACJGGCCAu 1 1 1 ^TTCTGACGAGCTXrrATAGTAGTCCGGTGTGGACCTCTGCOCTC 
CCTGCTCTGCGGAAGCTT^ 

GGTACCCAGCGTCTCAGGGTTTTGGCTGCCCACCTCT 

^^^IS^^^JS^^^^^^^^^^ AGTCACTGCA 1 1 II 1 1 1 1 1 M I IAGTAGAGACC 
AAGTCTTGCTGTGT I G I IX2AGOCTGGATTCAAACTCCTGAGCTGAAGGAATOCN 

W ^^!5 G S? GTGGGAGGACGTO0GGCT 

^ooggccggaagagcaagcgagatgaaggtg^ 
atgtccg ^gaaaccaagttxxxactta^ 

ccttttgaggtcccacgag^atatataagagcttt aaatgctaccaaactggaaogagtattt 
gcaaaaocattccrtgcttcgctggatggt^ 

TCCAGAGAAGCTGGCT ACTGT CCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAAT 
CTAACTCAGCGG^TT CTATOCCTACW 

TACTOGCTTTTGTGGGACI III 1 1 1 1 lU^CTGTTGGTGATGACAAAACTGTGAAGCAGTGGAA 
AATJGGATGGGCCAGGCTATCGAGACGAGXi^^ 

G I G I A Z A £I!^r^raTc^ 
agaca it33 ggatgaac ^^ 

AAGTACTGTTAAATTTAAOCCAATTGAGACATTTCTCTTGGGAA^ 
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I A I^Z^ G J A ^ TAT ^ GG ^ GCTACTCCT7 I^ 

AAATACA ATCTGT TGGAACCCT ATG GAAGCTTT CATTTTTACAGCAGCAAATG AAGATTATAA 
CTTATATACTTTTGATATGCGTGCACTGGACACTCCTGTAATGGTCCATATGGATCATGTATC 
TGCAGTGCTTGA TGTG GATTACT CTCCCACTGGGAAGGAGTTTGTGTCTGCTAGTTTCGATA 
AA J C ™™^ G ^ T CTTTCCT^ 

TOCAACATGJTATCTGTGTAAAATGGACTTCTGAC^GCAAGTATATTATGTGTGGATCTG^ 
A ^T^^T7^ C 9J GTGaAAAG CT AATG<rrTCTG^AAAATTG G GT GTG CTT ACATCACXiA 

g^aaaagcagccaagga™ 

AAAG ^ J A ^ AGG ^ GG ^ G ^T G GAGATCTAGCAAAATCT ATCTATAG CCAGATTCAGGAACAG OG 



CTGTGCCACTTGTGTCAGAGAAGAAGAAACACGTAGTGGCAGTTGTAAAATAATTGGTA 
CTAACAATCCTGATGTATAATTATTTGTTACTTTTGATTTGAGAACTCT ACAAATAAAAGTGCT 

6 £ GA Si T ^J7 AA JT G £^ 

TACCfTTGAAAAATGATCCTT AAAG GTG G CCT AG TT GGTAAGACTGTTTT AT CCTT AATCTGCA 
T r C ^ CT TTCATTGT ^g^ATACAGTATTTGCAACTCAl I UI \CH GTTTTTATTACAGATATAC 
n AC T TTCTCT TTGAT CT ATTATTGTAGACACT ATACATTCAAATTGACATTTAAGACCAAACAT 
CTCTTATGTTATCrrrAATATTACTT rGA^ 

CA J£ A 5 A ni AGAA i 

TTTGGCnTTTGACTCTTTT AAAACAATCAGCCTGCATTTATATAACTTTT ATAAATAATAATAT 
AATTTGGGTCAAGTTAAGATATTAAAAGTI CC I II C*GC*MHHNHHNKHNHHNHHNHNNNNM 
>6 



GTOCGGOCGGAAGAG<^ACCGAGATGA 
ATGTCOGCGAAACCAAGTTGGACTTACAGAO 

GCAAAAGGATrCtnTGCTTCGCTG^TGGTCACCGTGATGG 

TCCAGAGAAGCTGGCT ACTG rCCTTI CTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAAT 

AAC 7 CAG g GG ^^ G ^AT(XGTACAATAC^^ 
TA ^ C 5 5C J^ G J GGG AC' ICI 1 1 1 1 1 CACTGTTGGTGATGACAAAACT GT GAAGCAGTGGAA 
AATGGATT^GCCAGGCTATGGAGACGAGGAAGAGCCATTACATACAATATTAGGA 

G I G T A I A 5I GGGA I TGAT ^ 
AQA ^IH G j^£ A ^ 

AAGTAGTGTT AAATTT AACCCAATTGAGACATTTCTCTT GGGAAGTTGTGCATCTGACAGGAA 

T A T AG I A ^ G I A £ GATATGAG <sc^^ 

^TIJ^JJ^J. ' ^^T^TGCGTGCACTGGACACTCCTGT AATGGTCCATATGGATCATGTATC 

aa j^ a ™^tct3^^ 

AA ^I aAA ^T3S Ga ^l? I GGAAAGCTAATGCTTCTGAAAAATTGGGTGT GC I f ACATCACGA 

AAACGTATAGCTDGTCATOiACATCTACCAAAATCT 
CWTCATGAAAGAAGC^ 

? GaA ^^J7 AA J! GC Ji A ^T nAfi ^ ATA1 ^^ 

3Z C !IgIII G ^ GTAG ^ TAC ^^ A 1 1 1 1 M U II G TTTTTATTACAGATATAC 

TTACTrTCTCTrTGATCTATTATTGTAGA 

CTCTTATGTTATCTTT AATATTACTTTGAATAATGATTGCAATGATG ITTCIT OC TS I G ATTCCA 
CATAACATTTAGAAT AATGA TGTCAATTrTTTACA^ 

ITI GG ^^ C ^ C T!77 AAAACA ^ 

. ^fJ/TSGGTCAAGTTAAGATArr 
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AAATAW^AGAAAAACAATTTTCTTATAAAA 
GCmAAMTTCAGAAATATGCCCTAAAACCATAAGQAAAAAG^ 
ATCACAGCAATTAQ GCCGTTC TATTCAATTTTGCCATGAGCTAAAAATCA^ 
GTAAAT X ACGC 5 CTGI 1 1 1 1 ' ATTCTTAAGCACTAGGGTTAGGATTGTGATCTGAGCTTTACT 
AAATCGG AAAAGAAAATCT CAATTATAGAACATTTAGTTT ATTTATACCTTAATGCCCGGAGA 

I^I°I G J TCTGATCC ^ GAA ^ TGT(MG< 3C 

G ^ AG ^ G ^ GAAAAGAA ^ GAG ^ 

AGT AAGAG AATTAGTGAAACAGT AGGAAAAAGGTG ATCT GGGGTTCGAAAAG CTAGGGCAA 
NNNNNN 

>n 

»12 
>13 

AACAGTAGAATCGT ACCTCGGCGGCCACCCGGGGGCGCACAACG Wfl^NNN ^^^^^^^ 
r*~„ * *^9£AGCCTG^ 

CG ^ AGGCC i^ c TI^^ 
GATGATGCGA TTI GGT ^ 

T^^5^^ GOTCCCTAG ATGACCT GGTGAAGGTCT GGAAATGGCGTGATGAGAGGC7GGACCT 
ACAGTGGAGTCTGGAGGGACATCAGCTGGGAGTGGTGTCTGTGGACATCAGC(^CAGCCT 
GCCCATTGCTGCATCCAGCTCTCTTGATGCTCATAT^ 

7G ?? A ^ A ^ AG £ C ^^ 
G /J A i A 5 T 7 CT £ CAT j^? CCTGG ^ 

GACTCCCAGCTCCTTGTCACTGCTTCAGATGATGGCTACATCAAGATCTATOAT 
G^TTTGGCTGGCACGCTGAGCGGCCATGCCT 

II G ^ G ^ G i^?^ TTGATGGGAAAA ^ CCG ^ 

TGAAATTGGGGTATAAATGTGGGGGTGTTTCAATATANN wuuwvwuwv* 
>15 

™ * * - J!* AGA £ AA1 II GGAAC ^^ 

TG ^ G £ AAGGAC 5 TAGAAGA ^^ 

CTGTCGCCTAGGAAAAATAAAAAACCTGGGCAAAACAGGGGACCA^ 

GTC ™^S^ ATAAAGAAGG ^ 

G ^^J5X^^^^^^^^^ GACGAGTGG A * ATGACT GTTTCCAAGTTCATGGCAATTCAGAAG 
AGATGTGATGGAAGAGAGGCTGAGGTCAGCACGAGAGACAGCCAAGCGGCCTGTGGGTGG 
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>1G 

A ^ GAA -!^ GGTGAAGGTCTG ^^ 

>17 NNNNNNNNN 
>18 

AACAGTAGAATCGTACCTCGGCCGCCACCCGGGGGCGGACAACGWNNNNNN 

AC jl' ' ' 1 " 1 1 I I M I M I » A I 1 II I HI H I U 1 1 1 1 Hi H 1 1 1 CCCCCGGGAGAGGA 
^[^G^AAGAGCAAATTGCTGCTG AAAATTTCT ACATTGATCOVGACAAi^A>^TTAGAG(^ 
GGCTGAAAAAGAACCCTTGGTGTTTTTACTGCTGTTCAACCAGATCAAC 
^ CC ^ T J AG ^ CTGTGCTCTGTGG ^ 

cc T™ c ^ gg / :c I gaaaggggagctcatmtgact g 

>20 

*^^^ CCGAGGCACCACAA " mT TT^ 

^^^Z^^^^^^^^^^^ GAGA ^ GG ^ G ^^TAAGAATTTG GTTCACCAGGACACAGCTn 
GCAGCAGGCTTGGGGAATGCTTCCAAAAGGCTATGCCATTCAAGGmCAfi^^ 

A^TAQAATCGTACCTCGGCCGCCACCCGGGgSg^^ 

>23 ' - ............ 

CGCGGTGGCGGCCGAGGTACAGAGTAGAGAGAGTTCTGCAGGGATGAAGTGGGAG 
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ACG IX™^ G ^ ACC ^ GACCAGACCAGGCC ^ 

ccaagtgcaacaggtaactgctggagggaattcagcaagagagtgacaggagctgattgac 

»24 

„ NNNNJ^NNNNNCGCGCCCX^GTTGCG 

G * GGA ^ AGGGG C^GCGAGCCAGGGGGGAGCGCCTACGAGAGGGCCTCACGTGATGGTCA 

CCACT aag^ggaggcacaacgcctgttcccgcagaaagcaggcgccccaaaacgcttcaga 
cagaaccatagccagtagacagagctctacttgacttgaggatagagagaaagatgaaaagt 

ttgtaatgacttcattaagtctggccttga*^ 
agcattgtgggagjgccctctgttagaacaattctgatcx^ 

CCAAGTCTTCAGCTT7T CX^TGTCCAGACACAGACAATCCATGAACTATGTTTCCTAAAGCCA 
GGGC^WVACCTGAAGTCTGCTGGCTATTCTCX^CTAACAGTCGAAGCTTGTTCATAACATCC 

I^5 C ^ TCTGTAGCCTCAATAA ^ CCAGCACTGAACGC 

^Z^^^Sii^^^ G *^V : ^^ C ^^ < ^ AAAG;G ttctgctt cttg gctoccgct g tcatct acg tgcggagvac 

AGCTTCC»GAGC^TGCrrGCTACGTGAACGCGGGACTGCATTTGGCTGCTGTGGCTCATGA 

G . GG ^ G £ ACA ^Ti^. c ^ 

ATCAC J G ^III CTCTT CACACAAGGGAG ^G AGAAACATCCCTAAAGCAAGGCCCATGTGGA 
TTTGCACGGCTTGAGACT CATCAGCACTTGGTTTCCCAGGTAACCTGGCAGTCAGGATGTTC 
A S GA Il T i5 ;TCAACCTT CTC ^ GC AAGAGATAATGAAAACTGGCACAAGGAGAGACAAAGC 
CGTGGCGGCAGCAGAACGGGCAATGGCACTAGCTGTGTTTTCACCAGAATAGGACTTATA^ 
TAAAACCAG<^G^GAAGTTGCCCTCTGGGTTGGTAATGGCTATGCACAATGACCAAGAGTGT 

CGTCAGAGTCTGAGGAGAGGCTGGGTTCATGTCTAGATACG^ 

AGC I AAC ^9 GCA JI GCC ^ CACTAC ^ 

CAGTiCAGaATGTCTCTAACCCAGAGCCAGGCTGTGC I ITTTl I GTACTGCACCTCCTCAGGT 
TCTTCTTTTCCATGTTTT AACT GCAACTCCAGCTCTCCTAGTCTTCCCTGTAAAATGGCATGAT 
J^C^TTCATCTATGCAAGCCA^ 

>25 

TP^^^ ^^P^y^^ G GCAGUSTACGCGGGAGG<^CXTTCTTTTCT ACGTQAAGAGTT 
TTCTAAACTGAACTTTGTTTTCAGTTCCGGCTCCAGCCATC^ 

G £T CGCAC 7 CG ™ GACC ™ T ^^ 

TG ^7 G y ccAccATAccAccTGAGAGT ^ 

G^CATTrrCCTGTCTTrATTATGATAGTCTAAC 1 1 1 1 1 C > C CTCAAAGGTATAGCTGCCTTG 

C jn CATG ^ AA 4 A ^^ 

G ^ ATGACAGAT ™™ TTW ^ 

07 A I AC 5^ C / GACCAC TC^^TGTAATATrTTGGCAAGGTG^ 

AAATCTCAGCATAAGCCAGGCATGGTGGTACACAGCTGTATTCTCAGCT 

GC^GAGTCCAGGAAATCAAGACC^GCCTGGGTAACATA^ 

NCGCCTGTGGGAGGACGTCCGGGTGGGCGGAACTCCTAGCGG^ 
GTCCGGCCG^GAAGAGCAACCGAGATGAAGGT GAAGATGCT^QAG^^GGAATCOGGAC>UaT 
ATGTCCGOiAA^ 

CCTTTTGAGGTCOCACGAGAATATATAAGAGCTTT AAATGCT ACCAAACTGGAAGGJU5TATTT 
GCAAAACCATTCCTTGCTTTXKrrGGATGGTCAC^ 

TCCAGAGAAGCTGGCT ACTGTCCTTT CTGGGGCGTGTGATGGAGAGGTT AGAATTTGGAAT 
TACTCGCI 1 1 IGIGGGACI 1 01 J I I I I CACTGTTGGTGATGAGAAAACTGTGAAGCAGTGGAA 
A ^ TG ^I GGG i^ QGTATGGAGAC ^ 

GTGTATACTGGGATTGATCATCACTGGAAAGAAGCTGTTTTTGCCA 
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TACCCreAAAAATGATCCTTAAAGGTGGCCTAGTTGGTAAGACTGTTTTATCCT^ 
SS^tt3Xt«IJ^^^^ AA ^ A ^ AG ^ A ^^ G ^^^^^^^^GTTGTTTTTATTACA^TATAC 



r T rv^? CG ^^ G ^ GG ^ TCCGGGTGGGCGGAACTOT 
ATGTCCGGGAAACj^G^^ 

C J AACTCAGC g^? G T A y GGG TACA^ 

I A 5J CG ^^ G J G J G ^ AGT Ji' ' 1 1 " I ^ACTGTTGGTGATGACAAAACTGTGAAGCAGTGGAA 
? TG ^ ATAC ^^ G ^ T 1 GATCATCACTGGAAAGAA ^ 

AA ^ TAGTG JZ A ^^I TT AACCCAA ^ GAG ^^^CTCTTGGGAAG7TGTGCATCTGACy«3GAA 

A 4 A I ACAATC Z G J^ GGAACCCTATGGA ^ 

TTCTTCTTrcAT^AG^TACAGTATTTGC^ 
JTACmCTCrnGATCTATTATTCTAGA^ 
: CTCTrATGTTATCHTTAATATTACTTTGAATAATGATTGCAA^TG^C^ 

TTTGGCTTTTreACTCTTTrAAAACAATC^^ 
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AATTTGGGTCAAGTTAAGATATTAAAAGTTCX7rTTCAGCANNNNNNNNNNNNNNNNNNNNNN 

NNN 

>28 

>29 

>30 

>31 

>32 



gcag ttcatatcatgatgttactttgattctctgaccaaactggcctgtgagcaccctgggc 

CTTTCTTCCTCTGTCAAAGGOCTTAAGACAGGTTTACCCTGTAGCCAG GCTCTGGAAGAC AG 
AGCTGGGTTAAAGCTGGGTGGGAGAAGTGAAAAAGGTCAGGTTTACATTCCT ACGCGG AAA 
AGGATGTAACACGGGGCCACATCCTATGCCCAATCCCAAGGCAGGGAGGCAGGGAAGTGG 
CTGCCAAACCTGTTGTAGGAGAGTAATAAATGACTTGAGAGTAAGCCTAAGCAAACTCAAGT 
GGGAAGGGGAGTGGGCTGTAAAATAGTTTAAGAGACTCTCTCAGGAAGTCAGCGTAATTGA 
TGTGTAGAAAGGTAACAGTCAACAGTTCTCCTAACAAGACAGCTTCAAAGCAGCAGCTATAG 
TGGAGCATTCCTGAGGCCTGCTGCAGATCAAAGCATGAATGTGCAGACTGGTCCTCTTGCC 

cagcg 1 1 1 ci 1 1 caaatcttt g gacatgtt at atttt agag gcaag ttcagttct agaggagc 
tggo:tgccc^cagactcaccgctcagtcxxaggctgaccttggtgcccagaactcagga 
GTTTQTI I ACCCT aactttgtttataaagtccagaatggggccgggaatggtggctcacacc 
tgcagtcccagcactttggggaagct 

>33 

cgcggtggcggcggaggtaggtatgcacttgcttgccatctaagcagggacaatg 
gcagttca tatcatgatgttactttgattctctgaccaaactggccrrgtgagcaccctgggc 

CTTTCTTCCTCTGTCAAAGGCCTT AAGACAGGTTTACCCTGTAGCCAGGCTCTGGAAGACAG 
AGCTGGGTT AAAGCTGGGTGGGAGAAGTGAAAAAGGTCAGGTTTACATTCCTACGCGGAAA 
AGGATGTAACACGGGGCCACATCCTATGCCCAATCCCAAGGCAGGGAGGCAGGGAAGTGG 
CTGCCAAACCTGTTGTAGGAGAGTAATAAATGACTTX1AGAGTAAGCCTAAGCAAACTCAAGT 
GGGAAGGGGAGTGGGCTGTAAAATAGTTTAAGAGACTCTCTCAGGAAGTCAGCGTAATTGA 
TGTGTAGAAAGGTAAGAGTCAACAGTTCTCCTAACAAG^CAGCTTCAA^ 
TGGAGCATTCCTGAGGOCTGCTGCAGATCAAAG CAT GAATGT GCAGACTG GTCCTCTTGCC 
CAGCG I MCI I \ CAAATCTTTGCACATGTT AT ATTTT AGAGGCAAGTTCAGTTCT AGAGGAGC 
TGGCCTGCCCCACAGACTGACCCCTCAGTCCCAGGCTGACCT^ 

GM Itil I IACCCTAAU I IGI I 1 ATAAAGTCCAGAATGGGGCCGGGAATGGTGGCTCACACC 

TGCAGTCCCAGCACTTTGGGGAAGCT 

>34 

ACCAGTT AAAGTCTTCT AGCCTGTATCCCCA C ' I CC 1 1 1 1 1 UCCACTTGCAAATTCGGT 
AGCCCAGTTACCCAGAGGGAGGCATAGGAGGGAAAACGAAGACTGA^ m. 

?JTII53H!£F AC ^^ An ^^ T ^ 

ACATTCTATCCAAATAGGATGCCCTTCTGTGGAACTGCATATTTGGAAAC^ 
TAACTTATGCACTCCACTGGGAACTTAC^ 

GTTACAACCGCTGGGTGTGTATTGGTTCAAAAGGACCTACCT ACAAGGTTATATCAATCCATT 

GTCCAATTTGAGAGAI 1 1 1 1 1 CTGAATCCAGTTAAAAT AATTTTTGGCTACACCTGGGGACAC 

TTCOCAGG^CAACAATGAXrTTGTAGTCTAGTGCCCAA 

TT^TTGOCAOCAGATOCC^^ 

ACTCACAGTGTGTCTGCATTTTTCCTTCATAATTAT^ 

ATCCTGATGTTAGAGA F 1 1 ICt I T CCAAGGCAAATAGAACTTTCACCNNNNN 

ncgcctgtgggaggacgtccgggtgggcggaactcct AGCGGACACCTCGTGGA 

GTCCGGCCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTGAGCCGGA^ 

atgtccgcgaaaccaagttggacttacagagagttccaag^aaact 

CCTTTrGAGGTCCCACG^^TA^ 

TCCAGAGAAGCTGGCTACTGTCXrrTTCrrG<iGGCGTGTGATGGAG 
CTAACTC^GCGGAATTGTATCCGT^ 

TACICGC1 1 1 I G 1GGGACI rCTTTTI I CACTGTTGGTG ATGACAAAACTGTGA^GCA^GGAA 
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AATGGATGGGCCAGGCTATGGAGACGAGGAAGAGCCATTACATACAATATTAGGAAAGACA 
GTGTAT ACTGGGATTGATCATCACTGGAAAGAAGCTGTTTTTGCCACATGT GGACAGCAAGT 
AGACATTTG GGATGAACAAAGAACTAATCCTAT ATGTTCAATGACCT G GGGATTTGACAGT at 
AAGTAGTGTTAAATTTAACCCAATTGAGACATTTCTCrrGGGAAGTTGTGCATCrr 
TATAGTACTGTACGATATGAGGCAAGCTACTCCTTTGAAAAAGGTTATCTTAGATATGAGAAC 
AAATACAATCTGTTGGAACCCTATGGAAGCTTT<>TTTTTA 

CTT AT AT Au J 1 1 JGATATGCGTGCACTGGACACTCCTGTAATGGTCCATATGGATCATGTATC 

tgcagtgcttgatgtggattactctcccactgggaaggagtttgtgtctgctagtttcgata 
aatct attcgaatcttto^gtagacaaaagtcgaagc^gggaggtatatcataqaaagag^a 
tgcaa(utgttatctgtgtaaaatggacttctgacagcaagtatattatgtgtggatctgatg 
aaatgaacattcgcctgtgg aaagctaatgcttctgaaaaattgggtgtgctt acat c acg a 

GAAAAAGCAGCCAAGGATT AT AACCAGAAATTGAAG GAGAAATTTCAGCATTATCCTCAT ATA 
AAACGTATAGCTCGTCATCGAC^TCTACCAAAATCTATCTATAGCCAG^TTCAGGA^ 



ctgtgccacttgtgtcagagaagaagaaacacgtagtggcagttgtaaaataattggtattc 

CTAACAATCCT GATGTATAATTATTTGTTACTTTTGATTTGAGAACTCT ACAAATAAAAGTGCT 
GGGACT AGATTAATTGCAAACATTTTAGTTATATGTGTAGAGCTTT ATTGTTACTCCTTTTAGC 
taccctgaaaaatgatccttaaaggtggcct AGTTGGTAAGACTGTTTTATCCTT AATCTGCA 
ttc tt ct ttcattgtagaatacagtatttgcaactcattttttcttg tttttattacagatatac 

TTACTTTCTCTTTGATCTATrATTCTAGACA^ 

CTTCTT ATGTT ATCTTT AAT ATT ACTTTGAAT AATGATTG CAATGATGTTTCTT CCTGTGATTCCA 
CAT AACATTTAGAATAAT^TGTCAATTrTTTACAACTGA^ 

TTTGGCTTTTTGACT CTTTTAAAACAATCAGCCTGGATTTATATAACTTTT ATAAATAATAATAT 

AATTTGGGTCAAGTTAAGATATTAAAAGTTCCTTTCA^ 

NNN 

>36 

GGTACATTTGTGTTTTATTGTGAAGGGTCCTCAACTGTGTGGCT 
CCCACTGCAATGTATGGAGAGGAGAGAAAGGGATGAAAGTGAAGGCAGGGGGGGGGATGT 
TTGTTTCACGGGGTGAACTTCTGCCTGAGCAAG 

iui i I CAATACA 1 1 1 1 1ATTTAGCACTTATTCTGTGTCTGCTGCCCTGGGATACCAGAGTG 
AATAAACAGA T AAAAA GTCCCTGACCTTCTGGAGCATACAGTCT CT GGAGAGAAAATGGAT A 
GATAAGCAGACTTTTATATATGAATAT 
>37 

, , GTG GCGGCCGCCCG^CAG^^ 

CTTCGTGCGACCCG CQTCC GAGCTCAGACGAGCTCCCTGGAGACCCCT C 1 I CACAAGAAGA 

CCGATTATGATGATCTTGAGCCTGAATGGCTGGACAGTGTGCAGAAAAA 

T GTCAGGTTCAGTGAAAATGAGATTATCATTGAAGAT GACT ACCAAGAAAGAAAAGAGT ATGA 
ACCCAAACT CAAGCAGTTTACGAAAATTTTAACAAGGAAAAGACTTTT ACCCAAGCNN 

>38 

COGAGGT ACTTA AG 1 1 UICI I CAGTTACAGCT ACCACTGTGAAAATAATTCTCTGCT 

TAT CAAGTT TACAACTTTAGAATTTCCTGTTTTAAAGTTTT CT catttacttatcacacaqtcat 
crrrcTTTTT GCCAAACGCT AT AGT AG cacattaaaaggagact gatgtg gaatcaactctgtg 

CAAAAAGTArrGGGTGCTTTGGTAGAAGTCTATACAGAAGACACTGGA 

ttttgtcc^ggtgagttgatgtc^gaaaaggcttaataatggagatgaggccgggcatggt 
ggttcacacctgt AATCCCACCTGI 1 1 GGGAGGCTGAGGCAGGTAGATCACTTGAGACCAG 

JJ*** GAOCAfi OCCAGCC^ 
AAANNNNNNN 
>39 

AGCAGAGCCCCGGGAATTCAACCTTTAAGTGGATAACTTGGCTT^ 
AGG£CATCAAAGAG^ 

J^IS2I r ? AAA ™ TCCACAAOTA ^^ 

GCATCGTCACTGACCAATCAGAAGAGATGCCAGTAGTTGGGCGCAGTGGCAGCACT^ 
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AGGCTGAGTGGACAGATCACCTGGGGTCAGGAGTTCGAGACCAGCCTGGCCTACATGGTG 

aaaccccatctctactaaaaatacaaaaatgagccaggcatggggggcacctgtaatcccag 

CTACTTGACAGAGTGAGCCT ctgtctcaaaaaaaaaaaaaaaaaaaaaann 

>40 

nncacgcgtccggctaatgaatctt ggggccggtgtcgggccggggcggcttgat 
cg^caactaggaaaccccaggcgcmgaggccaggagcgagggcagcgaggatcag 
c^^ccttcccggctgcxx^cgctcctcggaggtcagggcagatgaggaacato 
ccccttcggaggaggaaggaagtcccgctgocaccttatctctgctcct 
tgttcccagagctttttctctagagaagattttgaaggcggcrrtttgt^ 

ACCATC^TCTAAAG^AGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGC^GAATAATTG 
CAGAAAATCTTCAAAGGACCCT ATCTGCAGATGTTCTXaAAT ACCTCT GAGAAT AGAGATTGAT 



TTTTGCAAC^TTGGACCAAATACAATG AAGTATTCTTGCTGTGCT CTGGTTTTGGCTGTCCTG 
GGCACAGAATTGCT GGGAAGCCTCTGTTCGACT GTCAGATCCCCGAGGTTCAGAGGACGGA 
TACAGCAGGAACCAAAAAAGATCCGACCCAAGATTATTCTTGTGCTTACCGATGATCAAGAT 
GTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGG 

ccaccttcatcaatgcctttgtgactacacccatgtgctgcccgtcacggtcctc 
accgggaagtatgtgcacaatcacaatgtctacaccaacaacgagaactgctcttccc 
gtggcaggccatgcatgagcctcggacrrttgctgtatatctt aacaacactggct acagaa 

CAGCCTTTTTTGGAAAAT ACCT CAATGAAT AT aatggcagctacatcccccctgggtggcga 
GAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACT GTTTGTCG CAATGGCATC 

AAAGAAAA j^ TGGA ^ GA ™ T G<^^G^ 

^^^^^^^^^^GTCTAAGAGAATQTAJCCCCAT^GGOCOGTTATGATQGTGATCAGC 
CACGCTGCGCCCCACGGCCCCGAGGACTCAGCC<X^ 

CTTCCCAACACATAACTCCTAGTT ATAACT ATGCACCAAATATGGATAAACACTGGATTATGC 
AGJACACAGGACCAATGCTGCCCATCCACATGGA^ 

CTCCAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGAC 

S*^ ga <^ g < 5a< ^ ta ctta^ 
Il G £^ GG IS AA 5 3GGGA ^^ 

GTGCTCCAAGTCTAGAACCAGGATCAATAGTCCCACAGATCGT^ 

CCCACGATCCT GGATATTGCTGGGCTrcGACACACCTCCTGATGTGGACGGCAAGTCTGTCC 
JCAAA£TTCTG<iACCCAGAAAAGCCAGGTAACAGGT^ 

TTJG^XrTGATACATTCCTAGTGGAAAGAGGCAAATTTCT ACGT AAGAAGGAAGAATCCAGCAA 
GA^TATCXiAACAGTCAAATCACTTGCCCAAATATGAACGGGTCAAA 
CCAOTAOCAGACAGCCTGTGAACAACCGGGG^ 
TGGCAAGCTTCGAATTCACAAGTGTAAAGGACCCAGTGACXn^GCT 

AjCGCGGXACCTCT ACGCTCG<^GCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGT 

CTGGTTACCGTGCCAGCAGAAGCCAAAG^ 

GACTCCAAAGTACAAGCCCAGATTTGTCCATACT 

TTGAAGGTGAAATATATGACATAAATCTGGAAGAAGAAGAAGAATTGCAAGTGT^^ 

AGAAAGATTGCTAAGCGTCAT6ATGAAGGCCACAAGGGGCCAAGAGATCTCCA 

GJGGJGGCAACAGGGGCAGG^^ 

CTGTCCGAGTGACACACAAGTGTTTTATI C 1 1 CCCAATGACTCTATCCATTGTCAGAGAGAAC 
TGTACCAATOGGCCAGAGCtrTGGAAGGACCATAAGGCATAC^ 
TCTGCAAGATAAAATTAAGAATTTAAGAGAAGTGAGAGGACATCTGAAGAGAA 
AGGAATCrTAGCTGCAGTAAACAAAGCTATTACAA 

AAATT A AAGAGCC^TCTrCAOOCATTCAAGGAGGCTGCTCAGGAAGT AGATAGCAAACTGCA 

ACTTTTCAAGGAGAACAACCGTAGGAGGAAGAAGGAG^ 

GAAGGGGGAAGAGTGCAGOT 

CAGACAGCCCCGTTCTGGAACCTGGGATC iTTCTGTGCI I GCACGAGTTCT AACAAT AACAC 
CTACTGGTGTTTGCGTACAGTTAATGAGAOGCATA A I II i c ittt ctgtgagtttgctactgg 

AGAACGAGGCATTTTGAATCAGCT ACACGT acaact aatggagctcagaagctgtcaaggat - 

v AT m AA f* CAfi TBCAACCCAA^ 
GACCTACACAGAGGACAGTT ATGG<^TGGATGGGAAGGTT 
GACATCAACTGGCAAGGCCTAGAGGAG<TACACAGTGTGAATGAAAACAT^ 
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ACAAAACTACAGACrTAGTCTGGTGGACTGGACTAATTACTTGAAGGATTTAGATAG^ 
TGCACTGCTGAAGAGTCACTATGAGCAAAAT AAAACAAATAAGACT CAAACTGCTCAAAGTG 
ACGGGTTCrTTGGTTGTCTCTGCTGAGCACGCTGT^^ 

AGATGAAGACCCAAGGCATAAGGTTGGGAAAACACCTCATTTGACCTT GCCAGCTGACCTTC 
AAACCCTGCATTTGAACCGACCAACATTAAGTCCAGAGAGTAAACTTG^ 
ATTCCAGAAGTTAATCATTTGAATTCTGAACACTGGAGAAAAACCGAAAAATC 
TGAAGAGACTAATCATCTGGAAACCGATTTCAGTGGCGATGG CATGACAG AGCTAGAGCTC 



GGTCCTGGAAAGGACATTTTTGAAGATCAACTATATCTTCCT GTGCATTCCGATGGAATTTCA 
GTTCATCAGATGTTCACCATGGCCACCGCAGAACACCXiAAGTAATTCCAG^ 
AGATGTT^ACCAAGGTGGAGAAGAATCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAG 
CGOCTCCTCTTCACTCTCCTCTG^TTAGAT 

i n i lAA CTi ii m ATTTGTAAACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCC 

tgggtacctttgtgcagtagaagctagtgagc>tgtgagcaagcxk;tgtgcacacggagac 
tca tcgtta t aattta ct atctgccaagagtaqaaagaaaggctg-gggatatttgggttggc 

TTGi* i i ■ i^Ai i i i i i GCTTGTTTGTTTGTTTTGTACTAAAACAGT ATTATCrrrTTGAATATCGT 
AGGGACATAAGT ATATACATGTTATCCAATCAAGATGGCT ATAATGGGCTTTCTCAGAGATAA 
AACTTGACCCCCG TGTCAAATT GACATCACACTCTG CAT G TCT GCGT AAT GAAGGT ACGAT G 
CAACTAT AACCAGTGCAAT ATGACACTGACACT AT ATT AAATTCAAT AAT ACNN 
>41 

ACACGTGCACATTGTGCAGGTTAGTT ACAT ATG TAT ACATGAGCCATGCTGGTGCGC 
TG CACCATGGCACATGCATATCT ATGTAACAAACTT GCATGTTCTGCACATGTATCACAGAAC 
TT AAAG TGT AA T AAAAAAAi^AAAGAAAAACAG CATGCAATT CAG CCCAGACAAAAAAAAGAAG 
TCAAAGAC AGOGAGAATTCTT AAAACAGCAAT AAAAAGT ATAAAGTCACTCTAAAGGAATCCC 
CGTTAGATTAACAACACATTTCTTAAGAGAAATCTAACAGGCCAGGAGAGAATGGGATGACA 
T ATTCAAAGTGTTAAAGGGGGGGAAAAAACT CCACTCAAGACTACACCCAGAAAAGCTATCT 
' TTCAGAAATGGAGATAAAAAC^TCTTTCCCAGACAAAGAAAAACT AAGAGAATTTACTACCAC 
TCACCAGCCTTACCAAAAAATGCCCAAGGGAG TCCT ACATCT AAAGCAAAACGACAATCATC 
ACGAAAACATGCAAAAGCATAAAACT AACTTGT 
>42 

NCGCCTGTGGGAGGACGTCCGGGTGGGCGGAACTCCT AGCGGACACCTCGTGGA 
GTCCGGCCG<iAAGAGGAACCGAGATGAAGGTGAAGATGC^ 

ATGTCC COGAAACCAAGTTGGACTT ACAGAGAGTTCCAAGAAACT ATGATC C I UC 1 1 1 A CAT 
Com I GAGGTCCCACCAGAATATATAAGAGCTTTAAATGCT ACCAAACTGGAACGAGTATTT 
GCAAAACGATTCC 1 1 GCTTCGCTGGATGGTCACCGTGATGGACT 

TCCAGAGAAGCTGGCTACTGTCCT 1 1 CT GGGGCGTGTGATGGAGAGGTTAGAATTTGGAAT 
CT AACT CAGGO GAATTGTAT CCGTACA ATACAAGCACATGAAGGCTTTGT ACGAGGAATATG 
TACTCGC » F 1 1 GTGGGACTTC IT III I CACTGTTGGTGATGACAAAACTGTGAAGCAGTGGAA 
AATGGATGGGCCAGGCTATGGAGAOGAGGAAGAGCCATTACATACAATAT^ 
GTGT ATACTGGGATTGATCATCACTGGAAAGAAGCTGTTTTT GCCACATGTGGACAGCAAGT 
AGACATTTGGGATGAACAAAGAACTAATCCT AT ATGTT CAATGAGCT GGGGATTTGAGAGTAT 
AAGT AGTGTT AAATTTAACCCAATTGAGACA 1 1 ICTC1 I UGGAAGTTGTGCATCTGACAGGAA 
TAT AGT ACTGTACGATATGAGGCAAGCT ACTCCTTTGAAAAAGGTT ATCTTAG^ 
AAATACAATCTGTTGGAACCCTATGGAAGCTTTCATTTT^ 

TGCAGTGCTTGATGTT^ATTACTCTCCCACTGGGAAGGAG 1 1 n S TGTC l GCTAGTTTOGATA 

aatct attcgaatc rttcctgl agacaaaagtcgaagcagggaggtatatcatacaaagagaa 
tgcaacatgttatctgtgtaaaatggacttctgacagcaagt atattatgtgtggatctgatg 
aaatgaacattcgcci g i ggaaagctaatgcttctgaaaaattggg tgt gc 1 1 acatcacga 
gaaaaagcagccaag^ttataaccacaaat^^ 

^£^I^T5SS5IS5^ 5A 3F ACC ^ 

CATCATGAAAGAAGC » CG I CGACGAAAGGAAGTGAATCGTATTAAACACAGCAAGCCTGGAT * 

g^ c c A <^ GTG ? CA £^ ^ 

ctaacaatcctg^tgtatmttatttgttact^^ . 

GGGACT AGATT AATT^CAAACATTTTAGTT ATATGTGTAGAGCTTT ATTGTTACTCCTTTTAGC 
TACCCTGAAAAATGATCCTTAAAGGTGGCCTAGTTGGTAAGACTGTTT^ 
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~^II£III^ GTAGAATACA ^ 
£ AG ITI C ^TI3 G ^ 

°J STT^T5II A I CTrT ^T^ACTT TGAATAA TGATTGCAATGATGTTrCTTCCTGTGATTCX^ 
CATAACATTTAGWA^TGATGTCA>VTTTTTTAC^ 

AA^GGGTCA^GTTA^GATATTAAAAGTTCCTTT 
>43 

G ^ c S G JS c ^ GAAGAGCAACraAGATGAAGGT ^ GAT ^^ 

A J G J^f*^£^GTOGAC™^ 

C T AAC I CAG ^^^ GTA T ^ T ^CA ATACAAGCACATGAAGGC^ 

AA I G ^7 GG i^ GGCTATGGAG ^^ 

GTGTATACTGGGATTGATCATCACTGGAAAGAAGCTGTTTTrGf^AnATfftTf;nArA/a^AA^ T 

agaca Z^ g ^ gatgaacaaagaactaatcctatatg ^ caat g 

a ^ g ^ ag J^^ aaa J^^ aa gccaattgagacatttct cttggg aagttgtgcatctgacagg aa 

I A Ii 6 J^ G Z AC ^ TATGAGGCAAGCT ^ 
™ A ? AA ISI G J TGGAACCCTATGGAAGC ^ CA1 ^^ 

T GC ^7 G £ir GATGTGGA ^ ACTCTCCCACTG GGAAGGAG^^ 

GG / G ^^J7 AA J TGC j A ^ CAT ^ AG ^ ATATGTGTAG ^^ 

3T C TT C T ^ CATO TAGAATACAGTA ' mGCAACT C A> llllia I G TTTTT ATTATlAfi ATATAp 

CATAACATTTAGAATAATGATGTCAATTTTTTACAACTGAATTTAT^ 

NNN 
>44 
>45 
>46 
>47 

CTACAG ATTTAGGTGG AAGAGC AG CCATGTTTGAAGGGCACATGTG^A^GJ^TGG^^GGCAG 



Pate 12 of 355 



PCT/USOO/33312 



Table 4 



CCA 5 G J A 2 CAGACAGCCT ^ 

TZ G ^ G ^Z G ^ TATATGACATAAATCTGGAAGAA ^^ 

CAGACAGCCG(XrrrCTGGAACCTGGGATCTTTOG 

CTACTGGTGTTTGCGTACAGTTAATGAGAC^CATAA 

CTTTTTGGAGTATmGATATGAATAS^ 

AGAACGAGGCATTTTGAATCAGCrrACACG^ 

ATAAGCAGTGCAACCOWGACCTAAGAAT^^ 
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gaca^c^actggca^ggccTa 
acaaaactacagacttagtutggtggactggact 

I G ^?S C ^ GAGTCACTATGAGCAAAATAAAA CA^ 

AGA I™ GA ? C 5^ GGCATAAGG ^ 

GGGCCCAGCCCCAGGCTGCAGCCCATTCGCAGGC^VCCCG^^ 
GGTCCrGGAAAGGACATTTTTGAAGATCAAcI^ 

AG ^l™ G ^ CCAAGGTGGAGAAGAATCACGAAA ^ 

CG ^! CG I C II CACTCTCCTCTG ^^ 

J^^PIT^^GTAAACTAATA^GGTW^^ 

T GGG ^ C ^ G J^ CA J? TAGAAGCTAGTGAGCATG 

I CA I CG ™T AA IQ ACTATG TGCCAAGAGTAGAAAGA^ 

A j^^A C ^ AAAC ^ G ^ GGAC ^ ACAG ^ 

CCTTTT G^GTCCCACG AGA^TATATA^CUGCTTTAAATGCTAC^^ 

GCAAA ^ C ^XI^ GC ^ CGCTGGATGGTCACGGT 
Z° CAG ^ GAAG 5 rGGCTACTGTCC ^ CTGGG GCGTGTGATG 

C T# C I CAG ^ G ^^ 

TACTCGCTTTTGTGGGA C I I I I M I I I CACTGTTGGTGATGACAAAACTGTG^cSf^ftAA 

AATG^TGGGCCAGGCTATGGAGACGAG<^GAGCCAT^CAT^ 

?o G J A I ACTGGGA ™ TCATCACTGGAAAGAAG ^^ 

AGAC ^7T7 G ^7 G ^^ 

AAG 7 A ?7 G ^ AAA I™9 G ^ 

AAATG^TTCGC^GTGGAAAGCTAATGCT^T^^ 

gaaaaagcagccaaggattataaccagaaattgaaggagaaatttcag 

AMCGTATAGCTCGTCATCGACATCTACCAAAATrTA^^ 
CATC*TGAAAGAAGCTCGTCGACGAAAGGAAGTGAATCGTAT^^ 

G Z AAC i^l CCTGATGTATAA ^ A JT[ G ^ AC ^^ 
gggactagattaattgoaaacattttagttatatgtgtaSSc^ 

TACCCTGAA^TGATCCTTAAAGGTGGC^ 

2S55S C ^ G J AG ^ TACAGTA ^ GCAAC ^ 

TTACTTTCTCTTTGATCTATTATTGTAGACACTATACAT^ 

CA J AAC Jvni AGAA i A ^[ GA 

A^G<MTCAAGTTAAGATATTAAAAGTTCCT7TCAGCA^ 
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T ~^ / ^^ GGGGGGA 9GTACCTCAGCATATATT 

G ^ AGAGAG ^Z CA ^ C II! CCATCGACTC ^ 

^^^i^^^^f^^^^^^^ ^ GAAA ^ GGA ^^~^~^ ATTGTGCTGAAAGCTGCTTGTTGT&AT 

G ^I C Z C ^ AC ZII GTAG ^ G ^ CTATCTG <> GCACTGAC ^ 

G^TCTTTTCTCTATAGAAGGCTAACAATCTCTTCGTGTGTTmC^ 
GGGAGpTTTCCCTGGAGTACTTCTTACACAGACTGTCT ATGCT^A7SAAGVA*A^^0CTCG^^rA 

^^CG^^Sg^G^ 6 ^* 5 ^^^^^ 

>S2 

>53 

rTrv^^ C 5 C ^^55 AGGAC ^ TCCGGGTGGGCGGAA ^ 
TCCAGAG^^GGCTACTGTCCTr^^ 

c T^ c I c 4 GCG i^II GTATG 5GJ^^ 
I AC 7 G /! C II3T G J GQGA 5^ CTT ^^ 

G 7 G J A I AC JS G ^JT GATCATCACTGGAAAGAAGCTG: ^^ 
AGAC ^IJ^ GA 7 GAACAA ^^ 
AAG 7 AG 7 G II AAA J™ AAC ^^ 
A A I^i2^ c ^^I G ^ GGCAAGCTA ^^ 

AAATACWCTGTTGGAACCCTATGGAAGCTTTC^ 
I GCAG A G r?^A G AT G r^ G ^ A ^^ 

Ia? aa aa^a77 a Z c 7 g J3I aa ^ 

aaa t gaac a™ cgcc ^ gg *^ 

gaaaaagcagccaaggattataaccagaaattgaaggagaaam 

c ^i ca 7 gaaag ^ gc i c ?^ 

gg 5 ac j^j7 aa j rggaaacat ^ ag ^ 
ta^ctg>»j^^tccttaaaggtggcct 

tt c tt c 7^ ca ^ gtag ^ taca ^^ 

CTcStEStIJS^ 

St^ttTa^^ 

A^GGGTCAAGTTAAG^TATTAAAAGTTCCmC^GCA^ 
>54 

CCTGCOG^GgJ^^ 

S^^^Ta^t^ 
ggtttgaata^gtctcagatottg^t^^ 
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tccatt aaatgct gctcgttgccaagatgaaaaggcacacctt gcaaccatgaaatcctttg 
^tactcacaggagagtgaoccacaaaccaaatctgttgggttctaaatggtttataaaaatat 
taaagaggcatttctcatctgtatcaacggaaacatttgttccaaaacaagacttcccagagg 
tgaagagaccactaaaagcatocaggagcagacagccatccaggaccaacctt ccagttct 
gtctgtgaacgaggacctaatgcactgcacagcatttgcaacggcagatgagtatcatxtrg 

GGAAATCTGT CTCAA GATCTGGCCTCCCACGGATATGTTGAAGTAACAAGCTTGCCTAGAGA 
TGCAGCAAATATTTTGGTGATGGGTGTGGAAAATTCTGGAAAAGAAGGTGATCCTGGAACAA 
TATTCTTCJ ICAGGGAAGGAGCTGCTGTGTTTTGGAATGTGAAAGACAAAACTATGAAGCAT 

gtgatgaaagttctagaaaaacatgaaattcagccctatgaaatogcactggtacactggga 
aaatgaagaacttaactacataaaaatagagggacagtcaaaacttcacaggggggaaatca 
agtraaattcagagctggattragatgatgccattctagagaagtttgctttctccaatgctc 
tatgcctttctgtaaaactggcaatttgggaagcatcactggataaatttattgaatctattca 
gtcaattgctgaggct ttaa aagctgggaagaaagtgaaactatctcatgaagaagttatgc 

AGAAAATCGGTGAACTCTTT GCT CT AAGGCACCGTATAAACTTG AGTTCAGACTTCCTGA7T A 
CTCCTGATTTCT ACTGGGACAGAGAAAACCTGGAAGGAGTTTACGaTAAAACGTGTCAATTC 
CTT AGGATTGGCCGAAGAGTTAAGGTCATGAATGAAAAACTTCAGCACTGCATGGAACTAAC 
AGATCTAATGCGGAATCACCTGAATGAGAAGAGGGCACTCCGCTTGGAGTGGATGATTGTC 
ATCCTCATTACCATAGAGGTAATGTTTGAGCTGGGACGAGTATTTTTCTGATCAAGTGATAAC 
CAAAGTGTCACTGCAAGAGATATTCAAGTTCTACAATCAAAAATTAAATGTTCGGCX5CGGCG 
CGGTGCCTCATGCCTGTAATCCCAGCACTTTOGGAGGCCAAGAAGGGTGGCTTGAGATGAG 
ATCAGGAGCTCAAGACAAGCCTGGCGAACATGGTGAAACCCCATCTCT ACT AAAAATACGAA 
AATTAGCCAGGTGTGTTGGCACACGCCCGTCATCTCAGCT actcag gaggctgag gcagga 
gaatctcttgaacttgggaggcggaggttgcagtgagctaagatcacaccactgcactcca 
gccagggcaacagtgagactcagtctcaaaaataaacaataaaataaataaataaatgaatgt 

NHNHHNHNHNNNHHNH 



^KXXX^^GTGGGAGGACGTCCGGGTGGGCGGAACTCCTAG^^ACACCTCGTGGA 
GTCCGGCCGGAAGAGCAACCGAGATGAAGGTGWW^TGCTGAGCX»GAATCCGGACAATT 
ATGTCCGOGAAACCAAGTTGGACTTAC^ 

CCTTTTGAGGT CCCAGGAGAATATAT AAGAGCTTTAAATGCTACCAAACT GGAACGAGTATTT 
GCAAAACCATTOCTTGCTTCG CTGGAT GGTCACCGTGATGGAGTCAATTC 
TCGAGAGAAGCTCGCTACTGTCCTTTCTGGGGCGT GTGATGG AGAG GTT AGAATTTG GAAT 
CT AACTCAGCGGAATTGTATGCGTACAATACAAGCACATGAAGGCTTT GTACGAGGAATATG 
TACTCGC 1 1 1 1 GTGGGAC 1 1 Ci 1 1 1 1 T CAGTGTTGGTGATGACAAAACTGTGAAGCAGTGGAA 
^GGATGGGCCAGGCT ATGGAGACGAGGAAGAGCCATTACATACAATATTAGGAAAGACA 
GTGTATACTGGGATTGATCATCACTGGAAAGAAGCTGTTTTTGC 
A^SACATTTGGGATGAAGAAAGAACTAATCCTATATGTTCAATGACCTGGGGA 
AAGT AGTGTJAAATTTAACCCAATTGAGACATTTCTCT^ 

T A T AG T AC T£I A ^ TATGAGG ^^ 

AAATACA ATXrreTT GGAAOCCTATGGAAGCTTTCATTTTT 
SH^TlT^iJLM f^^TGCGTGCACTGGACACTCXrrGTAATGGTCC^ 

y GGA P7P^^^^^^ t "^^ TTAG ^ GTCCCAGTGGGAAGGAGTTTGTGTCTGCTAGTTTCGATA 

TGCAACATGTT ATCTGTGTAAAATGGACTTCTGACAGCAAGT ATATTATGTGTGGATCTGATG 
AAATGAACATTCGGCTGTGGAAAGCTAATGCTTCTGAAAAATTGGGTGTGCT 
GAAAAAGCAGCCAAGGATTATAACCAGAAATTGAAGGAGAAATTTGAGCATTATCC^ 
AAAGGTATAGGTCGTCATCGACATCTACCAAAATCTATCTATAGCCAGATTCAGGAACAGCG 

CTGTGCCACTCTGTO^GA^ 

^ A ^^ 1 ^I GATGT ATAATT ATTTG TTACTTTTGATTTGAGAACTGTACAAATAAAAGTGCT 
^^^Cf^GMTAATf^CMMjAJTrTMS^AJ^QTQ^AB^GCTTT ATTOTT ACTCCTTTTAGC 
JACCCTGAA^TGATCCTTAAA^ 

^ OT gTrCATTG TA<^TACAflTATT7GCAACTC AI 1 1 1 l id > GTTTTTATTACARATATAf! 
TTA CTI I CI C1 I ITQATCTATTATTGTAGACACT ATACATTCAAATTGACATTTAAGACOCAACATr 
CTCTTATGTTAICI 1 1 AATATTACTTTGAATAATGATTGCAATGATGTTTCTTGCTGTG ATTCCA 
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CATAA CATTTA GAATAATGATGTCAATTTTITAGAACTGAATTTATTTCTAGTGCTTTACTTATA 
TTTGGCTTTTTGACTCTTTT AAAACAATCAG CC TGCATTTAT AT AACTTTT AT AAAT AAT AAT AT 
A^TTTGGGTCAAGTTAAGATATTAAAAGTTC^rrrTCAGCANNN^ 

>S7 
>S6 

NNNNKNhlNNNAGGCGGCCGCCCGGGCAGGTACGXXSGGCTATTGTGATTCCCAGTG 
ACCCATAGAACAGGATTTCACTAGTCCTATGACATGTGACTGGGCTTGGGAAGTTCGGGTGT 
CAGGTCCAAAAATCCTAAGGTGGGATCTTCG CTTTGTGAAG CAAATTAATTACACAACCAAAT 
ATTGCCAC ATTCTTGAGGTCTATTGACACAATGGGAACTTCAACCCCTACTTAGCTTAGCATT 
"Hill t G AAAGA GTGAAAAGTGGTOCACGTAGAGCACAATATAATTTAAGTAAAGG AAGAT 
TAAAACATATTTTTATCCA7TTCTTATGGTGGGAAATTAACATGTTTTAGATTTGAGGTC 

tct caggaaaccctttcaacttcgt attattcactoctgagtagtatggggtagaaaatgagt 
ggaaatcagtttggccactatttccgagtcttttgcactgcaatactttcatcaatatttac 

TATTTCAGTCCTGTTTACAGATGGGGATCACATCAGGCNN 
>S9 

NrWNNCCGAGGGACGCGGGAAAGATCAGTTGTTTTACCTTGGCATTC^ 

gtttgactcccatggttctcaaagcgtgatcctggtccaccaccatcagcatgggggggaa 

cgtgttagcactgcaaattctcattcctccctaattttctga^ 

gcccagc aatct gttttaac caaac ttccacataattctaattaatttat^ 

TGATCT AGTTTGTCCCTCT CATTTTGGAGGCAAAGAATTGAATTCTAGAGAGGTTAATTGACT 
TGTCCAGTCATACAGATAGGTTCTGTTTTCTArrArrTATTTATTTA 

CTTT ACCCCCCAGGATTCATAGI ITTCTl I CTAATACTCCATATTTGACTTGACTTTTTT ACAA 
GTTGT AATTACAAATAAGTCTAAGATGGGAAAGTTGTGGAAAACTTT ATAGAGAACATGAGAT 

ttgactgaacagtaaacattaagtagagaggaaagaaaggggtgttct aagcagtagggacc 

acagtgaataaaggtagagataggtaggtttaaaaaaaaaaajuagtcctggggacagoctg 

ggtgacagagcgagactccgtcnnn 

>eo 

NCGCCTGTGGGAGGACGTCCGGGTGGGOGGAACTCCT agcggacacctcgtgga 

gtccggccggaagaggaaccgagatgaa6gtgaagatgctgagccggaatccggagaatt 
atgtccgcgaaagcaagttggactt acagagagttccaagaaactatgatcctgctttacat 
ccttttgaggtcccacgagaat at at aagagcttt aaatgctagcaaactggaacgagtattt 

GCAAAACCATTCCl I GCTTCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCA 
TCCAGAGAAGCTGGCTACTG I CC 1 1 I CTGGG GCGTGTGATGGAGAGGTT AGAATTTGGAAT 
CT AACT CAGCG GAATTGTAT CCGTACA AT acjvagcacatgaaggctttgt acgaggaat ATG 
tactcgc 1 1 1 igtgggacttcit 1 1 ? fcactgttggtgatgacaaaactgtgaagcagtggaa 
aatggatgggocaggctatggagacgaggaagago^ttacatagaatattaggaaag^ 
gtqta tact gggattgatcatcactggaaaqaag c i <i 1 1 1 1 ! gccacatqtggacaggaagt 

AGACATT7GGGATGAACAAAGAACTAATCCTATATGTTCAATGACCTG 
AAGT AGTCTTAAATTTAACCCAATTGAGACATTTCTCrTTGGGAAGTTGTGCA 
TATAGTAC TGTACG ATATGAGGCAAGCTACTCCT TTGAAA AAGGTTATCTTAGATATGAGAAC 
AAATACA ATCTGT TGGAACCCT ATGGAAGCTTTC^TTTTTACAGCAGCAAATGAAGATTATAA 
CTT ATAT ACI I i I GATATGCGTGCACTGGACACTCCTGTAATGGTCC^TATGGATCATGTATC 
TQCAQTGC1 I GA TGTGG ATT ACT C T C CCACTGGGAAGGA G 1 1 I G I G I C 1 GCTAGTTTCGATA 
AATCT ATTCGAAT CTTTCCT GT AGACAAAAGTCGAAGCAGGGAGGT ATATCAT ACAAAGAGAA 
TGCAACATGTTATCTGTGTAAAATGGACTTCT GACAGCAAGT ATATTATGT GTGGATCTGATG 
AAATGAACATTCCCCTGTGGAAAGCT AATGCTT CTGAAAAATTGGGTGTGCTTACATCAOGA 
GAAAAAGCAGCCAAGGATTATAACCAGAAATTGAAGGAGAAATTTCAGG^ 
AAACGTATAGCTCGTCATCGACATCTAiCCAAAATCTATCT ATAGCCAGATTCAGGAACAGCG 
CAT CATGAAAGAAGCT CGT CGACGAAAGGAAGTGAATCGTA7T AAACACAGCAAGCCTGGAT 
CTGTGCCAC ITGTGn CAGAGAAGAAGAAACACGTAGT GGCAGTTGT AAAAT AATTGGTATTC 
CT AACAATCCTGATGTATAATTATTTGTTACTTTTGATTTGAGAACTCT ACAAATAAAAGTGCT 
GGGACT AGATTAATTGCAAACATTTTAGTT ATATGTGTAXaAGCTTT ATTGTT ACTCCTTTTAGC 
TACC CTGAAA AATGATCOTAAAGGTGGCCTAGTT^ 

I IC 'I ICt I rCATTGTAGAATACAGTATTTGCAACTCA I I I I HCHG l 1 1 1 1 A TTACAGATATAC 
TTACTTTCTCTTTGATCT ATTATTGTAGACACTATAC^TTCAAATTGACATTT AAG^CCAAACAT 
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CTCTT ATGTTATCTTTAATATTAGTT TGAATAA TGATTGCAATGATGTTTCT 
CAT AACATTTAGAATAATGATGTCAATTTTTT ACAA 

TTTGGCI 1 1 1 1 GACTCTITTAAAACAATCAGCCTGCATTTATAT AACTTTT ATAAATAATAATAT 

AATTTGGGTCAAGTTAAGATATTAAAAGTTCCTTTCAGCANNW^ 

NNN 

>61 

_^^CGCGGTGGCGGCCGAGGT ACACG7T ACTGTTCCGTCGTATTTTGTAGT CT CTGTTCT 
GCCCT FTGGA ACATCTCTTCGGTGTTCCTGTGGGATCTCTCTACTGCATTCT ACTTT ATGTAA 
TAATCTGI I CAATAAATAATTrTrAAAAGGAGACAACAACGCCGCAGGTGATCTGGAGGCTC 
CTG GAGGACCTCAG CGACTCAGGT CCAGTCCAAG GAG GGCCG CAGAT CAG GCTGAAGGAT 
GGATCCACATGTTTAGAGGAGATCGAGAAATGCAGAAGAGAGATGCAGCAGAGAAATGCCA 
CAGAAAGGGGAGCTGGAGAGAATCAAAGCATGAGAGGAATTCAACCTGCTGTCACTGGAAG 
G GGTCCAGATGGAACGCTTGAGAAGAAACGTGTG T AGCATCTAGGAG TAAAGACTCGCCCT 
GGCTGACAGCT AGTAAGGAAATGGGAACCTCAG TGCTGCAGCCTCAAAG AATTGACTTT AAC 
CCACAGCCTGTGTGCACTTAGAAGCGGATGCATTCACAAATCTTOCAGAAG 
CCCATTGACACCTTGATTTTAACCTTCTGAGAT^ 

9J GA 5^AATACATAGGTATTGTTTTAAGra 

TTAAAAACTGATACAAGCCATAACCCAATCCTGTAGTCTTCCTCCTT ACANNNNN 

>62 

>63 

>64 

>€5 

' ' L' ' I ■ VJ " " nTIGGAGGAGATGGACAGTGTCAGTCTCCTGATAAGGGGGTGA 
TGGGT AGGTAATTTAAAAGCTTCT ATTATAAAATCT AGTCTCTCTGACACTGCCCTGTCCACT 
GCAGTCACATCTCCCAAT ACTGAAGGATCCT GAGAATACGAGCGGGCATGACACTT ACT CAC 
GTCATTCACCATNCTCGNNMNN 
>66 

CCCATCCCATATTTAGGTTAGATAGAGAAGTGTGTATGTGTGTGTGTGTGTGTGTTGCTCGC 
A CAGTG ATGAACTGT AAACATAAATGAAGATATGGAAAAATACATGAATTAGGACAACATGAC 

A ^r?£ A IL AG iH c i^ 

CAGAkui rGACTGl 1 1 1 C I GGGGATAAAGCAAGGCTTTACAATTTAGCAGTCTGTAGCTAGCT 
^*^J^*^TAA\AACA\ACAACAGCAGAGCGTT AAGTGT ATTTTTGTGACCT aaaacatgaactc 
AGGG r 1 1 OCAAATTCCTAACAATGAATAGT 
>€7 

m m ^TTGGAGCTCCCCGCGGTGGJ^ 
TCAAATTACCCACAAGTTAAATGCCCATGTTCCAGAC^ 

^T^3^f^99 Cn ^ AT WT ^^ C ^ C » ' G I GC 1 1 AGACCAAAGGAAACCACCACAGG 
GATTTCACAGGC . . 



. , J^Og^gSCTXntXXrrCGCTC ACTCCTCAG TCT 
CAAGCTCAGCCTGCAGCTGCCGTGGG C 1 1 I G I G T G GACTGGACGCAGAGCTTGGGAGACG 
GGGGA GGGCTA TTACTCCAATTCACTGTCA^TGGAATTAOtfSCT ATAGCGGCAGTGTATATA 
GGATTGC I ! 1 1 1 CTCGTCTTCCTGGGTTCTGAAGT AACGGAAGCT ACCTT GT ATAAAGACCTC 
AACACTGCTGACCATGATCAGOiCAGCCT 

CTGTTOg iGAAGTAGCACCTCTATCAGTTATGGCT AAATCCT GT CCATCTGTGTGTOGCTKC: 
GATGCGGGTTTCATrTACTGTAATGATCGCTT^ 

GATGCTACAACTCTCT AOCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGATTTG 
AAAA^CTTGCTGAAAGTAGAAAG>ATATACCTATACCACAACAG 

AACCT CfcCAAACT ATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTAT 
GCTCAvi 9 I t^AAAAATTCCCTATCTGGAAGAATT ACATTT AGATGACAAC I C I G I C I C I GCA . 
GTT AGCATAGAAGAGGGAGCATTCCGAGACAGCAACTATCTCCGAC I GC \ 1 1 rCCTGTCCCG 

ATAATOGCATATOACTATJTCAT^^ 

TTCT AGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAG iTTTCl I C AACCTAGTTA 
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ATTTGACAGAGCTGTCCCT GGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCTTCCAGG 

CAC^CCJGAGGAAGCTTTATCTTCAAGATAACX^CA 

I^H^IEL^GCAGCTCTATrc^ 

GGGTATCITrGATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGT^ 

<X^TGCAAGATGAAATGGGTACGTGACTGGTTACAAT<^CTACCTGTGAAGGTCAACGTG 

CGTGGGCTCATGTGCCAAGCCCCAGAAAAGGTTCGTGGGATGGCTATTAAGGATCTCAATG 

CAGyUCTGTTTGATTGTAAGGACAGTGGGATTGTAAGCA^ 

CCCAACACAGTGTATCCTGCCCAAGGACAGTGGCC^ 

TTAAGAACCCCAAGCTCACT AAGGATCACCAAACCACAGGGAGTCCCTCAAGAAAAACAATT 
ACAATTA CTGT GAAGTCT GTCACCTCTGATACCATTCAT ATCTCTTGGAAACTT GCTCT ACCT 
A ^ G ^ G ^ZI9 AG ^ CT ^^^ GGC ^ AAACTG ^ GGAT ^CCCGGCATTTGGATCTATAAC 
AGAAACAATTGTAACAGGGGAACGCAGTGAGTACTTGGTCACAGCJCCTGGAGCCTGATTCA 
COCT AT AAAGT ATGCATGGTTCCCATGGAAACCAGCAACCTCTACCT ATTTGATGAAAGTCCT 
^^^^^^^G^'ACTGAAACTGCACCCCTTCGAATGTACAACCCT ACAACCACCCTCAATCG 
AGAGCAAGAGAAAGAACCTT ACAAAAACCCXiAATTTACCTTT GGCTGCCATCATTGGTGGGG 

CGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTATGCAGA 
AGCTGGCACT AAGAAGGACAACTCTATCCTGGAAATCAGGGAAACI ICM i I CAGATGTTAC 

GAATGAATCTGTACAAAAAC^TCACAGTGAAAGCAGTAGTAACCG^AGCTACAGA^ 
GGT ATTCCAGACTCAGATCACTCACACTC^TGATGCTGA^ 
TTGGGTTTTTTAAACCTAAGGGA^ 

AAAAGAG ACTGA AAAAAAGCAATGTACTGTACATTTGCCATATAATTrATATTTAAGAA 
TATTAA AAGTT T CA AATTTCA GGTTACTGCTGCX^TTGATGTAGTGGAGATGCCTGAACACAA 

™™ A 5*T rG Ji rrrCT a _9I GAG ^ AAGA ^ GA ^^ G ^ Ga ^^ GAaa ^^ g ^^ 1 ^^GTT GCT 
STi^OGAACAATCAGGACTGCATTCATATGAX^TCCTTGT AGT AT AAGCACAGGOCATTTTTC 
*CTTTGGTATTAATAAAATGT AAAAAAAAAAAACTGGCT GAAT GGCTGAAT GAGAT AAAATTT A 
ATTTTAAAAAAATGGTTATGAAATAATGTTCCAATTATTAAATTTGTATTA 

AATAAATCAAAATGT GTGAAGTAATGGGCAATATCAAACTT CCT GCATATCTCCATTTTTGCTC 
TAGGCAAATT AATTATCCTT AAAAAAGTTAAGCAT ATCTTCTGAACT GAATACATCAGCTGGC 
ATAAAAGGAGCATGAAGTCTGTTAAAGCCATTGTCA^ 

CAi 1 1 1 U3GT AATGTAAATGTGCTT CX^AGTTGGGGGGAAAATGTGTACTTAGGAAAACATG 

GAAACTTAGACTTGTATAG 

>70 

^,~rff?£??!fffl^ I C AGTAAT 
ATAGTTCATtrrGGATTCGCTCCAATCTAGTTTTAAACT AGCTTTGGTCTTCTCATCTTTTCGGT 
ACAACACATTTCAACT ATT ATACTTCTAC 1 11 1 1 1 rCTTGTACACTCAGATCCATGGCACGTCA 

I3337. A f^ GG ^ A ^^ TCAATATA1 ^ c ^^ 

^SI^ A ^f^^ G 5I G 5g CT ^CTGAATA I IC1GGI I rCTGAATATTCTGG ITCTTTTC1 
. i i ■ lA^nATAIICI 1 CTAGrrCGA^G^MOCACTrGGTOCAGTACrGGGAACAjGAGCTCGCGAT 

?^I G ^Z^^ G IS5? CAG1n ^ TOCAT ^^ 

AACCTGAAAGCTGT AGTT GGT ATCATGGTTCATGGCACCCATGTATGCCACAACTTGCAGTG 
GGTTGTCAAATGT ACTTTGAATAAAAGGCTTT GGTTTCTCTGATGTCTTCCAATtiAATCACAC 

TQAACAGCACTTTCAAGAGCTCGCACTCX^ACTGA 
ATGTAACC^ACTTGAG^ 

TTTCAAAGGAACTGAAGGATC^CTTGCATTTGGTTTA^ I fCTCTCI GGATTGAAGATAGG 
f^tlSit^V^^SB^E^SISiiy^^ ***^!^^pTITGGCAGCTTATGCTTGCTCACGG 
- GTCCACAGAGCAAAGCAICI I CC I CCACCGATCC(X3GGGTCTGGGCGACGCCTCTGGATGA 

AAAACTTT GAACACCTGAGCTGCCT GCAAATGGTCTGAAATAACTTCATCTTt^TTCAGATTT 
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CACACTTCGCTTAACTCTCAAGGCGCGAACAACCTCTCGCGACCCGN 
>71 

~. . . ^TTiL 0 ^ 07 CC^^^GTGGCGGCCGAGGTACTTGAAGGATAAGAAATTACTGTG 

GA^CACAGGC AT ^^^ACTCAGTGTGCTTAGACCAAAGGAAACCACCACAGG 

>72 

>73 

>74 

ArTAr?^^Iw^?^ GG ^ G ^^ TGATM ^ CT ^ GATf ^ GAG CA TA GT"AGAGAA 
^ a^^^ZT A t AA « A ^ AAAAAAAAG CAC ATT AC AACTCT ATGT CAGTGTCT GT CCCAG G TCCT 
l^V^^Sl G CCTGGAAGCATG CAAGT AAAATTTCCTTG ATGGCATTTCT AAGGTTCAA 

ACA i A n c iJ c ^ AAAAATGcA ™ cAA ^ 

A ^ AAA ^!7 G I^ C / AAAG ^^ 

gagccactggtgggccctcagacacgagcaatcctt CT GGCCACTGCCAGTGTCCTAATAA 
GGG^TGAACAAGCGCCTATTGGTGAGGGAAGGGGAGT^^ 

^ttaaa^g^taaggtgtggt^ 

>75 

AG J^^ GAAAC I AATCTGAAAGGGAACGATA ^ 

>76 

CCGCCTGCCTC^ 

II£ CC X G i^^ GAG ^^ 
AG ^^ T J!9 AAGAAACTGTGG ^^ 

GG 7S G 4 AG 3 A ^ AGA £7^ 

A^TCCCTGCCAGAACCACCACTGCAAACACGAf^ 

I CGACAGTACGGGA ^ C ™ 

J^t A J^? 1 ^ g ^ g17 ^ ggg ^ GGAGAGaagg ^ ggt ^^^^^^tm^^j^tm^t^ac 

GGTGCMAAAATOA^TTCT^ 

?S?Z7 CCACACGGA ^ AATAG ^^' ' 1 1 » CIC I I GCCATTTGT AGCTTTGCCCATTGTCT 

I A T^£^ G ^ G ^ ACACGGAT ^ 

IH CAG ^ TC ^ GGCTGCCTGTGGGC ^^ 
TAACAGACACACGATT3TTO 

JC^ggAGAAOCCACCAACCAGAA^ 

2?£ Q ^ S J^ A J QAA "^ 

CTGTCAGTTCTCTGGGAAGTGGTCAGCGCATCCTGCAGGGCrrTCTCCTCC^ 

JfAAC^CGWTOTCTCAXSGGGCTCT 

AAAA TCAAG^TGj*GATGTA^ 

ataagcatacatgtctacacatgactagacaaatgcaaaoctItgcaaacSc^ : 
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GTA^Sic^G^AG^ 

I C TS C I GGGACAAC ^ 



«TAACTTT(nBTGACC«GACTGCACAAGACAMGCT^ 



;£x A i^2 AA £ AG ^ 

TCCCTCTTGCCTGG^TCTCrCAOCTTCATAAATCTAXaTTTACTC C 1 1 FN 



* 4T ^ T ^9 G H GGCGGCGGGTAC ^ ATA ^ ACA ^ A TGCTAAAATG 

TGTAGGTGCCGTG<^CAfiGAAAGCTAACTTTAGCTGAAAGCATCT 
A^GGOCCTCAAAGGAAAG<^TGAG^ 
TCTTGJTTGTCAAGAACAACA^TCtX^TTTGA^^ 
JSCTrAGGTTCTGGCGTGCA 



, ?^ N ^ NAGAGC ^<^CCCCCTCCTCX^GTGCTGTCT 

raS^™£° AG * AAAAA I^ 

GGCCTGG^CCTGGCCCTGGTAAAAGCTGGTGGATTTGAAATGCA^ 
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£?g^5£^^ 

>87 

>aa 

>69 

ACCGCTCAGCCTGCTTGGTTGCATCCTCCGCATGGCGAGTCAGCTCTGAGATCTGA 

g £J gg 5 a ,7 g cg^^ 

CTGTTGCTTGGAAATCTGAGATACTTGATGCTGTAAA^ 

AAGTTGTGT 

>90 

GGAAA i! AC Z G ^ ACG ^I CT a ^ gaaaaaa gtgacttcattcaattggttcagccacccgtatc 

^ AA J c Z c 7 c ^ TmGCG ^^ 

CCTCCCAGTAGGAAGAACAGTCACAAGGCACTGTTATATC^T^ 

G ^TI A I*T AA J AGTATAACAGC ^ 

G ^^^3 X A J^E^ A ^^ G ^ G * G ^^ GGGGG TGAACAAACT ACAGCCATTGATGAGAATGTAA 

gcaggcatccttgagcttcttctaggaacaaatacagatgtgcacaaaa 

GTAA^TTTGCATGACTTXSTAGCAGTAACAGTTT^A^ 

AGG ™! CAG JI A 7I GGTAG 

ctoctgaatgaatattgaaaagcatctagtgacaccaaggcctt^ atagttaga>C\aggcatc 

>91 

^n A J^ A fJ AG X ATAACAGCAGTGAAT 

gc ^ gc i g i a j a i^ aa i^ 

GGAGGCATCCTTGAGCTTCTT CT AG G AACAAAT ACAGAT GTGCACAAAATTTTCAT^^TTCA 

gtaagaatttgcatgacttgtagcagtaacagtttcacattttc^ 

r^^f^nl^^TT t^ G ? TAGGGT ATAAAGTGT GTTAGACT GAT^TTAAAAT^C^"S^nnrCTG 
CTOCnTjAAJGAAT ATTGAAAAGCATCT AG TGACACCAAGG CCTrTTATAGTT AGAAAAGGCATC 
mCATCTNNNNNNNNNNNNNNIWNN^NNNNNNNKN AUAAAAQGCATC 
>92 



C*GTcStWa1^ 

G GCGTCAAACTCAACTAAGTCTTTCCTGGCAGATGCCGGCT^ 

gc aa ^^ g 5S2 a tgg^ 

GCATTCCTA^GTTAGCTGATGTTTTCA 

AAAGTTACCCAACAA^GAGA^ 

AGAATTrrrTCTGTGArrCATA^ 

ag £Ia^™ a ^ g X ag ^II g ^ 

G z c ^ GGA ini Gc i GTAGGAAmGTAAcAA ^ 

ACCAGT^ACTTGAAGCTAAAATT^ 
GGOTOCAGTCCTC^ 

ATTCAAGATCTGAAATTACTTGCTAATAAAACACCAC ATACTT GGAGTAGGGAGAATATTCAG 
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GCACTTTGTG^TGTGCCCTCO^ 

CCTTTC^CACTATCAGGGACCATCGCCATCAAACATTACTTCAGTAT 

5^ SSPj^ GAGG ^ GA ^ GGAGT ^ A GAGTCCTAACT AATATAACT GTTTCTTGT^CAAGAAAA 

G ^I£^ G *? CA ^ QAACA ^ 
G J°^ GATGATAGTCCAGGTGCTCAGGG ^ 

IT G ^? > 5^^ G f^ G 1 G P GGGA ^ C ^ AG ^ A GT GAG T AGTT G AGACCTTG TTGACT CAATTGCA 
CAGTGCTCAAGACGCTGCCCGGATTTTGATGTGCCATTGCCTGGCAGCCATTGCCATGCAA 
G T GG ^ G J[ GG ^ GGG TGATGGG ATG ^^G^GACCTCATGGAGCTGTACAAG^^^n^^AC 
GA ™ GCCACA6ACAAGCAAGA ^^ 

AG T CAG ^ GGCA T rGTCTGTGGAAAGTM ^ 
^! GG ^I GG ff TGTATACCGTA ^ GCCAG ^ 
cgaaag ^ c Z™I ga gagtttgctga^ 

a i ag it! g ^ ag ^ g jj^ cac ^^^ 
CAGCAGC J AG J AC i£^ G ^^ 

ACCTTTTACAAGCCTTCTCTCAACTTATCTGTACTTGTAATAG^ 

SI G ^^J TGCCACAACAA ^ GCCATGACC ^ AGGAA ^ 

TCCAATCAGATGAAAC^GTCX^TGGAAGAATTTCGAAGCCTTGCTTCTCGAT 

GG ^ T AT GG £35J AC ^^ 

TG CACACAG CATGCCT CTGCAATG CCATC ATTGCTTTG CTGAAAGTTCfSfX : 1 1 lt: i I im>fti 
9^™ " ' " 1 ^9^ ACT ACAG TCTACCAGCATOAAGCriTGCTCTGTCACCATCGCCCCGG 

AA J C ^ G ^ GAGCCG ^ 
^?^ A C J GC ^ AG J^ 

J^I G J SAGGAAAGGGTrG * A 

GC I A T CG ^ GGA ^ GAC ^^ 

T GG ^ AGA ^ GGT A CCGAGAACT 

AA IT CGC J3 A i^ CA ^ 

TACACACGGTTTTA ACCAT GGAATGAATGCACTGCAG 

CAGAA ^ G VX G iT 1T ^ 

X G ^TIIEI? TAA I G I AATA ^ CTGGAAA>AA 

GCGTGGTTGCTCACGCCTGT AATCCCAGCACTTTGGGAGGCTG^kGGTGGGCGGCTCACGA 

GGAGATCAAGACCATCCTGGCTAACACAGTGAAACCCCCTCT 

A 7 4 AGCCAA ^T GG I G « G ^ 

G ^I GG ^1£^ ACC X G <^ 
CGACAG ^ C J GAGACTCCGTCTCAAAAAAA 

AAAAAAAAAAAAA 

>95 

~ . ~~ . ACC I GTATGATAACA ^GCAGTCAAACATATCT^ 

G ^ GGAG j^ AG ^S c ^ AG JI^^ 

GTAGGGGGATGAGCAGCTCXiACACCCTGCTCCT GTGTGAGCTGTGCGCTCCCGACTGGGA 

>96 
>B7 

^ GAG ^ 1 CGCCGC<^TGGCGGCCGAGGT ACCTTCCOCTGAGGAGCCCCCTTCAGA 
GGGGTGAAGAGCAGTATCTTCAGAGGC^TCCAAGTTTTAG^ 
AATGCAfiAGAAGAGGCTGGTCATAGACAAGTTTCATBt^^ 

AGAGTTTAAAGAGTTTGGGATCGAAAGAAATCAAGAATT'^^^^^ ^p*G AGG 'V A 
AGCTCCAN 
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nnctccccgcggtggcgg ccgaggtaocagcagagatggctt caagatgatttag 
gacttgggtcagtagcactt actgatgtagtggtttgatacacactgattacxrn' ct7gcttt 
t7tattctctggcatttctcct atataactagccactttt aaacaatatttgtcggctcttttct 
tct gci rs tci gtaaatattagggttcctgagtcctt agct ag a 1 i i 1c t f ctcttctt actcc 
tggoctttccttgggagagttcataattcacctactccatctagatatttgtgatgtocaaac 
acatctccacgttaggcttctatttgtagcatcagacccacactttcaactgtccactagata 
gcctcacttggatgctctgcaggcct AAATAACCTTTGCGGACAGATTAACAGGGAAAAAAT 
ATTAATAGGAAAAAATATAGAJ 1 1 1 1 ATCTGATGTTAATATTTCTATGTGGCATGGAGGACTTC 
ACAGAAAAAAGTGAAAACTCTAAAGCAGTTAGATTTGAGAATTATATACCATTTTAACAAAGAA 
CAATACATTGTGGAGACATGACAAAGGAAAAAGGGGTTTGGGCT GGAAGGGGGATGGGAAA 
GTGACTAGGAAACATATCGGGGAAACTGATGGAAGAGAAGGGCAATTTTAATAAGGTTTGT 
>99 
>100 

AGCTCCCCGCGGTGGCGGCCGAGGTA L I I 1 1 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 IGCAGT 
TTTCCAAATATGTTTTAATATGCATATCATCCAGGCAGCATAATGTTATATTTCAAAGACAGAT 
TTATCCATTGAATTATTG7TTTTAAAAGTTGGGATTCTCTACATAGAACATA7TTTCTGAAAT7T 
CAAGAATATTTTCAGGTAAATTAAGAATTAATTTCTTCT AAGACTATCCAATGTGTCTCAATCT 
ATTCCATAATATAATCAATGATAAAGATTCACATGTATCAOCAAA7TCGAGGCAGCTTAGTTG 
AAAAAATTT GAAAC AGCTT ACTGAATTCCATTTGCTGATTCTGTGGTGGCTTCCTCGATGGCA 
TGTGTGCTCCI 1 1 GGATGCCTGCAGGGGTGGTCACT GCAAAGTCGTCATCTGTGCCACTGG 
GAGTTGGGAGGCGGCCTGCTGGGGTTCCCTGGGTGGCAGGATTTACACCTGCTCCTCCTG 
CTGGAAGGCTT CCATCCTGGACATCTGGA7T AGCCCCTGCCTGACTGGTGGGCAGGATGCC 
TCCCGGGAACAAGGAATGGATGATGAGGCTCGT GAAGATTTGTGGCAATTCCTCTGAGCTTA 

crcTTG^ACCTra 

>101 

tfrGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCCCCTGAGGAGCCCCCTTCAGA 
GGGGTGAAGAGCAGTATCTrCAGAGGCCATCCAAGTTTTAGCATAACAAGGAGGGAAAGAG 
AATGCAGAGAAGAGGCTGGTGATAGACAAGTTTCATGTTCACAACTTGAATTGCAGAGGTCA 
J^TrTAAAGACTTTGGGATG^ 

>102 

ACCAT AAT AATGCAATT AACAAAATCCAGGATTT AAG GATTTCT ATAAGATT AAAAAAA 
AAATGAGGTGGTGTCGAGTGGGGAGAGAAAAAAGCAGG AAACAAAACT GGTGAGAGGAAAT 
GAGCCOCTGATGAAAGATCTTAAACACCAGGCTGAAGATTTTAGATTTCT ACCTATTAGAAAT 
GAATATTCACTGAGGTTTGATGAAGAGTCACTGAAGTGTCACAAAGAAAACAAGATTTGAGA 
AAGATTCTTGAGAACTCGTGCATAGGAATGAACTGCAATAAGGGCAGATTAGAGAAGAACTA 
GGCCATGAGGGCCTAGT ATCCAGAATGAGGCAGAGGGAGGGACGCTGGATGTGAGCAGAG 
AGGATGATGACGAATT AGAGATC^CGCCAAGG AGACAG AGAAGTCAAG AT GATGTT GAAAT 
GTGCTGOCTG GGTAACTGGGAGAGAAAATQGTACTGTCTGGACCGAAAATAAGAATTAGTKA 
CTTCTTTTCCAGAGGGTACT AAQGCAGGTCCTGGTTTAGAGCCATATGGAATAATTAAAAAAT 
CCTGTAAGATAACCCACGATCAAGACACACTTCAAATAAAAACCACTrTAGCAGCAGAXXXiAT 
C1G I i GGGCCACTTGGCCCCGCTAGACCTTGCTGATAGAGAACACCAACTCATCTCTTCA 

ctcaacaaagattcactaaaacct aaacaatgtagcatcct aagct aggtagggaggacccc 

^^^^^J^^*^^^^^^^^^^^^^GCTTAATTCTACCTTCTTTCOCT ATCAATG 

AGCTGTr ACCT atgaacaggtagctaaagtgcttgcttcagagatatgagtgcctgatacca 

GMCC1 AACTGCCACGT ATGGCAGATGAACACCTTCAAGACTTCAGTCTATOCTTGGCAGAG 
>103 

Q^'^' ^atg gtcgtgggagtcctctagcaatg u i c ci re r rw rroooGTOtACTcrawgr 

CarTGACTTCAAACTCAGCTTACCCAAAGC^ATGTCCGGCGGCCGAGG 1 1 ICUti l 

GCCCTCACCT7TGCAGGAGACTCTCAATTTCTCAGTGCACATCAGCTCTGAGACCACCAA 
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CAAGGGTTATTTTTCT AAAAGACATTGGTTCCCATCGCTCCTCTGACT AAAGGTCCT act atg 

GCACATTTGCtXnTGGCACTCAAGGACCTTGCAATCAGGCTGAGAACCT CATGTTCT CAAAC 

TCAAGACCANN 

>104 

>105 

ACTTTCT AGGTAT ATCATGTGCCCT AATGTGCT CCTAAT ATCAT AAATGTTT ACTTTCC 
GAAAAGTATTTCrrGAAAGGGAGCATATTTTGGAAAGTGCATAGGCTTGTAATCATACTTGTTT 
T CAAGTTTCAACTTTGCT ATTCAACT AGAAT AATCTTGTGCAAAACCTGAGCTGATTTTCT CAT 
CT ATAAAATGGAAACA ATAC TTT CTGTGATAATGG GTGC AAAACACAAGGTATACT GGTTTCT 
TT GCT CTGGATT CAAG mTCTTCTl AGTTTCAAAATTTTAAAGGGAAACCAAAAATGTTTCAT ' 
GGGAC 
>106 

GTCCGGTAGTGGGCAGCGATCAGGGCTGGGGCTCTTTCCTGAGTTGTGTCAGGTG 

agagattgtgagaacttggcttgcagggtttgggcatcagctgcccattgaggggccgttc 
attgtcrcaaagtgaatgtggggtggtttgatctgcatgtgtcatttgtatccacacaagtta 

ATTA TTCTG CI 1 1 1 G F IGTAGTACCTTGGTTGTGAAGCAGAAGCTACGAGGCGTCTATGTGCA 
GCGATCTTATCGCTCTG CATTAAGTAAGATGAGGATTCACTCTTAATTTATGGGCACAATTTA 

GTTTCTI ccacacaaatttaggccttaactctttt attttttcctacagtgggggtttggagta 

ATATTCATACGGCATGGACTTTACCAAGATGGGGTATTTAAGTTTACAGNN 
>107 

ataattgcagagaaagcttgccaacggtgataagtaggtttgtct agcagcactgat 
g cgtggtg gaagttgatggtcatg aacat acagtgtgat aacct atctgccctcttgacctt 
ttctagtagtgctatgtcattttggtactaaggtaggtgaattttccaagtgttcttggaaat 

AAGGAAACATCAAGAATAATGTAAAAGCCTCAT AT AGAAT AATGAAT AAT AAAGAATAATGTG . 

aaggcttcattcaaggttggggtttgccagatacattgcaacaaaatgacagagcagccaag 
gtatttaggatagtggccaaaggattgtaatgatggctt atggaagtgtcagctggataaag 
agtgaaaatgaataaaaactaann 

>108 

nnggattct acatcaggtgtctgtgcctcgctgctgaaggataacccagagtgcaa 
ggtcatctttgttgctgaacagggctggacctgtcgcactt aaggacacttaaaggattct A 

TTCTT CATTCAGGTCCCCCAGAGAAATTGGCTCCTTA I MUCH I A CCT ATTCCTAGACTTCC 
TTTTGTCT AGAGCX^GTTTTGCAAAGGGCACTTTTATCCATCTCAG^ 

AGAATGAGTAAAC^TATGGGGCAAATAGCATATATGAGCTAAACCAGTTAACTGTTAACCAA 

GGCACATGGTGAATGCCTT AGTA I II I 1 1 1 III 1 1 

>109 

CCACGCGT CCGAGACACTTCTCTGACTAACCATAGACT ATGT GGAAAATGGTAGCTG 
GATTGCCTTTGGGTGGAGTCCTTGCCCTG TGGCAT AGGAAACAAAGGAAAGGAGAGaGATG 

ccctttgagattaatgaaaatgctctcagccaaataaaatct aaaaatagcctccttgtgata 
cgaacgcgtggcccct aagggtc ctaaa gagagagct agoggaqgttcagctggccacag 

AGATGCTAAAGGTCAGGAGCAGACTTTTAGGG 1 I IUCIUI 1 1 1 ATAGGTTTAAAGACCAGGT 
CT GT GTTTTGATAACrrGAACTTGCTAATAGCTGGCCACTTGAGrT GC 1 1 C I ICCAGCTCTTTG - 

tgggagtg^ 

>110 

gtgctgcctgcactgtgactaagactttctggagt atcatcatgtttaggagttgat 
gagattatagtttcatgtaagtgtatcattagatgacaactctacatcttt aggcatggaaac 

AAACATTTTTCCTGXsAAGAAAAAAAAGTGAACATCCAACCTC 

TCTTT GCTATTAAGAAACTCGGTGO fCTTTCl tAXiACTCTATTATATTGTCAAAATACATCTGG 
AGACACT ATATAAACTTTTT CTCCTTTAAATTACCTGGTTTATATATTATCTCCT GTAGCCTGC 
ATATAGATAAAGGTTAAACATAGAGGATTTAGGTTGTTGGTAATTTAATAAANN 
>111 

cgcgtcagtcagtctgacggtcagtggatcggtgggtttatctcaagggctgagta 
gccggt aagaaacgagggttcccgggattggaccgacgcagocatg gt aggtccagat cc 

ccgcggtaagcagtggtctgatcaaaagaaagct aactgctagatctgatcgagttaagagt 

GTGGATCTGCATCCTACAGAGCCATGGATGTTGGCAAGTCTTTACAATGGCAGTGTGTGTGT 
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CTGGAAAGTTTOT 

AGTGTTCAATTACAATACTCTGGAG AGAGTTCATATGTTTGAAGCACACT cag ACT ACATTCG 

^j^n^GTTCATccAAcc^ 

GCTCTGGGACTGGGATAAAAAATGGTCTTGCTCACAAGTGTTTGAAGGACACACCCAT^ 

NNNNNNN 

>112 

NNNNCCGAGCGGTTTGCATCGCCAGCTCGCGGAAGGCCATGAG^ 
GAAGAACGCATCX^TGGCGGCAC<^GCCTGTTCCGGCACGT 

ACC CAGT CAGGCAAGGTCT CGCGCACTTGCACGCCC5GTGATGGCGCGCACCTGGTCGTTG 

AGGCTrrCCAGATGCTGGACGTTGACTGTGGTGAATACGTTGATGCCGX^ 

CCTG^TGTCTTGCCAGCGCTTTTC<STGGCXK^^ 

CGTCX^CCAGCAC^GTTGGGCTTGGOMCGAGCANNNN 

»113 

'GAGGGGGCGACTGCCCCTGACAGGTGTTGGATTCCAGCTGCTGTG 
TGGTGGGAAGAACGGGAGAATGAAGCX^GCCTTGGGAGAGGTAGGA«3CGAGCC^ 
CAGCi lk, 1 1 CCAGCATCTGGATCCAGGCTCAGCTGAAGCCAGCCACCTTCTG 

>114 

NNAT AGTGATGAAGCTGGAT AGTT AAAACAGAAACTT GAAAAGCAAGATGTGGAACT 

' ACGCXKjGAAGCAACTGTCAGCTAGTGAGATTACT gtgtatggccaatccagataaataaga 
CGATCAAGTCTTT ATGAAAAGGAAAGAAAAATTTGGAATGCAGATCTCTGTCr^r^^Trr 

^I A ^^ TGGAGC< ^ TC1TCACA ^ 

gCTCGTGGTGGCTCATGCCTGTAATCC^^ 

TCfCCTTroGTCTCAGGGCTTTGAGACCA^ 



i^'/^™™—' '11 ' 11* ' 1 AATTTTTACTCti II I L I II ATTAATATAGAAAAGGAGCCC 
AQGGCAGCT GGACCAGTAGTACAAAGCACCAGGAGTTAATAfa^ATTrrrre/rrn m. Arsnrzn a to 

TOCOCAGAACaX^GAAACGGCTCCCT 

GACCTGGAAAC££TGGGGA^ 

GTOS^TAAGG^GGGAACrr^^ 

AGAACCCAGAC^GGCCCTGG^ 
CTTTGCTCTATACTACATATGGtt 

CCAGTTGGGAGGAAGATGGGGCAAG<JAAGGAAGCACTT GAGTGTCCCTAGCTTAGGCAGC 

CGGGGATGAGACACAG4X>GGACAACAGCACCCC^^ 

AGGGACGGGCTTATCCTCGGCCAGT^ 

CAGCCTTAGCAAATCAAGTGTGCAACAAGCAC^^ 

J^CAAGCJCTGCCAGAACTTG^ 

CCTTGGDCTbl 1 I U, I CC T C I GTCAATGAGGGCTTTGAATAAGAGCTCCTAGGTCATGAAAA 
GGACTCJGCAGGTGAAGGGAACTCX^ 

GAACTGCTATGGGATGGGGTGAAGTGGGGGATGGAGGAGGAAG^ 

ct atccccacaaacactacagagtgaggaaaggccactggaag<xc i i d i re cccAaSsG 

^gAAAGGOCAO^AGAGAGTGAATGTGGC^ 
gOgffCACTTCGCAACO^ 

ITCTTTTCACTGTCGITCCTGITCTGGGAACAGC^ 
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gt tggt gaggaaaggacagagtgggtgagtaagcagacagggaggt aagagtgactctct 
ggcttt ctcxttcrrrcctctt^gagagcaggtcaatcaactgagtcgcagatct 
acagcattgaagatgtgtggctggtcagcatagacatggtgggagg(>ctxrrtaatctcca 

TGTCTCGGACATAGGAATCCGGCOGCTGCATCTTCA CC 1 1 1 1 1 1 c ccgtactggtatctatc 

caggtgtcggacccgtagat(^tagtgata6gcacatcttttcgaatcaagtgaattw3ctc 
cagcatagggcgccgggcccagccaaaggactccatc^tggcti^^ 

. CTGGGATTCTGTGCGTTGCAGTGGTAAATATACrrCTGATATGGTATC^TCTTCAAAGAAGTCT 
GCAAACTTGCGTTTGAAGTCCG<5CCGGAATCGCTGCACCAGACCAGGCCCCCAGGGOCCA 
GCTACT(XiAAGAACAGCCAATGGATTGGAACGTCCT AGGACAGATGGCACGGCTTTGACCC 
AGGCTGGGGGTGCACGGATCTCACTGGGGTTAGTTGGTXX5GAGGGGAAAGCCCCATGGGT 
CCACCAGGATGAGGTGTTTAACTCTATCAGGGTACTrGATTGAGTAAGAAGTGGCCAGGAAT 
CCTCCCAAACTGTGCXXX^GAGGATCATGCTGGGGATCXXX^ 

CT ATCGATGTCACAAACT C^TCCTCAGCCC2XTr(XX3GGTCCXrrTGGGAATGCTGGCCTTGA 
GCTTCGCCCGAAGCCAAGCAGATCGAAGGTGTGCAGTGTGGGGCGGGCACTCAGTGAGTC 
CATGTT GAGG ATCGAGAGACCCACX3CCGCCCCXAAAACCATGGACCA 
CGGTCGTTTTGCTCG^iGGCTCACAGTCACCGTCCAGATCT^ 

CATATCTGGCCAGGAACTT ATTCTGGAGACACTGGAGGATCCT GGCTTCCACATTCTTCAGC 

TGAGACATGGAAGTGGGGCGCCACGTGGGCAGCCAGCTACTCAGCCAGCCTTGAGACTGC 

TGCTCCAGATCATO^CGATAGTAAACAAGCCTTGCCAGGTCCTGG^ 

CCCTT CTGTCGTCTT CTT CGCAGCCGT AGTCGACGCGGCCGCGAANN 

>1t6 

C^CGGTG GCGGCCG GT AGCGCCGGT AGGCGGTGTGGACX^GGGGCTCGTCGGT 
GGCGGCCAGCGAATTGGTGACGACGCTGATCTTCACGTTGCGCCC^ 
CACGTCX^AGCCCCXSTGGCACCCGGAATCAGGTAGGGCG^ 
CGCGCGGCGCATCTGCTCGACXACGTTGTAGCGCACGCTX5TT^5ACATC^ 
GCCGCCGTACXiACGCGGTCTTGCCGATCACGCGGTCAGGCGAATa 
GGTGGTCt^GATCAGGCCGAGCTTGCCGGCGTTTGAGGTCTTCGACCATCXi 
CGAN 
>117 

TGAGCTCACXXiCGGTGGCGGCCGAGGTACTCTAATGGAGCCACTC^ 
AAAAAGACAAAAATACCTCCT ACAGTTGTTATCATCAA<X5TOkGTTGCTGGCTTTTCCTAAAT 
TTGTCTTCTACCTCAGATCTAAACCATTTGATAACATTAGGGCA^ 

CCAGTAAAACCATAGCAAATG UNCI CCCTAGGACACTATCTGTTTTCACAGGAAAATTTTT 
CT CAT AGAAAAACT G T AGGAAAAGCCAT GGA T GAGCTGAGAAGACCAAACCT AT CTCTTGG A 
AAACAACAGTAGGGAGCGTGGATT AGAAT GTCrTGGGTGCGT GAAACAGGCAGACAATCCT 
GAAACATCTTTTCT GGGGACG T AAGGCATGAAAAATTTCT ATAGACTT AGGAGGGCTTCTAG 
GAAACAGGAAACGANN 
>11B 

ACCCCTAGCAGAAACAGAGTTTCGCC A T G n GG I CAGGCTGGTCTCGAACTTCTGT 
CCTCAACTGTTCX^CCTGCCTCAGCCTCCGAAAGT^^ 

GXXX^AGCATGCCGCT AACCGAGGCAGCAGCGGAOGTGAGCGATAATGGCGGATATGGAGG 
ATCTC1 [CGGCAGCGACGOCGACAJGCGAAGCTGAGCGT AAAGATTCTGATTCTGGATCTGA 
CTCAGATTCTGAT(^AAGAGAATGCTGCCTCrrGGCAGT AATGCCTCrGGAAGTGAAAGTGATC 
AGGATGAAAGAGGTGATTCAGGACAACCAAGTAATAAGGAACTTjrT^ 

GGACGAGGGAGCTT CACATCATAGTGGTAGTGATAATCACTCTGAAAGATCAGACAATAGAT 
CAGAAGCTT CTGAGCGTTCTGACCAT GAGGACAATGACCCCTCAGATGT AGATCAGCACAGT 
GGATCAGAAGCOCCT AATGATGATGAAGA<XJAAGGTCATAGATXX5GATGGAGGGAGCCATC 

attcagaagoagaaggttctgaaaaagcacattcagatgatgaaaaatggggc^ 
aaaagtgaccagtcagatgatgaaaagatacaaaattctgatgatgaggagagggcagaag^ 
atct gatgaagat a agct g cagaattctgacgatgatgagaaaatgcagaacacagatgatg 
AGGAGAGGCCTCAGC I I I ccgatgatgagagacaacagctatctgaggagg>aaaggctaa 

TT CTGAT GATGAACGGCCGGT AGCTT CT GAT AATGGATGATGAGAACACAGAATTCT GGAT G 

aatgaagaacaacacaagctgttctggatgaagagaaaatcgcaaattt^^ 
gccaagggctcaaaattgagaacacagggatcaaatattacaa 

GACCTTXXAAAAGACGCGACAAACACAGACGAGarrCC^^ ■ 
GCTATGGAAAAAAGAGCAAATGACAAGCGAGGGTT^AAAAAAATAGG^AGGACGG^ 
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AGGAA<^QGAGAr^M.GA*_*CCACGAGGGGTGGCGAGAAACAA^ • r • GGA""* 

A ?£ AG f G ^7 GCAAGCACAAAC ^ 

AGCAAATGAGACAAACACACCTGACAACGAACAATGATAAGAAGCAAAA^^ 

-^.^^ ^*'^l^'f^** , *' s '**'* , *'*^*OG G AAGXCTTCTAGAATTAATTAACG00C5GGT AGCGGAOTACGC 

G ^ AGGAGGA J^ GAGA ^ 
i? G ^ G ^ G J GGCGAAGGCGGG ™^^ 

GG ™f Jf^^C^^CCTCAAGCTCAATGACT ACX^GCAGA^GAAACC^^CCA^CT 

>120 
>121 

rr^^Ir??? TG J^ TGCAG J G ^ CTCACGOT 

2££?i A £I AAG ^^ 

9^ G **«?TTTATGTATTTGGGAAAAGTAAAATA^ 

^ C 5^T* CA /£ C ^ C J G ^ 
GC ^2 AAC 7 A ^ AC n AAAGTCTC ^^ 

GGGAACTGTTAAAGOCTTACACCAAGGAAAACATAAACTTGTAC^^ 
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CA I AA ^ TAG J* GGAAGCTATA ^ 

g t aa ss aa ^^ 

CAGCAG ^ G ^ ATGGACT ^CTAGCX«GGCAGGTACGCGGGTGTGCAACTGC^ 

^SS^^T G ^ c ^ TTGTGAAGAGATGGGAGTCTC cc^ 

pT^S^^^rT^^^^^ 7 ^'^ CCCG CCCCAC TT CGCGAAG CCCT AG G ATT A CG G G AGTG 

Al^ISi2^ mA( ^ GT ^ nAaACTCCt ^OTTArrrAA*GCTGG«TCTG<^CCAG 
JfJTOTCOXAWA^TAAGTAACTAGAA^^ 

^g^g^Sg1S^m CTOT ^^^ GCTG ^^ 

>124 
>t25 

AAG ?J AACATG ™ GCTGAGA ^ 
AGAA ^J AACA TCCCAGA^^ 

CAA J GAAACG ^ AAAAATAG 
S C ™E£ ACGTGACACTGA ^^ 

JTGAGTTTTTAAAATCTC^TGCTGACTAAAATTCAAAAAACTAGAAG^^ 
f AAAC ^^ G ^ GCTATAGCAATA ^ CCAGCT 

S2?^c£5SSa2£^ t ^^^^^ 

>126 

^ " CG ^^ GTGGGAGGACGTCCGGGTGGG <^G 

?J CCG i^ GAAG ^ CAACC ^ 

ATGTGCGCG^ACGAAGTTGGAC^ 

G ,S AAAA ^ TTCCT Jj GGrTOCTG ^ 
J C ^ GAGAAGGTGG H ACTGTa ^ CTGGGGCGT 

C T AAC I C ^ G ^ GAA ^ GTATCCG I ACAATAG ^ 

t ^ ogc i!7 tgtggga ci ic " 1 ^ n cactgttggtgatgagaaaactgvgaagcagtggaa 
g J g Z^I ac ISS ga II g ^^ tcactggaaaGa ^ 

AGACA ^^ GA 7 G ^ ACAAAGAACTW - 

AAGTAGTGTTAAATTTAACCCAATTGAGACATTTCTCTTGGGAAGTTGTGCATCT 
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Il 4 Tt G J A ^ G J ACGATArG ^^ 
AAA I ACAA I C J G J^ GGAAC ^ 

CTTATATACTTTTGATATGCGTG CACTGGACACTCCTGTAATGGTCCAT>^GGAT^TGT*TC 

I^7£2J ga J g j g ^ 
AA I C I A ^ CG ^I C I^ CCTGTAGACAA ^ 

TGCAAGATGTTATCTGTGTAAAATGGACTTCTGACAGCAAGTATATTATfTT^^ 



zi c ii c 3n CA ir GTAGAATAcAGTA ^ GcA ^ T cATTT^ 

ctcttTtgttTtct^ 



C^A^^TCAAGACACATGAAGGAGCATACCTTTTTCTG^ 

£ AG l AAACA ^ AG J G ^ 

CG Z AAAAA T G ^ A !T A J^ ACAA ^ 

^Z AAAAACAAAATGTATCAAAAA ™^ 

TOCAGTAGAAAAAATCAATAGACTTAGAAAAA^ 



J^ GG J AT J G 5TI5* GG ^^^ 

GCTTAATAAAAAATTGTTrGAGGCCGGGCGTGGTGGCTCACGGCTATAATC 



•TAATAGCCATTCTTAACGTrTTATAATTAGAAGCTGin^ 
GGAGAAAC^^GACTGTGCTGC^ 
CTGTAGACCCTTGGGAGTTGTAGXX^GTTAA^ 
GATACAACCCAAGGGCAACXXXrrGGCCTTT^ 
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COTGTGAAGG^TTGGCTCACCCCAA^ 
>132 

S GACG ^ GGG I C ^ GG ^ GCCCT ^^ 

r ft^A^^«^Z^^ G J?J? GGGGAA7G ^^ GG A7 A ^ GG ^ CAGGCGTTTTrCGTCCTTGTGCAG 

CGG^GTT^GCG^ 
>133 

*T**xA^ G ^ C ^ GCC J GAGGTACGATAA ^ CA I G ° C ^ 
TGAAT^^^AT^^ 

i CAA J AG ^I AA IT[ AACTG< ^ G ^CTCTTC^ 

AG 7III A ^!I CA L CTCT ^ 

toa^^g^^ 

>134 

rrr*A T A J ?£ AG ^J GAAG S^ 

AA i AA i5J G ^ s ^£^^ 



^ w J^5fJ? AAC J CAGACAGGC ^ 

A J A JS CA ^ G ^ C JP AG JJ CTC ^ 
JJ25^ AGGCA ^ A A^^ 

C ^ GAAA TCACGAATCCC^^ 
AGAACTGAAAGTTpGAGTT^^ 

TTACGCCTCACCTGAAATACTATGTGGCTATGGACTTO^AGCCAAGTTAGGT^ 



Pap 30 of 355 



5 P«te3l of 355 



423 
Tabic 4 



PCTAJS00/XU12 



G AAGGG TTTTATAAAAGC^CATACAGAACTCTTGGTGGTGAAACAAGAATTCTTGCAGTAACA 
GATTTTGAGCCAACCCAGGCACG CATGGCTTTCCCTTGCTTTGATGAACCGTTGTTCAAAGC 
CAAui i 1 1 CAATCAAGATACGAAGAGAGAGCAGGCATATTGCACTATCCAACATGCCAAAGG 
TTAAGACAATTGAACrrGAAGGAGGTCTTTTGGAAGATCACTrTGAAACT ACTGTAAAAATGA 
GTACATACCTTGTAGCCTACATAGTTTGTGATTTCCACTCTCTGAGTGGCTTCACTTCATCAG 

Q^^^caaggtgtccatcta tgcatc cccagacaaacggaatcaaacacattatgctttgcag 
g catcactgaagct actt gatttttatgaaaagtactttgatatctactatccactctccaaac 

TGGATTT AATTGCT ATTCCT GACTTT GCACCTGGAGCCATGGAAAATTGGGGCCTCATTACAT 
ATAGGGAGACGTCACTGCTTTTTGACCCCAAGACCTCTTCTGCTTCCGATAAACTGTGGGTC 
ACCAGAGTCATAGCCCATGAACTGGCGCACCAGTGGTTTGGCAACCTGGTCACAATGGAAT 
GGTGGAATGATATTTGGCTTAAGGAGGGTTTTGCAAAATACATGGAACTTATCGCTGTTAATG 
CT ACAT ATCCAGAGCT G CAATTT GATGACT ATTTTTTGAATGTGTGTTTTGAAGT AATT ACAAA 
AGATTCATTGAATTCATCXUXSCCCTATCTCCAAACCAGCGGAAACCCCGACrrGAAATACAGG 
AAATGTTTGATGAAGTT7CCTATAACAAGGGAGCTTGTATTTTGAATATGCTCAAGGATTTTCT 
GGGTGAGGAGAAATTCCAGAAAGGAATAATTCAGTACTT AAAGAAGTTCAGCTATAGAAATG 
CT AAGAATGATGACTTGTGGAGCAGTCTGTCAAATAGTTG7TTAGAAAGTGATTTTACATCTG 
GTGGAGTTTGTCATTCGGATCCCAAGATGACAAGTAACATGCTCGCCTTTCTGGGGGAAAAT 
GCAGAGGTCAAAGAGATGATGACTACATGGACTCT ccagaaaggaatccccctgctggtgg 
ttaaacaagacgggtgt7gactccgactgcaacaggagcgcttcctccagggggttttcca 
ggaagaccctgaatggagggccctgcaggagaggtacctgtggcatatcccattgacctac 
tccacgagttcttct aa7gtgatccacagacacattctaaaatcaaagacagatactctggat 
ct acctgaaaagaccagttgggtgaaalttaatgtggact caaatggtt act acat cgttcac 
tatgagggtwtggatgggaccaactcattacacagct^^ 

ACCT AAGGACAGAGTAGGTCTGATTCATGATGTGTTTCAGCT AGTTGGTGCAGGGAGACTGA 

CCCTAGACAAAGCT CTTGACATGACTTACTACCTCCAACATGAAACAAGCAGCCXXXH^ 

CTCGAAGGTCT GAGTTACTTGGAATCGTTTTACC>ACATGATGGAC>GAAGGAATATTTCAGAT 

ATCTCTGAAAACCTCAAGCGTTACC 1 1 C 1 1 CAGTATTTTAAGCCAGTGATTGACAGGCAAAGC 

TGGAGTGACAAGGGCTCAGTCTGGGACAGGATGCTCCGCTCGGCTCTCT^ 

GTGACCTGAACCATGCTCCrTGGATCCAGAAAGCTGCTGAACTC^ 

TCCAGTGGAAAATTAAATATACCAACAGATGTTTTAAAGATTGTGTATTCTGTGGGTGCT 

ACAACAGCAGGATGGAATTACCTTTTAGAGCAATATGAACTGTCAA^ 

AAACAAAATTCTGTATGCTTTGTCAACGAGCAAGCATCAGGAAAAGTTAC^ 

ACT AGGAATGGAAGGAAAGGTTATCAAGACACAGAACTTGGCAGCTCT CCTTCATGCGATTG 

CCAGA(»TCCAAAGGGGCAGCAACTAG(>TGGGATTTTGTAAGA 

CTGAAAAAATTTGACTTGGGCTCATATGACATAAGGATGATCA 

C 1 1 1 1 C II CCAAG GATAAGT TGCAAGAGGTGAAACTATTTTTTGAATCTCTTGA^ 

ATCACA TCTG GATAt 1 I i I CAAACTGTTCTGGAAACGATAACCAAAAATATAAAATGGCTGGA 

GAAGAATtTTTCCGACTCTGAGGACTTGGCT AATGGTTAATACTTAACTCGAGGCATGGTCAA 

TAGAAAAAGT AGGCTGGGCGCGGTGGCTGACGCCT GTAATCCCAGCACTTTGGGAGGCTGA 

GAAGGGCGGATCACGAGGTCAGGAGATGGAGACCATCXTTGGCT AACACGGTGAGACCCCG 

TCTCCGCTAAAAATACAAAAAATTAGCCGCGCATGGTGGCAGGT^^ 

TCGGCAGGCTGOGCAGGAAAATG GCATAAACCCGGGAGGTGGAGCTT GCAGTG AGCCGA 
GATTGCACCACTG CATTCX2AGCCTGGGTGACTGAGCGAGACTCTGTCT CAAAAAAAAAAAAA 
AAAAAANfMNNNNNNNNMWNNNNN ^wu^mn 
>156 

CGCGGTGGCGGCCGAGGT ACTTAAAAGT ATATCAGGGCAGTTTCATGCCAGGGAG 
CCAGGGAAGGCACCCAAGGAAGTGATGGAAGAGTAGAAGTTCACCAGG^ 
AAGGGCTCAGCAAATTTCTCTGTAACAGGATGCAX5ACCCCGCGT 
>137 

GGAGTCGACX5TGTGTTTGTGGGTGAAATGGCT GCGCAGGTCGGAGCGGTGCGCGT 
AGTACGGGCGGTGGCGGCGCAGGAGGAGCCX3GACAAAGAGGG 

CTGGGGT CTCCCCGCGGGGAG TT AAACG G CAGCXiCCGATCT AGCAGTGGGGGGTCTCAGG 
AGAAGCGGGCGCGGCX^AGCCAGGAGCCCCCT C 1 UGCTCCCCCTCACCG^CXSGCGTCGC 
AGCC GC CAAiCATtiCTG^ GOGCTCCCGCCAACQAATO 
QTTGAGCCICI tC»UCTGCCGCCTC<X»CXX>CCACCTTCGCTtS^^ 
GTCCCTGCCCCTCTCCCGGCCCCAAG~ * ~™ — " " ~ — - 
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TGA£CGC<^CCGGGCCCAAGCATAAGCGCCACAAGGAGCGGCACAAGCACCATCAC4^CC 
GCGGCCCCGATGGTGATCCCAGCTCCTXSCGGAACCGATCTCAAGCACAAGG^ 
AAAACGGCGAGAGGACTGGAGGGGTGCCTCTGATCAAAGCCCCCAAGAGAGAAACACCAG 
ATGAAAATGGTAAAACCCAGAGAGCCGATGATTTTGTCTTGA^ 

AAAAAGAAACACCGAGAAGACATG CGAGGAAG ACG CCTT AAAATGT ACAAT AAGGAAG T ACA 
AACCGTCT GTGCTGGGCTGACCCGCATCAGT AAAGAAATTCTCACCCAAGGACAAATAAATA 

gcacttcaggacttaataaggagtccttcaggtatctgaaagatgaacagctgtgccgatta 
^^^g^^^atgcaagaatatcgggtaccccagggagtacaaacaccttttatgactcacca 

GGAACATTCT ATTCGTAGAAATTTCTTAAAAACAGGTACTAAATTTAGCAACTTT ATTCATGaG 
GAACACCAGTCCAATGGTGGTGCTCTTGTCCTTCATGCTTACATGGATGA^ 
TCTCCAATGGAGATGGAGAGATTTTCTGAGGAGTTTCTTGCTTTGACAT^ 
AAAAATGCT GCTT A CTATG CTTTAGCAATAGTGCATGGAGCGGCTGCTTATCTCCCAG ACTT 
CTTGGACT ACTTTGCT1TTAATTTCCCCAACACTCCAGTGAAAATGGAAATTCTGGGCAAGAA 
AG ATATTGAAACAACCACCATTTC^AATTTTCACACTCAGGTCAAC AGGACAT ACT GCT GTGG 
CACCTACCGAGCAGGTCCTATG CGGCAGATAAGTCT CGTT GGAGCAGTAGATGAAGAAGTT 
GGTGATTATTTCCCAGA6TTCCrrGATATGTTAGAAGAATCw^CCATn"CT GAAAATGACTTTG 
CCCTGGGGTACACTTTCTAGCCTCCGACTCCAGTGTAGGTCCCAGAGTGATG^ 
TAATGTGGGTAAGGCCAGGAGAACAGATGATCXXTTACAGCAGATATGCOOVAGTCACCCTTC 
AAAAGACG ACGATCAA TGAATGAAATAAAAAATCTCCAOTACCT ACCTCGGaCCAGTGAACC 
CCGCGAAGI rCTCI 1 1 GAAGATAGGACTAGAGCTCATGCTGATCATGTCGGTCAGGGGTTTG 
ACTGGCAGAGTACGGCTGCTGTTGGAGTTTTGAAAGCTC^ 
(>ACCTC<SCATAACCAAAGATGTGATTTGTTTTCATGCTGAGG 

AGACTTCAGTTAGATCTTCATGAACCTCCAGTTTOCCAGTGCGTACAGTGGGT AGATGAAGC 
TAAACT AAACCAAATGAGGCGGGAAGGCATTCGTTATGCT AGAATTCAGCTTTGCGACAATG 
ATA TCT ACTTCATCCCT AGAAATG TCATTCATCAG TTCAAAACAGTTTCGG CGG TGTGCAGCT 
TAGCCTGGCATATAAGGCTTAAACAGT ACCACCCTGTTGTGGAAGCCACTCAAAACACAGAA 
AGCAATTIjrAACATGGACTGTGGTTTAACTGGAAAGCGAGAATTAGAAGTTGA^ 
TGTGA GGATA AAAACT GAATCTGAAGAAGCATGCACAGAGATTCAGCTGrr AACAACTGCTT 
CATCATCTTTCCCACCTGCATCAGAACTTAATCTAC^ 

CAGTTTTAAAAGTGGAAAGT AGACTG GACTCTGACCAGCAACACAATCTGC^AGAACATTCA 
ACCACTJCTGJGTGATATGTACATATTC*^ 

GTTATAGTTTT ATAACTGGCTTAAGTTAAGTTTTATTGGAGAAATCTT GCCT ATAAT7CTATAA 
AGAQA AATGA CATTCCACAAATGTCAAGCAT A 1 CTTTTT1 ACACAGATTATGCAAAGTTAAGA 

gttgtatcttatcccgttagtacagtatgtaatagtgggtctgctgct actttctgttttaag 

gtgtgagctaacaattc^atct^ 

cttgtitggtaaatratrcacttcocttaattt^ 

TrTTATGGATAAATAAGATTTATGGTTTAGTACTTGTGTCTTTATGGCAC^ A 

CAGTGAAAATGGCATAGGTTGCAGGAAAGAATTTCTGCATCTGGTACATGGGCAAGT 

ATTTAGTTCTAACXACAAATAACAAGTAGTTTCCAAGGAA^ 

AACAAAAGAGAA AGCA CAGCA CI 1 1 C I U ATCCAAACTGT C 1 1 1 1 HI 1 G 1 ATCTGTTATTTAAA 
GCCXAGnTJGATATTTT^AATTACAAAAAATTT AGGATGAATGGCOCT GGGAATAAT AACGGGG 
CACTAACT GTT AGAAAT ATGCT ACGTT5GAAAATTTGGGCAAX1ACAT ATGGG CGGACCAGGTG 
ACGCGCTCCAATGTGGGGATTGCT AAGGCAAATATACGAAAAATGTGGGGAATGGCACAAA 
AATCGGGACT AATAGTACAAGTGGTTCACGAGAAAACTTGGTACTGGCT AATCT ATANNNNN 

>138 

NNGAGAGCGACGCGAC^TGTGGAGGGTCGAGAGGTTCAAGATGTTGGATGTGGC^ 
CCCCCXXXXSCATTGCTGTTGAAGCATTGCCGGCXj 
CTGACCAGGCATACTGAAGCCGGGAGTTGAGACCGCCTG^ 

GTCTGT AAGATATACAAAAATTAGCCAGGCATGGTGGCGCAAGACAGTAGTCCCA^ 
GGAGGCTGAGACAGGACAATTGCTTGAACCTAGGAG^ 

GTGCTACTGCACT CCAGCCTGGGTGACAGAGT AAGACTCCATCTCAAAAAAAAAAGAAAAAA 
ATTGACTTTGGAACCTCAGATTACATATCAGTTTGC^TAC^ 

AAAATTCAGTTACTAAAAATJACTGAT ^ATCTCCATGATTAGAACX>CACT<3TG<mGTGTGTG 
TAGTCAAAGGAGGAGAATTTTTAATGCTATATAAGCATAACTGATAACTGCT ATT ACAAAT AAA 
TATTCCACAAATTTGGAAAGTTATTAGAGGAAGAA 1 1 f 1 1 1 1 1 CCTTGTAATTTCCAGGTGTTT 
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ATATTAGTTGGGCCATAGTGAAAATTACATGGAGGAAAGAAAATAGGAAAATAAGTCACAGA 

AAAAGAAAATCAAAACAAATAG AAACT CTTGGGGAACAAGT GAGTT AATT ACCGCT CATGTCTC 

CCATCCGG 1 1 U i v I AGCTCCTTGAGGGTTACTGTCTAATGCTCCACAAAAGTGCCTTACCCA 

GTGCTTGGTACAGAGAAGGCACTGAATAAATTCACAAAGGCCGATTGGTTCACCC^ 

T AGAGACAACAGACA CGCA ATTCTGACGAQGACTCCTGTT ACT AAAAG ACACAGOCTCTGAT 

ACAAGAGAGATATCCCTTTGACT AAAGCATT ACXiAGGGTCCCCAG G GCCCCCTCCCACTGG 

GGCGGTAACACT ACGGGTCTCCCCACCATATATTCCATGTCAAAGTATCT ACACAAATACAG 

AGGAAATTAAGCAAGTAAATACGGTATGTAATTGTTATCATTTGTATTTCm 

ATAAATATTTTAAAGTAAACAATATGAGTGAGTGCCTTTCATTAGCTATGATCT 

TATATTTTGACTGATCTGAATAAGCAGGTTACTGTGGAAGCATATA^ 

tgattccagtgggtacaaccaagtgtcagtacttgatacataactct ATCCCATCATTCGCAA 
ttacaggctgctgtgtggagtattaaacatgcatcntagtttttatttgtacanaatggtfy.a 
aattttcacttacatataactttccaactgtgtaagtgtt^ 
gattttttggtgtgtcattttcttctataaaatc i c » rtcttttcti \ gttttttgtttttttgtt 
tttacatgggatacaataaatcggacgcnnn 

>139 

mtgagctccccgcggtggcggccgaggtaccttcccctga 
ggggtgaagagcagtatcttcagaggccatccaagtt^ 
aatgcagagaagaggctggtgatagacaagt^ 
agagtttaaagagtttgggatggaaagaaatcaagaattgg 

>140 
>141 

ggggtgaagagcagtatcttcagaggccatccaagttttagcataacaag 

aatgcagagaagaggctggtgatagacaagtttt^tgttcacaac^ 

agagtttaaagagtttgggatggaaagaaatcaag^ 

AGCTCCAN 
>142 
>143 
>144 

j- ■ . WP^TAAGCTAGGGGCGTCX^TTCCAGAGCCT 

CCGTTGCCCTTACATCTCTCATTTGG 

AAAGTCATCAAAJA^ 

CCCAui FC1 1 1 ACTTCTGCGGTCTCL I rTCTTI IATTCCCCCGCGT 

>145 . . 

>148 

^^WWACCGCGGTGGCGGCCGAGGTACGC^ 
CAAGCCC TACTGOCT G GTGAGCAAGAGAGATT(XAAAGACTTTACTTTGAAAAf^lATCrrooc 
AGCTTCTACI 1 1 1 1 1 1 IAAGG AAAAGTAGAI IHtll I Ifal I 1 1 1GI 1 1 lAAGCAAGAAC 

AGAATCTAATGACTTTTTTCATGCCATCGCTT^ \ IULI I ili I ii.h. i im ; 

CI CI 1 OCI GGCAAAGTATATACTGGATTTTTAl < GCC 1 1 C I fG G ffl 1 1 1 1 1 1 OCCTA CQTGTAT 
CGGCCGTTATGCTTAGCCAGTTTATTCTTT A7TTTTTTACTGGAGTCATTGCCAGTQATGGAA 
ACGGTCTTTGCTTCTCrrTCACT 

TTTATATTATGAti I Id ACATTCATCTTfCCAGCACTCTGAAGTT ATCAGCAAGTTCTCAGTCA 
GTTCAAGGCATTGGATTCTGCTTGAI M CIM 1 I AATTCAI Hi t 1 1 T I U AC CCC I I I G AGAGTT 
^^ T *GAGAGGAGTCTGGAAG6CAGAGATI^ 
AAGGACTTGCACTGGTCCCCAA^ 

TTGCATTTOGGCAAAATOTATGAACGGCA<^AGAQ I GACATCTCCAA 

GGTGGACAGGACT6TGGACCT 

GAGCCATGCCCTGATGCAGCATCTC1 1 CI UGCTTCCAAGCAGAGCCCAGAATGTGAGAACT ■ 
^^J^^^»J^[^^CT^^^ACAGAt^GTACCTCMAACAAJSGTGT ACTT AAAAGAGAATCT 
GGGAGTGATTCrGACCTCTTTCA^^ 
GCCAGAGGAAGAGCATTTGiiCCTGTG^ 

CCTTCACraGACCGACATTCTTCTCATGGCAGTGA l G l G TC TC TC1 CCCAQATTTTAAWQCC 
AAACAGGTCAAGAGATCGGCAAAGCCTTGATGGATTCT 
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ggtcgagaaagtgagagtgagcctgctgacccaggcgacgtggaggaggaggagatggac 

AGTATCACTGAAGTGCCTGCAAACTGCTCTGTCCTAAGGAGCT CCATGCG C rCTC I 1 1 CTCC 

CTTCCG GAGGC ACAGCTGQGGGCCTGGGAAAAATGCAGCCAGCGATGCAGAAATG 

COGGAGTTriCAGTCTAGAAGGCTTGACAGGAGGAGCTGGTGT CGGAAAC AAG CCATCCTC 

ATCTCTA GAAGTAA GCTCTGCAAATGCCGAAGAGCTCAGACACCCATTCAGTGCT 

GGGTTGACTCTTTGGTGTt^CTTrCAGAAGAGGATCTGGAGTC^ 

GATGTTTGATCAGCAGATATGTTCACAGATCTAAGC^VGCAGGGATTTAATT 

CATTTCCTCTCCATTGACAAAATCCATCTCATTAATGACAATCAGCC^TCCT GG ATTGGACAA 

TTCACGGCCCTTCCACAGT ACCTTCCACAATACCAGTGCTAATCTGACT GAGAGTAT AACAG 

AAGAGAACTATAATTTCCTGCCACATAGCCCCTCCAAGAAAGATTCTGAATGGAAGAGTGGA 

ACAAAAGTCAGTCGTACATTCAGCTACATCAAGAATAAAATGTCTAGCAGCAAGAAGAGAAA 

ACAAAGAGG GGCGTCG GGG GAAGGGGAAGGAGAANN 

>147 

AC CCAAGGTG^GCATTTTTTTAAAAAACCCATGGAAATAAATGCT 
Gl IGI 1 1 GAAAATAAACAAAGAAAATGCAAACAAAACAAAAACCATGGTCCA1TCAAGCTCAA 
GAGT ATTTAACCAATGCTCTGTTGCCTCTTAAAGGATTGGTAGCT ATTTCCCCATCT ACAAAT 
ACATGACAATTAACT AAGC«CAATTCTTTAAAACTATCTGGAATTAGGTCAAAATTATCTAATT 
1 1 Ml CTGATTTAATTATGGATTACGTAATCCAATAGTTGCCAACATTAT AAAACCCT AACTTT 
ACCTCATIGTTTGGCT ATACCAGGTCTCATGACTCTGGACATAACCACCATCCTTCCTCCCAA 
CACCCCGCGT ACT CAGAGT AAAAC<XX5GAGCTTCATGATAACCATGAG GCCCGCAGCTTCT 
GCCTCCAAGGCTTCTCTGGCCTCyvCCTCCCX^GCT 
>148 

GTGGCGGCCGAGGTACCTATGTGCGCGGTGGTAGAAAAGCACCTGGGTCGGGTGC 
AGACTGCGGAGCGGGCCCTACCGTGTGCGCAGAAAGAGGAGGCG^ 



CCAAGGAAGTGTCTGTTC riGI GCTGAGTTTCAGGTGCCTTCAGCTTGTGGGAAATCCCGAA 
< ^ T GGCCAAAGACAACTC^ACTGTTCGTTGCTTCC^GGGOCT GCT GATTTTTGGAAATGTGA 
TrATTGGTTGTTGCGGCATTGCCCTGACTGCGGAGTGCAT C I I CM I GTATCTGACCAACAC 
AGCC1 C I ACCCACTGCTT GAAGCCACCGACAACGATGACATCTATGGGGCTGCXrTGGATCG 
GCATATTTGTGGGCATCTGCCTCTTCTGCCTGTCT GTTCTAGGCATTG T AGGCATCATGAAG 
TCCAGCAGGAAAATTCTrCTGGCGTATTTCATTCTGATGTT^ 
GCATCTTGT ATCACAGCACCAACACAACGAGACTTTTTCACACCCAAC^ 
GAT GCT AGAGAGGT ACCTGCCCTGGCGGCCGCCACCGCG 
>149 

NT GAGCTCCCCGCGGTGGCGGCCGAGGTACCrrCCCCTGAG 1 1 CAGA 

GGGGTGAAGAGCA6TATCTTCAGAGGCCATCCAAGTTTTA 

aatgcagagaagaggctggtgatagacaagtttcatgttcax^aactt gaattgcagaggtca 

AGAGTTT AAAGAGTTTGGGATGGAAAGAAATCAAGAATTGGGCTCXSGCCGC^^ 
AGCTCCAN 

>150 - • 

>151 

^./^^LUtlil>'AVJiiIir riTT GI rrTTGTTTTn ICIG I C CCCTCTGAGCCATGGAAG 
AT ^i 5Q *^ AACAAA ^ 7 ^ ATAA * CT ^ 

ci i iCCCTCCTCTCl 1 1 i GAATTCTT ATTCCTGAAAGAATGGAT AATGAATCAGGAGATGAGC 
AAAAAC GTATCTT TT ACAAAGCTCTAGTCTTOC^AAAGCCTCT AAACTCAAACGAAACCTTTTT 
AAAGT AGTTTTG lAAAAGCTCAAGGTATGCCATTTCCAGAAAGTTGCAGATG 
GCATTAGCCAAATl CTG1 CACACATTGAGCAATGAAATTCAGGAATTGGACAATGACCTCTTG 
GCATATGAAAGAATT AAAAGAGGGCT AGGGCTT AGGGCAGGGATCT AATCGGGGAGGATGT 

TCTGGTTT^TTGAGTGGCTAAC^ 

GGA ill W/ 1 cACCTCAGATCT AAGXlATCTCXTrGTCCATGC^GTACCTGCCCN>*<NK><WN 
>1S2 - - 

mnw ' n ' <t * <w * > *caggaaatcatcaatcgcagt agcagtgaagcaaatcaggtggtt 
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£X» I CCCAAaAC I I CAAlil AAATGG1 UJ GC I CCTGGTTCAGCTCCACAGTTAACTACAGCCAT 

tttggaaattaaagaaagcatattgtctttgctaattaaacttcaccacaaactctcaggaaaa 
caaaactcctactatcctccttggcttgatgacatagaaattttaatccaaccagaaattcct 
aaatacagtcatggagatggtataactgccgtggaaagaattttactaaaagctgcatcgca 

AAGTAGAATGAACAAACGCATCATTGAAGAGATATGTAGAAAAGTGACCCCTCCTGTACCAC 

tagagagaggcagcagaaattgcttgcggagtttgcttcaogacagaaaggctttatggaaa 

ctgcaatggatgttgattctcctgagaatgatarrcctatggagatcaccacggcagaacca 

caggtttccgagggagtatatgactgtgttatttgtggacagagtggcccctcctctgaaga 

tcgacctactggatttagtcgtcctgttacaa6catctcaagttaggggcgtggcgtgacaa 

tgtgacccaaaaagtggccaccaggaaaagaccaattacctgggatacgtgcgcgtcagaa 

gaggttacaatacgggtattaggaagcagcccgggaaatggcggagggggttgtacgggc 

cctcttaaagcaacccagccacagcgactccggtccgaaggcccgcagcagtccccacag 

acagctcacaacaagccgccaccacatccataactcactgcccnnnn 

>153 

ggtggcggccgaggtacacctgcaactgtgcgaatggtcctgttgcctcctgcatt 

TTGG GCTCTGTTCT ATAAAGG AAG AGTAAAG ATGG AGCT CCTCCTGCCTCCATCACGAAAGC 
ACATATCATCTGTCCCTTTGGATTTTACTTCCAGGACGCGTGTOGTCCCCAGCGTGTGTTGC 
CTTATGGTGCCGGC AGAGC CTCAGCTATCTGCCTGGGAAGTCGGATGTCCTTGGAGAGAAT 
TTGGAATGCAGATAAI 1 1 I Id IATTTCTTGAGAGCTTACTTTAATCAGCATGACACTACCTAA 

acactg aagatg gccttat attagtaagatttgcacaaaattaagtatacctatg caaactatt 
actttggtttttaggagtttggtcagatgaagaagtaatggtatcacatatatatgt aagaag 
acaaccatcattatttttgtaagtgtttt at aaaaacaaactg att aacttgtg aannnnnnn n 

NNNNNNNNNNNNNNNNNNNNN 
>t54 

tccaccgcggtggcgtccggcccccgc ci 1 1 i ctgcggctttcagcgcgcgtttc 
aggtcgtcaatgaggtcgtcggcatcttcgagaccgatggacaggcggatcgtgccctgg 
ctgatgcctgcgcccgccagcxscttcgtcgctcatgcggaaatgcgtggtgctggccggg 
tggatcaccaggctgcggcaatcgcccacgttggccaggtggctgaagaccttgagggttt 
caatgaacrrtcttgccctgctcgcggttgcccttgaggtcaaagct 

>1S5 

r*^NGTGGATACATAAAAATGTGTAGT7TTTATTAGTTTATTATACTTTTATTAGTTC 
AGTCTCAATAAAAGGTTGAAAGGC^TTACGATAAATTGTTCTGTCTGTTTATATCATTTCTTCT 
GGCTGCTCACTGAGTCTTAATCAGTGTACTAGACT GCAAACT aaag aataagattgggggga 

ggttataaaaagaacatgtataaagcttagcaaacccttttt aatgttctgaagtcagtctttg 
taagtgaaatcgctggagactagaaagtatgaaatggcagtct acctgggcaacct acaaaa 

aattgaaccagaactggaaaattatttaggaaagtttgtt aact actctttgttgatctcatgg 
agtggggaatgggaattgaaccagaactggaaaattatttgggaaagtttattaactactc^ 

TCTGCTGAGTAAATTTAAATGTGTTCTGGAGATTGTTGAGGTCT agaattggtctatacaatg 

cccctgtacaagct acattgtgtcattgtcaaaatcaggagtttattggattagatatacttt 
gtgcttaattttagtatgattacttccgactggtgattatacg^ 

gttgggggggacctaagaatctggggagggacc^^ggtttggcgat^^ 
gcgtcaaacaannmwnnnhwnnnnnnnnnnwnnnnnn 

>1S6 

caaaatagaaccacctgaagtaagacaaggaaaggacagtaaaaaaaggcaaggaa 
ga aqagct gagaaagagctgcaagaaaaaagatgtgagaaactgaggagaaaaccaaatgt 
gacttttcgaccagagaattccataattaataatgatgattcagaaatcattaaga 
ggaacaacaagaacattgtggttcccatcctttggatgacttcgacgttccttttgaa^ 
acaaqatgataatacttcagctggattocatttcatggcctct^ 

AAGCC^AGATGCAAGCACCAGAGCGGGCGACCGCCAGCCCGGN 
>1S7 • 

. >1» • . y 

TCTTCTCTGI 1 IGI 1 1 1 1 1 1 1 1 1 1 1 AATTTTTACTCGTTTCTTTATTAATATAGAAAAGGAGCCC 
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A GGGC AGCT g g accagtagt acaaagcaccag GAGTTAAT accattctggtg aagggg atg 
gttftacaaaagtgaaggagcaggcaggagccaccaggttctgaggccaggcccagcctac 
tgcccagaacccctgaaacggctccctgggaaaaagctgacagatgggtcaggggtggatt 
gagctggaaaccatggggacagatggcagggatagagggtcatgcagtgggaaccaccca 
gtggctgataaggacagggaacttgtggctggaggctccccattgggccatgggcagggg 
cttgcagatggcctcagctctgggggcaggtagagaaactgcagagactgatgggcatgg 

AG AACCCAGACATG GCCCTGGG G CTGAAG GGCCTTTCCACCCTCTCTTCACC AGGAGCCAC 
CTTTGCTCTATACTACATATGGGGCTTCAGGGCCCAAGGCACAGGGGAGGCTCAGAGGCCT 

cggggatgagacacaggcaggacaacagcacccccgcatagtggg^ct^^ 

agggacgggcttatcctcggccagtgactaggaccagccccatggcaatggtgcctgtctc 

cagccttagcaaatcaagtgtgcaacaagcacagggtgtcggcagacctgggctctagcct 

t acaagct ctg cc^g aacttgaatcacacagagctgtatcaccatg gtccagccatgtcgtg 

ccttggcctgtttcctcctctgtcaatgagggctttgaataagacctcctaggtcatgaaaa 

ggactctgcaggtgaagggaactccaggacgtgctcttggcagttaagcagaccctggatg 

g^ctggtatgggatggggtgaagtgggggatggaggaggaaggtgttcttgcatggaac 

ctatccccacaaacactacagagtgaggaaaggccactggaagcccttctttgcccagagg 

aagaaaggccacagagagagtgaatgtggcttggtgcaatcccttcacatcctccaccatct 

gggtaacacttggcaaggagtggatgggtggcattgtgcaaccctttatgttcttccctgg 

gg gtgg cact caaggcctcttgtg gcttctgccttcagccttcagtgtagggtcaag AQTG 

TTui i i iCACTGTCGTCCTGTTCTGGGAACAGCACAAGACTCTGCCTGGCTGGCACATGTTA 

GT T GG I G ^ GG * AAGGACAGAGTGGGT <^^ 

9« hi JL 1 ^ G /^ G ^ G ^CTrCAG^GAGCAGCTCAATCAACTGAGTCGCAGATCTCCTCCACC 
ACAG^TTG^G^TGTGTGGCTGGTC^GCATAGACATGGTGGGAGGCACCCTTAATCTCCA 
TGTCTCGGACATAGGAATCCGGCCGCTGCATCTTCACCTTTTrrCCCGTACTGGTATCTATC 
C^GGTGTCGGACCCGTAGATCATAGTGATAGGCACATCTTTTCGAATCAAGTGAATTCGCTC 
CAGCATAGGGCGCCGGGOXAGCCAAAGGACTCCATCATGGCTTTGAATGCTGTCTCACCA 
CTGGGATTCTGTGCGTTGCAGTGGTAAATATACTCTGATATGGTATCATCTTCAAAGAAGTCT 
GCAAAC I TGCGmGAA G T CCGGCCGGAATCGCT^ 

GCT ACTCGAAGAACAGCCAATGGATTGGAACGTCCTAGGACAGATGCCACX3GCTTTGACCC 
AGGCTGGGGGTGCACGGATCTCACTGGGGTTAGTTGGTCGG^GGGGAAAGCCCCATGGGT 

CCTCoS^CTGTG^^ 

CTATCGATGTCACAAACTCATCCTCAGCCCCCTCCGGGTCCCrTGGGU^ATGCTGGCCTTGA 
GCTTCGCCCGAAGCCAAGCAGATCGAAGGTGTGCAGTGTGCGGCGGGCACTCAGTGAGTC 
P^^QT rGAGG ATCCAQAGACCCAGGCCGCCCGCAAAAGCATGCACCATCAGGAAGGGGGTG 
CGGTCGTTTTGCTCGGGGCTGA^^GTCACCGTCCAGATCTrATTCTGGTTTGGGAGGGATA 

TGAGACATGGAAGTGGGGCGCCACGTGGGCAGCCAGCTACTCAGCCAGCCTTGAGACTGC 
TGCT CCAGATCATTXSGCCATAGTAAACAAGCCTTGCCAGGTCXrrGGCGAGCCCGCCCAGCC 
CCCTTCT GTCGTCTTCTGGCAGOCGTAGTCG ACGOGGCCGCGAANN 
>159 

NNCACGCGTCCGGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGG<XjGCTTGAT 

CGGCAACT AGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGC 
CAGGCC 1 1 CCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGGAGATGAGGAACATGACTCTC 
CCCCTTCGGAGGAGGAAGGAAGTCCCGCTGGCACCTT ATCTCTGCTCCTCTGCCTCCTCCC 
TGTTCCCAGAGCTTTTT CTCTAGAGAAGATTTTGAAGGCGGCTmGTGCTG ACGGCCACCC 
ACCATCATCT AAAG AAGATAAACTTGGCAAATGACATGCAGGTTCTT CAAGGCAGAATAATTG 

CAG aaaatcttcaaaggaccctatctgcagatgttctgaatacctct g AGAAT AGAGATTGAT 
TATTCAACCAGGATACCT AATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTGAG 
TTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCT 

GGCACAGAATTGCTGGGAAGCCTCTGTTC»ACTGTCAGATCCCCGAGGTTCAGAGGACGGA 

I^ gcag j g ^ cgaaaaaa ^ 

G J GG * G £T GGGGTCOC l GGA ^ T CATGAACAAAACGAGAAA 
CC^CCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCC<» 

AGCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCCCCCTC 
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GTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTT AACAACACTGGCT ACAG AA 

gaatggcttggattaatcaagaattctcgcttctataattacactgtttgtcgcaatggcatc 
aaagaaaagcatgg atttgattatgcaaaggact acttcacagactt aatcactaacg ag agc 
attaattacttcaaaatgtctaagagaatgtatc^ccataggcccgttatgatggtgatcagc 

CACGCTGCGCCCCACGGCCtXX^GGACrrCAGCCCCAC AGTTTTCT AAACTGTACCCCAATG 
CTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGC 
AGTACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGG 
CTa^ACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGAC 
GGGGGAGCTGGAGAAT ACTT ACATCATTTACACCG CCGACCATGGTT ACCATATTGGGCAGT 

ttggactggtcaaggggaaatccatgccatatgactttgatattcgtgtgc g i i i 1 11 iattc 
gtggtcgaagtgtagaaccaggatcaatagtcccacagatcgttctcaacattgacttggcc 
cccacgatcctggatattgctg ggctcgacacacct cct gatgtgg acggcaagtctgtcc 

TCAAAci I u f GGACCCAGAAAAGCCAGGTAACAGGTTTCG AACAAACAAG AAGGCCAAAATT 

tggcxstgatacattcctagtggaaagaggcaaatttctacgtaagaaggaagaatccagcaa 

Gj^TATCX^VACAGTC^AATCACTTGGCCAAATATGAACGGGTC 

ccaggtaccagacagcctgtgaacaaccggggcagaagtggcaatgcattgaggatacatc 
tggcaagcttcgaattc^caagtgtaaaggacccagtgacctgctcacagtccggcagagc 
AGGGGGA *2ST£E* CGCTOC ^^ 

^T^IPjZT^^^^ GCCAG CAGAAG CCAAAGAAA GAG TCAACG G CAATTCTT G AGAAACCAGG G 
GACTCCAAAGTACAAGCCCAGATTTGTCCATACTCGGCAGACACGTTCCTTGTOCGTCGAAT 
TTGAAGGTGAAATATATGACATAAATCTGGAAGAAGAAGAAGAATTGCAAGTGTTGCAACCA 
AGAAACATTGCT AAGCGTCATGATGAAGGCCACAAGGGGCCAAGAGATCTCCAGGCTTCCA 
GTGGTGGCAACAGGGGCAGGATGCTGGCAGATAGCAGCAACGCCGTGGGCCCACCTACCA 

I£I* C ^ TCGGCCAGAGCGTGG ^ GACCATA >^ G ATACATTG^ 

TCT gcaagataaaattaagaatttaagagaagtgagaggacatctgaagagaaggaagcctg 

AGGAATGTAGCTGCAGTAAACAAAGCTATT AGAAT AAAGAGAAAGGT GTAAAAAAGCAAGAG 

aaattaaagagccatcttcagcgattcaaggaggctgctgaggaagt AGATAGCAAACTGCA 

ACTTTTCAAGGAGAACAACCGTAGGAGGAAGAAGGAGAGGAAGGAGAAGAGACGGCAGAG 
GAAGGGGGAAG^T^^GCCTGCCTGGCCTCACTTGCrTGACGCATGACAACAACCACTGG 
CAGACAGCCCOSTTC^^ 

CT ACT G GTGTT TGCGT ACAGTT AATGAGACGCATAATTTTCTTTT CTGTGAGTTTGCTACTGG 

V' ' ' '' ggag tajtttgatatgaatacagatccttatcagct 

agaacgaggcattttgaatcagctagacgtagaactaatggagctcagaagctgtcaaggat 
ataagcagt GCAACCCAAG ACCTAAGAATCTT gatgttggaaataaagatggaggaagctat 
gacct acagagaggacagttatgggatggatgggaaggttaatcagocccgtctcactgca 
gacatcaact ggcaaggcctagaggagctacacagtgtgaatgaaaacatctatgagtacag 
acaaaactacagactt agtctggtggactggactaattacttgaaggatttagatagagtatt 

ACG^GTTCTTGGTTCn^ 

AGATGAAGACXXAAGGCATAAGGTTGGGAAAACaS^ 

AAACCCT GCATTTGAACCGACCAACATTAAGTCCAGAGAGTAAACTTGAATGGAATAACGAC 
ATTCCAGAAGTTAATCATTTG^TTCTGAACACTGGAGAAAAACC^^ 

TGAAGAGACTAATCATCTGGAAACCGATTTCAGTGGCGATGGCATGACAGAGCTAGAGCTC 

gttcatcagatgttcaccatggccaccgcagaacaccgaagtaattocagcatagcgggga 
cgcct c ctcttca ctctcctctgattagatgaaactgtt accttaccctaaacacagtatttc 

1 1 i i iAACi 1 1 1 1 1 A TTTGTAAACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCft 

ISS^ISA^lll^Jt '¥lll Gi 1 ' iuiactaaaacagtattatcttttgaatatcgt 

AGGGACATAAGT ATAT ACATGTT ATCCAATCAAGATGGCTATAATGGGCTTTGTCAGAGAT AA 
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aacttgaodcccgtgtcaaattgacvvtcacactctgcatgtctgcgtaatgaaggtacgatg 
caactataaccagtgcaatatgacactgacactatattaaattcaataatacnn 

>160 

nncacgcgtccggctaatgaatcttggggccggtgtcgggccggggcggcttgat 
cggcaactaggaaacxx^aggcgcagaggccaggagcgagggcagcgaggatcagaggc 
caggccttcccggctgccggcgctcctcggaggtcagggcagatgaggaacatgactctc 
ccccttcgg ag g agga ag gaagtcccgctgccaccttatctctgctcctctg cctcctccc 
tgttcccagagcttttt ctctagagaagatttrgaaggcggcttttgtgctgacggccaccc 
acxjatcatctaaagaagataaacttggcaaatgacatgcaggttcttcaaggcagaataattg 

CAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGAT 

tattcaaccaggatacct aattcaagaactccagaaatcaggagacggagacattttgtcag 
ttttgcaacattggaccaaatacaatgaagtattcttgctgtgct ctg gttttggctgtcctg 
ggcacagaattgctgggaagcctctgttcx5actgtcagatccccgaggttcagaggacgga 
tac^gcaggaacgaaaaaacatccgacccaacattattcttgtgcttacx^tgatcaagat 
gtggagctggggtcc ctgc aagtcatgaacaaaacgagaaagattatggaacatggggggg 
ccaccttcatcaatgcctttgtgactacacccatgtgctgcccgtcacggtcctccatgctc 

ACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTrCCCCCTC 
Q^^^ ^AQQCC ATGCATGAGCCTGGGACTTTTGCTGTATATCTTAACAAC^CTGGCTACAGAA 
CAGCCi 1 1 I 1 1 GGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGA 
GAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACACTGTTTGTCGCAATGGCATC 
AAAGAAAAGCATGGATTTG ATTATGCAAAGGACTACTTCACAGACTT AATCACT AACGAGAGC 

attaattacttcaaaatgtctaagagaatgtatccccataggcccgttatgatggtgatcagc 
cacgctgcgccccacggccccgaggactcagccccacagtttrctaaactgtaccccaatg 
cttcccaacacataactcctagttataactatgcaccaaatatggataaacactggattatgc 

AGTACAC AGG A CCAATGCT GCCCATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGG 
CTCCAGACTTTGATGTCAGTGGATGATTCT^GTGGAGAGGCTGTATAACATGCTCGTGGAGAC 
GGGGGAGCTGGAGAAT ACTT ACATCATrTACACCGCCGACCATGGTTA(XATATTGGGCAGT 

ttggactggtcaaggggaaatccatgccat atgactttgatattcgt gtgcctttttttattc 

GTGGTCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTGACTTGGCC 
GCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCC 
^CAAACTrCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAG^CCAAAATT 
TGGGGTGATACATTCCTAGTGGAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAA 
GAATATCX^AACAGTCAAAT(^CTTGCCCAAATATGAACGGGTCAAAGAACT ATGCCAGCAGG 
^G^ACCAGACAGCereTG^AC^ 

ACGCG GAACCT CT ACGCTCGCGGCTTCX^TGACAAAGACAAAGAGTGCAGTTGTAGGGAGT 
CTGGTT ACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGG 
GACTCCAAAGTACAAGGCCAGATTTGTCCATACTCGGCAGACACGTTCCTTGTCC^ 
TTGAAGGTGAAATATATGACATAAATCTGGAAGAAGAAGAAGAATTGG^GTGTTGCAACCA 
AGAAACATTGCT AAGCX5TCATGATGAAGGCCACAAGGGGCXIAAGAGATCTCCAGGCTTCCA 
GTGGTGGCAAGAGGGGCAGGATGCTGGCAGATAGCAGCAACGCCGTGGGCCC^ ACCA 
CTGTCCGAGTGACAC^CAAGTGTTTTATTCTTCCCAATGACTCT 

TGTACCAATCGGCCAGAGCX3TGGAAGGACCATAAGGCATACATTGACAAAGAGATTGAAGC 
TCT GCAAGATAAAATTAAGAATTTAAG AGAAGTGAGAGGACATCTGAAGAGAAGGAAGCCTG 
AGGAATGTAGCTGCAGTAAACAAAGCT attacaataaagagaaaggtgtaaaaaagcaagag 
A AATTA AAGAGCCATCTTCACCCATTCAAGGAGGCTGCTCAGGAAGT AGATAGCAAACTGCA 
AC I II I CAAGGAGAACAACCXaTAGGAGGAAGAAGGAGAGGAAGGAGAAGAGACGGCAGAG 
GAAGGGGGAAGAGTGCAGCCTGCCTGGCCTCACTTGCTTCAX 

CAGACAGCCCCG 1 1 CTGGAACCTGGGATC7TTCTGTGCTTGCACGAGTTCTAACAATAACAC 
CTACTG GTGTT TGCGT ACAGTTAATGAGACGCATA A f IIICII1 1 CTGTGAGTTTGCTACTGG 
CTTTTTGGAGT ATTTT GATATGAATACAGATCCTTATCAGCTCACAAATAC^GTGGACACGGT 
AGAACGAGGCATTTTGAATCAGCTACACGTACAACT AATGGAGCrrCAGAAGCTGTCAAGGAT 
ATAAGCAGT GCAACCCAAGACCTAAGAATCTT GATGTTGGAAATAAAGATGGAGGAAGCTAT 
.GACCT ACACAGAGGACAGTTATGGGATGGATGGGAAGGTTAATCAGCCCCGTCTCACT 
GACATCAACTGGCAAGGCCTAGAGGAGCTACACAGTGTGAATGAAAACATCTA ' 
ACAAAACT ACAGACTT AGTCTGGTGGACTGGACTAATTACTTGAAGGATTTAGATAGAGTATT 
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tgcactgctgaagagtcactatgagcaaaataaaacaaataagactcaaactgctcaaagtg 
agg6gttcttggttgtctctgctgagcacgctgtgtcaatggagatggcctctgctgactc 
agatgaagacccaaggcataaggttgggaaaacacctcatttgaccttgccagctgacctt c 
aaaccctgcatttgaaocgacow>ttaagtccagagagtaaacttgaatggaataacgac 
attccagaagttaatcatttgaattctgaacactggagaaaaaccgaaaaatggacggggc^ 
tgaagagactaatcatctggaaaccgatttcagtggcgatggcatgacagagctagagct c 

GGTCCTGGAAAGGACATTTTTG AAGATCAACT ATATCTTCCTGTGCATTCOGATGGAATTTCA 



agatgttgacx^ggtggagaagaatcacgaaaaggagaagtcacagcacctagaaggcag 
cgcct c ctcttca ctctcctctgattagatgaaactgttaccttaccctaaacacagtatttc 
tt7ttaact 1 1 1 1 1 atttgtaaactaat aaaggtaatcacagcc accaacattccaagct accc 
tgggtacct7tgtgcagtagaagctagtgagcatgtgagcaagcggtgtgcacacggagac 
tcat cgtt at aattta ctatctgccaagagtagaaagaaaggctggggatatttgggttggc 
ttggttttgai i t 1 1 1 gcttgtttgtttgttttgtact aaaacagtattatcttttgaatatcgt 
a gggac ataagtatatacatgttatggaatcaagatggctataatgggctttctcagagataa 
aacttgacccgcgtgtcaaattgacatcacactctgcatgtctgcgtaatgaaggtacgatg 
caact ataaccagtgcaatatgacactgacactatattaaattcaataatacnn 

>161 

^M^I^WNNNNIWNNNNNMN^4NNNN^W^^NNGGGGTGTGGGGTGGGGTGGTGGGG 
GGGGGGGG GGGGT GGTGTCTGGCAGTCTTTCAGGGTGCAATTTTGTTTATATACACTGTAT 
GTATATAI I ICI 1 1 1 AGATTTGGCTGT AGTGGACTGGCCATGGTTCAAGTGGGACTATAGCAG 
TACATGGGTCAGGGACAGTCATTTTGGCTATGTACACATTCATAGTCGGTXXATGGCTTCCA 
A CTAGTA GOGCT ATTTCXXSAAGGTCTAATACACAAACTGAACCATCTGATG CACTGGCTCCA 
Axji i FOl CTCCTGGTGCATTCCAGCAAACTTCAAATATTCCAGCTGTTCCCCTATAGCTGTGA 
ACT AGAGCACCT GTCT GCGTGTTCCAGATGTGTACACATTTGTGAAAAGAACCACTTGCCAG 
ATACCTGCCATCAGGACTGAAAGCTACAGTGTACACAGGCTCTTGGTGTTTTGTCAAGGTAT 
GGATGCATATCCCTCGGTCT ACATCCCAT AACCT AACAGTAGAATCAAAGGATGCACTTGCT 
AAGATAAGGTTGGCATTTGGATTATTAGTCCGTGGTCXrrGTTGGACT CCATTTGATAGTATAA 
ATTTCl 1 1 ATTATGTGCTTGCAAATCATGGACACAATTGT CTTGTTTCAT ACTCCATATCTTTAA 
AGTCATGTGGTCAGAACAGGAGG CCAA GAGATTGCCAGTTGGGT CCCATTTGATAGCATTTA 
CTTCATTCGTATGTCCTTGGAATGTTTTAATAGGTCTGT CTTGTCCTAATTTACAGACATGAAT 
GCACATATCTGTACTACAAGAAGCAAAGGTGTTGTTGCTCTGCCAATCAACATCCAATGCTG 
GTGCTGAATGAAAAGGAAACTGTTGCTTGGCTTCACCAGTATGTGCGTCCCAAATAATTC 
GTCTTGTCTACT CCAGCACTT AGGATGAAA I I fCCI I IC I I A TTCCATTTTAATGCAAATATAG 
GGCCTTT ATGCTGCCCT AAGGTGCTAGCAAGGTTACCATCTTT AGTCCAT ATTCTGGCAAAC 
CCATC ATAGG AACCAGTTGCTAGAAGTGTACCTTCACT ATTCCAATCTAGAGATGTGACATCC 
TTG I reCTl UGAACATCTTGGCU FOCI ICI CGTATAGAATGTCTAAGTACTAACTGTGTAGAG 
CCACTGGTGCTGT1 CI CACTAAGATTCCATATTCTTGCT GTrGAGTCTCCAGACCCTGATGCT 
AGGAGATCACT AACAGGGTTCCAGGCACAGATAAAAACTTCAGATTCATGGC5CCCGCAACAC 

TGCTATAGT ATGTGCTCCATTCTCCTCCCCATTTGC 1 Q I G I 1 1 1 C I C CATTTTTTGCAGATCCT 
16 M GGCTGGCT GCAGCTGCGGCAGCTGCAGCAGCTGCTGCXrrGTTGCTGTGCAAGGTTAT 
CTCTATAAGC1TGTTGTCTTGTTTGT ACT ACATCAGGCATT ACGGCATCT ATCAGGGACAGAG 
ACTCT ATTGGTCGACCATCAAACAAGGTACCATCCTCATTAATACT AACTT CTGCTTCT ACAT 
ACTGTAGACC 1 1 ICI GGATQATAGAAATCAATGCAGCGGGTGGGACGAGGGCACCATTTATA 
TTGGACTGACTGATATGGCTTTCTATACCAAAGGTAAATGCT GAATGAGAAAATCCTGACTCT 
TGCAAGTATCTATATACCAAGAAGTTGACCTCATGACT GCTT ATAGTCATCXTTATTCCCACTT 
AAACCATGAGGTCACAACACAGGATATAACCCATTGGCAGTGCATTGATGTGGGGATGT 
ACTGAATATCCGGTCACCGCGAATCACAAGTTGCTG T I G t IGATGCTGGAAACGGTGGCCTC 
CAACGCCGCTCCCCCCTCCCXKSGAATGGAGGCACAAG 
GATCCN. - 
>162 

CGGCAACTAGGAAACCCCAGGCGCAGAG^SCCA^ ■ 
CAGGCCTTCC^XSGCTGCCGGCGCTCCTCGG^GGTCAGGGCAGATGAGGAACATGACT 
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CCCCTTCGGAGG AGGAAG GAAGTCCCGCT GCCACCTT ATCTCTGCTCCTCTGCCTCCTCCC 

TGTTCCCAGAGCI 1 1 1 1 C I CT AGAGAAGATTTTGAAG GCGGCTTTT GTGCT GACGGCCACCC 

ACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTG 

CAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGAT 

TATTCAACCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGACGGAGACATTTTGTC^ 

TTTTGCAACATTGGACX^AAATACAATGAAGTATTCTT GCTGTGCrrCTGGTTTTGGCTGTCCTG 

GGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGA 

TACaGCAGGAACGAAAAAACAT(XGACCCAACATTATTCTTGTGCTTACCGATGATCAAGAT 

GTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGG 

CCACCTTCATGAATGCCTTTGTGACT ACACCCATGTGCTGCX^CGTCACGGTCCTCCATGCTC 

AC<^GGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTrCCCCCTC 

GTGG<^GGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAA 

CAGCCI 1 1 1 1 IGGAAAATACCTCAATGAATA7AATGGCAGCTACATCCCCCCTGGGTGGCGA 

GAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTW^CTGTTTGTC^CAATGGCATC 

AAAGAAAAGCATGGATTTGATT ATGCAAAG GACTACTTCACAGACTTAATCACTAACGAGAGC 

ATTAATTACTTCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGC 

CACX3CTG(^CXXXX^CGGCCCCGAGGACTCAGCCCCACAGTTTTCT AAACTGTACCCCAATG 

CTTCCCAACACATAACTCCTAGTTATAACTATGCACCAAATATGGATAAACACTGGATTATGC 

AGTACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAAC^TTCTACAGCG 

CTCCAGACTTT GATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGAC 

GGGGGAGCTGGAGAAT ACTT ACATCATTTACACCGCCGACCATGG TT ACC^AT ATTGG GGA G T 

ttggactggtcaaggggaaatccatgccatatgactttgatattcgtgtgc c i lllll i attc 
gtggtccaagtgtagaaccaggatcaatagtcccacagatcgttct caacattgacttggcc 
cccacgatcctggatattgctgggctcgacacacctcctgatgtggacggcaagtctgtcc 
tcaaacttcrrggaccc^gaaaagccaggtaacaggtttcgaacaaacaagaaggcc^ 
tggcgtgatacattcctactggaaagaggcaaatttctacgtaagaaggaagaatccagcaa 

GAATATCCAACAGTCAAATCACTTGCCCAAAT atgaacgggtcaaagaact atgccagcag g 
CCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATC 
TGGCAAGCTTCGAATTCACAAGTGTAAAGGACCCAGTGACCTGCTCACAGTCCGGCAGAGC 
ACGCGGAACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGT 
CTGGTTACCGTGCCAG CAGAAG CCAAAGAAAGAGTCAACGGCAATTCTTGAG AAACCAGGG 
GACTCCAAAGTACAAGCCCAGATTTGTCCATACTCG GCAGACACGTTCCTTGTCCGTCGAAT 
TTGAAGGTGAAATATATGA£^TAAATCTGGAAGAAGAAGAAGAATTGGAAGTGTTGCAACCA 
AGAAACATTGCTAAGCGTCATGATGAAGGCCACAA 

CTGTCCGAGTGACACACAAGTGTTTTATTCrrTCKXAATGACTCTAT^ 

TGTAGCAATCGGCCAGAGCGTGGAAGGACCATAAGGCATACATTGACAAAGAGATTGAAGC 
TCTGCAAGAT AAAATT AAGAATTT AAGAGAAGTGAGAGGACATCTGAAGAGAAGGA G 
AGGAATGTAGCTGCAGTAAACAAAGCTATTACAATAAAGAGAAAGGTGTAAAAAAGCAAGAG 
AAATTAAAGAGCCATCTTGACCCATTGAAGGAGGCTGCTCAGGAAGTAGATAGCA 
ACTTTTCAAGGAGAACAACCGrTAGGAGGAAGAAGGAGAGGAAGGAGAAGA 
GAAGGGGGAAGAG TGCAGOCT GCCT GGCCT CACTTGCTT CACGCATGACAACAACCACTGG 
CAGACAGCCCCGTTCTGGiAACCTGGGATCrrTTCTG TGCTTGCACGAGTTCT AACAAT AACAC 
CTACTG GTGTT TGCGT ACAGTT AATGAGACGCATA A 1 1 1 I C I 1 1 1 C TGTGAGTTTGCTACTGG 
Willi I GGAGT ATTTT GATATQAATACAGATCCTTATGAGCTCACAAATACAGTGCACAOGGT 
AGAACGAGGCATTTTGAATCAGCTACACGTAGAACTAATGGAGCTCAGAAGCTGTCAAGGAT 
ATAAGCAGTGCAACCCAAGACCTAAGAATCTTGATGTTGGAAATAAAGATGGAGGAAGCTAT 
GACCTAGACAGAGGACAGTTATGGGATGGATGGGAAGGTTAATCAGCCCCGTCTCACT 
GACATCAACTGGCAAGGCCT AGAGGAGCT ACACAGTGTGAATGAAAACATCT ATGAGTACAG 
ACAAAACT ACAGACTT AGTCTGGTGGACTGGACT AATT ACTTGAAGGATTTAGAT AGAGT ATT 
TGCACTGCTGAAGAGTCACTATGAGGAAAATAAAAGAAATAAGACTCAAACTGCTCAAAGTG 
ACGGGTTCTTGGTTGTCTCTGCTGAGCACGCTGTGTCM 

AGATGAAGACCXI^AGGCATAAGGTTGGGAAAACACCT CATTTGACCTT GCCAGCTGACCTTC 
AAACCCTGCATTTGAAOCGACCAACATTAAGTCCAGAGAGTAAAC^ .. 
ATTCCAGAAGTTAATCATTTGAATTCrrGAACACTGGAGAAAAACCG^ 

TGAAGAGACTAATCATCTGGAAACCGATTTCAGTGGCGATGGCATGACAGAGCTAGAGCTC 
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GGGCCCAGCCCCAGGCTGCAGCC^TTCGCAGGCACCCGAAAGAACTTCCGCAGTATGGT 
GG TCCTG GAAAG GAGATTTTTGAAGATCAACT AT ATCTT CCT GTG CATTCCGATGGAATTTCA 
GTTCATCAGATGTTGACCATGGCCACCGCAGAACACCGAAGTAATTCCAGGATAGCG GGGA 
AGATGT TGACC AAGGTGGAGAAGAATCACGAA^GGAGAAGTCACAGCACCTAGAAnGGAn 
CGCCTCCTCTTCACTCTCCTCTGATTAGATGAAACTGTTACCTT AGCCTAAACArJVfiTATTTf; 
TTTTTAACTTTTTT ATTTGTAAACTAATAAAGGTAATCACAGCCACCAACATTCCAAGCTACCC 
TGGGTACCTTTGTGCAGT AGAAGCTAGTGAGCATGTGAGCAAGCGGTGTGCACACGGAGAC 
TCA TC GTTAT AATTTA C T ATC TGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGC 
TTGUH f iGAl 1 1 1 1 1 GCTTGTTTGTTTGTTTTGTACT AAAACAGTATTATCTTTTGAATATCGT 
AGGGACATAAGTATAT ACATGTT ATCCAATCAAGATGGCTATAATGGGCTTT CTCAGAGATAA 
**CTTGACCCCCGTGTCAAATTGACATCACACTCTGCATGTCTGCGTAATGAAGGTACGATG 
CAACT ATAACCAGTGGAATATGACACTGACACTATATTAAATTCAATAATACNN 
>163 



CACTAin 1 1 1 1 1 1 1 1 1 1 1 gagatggagtctcgctctgtcgcccaggctggagtgca 

ATGGCACAATXnCAGCTCACCGCAACCTCCACCTCC^^ 

GGGGAGACATCCCTCAGTGCTTAGACATATTCTGAGCCT ACAGCAGAGGlAACCTCCAGTCT 
CAGGACCATGAATCAAACTGCCATTCTGATTTGCTGCCTT ATCTTTCTGACTCTAAGTGGCAT 
TCAAGGAGTA CCTCT CTCTAGAACTGTACGGTGTACCTrGCATCAGCATTAGTAATCAACCTGT 
TAATCCAAGGTCTTT AGAAAAACTTGAAATTATTCXnXK^AAGCCAATTTTGT(XIACGTGTTGA 
GATCATTGCTACAATGAAAAAGAAGGGTGAGAAGAGATGTCTGAATCCA^ 
TCAAGAATTTACrrGAAAGCAGTTAGCAAGGAAAGGTCT AAAAGATCTCCTTAAAACCAGAGG 
GGAGCAAAATCX^TGCAGTGCTTCCAAGGATGGACCACACAGAGG CTGCCTCTCCci»TCAC 
TTX«*ACATGGAGTATATGTC*AGCCAT^^ 

^AGCTATTCAGTAATAACTCT ACCCTGGCACTATAATGTAAGCTCT ACTGAGGTGCTATGTTC 
TTAGTGGATGTTCTGACCCTGCTTCAAATAT^ 

AGGAATCTTTC TGCTTT GGGGTTT ATCAGAATTCTCAGAATCT CAAAT AACT AAAAGGT ATGC 
AATCAAATCTG Ct I IT T AAAGAATGCTCT TT ACTTCATGGACTT CCACTGCCATCCTOCCAAG 
«5GOCCAAATTCTrrCA^^ 

tTSI*™™' ' "-^'ATATTGI 1 1 1 CAGTGTACATGGAATAACATGTAATTAAGTACTATGTATC 
AATGAGT AACAGG AAAA TTTTAAAAATACAG>TAGATATATGCTCTGCATGTTACATAAGATAA 

CTAAATGTTGAAAGACCAAAAGGrTTAAT AAAGT AATTATAACTAAGAAAAAAAAAAANNNNN 

>t6B 

>167 

>168 

>t69 

^^J#O»O»IGGA0CCGGCrGCGG^ 

CATCAGGCACGATAAAAGCCATGTTGAKaTGGCCCAGGTCAGTTTGCTGAGA 
TGAGGTCAATTTTAGAGAGATACCTTCACATGTGCT ATCGAAAGTATGGATGTATTTTACGTA 
CAAGGTTC^AC^AACAGCT^^ 

CACTGGAACTGCTGATGGCTGCGAACTTCTTAGATTGTTAAAT^ 

G TTAACT CI 1 1 1 CAGTATTTAATACCTGT AGTTCATTTTGGGGJkTTTCAAAi^AX •_ I n il fC CCA 
CTGAAGGTCCAAAATGCTCCCAATGAGCAI I ICLI I IU AGGCACGAGGTCAG G T C CI Ca aaa 

AATGTGCAGGGCACTGAAATAGAGTGAAAAAGACACT^ 
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cagttgaggggagataggcaatcaaatatacaattataatacagaatacattcaataatagtg 

GAAGTACCTGCCCGGGCGGCN 
>170 

AT AT ATT ATCTT AT AGAAAT AACT AAGGGAAGTTAGTGCCTTGTGACCACATCTATGT 

GACTTTT AGGCAGTAAGAAACTATAAGGAAAGGAGCTAACAGTCATGCTGTAAGTAGCTACA 
GGGAATTGGCTrAAAGGGCAAGCTGGTTAGTACTT AGCTGTGTTTTTATTCAAAGTCTACATT 

ttatgtagtggttaatgtttgctgttcattaggatggtttcac^gttaccatag^ 
agcaacaggtcx^aaaaagtaggggatgattttcrrccatgtaatccagggagaaaa 
tgaccattgttggttgggagactgaaggtgattgaaggttcaccatcatcctcaccaacttt 
t gggccataa ttcacccaaccctttggtggagcct GAAAAAAATCTGGGCAGAATGTAG GAC 
TTCTTT ATTTTGTTrAAAGGGGTAACACAGAGTGCCCTTATG AAGGAGTTGGAGATCCTGCAA 
GGAAGAG AAGGA G TGAAGGAGAGATCAAGAGAG AGAAACAATGAGGAACATTTCATTTGAC 
CCAACATCXnTTAGGAGCATAAATGTTGACACT AAGTTATCGCTTTTGTGCT AAAATGGACAG 
TATTGGCAAAATGATACCACAACTTCTTATTCTCTGG^ 

CATCAAATGGAGTTAAATACATATTTGAAATTTAGGTTAGGAAATATTGGTGAGGAGGCCrrCA 
AAAAGGGGGAAACATCr T T TGTCTGGGAGGATATTTTCCATTTTGTGGATTTCCCTGATCTTT 
TTC T ACCACC CT GAGG GGTGTTGGGGGAATTATC^TTrrGCTAC^TTTTCAGAGGTCATCCA 
GGA 1 1 1 1 iGAAACTTT ACATTCTTTACGGTTAAGCAAGATGTACAGCTC^TCAAAGACACT 
AATTCCTTCTTAGAAAAAAT AGT GCTAAGGAGGNNN 
>171 

NNNCACAGTGGGCACTTCTGCCATGAGCAGAGAACT GATG<2GACATGTTTTGCTGC 
TTGGT AGCACTTT AAAAAATTTTTTGATTAATGAAGAAAGTAAAACCATAAACATTTGCCAAAA 
ATTCATGCCCCAGTATTAGCAATGAATTAGTTGCATTGGTTTGAGAAAGGO 
GGAAATCTTGGTGT AACTT AAATATTTGAAAATTACCTTT AATGCAATGCATATCTGTTTATTC 
TGGGAAATGTTTTAATGCCAGGGCXrrGCTGAGTTGC I ICI ICI IGTGGAG AI 1 1 1 1 1 1 1 1 AAT 
CT CCT GAGTTGTATAAAAGTTGTACCTGCCCGGGCGGCCGCT CGACGGCCGAGGT ACAGTC 
ATAATCTC 1 1 1 1 CAAGCCGGCCTAGCCCC 1 1 GCCGGAACCTCGGCTCCCOCGCAAOGAAACT 
ACTGCT AAGCCAACT GGACT ACACTTCGCAGACTG CTTGGAGCCTCTCT CT CGGCAGAACCT 
CGTCTTCCGCGAGCI 1 I 1 CCTGGAGGTTCT AGGAGGGATGCCCCTCAATG<XiACGACGCCA 
TTTCCT ACT ACGACTTCCATCATGCTCCX5CGCGGCT CGGGGCGGNNWN 
>172 

NCGGGAAGTTCCAACTGTGGCAGAAATGAAGATGGAACTTCTCAAGAATAAA 
TTGGCOGGGTGTGGTGGCTCACG<XrrGAAATCCCAGCACTTTGGAAGG 
GATCACATTJAGGCCAGGAGTITCGAGACCAGCCTGGCCAATAT 

T AAAAAT ATAAAAATGAGACGGGCATGGTGGCACACACCTGTAATCCT AGCT ACTCAGGAGG 
CTGAGGCACGAGAGTCGCTTGAAOCX^AAAGGCAGAGTrrG 

CCTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTC^ A 
GCACTCATCCCCTAGATATCCCCCAAGTGTCTCT C 1 1 i XXXTTTGGTCAAGGTTATAACTGG 
GATAATGATTATGACAGACTGGCTGGGTGTCAGAGGTACAGATTTAAGGTTGATGGACT 
GGT AAG GATAGCT AC AG CTGT GTGGGGCTGAAGXSTCTGTGGCACTGAGCT ACTTGGGGAAG 
GAGGGCiuiui I I iCATTGTGACACACT GAGTTAAT AAAGCACTT ACT GAGGGAGCGAGAGC 
CCAAACTCTAAATGTGC I G I AGAAAAAGGGCCAAGTCATTGACTGCACCAC 1 CCI IGAGCCA 
GAGGT AGAAAGGATTTACT CT T CAGCCATCTGGTAGAGCCCCAAGAACAAGTTACATGTGGA 
CAAAGGGAGGGAGAGGT ATCATGGTGATT AATAAATTCAAACAAAG^TTGAATGATAAGAOCC 
CAGGATGGAATACAGTCTGAGAAAGGCCTGGGCAAAGGGAGGCAGAGG^ 
CAGGTCAAG GAAGATACACC CATGGTAGCCT AGGAAATTGGCAGT AGTGTTGGTGGGTGGG 
ATGGTGTAOCACTTGCGCTGGGCATAGGTGAACA 



CCCAGGAACGATCATCAACACCAATGCAGCT ATCCCGGGACATGTCC^CATCTGCCACTCCT 
ATCCGGAACTGCTGGGAGCG^nTCACT GTCACTTCCCAGT A G ' I G 1 CI GCCACTGGTGACGG 
CTGTGTCN 
>173 

- OCACGCGTCCGAT AGGTGAAAAAAACACTGCCA TTGACAAGTCAAGG AACCCAGGG 
CCAGCTGGAAGTGTGGAGCACACATGCTGTGGAGCACACATG 

TCTGAGGTTTGTGT AGT AGTGGAAGATTTTAGGTATGT AGAGCAAGTTGAAAATGGATTGAG \ 
ACTGCATGGTGGCATAAATGAGAAATTGCCTGTAGC>TCTAGTCT ACTTGAAGGAAGTGGAG 
ACAT AAGGAGAGACAAAAACAGGTTTGTGCCATAAAGT A I 1 1 1 1 1 C AAAGACACCAAGATGTG 
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GTAAATGAAAAi iamaui I CaC I TuCC'I UUTGCCATG AAACTTT GCCTT AAGAAG GTGCTGG 

ATTCCAAGGTTrGTAAAGGCATCTCGGTAAAGACTGCTTTTTGAATGCATATGATTTTGCATC 

AGCT AGACT GAGTTGATTCTGACCAG ACTTGATGGTTTT AAGT CG GAACCGAT AAATTTT AAA 

AAGGAG AAAAAAT AATTTGACCT AGT AGT ATAAAACATGAGGCTTTAATGGT ACTTT GCT ATG 

AAAAGAAAACACTGTATTCCTTATGCAAAACACATGTATCTTTCATTATTTATAAGTGGGCCTC 

TCTTAGGCTCAGTTACTCAaTCAtACGTAGTATnrataAAATAAtrrATATc 

>174 

>175 

AG GGAGTCCG CCCACG CGTCCGCCGG GTTTTAGTTCCTCGGG GAGCCCCTGGTGC 
CCCGGATACGGCTGATTTGTCGTGTGGGACCTGTTCTGGCTGCTCCAGCCCCAGGAAGGAC 
CCAGGACACCCGGAAGCCGG AAATGGACTC AGTGGCCTTTGAGG ATGT GGCTGTG AACTTC 
ACCCAGGAGGAGTGGGCTTTGCTGAGTCCTTCCCAGAAGAATCTCTACAGAGATGTGACGC 
TGGAAACCTTCAGGAACCTGGCCTCGGAATCCAATGGAAAGACCAGGACATTGAGAATCTG 

taccaaaacctggggattaagctaagaagtctggtggagagactctgtggacgtaaagaag 
ggaatgaacacagagaaactttcagccagattcctgattgtcacctgaac^agaaaagtcaa 
act ggagtgaaaccatgcaaatgcagcgtgtgtgggaaagtcttcctccgtcattcattcct 
ggacaggcacatgagagctcatcctggacacaaacgatctgagtgtggtggggaatggag 
agagacgccccgtaaacagaaacaacatgggaaagcctccatttcccccagtagtggtgca 



tttaattctccccattaatctcacatccatcaaaggatctcacactggaaagaggtcctataa 
tgtaggaatgtgagagctcccagtttcaggttcttcgagacgggaactgcttctgggaaaa 
ggttgatgtagtcgggggaaaccttgattcccggttttcattggtagactccggaaaac 
atgtacagggaagctttcgagacggcaaaatcccnn 



>176 



CATACAGTGTGCaacAGGGGTGGTACAGTTGGATOgGCATCAGCGTTGCATTTACTTGTGATG 
CATCAGAGACACaacCTGTTGAAACATGAGAAAGCATTTCGGCACCCTCAAATGTACTGGGAA 

tgccagcagtttccaaagcctgtttaataatttcactacctt cttgact gacactggtattaa 

AGTTAGTCACACCAGATC^^GGAAATGTATTTGGTAAAAATGCTGGCX^GCGATTTTCAGCA 

agaagctgtagaaatgatggatctttaaaacatgcttctggattgaaggtagattgttgtgg 
ctgctgcaaatttactgcatttgtggagagaatcatgctggcaagaagagaaggctgtccat 

TACT CT GTAGAAGTTCttgAGCAAT11r^GCTCt^TCCAGTGGTTtacAGTAAGATCGCT»ggATAAG 
TGCCCACCCAGTGATCTGGGATTAGTAAAAGCCCTGT AACACCT GCT ACAGATAAATTTCCC 

atctctgataattgcaggccatttagcccttttgttgtgcttagg i men iccttcccattt 

GGGCCCCGCCCACGGTC 1 1 1 1 1 1 AACI 1 1 1 1 (JAGGTGCAGACTGTTGGGCATTGGTGCTACT 
GAAGTTATTTGCAACATTCACTTGTGAAGGAAAAACTGC^^ 
AAAGGAGGAATTAGTGTTTCCTTCAATCTGTGAGGAATAATGATTTGTATTArrr^ 
GGAAAAAACAGAATTAGTCCCACTATCTGCTGGTGAAGGGAAGAGAGACATTGAACT ACTTT 
CT GCAGGTAAAG GAAGG AAAGGATCTGAGCCACTGTCAATATTCAAAGGCAGAACTGTTTTG 
GTTAGATCTrCCATTTGTGCTGGC*AGG 

ATACCACCTT GTTCATTTrrATCCTGAGAAGTTATATTTGGATTTATGACACTTTCCATTGA 
ACAACAATGCAGTTGACACACTTGCTAAATGAGCTGGAAAAATC 

CAACTGGGCCGAACAAGCAGGATTCCCATTGGCTTTGGTCTGT GATGTAAAAAATGGATTTG 
AGCTGTrCACCGGTGATGGCA^ 

JA GCT t iCTGACCI I I II I ACTTTTGCGCTGCATTTTAAATGCAGCATATTGGTCAGGGTGTG 
CTGI CI I CATGTGTTTCCCAATACTCTGTGAAGAATTATAGGTT CGAGTACATCCCTCGACCT 
GGCAGCT GAATTTCTGAACT GGAG CTTTTGGAGGCACTGATGGAGTTCCT AATGAATTACTT 
AGGTCGGACTGCGGTGGAACAAATGCAATACTTTCTAATGTCTTAAGAG^ 
TTQGATGA TGTT TTAACATTATCTCCACATTCAAATTTGGAAGTTGGTAAATTGTTTTG 
CTGCCTQATTTTGCACAGAAAAGGTCATCAAGTTGTTCACTTG^ 

TAAGGTCACCT ATTTTGAGAGTTTCTGAATTTACTAAAGAATTCTG AGTTAACATGGGCATCAT 
•TAAAGCAATCTTGTTCTrTTTCTCrroCTGGAAAGCT 
AACTGGCGTATGTAAGTCTGTGAiXAGTGCTTCACCACT 
GCTTTCCAAAGTTATCCTC^ATCCCCTGATTGAGAGACACCrrTAATGGACA 
TGGATTGAAGTAGAGGAGT AGT AGCAGGGTTTTCXiAAACX^TTTGACAATGGTCX^TT AGCT 
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TGCu I GAGC i CAGAGGCAGAAATCTGGTCTTGTTTGGTCACAGCTGATTCTGCTTTGCTTTT 

ATCCCAAGCATCACTTCTATCTGCTAGTAAGCTGttctcaacgTTATTTTCTGAAGG^ 

TCTTTCTTT CT CAAT ATTCTCTGCTGTG TTCTG ATTCACTTCAG AAG GCTGAAT GGAATCT CCA 

GATGAATTAAGTTGCGCTTCAGGAGGCAGCACTTTTTCAGGAGGAGT 

»177 

>178 

TGGCGGCCGCCCGGGCAGGTACCAAACCATTTTCACTAGTTCAG<5ATAGGAATATT 

catcagattgtctctgtaaaagtgaatcacaaaaattccacctgtgtaggtgtgggactgga 

CAGCT GAGTGACAGGGCCCTGGG AAGAACAGAAAGCACTTTTCCTCTTTCCT CT GAAATATC 
AGAAGTTAAAAATCTACTCTGAGTTATATGTGCATCAATTTTAGACATATTGCTGATTTTATTA 
TGAAAATGAAGTGCTAAAGACAAAGGATATTTCCATTCCTCTGGACAGGCAGCCACAGACCA 
G CACTGCTTG ACCCATGTGTAT AC ACATGTGTGCTTTGT ACC AG GCG AAGGC AGCT CTTCCC 
TCCCTTC^GTTC^ATTTACAC^CTCATCTTCTTTAAAGI 1 IC 1 1 I GGATTTCATGATCT GGCA 
ATGA AACTCA I MIC 11 1 ATTCTTCAGTCTCTGTCCTGGTAAATTTCCACGTATACAGATTTTG 

ccag 1 1 i 1 1 1 caactagtttatcrtttggggctttattatttgtttctg acaacact gcagatg 
agtctgacttctggctaactttacacaaaaaatgtactgtaaattcatgctgagatctgggca 
ggcagaggtatgcannn 

>179 

nnatgtggttacgacccactgtattgaggtgacgcgatccataggctgtggtgttt 
gttttcgctgatcx^cac^aacgttggggcactgtctattcatgtgttcaagctgaaggctc 
gttctcggttgtcattttacagtgtttrtctacggggttacattacaggaatgttgtagcgac 

GTTT AGCCCGT G GAGT ATCAACG T CTTGAG ACTCCGTGT GAGACTCCCTG G TTTGTTCCACA 

ACAGTGTG lllll AGATTCCGTACCTTTGATTAAGGAACACATC^TGCCGTGAAGCCAATTTA 

TATTCTGCAATT CCGTA GTGCATGTAATGTATTCTGCCGTCTCGTAGTGTGAAGCCATGCTTG 

GCACATCCAG7TCTTTGATGTCTGGCTGCCTTCTGCGGGCCAACTGTCTT GT GGAATTCGTT 

GCTCXX^GAGATAGCTTGAACTGCAGATCCCGCACAGC^TTGCACTGAGCTGTCGTTGTATC 

TGAGCCTGGAC^TGGCGGCCGAGGTACTCACAGTC^CGCAAATTCACAGTCTGCGTGCAC 

GCTCTCCATTCI ICIICI IGGCTTTACAGGTTCCCAGGTCAAGAGCTT(>CCCATAATTAAGA 

CCTTCTGAG G ATGATCG ATAGATAAACACACCTCCTCTGAACCATCCTrGGGCTTCATGGGG 

TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCGTCTTCCA6AGCGCTTTGTG 

TCCAA ATAAG AACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAGTTTCCATATGCT 

GAAGGTTmCX^CTATTCACACTCTGTGGCGTAACCTTCTTCA^ 

CCAATCT A 1 1 f CI I CCAGCTTCTCTCTGGCCATCTTTTCCTTGATCTGAGACAGTCTGATCAG 
TTTTCGGCCGGTCATGTGTCTTCGTTCAT ATTCTCTGGAG GAT ACT CGAGCCCGCCTCGAGC 
CGCAGACCAGGAG AAGGCTTCCACACAGATGGCGATTGAGTCGTTTCCT CACAGAACTTTC 
ACT CGGGGTCCACCACATATTTGACCTCTAGTT ATCCCACTAGGTTTGTTCCGAGAAATCGT 
CTGTA GGGG TTGGGAGGGTGCACTTGTCATGCTrGAAGATGAGCTTTTGGGATCTGGAGGT 
GAAGCCTTTGGTGTTCGAGCCACCCTC H G G 1 1 C 1 1 GGTAGCGCAGGGACATCAAGCTCCG 
CAGAAAAGC^TGTTGACTCCTGAATTCTCTGAACTCTCCTCTCCTT AAQAGGTGGCCGGGGA 
GACTTCTCAGGGGATTTTTGCAGACGGGCTGGGCTTGCTGTCTGTTTAATTGTTG^ 
AGCrGCTCTTGTAAGGCCTTCATTTGCTCTTGCAAATTCCTTAAT^ 
TCGTTTTCTCTCGCCTGGGGGCAGGAGCAGGGAGGA 

GGGAAC ^CT TCTTCATC TCT 1 1 ICC I I 

GTCGGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCATTTTCCCG 



GATTGTCTTCCTCCTC^TCCATGCTGTCAAGAGGACAGTrGTGACAGGAAA 

GCT (^GATGCCCAGATGAGCAATGTAGAATCTT GGCCGATGCCGCIGAGGCCTTCTTCGTTC 

ACAGCTGAACCCGCCAAAATCGGACNN 

>180 

NNATGTGGTTACGACCCACT GTATTGAGGTGACGCGATCCATAGGCTGTGGTGTTT 
GTTTTCGCTGATCCACACAAACGTTGGGGCACTGTCT ATTCATGTGTTCAAGCTGAAGGCTC 
GTTCTCGGTTGTCATTTTACAGTGTTTTTCTACGGGGTTACATTACAGGA^ 
GTTTAG^CCGTGGACTATCAACGTCTTGAGACTCXXSTGTGAQACTC^ I GG I I TG 1 I C CACA 
ACAGTGTGTrrTTAGATrCCGTACCTTTGAtTAAGGAAGACATCA 

TATTCTGCAATTCCGTAGTGCATGTAATGTATTCTG CCGTCTCGTAGTGTGAAGCCATGCTTG 
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GCACATCCAGTTCTTTGATGTCTGGCTGCCTTCTGCGGGCCAACT 

G CTCCCAGAGATAG CTT GAACTGCAGAT CCCGCACAGC ATTGGACTGAGCT GTCGTT G T ATC 
TGAGCCTGGACATGGCGGCCGAGGTACTCACAGTCACGCAAATTCACAGTCTGCGTGCACG 
GCTCTCCATTCTTCTTCTTGGCTTTACAGGTTC^ 

CCTTCTGAGGATGATCGATAGATAAACACACCTCCTCTGAACCATCCTTGG 
TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCOT^ 

TCCAAATAAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAGTTTCCATATGCT 
GAAGGI 1 1 1 ICCACTATTCACACTCTGTGGCGTAACCTTCTTCAATATAACXCCAAATGTCAC 
CCAATCTATTTCTTCCAGCTTCTCTCTGGCCATCTTTTCCTT GATCTGAGACAGTCT GATCAG 
TTTTCX3GCCGGTCATGTGTCTTCGTT<^TATTCTCTGGAGGATACTCGAG(XXX5CCTCGAGC 
CGCAGACCAGGAG AAGGCTTCCACACAGATGGCGATTGAGTCX5TTTCCT CACAGAACTTTC 
ACTCGGGGTCCACCACATATTTGACCTCTAGTTATCCCACT AGGTTTGTTCCGAGAAATCGT 
CTGTA GGGG TTGGGAGGGTGCACTTGTCATCCTTGAAGATGAGCTTTTGGGATCTGGAGGT 
GAAGCCTTTGGTGTTCU^GCCACCCTCTTG GTTCTTGGT AG CG CAGGGACATCAAG CTCCG 
CAGAAAAGCATGTTGACT CCTGAATTCTCTGAACTCTCCTCT CCTT AAGAGGTGGCCGGGG A 
GACTTCTCAGGGGaTTTTTGCAG^CGGGCTGGGCTTGCTGTCT GTTTAATTGTTGTT ACTTTT 
AGCTGCTCTTGTAAGGCCTTCATTTGCTCTTGCAAATTCCTT AATTCCTCTTGCAACTCTTCAT 
TCGTTTTCTCTCG<XTrGGGG^CAGGAGCAGGGAGGACCCTATTTTCAGTTGACrrGTGATGC 
GGGAACTTCI I CI I CATCTGTTAAGTCCTCCATATCTCCAAAGAGAGTGGCCAGATTTrCCTT 
TTCGTCTCTT GTCT CTCCTGTTTCTCCATCATCAGCCT CTTCTGT AT AAGATTCACCGTCGCC 
GTCGGCATCIAAAGAGCTCATCAAATGCGTCGGGCTCGCCATTTTCCCGCGTCAAGA^ 
TTTCr rCTGAA TTACAATCCAAGGCTGACTCA 1 1 1 I CI I CCAGGAGTGCGGTCAGCAGAGACA 
GATTGTCTTCCTCCTCATCCATG CTGTCAAG AGGACAGTTGTGACAGGAAACTTCGAAGGAG 

gctcagatgcccagatgaggaatgtagaatcttggccgatgccgggacgcc^ 

ACAGCTGAACCCGCCAAAATCGGACNN 

>iei 

WIATGTGXTTTACG ACCA^CTGTATrQAGGTGACGCGATCCAT AGGCT GTGGTGTTT 
GTTCT<XK3TTGTCATTTTACAGTGTTTTTCTA 

GTTTAGC XXXrTGG AGTATCAACGTCTTGAGACTCCGTGTGAGACTCCCTGQm 

ACAGTGTG 1 1 1 1 1 AGATTCXX3TACCTTTGATTAAGGAACACATCATGCCGTGAAGCCAATTTA 

T ATTCTGCAATTCCGTAGTGCATGTAATGT ATTCTGCCGTCTCGT AGTGTGAAGCCATGCTTG 

GCACATCCAGTTCTTrGATGTCTGGCTGCCTTCTGCGGGCCAACT 

GCTCCCAGAGATAGCTTGAACTGCAGATCCCGCACAGCATTGCACTGAGCT 

TGAGCCTGGACATGGCGGCCGAGGTACT CACAGTCAOGCAAATTCACAGTCTGCGTGCACG 

GCTCTCCATTCI ICI I CI I6GCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 

CCTTCTGAGGATGATCGATAGATAAACACACCTCCTCTGAACCATCCTTGGGCTTC^ 

TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCGTCTTC^ 

TCCAAATAAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAGTTTC^ 

GAAGGTTTTTCCACTATTCACACTCTGTGGCGTAACCTTCT^ 

CCAATCTAI | ICI I CCAGCTTCTCTCTGGCCATC 1 1 1 1 CCI I GATCTGAGACAGTCTGATCAG 
TTTTCGGCCGGTCATGT GTCTT CGTTCATATTCTCTGGAGGATACTCGAGCCCGCCTCGAGC 
CGCAGACCAGGAGAAGGCTTCCACACAGATGGCGATTGAGTCGTTTCXrrC^ 
ACTCX3GGGTCCACCACATATTTGACCTCTAGTTATCCCACTAGGTTTGTTCC^ 
CTGTAGGGGTT GGGAGGGTGCACTrGTCATCCTrGAAGATGAGCTTTTGGGATCTGGAGGT 
GAAGCCI 1 1 GGTGTTCGAGCCACCCTCTTGG I ICI I GGTAGCGCAGGGACATCAAGCTCCG 
CAGAAAAGCATGTTGACTCXJTGAATTCTCTGAACTCTCCTCTCCTTAAGAG 
GACTTCTCAGGGGATTTTTGCAGACGGGCTGGGCTTGCTGTCTGTTT AATTGTTGTT ACTTTT 
AG CTGCT CTTGTAAGGCCTTCATTTGCTCTTGCAAATTCCTT AATTCCTCTTGCAACTCTTCAT 
TCGTTTTCTCTCG^XrTGGGGGCAGGAGCAGGGAGGACCCT 

6GGAAC I 1 C 1 1 C I I CATCTGTTAAGTCCTCCATATCTCCAAAGAGAGTGGCCAGATTTTCCnT 
TTCUICICI I GTCTCTCCTG 1 1 ICICCATCATCAGCCICI ICIU TAT AAGATTCACCGTCGCC 
GTTCGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCATT^^ AT 
TTTCTTCTGAATTACAATCCAAGGCTGACTCA 1 1 1 IC I ICCAGCAGTGCGGTCAGCAGAGACA 
GaTTGTCTTCCTCXJTCATCX^TGCTGTCAAGAGGACAGT^ 
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GCTCAGATGCCCAGATGAGCAATGTAGAATCTTGGCCGATGCCGGGACGCCTTCrrTCGTTC 

ACAGCTGAACCCGCCAAAATCGGACNN 

>182 

NACAGCT GGCGAGTGTACCCGTTTCTGCGAG AGAAGCT AAAGAATGCCCTTGCGTG 
AGGAAATTAAATTTATGTTCAAGCTAAATGTCX3TTTATGGCACTGGGAACACAAGCATTGTCG 
TAACAAGTTCTGGTGGAAATCCGTACATGGATACGTTCTCTTCTGGGGGCGGTCTCCAGTCC 
TTTCTCATGAGGGAGCACACTCCTCTGCCTCATTGCAGTGGCCTCAGGGATATGGAATTAAG 

atccacxtggtgtgatgaataaacccagactctcagcaacgc^ggaaaaaaaaacaaaaact 

ggctggcgatctggagtaaaggatcctcac^tccacgtgaaccaggaaactctgtgcccaa 

atcgacgaaaaaaaaacactgggagagccgaactaaaagtcttttagcacgggtaccagcc 

ctagaatttccagtagaccagcagacagccgggaaaccagatcctcatcaaaagacagaaa 

gaaaaaaattcgaagccagcctgagaagggcxxrratttcaatgctgttaaccactgagactg 

catcctcaacagtgagacaggatggaccaacatactcaagctcaaggaaacatctaacaccg 

gtcggaaactcatgggatcaccggtacctaggattaggccatataaagtttaccgttacaga 

accgcctactctcatattacgtctcac^atggacacttatccctacctatccggacaagccc 

ctgggcatagggacctagctgac^acacgcagtacjcaagggccttaaa^ 

cccgtctcggcacaggtgccacatcacgagacgctatggctaaaccacagtatagacgggc 

GCCO*AATGAGAGGACACCGGCCGCCTTGGTTGTCACGGCACA^ 

ggtagggcatagaacatgactactacgcggagggcagcacaacgcaacacaatatatacag 

AAAGAATGAAACGTACAG AAACTTCAGCAAGTGATCT GCTTTCTTGTGTTGTCTT CTCCTGAC 

CNNN 

>183 

tatAGGGAGTCGACCCACGCGTCCGGcCAGAGTAAAGCAAAGAGAAAGGAAGCAGGC 
CCGTTGGAAGTGGTTGTGACAACCCCAGCAATGTGGAGAAGCCTGGGGCTTGCCCT 
TCTGTCTCCTCCCATCGGGAGGAACAGAGAGCCAGGACCAAAGCTCCTTATGTAAGCAACC 
CCCAGCCTGGAGCATAAGAGATCAAGATCCAATGCTAAACTCCAATGGTTCAGTGACTGTGG 
TTGCTTjrTCTTCAAGCCAGCTGATAiCCTGTGCATACTGCAGGCATCTAAATTAG 
CG AGT AAAACT GAAG AAAGAAGG AT ATTCTAATATTTCTT AT ATTGTTGTTAATCATCAAGGAA 
TCTCTTCTCGATTAAAATACACACATCTT AAGAATAAGGTTTCAGAGCATATTCCTGTTTATCA 
ACAAGAAGAAAACCAAACAGATGTCTGGACTCTTTTAAATGGAAGCAAAGATGACTT CCT CAT 
ATATGATAGATGTGGCCGTCTTGT ATATCATCTTGGTTTGCCTTTTTCCTTCCT AACTTTCCCA 
TATGTAGAAGAAGCCATTAAGATTGCTTACTGTGAAAAGAAATGTGGAAACTGCTCTCrCACG 
ACT CTC^AAGATGAAG ACTTTTGT AAACGTGTATCTTTGGCT ACTGTGGATAAAACAG^ 
ACTCCAT CGCCT CATTAGCATCATGAGCATCATCAC^ 

CAGTGAGCTTTC^GAGAATCAGCAACCAGGAGCACCAAATGCTCCT ACTCATCCTGCTCCTC 
CAGGCCrrTCATCACCACCATAAGCACAAGGGTCAGCATAGGCAGGGTCACCCAGAGAACCG 
AGATATGCCAGCAAGTGAAGATTTACAAGATTTACAAAAGAAGCTCT GTCGAAAGAGATGTAT 
AAATCAATTACTCTGTAAATTGCCCACAGATTCAGAGTTGGCTCCTAGGAGCT 
ATTGTCGACATCTGATATTTGAAAAAACAGGGTCT GCAATCACCTGACAGTGTAAAGAAAACC 
TCCCATCTTT ATGTAGCTGACAGGGACTTCGGGCAGAGGAGAACATAACTGAATCTTGTCAG 
TGACGTTTGCCTCCAGCTGCCTGACAAATAAGTCAGCAGCTT ATACCCACAGAAGCCAGTGC 
CAGTTG ACGCTGAAAGAATCAGGCAAAAAAGTGAGAATGACCTTCAAACT AAAT ATTTAAAAT 

aggacatactccccaatttagtctagacacaatttcatttccagcattt^ accaaa 

ttagtgaaccaaaaatagaaattagatttgtgcaaacatggagaaatctactgaattggct^ 

cagattttaaattttatgtcatagaaatattgactcaaaccatattt^ 

GAAAGGTGATTGCAGCTTTTGGTTAATATGTC I rTTTTTTTCTTTl 1 CCAGTGTTCTATTTGCTT 
T AATGAGAATAGAAACGTAAACTATGACCT aggggtttct gtt ggat aattagcagtttagaa 
TGGAGGAAGAACAACAAAGACATGCTTTCCA I l l ll l IC I I ia cttatctctcaaaacaatatt 
acttt gtcttttcaatcttct acttttaactaataaaataagtggattttgtatttt AAGATCCA 

GAAAT ACTT AACACGTGAATATTTTGCT AAAAAAGCATAT ATAACTATTTT AAAT ATCCATTTAT 
Win » GTATATCT AAGACTCATCCTGATTTTTACTATCACACAT6AATAAAGCCTTTGTATCT 

CI I ICI I ATC 1 1 ICI II C I C I aatgttgtatcatactcttct aaaacttgagtggctgtcttaaa 

AGATA TAAG GGGAAAGATAATATTGTCTGTCTCTATATTGCTT AGT AAGTATTTCCATAGTCAA 

TGATGGTTT AATAGGT AAACCAAACCCTATAAACCT GACCTCCTTT ATGGTT AATACTATTTAA . 

GCAAGAATGCAGJACAGAATTGGATACAGTACGGATTTC^ 

TAAAGCT GACTTCGTTTGTTATGTAGX5CTGTATGCATATATTGAAAACAGAAG 
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GATTGCTTTT AAATAAAT ACCAACT AATGAATTT ACTG CT AAGCTCAAAACCGtCT ACG CTT AA 
GG TGAGAATCTTTGCCAACAGAAAGGCG TTG CTCT AAATCGCGAATGGT ATT ACGAAGAATT 
GCAATCTOCTTG 1 mTTTCTTCI I GAAAATGCTGAAGTTTOCTTCTGTAGAT A C 1 I I G TGGCA 
AAACAGT AGAAT CCT GAT ATG ATTTTGATTT AGTCTGAACTCTTGCT AG TTTG GCATCAAATTC 
TAGTTGTAGCTTGATTATTTCA II I I I I I I II 1 1 1 1 1 TGACTTCTTAGCTTT GCATTTAACTCTGA 
AATT rCCATCTCCl 1 1 1 ICTCTATTAG1 rCTTTGTGCl I TICT TCATTTAATTCAACI 1 1 rCTCl 
GCATGTCCTGAT AAAGTTTGCTTA 1 1 ILT 1 1 C 1 1 ATATCCCTCCTCTTTGATTTTCATTTCACTT 
ACAAGTTTGGCAATATTATTCTTATATTCCTGTTCATGAGACTTT AACTTCTCTTTT ATAAGATC 
AGCACTT AI I f I f AGTTGTTCATTTTGACCTTTCAAATTCrrGTTGATATTCAATGGTCTGTTGTA 
GCCCI 1 1 \ ATTATATTATGAAGAGTAGCACATTC 1 1 CI 1 I AATGGAGAGCTCC 1 MGCT IGCAT 
TAG 1 1 1 1 AATAGGCAATTAU 1 1 1 II I CCAACT GAAT ATCCAGT ATATTGTTCTTTCCATT G GTTT 
CTACCA1 I M C II F ICTATCTGTGAGAAGTCATTTATAAGTTTGTCAAGATTCACACTGCTAAT 
CI 1 1 1 1 CATACATCCTTCACTGCTTTGGTCCATGCCTGCTGTTGCTCCTAGTTCCCCaCAGTC 
TCCCTTTCCGCTCCCCGCGTACCTCGGCCGCCac 
>164 

NACAGCTGGCGAGTGTAGCCGTTTCTGCGAGAGAAGCTAAAGAATGCCCTTGCGTG 

TAACAAGTTCTGGTGGAAATCCGTACATGGATACGTTCTCTTCTGGGGGCGGTCTCCAGTCC 
TTTCTCATGAGGGAGCACACTCCTCT GCCTCATTGCAGTGGCCTCAGGGATATGGAATTAAG 
ATCCACCTGGTGTGATGAAT AAACCCAGACTCTCAGCAACGCAGGAAAAAAAAACAAAAACT 

ggctggcgatctggagtaaaggatcx;tcacatccacgtgaaccaggaaactctgtgcccaa 

ATCGAOGAAAAAAAAACACTGGGAGAGCCGAACTAAAAGTCTTTTAGCACGGGTACCAGCC 



GAAAAAAATTCGAAGCCAGCCTGAGAAGGG OCCT ATTTCAATGCTGTT AACCACTGAGACTG 
GTCGGAAACTCATGGGATCACCGGTACCTAGGATTAGGCCATATAAAGTTTACGGTTACAGA 

ACCGCCT actctcatattacgtctcacaatggacactt ATCCCT ACCT ATCCGGAGAAGCCC 

CCCGTCTCGGCACAGGTGCCAC^TCACGAGACGCTATGGCT AAAOCACAGTATAGACGGGC 
gcccaaatgagaggagaccggccgcctt GGTTGTCAGGGCACAAGAGCATCGCCATCAGA 

ggtagggcatagaacatgactactacgcggagggcagcacaacgcaacacaatatatacag 
aaagaatgaaacgtacagaaacttcagcaagtgatctg - 



>18S 



GTCCGGCGATGGGCACGGGCA 1 1 » CI IOG1 1 I ATAGCI G ICIUI 1 1 gcattctgattgggaa 
cactgggatgattttcatcatgccgacagtggtggtaatggatgta 1 CCC 1 1 1 CCATGACCC 
GACCr GTGTC1 attgaggggtcogaggaatacc^gogaagcagtaagtaatatggatgatta 
tgacaaaacctgcttggagtctgcattagttggtgtttgcaata 

gtcactgcgacaitccctagcgactcaaaatcaacgaagtg 

CTTTTCCTTTCOCATCT AAGTTATATATCATGTGCATGGCGAATTTGggaGGAGCTCCAGAGC 
ACCGAI ICC 1 J GGAATGCCTTGAAOGTCTCATAGATTTAAACAATOTToAAGGGCAGATrTTTA 
CTATTGATGgooooctGTGCTTGAAGAATGTACAGTClatGnTGGAAAACT GATAGATTTGGCAT A 
TAr; 
>186 

G UM LX 3CTGATCCAGACAAACGTTGGGGCACTGTCT ATTCATGTGTTCAAGCT GAAGGCTC 
GTTCTCGGTTGTCATTTTACAGTGTTTTTCT ACGGGGTrACATTACAGGAATGTTGTAGCGAC 
GTTTAGCOCGTGGAGTATCAACGTCTrGAGACTCCGTGTGAGACTCCCTGG 1 I I GI I C CACA 
ACAGTGTGTTTTTAGATTCCGTACCTTTGATT AAGGAACACATCATGCCGTGAAGCCAATTT A 
TATTCTGCAATTGCGTAGTGCATGTAATGTA7TCTGG03TCTXDGTAGT 
GCACATCCAGTTCTTTGATI 



GCTCCCAGAGATAGCTTGAACTGCAGATCCCGCACAGCA7TGCACTGAGCTGTTO 
TGAGCCTGGACATGGCGGCCGAGGTACTCA^ ' 
GCTCTCCATTCI I CI f CTT GGCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 
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CCTTCTGAGGATGATCGATAGATAAACACACCTCCTCTGAACCATCCTTGGGCTTCATGGGG 

TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCCGTCTTCCAGAGCGCTTTGTG^ 

TCCAA ATAAG AACAAGGACACAGATTGTGTCAGGTCACGAAGATCATTCAGTTTCCATATGCT 

GAAGGI 1 1 1 1 CCACTATTCACACTCTGTGGCGTAACCTTCTTCAATATAACCCCAAATGTCAC 

CCAATCTATTTCTTCCAGCTTCTCTCTGGCCATCTTTTCGTTGATCTGAGACAGTCrrGATC^ 

TmCGGCCGGTCATGTGTCTTCGTrCATATTCTCTGGAGGATACTCGAGCCTC 

CGCAGACCAGGAGAAGGCTTCCACACAGATGGCGATTGAGTCGTTTCCTCACAGAACTTTC 

ACTCGGGGTCCACCACATATTTGACCTCTAGTTATCCCACTAGGTTTGTTCCGAGAAATCGT 

CTGT AGGGGTTGGGAG GGTGCACTTGTCATCCTTGAAGATGAGCTnT GGGATCTGGAGGT 

GAAGCCTTTGGTGTTCGAGCCAGCCrrCTTGGTTCTTGGTAGCGCAGGGACATCAAGCT 

CAGAAAAGCATGTT GACTC CTGAATTCTCTGAACTCTCCTCTCCTT AAGAGGTGGCCGGGGA 

GACTTCTCAGGGGATTTTTGCAGACGGGCTGGGCTTGCTGTCTGTTTAATTGTTGTTACTTTT 

AG CTOCT CTTGTAAGOCCTTC^TTrGCTCTT GCAAATTCCTTAATTCCTCTTGCAACTCTTCAT 

TCGTTTTCTCTCGCCTGGGGGCAGGAGCAGGGAGGACCCTATTTTCAGTTGACTGTGATGC 

G GGAACTTCTTCTTCATCTGTTAAGTCCTCCAT ATCTCCAAAGAG AGTGG CCAGATTTTCCTT 

TTCGTC I C 1 1 GTCTCTCCTGTTTCTCCATCATCAGCXrrCTTCTGTATAAGATTCAOCGTCGCC 

GTCGGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCATTTTCCOGCGTCAAGAAGTTA 



GATTGTCTTCCTCCTCATCCATGCTGTCAAGAGGACAGTTGTGACAGGAAACTTCGAAGGAG 
G CT CAGATGCCCAGATGAG CAATGT AG AATCTT GG CCG ATGCCGGGACGCCTT CTT CGTTC 
ACAGCTGAACCGGCCAAAATCGGACNN 
>187 

NNNNWJCTAAACAGCCTGACACTGAGGGGAGGCAGTGAGACTGTAAGCAGTCTGG 
GTTGGGCAGAAGGCAGAAAAC<^AGCAGAGTCACAGAGGAGATGGCXiAACTGCCAAATAGC 
^^^^^^^^^^^^^^CAOM^GT(^TCTTT<XAATTTCCCfi>ACT CCTTTGCTTCAGTGC 
CCTGJVTCTCTGAACTAAGAAACCAGAAAGAAGTGGCAGGATGGACTT ATCATTACAGCACAA 
^GCATACTCATGGAATATTTCCCGTAAATAGTGCCAGAATCXSCTACACAGACTT AGTGGCC 
ATCCAGAATAAAAATGAAATTG ATTACCTCAAT AAGGTCCTACCCT ACT ACAGCT CCT ACT AC 
TGGATTGGGATCCGAAAGAAGAAT AAGAGAT G GACATGG GTGGGAACCAAAAAGGCT CT CA 
CCAACGAGGCTGAGAACTGGGCTGAT AATGAACCTAACAACAAAAGGAACAACGAGGACT G 
CGTGGAGATATACATCAAGAGTCCGTCAGCCCCTGGCAAGTGGAATGATGAGCACTGCTTG 
AAGAAAAAGCACM5CATTGTGTTACACAGCCTCCTGCCAGGACATGTCCT 
GAGAGTGCCTCGAGACCATCGGGAACTACACXn'GCTCCTGTTACCCTGGATTCT 
AGAATGTGAATACGTGAGAGAGT GTGGAGAACTTGAGCTCCXn'CAACACGTGCTCATGAACT 
GCAGCCACCCTCTGGGAAAC I ICICI 1 1 1 AACTCGCAGTGCAGCTTCCACTGCACTGACGG 
GTACCAAGTA AATG GGCCCAGCAAGCTGGAATGCTTGGCTTCTGGAATCTGGACAAA 
CCTCCACAGTGTTT AGCTGCCCAGTG<XXiAC<XXrrGAAGATTCCTGAACGAGGAAACATGAT 
CTGCCTTC^TTCTGCAAAAGCATTCCAGCATCAGTCT AGCTGCAGCTTCAGTTGTGAAGAGG 
GATTTGCATTAGTT GGACCGGAAGTGGTGCAATGCACAGCCTCGGGGGTATGGACAGCCCC 
AGCCCCAGTGTGT AAAGCTGTGCAGTGTCAGCACCTGGAAG<XXXXiAGTGAAGGAACCATG 
GACTGTGTTCATCCGCTGAC I GC 1 1 i I GCCTATGGCTCCAGCTGCAAATTTGAGTGOCAGOC 
CGGCTACAGAGTGAGGK3GCTTG*XACATGCTCCGCTG<^TTGACTCT GGACACTGGTCTGCA 
CCCTTGCCAACCTGTGAGGCTATTrC^ 
TGG^TTGCTCTCCATCCTTCAGAGC<5TTTCACT 

GAAGGTTTCATGCTGAGAGGAGC<X^TATAGTTCGGTGTGATAACTTGGGACAGTGGACAG 

CACCAGCCCCAGTCTGTCAAGCTTTGCAGTGCCAGGATCTCCC^ 

GGTGAACTGCTCCCACCXXTTTCGGTGCXrrrT^ 

AATGAAGGCTTGCTCCrrGGTGGGAGCAAGTGTGCTACAGTGCTTGGCT ACT GGAAACT GGA 
ATTCTGTTCCTCCAGAATGCCAAGGC^TTOCCTGCACACCTTTGCT AAGCCCTCAGAATGGA 
ACAATGACCTGTG TTCAA QCTCTTGQAAGTTCCAGTTATAAATCCACATCT 
GAOGAGGGATA) rCTTTGTCl GGAOCAGAAAGATTGGATTGTACTCGATCGGGACGCTGGA 
CAGACTCCCCACCAATGTGTGAAGCCATCAAGTGCOCAGAACrrCTT^ 
CAGCCTGGATTGTTCTGACACTCGTGGAGAATTCAATGTTGGCT 1 1 I CIC I I 

GTAACAACGGCTTT AAGCTGOAGGGGCCCAATAATGTGGAATT^ACAACTTCT 
GTCAGCTACT CCACCAACCTGCAAAGGCATAGCATCACTTCCTACTCCAGGGGTGCAATGTC 
CAGCC42TCACCACTCCTGGGCAGGGAACCATGTACTGTAGGCATCA 
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TTTTAATACCACTTGTTACTTTGGCTGCAACGCTC^ 

CAGCTGCAGACCTTCAGGACAATGGACAGCAG T AA CTCCAGCATGCAGAGCTG TGAAATGC 
TCAGAACT ACATGTTAAT AAGCCAATAGCGATGAACT GCTCCAACCT CT GGGGAAACTTCAG 
TTATGGATCAATCTGCTCTTTCX^TTGTCT AGAGGGCCAGTTACTTAATGGCT CTGCACAAAC 
AGCATGCCAAGAGAATGGCCACTGGTCAACTACCGTGCCAACCTGCCAAG 
ACTATCCAGGAAGCCCTGACTTACTTTGGTGGAGCQGTGGC^ 
GGGTGGGACGCTOrrGGCrrrrGCTAAGAAAGCGTTTCAGACAAA 

CCCTTGAATCCTCACAGCCACCT AGGAACATATGGAGTTTTTACAAACGCT GCATTTGACCC 
GAGTCCTT AAGGTTTCCATAAACACCCATGAATCAAAGACATGGAATTACCTT AGATTAGCTC 
TGGAOCAGCCTG1 1 GGACCCGCTCTGGACCAACCCTGTITCCTGAGT1TGGGATTGTGGT A 
CAATCTCAAATTCTCAAOCTACCACCCCTTCCTGTCCCACCTCT^ 

CCACAGAAGCCAGGAGCAAATGTrTCTGCAGTAGTCTCTGTGCTTT GACTCACCTGTTACTT 
GAAATACCAGTGAACCAAAGAGACTGGAGCATCTGACT CACAAGAAGACCAGACTGTGGAG 
AAAT AAAAAT ACCTCTTTATTTTTTGATTGAAGGAAGGTTTTCT CCACTTT GTTGGAAAGCAG G 
TGGCATCTCTAATTGGAAGAAATTCCTGTAGCATCTTCTGGAGTCTCGAGTGGTT^CTGTTG 
ATGAGGCCTCTT GGACCT CTGCTCTGAGGCTTCCAGAGAGTCCT CT GGATGGCACCAGAGG 
. ^GCAGAAGGCCAAGAATCAAGCTAGAAGGCCACATGTCACCGTGGACC I I C CIUCCACCA 
GTCACTGTCCCTCAAATGACCCAAAGACCAATATTCAAATGCGTAATTAA 
AAAAAAAAAAAAAAAAAAAGATCT7TAATTAAGCGGCCGCAAGCTTN 
>188 • . 

TGTAACTACAGGTGTCAGATGCATCACAAAAGCAGAA^ 

CAI ICCI I U I CAATTTCATGCTGCCTACAGCAACAGCAT AAT ACTGCAAACAGCCATGATGTC 
ACTCXSAAGTGCTCTGTGATTGACAGAGAGGGACAGTCGT AGTCAGAGGTGGCTCCTCAGAG 
AATTCAGAACTCACTCGCTGTCCTrCCAGGGt^ 

ATTCrCTGCATGAGAGAGCAGGGATGGGAAGTGATATAGGT ATGT AAGGATGG TCAGTTACT 
CT AAATGTAGTT AGACAGGACAGCCAGAATACCGAGGTCTT GGTT AGGTCCTCTGTAACAAG 
CCGT AGAGGCOCAGAAATGTGGTGACAGCGAOACACATTCTT AACT CTT ACACTT GTTGAAA 
TGAGT AGAAGGTGACA \ I I GI 1 1 GGAAATCCCTCCCCAGCC I rGT NOCCAMN 
>1B9 

N?4NNNfB01AAGGAAAGCAGCTGCAAACTTCtXJkTCTGCAG I G 1 1 IGI I IG I CIC GG 
CTCCGGCCATCACTGCCACGATTACCtXTGGATGAATTGCTCAGTGGAAAT^ - 
"fCA GCGG ACCTGCAGCGAGGrTGATTAMAAGCTTTATTGCTCAEACMMGiCt^ F BI I IGGTGG 
TCTCTTCACATGGACGCGCGCGACATTTGGTGCCCTGACTT GGATCAGGGGACCT CCCTTG 
GGAGAJCAATCCCCTGTCCTCCTGCTCTTTGCTC^ 
GTOCTCAOAOCAACCAGCCCAAGGAACATCTC^ 

AAGACT AAGATATCAGAGAAATTATTAGTGCACATTATTAGAAGAGAGCTTCAGATGAAAATA 
AAGATCAAGAAAAGACTC I IGC1 1 1 GAGAAGACACAAAGAAATGACATCATCTTATTGGGATT 



>190 

_ Jjcegg^TCCGGAcJOggaGCTgtetpAAAa 

CTGGCCACTGQTTTCCOGGAGTTAGCOGGAACGACCTT GCAGCCrTGGACACTAGOCAGGC 

ocroocTCrrcregc^^ 

TggOCTOGCreGGTCTCGGGGAAGCAGAGGACTGTTC ^ 
T CGGCOCTAGACACOCACGACTCGCAGQGTCCAT 

GGGCTGACAGGTGCCAGGGCCCATGCTGCGGGAG C Cr G TGl GC1 CAGOC 1 1 CM U CGGAC 
gCTAAAGAAGCTAAGTGGAAGAGT G TI roCTOCTCTGGCCGTAAAGCAGGTACTCTCTGCA 
GCflCCAGC TOTC OOO G CQCTACTOCGGAOCGOro 

TCAGC0GAATCCCAGOO0C1 1 CC C I I GGGCCTQCTGT GG T G CTGGACATCAGTGACAGACO 

TGwrw**^ 1 1 I GIC T lAAATGQAATCAAGACTGTGGAflgac 

f^O^^^^^JCCCCTGCTfiA^jCACCCACCGQiMCfCJ ACCATCGCCGTCCACATTGCTCA 

CAGGGACTGCGAAGGCGATQCCTGGCGGGAGCTGC^^ 

CTGCTCAGATrCAGGCCTIGC^ 

ACTCGTTGAC^TTGGGGAAACTTTXXAATGCCCC^ 

AACT AGAAAATCAAGCTGTACCCTGATGCTACAGACGAGGACATCACCTCACACATG^ 
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CGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCC^GGACCTGAACGCGCCTTCT 

GATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGCTGGATGAjCCAGAGTGCTG 

AAACCCACAGCCACAAGCAG TCCAG ATT AT AT AAGCGGAAAGCCAATGATGAGAGCAATGA 

GCATTCCGATGTGATTGATAGTCAGGAACTTTCCAAAGTGAGCCGTGAATTCCACAGOCATG 

AATTTCACAGCCATGAAGATATGCTGGTTGT AGACCOCAAAAGT AAGGAAG AAGATAAACAC 

CTGAAATTTCGTATTTCTCATGAATTAGATAGTGW 

AAATACAATTTCTCACTTTGCATTTAGTCAAAAGAAAAAAA 

AACATGAAATGCI ICII I CTCAGTTTATTGGTTGAATGTGTATCT ATTTGAGTCTGGAAATAAC 

TAATGTGTTTGATAATTAGTTTAGTTTGTGGCTTCATGGAAACTCCCTGTAAACTAA^ 

AGGGTTATGTCT ATGTTCATTCT ATAGAAGAAATGCAAACTTATC^CTGTATTTTAATATTTGTT 

A^GTCTrCATGAATAGAAATTTATGT AGAAGCAAACAAAATACTTTT ACCCCCTT AAAAAGAGA 

ATATA ACATTTTATGTCACTATAATCTTTTGTTTTTTAAGTTAGTGTATATTTTGTTGTGATTATC 

TTTTTGTGGTGTGAAT AAAT CTTTTATCTTGAATGT AAAAAAAAAAAAAAAAGT 

>191 

GT ACTCCCTGGAAAGTCCAGCTGAGAAAGCGATOCTGCCCTCTGCTCCTCGCAGGG 

TTACCCTCCTGTAAGTCTTCTGCTT AGTGTTCaGAATTGGGGGATGCTGGGACTGGGCAAGG 
ACXTGTAGGCAACACCCCATAGCCTGCTCATG(XTGTTGGGTTGCCT ATGGATCATTOCCTG 
CTGGGCTCACTCACCGGCTTCG T AT AAGGTCCrrTTTTGAGGTTTATTATTTCCTT GTCCAT AT 
ACTTGATGCTCTTCATTGGCTTGTCTGGGACCTGCCTTAGGT^ 

CGGACAGCCCCCG AGTTG GGGGAACTCTGAAGCTT CTT GGTGGCTGGAACCTTGGTCATCT 

TAAAAATCCTTCAGGTTTTAGCCTGTGCCCOCAAGACAAGGATTTTTCCA 

CAGTAGTTACTGGTATGAGAAGTTTCGGCAACTTCTCCCTGATCCCCAAGTOC^ 

GAACTCCAAAGCGG 1 1 IC CI I C ICCCGCGN 

>182 

.^.^,ISSP GGCCG £g CGGGCAGGTA U1 " I 1 1 1 1 1 1 II II I I N I I I | Ml 1 1 CTGGCTTGAA 

ATACAGCTGAAATAACTGAATTTTCTACTTGAAACGTCT 

GGCCCTGCAAATGTAAAGGGCCAATCTTTGTTACAGAGGGGTTCATTG 

GTTCTGCAAAGACAAACAGGTCTCACAGATAGTTGOCCCCGCGT 

>193 

>1&4 

>195 

NNGAGGATCCATTATCTTCTGTTT«rTTTGCTC I ICCIGT 1 1 1 IG G T G CCIGI ICCAG 
GTCATGGAGGAATCAT AAACACMT ACAGAAATATT ATTGCAGAG TCAGAGGCGGCCGGTGT 
GCTGTGCTCAGCTGCCTTCCAAAGGAGGAACAGATCGGCAAGTGCTCt^ 
A^TGCTGCCGAAGAAAGAAATAAAJWCC^^ 

AATGCCI I CI I AAAGTTT ATAAAAGT AAAATCAAATT AC A 1 1 1 1 1 I I | C CAAAAAAAAAAAA A A A 

AAGTAAAAGT 

>196 



CCATGGTGA1 1 u i ^TTCATCATCCCTT AATT AAATATCAAATT ATA 1 1 rGTGI GAAAATGTGA 
CAAACACACTTATCT GTCTCT TCTACAAI rSTOGTI I ATTGAATGTG A 1 1 1 1 rCTGCACTAATA 
TAAATTAGACT AAGTGTTTTCAAATAAATCTAAATCrTCAGCATGATGTGTTGTGT ATAATTGG 
ACT AGATATTAATTAAGTCACCTGTATAA I G 1 1 1 1 G I AATTTTGCAAAACATATCTTGAGTTGT 
TT AAACAGT CAAAATGTTTGATATTTTATACCAGCTTATGAGCTCAAAGTACCTC^^ 
>197 

>198 

NAC TTGGTGI 1 1 G I IXXXCCCGGOGAACACATTTTGG^TrGCGCACACTGGCOCTG 
GTGGCGCGCMCM I rTAGGGGGCCTGTGGCCCACAGACAATGTTCTAAOCGGGTAGTGGT 
GCTCTCTT *^f^y^"^^r^<?A T AAC^^ ; i c i l n G CCOGCT G TA 



GCACAOCOGGT ACCCT AAAGCTTGACCCGTT AAAAGCTGTACTTGT AACTGGGTACCTOGTG 
AACT G T CAAGGAG<5CCAGAGCAGGAAAGGGAAAGGAATAACCCCC>^^ 
GAGAGGCACAAATTAGAGGGCTGGGCACAGGCTGTAGCCCT 

TTGACAG TTGC1 CI GT GG I CI C I tXSGATATAATTCTGCOCAAGGCTAGAACOU^AGAGAAGA 
GTTTGCACTClTAAGTCCAGXiAAG^SGGACT 

TTAGCACACTAAACACATGGCCAGGACCCTAGGGACACAAGGCAGCTGGAGAGTGGGA 
CTTGTT AAATGGCATGGT AGGCAG^TT AGAGTCCTGGCTAT AATCCCT AGGGCCCCAATCCT 
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a7taccctgaggcttctgggagaacaagagccctgagg 
mgaagcagtaagaccaggcatgagaaaacccagaaagccagctcagttcccaagaaggct 
ggcacatggggcctgagaattcttaaatggccattgtcactggtacttgctcagcctttcca 

TCCATCCCATTGGATGGAGAAGAAAGGTGGACAGCCTGTTCGTCTCTCATGTCAGCCTAGG 

gctgggaacagtttgtgaggacttatctgttgtacctgccaaaagttaattagtaactcacc 
gtcgagagtgaattaacaggacaaacgtaatccaacatgccagtgtgggtaggacacagtt 
ccctaatcagcccttggcccccagatgcaggctctccxxtcccctctgagacctctctggg 
aatagcagacaagagaatgtcagggcagaaacctgctggact aggctctcagcagcccagc 

TCCTCCCTGGGGGAATCCCCCAGAATTCCTCACTGTGTGACACAGTTTTCTCCCATGTCCTG 

ggcat atctg tctgacatggtggtccttaagtcctcaatgtcacgacgcagctgttgaarfrr 

cttctagtttcctcttgatcacatctggcttctgcaaatctagctgagtctctgggtgctgtg 

agtgaatgccgaggagcagggagagattggggtcatggccctgggccctctgggtcacaa 

tcctac^gacagcctgcagatcttgaaggcaactggccaactcctggtgcagctcaagtgc 

cagctgggocgtgtctggtgaaggtggagaccctggttgggcctggcgcaggtgttcctg 

gagtgt cagattcttctcaatcaagtcctggttttg cactgaaa g ctcctttcacggctgtg 

cggagctgctgcagggctgaatcccttcgctgggcctcctctctcagggcct 

cttcttctrgagccacttcct^crctaggctctgtacttgtgagcgcaactgatccatctrct 

GCTGCTGCTTCTCCACAATCTTTTGGAG CGAATCG T ATCG CTTCTG CC AG gactcc atttcc 
GGAGCAGAATTCTGCTTTCTCTCTGCXTCAGGCTTTCCAGTTGAATCTC 

AAGGGTGCT CCTCGCAGATGGAACAAN 
>189 

._ T _ ^ r N^CTGGTGTTTGTC^ 

^ GCG ^^~ f " ' 1 ^gggggcctgtggcccacagacaatgttctaaccgggtagtggt 

GCTCTCTrACCCOGGTGGTCAGATAACCAGGGGGCCAGGGTCC C I 1CI » I TGGCCCCTGTA 
GCACACCCGGTACCCTAAAGCTTGAGCCGTTAAAAGCT gtacttgtaactgggtacctcgtg 

^actgtcaaggaggccagagcaggaaagggaaaggaataaccoccaocacccccaacacaa 
gagaggcacaaattagagggctgggcacaggctgtagccctgggtgagggggtaagcagc 
ttgacagttgctctgtggtctctgggatataattct gcccaaggct agaaccacagagaaga 
gtttgcactcttaagtccaggaaggggu^acctggaaggcctgagaacaaaggagaaag^ 
ttagg^ctaaacacatggccaggaccctagggacacaaggcagctggagagtgggatct 

^ g ti aaa i ggcatggtaggc * ga ^ 

AG I A ?^ AC £T G ^ AA 5* iAACA ^^ 

GGCCCCTCTGATGAGCTCTCT AATCAGCAGGACCAAGGTGTGAAGTGGGAATGAACATGGA 
^^^^■T^^^l^TI^SiiZSS^^**^^*^^ 13 < ^^^^^ GC CT^TT n CCaTCTCrr CATGT CAGCCTAGG 

^3C25^S^?^S^!5I3^^^^^^^^^^^^*^^^'^^"' n ^^^^^^"' T ^^^ i ^'^'* C3TTGOQTT" ^GGACACAC3TT 

CCCTAATCAGCCCTTGGCOCCCAGATG^ 

AATAGCAGACAAGA^^ 

GGCAT AI CIGI CTGACA TGGTGGTCCTTAAGTCCTCAATGTCACGACGCAGCTRTTGAAfyrT 
UllCTAGi 1 1 CGI O I IGATCACATCTGGCTTCTGCAAATCTAGCTGAGTCTCTGGGTGCTGTG 
AGTGAATG<XXAGGAGCAGGGAGAGATTGGGGTC>TGGCCCTGGG^^ 
TGCTACAGACAGCCTGCAGATCTTGAAGGCAACTGGCCAACTCG^ 

CG . C * GC1 ^ G 5 AGGGC 7^^ 

CTjrcrTCTrcAGCCACTrcCTCCTCTAGGCT 

GC I G i* G £ rT J£ TCC i^ 

ACTTGTGGAGCTTCTCCACTGGTCTCCTCCACAGACTGTTT 

GGAGGAGAATTCTGCTTCTCTCTGCCTCAGGCTTTCCAGTTGAATCTCCTTCTCTCTGC 
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TTGCCTGGGTCTCCTCC AGCAAACTCTCCAGCTCAGTCACTTTCTCCT GCAGCT CTGTGCTC 

TrCAACTCAGAATCCTrAAGCTATATTTAATTTCCTCAGCAAGGCAATTTTTACTTTCTGCAGA 

AAGGGTGCTCCTCGCAGATGGAACAAN 

>200 

nnatgtggttacgacccactgtattgaggtgacgcgatccataggctgtggtgttt 
gttttcgctgatcc acac aaacgttggggcactgtctattcatgtgttcaagctgaaggctc 
gttctcggttgtcattttacagtgtttttctacggggttacattacaggaatgttgtagcgac 
gtttagcc cgtgg agtatcaaggtcttgagactccgtgtgagactccctggtttgttccaca 
acagtgtgtttttagattccgtacctttgattaaggaacacatcatgccgtgaagccaattta 
tattcrgcaattccgtagtgcatgtaatgtattctgccgtctcgtagtgtgaagccatgcttg 
gcacatccagttctttgatgtctggctgccttctgcgggccaactgtcttgtggaattcgtt 
gctcccagagatagcttgaactgcagatcccgcacagcattgcactgagctgtcgttgtatc 
I gaggctggacatggcggc <^<^tac^ 

GCTCTCCATTCTT CTT C I I GGCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 

ccttctgaggatgatcgatagataaacacacctcctctgaaccatccttgggcttcatgggg 
ttggcattgaggatccctacgacagtcccctgctccgtcttccagagcgctttgtgaacttc 
tccaaat^agaacaaggacacacattgtgtcaggtcacgaagatcattc^ 

GAAGGTTrTTCCACTATTCAGACTCTGTGGCGTAACCTTCTTCAATATAACCCCAAATGTCAC 
CCAATCT ATTTCTTCCAGCTTCTCTCTGGCCATCTTTTCCTTGATCTGAGACAGTCTGATCAG 
TTTTCGGCCGGTGATGTGTCTTCGTTCATATTCTCTGGAGGATACTCGAGCGCGCCTCGAGC 
CGCAGACCAGGAGAAGGCTTCCAGACAGATGGCGATTGAGTCGTTTGCTCACAGAACTTTC 
ACTCGGGGTCCAGCACATATTTGACCTCTAGTTATCCCACTAGGTTTGTrCCG^GAAATCGT 
CTGTAGGGGTTGGGAGGGTGCACTTGTCATCCTTGAAGATGAGCTTTTGGGATCTGGAGGT 

gaagcctnggtgttcgagccaccctcttggttcttggtagcgcagggacatcaagctccg 
cagaaaagcatgttgactcctgaattctctgaactctcctctccttaagaggtggccgggga 

G ACTTCTCAGGGGATTTTTGCAG ACG GGCTGGGCTTGCTGT CTGTTT AATTGTTGTTACTTTT 
AGCTGCTCTrGTAAGGCCTTCATTTGCTCTTGGAAATTCCTTAATTGCT CTTGdAAfrTfrrmAT 
TCGTTTTCTCTCGCCTG<iGGGCAGGAGCAGGGAGGACCCTATTTTCAG7TGACTGTGATGC 
SSS^L 1 IC> tCI 1 CATCTGTT AAGTCCTGCATATCTCCAAAGAGAGTGGCCAGA 1 1 MUCH 
TTCGTCTCTTGTCT CT CCTGTTTCTCCATCATCAGCCTCTTCTGTATAAGATTCACCGTCGCC 
GTCGGCATCAAAG^GCTCATCAAATGCGTCGGGCTCGCCATTTTCCCGCGTCAAGAAGTTAT 

ii i^i 1 ctg^^acaatgcaaggctgactcattttcttccagcagtgcggtcagcagagaca 

GATTGTCTTCCTCXrrGATCCATGCTGTCAAGAGGACAGTTGTGACAGGAAACT^ 

GCTC ^ TGCCC * GATGAGG AATGTAGAAT^ 

ACAGCTGAACCCGCCAAAATCGGACNN 

>20t 

>2Q2 

^^gAgTCGACCC^^ 

CAAAGGTCAATATCCAACAGAGCCAAGCTACCCTGTGCAGCCT CCTGGGAATCCAGTATACC 
CTCAGACCTTGCAT CTTC CT GAGGCTCX3AGGCTATAGCGATGCTCCACCTGCCTACTCAGAG 
$2E[ ATCG X CC <M£ C ^ 

TTCCTGGAGCCTCTCTGTATCTTCCCATGGCCCAGTCTGTGGCTGT7GGGCCTTTAGGTTCC 
ACAATCCCCATGGCTTATTATtt^GTCGGTCCCATCTATCCACCTGGCTCCACAGTGCTGGT 
GGAAGGAGGGTATGATGCAGGTGCCAGAT7TGGAGCTGGGGCTACTGCTGGGAACATTCCT 
CCTCCAarrcCTGGATGKXCTGCCAATGCTGCTCAGCTrGCACT 

TCCTCGTAACTCAGCGGAAGGGGAACTrCTTCATGGGTGGTTCAGATGGTGGCTACACCAT 
C^TG^GAACCAAGGCCACCTTTGTGC^ 

cacaatgtaagtgctttaotcatattaacctga 
G J c rn£[GG'^^ 

cagtacctccctaaagcattttgaggtag g g gag gtatccattcataaaatgaatgtgggtg 

AAGCCGCCCTAAGGATTTTCCTTTAATTTCTCT GGAGT AATACTGTACCATACTGGTCTTTGC 
TTTTAGTAAT AAAACATCAAATTAGGTTTGG AG G GAACTTTGATCTT CCTAAGAATTAAAGTTG 
CCAAATTATTCTGATtGG^ 

GAACGCATT AGTTGTCTGCCTTTTCC 1 "I I CCATCCCTTGCCCCACCCATCCCATCTCCAACCC 
TAGTCTTCCATTTCCTCCCGCCAGTCTGCATTGAATCAATG GTGCAGGACAGAAAGCCAGTC 
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CTCAGGCJj^TGGAGGGTGCCTTATGGGCCCTCCTCATAGGTTGTCTCTG^ 
GTttttgtcocaaggggw 



NWTCTGGTTACGACCCACTGTATTGAGGTGACfiraAT^^ 

G JT^CTGATCCACACAAACGT^ 

G iT CTCGGT J G J CAT ^^ 

ACAGTGTOTTmAGATTCCGTACCTTTGATTAA^ 

G ^ GG J37332E AC J A1 I C ^ CT ^ 
cc ^^ A J37iU? CCAG i^^ 

TTTTaSGCCeGTCATGTGTCTTCG^ 
CGGAGACCAGGAGAAGGCTT^^ 

a h c ^ g h° caccacata ^ 
SSSS^tg^^ 

GAAGCCT jl GG J G jr aiAG ^ 
c ^ aaagca j g j tg ^ g i cct 

^J^ZX^^^i^^ G ^^^ T ~ I ~'~ r ^ Gi ^ A ^ A ^ ; ^^ < ^ CTG GVGCXTG CT GT CrrGTTTTXA^TT^TT^GTTn^CCTT^T 
ACCTGCTCTroTAAGGCCTT^ 

GGGAACTTCTTCTTCATCTGTTAAGTGCTCCATATCTCCAAAGAGAG^ 
TTTCTTCTGAAT^ 

G |p£Gc^^ 

ACAGTGTGTTT7TAGATTCCG7 ACCTTT GATT AAGGAACACATCATG^^STGAAn^nAATT^ 
TATTCTGCAATTOCGTAGTGCATGT AATGTATTCTG^^GTCTCGTAGTraTGAAG^^STGSTR 
GCACATCCAGTTCTTTGATGTCTG^^ 
GCTCGCAGAGATAGCTT<iAACTCCAGATC^ 
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tgagcctggacatggcggccgaggtactcacagtcacgcaaattcacagtctgcgtgcacg 
gctctccattcttct7cttg g cttt acag gtt cccagg tcaag agcttcaccc at aattaag a 
ccttctgaggatgatcgat agat aaacacvvcctcctctgaaccatcctt gggcttcatgggg 

TTGGCATTGAGGATCCCTACGACAGTCCCCT GCT CCGTCTTCCAG AGCGCTTTGTGAACTTC 

tccaa ataag aacaaggacacacattgtgtcaggtcacgaagatgattcagtttccatatgct 
gaagg 1 1 1 1 1 ccactattgacactctgtggcgtaaccttcttcaatataaccccaaatgtcac 
ccaatct atttcttcca g cttct ct ct ggccatcttttgcttgatctgagacagtctgatcag 
ttttcggccggtcatgtgtcttcgttcatattctctggaggatactcgagcccgcctcgagc 
cgcagaccaggagaaggcttccacacagatggcgattgagtcgtttcctcacagaactttc 
actcggggtccaccacatattrgacctctagttatcccactaggtttgttccgagaaatcgt 
ctgtaggggttgggagggtgcacttgtcatccttgaagatgagcttttgggatctggaggt 
gaagccttt ggtgttcgagccaccctcttggttcttggtagcgcagggacatgaagctcgg 

CAGAAAAGCATGTT GACTC CTGAATTCTCTGAACTCTCCTCTCCTTAAGAGGTGGCCGGGGA 
GACTTCTCAGGGGAI 1 1 1 I GCAGACGGGCTGGGCTTGCT GTCT GTTT AATTGTTGTTACTTTT 
AGCTGCTCTTGTAAGGCCTTCATTTGCTCTTGCAAATTCCTT AATTCCTCTTGCAACT CTT CAT 
TCGTTrrCTCTCGCCTGGGGGCAGGAGCAGGGAGGACCCTATTTTCAGTTGACTG 
GGGAACTTC 11 C H CATCTTGTTAAGTCX^CX^TATCTCX^^GAGAGTGGCCAGATTTTCCTT 
TTCGTCTui i u i CTCTCCTGTTTCTCCATCATCAGCGTCTTCTGTAT AAGATTCACCGTCGCC 
GTCGGCATCAAAGAGCTGATCAAATGCGTCGGGCTCGCCATTTTCCCGCGTCAAGAAGTTAT 
TTTCTT CTGAA TTACAATCCAAGGCTQACTCATTTTCT 

^TrGTCTTCCTCCTCATCCATGCTGTCAAGAGGACAGTTGTGACAGGAAACTTCGAAGGM 

GCTCAGATGCCCAGATGAGCAATGTAGAATCrrrGGCCGATGCCGGGACGCCTTCTTCGTTC 

ACAGCTGAACCCGCCAAAATCGGACNN 

>206 

NNATGTGGTTACG^CCCACTGTATTGAGGTGACGCGATCCATAGGCTGTGGTGTTT 

gttttcgct^tccacacaaatottggggcactgtctat^ 
gttctcggttgtcattttacagtgtttttctacggggttac^ 

gtttagcccgtggagtatcaacgtcttgagactccgt gtgagactccctggtttgttccaca 

acagtgtg 1 1 1 1 « agattcxx3ta<xnttgattaaggaacacatc^tgccx3tgaagccaattta 

tattctgcaattocgtagtgc^tgtaatgtattctgccgtctcgtagtgtgaag 

gcacatccagttctttgatgtctggctg^^tctgcgggccaact 

gctcccagagatagcttgaactgcagatcccg^acagc^ 

tgagcctggacatggcxsgc cgagg tactcacagtcato 

gctctccattc mci i ci i ggcttracaggttcgcaggtcaagagcttcacccataattaaga 
ccttctgaggatgatcgat agat aaacacacct cxttctgaaccatccttgggcttcatgggg 
ttggcattgaggatccctacg^cagtcccctgctccgtctt ccagagcgcttt gtgaacttc 
tccaaataagaacaaggacacjacattgtgtcaggtcacgaagatcattcagtt^ 
gaaggtrtttccact attcac^ctctgtggcgtaaccttcttcaatataaccccaaatgt(^ac 

CCAATCTA MICH CCAGC 1 1 U I C I CTGGCCATCTTTTCCTTGATCTGAGACAGTCTGATCAG 
TTTTCGGCCGGTCATGT GT CTT CXTnT^ATATTCTCrGGAGGATACTCXSAGCCCGCCTCGAGC 
CGCAGACCAGGAGAAGGCTTCCACACAGATGGCGATTGAGTCGTTTCCTCAC 
ACTCGGGGTCCACCACATATTTGACCTCTAGTTATCCCACT AGGTTTGTTCCGAGAAATCGT 
CTGTAGGQGTTGGGAG^TGCACTTGTCATCCTTGAAGATG 
GAAGCCTTTGGTGTTCGAGCC^CCCrCTTGGTTCTTGG^ 

CAGAAAAGCATGTTGACTCCTGAATTCTCTGAACTCTCCTCrrCCTT AAGAGGTGGCCGGGGA 
GAG I \ C I UAGGGGATTTTTGCAGACGGGCTGGGCTT GCT GT CT GTTT AATTGTTGTTACTTTT 
AGCTGCTCTTGTAAGGCCTTCATTTGCTCTTGC CAT 
TCGTTTTCrrCTCGCCTG^SGGGCAGGAGCAGXaGAGGACCCT ATTTTCAGTT GACTGTGATGC 
GGGAACTTCTTCTTCATCTGTTAAGTCCT 
TTCGTCTCTTGTCTCTCCTGTTTCTCCATCATCAGCCTCT^ 

GTCGGGATCAAAGAGCT CATCAAATGCGTCGGGCTCGCCATTTTCXXXSCGTCAAGAAGTT at 
"1U1 CTGAATTACAATCCAAGC CTGACTC A 1 1 1 1 C M CCAGCAGTGCGGTCAGCAGAGACA 
GATrGXCTTCCTCCTCATCCATGCTCT 

GCTCAGATGCCCAGATGAGCAATGTAGAATCTTGGCCGATGCCCGGACGC^ 
ACAGCTGAACCCGCCAAAATCGGACNN 
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CGCG GTGGCG 6CC GCCOG GCAGGTACATGGTTCTTCCTCAGAAAGTGGTTCTTCCT 
TAAT GTGIIICIIinACCC C I MIC II CHC ll C riCACAGAT bM ICH C I ICM C m CCACT 

1 1 mci rci i cctcttcttcaactgaatagggtaagtgtaaaggcacaacaaat7aacactgta 
tcagatctcattccttccaaaaacgtttgagtcct agtttttttct gt cattctcatcaact ac 
ccaatgtttgttttgttt atttt ataattgggaag g ttctccaag gcct accact aactttaac 
gaatgatatagatagagctcagagcaatcttctcacgatcatgaagtcatgtataaaaatcag 
gattaaaacaaaggtcatctgatctccaatcattattgggaagaaagtcaattatattagaaat 

GGTTAAGAGCTTGCACTCTGAAGTCAGACGGCCTGGGTTT AATCTACCT GCT GCAACOCTGA 
AAAATTGTATTTACCCTTGGTGAAGCTCCCT atct ATAAAACTT aagaatgtcttatcttactg 

gactgttactgatttaaaaagatgatgcatagaaagcactaagtataatgcttagcacacatt 
acaataaggactcaacacatagctatcattagacattcagtgaccagctgggtgcagtgggt 
cacxscctgtaatctcagcactttgggaggctgaggcgggaggatcacrrtgaggtcaggata 
tcaagaccagcctggccaacatggtgaaatcctgtatctaataaaaatacaaaaaagttagct 
aggcatggtggtgcatgcctgtaatcccagttattcaggaggctgaagcacgagaattgct 
tgaaccc^ggaggcggaggctgcagtcagccaagatcacg^xactgcacttcagcctggg 
cgagailagggagactctgtctcannnnnnfnnnnnnnnnnnnnnnnnnnnnnfn 

>208 

oscggtggcxsgccgcccggcaggtacatggttcttc^cagaaa 
taat gtgi i ici 1 h iaccc c i i mci i ciic i ic i ic acagatgi i jci ici icm ctgccact 
mrci rcTTCCTci i ci icaactgaatagggtaagt gtaaa ggcacaacaaattaacactgta 

TCAGATCTCATTCCTT CCAAAAACGTTTGAGTCCT AG III III I C 1 GTCATTCTCATCAACTAC 

ccaatg morm gtttattttataattgggaaggttctccaaggcctaccactaactttaac 
gaatgatatagatagagctcagagcaatcttctc>cgatcatgaagtcatgtataaaaatcag 
gattaaaacaaaggtcatctgatctccaatcattattgggaagaaagtc^ttatattagaaat 
ggttaagagcttgcactctgaagtcagacggcctgggtttaatctacctgctgcaaccctga 
aaaattgtatttaoccttggtgaagctccctatctataaaactt aagaatgtcttatcttactg 

GACTGTTACT GATTTAAAAAGATGATGCATAGAAAGCACT AAGTATAATGCTTAGCACACATT 
ACAATAAGGACTCAACACATAGCTATCATTAGACATTCAGTGACCAGCTGG 

cacgcctgtaatct cagcactttgggaggctgaggcgggaggatcacttgaggtcaggata 

TCAAGACX^GCCTGGCCAACATGGTGAAATCCTGTATCTAATAAAAATACAAAAAAGTTAG 

AGGCATGGTGGTGCATGCCTGTAATCCCAGTTATTC^ 

TGAACCCAGGAGGCGGAGGCTGCAGTCAGCCAAGATCACGCCACTGCA^ 

CGACAGAGGGAGACT CTGTCT CAN WNNNN NNNN WN UNHHHHNUHHHHtiHHN 

*20Q 

N>«TCGCCATTGTGACCCGAGAATTTAAT AT ACTTGTG CCCCGCGCG TGTTTTGTT A 
ACCGTCCGCCCGAGTGTCCCCTCAATTOT 

CCATCT GGT ATTCTCCCCT AGAGCACt^GATGACCATCTGAAGCTGGACAAGTGTCTCTAAC 
AAT AAAACATT ACTGTTT ACAAAACAAAAAGCACAAACAT AATT ATGGAAT AAAT AAAAAACAA 
GGGACAAACAGCCAACTGACTCTACCCACTTGGTO ATTTTTTT 
AATQCTTCTGAAAG ITTCTI GG COCACAQAQGACT AGGGTGCAATCATTCCCTQTGTTAGTQ 
AGTTGGGTTT AATGCAGCTTCAAAATTAGGGTAAAGGGAC 1 1 GG I UAAATGTTTACATTAAAT 
ATTTCACTCCT ACCCATTCTTC ^GGAAAAAAGGTGAGCTCAGCAAGGCTGGATGCCATTAAG 
AGAT ATTTACTG 1 1 1 TCI 1 1 1 1 C I ATAGCTAAAAAAG^AACTTTACACGAAGAAGCTCTTGAT 
TAAGGAAATTTCAATAGATTCATATTTATAAAATTTTAAACATTTGGCACA 
AAAATGGGGGAGAAAAAATAGGTCTQGTTGTI G l CXaXXTTTTTOA^TGCrOTGGACAG 
TGATGAGATGCTCAC^GAAGAAAAAGGCCTGGCTTTGTACCAGGCT GGCGACAGGTGCTAC 
CAfiGAGTGGGCTGAGCGGAGAAAAACTATCTCCCACTCTrTTGG 

ACI I CCACATTGCCTGGOCCAC 1 1 CC I UAGCAACCCCAGGTTCGGCTCTGTATAAGGACCCT - 

CCCCTCCCAACCCCAACCCCAGAGTGCAGTGCAAATCAACCAAC^^ 

GCAATCA AAGGAA ACAGTTA AACAO^ ACAATTTCTTAAAG 

GTTGAACATTTTTCGAGT G T G IT ri 1 1 1 CAAGTGTAAAAGCAGTGACA I 1 1 T U fT C AAACAGAA 
GCAGCATCT AG<1AATTCTCGCACTTGGGTTCTAGGGGGTT ACAGGT ATGCATCATGGATTCT 
TCTCCCTCGTATTTAAAAAGGCCTC»TGTTT^ 
TCTCCCCTCTAGCGGACAGTGGGTGGCCAGCCAGCCTCCCT 
AGCAGACATCCCTCATTGACCTGCTGXSGCTTGCTTTCT^ 
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GAGAAAGAAACTTACAAAAGCACAACCiACCAAAGGCAGCCT 
ACTGAGTTGCTGGAGACTACCCTCTC^GTCCC^ 

aatttgaatcaaacatgggggag gacacctgccaagcaatagtatgatggactcaagtcatc 

CCAGGCCATCT ACTGTCTCCCTCACCTGtXXTr AACCTTTTCTGAGTCOCT CCCTCTCTTGTG 

CAAGCACTGTAGTTTAAAAAGGAAAAAACACXXXXX>CCATTAA^ 

AC IE AGTATATAAAG ^S<^ A 6GCTAAGGT^ 

A I C 7. C Z G I GCGAGCC ^^ 
CA ^TCT G AGCCAC^TC^^ 

AG ° T 5 G £?I G , CTGC " C 1 ' J C I CT CTGCCCCT AGTCCAGGCTCCTTTGCTTCACGTAAG ATT 

aa cactttcccattcct ctgaagttgctggaaggaca7ttcccaggaagaaacaattcctca 
ggca^tggg^tggcajSaatc 

G< ^9^ G ^J^^^^ T( ^^^^GCC^TCTG7GJC nC T TI C CACCT C I ICC1 1 C AGTCTCAA 
^^ACTTGTArcCCCAGCTACCTCC^ 

agc ^ ggc i7i[ ggg t ac ^ 
aac II agcca I g T c tcaa^ 

tatggtgacag acccgtatcatcttaaagtatgttcatagtt aaggcttgacttaagaaaata 

AGAGAACCAGA f*^^ 
c I caca g tc ac^^ 

AACAAGAGTTT ACAACTCCT CA7GGC7TCTT AATAGGTGAAGTAGGTGAAAAGTCTGAGAAG 

g CACAGGAG g G J7T G i*^^ 

CCAGCCCCACT ATAATTGGC^GTATGTTTGTTCATGTTTCCTGAAAACA I HI C I 1 1A AAAAG 

GAAA ^ GA ^ AAA ^ TG <^ 

IgCTTCCCTGAMTAATTA^ ^ 

CTGCT CAACCTOCCAACCATGCCG 1 1 1 ICCCI 1 1 T ATGTG TATCT CTTGGAGT AAAAT AAATT 
CA .T7 AA J GGCTTT ^^C^TACAAATACAATAGAAAAGAAAGAAGTCTGGAACCTGACTATCAT. 

GG f* GC ^ AAAAG 7 A JOT 

GAAATAG AGGCT CCTTCTCCTTTGAG7TCAATACCCTCCCT GTGTCCTACTCACCAGGAAAAT 
>210 

™™^ nACGACCC *^ 

G IIII CGC I GATCCACACaa ^ 

G JTI AG £2^I GGAGTATC ^^ 
^SJS5S A ^SS G J A ^ 

GCACATCCAGTTCTrTGATGTCTGGC I U X! I I C I GCGGGCCAACTGTCTTGTGGAATTCGTT 

GCTGOCAgAGATAGCTTGAACTGC^^ 

TG A GCCTOGACATGGCG^ 

GCTCTOGATTC t ICI ICI I GGCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 

TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTO 

TCCAAATAAGAACAAG<^CACACATTGTGTCAGGTCACa 

GAAGGTTTTTC^A^ 

S£ AA I C T A JLV^! '^GCTTCTCTCrrGGCCAlGI 1 1 I CC I I GATCTGAGACAGTCTGATCAG 
TTTTCGGCCGGTCATGTG7OTCGTTO 

gJG^GGGTrGGGAGGGTGCACTTC 
CAGAAAA^GTTGA^ 

AGCTGCTCTTGTAAGGCCTTCATI IGC I C I IGCAAATTCCTTAAl ICCIC I I G CAACTCTTCAT 



Pmge 57 of 355 



PCT/VS00/33312 



Table 4 



TCoi i i iCTTJTCG<X^TGGGGGCAGGAGCAGGGAGGACCCT attttcagttgactgtgatgc 
GGGAACTTCTTCTTCATCTGTTAAGTCCTCCATATCTCCAAAGAGACT 
rrCGTCTCT I GTCTCTCXrrGTTTCTCCATCATCAGCCTCTTCTGTATAAGATTCACCG 
GTCGGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCA 1 1 rrG CCGCGTCAAGAAQTTAT 
TTTCTTCTGAATTACAATCCAAGGCTGACTCA I 1 1 ICI I CCAGCAGTGCGGTCAGCAGAGACA 
GATTGT CTTCCTCCTCATCCATGCTGTCAAGAGGACAGTTGTGACAGGAAAC^ 
GCTCAGATGCCCAGATGAGCAATGTAGAATCTTGGCCGATGCCGGGAC^^ 

acagctgaacccgccaaaatcggacnn 

>Z11 

NNATGTGGTT ACGACCCACTGTATTGAGGTGACGCGATCCATAGGCTrGTGGTGTTT 

gttttcgct gatccacacaaacgttggggcactgtctattcatgtgttc^agctgaaggctc 
g ii c t cgg ti gtca1 ™ ac ^ 

gtttagcccgtggagtatcaacgtcrttgagactcxxjtgtgaqactccct 

acag i gtgi ' ' ' ' a gattccgta(xrrttgattaaggaacacatcatgcc<n'gaagccaattta 

tattctgcaattccgtagtgcatgtaatgtattctgccgtctcgtagtgntgaagcca 

gca£atccagttctttgatgtctggct 

GCTCCCAGAGATAGCTTGAACTGCAGATC«X5CA^ 
I GAGC H GGACATGGCGGCCGAGGTACT 

GCTCTCCATTCI I (J J 1 CT1 GGCTTTACAG^TTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 
CCTTCTGAG^TGATCGATAGATAAACACAOT 

J3S GCA 27 GAGGATCGCTA ^^ 

TCCAAAT^AGAACA^GACACACAT^ 
GAAGGTTTTTCCACTATrCACACTCTCT 

CCAATC TAI I ICI ICCAGCI fCTCTCTQQOCAT Cl 1 1 ICCI IG ATCTGAGACAQT'CTGATCAG 

TITTOGGCCGGTCATGTG I C I [CGI I CATATTCTCTGGAGGATACTGGAGCGCGGCTCGAGC 

CGCAGACCAGGAGAAG<^^ 

ACTCGGGGTCCACCACATATTTGACCTCTAGnTA 

CTGTAGGGGTTGGGAGGGTGCACTTCT 

GAAGCCTTTGGTGTTCXjAGCCACCCTCTTGGITTCTrGGT AGCGCAGGGACATCAAGCTCCG 
^^^^^^^GTTGAJ^TC^GAAJJCrrCTGAACrr CTCCTCTCCTT AAGAGGTGGCCGGGGA 
GACTTCTCAGGGGATTTTTGCAGACGGGCT GGGCTTGCTGTCTGTTTAATTG ITGTI ACTTTT • 
A P CTG ?^ G T A AGGCCTTCAI ' 1 GC I C M GGAAATTCCTTAATT C C1 €1 1 1 GCAACTCTTCAT 
TCG i n iu . ^TCGCC TGGGGGCAGGAG<>G<KaAGGACCCT ATTTTCAGTTGACTTrraATfir 
GGGAA£nCTJCITCATCTGTrAA^ II 1 1 CC 1 1 

JTCGTCTw 1 1 u i CTC I CC I U 1 1 1 CTCCATCATCAGCCTCTTCTGTATAAGATTCACCGTCGCC ' 
G JS G ^I CAAA ^ AG ^£ ATCAAATGCGTO 

TTT^CTGAATTACAATCCAAGGCTGACTCA 1 1 1 IC I fCC AGCAGTGCSGTCAGCAGAGACA 
GJCTTGTCTTCCTCCTCAT^ 

GCTCAGATGCCCAGATGAGGAATGT AGAATCITGGCCOATGCCGGGACGCC 1 1 C II CG 1 1 c 

*CAGCTGAACCCGCCAAAATCGGACNN 

>212 

GCACCACACCTGGCTACTTTTTAAAI 1 1 1 1 1 I H I I C I G T A GAGACGAG G I CI I I C I ATGCTGT 
^CAGGCTGAACjrCATGGGTTT CC I TG 

A I A ^ CAG ^ AAG C0CTCA G^ ^ rTTTCCTTGTCI IBAGATTGCTCTGGA 

ATGGAAATT AGGC 1 1 1 1 MGAAGGTGTGACCCI 1 1 1 Ital ICATTTCTTrj^r^rri^Afri-ri-rr^ 
TTTTTT^TGTTTGACACAC^ 

TTGCTCT GTT AAGTGTGACACTGTCCCTTTGAGAATCTGGCGACAGCTATGTATCC^ 

CACACACCCCAAAAAAAAAATTTATGTCTGG 1 1 CCAGGAGTTACCTTTTATGAGAAGTCCATC 

pTGAAGACCCTGGATGTTCAGAGAACT^^ 

GGGCCOGGTGGCTCATGCTTGTGG^ 

CT<5AGGTCAGGAGTrCCAGACCAGCCT^^ 

>213 

>Z%4 "• ■ 

CGCQGTGGGGGGGGMjGTACATGCCTAGAX&TAGJCGGAGCT ACTCGGGAGGCTG 
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^™ GAG ^ CGmGAACCCAAGAG ^^ 

S^PP^PPT^tgacagagagag^ 

JIa^I^S^ cag i a i aa ^ 

AAGAAACCCAGTAAATGTTTGTTGATTGAAAGATATTAATACTCTTGC^ 

GAAAAAGGTATTAGTATTGGCTTTTCACAACGCCTTGT 

>2t5 



AGTGr?S G ^^?2S A i CA ji A 5 A ^^ 

^ T9^^^*^^^^^^ GG ^^^ GGG ^^^^ A GAG T AAG AG TTTT AGCTGAATGAGTGGC 

(^CTCATAGAGAG^TTGCATTTCTGGCTTCX^CTTGCAGCCATAG^ 

TCTAACC^GGGGGGGTCCAATCTTTTGGCTTCCCTGG^^ 

G I c JT??ii ( ^ A ^ TAAAA 



CCAAAGGGAAGTCTGTGCTCTCATCAGATGTTC<rrTCAGGAACAGA^ 

G ^ CG ^ CA Z G ^ GACA ^ A I CA ^^ 

pCAGGATGTGCGAAGGCTrC^^ 

c SS^ G ^f AG J^ AC ^ c 3 A i ST ^ 

S^C^T^AT^ 

i^7 AG ^^ A ^I A 5IS eAA ^ TCGA 

T CA J AA I9^ G J G ^SF reGOTGAAG ^ 

A II G I!3^ CAA 7 A ^ G I CCAAGC ^ 

! c Z GGG 7I rGAA !5^ CACC ^ 

AGG ^ C ^ T S A A A J?^ C 4 AG ^^ 

ATGGATATGTCTATTACTCGGCTTCTTAGCA C I C ! I ACCCCCAACGTCC 

ACAGAGCT GGATGTTOCTCACAATGTCCAAGTGGCTG^GT^ 

GGT^GAAATGGAATACTG^^ 
AG ^^ AA ^ A ^ AG II^^ 

SSS^i^SE^^3SS A T°^^ 

AA CCCCAGACACATGGAGAAGTTCTGW M J IUI 1 1 1 G G0GGATAGTO0CAGATGAGGTTCCAQ 

C^AGCTGGGATCAC^ 

GCCAOCACTG^CraTA^ 

CTTCAGAATTCAGAAATAAAATAATTCATCT^ 



>219 
>220 

«^GSGCr^^^TG^«C^^ 

AGGAGTCAGGTTGAAAATGGAAAGTATTTATTGJ^ 

AACCATGTTGAAGTCAAAACTGCAGCTGCTACCATGGTGTO^ 

TTCOiAGGAGGCATCTCCAGGAAGATGATGG^ . 
CACWTCAGATCACIAA^^ 
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ACC iGCittll I GGCCAATTTAl CC I ACAGGTC7TGGACGGTGGGACCTCTTCAG AGAAGATC 
TGGT AAG G TGAGCAGCACAGTG GCCAT G G AAAAAGAAAAACT CT ACA G CAT ATTTCCGAGG A 

tcaaggacaagtccagaacgagatcctctcattcttctgtctcggaaaaacccaaaac^ 

GATGCAGAAT ACACCAAAAACCAG GCCTGGAAATCTT ATGAAAGATACCTTAGGAAAGCCAGC 

TGCTAAGGATGTCCATCrrTGTGGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGT 

AGCTGCAAGTTTCCGGTTTAAACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTG^ 

TGAGTGG CT AGAATTCTTCT ATCCACAGCTGAAG CCATGGGTTCACTAT ATCCCAGTCAAAA 

CAGATCTCTCCAATGTCCAAGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAG 

AGATTGCTGAAAGGGGAAGCCAGTTTATTAGGAACCATTTGCAGATGGATGACATGACCTGT 

TACTGGGAGAACCTCTTGAGTGAATACTCTAAATTCCTGTCTTATAATGT AACGAGAAGGAAA 

GGTTATGATCAAATTATTCXX^AAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCAT 

AGTCCTCTTTGTGGCAACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGC 

ACXn"ATACCTrGAATATCTGCTATCAAGCCAAATACCTGG7TrTCCTTATCATGCTGCACCCA 

GAGCAACTCTTGAGAAAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTT 

tggatgaataaggaccagaaatcgtgagatgtggattttgaacccaactctacctttcattt^ 

CTTAAGACCAATCACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAA 

ttgactgtgtccatgtgatgatgccctttgtcccattatttggagcagaaaattcgtcatttg 
gaagtagtacaactcattgctggaattgtgaaattattcaaggcgtgatctctgtcvxctttat 
tttaatgtaggaaaccctatggggtttatgaaaaatacttggggatcattctctgaatggtct 

AAGGAAGCGGTAGCCATGCCATGCAATGATGTAGGAGTTCTCrTTTGTAAAACCATAAACTC 

TGTTACTCAGGAGGTTTCTATAATGCCAC^TAGAAAGAGGCCAATTGCATGAQTAATTATTGC 

AATTGGATTTCAGGTTCGCTTTTTGTGCCTTCATGCCCTACTTCTT AATGCCTCTCTAAAG CC 

ACTAAAAAATAAAATAAGACTCGAGGGGGGGCCCGGTACCCGAATTTGCCCT ATTATGGGG 

CGGATTACAAATTAACTGGCCCGGCGTTTTTACAANNNN 

*221 

AGATGCATCCAGGGGTCCTGGCTGCCTTCCTCTTCTTGAGCTGGACTCATTGTCGGGCCCT 

gccccttcccagtggtggtg atgaagatgatttgtctgaggaagacct ccagtttgcagag 
cgctacctgagatcatactaccatcctacaaatctcgcgggaatcctgaaggagaatgcagc 
aagctccatgactgagaggctccgagaaatgcagtctttcttcggctt agaggtgactggc 
aaacttgacgataacaccttagatgtcatgaaaaagccaagatgcggggttcctgatgtggg 
tgaatacaatgtrrrccctcgaactcttaaatggtccaaaatgaatttaacct acagaattgtg 

AATTACACCCCTGATATGACTCATTCTGAAGTCXiAAAAGGCATTCAAAAAAG 
T GGTCC GATGTAACTCCTCTGAATrTTACCAGACTTCACGATGGCATTGCTGAGATCATGATC 
TCTTT TGGAA TTAAGGAGCATGGCGACTTCT ACCCATTTGATGGGCOCTCTGQCCTQCTGGC 
TCATGCI 1 1 1 CC I CCTGGGCCAAATTATGGAGGAGATGCCCATTTTGATGATGATGAAACCT 
GGACAAGTAGTTGCAAAGGCTACAAC I rGTTTCTI GTTGCTGCGCATGAGTTCGGCCACTCC 
TTAGGTCTTGACCACTCCAAGGACCCTGGAGCACTCATGTTTCCTATCT^ 
CAAAAGCCACTTTATGCTrcCTGATGACGATGTACAAGGGATCCAGTCTCTCT ATGGTCCAG 
GAGATGAAGACCCCAAGGCT AAACATCCAAAAAGGCCAG ACAAATGTG CCTTATCC 

CCTGCATCCTCAGCAGGTTGATGCGGAGCTGTTTTT AACGAAATCATTTTGGCCAGAAGTTC 
CCAACXXSTATTGATGCTGC^TATGAGCACCCTT CT CATGACCTCATCTTCATCTTCAGAGGT A 
GAAAATTTTGGGCTCTITAATGGTTATGACATTCTGGAAGGTTATCCCAAAAAAATAT^ 
TGGGTCTTCCAAAAGAAGTTAAGAAGATAAGTGCAGCTGTT CACTTT GAGGATACAGGCAAG 
ACT CT CCTGTTCTCAGGAAACCAGGTCTGGAGATATGATGATACTAACCATATTATGGATAAA 
GACTATCXX3AGACTAATAGAAGAAGACTT(XX^AGGAATTGGTGATAAAGTAGATGCT GTCTA 
TGAGAAAAATGGTTATATCTATTTTTTCAACGGACCCATACAGT^ 
TAACCGTATTGTTCGCGTCATGCCAGCAAATTCCATTTTGTGGT^^ 

ATTGTTATTTAAATCCTGAAGAX3CATTTGGGK3T AATACFTCX^GAAGTGCGGGGT AGGGGAA 
GAAGAGCTATCAGGAGAAAGCTTGGTTCTGTGAACAAGCTT CAGTAAGTTATCTTTGAATATG 
TAGTATCTATATGACTATGCGTXMCTGG AACCACATTGAAGAATGTTAGAGT AATGAAATGGA 
GGATCTCTAAAGAGCATXTGATTCTTGTrGCTGTACAA 

CACX^ACAAATGGGACACATGGTCTI GT CAAT GAGAGCAT AATTTAAAAAT ATATTT AT AAGGAA 
ATrTTACAAGGGCATAAAGTAAATACATGCATATAATGAATAAATCATTCTTACTAAAAAGTAT 
AAAATAGTATGAAAATGGAAAT7TG GGAGAGCCATACATAAAAGAAATAAACCAAAGG AAAAT 
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GTCTGTAATAATAGACTGTAACTTCX^AATAAATAATTTrCATTTTGCACTGAGGATATTCAGA 

TGTATGTGCCCTTCTTCACACAGACACTAACGAAATATCAAAGTCATTAAAGACAGGAGACAA 

AAGAGCAGTGGTAAGAATAGTAGATGTGGCCTTTGAATTCTGTTTAATTTTCACTrrTGGCAA 

TGACTCAAAGTCTGCTCTCATATAAGACAAATATTOCTTTGC^TATTATAAAGGATAAAGAAG 

GATGATGTCTTTTTATTAAAATATTTCAGGTTCTTCAGAAGTCACACATTACAAAGTTAAA^ 

GTTATCAAAATAGTCTAAGGCCATGGCATCCCTTTTTCATAAATTATTTGA^ 

AAAGTTGCATTTTAACCCTATTTTACCTAGCTAATTATTTAATTGTCC^ 

ATAGGCTATTTTCTAAAGACTTGTATAGCATGAAATAAAATATATCTTATAAAGTGGAAGTATG 
TATATTAAAAAAGAGACATGCAA A I 1 1 1 1 1 1 1 1 AAAGCAGTCTACT AGATTGTGATCCCTTGAG 
ATATGGAAGGATGC C I 1 1 1 I 1 1 CTCTGCATTTAAAAAAATCCCCCAGCACTTCCCACAGTGCC 
TATTGATACTTGGGGAGGGTGCTTGGCACTTATTGAATATATGATCGGCCATCAAGGGAAGA 
ACTA1TGTGCTCAGAGACACTGTTGATAAAAAC1X1AGGCAAAGAAAATGAAATGCATATTTGC 
AAAGTGTATTAGGAAGTGTTTAT G I I G 1 1 1 ATAATAAAAATATATTTTCAACAGANNNNNNNNN 



>Z22 



NAGGGAGTCXX^CCCACGTGTCJCGCGCAAGACGCXrr(^TAGGGAGTGTAACT ATG 



GAATGGGACATTTT AGAGACT GAAGAGCATTAT AAGAGCCGATGGAGATCT ATT AGGATTTT 
ATATCTTACTATGTTTCTCAGCAGTGTAGGGTTTTCTGTAGTGATGATGTCCATATGGCCATA 
TCTCCAAAAGATTGATCCGACAGCTGATACAAGTTITTTGGGCTGGGTTATTGCTTCATATAG 
TCTTGGCCAAATGGTAGCTTCACCTATATTTGGTTTATGGTCTAATTATAGACCAAGAAAAGA 
GCCTCTTATTGTCTCCATCTTGATTTCCGTGGCAGCCAACTGCCTCTATGCATATCTCCACAT 

cccagcttctcat aat aaatact acatgctg g ttgctcgtg gattgttgggaattggag cag 

gaaatgt agcagttgtt ag atcat at act g ctg gtg ctt acttcccttcaggaaagaacaagc 

acttttaactggtctcatc^xxrttttaaactcatatttcac^ 

tgtaaacaaat ai unc i i a tcattttgcctttgctcatg atgttctcattacct atagtttct 

cttctgttctcatcattcttcatccttcaagatn^ttraaac 

ctgtaataccagctactcgggagactgaggtagcagaatctcttgagcccaggagttcgag 

GGCAGNCTGGGTAACATAGGGAGACTCCATCTTAAAAAAAACATCACTGGXSCGK^ 
AATCTCGTGACTGGGGAAACXXX^GGGTACCAAATTAAC^^ 

GCAGAGGGGGAAAGGAAAGGCGGCCAAGTGCCCCCCCAAAGGGCCCCCAGAGAAGAGAC 
AGAGGGACCATGTCAGGCCCCCCCCTAGGGGCCCCCTAACACCACACGGGGGCGCCCCCT 
CCAAAACCCCGCGCTCGCGCCCAGAC 
CACCNNNNNNNNNNNNNNNNNNNNN 
>223 

acgcgggg gggacgt aaggt ggg g cggtg aaagaagtttg ctg acgaagatggc 
gactgaggcacagagtgaaggggaggtgccagcccgcgaatccggccggagtgatgccat 
ctgcagttttgtgatctgcaatgattcrrcccttcxiaggtcagocca 
cttitrrgtggagaaactccgacatgagaaacctgagattitcactga 

ATATCACAAGGCTCATCGATTTACCTGGAACTGAGTTGGCTCAGCTG^ 

CCTT AAGTTGCCTGGCGGGGCTGGCCCAGCATCAGG A I J C I I CCGGT CTCTCATGT CT CTC 

AAGCGAAAGGAAAAAG GAGTGATATTTGGGTCCCCACT GAOGGAGGAAGGCATTGCCCAGA 

TATACCAACTGATTGAGTATCTACACAAAAACTTGCGAGTAGAGG G I I 1 0. 1 1 1 AGAGTACCG 

GGTAATAGTGTCCGACAGCAGATTTTAAGGGATGCTCTrCAATAATGGAA 

GGAATCAGGGGAATTTCACTCAAATGATGTTGCCACTTTGCT GAAGATGTTTCT AGGAGAGT 

TGCCGGAGCCTCTGCTGACACATAAACACTTCAATGCAjCACCTCAA 

CAGTTTGATGATAAAGGAAACAAGACCAATATACCAGACAAGGACCGGCAAATTGAGGCTCT 
CCAGTTGCTCTTCCTCATTCTCCCTCCTCCTrAATC^ 

CCTATACCAGACAGCAAAGAAAGAAG ACAAGAACAAGATGTCAGCCTATAACCTTGCGCTT A 
TGTTTGCA(XXXACGTCCTGTGG<XXAAAAAATGTCACTGCAAATGACCTTCAGGA 
ACAAAGTTAAACAGTGGGATGGCTTTT ATGATTAAA(2ACTCCCAGAAACTTTTT AAGGCTCCT 
GCTT ATATTCGGGAGTGTGCGAGATTGCACT ATTTGGGATCCAGAACTCAGGCATCAAAGGA 
TGACCTTGACCTCATAGCTTCATGTCATACT AAGTCCTTTCAGCTGGCAAAGTCTCAGAAACG 
GAACCGGGTAGATTCXrrGC«rTC^CCAGGAGGAGACCCAGC^ 
GAGAGAGCTGTITCAACACGTTCATGATATGCX^GAGTCAGCAAAGAA 
GAGAGTTTAATAAGCAATCATTGACCCAGACACCAGGGCGAGAACCTTCT ACTTCCCAGGTA 
CAAAAGAGGGCnTXmCGCGCTCCTTCAGTGGGCTTATTAAGCGGAAGGTCCT 
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AG ATGATGTCAGAAAAGAAAAAGAAGAACCCT ACTCCAG AATCTGTGGCCATTGGTGAATTG 
AAGGGAACCAGCAAAGAAAATAGGAACTTATTATTTTCTGGCTCTCCAGCT GTCACGATGAC 
ACCAACAAGATTGAAGTGGTCTGAAGGGAAGAAAGAGGGGAAAAAAGGA rTTCTCI GAAGG 
ATCCAGAGTTGTCTCCTATGGTCCATGCAGAATTTTCTGTTTAGTGGGCAGGTGTTATTCCTG 



TAC<XnGAGCCCACTGATACCTCTAAO CCCACTC ACTG 

ACCAGCTTATACAGCACACT AAGGTGGGCTTCAGTGCT CACTCAATGTGTTTAGGCAGATTC 
CACTTTTGAAAAAAAAT ATGAAATGTGTG CTCAACTGCCAG T AATTTTTT AAAAAG CACT GTCC 
CAGTGGA TTGATG TTG I 1 1 I < AATGGATATTTTGGGTTTTTCTCTGTTTTGATAGTATTGGGTA 
TTTGGTT GI I I I IGT TTGTTT A I I I C I 1 1 GTTTTAAAAGCCATG I I H \ GGTTGGGCTCT AAGCT 
AGATATCTTTCCCTCI I 1 1 1 CACTTTGAGCTTTGGGAAAACTCTTTATCTTATGAGGCTGTATT 
CCTCAAT ACCT AATTTGT GT CCAAAGAATTT AT AG CTCTTCTrGG ACATTTTTT ATTATTTCTT G 

ggtgtgacatcagagtatttgacctgcagtattgaaaaaggagaattcagaatgatacagta 

TTTTAACAAATCTTAATTATTAAACTCTTTTCCTTCCTTCCATTTCT C 
TCTCTCT CrrTTCCCTTTCCTCAG TG AT GTG AAAATAATTGT GTTTTGCTGAACTT GTTATCTT C 
ATTCAATTTCCTCTTGACT AAAAC^TCTCTGGTGCCAACGTAATACTTCTGAACCACATCACT 
GTGACTCAAGGAAAGTCACTGAC^GCATAAGAGAAGTTTGCTAAAATATTTGTATGTGGGGG 
AAGCTCTGGAGTGTGCCT AGG AG GGGGCTGGCTGCCTTTATGTCCCAGGATGACTCTTTAT 
GGGTGGGATTACATTGCACCXrrCTGAGGGTGCAGGCTAGACCGTCTCCTGAGAGGAAGTTA 
GGATCAGAAAGAAGAAGCAAGC^GCAGCCTCTGCAGGGCTGACAGGATTTAAAGGAGAGAA 
TGTTCTT ATTTGGAAGCAGCTGTGGCTTGTCACCAATGTTCAAGGAGTGTTACTGTTCCGC^ 
CTCTCTTTGTCAGAAGGGACACAGGTGGT AATTTG GAGATGGGGCCAGAGCTTCTGGC 1 1 1 1 
GGATrrGGTGTGTTCACTTGTGTTGGATAGAGCAGTGGCATGGCTTrGACCTAGTATGAACT 



GCAGTTGCCCCACAAAATGTGGTTTGCTGACCTATTTCTAAACTGTTGAATATGCTGC^ 
TGCTGAAATGAAAGATG ACT CTGGGGGAGCAGAGCTTGGCCTT GTGCCCAGCTGGCAGCCC 
CCTCTGCCAG CCTTTCrGCTGCTTTTGCT GCTGTAACAGCAATAGTGGAGAAAAATGT AAAAT 
TTGG TCTTO CAGCTTAATG CAGT GTGAACAATAGATGGTTAGGAAAACAAAACTGCTTAGAA 
GCCCCTTTCTCTAGAGCAGTTTTATGrrCATITGTAAAAACACATAT^ 

T AGGTTTCT ATTCAAT ATTTGACTTTTTTTTTCTTATT AA GAAAATGAAATCCCTTACACCAGAT 

ATCAGTTAATTCAAACAGAAAACCCTTTGGGTATCANNNNN 

>224 

agtCQactcacgcgtCCGGGAAGCGGGGTGGGAAo&BgGgAGGGGATGGG GTTGACCGAG 
CTGGAGAAAGC<nTGAACTCTATCATCGACGTCTACCACaAGTACTCCCT GATAAAGGGGAA 
TTTCCATGCCGTCTACAGGGATGACCTGAAGAAATTGCTAGaGACC^ 

TCAGGAA^uAAGGGTGCAGACGTCTGGTTCAAAGAGTTGGATATCAACACTGATGCTGCACTT 
AACTTCCT GGAGTTCXTTC^TTCTGGTGATAAAGATGGGCGTGGCAGCCCACAAAAAAAGCCA 

tgaagaaagcc^caaagagtagctgagttactgggcccagaggctgggcccctggacatgt 
acagactctcattttatgacgtatcctactgcatcaggacatttgtgt 
aggggaaatgaaagtgatgagacgatgagaggagtgaaataccaaggacgccatactagga 
aacccagg tctat ttgttattcagagtaaggatcaagccagatagcctgttatgtaatttctcc 
gataaaagattttgaaagcaggtgctgtg ggcat ctgtatggg gaatcgcactcat agaatt 
attttcatttgtaaatatttggtatcaggccaagcaagggaaagaagctttact qtattaccatc 

TTTccTggaaaAGATTGAI 1 1 1 1 CT CTCTCCCttaggGGATATGAggTATGATACCTGCaaocAAAAT 
AAGCT GgqQOaaGTGCTCTCTCCTTAACTATTGTCCAAGCAATGTACATCACT 
GAGTGACCATA C 1 1 1 1 MC I I IGCTGCTTGG1 1 1 1 1 C I ATCACT AAAAAGCAAATTTAGGTGGA 
AGATGGATGGGTAAGTCCTrTGTCCTTGTCAAAAGAATTTAGAAAGGGTAAAGGTGGT GGGA 
TTTGAATTCTTTTAAACGGTTTATTCATTGGAAAGG<>AGTGAGTGGC^ 
CrrTGGGAATTTTTGAAGGAATTTAAGAGCTGT ACTT ATTTTAAATAATGAGGAAACAATCAGA 
GAGACTT AACCAG GGTGGCAACTATGGAAGAGTGGGGGTAGAATCCAAGTCTACAGTrCTA 
CGGCACTGCTTTGTTTAI III 1 1 UAATCAAGCACACATGAGGGATTGCTTAATAAATTCAAGA 
GTTTTCT ATATAGTTTGGGATTACT ACAGACACAGGTACTCT AGCCATGTGACT ATTTCAGAT 
TAGTGTTCCCATTGCTCTCCATCCTATrTTTCATATCTAAATGTC 
TTGTAAAGAAAACCAAAAAGA6CCATGCCCAAAAGAATGAGCATGAAATGAAGC 
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TATTTAGAAI 1 1 CM CI I CI 1 1 II 1 1 I I I I 1 1 I 1 1 CCAAAATTCTGAGTTAAGAAAGTTCATTGCC 

AACAGTTGAGGAT AAAT AG CTCTATTG ATTT AGGCAAATGAATGA6AAG ATTT AAACATCATC 

AAGACTCCCAAAGTTGGTGTAATAAAAGTTGAGATGTTTTGCT AGATGCTGAGAGAAAAGGC 

ATAAATAGGCCATrTGTGCTCCTGTAAACAAATGAAATTTTGTAATGATCAAGAGACAGCATG 

ATATTATAAAACACACTTTGGACTCATCGAGACXJTGGCTCAGAGTTGTAC^ 

CAGCATTTGGTTAGACCTTAGAGAAATTTTATAACCTCTCTAAGCCCA I I I I C I 1 1 A TCTGTAA 

ATAAAGGAAAAAGATCTGTTrGGATTTGGGACAGGAGAGrrGTCTGTTrTATAGGGTAATGTT 

TATGAAAATAATTTATCACCTAGGTACTTAGTAAATACCAGTTTCT GTCTCT GTTTTTCTT caa 

AGCTTTGTTGACAATGTAGGATATTGCCAACTAATCCTTTGACTTGAGCGC^ 

CAATCCAGGCAGCATGAAGAATGATCAAGAAGGGGACACCTAGTGGt^GTrTGACCTTTGAG 

GATCTATCTAGGATCAGGTTACTGATTTGGGACCTCTGAATCAGTACATCATAACATCGAGA 

GTGATCTCTGCCAAAATTAAAATTCTGTCAGAACAGTATAAGCAGAAGGAAGACATTAGGTCT 

TTCTGAAATATGGAGCTGATAACTTCTATCAGTTCTrGAATTCrrGTACTGTCCCA 

CATCTATATTGTrGCATGTGCTTAGCTTTGTGCCTTGAATTGCTATTTC C I I 1 1 ! ATCTTGCAC 

TCAGATGGTATTTTTAGAGATGTTrCTTCAATCAAAAGAAGATAAAGTTAAAAAAAAAAAAAA 

>225 

NNATGTGGTTACXSACXX^CTGTATTGAG^TGACGCGATCCATAGGCTGTGGTGTTT 
GTTTTCGCTGATCCACACAAACGTTGGGGC^CTGTCTATrCATGTGTTCAAGCTGAAG GCTC 
G7TCTCGGTTGTCATTTTACAGTGTTTTTCTACGGGGTTACATTACAGGAATGTTGTA 
GTTTAGCCCGTGGAGTATCAACGTCTT GAGACTCCGTGTGAGACTCCCTGGTTTGTTCCACA 
ACAGTGTGI 1 1 1 F AGATTCC^TACCTTTGATTAAGGAACACATCATGCCGTGAAGC<3AATTTA 
T ATTCTGCAATTCCGTAGTGCATGT AATGTATTCTGCCGTCTCGT AGTGTGAAG CCATGCTT G 
GCACATCCAGI I CI 1 1 GATGTCTGGCTGCCTTCTGCGGGCCAACTGTCTTGTGGAATTCGTT 
GCTCCCAGAGATAGCTTGAACT GC^GATCXXXiCACAGCATTGCACTGAGCTGTCGTTGTATC 
TGAGCCTGGACATGGCGGCCGAGGTACTCACAGTCACGCAAATTCACAGTCTGCX3TGCACG 
GCTCTCCATTCI I CI I CI J GGCTTTACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 
CCTTCTGAGGATGATCGATAGATAAACACACCTCCTCTGAACCATCCTTGGGCTTCATG 
TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCC<3TCTTCCAGAGCGCTTTG1TGA 
TCCAAATAAGAACAAGGACACACATTGTGTCAGGTC^OGAAGATCATTCAGTTTCCATATGCT 
GAAGGTTTTTCCACTrATTC^ACACTCTGTGGCGTAACCTTCTTCAATAT 

CCAATCTATTTCTTCCAGCTTCTCTCT GGCCATCTTTTCCTTGATCTGAGACAGTCTGATCAG 
TTTTCGGCCGGTCATGTGTCTTCGTTCATATTCTCTGGAGGAT ACT CGAGC 
CG CAGACCAG GAGAAG GCTTCCACACAGATGGCGATTGAGTCGTTTCXrrCACAGAACTTTC 
ACTCGGGGTCCACCACATATTTGACCTCTAGTTA^ 

CTGTA GGGG TTGGGAGGGTGCACTTGTC^TCCTTGAAGATGAGCTTTTQGGATCT 
GAAGCCTTTGGTGTTCGAGCCACCCTCTTGGTTCTTGGTAGCGCAG 

CAGAAAAGCATGTTGACTCCTGAATTCTCTGAACrrCTCCTCTCCTT AAGAGGTGGCCGGGGA 
GACTTCTCAGGGGATTTTTGCAGACGGGCTGGGCTTGCT GTCTGTTTAATTGTTGTT AC 1 1 1 1 
AG ^TGCT CTTGTAAGGCCTTCATTrGCTCTTGCAAATTCCTT AATTCCTCTTGCAACTCTTCAT 
TCGTTTTCTCTCGCCTGGGGGCAGGAGCAGGGAGGACCCT ATTTTCAGTTGACTGTGATGC 
GGGAACI rCTTCl I CATCTGTTAAGTCCTCCAT ATCTCCAAAGAGAGTGGCCAG A 1 1 1 ICC 1 1 
TTCGTCTCTTGTCTCTCCT GTTT CTCCATCATCAGCC I C 1 1 C I GT ATAAGATTCACCGTCGCC 
GTCGGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCC^TTTrCCCGCXa 

tttcttct gaattacaatccaaggctgactca i t t i ctt ccagcagtgcggtcagcagagaca 
gattgtcttccttxrrcatocatgctgtcaagaggacagttgt gacaggaaacttogaaggag 
gctcagatgcccagatgagcaatgtagaatcrn"ggccgatgccgggacgcxj^ 
acagctgaacccgccaaaatcggacnn 

>226 

acgcgggatggatagccgcttgcaggagatcogggagcggcagaagttacggcga 
cagctcctcgcgcagcagttgggagctgaaagtgccgacagcattggtgccgtgttaaa 
gcaaagatgagcagagagaaattgctgaaacaagagaaacttgcagq 

TCTGCTCCAAATCCAAAACGTAAGTATCTGGATGAAGGAGAGACAGAT^ 

AGAATATAAG gatgaactagaaatgcaacaggatgaagctt atcatcaattcattgtataaaa 

ATAAAGAGATTTTCCTGAGAGAACTGATTTCAAATGCrrClGA 

taatatcactgactgatgaaaatgctctttctggaaatgaggaactaacagtcaaaat^ 

GTGATAAGGAGAAGAACCTGCTGCATGTCACAGACACCGGTGT^ 



Pa^C2of355 



P»te63of355 



PCT/US00SU312 



gttggttaaaaaccttggtaccataqccaaatctgggacaagcgagttttttaaacaaaatga 
ctgaagcacaggaagatggccagtcaacttctgaattgattggccagtttggtgtcggtttc 
tattccguci i c ci i gtagcagataaggttattgtcacttcaaaagacaacaaggatactcag 

CACATCTGGGAGTCTGACTCCAATGAATTTTCTGTT^^ 

ACGGGGAACGACAATTACCCTTGTCTrAAAAGAAGAAGCATCTGATTACCTTGAATTGGATA 

CAATTAAAAATCTOGTCAAAAAATATTCACAGTTCATAAACTTTCCT ATTTATGT ATGGAGCAG 

CAAGACTGAAACTGTTGAGGAGCCCATGGAGGAAGAAGAAGCAGCCAAAGAAGAXaAAAGAA 

GAATCTGAT GATGAAG CTGCAG T AGAGGAAGAAGAAGAAGAAAAGAAACGAAAGACGAAAA 

AAGTTGAAAAAACTGTTCTGGGACTGGGAACTTATGAATGATATCAAACCAATAT 

>227 

NNCGCTAGCTTGGACGGTTTGGCTGCCAGAGCCTGCTGCCAAGAGACAGGCTTGC 
AGOCCCGCCCTTTTGCTCCAGCCCAGCCCTGGTTTAAGTCCCCGCCCTCOCATCCCCACTC 
TTCAGCTGCCCTYsGACCCACCCCAAATGACAGAAAGGGCTCACnSA^GGCCTTGM^CTTC 

GAAATTTGCT AATATGCAG cagttgatccgcaggctactcaaggacgcagatagtaaactct 

GAACCCTGCTTCAGTTGCGAGATTTTTTAAAACGGTCGAATAAATCTTCAAGAAATAGAAGCT 
CTAAATCTTCCAAACAGCCTAGATTGTAAAGGCGGCCGACACTACAACT 
GGAACCGGAOCTTCACTTGGAGTTGGCGCTCTGCCOGCTGGGAACTGTAGTCCTTGTTGGT 
TGCGGTCCCACCTAACGCCGAGGTCCTCTTCATTGTTCAAGCTGGTCTTTCGGGGCACGCC 
TGCGGAGTGGGGAAGCTGGAAAAGCCTGTGGGTGAGAACCTCGCACAACC CC I I 1 1 GATTG 
TCTGTGTGTGGOGCCTGCGCAGTAAGGGATCGTTGGAACTGGGCAGGAGGGGATAGGGTC 
CGGCT CCTGGCCCTTTCCCGAGGGGCCTGCGCACT AGGCAGTCGGTCTTTGCCGTT ACCGC 
TATGTGTGGGGCGTGTGTGGAATAACGTTATTGCCCAGCGGAGCTGAGGGCCCCGGAGCT 
CGACCGCAGCGGC^GCGACGACAACAGCGGCGACGACGACGACGACGAGGTGGGGGGAG 
GACGGCGTGCGA 

TCTCGAOGGTAAGGGGATGACGTAGCTTTGCCAAAGACTTAj 
CTTAACATCCTG G Mill G GTGAGCAGTGGAGGCACTO 
ATTTTTGTTGGAAGAGATGACTGTGAGCTCATGTTGCAGTCTCGTAGT GTGGATAAGCAACA 
CGCTGTGATCAACTATGATGCGTCT ACGGATGAGCATTT AGTGAAGGATTTGGGCAGCCTCA 
ATGGGACTTTTGTGAATGATGTAAGGATrCCGGAACAGACTT ATATCACCTTGAAACTTGAAG 
ATAAGCTGAGATTTGGATATGATACAAATCT7TTCACTGTAGTACAAGGAGAAATGAGGGTCC 
CTGAAGAAGCTCTTAAGCATGAGAAGTTTACCATTCAGCTTCAGTTGTDCCAAAAATCTTCAG 
AATCAGAATTATCCAAATCTGCAAGTGCCAAAAGCATAGATTCAAAGGTAGCAGACGCTGCT 
ACTGAAGTGCAGCACAAAACTACTGAAGCACTGAAATCCGAGGAAAAAGCCATGGATATTTC 
TGCrATGCCCCGTGGTACTCCATTATATGGGGAGCCGTCATGGTGGGGGGATGATGAGGTG 
GATGAAAAAAGAGCTTTCAAGACAAATGGCAAACCTGAAGAAAAAAACCATGAAGCTGGAAC 
ATCAGGGTGCAGCATAGATGCCAAGCAAGTTGAGGAACAATCTGCAGCTGCAAATGAAGAA 
GTACI fTTTOCTl I CTGTAGGGAACCAAGTTATTTTGAAATCCCT ACAAAAGAATTCCAGGAA 
C^TCACAAATAACAGAAAG4WTATTCATGAAATCGCA 

ATAACAGGTGCAGGGCATGCTT CATTTACCATTGAATTTGATGACAGTACCCCAGGGAAGGT 

AACTATTAGAGACCATGTGACAAAGTTTACTTCTGATCAGCGCGACAAGTCCAAGAAG 

TCCTGGAACTCAAGACTTGCTGGGGATTCAAACAGGAATGATGGCACCCGAAAA 



AAAAGCATTAAAAGTGATGTTCCAGTGTACTTGAAAAGGTTGAAAGGAAATAAACATGATGAT 
GGTACX3CAAAGTGATTCAGAGAACGCTGGGGCTCACAGGCGCTGTAGCAAACGTGCAACTC 
TTGAGGAACACTT AAGACGCCACCATTCAGAACACAAAAAGCT ACAGAAGGTCCAGGCTACT 
GAAAAGCATCAAGACCAAGCTGTrACTAGCTCTGCXaCATCACAGAGGGGGGCATGX^GTTC 
CACATGGGAAATTGTrAAAACAGAAATCAGAXXSAGOCATCGGTGTCAATACCCTTCCTACAA 

GTT AAAAAATCAAGCAACTTCTGCT ACTTCTGAAAAGGATAATGATGATGAGCAAAGTGACAA 
GGGTACTTATACCATTGAGTTAGAGAATCCCAACAGTGAGGAAGTGGAAGCAAGAAAAATGA 
TTGACAAGGTGTTTGGAGTAGATGACAATCAGGATTATAATAGGCCTGT^ 
AT AAAGATCT AATAAAAGATTGGGCTCTCAGTTCTGCTGCAGCAGT AATC 

CCACTGACTACATCTGGATTTCACCACTCAGAGGAAGGC/ 

TTGGGTTTCACAGTGGGCT AGTTTGGCT GCCAAT CA T ACAAG G CATGATI 
T AAT GGAATTTTCTGCACCTCTT CCTTT AGAGAATGAGACAGAGAT CAGTGAGTCTGGCATG 
ACAGTGAGAAGTACTGGCTCTG CAACTTCCnrTGGCTAGCCAGGGAGAGAGAAGGAGAOGAA 
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ctcttccccagcttccaaatgaagaaaagtctcttgagagccacagagcaaaggttgtaaca 

cagaggtcagagataggagaaaaacaagacacagaacttcaggagaaagaaacacctacac 

aggtataccagaaagataaac^agatgctgacagacccttgagtaaaatgaacagggcagta 

aatggagagactctcaaaactggtggagataataaaaccctacttcacttaggcagctctgc 

tcctggaaaagagaaaagtgaaactgataaggaaacttctttggtaaagcaaacattagcaaa 

acttcaacaacaagaacaaag ggagg ag gctgag tgga cacct act aaattgtctt ccaaaa 

atgtrrcaggtcagacagataaatgtagggaggaaacttttaaacaagaatcacaacctcca 

gaaaaaaattxaggacattctacaagcaaaggagacagagtggcacaaagtgagagcaaga 

gaagaaaagctgaggaaattctgaaaagtcagactccaaagggaggagacaagaaggaatc 

ctocaagtcattagtgcgagaagggagcttcactatagaaaaaccgagcccaaacataccca 

TAGAACTTATTCCCCATATAAAT AAACAGACTTCCTCT ACTCCTTCTTCTTTAGCATTAACATC 
TGCAAGTAGAATACGAGAAAGAAGTGAGTCTTTGGATCCTGATTCTAGTATGGACACAACCC 
TTATTCTAAAAGACACAGAAGCAGTAATGGCTTrr CTAGAAGCTAAACTACG TGAAGAT AATA 
AAACTGATGAAGG ACCAGATACTCCGAGTT ATAAT AGAGACAATTCTATTTCACCAGAATCT G 
ATGTAGATACAGCTAGTACAATCAGTCTGGTTACTGGAGAAACTGAAAGAAAGTCAACCCAA 
AAGCGAAAGAGTTTCACTAGCCTCTATAAAGATAGGTGTTGCACAGG T TC fCC I ICCAAAGAT 

gttacaaaatcatcatcttcaggtgctagggaaaaaatggaaaagaaaacaaaaagtcgttcc 
acagatgtgggttcaagagcagatggtcgtaaatttgttcagtccagtgggagaataagaca 
gcoctcagtagacttaacagatgatgaccaaaoctctagtgtaoctcattctgccatctctg 
atattatgtcatctgatcaagaaacttactcttgtaaaoctcatggacggactccacttacct 

CAC^GATGAGCATGTAGATTO>AACTGGAAGGAAGTAAAGTAACGAAATCTAAGACTTCT 

coggtggtatctggttcatctagtaaatgaaccacccttocaaggccaogaoct accaggac 
ttccct ctrgcgcagagcacgacttggtgaagcttcagacagtgaacttgctgatgctgaga 
aagcatctgttgcttctgaagtatccacaacaagttct agatcaaaacctcgcacaggaagg 

CGTAACATCTCTCGGATTGATTTATTGGCTCAGCCTCGTAGAACACGACTT GGCTCACTGTC 
AGCTCGTAGTGACTCTGA^CAACAATTTCTAG^GTAGTGCXn"CTTCGAGGACXXiCAGAAG 
CCATCATTAQAAGTGGAGOCAGACTAGTAOCATCAGATAA A I I I fCl OC I AGAATTAGAGCTA 
ACAGTATCTCTCGACTCTCAGACTCCAAGGTCAAAAGTATGACCTGAGCTCATGGCTCTGCT 
TCAGTAAATTCAAGATGGAGGCGCTTTtJCT ACTGATT ATGCTTCCACCT CAGAAGATGAATTT 
GGATCAAACCGTAATTCCOCTAAACATACCC G T C 1 ACGT ACTTCT CCAGGCCTGAAAAGCAC 
TCGCTTGCAGAGCGCTGGATCAGCAATGCCTACTAGTTCTTCATTCAAACACCGGATTAAAG 
AGCAGGAAGACT ACATCCGAGATTGGACTGCTGATCGAGAAGAGATAGCCAGGATCAGCCA 
AGATCTTGCTCTCATTGCTCGGXiAGATCAACGATGTAGCAGGAGAGATAGATTG 
CATCAGGGACTGCGCCTAGT ACCAGAGTAAGCACTGCTGCCAOCAOCCCTGGCTCTGOCAT 
AGACACT AGAGA AQAGTTGGTTGATCGTGTTTTTGATGAAAGCCTC 

ctocattagticattcgaaaacaocagaaggaaacaacggtcgatctggtgatccaagacct' 

caagcaggagagcgtoccgatcacttaacaattacaaggxx5gagaacctggag 

aagt gatgggagataatctgctgctgtcatccgtgtttcagttctctaagaagataagacaat 

ctat agat aagacagctggaaagatcagaat att atttaaagacaaagatoggaattgggatg 

acatagaaagcaaattaagagccgaaagtgaagtgcctattgtgaaaacctcaagcatggag 

atttcttctatcttacaggaactgaaaagagtagaaaagcagct ggaagcaatcaatgctatg 

ATTGATCCTGATGGAACTTTGGAGGCTCTGAACAAC^^ 

ATCTOCACCGAAAGAGAAGTtX^GCCCTGTGAATAACCACCACAGCCCGGGTCAGAOlCCA 
ACACTTGGCCAACCAGAAGCTAGGGCTCTTCATCCTGCT GCTGTTTCAGOCGCAGCTGAATT 
TGAGAATGCTGAATCTGAGGCTGATTTCAGTATACATTTCAATAGATTCAACCCCGATGGGG 
AAGAGGAAGATGTTACAGTACAAGAATGAC f I fCTCI I GATTGTTGAAAAATCATTACCTGTG 
GAATGGCT AGGAAT ATTGGAAGCAGCATAGTGTTGATGTACGCAAAACAAGACAGCTTGGTC 
AQCTACAATCTTQGAATOOCTGTCI I CI I AATTTTATTT ATTT ATTTTTGAGGTATAATGTAGT A 
TATCAATtX i MUAAA CTAT TTAG ATAACCACTTGATGCACAAATAGGAAAAAGCAGATTGTG 
GCAGTGTCGCCTTTTCTGGTTTTATGATTTTCAAATT 

CATAGATCT M I H( JI I rTTTITAAGCCATGCTGTGACCTACAAGGAAACT AAATAGCCAACAT 
TTCTGAACCCCT AAGJCl CCTGTGCGAAGCTGCTCCCTGAAATGGAC MUTT CATCTGTACA 
GATTTGTT AAACCATTCTATTTGCTTCTTAAT AATAGGATTTATATT AGTACT CATT ACCATTGG • 
ACACAATGACATAAGTACTCTCX^CAGTAAAGC .. 
AAAATTrrCCAACATAGATGTGATTTATATAA C I 1 I G 1 1 UATACGT AAATTGTCTTGGGGTTT A 

cggaaattaactattatgtttgcactaagatttgctgggagtggtaggtggacatatctatat 
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ATCAATAAGGACTAACCGTCTTTTTTGTACATAGGAGATTGATAATACT 

ccacagtgttttactcca ctttc aaaaagatcaatttggca c i rrm i c a i nun ia atgga 

AATAAGATTTGGTC TCTCA TTTT AGGTTAAATGATAACTAGAAAGATT AAACT AGACAGATAGT 
TTAGGTGGAGTATATTTTTAAAACTCAGAACATGTATATTG GTCCTGTGTTACCAAGTTTATAT 
GTGACAGTTGAAAAAG AAATTCGGTTGAAATGATCATGAGGTT AAAA 1 1 1 1 C 1 1 C ATTAGGGG 
ACTTGGAGAACCAGT AGTCGT AAGATT AGTTGATAGTTTCACT CCCAAGCAATGAATTGCTTC 
TGTGTGTTTCCCTGTAGGACTCAATAGTAAATGCTGTCTGTCTT ACACATTTATAAGGACCCT 
GCAAGACGACGACAAAGGCGTTTGGCCTGTGCTACT AAACAAGAAGCCTATGAAAAATTTCT 
TCH TTAAAC T TGl 1 1 1 [ I CTCTTTCCAGTAAGTTCACATTTGGATAATTTTAAAAAGAAAAGTAA 
TTACCTTTGTGTTTCCAGAACACT ATAATTGGGGTGTATCTTAATTCAGTTAAAT ATT attagt 

agacctggattttcccccttgaccccatcagtctataaaggttaaact 
tggtctttaatatttccacaataatcctgtgctatatttg 

acact tcaa gactttagaggattttttaaatcctgtattgttggatcaattaataaagatgcaa 

AAAAACTTTATAGAGATGTAAAAACAAAACTATAATGGATCTCCTA I ITTTCT1 I A AATACAAAA 

aaaaaaaggtaaatgaactattctccttgtaaag ctaaattccccattctgtctaataaaggaa 
gactgaaaaaaggttttaaagaacataaatggaaagatacaaatgctttgaaggaataaacga 
aatgtt aaaacagggtcaatccatttgaagaaaaagttggacaaaataatcaqcattgctccc 
tcttttatttaatatttgggtactgattatatccagatggaagt agaaggtaaggagtttagg 
aaaatactagaatctactctgcttacattcttgtttaagtgtttacacagtttggttca^ 
aaacatttggcctttractatgttatttttattttgtatgaatacattatoct atttatttt 
tgttac attta ttactt atgttattttgtt atgcatttacaactccatttttaaataaagtgttt 
gtggaactttaaaaaaaaaaaaaaacggcacgagg gggg gcccggtacccaattcccgcta 
t^^gagtcgt attacgcgcgctcactggccgccgttttacacacgtcgtgact gggaaaa 

CCCT GGCGTTCCCAACTTAATCGCCTTGCANNN 

>22B 

>229 

^ NWWNMWN^ 

GGTCAGGCTGGTCTTGAACTCCCGACCTCAGGT ATCTGCCCGCCCGGGCCTCCCAAAGTGC 

tgggatgacaggtgtgagccaccacccccagcctgttgtaaaaacttcttaaactt^ 

aaaataacccatgaaaatggcttaataaaataacattccaataaaactcttctataatactt^ 

acaagtaaaaactgaccaatgtaacaggttoccaaa a aaaaaaaaaaaaaaaaaaaaaaaaat 

aaaaaaaaaaaaaaaaaaaaaaaaaaagtacctgccogggcggccgctcgaccgcccgggc 

aggtaoggcctgtccgaatgatgcgttgcagttgatttgtcttcac^ 

cccgtttcatacccocagaggtttagacattcrci ctxx^actocatggccctgoctgtgacc 

cagaggaagagaaagccatcatcctgtagtaoctcggccttgagcctgcgcatctcatcat 

ctgtgagggtcccatagggcagttcgatgtgaatatcncagggtgggtcagccatcacaac 

tgcaaactt gcccaag>tactgacgtccaggtagc^atatcacaacagatccactggataa 

AGTGG T ATrrGG TCATCTTOGCTACTTTAACTCTGAGATTCATQGOCCGA y 1 C C C rCTTTTC C 
ATTTGAi 1 1 1 1 OCTATTOCCT TAAA AGCTTATCT ATCCCCCTCCATTCCCAATAAGCAAGACAA 
GATTGCTGAATT ATGCTCTGCrTTTCTT AACGTGT ATGGCT GCCCT ATGAAACATAAGTCGGTA 

agttgagacgagtttt ctgagcagacaggtacctgax^tgggaagagtcx^tctggactgg 
aatcacctcogacactctgtgt AAGAGCAAGCT cctggcttggcgtgtggtctt 
ggggcctcagaatccatgcaagcatcaatttcatagtgaacatactcggacgc 

>230 
>231 

nagtcgacccacgcgtccgtggcattatttctctctctacaaggagc^ 

TACGGGGAGCTCGCAAATA CIUCI 1 1 rGGTTTATTCTTAOCACCI IG CI fCTGTGTTO CTTGG 

™I9S G E^ GTG 5^ TATGCATt;,uu,c,c, ' ' 1 ' ggagctacagtggacaggcatttgtga 

CAGCACTATGGGACTGAGTAACATTCTCTTT GTGATGGCCTTCCTGCTCTCTGGTGCTGCTC 
CTCTGAAGATTCAAGCTTATTTt>ATGAGACTGCAGACCTGCC^ 

AAAAGCAAAGCCT GAGT GAGCT AGT AGT ATTTTGGCAGGACCAGGAAAACTTGGTTCTGAAT 
GAGGT AT ACTTAGGCAAAGAGAAATTTGACAGTGTTCATTGCAA^ 
TTTTGATTCGGACAGTTGGACCCTGAGA^ 

AT E AA 1 6T ^CATCCATCACAAAAAGCCCACAGGAATGATTCCCATXX^COSGAT 

AACTGT CAGTGCTT GCT AACTT CAGTCAACCTGAAAT AGTACCAATTTCTAATATAAGAGAAA ' 

ATGTGT ACATAAATTTGAGCTGCTCATCT ATACAOGGTTACCCAGAACCT AAGAAGATGAGTG 
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TTTTGCTAAGAACGAAGAATTCAACT ATCGAGTATGATG GTATTATGCAGAAATCTCAAGAT a , 

atgtcacagaactgtacgacgtttccatgagcttgtgtgtttcattccctgatgttag 
atatgaccatcttctgtattctggaaactgacaagacgcggcttttatcttcacctttct 
tagagcttgaggaccctcagc ctcc cccagaccacattccttggattacagctgtacttcca 
acagttattatatgtgtgatggi 1 1 fctgt ctaattctatggaaatggaagaagaagaagcgg 



STGTTTTTAAAAGTT 

CGAAGACATCTTCATGCGAGAAAAGTGAT ACATGTTTTTAA tt AAAGAG T AAAGCCCAT ACAA 
GT ATTCATTTTTTCT ACCCTTTCCTTTGT AAGTTCCTGG GCAACCTTTTTGATTTCTT CCAGAA 
GGCAAAAAGACATTACCATGAGTAATAAGGGGGCTCCAGGACTCCCTCTAAGTGGAATAGC 
CTCCCTGT AACT CCAGCTCTGCTCCGT ATG CCAAGAGGAGACTTT AATTCTCTT ACTGCTTCT 
TTTCACTTCAGAGCACACTT ATGGGCCAAGCCCAGCTT AATGGCTCATGACCTGGAAATAAA 
ATTTAGGACCAAT ACCTCCTCCAGATCAGA I IU IC I C I 1 AATTTCATAGA I reTBTTTTTTTTI 
AAATAGACCTCTCAATTrcrGGAAAACTGCCTTTTATCTGCCCAGAATTCT AAGCTGGTGCCC 
CACTGAATCTTGTGTAGCTGTGACTAAACAACTACCTCCTCAGTCTGGGTGGGACTT ATGTAT 
TTATGACCTTATAGTGTTAATATCTTGAAACATAGAGATCrrATGTACTGTAATAGTGTGATTAC 
TATGCTCTAGAGAAAAGTCTAGCCCTGCTAAGGAGTTCTCATCCtrrCTGTCAGGGTCAGTAA 



AGGAGAGGCAGAGATAGAACCTGGAGCCACTTCTATCTGGGCTGTTGCT AATATTGAGGAG 
GCTT GCCCCACOCAACACGCCATAG TGGA GAGAACTGAAT AAA CAGGAAAATGCCAGAGCT 
TGTGAACCCTGCTTGTCTTGAAGAACTGACTAGTGAGATGGCCT GGGGAAGCTGTGAAAGA 
ACCAAAAGAGATCACAATACTCAAACGAGAGAGAGAGaGACAAAAGAGAGATCTTGATCCAC 
AGAAATAGATGAAATGTCTGGTCTGTTGACCCCATGAACAAGT CTTGAAACAAGCAACAGAT 
GGATAGT CTGTGCAAATGGACATAAAACAGACAGCAGCTTCCCTGGTGGTCAGGGAGGGGC 

tttgcgagataccgaagctattgggan 

>232 

cacagcaa accagaga gaataatacaaatgaatgaaaactagaaccaaaacaaacag 
gaaccatctctaca iii mn actcaatcgaocx^gaatgctacagcattatccagggccgc 

AGAAAACAGACAT AAT ACTTT ATTCCTGAT AAACCATTCGATGT CCTT AAG TOCGCCCATATC 

AGOGT acattctgtgccatcaagaaatttactcagctgggcacagtggcgggatctcagctc 

ACTGCAACCTCCGCCTCAGAGGTTCAAGTGATTCTCCTGCCTCAGCCTCTGGAATAGCTGG 
GACT ACAGG CACCT GCCACCACACCXAGCT AACTTTTTGT ATTTTT AG AGAAGATGGG GTTT 
CATCATGTTGGCCCAGCTGGTCTCAAAGTCCTGATCTCAGTGATCCATCTGCCTCA^ 
CCAAGTTATAAGAI 1 1 III PCCT CTGG1 1 1 1 I A GTAAAT GI I 1 1 1 II I GAGATTGCTT AGCACC 
AGAATGATTTGCAAATTTGAAAATAGGAACTCCACTAGGAATGCCGGATAGAAGAGTGCTTC 
ACATTTGTAGAGGGAGACAAGAACTAAATATCACGACG I C I I I CI G AGOCTTTTGGTTTGGTA 
ACGTGCCCCAAATTCTT ATTCCAAACGGT ATAAGAT AATT ATGT GTAAATGAATAOCAGCTGT 
ACTT AGTTTT ATTTCAT ATTTGTOGATCTGAATATATTAAAATA I I J 1 1 II I I I H 1 1 1 I G ATGOG 
GAGTCTTGCT CT GTT GTCCAGCCTGGAGTGCAGTGGCATGATCT CGGCT CACTGCAACCTC 
TGOCTCOCAGGTrCAACCGATTCTCC I GCCTCAGOCTOCTAAGT AGCTGGT ATTACAGGAGT 
GTGCCATTAGCCTGGCT 
>233 

gGO GGGC GOCCGGGCA GGTACG OGGGGGCCA GT I C I U I I O GGGGACT AACTGCAA 
CGGAGAGACTCAAGATGATTCCC 1 1 1 1 1 A CCCAIU IT T T CI CTACTATTGCTGC 1 1 ATTGTTAA 
C<XT ATAAACGOCAACAATGATTATGACAAGATCTTGGCTCATAGT CGTATCAGGGGTGGGG 
ACCAAGGCCCAAATGTCT GTGCXXnTCAACAGATTTTGGGGACCAAAAAGAAATACTTCAGC 
ACTT GT AAGAACTGGT AT AAAAAGTCCATCTGTGGACAGAAAACGAC TG T G 1 1 AT ATGAATGT 
T TjCCC TGGTTATATGAGAATGGAAGGAATGAAAGGCTGCCCAGCAGTTTTGCGCATTGA 
TGTTT ATGGCACTCTGGGCATCGTGGGAGCCACCACAAOGCA<iOGCT ATTCT GACGCCT CA 
AAACT GAGGGAGGAGATCGAGGGAAAGGGATOCTTCACTT ACTTTGCAOCGAGT AATGAGG 
CTTGGGACA^TTGGATTCTGATATCOGT AGAGGTTTGGAGAGCAACGT GAATGTTGAATTA 
CT GAATGCTTT ACATAGTCACATGATT AATAAGAGAATGTTGACCAAGGACTT AAAAAATG^C " 
ATGATTAI IC CI I CAA TGT AT AACAATTTGGGGCTTTTCATTAACCATTATCCT AATGGGGTTG 
TCACTGTrAAITGIGCIlXSAATCATOCATGOGAAOCAGATTGCAACAAAIGGIGI IU IUG AT . 
GTC ATTG AC CGTGTG CTT ACACAAATTGGTACCTCAATTCAAGACTT CATTGAAGCAGAAGAT 
GACCTTTCATCI III AGAGCAGCTGCCATCACATCGGACATATTGGAGGCCCTTGGAAGAGA 
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CGGTCACTTC^CACTCTTTGCTCCCACCAA 

TAGAAAGGATCATGGGAGACAAAGTGGCTTCCGAAGCT cttatgaagtaccacatcttaaat 

ACTCT ccagtgttctgagtctattatgggaggaggagtctttgagacgctggaaggaaatac 

aattgagataggatgtgacggtgacagtataacagtaaatggaatcaaaatggtgaacaaaa 

aggatattgtgacaaataatggtgtgatccatttgattgatcaggtcctaattcctgattctg 

ccaaac^gttattgagctggctggaaaacagcaaaccaccttcac«gatcttgtggcccaa 

ttaggcttg gcatctgctctgag gccag atggagaat acactttgctggcacct gtgaat aa 

tgcattttctgatgatactcn"cagcatggatcagcgcctccttaaattaattctgcagaatca 

catattgaaagtaaaagttggccttaatgagctttacaacgggcaaatactggaaaccatcg 

gaggcaaacagctcagagtcttcgtatatcgtacagctgtctgcattgaaaattcatgcatg 

GAGAAAGGGAGTAAGCAAGGGAGAAACGGTGCGATTCACATATTCCGCGAGATCATCAAGC 
CAGCAGAGAAATCCCTCCATGAAAAGTTAAAACAAGATAAGCGCTTTAGCACCTTCCTCAGC 
CTACTTGAAGCTGCAGACTTGAAAGAGCTCCTGACACAACCTGGAGACTGGACATTATTTGT 
GCCAACCAATGATGCTTTTAAGGGAATGACTAGTGAAGAAAAAGAAATTCTGATACGGGACA 

aaaatgctcttcaaaacatcattctttatcacctgacaccaggagttttcattggaa^ 
ttgaacctggtgttact aacattttaaagaccacacaaggaagcaaaatctttctgaaagaag 
taaatgatacactt ct ggtgaatgaattgaaatcaaaag aatctgacatcatgacaacaaatg 
gtgtaattcatgttgtagataaactccrrctatccagcagacacacctgttggaaatgatcaac 
tgctggaaatacttaataaa7taatcaaatacatccaaattaagtttgttcgtggtagcacctt 

C^AAGAAATCCCCGTGACTGTCTATACAACTAAAATTATAACCAAAGTTGTGGAACCAAAAAT 
T AAAGTGATTGAAGGCAGTCTTCAGCCT ATT ATCAAAACTGAAGGACCCACACT AACAAAAGT 
CAAAATTGAAGGTGAACCTGAATTCAGACTGATTAAAGAAGGTGAAACAATAACTGAAGTGAT 
CCATGGAGAGCCAATTATTAAAAAATACACCAAAATCATTGATGGAGTGCCTGTGGAAATAAC 
TGAAAAAGAGACACGAGAAGAACGAATGATTACAGGTCCTGAAATAAAATACACTAGGATTT 

GTCACCAAA7TCATTGAAG 6TG GTGAT G GTCATTT ATTTGAAGATGAAGAAATT AAAAGACT G 

CTTCAGGGAGACACACCCGTGAGGAAGTTGCAAGCCAACAAAAAAGTTCAAGGATCTAGAA 

GACGATTAAGGGAAGGTCGTTCTCAGTGAAAATCCAAAAACCAGAAAAAAATGTTTATACAAC 

CCTAAGTCAATAACCTGACCTTAGAAAATTGTGAGAGCCAAGTTGACTTCAGGAACTGAAAC 

ATCAGCACAAAGAAGCAATCATCAAATAATTCTGAACACAAATTTAATA ) I 1 1 1 1 i I I CTGAAT 

G AGAAACATGAGGG AAATTGTGGAGTT AGCCT CCTGTGGTAAAGGAATTGAAGAAAAT AT AA 

CACCTTACACGCTTTTTCATCTTGACATTAAAAGTTCTGGCTAACTTTGGAATCCAT^ 

AAATCCTTGTGACCAGATTCATTACAATTCAAATCGAAGAGTTGTGAAGTGTTATCCGATTGA 

•AAAGACCGAGCCTTGTATGTATGTTATGGATAjCATAAAATGCAOGCAAGCCATTATCTCTCGA 

TGGGAAGCTAAGTTATAAAAATAGGTGCTTGGTGTACAAAACTTTTT ATATCAAAAGGCTTTG 

CACATTTCTATATGAGTGGGTTTACTGGTAAATTATGTTATTTTTT 

CAGAATGTTTGTCATATGCTTCTTGCAATGCATATTTTTTAATCTCAAACGTTTC 

AI 1 1 1 IC^GATATAAAGAGAATTACTTCAAATTGAGTAATTCAGAAAAACTCAAGATTTAAGTT 

AAAAAGTGGTTTGGACTTGGGAACAGGACTTTATACCTCTTTTACTGTAACAAGTAGAA^ 

GTQTAGCAGGTTAAAAACACCTCAGGGATTTCAGAGGAAAATTGACCTATATATATTTTTAATT 

CATCATATATCTGTCCTCnrTTCrTCATTTTTCTCAATTTCCAGAC 

CAATAAAGT 

>234 

. NNNNWN^CCGCCCGGGCAGGTACAGTATAGGTTGG 1 1 1 re CC TSl 1 1 1GACGCTT 
TATATATACGTAGACACACATACACATGTATATATACACACACACATTTTACATATATATATGA 
AACTGTATAATGTGTTCGCTTCAGTGTCTGGCTGCTTTTACTG^ 

GTTATCGTATATGGTATGAAATTTGTTTGCCTAG 1 1 1 I IGCC I I C I UATTGCTTCTGAATTGGG 

GCAGCTTTGCCCCTCAAGGGAAATTTAGCAATGTCTGGAGACATTTTTTATTT^ 

GAGGGACATGGGGGAGTTGTGCTAGAGAACTTAGTAGGTAGAGGACAGGGTTAGTGCTGAA 

CGTCCCACAGT 

>235 

>236 

TATAGGGAGTCGACCXw^CGCGTCCGCTTAAAGAGGAAAAGAGGGAAGAAGATGAG 
GAGAATGACAATGATAATGAAAGTGACCATGATGAAGCTGACTCCTAAACCAAAAGTGCTTT 
AAAAAOC AGCCT GGCGAGGAGAGCCCTGGACGGACTCCACTGTCTCT AAGTAAACACAGCA 
~ ~ ~ ' ' TAGCGTCTTCACTTCTTCACAGAGTTCCAGTGCGTGGTATTCnrrTCGAGGT 
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ATTCTTTCCAGGCCGAGATTGAGCACCTCATGTACCT ACGCCAC AGACAGCC AGAGGGAAA 
GCGACCCAG ACAGCAGCCCCTCCTGGACAGGCCCACCCTGCAG ctcaggcaccaag aaaa 
cagccgatactggcagccattgcagctccaaact GCAGAGGCAAGGCCAATTTTAACTTTTC 
aatttacagtcx^ttttgaagagcttctacatatcggttatgtaaattcatatatgta i n t i g 
gaatcagttcttataaacagctcgattcagttttagctaaatttatagtttaggtagtatgtta 
catttgaattmgtcttaagaaaagttgactgttcagatattttrct 

CTTTTCTATTAAAGATCTGTACATATTTTTACAGTAAAATGCTTTATGGAACTAGTTT^ 
CCTC TATGG CTTTAAG GCCTTGCTTAGTGCTGCAAATTTTGAG AAATTTAAAAATAAGCATTCT 
AACACI 1 1 1 ATTCCCAC AGAAAAATTCCAAG TCAATT ATCAAA TCAAACAC AAAAATAAGTCTT 
ACCT CTT GTAT AAGCATTGTGT ACT AAAAAAAAATTTTQ AAACATTTTGT AT ATTGGAGATCTC 
TCT CATCTT AACGTCTTTGCTT AAAATCCTGGGCCTCTTTTAACGG GATTAGG AGAAAACT AT 
TTAG^CCA^TTTT^ 

>237 

ACGCGGGGGGGACGTAAGGTGGGGCGGTGAAAGAAGTTTGCTGACGAAGATGGC 

gactgaggcacagagtgaaggggaggtgccagcccgcgaatccggccggagtgatgccat 
ctgcagttttgtgatctgcaatgattctt cccttcgaggtcagcccattatcttt aatcctga 
cttttttgtggagaaactccgacatgagaaacct gagattttcactgagttggtggtcagca 
atatcacaaggctcatcgatttacctggaactgagttggctcagctgatgggggaagtgga 

CCTT AAGTTGCCTGGCGGGGCTGGCCCAGCATCAGGATTCTTCCGGTCT ctcatgtctctc 

AAGCGAAAGGAAAAAG GAG TGAT ATTTGGG TCCCCACTGACGGAGGAAGG CATTGCCCAGA 

T ATACGAACTGATTGAGT ATCTACACAAAAACTTGCGAGT AGAGG G TTTGTTT AGAG T ACCG 

GGTAATAGTGTCCGACAGCAGATTTTAAGGGATGCTCTCAATAATGGAACTGACATTGACTT 

GGAATCAGGGGAATTTCACTGAAATGATGTTGCCACTTT GCT GAAGATGTTTCT AGGAGAGT 

TGCCGGAGCCTCTGCTGACACAT AAACACTTCAATG CACACCTCAAAATCGCTG ATTTGATG 

CAG7TTGATGATAAAGGAAACAAGACCAATATACCAGACAAGGACCGGCAAATTGAGGCTCT 

CCAGTTGCTCTTCCTCATTCTCCCTCCTCCTAATCGT AATTTGCTGAAGTTATTGC^ GATCT 

CCT AT ACCAG ACAG CAAAGAAACAAG ACAAGAACAAGATGT CAG CCT AT AACCTT GCCCTT A 

TGTTTGCACCCCACGTCCTGTGGCCAAAAAATGTCACTGCA>ATGACCTTCA^GAGiAATATC 

ACAAAGTTAAACAGTGGGATGGCTTTTATGATTAAACACTCCCAGAAACTTTTTAAGGCTCCT 

GCTT ATA1TCGGGAGTGTGCGAGATTGCACTATTTGGGATCCAGAACTCAGGCATCAAAGGA 

TGACCTTGACCTCATAGCTTCATGTCATACTAAGTCCTTTCAGCTGGCAAAGTCTCAG^ 

GAACCGGGTAGATTCCTGCCCTCACCAGGAGGAGACCCAGCAGCATACGGAAGAGGCACT 

GAGAGAGCTGTTTCAACACGTTCATGATATGCCAGAGTCAGCAAAGAAGAAACAACTTATTA 

GAGAGTTTAATAAGCAATCATTGACCCAGACAOCAGGGCGAGAACCTTCTACTTCCCAGGTA 

CAAAAGAGGGCTCGTrCGCGCTGCTTCAGTGGGCTTATTAAGCGGAAGGTCCTGGGAAATC 

AGATGATGTCAGAAAAGAAAAAGAAGAACCCTACTCCAGAATCTGTGGCCATTGGTGAATTG 

AAGGGAACCAGCAAAGAAAATAGGAACTT ATTATTTTCTGGCT CTCCAGCTGTCACGATGAC 

ACCAACAAGATTGAAGTGGTCTGAAGGGAAGAAAGAGGGGAAAAAAGGA I 1 1 C I C I G AAGG 

ATCCAGAGTTGTCTCCTATGGTCCATGCAGAATTTTCTGTTTAGTGGGCAGGTGTTATTCCTG 

CCGACAGCAAAGCTTGGACTTGCAGCTTGCTTGCTG 

T ACCGTGACCCGACTGAT ACCTCTAACCCCACTCACT GGATGATGTTTGCAAGCT GTGCCTT 
CTGAGAGAGTGCTTAGGCCCTGTCTCTC I II 1 1 1 AATATTATGGGGAAACCACT AACT ATCCA 
ACCAQCTTAT ACAGCACACTAAQGT GGGCTTCAGTGCTCACTCAATGTQTTTAQGCAGATTC 
CAGTTTTGAAAAAAAATATGAAATGTGTGCTCAACT GCCAGTAATTTTTTAAAAAGCACTGTCC 
CAGTGGATTGATGTTGTTTTTAATGGATATTTTGGGTTTTTCTCTGTTT^ 

TTTGGT TG M M! G TTTGTTT A I HC1 T T GTTTTAAAAGCCATGTTTTTGGTT GGGCTCT AAGCT 
AGATATCTTTCCCTC1 1 1 1 1 CACTTTGAGCTTTGGGAAAACTCTrr ATCTTATGAGGCTGTATT 
CCTCAATACCTAATTTGTGTCGftAAGAATTTATAGCTCTT^ 
GGTG TGACATCAGAGTATTrGA CCTGCAGTATTG 

TXTrCTCTCTTTCCCTTrCCTCAGTGATGTGAAAATAATTGTGTTTTC 
ATTCAATTTCCTCTTGACTAAAACATCTCTGGTGCCAACGTAATACT^ 

G TGACTCAAGGAAAGTCACTGACAGCAT AAG AGAAGTTTG CTAAAAT ATTTGT ATG T GG G GG , 
AAGCTCTGGAGTGTGCCT AGGAGGGGGCTGGCTGCCTTT ATGTCCCAGGATGACTCTTT AT 
GGGTGGGATTACA1TGCACCCTCTGAGGGTGGAGGCTAGACCGTCTCCTGAGAGGAAGTTA 



WO 01/42467 PCT/USOO/33312 
663 

Table 4 



GGATCAGAAAG AAGAAG CAAGCAG CAG CCTCTGCAGG GCTG ACAGGATTT AAAG GAGAGAA 
TGTTCTTATlTGGAAGCAGCTGTGGCTTGTCACCAATGTTCAAGGAGTGrrTACTGTTCCGCC 
CTCTCTTTGTCAGAAGG GACAC AGGTGGT AATTTGGAGATGGGGCCAGAGCTTCTGG C 1 1 1 1 
GGATTTGGTGTGTTCACTTGTGfTTGGATAGAGCAGTGGCATGGCTTTGACCTAGTATGAACT 
GGTGTCTGCCCAGAGAGCAGCATGTAGCAGGGGGGAATGCTCAGGTTTGTGCCTGGCTCT 
GTGGAGCTGTACAAC^CTTCTCACCCTGTGGGTTGGAGCCGAGTCAGGCCACTATGGGGAA 
GCAGTTGCCCCACAAAATGTGGTTTGCTGACCTATTrCTAAACTGTTGAATATGCTGCACCAT 
TGCTGAAATGAAAGATGACTCTGGGGGAGCAGAGCTTGGCCTTGTGCCCAGCTGGCAGCCC 

cctctgccagcctttctgctgctntgctgctgtaacagcaatagtggagaaaaatgtaaaat 

ttggtcttccagcttaatgcagtgtgaacaatagatggttaggaaaacaaaactgcttagaa 

gcccctttctctagagcagttt tatgtcatt tgtaaaaacacatattagcaaattcgttcg 

TAGGTTTCTATTCAATATTTGACI I II I I II I CI iattaagaaaatgaaatcccttacaccagat 
atcagttaattcaaacag aaaaccctttgggt atcannnnn 

>238 

nnctcagaggattacggacccacgcgtccgcaacatcctggcttagtattgtgtgc 
aaaatc>.gagaggggigcaagatcx:tgatttttcaggagttcaagcgac^tggcagccca 
atacggcagtatg ag cttcaaccccag cacaccaggggccagttatgggcctggaaggcaa 
gagcxx>gaaattcccaattgagaattgtgttagtgggtaaaaccggagcaggaaaaagtg 

CAACAG GAAACAG CAT CCTTGG CCGG AAAGTGTTTCATTCTG G CACTGCAGCAAAATCGATT 
AO^GAAGTGTGAGAAACXiCAGCAGCrCATGGAAGGAAACAGAACTTGTCGTAGTTGACA 
CACCAGGCATTTTCGACACAGAGGTGCCCAATGCTGAAACGTCCAAGGAGATTATTCGCTG 
CATTCTTCTGACCTCCCCAG GGCCTGATGCT CTGCTTCTGGTGGTTCCACT GGGCCGTTACA 
CTGAGGAAGAGCACAAAGCCACAGAGAAGATCCT GAAAATGTTTGGAGAGAGGGCTAGAAG 

tttcatgattctcatattgacccggaaagatgacttaggtgacaccaatttgcatgagtact^ 
aagggaagctccagaagacattcaagacttgatggacattttoggtgaccgctactgtgcgt 
taaaca^caaggcaacaggcgctgagcaggaggcccagagggcacagttgctgggcctga 
tccagcgcgtggtgagggagaacaaggaaggctgctacactaataggatgtaccaaagggc 
ggaggaggagatccagaagcaaacacaagcaatgcaagaactccacagagtggagctgga 
gagagagaaagcgcggataagagaggagtatgaagagaaaatcagaaagctggaagataaa 
gtggagcaggaaaagagaaagaagcaaatggagaagaaactagcagaacaggaggctcact 
atgctgtaaggcagcaaagggcaagaacggaagtggagagtaaggatgggatacttgaatt 

AATCATGACAGGGTTACAGATTGCTTCCTrTATTTTGTT ACGTCT GTTGGCGGAAGATT AAAC 
TT AATGAAAATCT GTTTGTATTTTCTGGATATTGTCTGGCAACCrrTGCGCCATACTTACTT ATT 
TAGCATAGTCGAGTGCTCTAGTTTCTGTCTCTXIAGGCACTCGTAACT aaggaccaccattgg 
CCATTGGTAGATGTTTGATTGACTT AACAAGAGAGGGACAAATTTTCAATTTGTGAAACTCCA 
AAGCAGAAAGTATTGGTGCTTGCTACCTT GTGAATTCTTCXrrTAGACATGCAGAGAAAATGTA 
TGCAAGAGACCAAAAAGATGGCTCCAAGCTATGTCATGTTACCTGTAATAAAAT C I I I I CI IC 
TAGATTCTTTCTATGTTGGCAGATAATCTCCCCTTGTAGCTTCCA 

agagtcacaatgatcatcttacccatgtggtttttgagaaagaaagatcaattctttg 

AGTAGGTAATCTTAGAGATGGAGATGATTGTAGAATTATTCCTAGATGAGTGTCAATTTATTT 
AATTCCATTGTCATATAAGGAGTCAAATTU 1 1 1 CI I ATCATTTGTTCATTGAAGAACAGAGACC 
TGTCTGGAAAATCGATCTCTACAAATTCAATTAAATAATGATCCCCAAATGCTGAAAAAGTGA 
AAT ACAGCAATTCAACAGAT AATAGAGCAATGTTTAGT AT ATTCAGCT GTATCTGT AGAAACT 
CTITTGACGAACCTCAATTTAACCAATTTGATGAATACCCAGTTCTC 1 1 CI II FCTAGAGAAAGA 
T AGTTGCAACCTCACCTCCCT CACTCAACACTTTGAATACTT ATTGTTTGGCAGGTCATCCAC 

ACACTTCTGCCCCCACTG cattgaattttttgctt atgttgtttat aat aaaactttt caatt at 

CTCAAAAAAAAAAAAAAAAANNNNNNNNNNNN 

>239 

NNNNNNNNACTTAATTTTTAATCTACCT^ • 
CCGAAAACCCGI Mill I AAACCATTATATTCAAAAACTCAAAGGTGATCGTATTAACAGATCA 
ATCTCAAAAATCTTTCCCACCAACAGGTACCAGTTAAGTGAAGAGCT CGTCT AGGTCT GCTTT 
TGTAACACCCAAATACAATTAGCACTTCTCTGCTGGTA^ 

GAGGCCAGGAGGCAAAGCCT AGCACGT AACAAAGT ATGTGCTTTG TAACTGCTGATT AATTC 
AGTTTCTT AACTAGGCAGAGCAGGTCATCAGTGTATCT AATTCACACTATTAAT ACACTGT CT 

TGCTGAAGAGTCTGACCT gcccagaaccccgttatggctagcccagggaagcagt aaactg 
caaagcagagaaaaggggcagctaagatgaggctagtgctggctgagtcgcagttaggtct 
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gttactgttctgttccaactataaatccaggatgactgttactcagattcagtgctatgtaga 

AAATAGAATGCACAGCCAAAAACATAATTTGGGGATGACTGGCAGCAC C I I I II I IC CCTTTC 
TTAAGAGGCTAACTGAAAGTTGATTAGGATTCTTGAGAGAACGTAACTAAAAGTTGAATCAAA 

tctatattcactgctatagatttaataaaaggaaagaccactgtaaagatgcaactacaacct 

gatgcgtctgctgaggagaggactgggcagctt gttgaaggctcgtcaaaagtagcgctta 

gttatcacaaggcctgca^tcaacgtcaaaagcgtaaggtttaggccaggcacggtagctc 

atgcctgtaatcccagcactttgtgaggcagaggcgggtggatcacctgaggtcaggagtt 

ccgagac^^cgtgaccaacatggcaaaaccctgtctctacrraaaaatacaaaaattagctg 

ggagtggtggcgcacacctgtagtcccagct attggggaagctgaggc aggagaatcactt 

gaacctgggaggcagaggttgcagtgagccgagatcacatcactgcactccagcctgggc 

aacagagcaagactccgtctcaaaaaaaaaaagcataaggtctgcct acctcatgggcaccc 

aaacagttcagtgccaaactagtacgagtaaa6cgatgcctg(xn'ctttcaccaacagtcca 

gagtatatttcaaaaatatctttattaaccaaggaaggaattactgcaaacatcaggacaaag 

g agattctaatct aggacttgagttaaatag ag gtttacagctaatcacaag aagagaggga 

AAGC^ATTACGTGTGCTGAAACCAAGTTTCATCAATGAGAAATATTTTAAAATGACTTTTAAAA 

TACTTAACATTGGCCAGGCACGGTGGCTCCCGCCT GTAATCCCAGCACTTT GGGAGGCCCA 

GGCGGGCAGATCACrn"GAGGTCAGGAGTTCAA6ACCAGCCTGGCCAACATGGTGAAATCCT 

GTCTCTACTAAAAGTACAAAAATCGGTGGGGCGTGGTGGCACACACCTGTAATCCCAGCTAC 

TTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGGCAA 

GA1TGCGCCATTGCACTCCAGCCTGAGGGACAGTGAGACCGG ACGCGTG GGTGACNNN 

>240 

NGGCGGCCGAGGTA C I I J I G CGGCCGAGGTA C I I I II I I I M 1 1 1 I I I I I A TGATTTT 
GATGGCTAGTTGTCTTTTTATTAGCT ATCAAGTTCATTTAACAGACAAAAAATTCAGTTCAATG 
TCATTCATTAAAAT AG GAAGAA7TAACAAT AGTTCATT AATCAATCTTTCAGCTG TTCCT ATTTT 
ATCACAATAACTTTrCCTATAATTGAGAGATTCATGAGGAAGTCTTGAAAAGAACGTATGTTT 
CTTTGAATTCCATAAAACATTCAGTCAAAATAATAAAAGAGGCGCTATTACTTTGTTTTGGGT 
AATGATATGCAGGCTAGGCTTTGCTGTAGTACCTCGGOCGCCACCGCGGTGNN 
>241 

GTCATGGAGGAATCATAAACACATTACAGAAATATTATTGCAGAGTCAGAGGCGGCCGGTGT 

GCTGTGCTCAGCTGCCTTCCAAAGGAGGAACAGATCGGGAAGTGCTCGACGCGTGGCCG^ 

AAATGCTGCCGAAGAAAGAAATAAAAACCCTGAAACATGACGAGAGTGTTGTAAAGTGTGGA 

AATGCCTTCTT AAAGTTTATAAAAGTAAAATGAAATTACA 1 1 1 1 I HI CCAAAAAAAAAAAAAAA 

AAGTAAAAGT 

>242 

nnatgtggttacgacccactgt attgaggtgacgcgatccataggctgtggtgttt 
gttttcgcrrgatccacacaaacgttggggcactgtct attcatgtgttcaagctgaaggctc 
gttctcggttgtcattttacagt g i ii m ctacggggttacattacaggaatgttgtagcgac 
gtttagcccgtggagtatcaacgtctt gagactccgt gtgagact ccctggtttgttccaca 
agagtgtgtttttagattocgtacctttgattaaggaacacatcatgccgtgaagcc 
tattctgcaattccgtagtgc^tgtaatgtattctgccgtctcgtagtgtgaagc^ 
gcacatccagttctttgatgtctggctg ccttct gcgggccaact gtcttgtggaattcgtt 
gctcccagagatagcttgaactgcagatcccgcacagcattgcactgagctgtto 
tgagotggac*tggcggccgaggtactcacagtca 

GCTCTCCATTC N C I I CI I G GCTTTACAGGTTCCCAGGTCAAGAGCTrCACCCATAATTAAGA 
CCTTCT GAGGATGATCG^TAGATAAACAGACCTCCTCTGAACCATCCTTGGGCTTCATGGGG 
TTGGCATTGAGGATGCCT ACGACAGTCCCCTGCTCCGTCTT CCAGAGCGC 1 1 I GIU AACTTC 
TCCAAATAAGAACAAGGACACACATTGTGTCAGGTCACGAAGATCATTCAGTTTCCATATGCT 
GAAGGTTTTTCCACTATTCACACTCTGTGGCGT AACCTTCTTCAAT ATAACCCCAAATGTCAC 
CCAATCTAI I I C I I CCAGCTTCTCTCTGGCCATC 1 I T I CCTT GATCTGAGACAGTCTGATCAG 
TTTTCGG CCGGTCATGTGTCTTCGTTCATATTtrrCTGGAGGATACTCGAGCCCGCCT 
CGCAGACCAGGAG AAGGCTTCCACACAGATGGCGATTGAGT CG 1 1 ICCICACAGAACTTTC 
ACTCGGGGTCCACCACATATTTGACCTCT AGTT ATCCCACT AGGTTTGTTCCGAGAAATCGT 
CTGTAGGGGTTGGGAGGGTGCACTTGTCATCCTTGAAGATGAG C 1 1 1 IGGGATCTGGAGGT 
XSAAGOCTTrGGTGTTCGAGCCAOCCTCI TGGt IC I IG CTAGCGCAGGGACATCAAGCTCCG 
CAGAAAAGCATGTTGACTCCTGAATTCTCTGAACTCTCCTCTCCTTAAGAGGTGGCCGG 
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GACTTCTCAG GGGATTTTTGCAGACGGGCTGGGCTTGCTGTCTGTTTAATTGTTGTTACTTTT 

AGCTGCTCrrrGT aaggccttcatttgctcttgcaaattccttaattcct cttgcaactcttcat 

TCGTTTTtrrCTCGCCTGGGGGCAGGAGCAGGGAGGACCCT ATTTTCAGTTGACT GTGATGC 
GGGAAC I I C II C 1 I (^TCTGTTAAGTCCrCCATATCTCCAAAGAGAGTGGCCAGATTTTCCTT 
TTCGTCTCTTGTCT CTOCTGTTTCTOCATCATCAGCCTCTTCTGTATAAGATTCAOCGTCGCC 
GTCGGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCATTTTCCCXsCGTCAAGAAGTTAT 
TTTCTTCTGAATTACAATCCAAGGCTGACTC A 1 1 1 rCTTCCAGCAGTGCGGTCAGCAGAGACA 
GATTGTCTrCCTCCrCATCCATGCTGTCAAGAGGACAGTTGTGACAGGAAACTTCGAAGGAG 
G CTCAGATG CCCAGATGAGCAATGT AGAAT CTTGGCCG ATG CCGGGACGCCTTCTTCGTT C 

>243 

MNCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTGCTGGGATTACAGGCACGAG 
CCAGTGCGCCCAGCTGCCTCTG 1 1 1 C 1 1 1 1 ATTAGCTGTTCTGGACTGTGGGGCTCCTTGGG 
CAGAT6CTGTATTATGGCGATAAGCCACACA CI I » I » GAACTGGCCCGGTCAGGGGGGACA 
TAACCAT7TOCTGTGCCACCCCATCAATCCCCACCTATTCTGAGTGTAGGCTCCTOCCCTGC 
TTGAGTAATGGCCACAGATXnTGGCTCX^CACTCCTAAGCTGCATGTTGAATTCCTGGGACA 

acaagactggcttgtggttocattctccagat CCTTGGG tt GGCTT ct G GGTG CACT AGG AG 

ATCTGAAATGCTCTCAGGCCACCAGGAAAGTACTGGAAGT AAAGTCTGACTCT AAAGAAGAT 

gaaaatctagt aatt aatgaagt aat aaattctcccaaagg gaaaaaacgcaaggt agaacat 
cagacagcttgtgcttgtagttctcaatgcacgcaaggatct gaaaagtgtcctcagaagac 
tact agaagagacgaaacgaaacctgtgcctgtaacttctgaggtgaaaagatcaaaaatgg 
ctacttcagtggtgccgaaaaagaatgagatgaagaagtcggttcatacacaagtgaatact 
aagacaacactcccaaaaagtccacagccatcagtgcctgaacaaagtgataatgagctgga 
gcaagcaggaaagagcaaacgaggtagtattctccagctctgtgaagaaattgctggtgaa 
attgagtcagataatgtagaggtaaaaaaggaatcttcacaaatggaaagtgtaaaggaaga 
aaagcccacagaaataaaattggaagagaccagtgttgaaagacaaatactt catcagaagg 
aaacaaatcaggatgtgcaatgtaatcgt7ttttcccaagtagaaaaacaaagcctgtgaaat 
gtatactaaatggaataaacagctcagccaagaagaactccaactggactaaaattaaactct 

CAAAATTTAA CTCTGT GCAGCACAATAAGTTGGACTCTCAAGTTTCCCCT aaattaggcttat 
TAGGAACCAGI 1 1 1 IC^CCACCAGCTTTAGAAATGGATCATCCAGTGACTCAAAGTACATTTT 
TAGGGACAAAGCT acatgat agaaat at AACTTGGCAGCAGGAAAAAATGAAAGAAATTAATT 
CT GAAGAAGTGAAAATTAATGATATT ACAGT AGAAATT AAT AAAACCACAGAAAGGGCTCCTG 
AAAATTGTCATTTGGCCAATGAGATAAAACC MCI GACCCACCATTGGATAATCAGATGAAAC 
ATTCTTTrGATTCAGCATCAAATAAGAATTTCAGCCAATGTTTGG 

GTCCAGTGGAAAATGTTACTGCl GC 1 1 CGACTCTG CTCAGTCAAGCAAAAATTGAT ACAGGA 
GAGAATAAATTTCCAGG7TCAGCTCCCCAACAGCATAGTATTGTC/W3 

AGCAGTGATAACAGGGAGACACCACGAAATCATTCTTTGCCT AAGTGTAATTCCCATTTGGA 

gataacaattccaaaggacttgaa^taaaagaagcagagaaaactgatgaaaaacagttgat 
tatagatgcaggacaaaaaagatttggagcagtttcttgtaatgttt gtggaatgctgtatac 

AGCTTCAAATtX^GAAGATGAAACACAGCATC FOCI 1 1 1 CCACAACCAGTTTATAAGTGCTGT 

TAAATATGTGGGCTGGAAGAAAGAAAGAATTCTGGCTGAATACCCTGATGGCA^ 

tggi ICI I cc I GA AGACCC AAAGTATGCCCTGAAAAAGGTTGACGAGATTAGAGAG^TGGTT 

GACAATGATTTAGGTTTTCAACAGGCTCCACrrAATG 

TTCATTTCCAATGACAAAAAAGT AGTTGGCTGCCT AATTGCG GAACAT ATCCAATGGGGCT A 
CAGAGTTATAGAAGAGAAACTTCCAGTTATCAGGTCAGAAGAAGAAAAAGTCAGATT^ 
GGCAAAAAGCCTGGTGCTGCTCAACATTACCAGAGCCTGCAATCTG CGGGATCAGTCGAAT 
ATGGGTATTCAGCATGATGCGTCGGAAGAAAATTGCTT CT CGCATGATTGAATGCCTAAGGA 
GT AACTTT ATAT ATGGCTCATATTTGAGCAAAGAAGAAATTGCTTT CT CAGATCCCACTCCTG 
ATGGAAAGCTGTTTGCAACACAGTACTGTGGCACTGGTCAATTTCT GGT AT ATAATTTTATTA 
ATGGACAGAATAGCACGTAAAACAAATTCTTGCCTACACCACT AGAAGACATCTATTGAAGA 
GAATGGAI I GG'I I GC I GACTTT AACCAGGAACT AGGGCCATTTTT ATT ACAATGAACT CAGGA 
C TGGCAACAACC ATATGGTTGTTCCATTTTCATAAAATTGGAAA 

Ttil 1 1 IUI I II I I AAAGAAGATATTTTATTATCTTTTACAGAAATTT ATGATTGATGTATTTTATC 
TATAGTTATTTAGACATGTTTACATGCAG<^GAT AATTGTTCATAGTGGACT GAAAACTAATG , 
CAAGGACTATGGT CTCAGTG ATAAGT ATATTTTGAAGTTCTTAATATGGAAATATACCAGTGT 
AGCTTGGT ACT GTATTTTTTTATATTGATCT GCTGAT ACCAGTGATAGG CTT AAAGATTGT ATT 
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TTCACAGAGTGGAAACCAATTTTTTTAGTTATTGTTCAAGGAGGGTGCAATATTAA 
GGAATTTGAAGCT AATTTTTAAAAGGCCTGAACTATACTTTGAAGAAACCCCT ATAGAAAAGG 
AAAGCT CCAGCT AAAT AGGAAGAATT AGAAT ATTGAGl 1 I I I 1 1 1 I CCTG A I 1 1 1 1C 1 C I I ICC 
T ATCTTTGATGGAAGGAGGAAGT AGAAAGTG GT AAAGAATTGAGGCTTTCCTTCTTGGAGAG 
CTGTAA ATGACAAGCATTAGGAAAGGTACCCTCCTAGA 
ACTTTTAAAATAAATG^CAACTTGGCACACCTAGGCTGTTAACAA^ 

AAAACGT AT AGAAT ACTTGGAAGCAAAGTATGGATGACTCGGTATCTGCTTTGTTATTCCTCA 

GAAATACTGCAC TGAGT ATATGCCCTCATTACTGGACTTCATTTTGATACTTGTCTATCCTT 

TAGTGCCCT CT ACTTTT AAAGGGTTTATATGTTGAAAAACTGCTGTGGCCTTTTATGACCTGT 

ATAT AATGTAGAATAAAAAT AAT AAAATACTTGATAG U 1 1 1 1 1 C T AAGTGACCAATGTACT ACT 

GAGGTGTAANNNNNMNNNNNNNN 

>244 

TTTTAGTAGGGATAAGGTTTCACCATGTTGGCCAGCTGGTCTTTAACTCCTGAOCT 
GAGGGATCCACCCA CCTCGG CCCTCCAGTGTGCTGGGATTACAGGCATGAGCC^ 
CCG GCCCTG GTTTGCTTTCTGAACCATGTGAATACAGTACCACCACAGTTGCT ATCTCTTGA 
ACATCf 1 1 CATTAAAA CA TCAC GGTCTAGTTTGAGAAT ACTTTT AAGCCTGCTGGCCT CCTTT 
GGGG CATTC I MU I CTCTTTT CAGCACGCATC 1 1 ICI 1 1 1 CCACTTACTCCGTAAGCTTTTAG 
CCATGTTTTACCTTGAGGGCCGAAGTTAACTrCAGCGGGAGTGAACGACAG GGGTGGGCTC 
^CTTTATCCAGTGGACTCGGAAGCCGG^AGGGCOCCCACCAAAAAGAGCAAGGGGAACC^ 
CGCCCTCAACAAGGCCTGCAT(rr(XX>GACTGGAGCTCAAGTATAG 
>345 

NNATGGGGCAACGTGGAAGAAATGGCCAGAATTTf4AACTGGTCATCAATGCGAN^ 
CTTTTT ACAAAAGNCCAGAGCCAtSAATAAGCCTTTCATTAAATUTTCCCACCA 
GGCXXrrCAGTTCTGTACCTGGAATGCACATCCAT AGTCACTGT GT ACAATTGCTTGAGTGAG 
TTCATGGTCCGTAGGAGGATGACCACTAGCCCACCAGCTTCCACTGTTTCT 
CAGCAAGTTTGGAGTTAAGGCTTCAAAATtxn"GCAGCACACACATGCCGAAGGTATTGC^ . 
GGATCTT GTGGGTCT CGTTGTAGTAGCAGTAGCGAATGTTTGTGGCTGCTATGAAQAGTTCA 
AAGGGGTCGTCCTGCTTT ATGTTCAGTGTTCCATTCTTT A 1 1 1 ICI I C I GCAGCTGTCGCATT 
CTTT] CTTCCGGTGACT GCT AAACCCCAGC ICI I ICI I ATAACACCACAGCACTGAAGGCCG 
AGCC 1 1 t^CAGTTGCTTTGGAT AACATGTGA7GAAGTATTACCACCT GATCTTTTCCTCGATC 
CCCAACT ACAAGAAAG A GAGATCTTTG CCGCTCAGCT ACTCCATTCTCAATGAGAATCCGGA 
TTCGGTTATCCACXrTTTTTCCGATGCATGGTGAAAAATTATTACTGCG 

GATCCCAGCCAGTGGAGAAGGGGAAGCACGTGGGAGCACGGAACAACTCCGAAAGAGAAG 
GCACAGCTGACTGCCtXTAGCGTATGCCTGGTGTCCGGGTCAOGTAGTAAGCCT GGCTCCG 
CGGCACGCGTTXXTCTCAGGAACTGGTAGACAGCACGTGCGCA^ 
GAGATGGCGAGCGAGAAGGGACCTTCCTCAAGGCTGTCGCTGAGTAATGACGT 
AGAAGCGGAGCTGAGGCCT AGATGTCCACTGCGGTTGCGCATTTTAGT CCCACT GCT CGCG 
GT AGCTGGGTGTTGGCTGCGGTTTGCCTCTT CACTCTTCATATCCCAGCT AAAGT AOCT CGG 
CTAAGAAGGGCAAAATAGATGTCCGCACCCCAGGGCCTGCCCACGT AGGTCCAGGGTTTTC 
AGCGTCAGiXrrGGTTACGCGAAACGCQACGGATGCAACGA 1 1 IGI U 1 I CAATTT AACCGATC 
CCCGTTAAGTTGAGGAAACTCGGCCAACAGACCTATTTTAT^ 
CCCCACAGTACTGATCTTACAGAT^^ 

GGGAGTCXiACCACGCGTtXX^OCOCTCGTCTGCGTCAGTTGGT 
CGGAGCGGGCGAGCGGAGTTAGCAGGGCTTTACTGCAGAGCGCGC^ 
ACCGTGGGGATCAGCGTAGGTGAGCTGTGGCCTTTTGCGAGGTGCTGCAG^ 
GTGCGTTCGCT ACGAGGATTGAGCGTCTCCACOCATCTTCTGTGCTTCACC^ 
GAATCCGAGTATGAAGCAGAAACAAGAAGAAATCAAAGAGAATATAAAGAATAGTT 
C^AAGAAGAACTCTGAAGATGATTCAGCCTTCTGCATCTGGATCTCTTC 

GAGCTGTCCGCAGGCTTGTCCAAAAGGAAACATCGGAATGACCACTT AACATCTACAACTT C 
CAGCCCl GGGGTT ATTGTCCCAGAATCT AGTGAAAATAAAAATCTTGGAGGAGTCACCCAGG 
AGTCATTTGATTTtTATGATTAAAGAAAATCCATCCTCTCAGTATTGGAAGGAAGTGGCAGAAA 
AACGGAGAAAGGCGCTGTATGAAGCACTTAAGGAAAATGAGAAACTTCA 
CAAAAGGACAAT<iAAATTGCCCGCCTGAAAAAGGA 

AACATGTACAGTATATGGCAGAGCTAATAGAGAGACTGAATGGTGAACCCTCT GGATAATTT 
TGAATC^CTGGATTGATCAGGAATTTGATTTCTGGAGAAGAACCTGCT GGACGATTCTCTAGT 
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GGGAAGACTCGGAAATTGGCACGTGTTGCTTGAAGGAACTGGTTCTCTCTrTTGCGGGGGCC 
6COGC1 R51 1 AAAGClTAATTTTGCGGTGGGAATrAAGGCGAAGTTTACTCCCATTTTTT 
GGAfiTATATACAACTTATGGCCACGCCTGTGTCAC^TTCCCGTCTGGAGCTGGGACCATTG 
GTTAGTAX^ATCCTTCTTCGATACAAAACAG^GGGGCkSGCAAAATCCTAGGGGCGGAAAAT 
TACGCATTGTAAGAGGCGAAGAGTATAAAGAAGCACGGTAACCGGACAGAC^GTCTTAGCA 
TGTTACT GAGCGCAAGCGCAT AGTTCATGACCTAGTTGCCGTCTAGCACACAGTCGCACNN 

NN^^CGCGGCCGCGTNCGACAGAAGACGACAGAAGGGTACGGCTGCGAGAAGAC 
GACAGAAGGGTACGGCTCGAGGTAGGCG^CGAGGCGCGGCGGCCGGAAGATGGCGGCG 
G(XX>CAGAGTTGTCGCTACTGGAGAAGTCCCTGGGACTGAGTAAGGGGAATAAATACAGTG 
CTCAGGG^GAGCGACAGATTCCAGTTCTTCAGACAAACAATGGTCCAAGTCTA>U^GGATTG 
ACT ACT ATAG&MSCTCATCTAGTCAAGCAAGCCAACAAAGAATATTTGCTGGGGAGTACTGC 
AGAAGAAAAAGCAATC«TTCAGCAGTGGTTAGAATACAGG<5TCACTCAAGTAGATGGGCACT 
CC^AAAAATGACATCCACACACTGTTGAAGGATCTTAATTCATATCT^ 
ACCTT ACAGGGTATAACTTTACATTAGCAGATATACTATTGTACTATGGACTTCATCGCTTTAT 
AGTTGACCTGACAGTTCAAGAAAAGGAGAAATATCTTAATGTGTCT CGCTGGTTTTGTCACAT 
TCAGCATTATOCAGGCATCAGG^ACATCTGTCTAGTGTTGTCTTC^ 

tact ^ttc^^actagaagctgtccatgcx^tacagaagatctattaaaaatgttttaaatgg 
aaaatgtactct ag aocacaggact aatgtaaattaatatacagtcattcattatttgttgaag 

TTGATAGAAi i 1 1 i GAAGTGTAAACTTGTGTCTGAATGTTTTATTTGTTCTTT AGCTGAAfiTTTT 
GCAATTrjTATGTCAAAATTCAATTGCTATTAAAC^ 

GTTTTTAGT AGATGACATTTATTTCAATAAAAGTTGCAAATCGGGCTTAATCTTAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAAAAAAAAAAAAAGAAAAA^ 

AAG^GGGGGGGGGTAAAAAATTTCCCCGGGGGGGGCCAAAGTTAATGACGTGA 

GTTTTGTGGTGTAAAAGGGGCGCCAAAATAGAGGGGGAGTAATATAAGAGAGAGAGCAGAC 

GGGGOGGTGTTATAAACAGTTGTTGTr£^ 

t77%Z~9£9ZlJ^ ' • ^^^G^GAATTGTGGACACACCT AGaGAGAATTTAAGCCAGGAAAT 

TAACTTAGGGTGTATGGTGGTAACGCGATATTGTGGAAAATTTGTCAGATTGTTGAAATANN 

>24S 

NNWKWN NHKNWCCGAGGTACTTNN I 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 I I U 1 1 1 1 1 1 I III I f I 
■ i it 1 1 1 1 III I ACAGAG ACGAGG AATTTAATTAGGGTTGTAACAAATGGTTAATTATAGTAAG 

AAAAACCAAATTGAATAA I f TT CT aactcacttggcagggggggtctogcagccataatg AA 

E^ AAA £5 G /»I5£^^ 

TTTTAATACTT ACTCTTAATAAATCATCTATTCTTCCCTCCI ICI ICTCTAAfifirjl^AATTfrrrA 

CTu 1 1 1 1 ctagggc agatattttttctattgtgaggtcggactgggtctgtct 

GG^ATCTui f 1 1 1 GGGAGCTGCAGGAATGCTCTGTGTTGCCAGAT<XXX3TAAATGAGGGA 

gg] 1 1 it^CTGAGCTGAAACAAAAGTGAAGCAGAGGCCAGCTCLX AJt I rG CGCCAGCTGCT 

CJGTGTCAAGOCGAfiG^^ 

^CCGJTOCCTOTTTGCAGCACT^^ 

CJAACG<^TTGGATAAGGAATGTT^^ 

AGGGCTCGT FCCTGGOGOGC1 rOCOOCAGCAAAinYSGOCMACAGGCAAAAAOCGAOGGQC 
>249 

^^*«ATGTGGTTACGACCCACTGTATTGAGGT 
GJJJTCGCTCAra^CACAAA 

GTTCTCGGUG I CATTTTACAGTGTTTTTCTACGGGGTTACATTACAGGAATGTTGTAGCGAC 
GJTTAGCCOGTGGAGT ATCA^XrrCTTGAGACT CCGTGTGAGACTCCCTG O HIGH CCACA 
fJAgngGT™ 

™ 1 ^ G £ AAT J5S^ AGTGCA ^^ 

GCACATCCAG nC 1 1 } GATGTCTGGCTGCC 1 1 CIGO GGGCCAAC I G ICI T G I G GAATTCGTT 
GCTCCCAGJ^TAGCT 

TGAGCCT GGACATGGCGG CCGAGGTACTCACAGTCACG^ 

GCTCJOCAI 1CJ ICI ICI ! GGCTTTACAGGTTCCCAGGTCAAGAGCTTCAiXXATAATT AAGA . 
CCTTCTQAGt^TGATCQATAGATAAACACACCrCC I C I GAACCATCCTTGGGCTTCATGGGG 
TTG<^TTGAGGATDCCTAaSACAGTCCCCTGCT 



JrI A I!7 CTT CCAGCTT CT CT CTGGCX^TCTTTTCCTTGATCTGAGACAGTCTGATCAG 
TTTTCGG CCGGTCATGTGTCTT CGTTC^TATTCTCTGGAGGATACTCGAGCCCGCCTCGAGC 
? G £^££ AGGA 5^ G ^ 

C7G I* GG ^"^ GGGAGGGT GC^^ 
GAAGCCTTTGGTGTTCGAGCOVCCCTCTTGG 

CAG AAAAGGATGTTGACTCCTGAATTCTCTGAACT CT CCT CT CCTTT AAGAGGTGGCK^GGGA 

GAC ™^ GGG *JI7JIG^^ 

GTCGGCATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCATTTTCCCGCXS^ 
TTTCTTCTGAATTAGAATCCAAGGCTGACrCATTTTCTT CCAGCAGTGCG GTCAGCAGj^GAGA 
GATTGTCTTCCTCCrrCATCCATGCTGTCAAGAGGAGAGTT^ 
JCTCAGAJTGCCCA^^ 

>2S0 

A J^^ c t c ^ G ^3^I CGA JTI AC ^ 

AAGOG^AAGGAAAAA^^^ 

TATACCAACTGATTGAGTATCTACACAAAAACTTGCGAGTAGAGGG I UGH IAGAGTACCG 
GGTAATAGTGTCCGACAGCAGATTTTAAGGGATGCTN^^ 

x^ c ^s A f^ c Hl CTG ^ GACACATAAACA ^ 
^ A f77^™™ c ^^ CTCCCTCCTCCTAATCGTAATT ^ 

CCTATACCAI^AfiCA^ 

TG1 ^ GCA «>CCACGTCCT G I GGCCAAAAAATGTC^CTGCAAATGACCTTGAGGAAAATATr 
^^^^C^AAACAGTTjGGATGGCITTT ATGATT AAACACT COCAG>GjAcVTTTTTAAGGCTCCTr 
GCTTATATTCGGGAGTGTGCGAGATTGCACT . 
TGACCTTGACCTCATAGCTTCATGTCATACT AAGTCCTTT CAGCTGGCAA^^^T^CAGAAACG 
GAA^G^AGATTOraDC^^ 
GAGAGAGCTGTTTCAACACGTTCATGA^ 

GACAGTTTAATAAGCAATCATTGACCCAGACACCAGGGCGA^ 
^ AAAGA GGOJTC«TTC^^ 

t? A I^^ CAGAAAAGAAAAAGAAGAACOT 

AAGGGAACCAGCAAAGAAAATAGGAACTTATTATTTTCT 

ACCAACAAG^GAAGTGGTCTGAA^ 

£&*?^™CTttATGGTCCATGCAGAA 1 1 1 I CIG I 1 1 AGTGGGCAGGTGTTATTOCTG 

CCCACAGCAAAG^TTGGA^^ 

TA OCGTGAOC^JCTttAT^^ 

ACCAfiCTrATACAGX^CACTAAGGTGGGCTTC^ 
CACTTTTGAA^ 

CAGTGG ATTG^TGTTGTTTTTAATGGATATTT^ ^ ITITTCTCTGTn I GATAGTATTGGGTA 

IT^yT?. 1 ' ' 1 1 S I ? lA tl 1 Cl 1 * rrrT aaaagccatgtttttggttgggctctaagct 

^^TCTTTtAX ICI 1 1 1 rCAgrrrGAGCTTTCGGAAAACTCm 

ggtgtgacatcagagt atttgacctgcagtattgaaaaaggagaattcagaatgatacagta 
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1 1 1 tAACAAAi Cl IAAI 1 Al I AAACiCi I ITCCTI CC ITCCATTTCT CCCTCCCTT6TCCATCTC 
TCTCTCTCTTTCCXrrTTCCTCAGTGATGTGAAAATAATTGTGTTTTGCTGAACTTGT^ 
ATTCAATTTCCTCTTGACTAAAACATCTCTGGTGCCAACGTAATACTTCTGAACCAGATCACT 
gtgactcaaggaaagtcact GACAGCAT AAG AG AAGTTTGCTAAAAT ATTTGT ATGTGGGGG 
AAGCTCTGGAGTGTGCCTAGGAGGGGGCTGGCTGCCrrTATGTCCCAGGATGACTCTrTAT 

gggtgggattacattgcaccctctgagggtgcaggctagaccgtctcctgagaggaagtta 

ggatcagaaagaagaagcaagcagcagcctctgcagggctgacaggatttaaaggagagaa 

tgttcttatttggaagcagctgtggcttgtcaccaatgttcaaggagtgttactgttccgcc 

ctctctttgtcagaagggacacaggtggt aatttggagatggggccagagcttctggctttt 

ggatttggtgtgttcacttgtgttggatagagc^gtggcatggctttgacctagtatgaact 

ggtgtctgcccagagagcagcatgtagcaggggggaatgctcaggtttgtgcctggctct 

gtggagctgtacaacccttctcaccctgtgggttggagccgagtcaggcgactatggggaa 

gcagttgccccacaa^^g^g^3^ 

cctctgccagcctttct gct gcttttgctgctgtaacagcaatagtggagaaaaatgt aaaat 

TTGGTCTTCCAGCTT AATGCAGTGTGAACAATAGATGGTTAGGAAAACAAAACTGCTTAGAA 

GGCCCTTTCTCTAGAGCAGTTTTATGTCATTTGTAAAAACACATATTAGCAAATTCGTTCGCG 

TAGGTTTCTATTCAATATTTGAC I rTTTTTTTCl I ATTAAGAAAATGAAATCCCTTAGACCAGAT 

ATCAGTTAATTCAAACAGAAAACCCTTTGGGTATCANNNNN 

>251 

>252 

GGC GatTpgagctCCCCGCGGTGGCGGCCGAGGTACATTTTACT ACGCACCCTTACGC 

ATTui II I 1 ctcacctctgtgtgtgtgtgtgcgtgcacatgcacacacacaaatgggtgaaa 

CAATTCTC^CCATACCAAGAGCCACGGCGCCtrrGCCGAGAATTTGCAT^ 

CAG GTGATGCTGACACTrGCTGGCTCATGGAACCACTGCTGTAGTATTTTCCAAATTATCCT G 

AT TCTAA GAACCACCT ATGAGCTGTG CTGTTTnTCT GTG GTT ACT G GCTCATGTCACAT AAA 

TTCTTTTAGGATTCAAACATGTTTGTGATATTACTCAGTATTTAGATCTTGC^ 

ATGATGGAAAAATTAACCACAGGTATATCATAACAAAAAGAACATGAGTTACGATTTTCACAA 

AGTTCAGATATATTTAAATTAGCCTATTTAATL 1 1 1 1 1 1 I GGTTGTTGTTGAGATGGAGTCTCA 

CTCTGTCTCTCAGGCTGGAGTACGTGCTGTTCTATTGTCGAAGGCGGGGTCTGCACAGAAC 

AGCTTTGGAAGGGGCTrGGGC«GTGTCTGTGGTGTCrrGCAG«TCGTCGTCGTTTGGCCGCT 

GCTTGTGGTGGTCGGTTATC<rrTGTCTGTCTGTATGTCAGCAGTrrCAGTAGTGGATATTTG 

ATTCACT GTGCCGATAAGTAGTTG CTGTATGGTTCGTAGGTCACTTGCTGTCCCCGGGT 



ACGCGGGGACTCGCXaTCGGTTGGCGACT CCCGGACGTAGGTAGTTTGTTGGGCCG 
GGTTCTGAGGCCTTGCTTCTCTTTACTTTT CCACTCTAGGCCACGATGCCGCAGTACGCGGG 
GGGGTGAAGAAGGGGCCGiGCCTTCAAGCAACAGCGACGGAAGATGGCAGCCACCACGGG 

GGAGTAGGAGATGGCACAGTTAGCTGG^TCTAGAAGATGAC»AAGACATGACAC^ 
GATGGACAGGG*TGATAATTGGGCCTCC^^ 

AAAATAGAATGTGGACCTAAATACCGAGAAGCACGCCCCTTTGTAAGATT^ 
AATATGAATGGAGTAAATAGTTCTAATGGAGTGGTGGACCGAAGAGCCATATCAGTGCTAGC 

aaaatgggagaattcatatagcatcaaagttgtcctgcaagagcttcggcggctaatgatgt 

CTAAAGAAAATATGAAACTCtXn"CAGCCGCCCX3AAGGAC^GTGTTACAGGAATTAATCAAAA 

agaaaaaccacaggcccttccccttcccccgaattcgatttaatcact 

TAGT AAATTTTCTAGATACGTXrrTGTAGACCTCAAAGTACGGGK^GAAAATCCACAAGAG^ 

CAGCAACCATGAGTACAAATGGTG^TGATCATCAGGTCAAGGATAGTCTGGAGCAATTGAGA 
TGTCACTTT ACATGGGAGTT ATCCATTGATGACGATGAAATGCCT GATTT AGAAAACAGAGTC 
TTGGATCAG^TTGAATTOCTAQACAOCAAATACAGrTGTGGGA^ 

GTGAAACACCTG AAAGGCCAGAATGAGG AAGCCCT GAAGAGCTT AAAAGAAGCTGAAAACT 

. taatgcaggaagaacatgacaagcaaggaaatgtgagg^gtctggtgacctggggc^ 

GAGAACATTTGCAAG AAGCTTTCAAATCCCTTCCGCT AT AGAATGG AGTGTCCAGAAAT AG A 
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CT GTGA GGAAGGATGGGCCTTGCTGAAGTGTGGAGGAAAAAATTATGAACGGGCCAAGGCC 
TGCTTTGAAAAGGTGCTTGAAGTGGACCCTGAAAACCCTGAATCCAGCGCTGGGTATGCGA 
tctctgcct ATCG CCTGGATGGCTTTAAATT AGCCACaAAAAATCACAAGCCATTTTCTTTGC 
TTCCCCTAAGGCAGGCT GTCCGCTTAAATCCAGACAATGGATATATTAAGGTTCTCCTTGCC 
ctgaagcttcagg^tgaagg ACAGGAAGCTGAAGGAGAAAAGTACATTGAAGAAGCTCT AG 

ccaacatgtcctcacagacctatgtctttcgatatgcagccaagttttaccgaagaaaaggc 
tctgtggataaagctcttgagttattaaaaaaggccttgcaggaaacaccgacttctgtctta 



agggc agcctagagggcaaaacagagaaaagctagacaaaatgataagatcagccatatttc 

A^^GAATCTGCAGTGGAAAAAAAGCGCAC ATTTG AGGTGGCTCATCT AGACCT GGCAAG A 

ATGTATATAGAAGCAGGCAATCACAGAAAAGCTGAAGAGAATTTTCAAAAATTGTTATGCATG 

AAACCAGTGGTAGAAGAAACAATGCAAGACATAGATTTCCACTATGGTCGGTTTCAGGAATTT 

CAAAAGAAATCTGACGTGAATGCAATTATCCATTATTTAAAAGCTATAAAAATAGAACAGGCA 

TCATTAACAAGGGATAAAAGTATCAATTCTTTGAAGAAATTGGTTTTAAGGAAACTTCGGAGA 

AAGGCATTAGATCTGGAAAGCTTGAGCGTCCTTGGGTTCGTCTACAAATTGGAAGGAAATAT 

GAATGAAGCCCTGGAGTACTATGAGC^GGCCCTG^GACTGGCTGCTGACTTTGAGAACTCT 

GTGAGACAAGGTCCTT AGGC AGtXAGATATCV^GC^CTTTCACATTTCATTTC^ 

AACATTTACTAATCATCTTTTCTGCTTACTGTTTTCAGAAACATTATAATTCACTGTAATGATCT 

AATTCTTGAATAATAAATCTGACAAAATATTAGTTGTGTTCAACAATTAGTGAAACAGAATGTG 

TGT ATGCA TGTAAGAAAGAGAAATCATTTGTAT6AGTGCTATQTAGTAGAGAAAAAATGTTAG 

TTAACTTT GTAGGAaATAAAaCATTGGACtTACACG AAAtgm«»tir »nr.*ttttowgtipa m»« ttet a n f a 

fccttaoc t rx tc caccaa rrtpairea gactttl ^ 

f^Qa9 a * e » w ">cacctca«atattooc tnr^^ 

>255 

nnctc^gaggattaccgacccaggcgtcxxacaacatcctggcttagtattgtgtgc 
aaaatcag ag aggggtgcaag atcct gatttttcagg agttcaagcgacaatggcagccca 
atacggcagtatgagcttc^cxx^gcacaccaggggccagttatgggcctggaaggcaa 
gagcgcagaaattcccaattgagaattgtgttagtgggtaaaaccggagcaggaaaaagtg 
caacaggaaacagcatccttggccgglaaagtgtttcattctggcactgu^gcaaaatccatt 
accaagaagtgtgagaaacggagcagctcatggaaggaaacagaacttgtcgtagttgaca 
caccaggcattttcgacacagag^tgcccaatgctgaaacgtcgaaggagattattcgctg 
cattcttctgacctccccagggcctcatgctctgcttctggtggttccactgggccgttaca 
ctgaggaagagcacaaagccacagagaag atcctgaaaatgtttggagagagggct agaag 
tttcatg^ttctcatattcacocggaaagatgacrraggtgacaccaatttgc^ 
aagggaagctccagaagacattcaagacttgatggacattttcggtgaccgctactgtgcx^ 
taaacaacaagggaacaggcgctgagcaggaggcccagagggcacagttgctgggcctga 
tccagcgcgtggtgagggagaacaaggaaggctgct acactaataggatgt accaaagggc 
ggaggaggagatcx>gaagcaaacacaagcaatgcaagaactccacagagtggagctgga 
gagagagaaagcgcggataagagaggagtatgaagagaaaatcagaaagctggaagataaa 
gtggagcaggaaaagagaaagaagcaaatggagaagaaact agcagaacaggaggctcact 
atgctgtaaggcagcaaagggcaagaacggaagtggagagtaaggatgggatacttgaatt 
aatcatgacagcgttacagattgcttcctttattttgttatc • 
ttaatgaaaa rcTGTn gtattttctgcatattctctggcaaccttgccccatacttacttatt 

TAGCATAGTCGAGTGCTCTAGTTTCTGTCTCTCAGGCACTCGTAACTAAGGACCACCATTGG 
CCATTGGT AGATGTTTGATTGACTTAACAAGAGAGGGACAAATTTTCAATTTGTGAAACTCCA 

aagcagaaagtattggtgcttgctaccttgtgaattcttccttagacatgcagagaaaatgta 
tgcaaqagaccaaaaagatggctccaagct ATGTCATGTTACCTGTAATAAAAT C 1 1 1 icn c 
tagattctttctatgttggc^gataatctcoccttgtagct^ 

AG agtcacaatgatcatcttacccatgtggtttttgagaaagaaagatcaa I Id IIG II 1 GC 

AGTAGGT AATCTT AGAGATGGAGATGATTGTAGAATTATTCCTAGATGAGTGTCAATTT ATTT 
AATTCCATTGTCATATAAGGAGTCAAATTGTTTCTTATCATTTGTTCATTGAAGAAC 
TGTCTGGAAAATC^^TCTCTACAAATTCAATTAAATAATGATGCGCAAATGCTGAAAAAGTG^ 
AATACAGCAATTCAACAGAT AAT AG AGCAATGTTTAQTATATTCiAGCTGTATCT GT AGAAACT 
CTTTGACGAAGCTCAATTT AACCAATTTGATGAAT ACCCAGTTCT C ) ICI 1 1 1C TAGAGAAAGA - 
TAGtTGCAACCTCACGTCCCTCAtrrCAACACTT^ 



P»eer7of355 



671 
Table 4 



PCT/USOO/33312 



ACACTTCTGCCCCCACTGC^TTGAATTTTTTGCTTATGTTGTTTATAATAAA^ 
CTCAAAAAAAAAAAAAAAAAN NNNNNNNNNNN 

>2S6 
>257 

CGCGGTGGCGGCCG^GGTAGTCTGACTTGCAGGGCGACAAGACCGGCCTTGCGAG 
CTGAGGTTTGGGCGTCATAGACTAAACCCAGAAAACAGAACATTGGGAAGTCTTCGGAATAT 

TCT CT ATCTTCTTCACCAACG agtaag acgttttggaataatgggactctacaaaggccttga 
agc^aactgctgcagac^gtcctcactgctgctctcatgttccttgtttatgagaaactga 
cagctgccaccttcacagttatggggctgaagcgtgcacaccaacactgagacgccttccc 

ATGAAAAATTCCG AAGATGCTCAAG AG GGAG GTTTCCTCCTGAGTG AAGAG AAG TGATTCTC 
CCTrGACTCTGGCTCCTGCCACGACAAATGTTACCCTCATTGGCTTGAAAAGCATCCAAGGG 
TGCACAGGGAGTATGGCAACTGGACCTGTrGTCACCTTAATTGTCATGCTGGCTGGTTGGAT 
TTGGGGTGGCAGTTGGACTAATGTGAAAAAAACATTGCTGAAAACCTAAAAATGACAGTTGT 
GAG TGTTT ATTGGI I I ICI \ AAGAGAAATGGACTATTTGCTCTCATGTGTAATGTTTTCTATTT 
AAATCTTTCTTAAAT ATACGAGCTGTTCTCTTTCCCT G AACT CTCCCCCA G GTTCTAG ACAATT 
TAATAACATGTTATTCTTCTCAAATACTTTTGTCATGTCTCAAGTGTCGGTGTTTTTACTGCAC 
TTAAAAAGTAACATTTATGGGCTNTGGACGGGGCT GAACTTCATCTCGGTGGGCCTTTTATA 
CCGGCT ACGGGAAGGGGGCCTCATGGGCTTTGCACCA I Mill I GTGGGAATGAAC ACTGG 
TGCCCAATGGATTCTrTA>VACATTGGCAGGTACATTTCACAGGGGAGAAAACTCCGATAGCT 
TGGAAACACCCGTATCACAACTGCG AGTG CGCTCACT CAACCGGTGAGGCAAATGGGCAAC 
AAAAAAAAAAGTGGCGGGGCGGCACATAN 

>2sa 

NNNNNNNNNNNAGGGAAAGCCTGCATGGGAAGTGGGAAGTTA^ 
CAAGGGGGACCCGGGTCCATGTTGGCGGGGGTAANCCCt^CCCACCACCCNGCCCGGNG 
GTTTAAATGCl^CCGXaTTACAGGGGNGNGGTCCCCATTTCGCCCATTKCAGGGC 
AAC^ITGTTTTGGAAAAGGGCCNATTCGGGTGCGGGNCC C I 1 I H I OGGCTT ATTTTACGCCA 
AGCTGGGCGAAAAAGGGGGAATGTGCCTGCCAAGGG CGATTTAAGGTTGGG GTAACCGCC 
CAGGGl 1 1 1 1 IACCaGTCAAAAACXjTTTGTAAAACGNGGCCAGTGAAATTGTAATACGACGA 

WWNNNNNNNNNNNNNNNNNNNNG 

CGACGTCGCCG ^CCG CCAaCATGGCOGCGCCCAGTGGCGTCGACCTGCTCGTCX^ 
GTTCTCATAGAATTTTCTCTTCACCACTCAATCATATCTACTT ACACAAGCAGTCAAGCAGTC 
AACAAAGAAGAAAI I I CM I II I CGGAGACAAAGAGATATTTCACACAGTATAGTTTTGCCGG 
CTGCAGTTTCTrCAGCTCATCCGGTTCCTAAGCACATAAAGAAGCCAGACT ATGTGACGACA 
GGCATTGTACCAGACTGGGGAGACAGCATAGAAGTTAAGAATGAAGATCAGATTC^GGGC " 
TTCATCAGGCTTGTCAGCTGGCCCGCGACGTCCTCCTCTTGGCTGGGAAGAGT^ 
GACATGACAACTGAAGAGAT AGATGCTCTTGTTCATCG GGAAATCATGAGTCATAATGCCTAT 
CCCT CACCTTJTAGGCTATGGAGGTTTTC<iAAAATCTGTTTGT ACCTCT GT AAACAACGTGCT C 
TGTCATGGTATTCCTGAC^GTCGAGCTCTTCAGGATGGAGATATTATCAAGATTGATGTCACA 
GTCTATTACAATGGCTACCATGGAGACACCTCT 

atgtggtaaaaagttagtggaggttgccaggaggtgtagagatgaagcaattgcagcttgc 
agag caggggct cccttc7 ctgtaattggaaacacaatcagccacat aact cat cag aatgg 
ttttcaagtctgtccacattttgtgggacatggaataggatctt actttcatggacatccaga 
aatttggcatcatgcaaacgac*gtgatctacccatggaggag 
agccaatcatt^cggagggatcccctgaatttaaagtcct ggaggatgcatggactgtggt 

CTCXX^AGACAAT(>AAGGTGTCGGCGC^GTTCGAGCACACGGT^ 

G CG CG CAGATCCT GACCAACT ACCCAT GAG CTGAGGAGCC<iCCGAAGCCCTCCAAGG CCA 
GTCCAGAGGTGAAGTTTGAGGCCCTCCCOCCACCCACOOCACA 
GACCGTTTGCTGCACT AGGAATTCGAGCTTTGGGCCCCACTCGCCCAGACT^ 
CATCGGGACCCGCGCTT GGACGCT GCAGGAGCCGCAAACGTCAGCT G TT CT GGAAACCGA 
GAGGTCCGAGAXiAGAGGAGATACGGGCGCATTTGAGAGC^AGGGCCTACTTGGCCX5GGAC 
TGAAGCTTGCGAGACT AAGAGCCAGCTCATCTrTTGTAACATTCATAATACXaGGAAACTGAGG 
ACt>GGTGGCTCGGAAAAGAG>TGAGTTCt>GCTTT^ . 
CCCCC^ACCXXTCCAGCTCGGC 1 1 C 1 1 1 G I G I CCAGGGTTGTAGATTTTTGGATAGAGGTGT 
TTCT GATTCTACT GAACTGGC<X>AAAACCTCACGCa<^GCCACTGGGATTCCAGCCAAGAGT 
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GGCTGCXSGCTAAC^CX^CC^GACGTCCTGGTCCTGAGGTGACTCX^GTAGGCTCCATGAG 

gaatwc^gaccctcaggacaaatggg 

>259 
>260 
>Z61 
>262 
>263 
>264 



>267 

naagagcaaccgagatgaaggtgaagatgctgagccggaatccggacaattatgtc 
cgcgaaacx^aagttggacttacagagagttccaagaaactatgatcx^gct 
g^ggtcccacgagaatatataagagctttaaatgctaccaaactggaacgagtatttgcaaa 
acc^ttccttgcttcgctggatggtcaccgtgatgg^gtcaattgcttggcaaa 
agaagctggctactgtcctttctggggcgtgtgatggagaggttagaatttggaatctaact 

CAG CGGAATTGTATCCGTACCTCGGCCGCGA(X^CGCT AAAAT AAA CAGCAGGACAGAAAT 

tgtgttgttttgttctttgaaatacaaccaaattctcttaaaatgattggtagga 
cctggccgggcgaccgccaccgcggtgagcnn 

>268 

aaaaaattaaatctgacaatgtcaagttataaggtggaagaaagaagtaatttctcag 
agaaattcaatact atcaagcaagttataactctttacagtgtttaatttatatctgtcaaggg 
ttagtgtctggaaatttaataatcaca I mien icatttctattctgaaaggtctgttgatg 

CCTGCCATCACAGTTATCATTCTAGTATTATATTTTGTCATTGACACCTTCT GTT CAGAAAA 

GACCATGGCTrGCTGAGTGCAC^C^^TTTATATACAATACTTTGTGAAGTTCTT 

GAGTTACAGTTTTAGTGGTCGCAGTGCCAGAAGGTCT^ 

GCTTATTCAGTCAAAGTAAGTAGAAAATAGTTC^GTAGACGTTGTTATGTTTAA . 

TGGTATAGAAAATGCCCATCTAGTTACTCCTTTTCTGCTTCrrGAAAGTGAGGA 

AATGT 

>269 

cccacgcgtccggtgagacttgcctgctcctct ggcccctggtcctgtcctcrrc i 
ccagcatggtgtgtcrrgaagctccctggaggct cu^gcttggcagcgttgacagtgacact 

cttaagtctgagtgtcatttcttcaatgggacgg^gcgggtgcg 
ccataacc^aggaggagttcgtgggcttcgacagcgacgtgggggmgtaccgggcggtgac 
ggagctggggcggcctgtcgccgagtcctggaacagccagaaggacctc^ 
gcggggccaggtggacaattact gcagacacaactacggggttgtggagagcttcacagt 
gcagcggcgagtccatcctcaogtgactgtgtatcctgcaaagaccgag^ 

ACAAGctgclO^TGTTCTGTGAGTGGTTTCTATCCAGGCAGCA 

AATG GCCAGGAAGAGAAGACTGGGGTGGTGTCCACAG G CCTGATCCACAATG GAGACT GG 

AGCTTCCAGACCCTGGTGATGCTGGAAACAGTTCCTCX5GAGT GGAGAGGTTTACACCTGCC 

AAGTGGAGCAOCCAAGCGTGACAAGC^XTTCTCACAGTGGAATGG 

TGCACAGAGCAAGATGCTGAGTGGAGTCGGGGGCTTTGTGCTGGGCCTGCT 

GCCGGGCTGTrcATCTACTTCAGGAATCAGAAAGGACACT 

CCTGAGCTGAAGTGCAGATGAGACATTCAAAGAAGAACTTTCT 

AAAAGCI 1 1 CCC I CCTGGCTGTTATTCTrCCACAAGAGAGGGCTTTCT CAGGACCTGGTTGC 

TACTGGTTCAGCAACTGCAGAAAATGTCCTGCCTTGTGGCTTCCTCAGCT 

CTGAAGTCCCAGCATTGGTGGCAGCGCCTCATCTTC^ 

CCCT ATGGCCTCCTGT GCATCTGTACCTGCCC%)goQgccGCTC<^ 

GTTGACAGAGTTGCCCACGGTAACAACCTCTTC^^ 

GTGCCTCCACTAlCCCGCGTACCXaSGC^ 

>270 

>271 

>272 

>Z73 

>274 
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>275 
*Z7B 

CGCGGTGGCGGCCGAGGTACGTTCT ATTCCTGCTCCTATTAGGTCCTTCTCACCGC 

acoggccctcggtcgattacgcctctccagttctgctggggacgttctagcctcgccccac 

CCGCGTCGATCrrTATGTTATACCGTCACTCCCAGTGCCCTAATGGAACTATCCCTCCACTC 

ACTGCCCCTGGTTCTACCCG&CTCGAGAGGCTCTCCCGGCCCACTA^ 

TAGGCCCGGCCCX>TCAGC«rrCCCCGCGTACCTGCXXX 

>277 

NNGGAGCTCCCCGCGGTGGCGGCTCGAGTACGCGGGGAGGCAGCGGAAAGCTCA 
GCCCATGTGAGGTGCCTCCTGOCAATCACAG ACT ACCCTTCCCTGGTCCTGGAGGTTCAAA 
GAATTGCAG GAG GGTAGAAAAGCACCTGGGTCGGGTGCAGACTGCG GAGCGGGCCCT ACC 
G T GTGCG CAGAAAGAG G AGGCGCTTGCCTTCAGCTT GTG GGAAAT CCCGAAGATG GCC AAA 
GACAACTCAACTGTTCGTTGCTTCCAG GGCCTGCTGA 1 1 1 1 I GGAAATGTGATTATTGGTTGT 
TGCGGCATTGCCCTGACTGCGGAGTGCATCTTCTTTGTATCTG ACCAACACAGCCTCT ACCC 
ACTGCTTGAAGCCACCGACAACGATGACATCTATGGGGCTGCCT GGATCGGCATATTTGTG 
GGCATCTGCCTCTTCTX5CCTGTCTGTTCTAGGCATTGTAGGCATCATGAAGTCCAGCAGGAA 
AATTCI I C I GGCGTATTTCATTCTGATGTTTATAGTATATGCCTTTGAAGTGGCATCTTGTATC 
ACAGCAGGAACACAACGAGACTTTTTCACACCCAACCTCTTCXTrGAAGCAGATGCT AGAGAG 
GTACCTCGCCCGGCGGCCG<rrCGACCGCCCGGGCAGGTACGGGGGGATAGTTCACTCACT 
TTCAAAGCCAGCTGAAGGAAAGAGGAAGTGCTAGAGAGAGCCCCCTTCAGTGTGCTTCTGA 
CTTTTACGGACTTGGCTT GTTAGAAGGCTGAAAGATGATG GCAGGAATGAAAATCCAGCTTG 
T ATGCATGCT ACTCCTG GCTTTCAGCT CCTGGAG TCTGT GCT CAGATTCAGAAGAGGAAATG 
AAAGCATTAGAAGCAGATTTCTTGACCAATATGCATAGATCAAAGATTAGT AAAGCACATGTT 
CCCTCTTGGAAGATGACTCTGXrTAAATGTTTGCAGTCTTGTAAATAATTTGAACAGCCCAG 
GAGGAAACAGGAGAAGTTCATGAAGAGGAGCTTGTTGCAAGAAGGAAACTTCCTACTGCTTT 
AGATGGCTTTAGCTTGGAAGCAATGTTGACAAT ATACCAGCTCCACAAAATCT GTCACAGCA 

gggcttttcaacactgggagtt aatccaggaagatat7cttgatactggaaatgacaaaaat 
ggaaaggaagaagtcataaagagaaaaattccttatattctgaaacggcagctgtatgagaa 
taaacccagaagacxxttacatactcaaaagagattcttactattactgagagaataaatgatt 
tatttacatgtgattgtgattcatgatcccttaattaaatatcaaat^ 
tgacaaacacacttatctgtctcttctac^ttgtggtttattgaatgtgatttttctg 
atataaattagactaagtgttttcaaataaatctaaatcttcagcatgatgtgttgtgtataat 
tggagtagatattaattaagtcacctgt ataatgttttgt aattttgcaaaacatatxtttgagt 
tgttt aaacagtcaaaatgttt gat atttt at accagctt atgagct caaagtacctoggccg 
ccaccgcggtgagctccaann nn 

>278 . . • 

^fOWGTCGACCACGCGTCCGGCAAGGCGTCCAGGAGTGACCT GGGGCT GTGGA 
GAGCGACCCGTGGCCTTGTGTTTCAGTTCTCT ATTCAGAAACTTGAACCTTTCCTAAGGGAC 
ACTAAGGGCTTCAGTCTTGAAAGTTTTAGAGCCAAAGCATC rTCTC TT TC l GAAGAATTGAAA 
CATTTTGCAGACGGACTGGAAACTGATGGAACTCTA^ 

AAAGCATCAGA I I 1 1 1 C 1 1 1 UGAAGCATCTGTGGCT GAGATGAAGGAATACAT AACAAAGTTT 
TCTTTAGAACGTCAGACTTGGGATCAGCTXnTGCCTCACTACCAGC^ AAAGAGAT 
ATTGTCCAGAGGATCAACTGAGGCCAAAATTACTGAGGTCAAAGTGGAACCT ATGACATATC 
TTGGGTCTTCTCAGAATGAAGTTCTTAATACAAAACCTGACT ACCAGAAAATATTACAGAACC 
AG AGCAAAG T CTTTGACTGT ATGGAGTTGG T G ATGGATGAACT GCAAG GATCAAT GAAACAG 
CTGC>^3GCCTTTATGGATGAAAGTAGCTGCCCGGGCGAACGCCACCGCGGGGAGCT CGAN 
>279 ■ * 

N^l*CACATTGTAATATTATATGATGTATAGTTGTACGCAGCTCTGTGCATAACTGTGG 
TAACTTTGTGGG1 OGCI CCTGTGGGTCCTGAAACAATGCVU3TTCTCCCCGCX3TATGCGACAT 
CCGGCTGATGGTCATGGAGATCCGCAATGCTTATGCTGTGTTATATGACATC 
ACTTCGAGAAQCTCAAGAAGCCCAGGGGAGAAACAAAGGGAATGAT^ 
TCTCCCATTCTGTGATGAGTACTCn CTGCACT G I IUI I ICIT 1CTAATAAAA C 1 1 ' IC f ITTTC 
GAACCTATACTQTCTTCTGTAAATTCTTC^ 

COATGGCAOOQTTCTQACACCTCACCTGOCATAATATAAAOTO 1 1 1 1 J M J 1 1 ATACCCTTCC 
ACTTGGAAAGACTACAGAGGAATCTTGCACTGCATAGTTCAAACTA 

TACCT GAAAAGCAAGAGAAAACAAGAAG GGGTAAATTTTGAACCAAGGGAAATGATTTAAGA 
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AGTGTCTGGTATTTTTCAAATTTCTGTCAGTTGTT ACATTTGTCATAAG TAAA TGTTT AGGAAT 
AAAGGATGGAGACATGCTTATTTTATTTAACTCCCCCAAAAAAAAAAAAAAAAAAAAGTACCT 

GCCCGGGCGGCCGCCCCGGGCGGCCGCTCN 
>280 

ACGCGGGGGGGACGTAAGGTGGGGCGGTGAAAGAAGTTTGCTGACGAAGATGGC 
GACTGAGGCACAGAGTGAAGGGGAGGTGCCAGCCCGCGAATCCGGCCGGAGTGATGCCAT 

ctgcagttttgtgatctgcaatgattcttcccttcgaggtcagcccattatcttt AATCCT GA 
CI I ( 1 1 1 GTGG agaaactccgacatgagaaacctgagattttcact GAGTTGGTGGTCAGCA 

ATATCACAAGGCTCATCGATTTACCTGGAACTGAGTTGGCTCAGCTGATGGGGGAAGTGGA 
CCTTAAGTTGCCTGGCGGGG CTGGCCCAGCATCAG G ATTCTT CCGGT CT CT CATGTCTCTC 



TATACCAACTGATTGAGTATCTACACAAAAACTTGCGAGTAGAGGGTTTGTTTAGAGTACCG 
GGTAAT AGTGTCCGACAGCAGATTTT AAGGGATGCTCTCAAT AATGGAACT GACATTGACTT 
GGAATC^GGGGAATTTCACTCAAATGATGTTGCCACTrTGCTGAAGATGTTTCTAGGAGAGT 
TGCCGGAGCCTCTGCTGACACATAAACACTTCAATGCACACCTCAAAATCGCTGATTTGATG 
CAGTTTGATGAT AAAGGAAACAAGACCAAT AT ACCAG ACAAGG A CCGG CAAA TT GA G G CT CT 
CCAGTTGCTCTTCCTCATTCTC«rrCCTOCTAATCGTAATTTGCTGAAG 

CCTATACCAGACAGCAAAGAAACAAGACAAGAAGAAGATGTCAGCCT ATAACCTTGCCCTTA 
To J 1 1 tiGACCCCACGTCCTGTGGCCAAAAAATGTCACTGCAAATGACCrrTCAGGAGAATATC 
ACAAAGTTAAAGAGTGGGATGGCTTTTATGATTAAACACTCCCAGAAACTTTTT AAGGCTCCT 
GCTTATATrCGGGAGTGTGCGAGATTGCACTATTTGGGATCCAGAACTCAGGCATCAAAGGA 
TGACCTT GACCTCATAGCTTGATGTCATACTAAGTCCTTTCAGCTGGCAAAGTCTCAGAAACG 
GAAGCGGGTAGATTCCTGCCCTGACCAGGAGGAGACCCAGCAC^ATACGGAAGAGG cact 

gagagagctgtttcaacacgttcatgatatgccagagtcagcaaagaagaaacaactt ATT a 

GACAGTTTAATAAGCAATGATTGACCCAGACAGCAGGGCGAGAACCTTCTACTTCCCAGGTA 
CAAAAGAGGGCTCGTTCGCGCTCCTTCAGTGGGCTT ATTAAGCGGAAGGTCCTGGGAAATC 
AGATGATGTCAGAAAAGAAAAAGAAGAACCCTACTCCAGAATCTGTGGCCATTGGTGAATTG 

AAGGGAACCAG caaagaaaat aggaactt ATT attttct ggct ctccagctgtcacgat gac 
accaacaagattgaagtggtctgaagggaag aaag aggggaaaaaaggatttctct g AAGG 

ATCCAGAGTTGTCT CCT ATGGTC^TGCAGAATTTTCTGTTTAGTGGGCAGGTGTTATTCCTG 
CCCACAGCAAAGCTTGG^CTTGCAGCTTGCTTGCTGCATTT^ 

TACC<STGACCC<l^CTGAT^XrrCTAACCCCACTCACTGGATGATGTTT GCAAGCTGTGCCTT 

CTGAGAGAGTGCTT AGGCCCTGTCTCTUF I Ml I AATATTATGGGGAAAGCACTAACT ATCGA 

ACCAGCTTATACAG^ACACTAAGGTGGGCTTCAGTGCTCACT CAATGTGTTTAGGCAGATTC 

CACTTTTGAAAAAAAA TATGAA ATGTGTGCTCAACTGCCAGT AATTTTTT AAAAAGGACT GTCC 

CAGTGGATTGATGTTGTTTTTAATGGATATTTTGGGTTTTTCTCTGT^ 

TTTGG i mi i n I GTTTGTTTATTTCTTTGTTTTAAAAGCCATGTTTTTGGTTGGGCT 

AGAT ATCTTTCCCTC 1 1 1 1 1 CACTTT GAGCTTTGGGAAAACTCTTT ATCTT ATGAGGCT GTATT 

CCTCMTACCTAATTTGTGTCCAAAGAATTTATAGCTCTTCT GGACATTTTTTATTATTTCTTG 

GGTG T GAGATCAGAG TATTTGACCTGCAGTATTGAAAAAGQAGAATTCAGAATGATACAGTA 

T TTTAACAAATC T TAATT ATTAAAC rCTTTTOCTTOCTl CXJATTTCTCCCTCCCTTGTCCATCTC 

TC TCTCTC 1 1 1 CCCTTTCCTCAGTGATGTGAAAATAA tibial 1 1 1 G C I GAACTTGTT ATCTTC 

ATTCAAI rrccTci I gactaaaacatctctggtgccaacgtaatactt ctgaaccacatcact 

GTGACT CAAGGAAAGTCACTGACAGCATAAGAGAAGTTTGCT AAAATATTTGTATGTGGGGG 
AAGCTCTGGAGTGTGCCTAGGAGGGGGCTGGCTGCCTTTATGTCCCAGGATGACT cttt AT 
GGGTGGGATTACATTGCACCCTCTGAGGGTGCAGGCTAGACCGTCTCCTGAGAGGAAGTT^ 
GGATCAGAAAGAAGAAGCAAGCAGCAGCCTCTGCAGGGCT GACAGGATTTAAAGGAGAGAA 
TGTTCTT A TTTGGAAGCAGCT GTGGCTTGTGACCAATGTTCAAGGAGTGTTACTGTTCCGCC 
^ioi^i » t groAGAAQ QgACA CAGGTQGT MTTTGGAGATGQQGOCAGAGCTItnGGCTnT 
G^j^TTTGGTGTGTrCACTTGTGTTCGATAGAGCA^ 

GGTGTC1 GGCCAGAGAGCA^K^TGTAGCAGGGGGGAATGCTCAGGTTTTrrGCCTGGCTCT 
GTGGAGCTGT ACAACCCTTCTCACCCTGTGGGTTGGAGCCGAGTCAGGCCACTATG<5GGAA 
GCAGTTGCCOGACAAAATGTGG 1 1 1 UCI GACCTATTTCTAAACTGTTGAATATGCTGCAGCAT 
TGCT GAAATGAAA GA TGACT CTGQGGGAGCAGAGCTTGGCGTTGTGCXX^GCTGGC^GCCC I 
CCTCTGCCAGCCTTTCTGCTGCTTTrGGTGCTGTAACAGCAATAG 

TTGGTCTTCCAGCrrTAATGCAGTGTGAACAATAGATGGTTAGGAAAACAAAACTGCTTAGAA 
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GCCCCI I T C I C I AGAGCAGTTTT ATGTCATTTGT AAAAACACATATT AGCAAATTCGTTCGCG 
TAGGTTTCT ATTCAAT ATTTGAC 1 1 1 1 M 1 1 I CI I ATTAAGAAAATGAAATCCCTTACACCAGAT 
ATCAGTTAATTCAAACAGAAAACCCTTTGGGTATCANNNNN 
.281 

NNWNW5GCGG«XSAGGTACGCGGGGGGykGA^ 
AACCT GT GTCTCTTCATCT CCCTG GGAAGGGTGCCCCCGAAGTGAAAG AGATG GCCTGGTG 
GAAAGCCT GG G AGAATGAAT AAACAG ACT AGGTTGAATCCAT ACAATGG AATGGTAGCAGAC 
AATAAAAAGAAAATGAACTATTGATGCCCCCTACTGCACAGCAGAAGCTCTGAATCGTGTTC 
CTGAATGAAAGAAGTCAGAGATGAAAAGATGGGCCAGGAGTCCAGTTTCTGGAAGGCCAAG 
AATCGAAGTAGC^AGCTGCAGCCGTTTTCCAGAC^ 

TCAGGACTGGAGGGGCAAACTCCGATGTGAGTGAGGCCCt^CTGCCAAATGGCGGCATGC 
TCAGATAGCACCCAAGAATTTGGGGAAAAAACTGGTGCTCACAGCN . 



cAAAAACTGGAGACTTTTAATGTATl T CI F T AATTTCGGACGCGTCCGATTTTGGAGT 
TGTTGAAATTGTCOCCGCAGTTTTCAATCATGTrCTGAACCA^ 

GCAGCTGGTCAAATTGCATATTCACTGCT GTACAGTATTGGAAATGGATCTGTCTTTGGTAAA 
GATCAGCCTATAATTCTTGTTGCTGTTGGATATCAC<XXX1ATGATGGGTGTCCTGGACGGTGT 
CCTAATGGAACTGGAAGACTGTGCCCTTCCCCTCCTGAAAGATGTCATtXSCAACA^ 
AAGACGTTGOCI I CAAAGACCTGGATGTGGCCATTCTTGTGGGCTCCATGCCAAGAAGGGA 
AGGCATGGAGAGAAAAGATTTACTGAAAGCAAATGTGAAAATCTT CAAATGCCAGGGTGCAG 
CCTT AGATAAAT ACGCCAAGAAGTCAGTTAAGGTTATTGTT GTGGGTAATCCAGCCAATACCA 
ACJ^CTGACTCCTTCCAAGTCAGCTCCATCCATCCCC^ 

CGTTT GGAT(^CAACCGAGCT AAAGCTCAAATTGCTCTT AAACTT GGTGTGACTGCT AATGAT 
GT AAAGAATGTCATTATCTGGGGAAACGATTOCTCGACTCAGT ATCCAGATGTCAACCATGC 
CAAGGTGAAATTGCAAGGAAAGGAAGTTGGTGTTT ATGAAGCT CT GAAAGATGACAGCTGGC 
TCAAGGGAGAATTTGTCACGACTGTGCAGCAGCGTGGCGCTGCTrGTT^TCAAGGCT 
ACT ATCCAGTGGCATGTCTGCTGCAAAAGCCATCTGTGACCACGTGAGGGACATCTGGT^ 
GAACCCCAGAG45GAGAGTTTGTGTCCATGGGTGTTATC^ 
OCTGATGATCTGCTCTACTCATTCXXTGTTGTAATCAAGAAT 

OGTC1 OCCT attaatgatttctcacgtgagaagatggatcttact ggaaaggaactgacaga 

AGAAAAAGAAAGTGCTrTrGAATTTCTTTGCTCTGCCTGACT AGACAATGATGTT ACTAAATG 
CTTCAAAGCTGAAGAATCTAAATCT 

TTAAATTACTTGTGAAAAAGAACACATTTT AAAGATT AOGTGCTTtrTTGGT ACAGGTTTGT GAA 
TGACAGTrrATCGTt^TGCTGTTAGTGT GCATTCT AAATAAAT ATATATTCAAATGAAAAAAAA 
AAAAAAAAAnar — 




HNfrWO GCGGCCGOGTCGAO'lGGCT^^ 
^C^M^^C^TCCTAMAAAMAA A AAMAAAAAAMAAAAAMGCMSCGGCl^C G I Q l ICGC 
GQCCQCGTOGACXCGGCTa^^ 

AACTCTGTGn^fifiGXSAAGAGGLiL I G I 1 1 CGCTGTCGGGTCTCTAGTTCTTGCAOGCTCTTT 
AAGAGTCTGCACTG<SAGGAACTCCTGCt>TTACC>GCT I TCI I GCACAAGGGAGGGGG 
AAACATACAirTATTCATGCCAGTCTXTTTGW 

mi roci 1 1 gcattgtaggagagggactaagtgctagagact atgtcgc i i icc i u agcta 

COGAGAOC6CTCOTC 
CTJGCCTGGGA|GGCCGCGAGAAACT^ 
' TCCTCTCT ACTTTAGCACGGTCTCCAflACTCAGCCGAGAGACAOC^ 
ACACACCGAGAGAGAGG^AGAGAGAGACT CTCCAGCCTGGGAACTAT AACTCCTCTGCGA 
^eeCGGA GAACr CCTrCC C C AAATCT Tr^^ 



TGGCCC GGGGAACCGG<iAGGGCCCX^X^ 
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OGCGCGTGGGCGCCCXSCCGAGCCGAGGCCATGGTGCAGCAAACX^CAATGCCGAGA^ 

CGGAAGCGCT GCT GGCCGGCGAGAGCTCGGACTCGGGCGCC»GCCTCGAGCTGGGAATC 

GCCTCCTCCCCCACGCC<^CTCCACCGCCTCCACGGGC^^ 

CTGGTGCAAGACCOCCAGTGGGCACATCAAG^ACCCATG^ACGCC^ 

CAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCACAAC^CXXaAGATCTCC 

AAGCGGXrTGGGCAAACGCTGGAAGCTGCrrCAAAGACAGCGACAAGATCCCTTTC^ 

AGGCGG^GCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACT ACAAGTACCGGCCCA 

GGAAGAAGGTGAAGTCCGGCAACGCX^AACTCCAGCTCCTCGGCCGCCGCCT CCT CCAAGC 

CGGGGGAGAAGGGAGACAAGGTCGGTGGGAGTGGCGGGGGCGGCCATGGGGGCGGCGG 

CXWCGGCGGGAGGAGCAACGGGGGGGGAGGAGGCGG<^GTGCGAGTGGCGGCt^CGCC 

AACTCCAAACCGGCGCAGAAAAAGAGCTG(^GCTCCAAAGTGGCGGGCGGCGCGGGCGG 

TGGGGTTAGCAAACCGCACGCCAAGCTCATCCTGGCAGGCGGCGGCGGCX>GCGGGAAAG 



CCCCTGGGCGCOGCCGCCGACCACCACTCGCTGTACAAGGCGCGXaACTCCCAGCGCCT 
GCXTTCCGCCTCXrrC«GGAGCX:Ta5GCCT(X»CAG 

GGCCGAGAAGAAGGTGAA^GCGTCTACCTGTrTCGGCGGCCTGGGCACG^ 
CXTTGGGCGGCGTGGGCGCXKKiAGCCGACCCCAGCGACCCCCTGGGCXn"GT 

TCGCCCGCCXXCGGCCXZCTXXZCCCQCCGJK^^ 

CTCGCCCGCXXXX3TCCAGCGC<3«XTCCCACGCGT(XTC^ 

CTCTTCCTCCTCCTCG TOCTC CT(»TCCTCCGAC^ 

CT GAACCCCAGCT CAAACTTTGAGAGCATGTCCCTGGGCAGCTTCAGTTCGTCGT CGGCGC 
tcga*ocgggacctggattttaactt CGAGCCCGGCTCCGGCTCGCACTTCGAG TTCCCG GA 
CT ACTGCA CGOC CGAGGTGAGCGAGATGATCTCGGGAGACT GGCT CGAGTOCAGCATCTC 

caaoctggttttc1acctactgaagggcgcgcaggcagggagaagggccggggggggtag 
gagaggagaaaaaaaaagtgaaaaaaagaaawlaaaaggacagacgaagagtt^ 
aagggaaaaaagaaagaa aaag taagcagggctggcttcgxxcgcgttctcgtcct 
caag*^cgcggccgc<sttttggacccgc<^ccc^^ 

CCCACTCJGCCCAGC CGG^ 

GATT GTTGTT ATTGATGTTGTT G T7 GATGGCAAAAAAAAAAAAGCGACTTCGAG Tl 1 GCTCCC 
CTTTGCTTGAAGAGAC CCCC TCCCCCTTCCAA^ 

CA AGAAGQCGAGTT AG TTTTCTAGAGACTT GAAGGAGTCT CCOCCTTCCTGGATCACCACCT 
TGGTTTTGJTTTATTTTGCTTCTTGtrr 

CCCCTTTTTTT AAACGCGT GATGAAGACAGAAGGCTCCGGGGTGACGAATTTGGCCGATGG 
C AGAI UMM G GGGGAACC^^GGGACTGAGAGACTCCACGCAGGCGAATTGCCGTTTGGf^ 
GCTTTTTn ICCTCCCTCI 1 1 1 (AXXTITGCCCCCTCTGCAGCCGGAGGAGGAGATGTTGAGG 
GG^GGAGGCCAGCCAGTGTG^CCGGO^AGGAAATGA 
CAfiCAGCG6GAGCTAGGGGC<K»GGG<XX5AjGGAGGAC 

TCAAACTGAAATGGATTTGCACGTrTGGCGAGCTGGCGGCGGCGGCT GGTGGGCCTCCGCf: 
TTm 1 1 1 w i AC GTGAAATCAGTGAGGTGAGACTTCOCAGACCCCGGAGGCGTGGAGGAGAfl 
^^^^GTTTGATGTGGTACAGGGC^^GTCAGT GGAGGGCGAGTGGTTTOGAAAAAAAAAA 
AAGAAAAACCGAAAGGAATGAAAAAAAAAAGAI 1 1 1 1 1 let rCTCTTAATCGGAATOGTGATG 
GT GTTGGATT ATTTCAATGGTGGGGTT AATATAGCATGTT ATCCTGT CT ATCTnTTAAAGATTT 
CTGTATAAGACTG1 * GAGCAGTTTTTAAAAT AGTGT AGGATAATAT AAAAAGCAGATAGATGG 
CGCTATG1 1 1 GATTCCTACAACGAAATTATCACGAGC I M i ll I CATTCTT AACTCTTT AAAGG 
^TTCAAACGCAACTCAAATCTGTGCTGGACTTT AAAAAAACAATTCAfit^ ACGAAATTTTTTCTC 
AGTGTGTGTGTTT ATTOCTT ATAGGTGTAAATGAGAAGAUj I G 1 1 1 1 1 I ICCI ICAOCGATGC 
TCCATCCTCGTATTTCTTTTTCCrTGTAAATGTAATCAGATGC^ 

TA TACT GG CCAAACAT AI 1 1 1 1 I U I I r T GTCCCT n fTTTCTI IOCI MC I 1 1 1 1 AC 1 1 U C1 > I AT 
TTCTTTATTCCTTC CT n I t A^ II 1 1 H ICI M GN M H LI U I M M 1 1 1 1 M 1 1 IG GGAAGTTGT 
TGTTAOCCACOCX ^TTTTAO GTCTCCTTCACTGAAGGGCT AGAGTTTT AACTTTTAATTTTTT A 
* TATTTAAATGTAGA^ 1 1 1 liAACACTTTTAAAAAACAAAAAAAGACAAGAGAGATGAAOCOGTT * - 
GGAT TA Hi LLt UAGGGT A TT TTGGT AACACAAT AT AT AAGGGGGGT GT AATOGGGGT AACT - 
C^CTGTTOGATTOGGA^^ 

ATCGAGCCCCTAA ITTTCl ULAT GGTGT AACCCTTCATTCTGCCGGGTTXjrTCAAGGTT GAC 
AGATATCCCTT AACTGGCACGAAGGTTTTACCCTTCGGGCCCCG 1 1 1 1 G I M AAAGGCGCGG 
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GGGACATTGAGGCAACAAAGGTTGCGTTTTAGGAACACCACAGGCTTTTCGGAAATATGGC 
GCTGTCCAGAGGGTTAAATTGTGCX^TCAGCCAAAGGGGTGTGGCTTAATGTGAACCCACT 
TTTGGCCNNNNNNNNN 

>285 

CGACOCACGCGTCCGGCGACGCCAAACATGGCGTGTTCCTAGAAGCOGCTTTCGG 
CATCAGTAGGCGGCGGCGTGGGGTCTGGCAGCGTGGGGAGAGGGACCAACCGACGCCAC 
TTCGTG7T GGGAAGTGGGAGCGGGAGGGCCGGGCAATTCCC6ACCGAACCAAACGGTTTC 
CATGGATCTCAATAGTGCCAGCACTGTTGTTCTTCAGGTGTTAAOVCAGGC^ACCAGTCAGG 
ATACTGCTGTGTTAAAACCAGCTGAGGAGCAGTTGAAGCAGTGGGAGACACAGCCAGGTTT 
CTATTCAGT GTTG CTGAATATTTTCACCAACCACACTTTGGATATAAATGTAAGGTGGCTTGC 
TGTACTGTATTTTAAACATGGAATTGATCGCTACTGGAGACGTGTAGCACCTCATGCTCTCTC 
AG AGG AGGAGAAAACT ACTCTG CGTGCAGGGCTCATCACCAACTTCAATGAACCAAT AAACC 
AGATTGCAACTCAGATTGCAGTGCTCATTGCAAAAGTTGCT AGATTGGATTGTCCCAGACAG 

tggcctgaactaattcccactcttatagagtctgttaaagtccaggatgatcttcgacagca 
ca6agcatt actta ccttctatcatgttaccaagacact ggcatctaaacg acttgctgctga 
T AGAAAACT atttt atg attt AGCTTCTG gaattt at aattttg cctg ctctctgtg g AATCAC 

CAC ACAGACACATTCCTG CAAGAAGTTTCTTCTG GCAATG AAGCTGCAATTTTG AGTTCACTA 
GAACGAACACTGCTATCATTGAAAGTGCTGCGTAAGTTAACTGTTAATGGA77TGTGGAACCT 

cataagaatatggaggtgatgggttttttacatggaatatttgaacgtctaaaacagtttctg 

GAATGCA GTAGAAGTATAGGTACAGATAATGTGTGTAGAGATAGACTGGAAAAGACCATCAT 
TCTTTTTACT AAAGTGCTTTT GGACTnrCTTGGATCAGCATCCTTTTTCATTTACTCCTCTAATT 
CAG AGATCACT GGAATTTTCTGT AAGCT ATGTTTTT ACAGAAGTTGGTGAAGG CGTTACATTT 
GAACGATTC^TTGTCCAATGTATGAATCTTATTAAGATGATTGTCAAAAATTATGCTTATAAGC 
CATCCAAAAATTTTGAAGATAGCAGGCCTGAAACT CTTG AAGCCCATAAGATTAAGATGGCAT 
TCTTCACATATCCTACTTTGACAGAGATATGTAGAAGATTAGTCTCTCATTATTTC^TATTAAC 
TGAAGAAGAACTGACAATGTGGGAAGAAQACCCAGAAGGCTTTACAGTGGAAGWAAGAGGA 
GGAGATTCTTGGAAATATAGTTTGAGGCCATGCACTGAAGTATTATTTATAGATATATTCCAT 
GAATATAATCAGACT CTT ACTCCTGTACTTCTAGAAATGATGCAAACACTTCAAGGACCCACA 
AATGTGGAAGATATGAATGCACTGTTAATCAAAGATGCT GTGTATAATGCT GTTGGATT AGCT 
GCTTATGAGCT CTTTGACAGTGTTGATTTTG ATCAGTGGTTTAAAAACCAGCTTCTTCCAG AA 
TTACAAGTCATTCACAATAGGTATAAGCCATTGCGACGCAGGGTGATTTGGCTCATCGGTCA 
GTGGATTTCT GTGAAATTCAAGTCTGACTTAAGACCCATGCTTT ATGAAGCAATCTGTAACTT 
GCTTCAAGATGAAGATTTAGT GGTCCGT ATTGAAACAGCTACAACTTTGAAGTTAACTGTTG A 

tgattttgaatttagaacagatcagtttctaccgtatttggaaaccatgt^ 

cagttactgcagcaagttacagaatgtg^cacaaagatgcatgttttgcatgtcctttcttgt 

gtgatcgaaagagtcaacatgcagatac^ccatatgtgggatgtttggtacaatatttgcc 

CCTCCTTTGGAAGCAGAGTGAAGAACACAAT ATGTTGAGATGTGCT ATTTTGACAACACTTAT 
TCATCTTGTTCAGGGATTAGGAGCAGACAGCAAGAACCT gtaccctttcctgctcccagtta 
TTCAACTGAGTACAGATGrrrrCACAGCCTCCACATGTTTAT 

atatgtcaccacttcttgaactaagttcagaaaatctt agaacttgctttaagatgatcaatgg 
ttatatcttttt atcatca^^gaatttttacagacatacgcagtaggtctatgccagtccttt 
TGTGAACI 1 1 1 aaaggaaattactacagaaggtcaagttcaggtgctcaaggttgtggaaaat 
gcccttaaagtgaacccaatact aggtccacaaatgtttcaaccgattttaccctatgttttc 
aagggtattatagaaggggagaggtatcctgtagtgatgtccacgtatcttggagttatggg 
tggagttctact agaaaacactag i i i iii i iu ii cact acttaatgagatggcccataaattt 
aatc>ggagatggaccagcttttg^gaaatatgattgaaatgtgggttgatcgaatggacaa 
cattacccagcctgaaagaagaaaactttcagctttggctttgctct 
taatagtgttatccaagataaattctgtgggattataaacatttcagtagaaggcctgcatga 

TGTCATGACGGAAGATCCTGAAACAGGAACTTATAAAGACTGT ATGTTGATGTCTCATCTTGA 

ggaacgaaaaqtaacagaagatgaagaaccaccc^cagaacaa 
ctggccctgaaggaccctgttcatagagtgt cactggagcagttcatctacgagaagctcaa 
gggacagcaggagatgctag gagaagaaggtttccagtccct catggaaacagtg gat acg 
gagattgtcaoccagctacaggagtjtttgcaaggattct aagagcacatgacatgtggctg 

CCTG<XXTTTCAGAAACAA(«^QAQTAACCCAGCCTC 

GATGAAGAAATCACTTCATGAAAATAAGCAAAGACCACACATTTTTTACTACAAA^ 
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TAATGTAATCGGATAATAAGATAAGCTATTCTCAAAGAT7GATTCATTTATGGAGGGGCTTCA 

taaaagtcgtagttacttgagaaaggggtgttcgttagagaaaaatagttagtggctaattga 

GAGAGAACGGAAGTTGCGAAATGAAGGGCTCTTACGGAAGGGAAAGAGGGGAAAGGGTAA 

attagaaacaggggagaagtaataagaagtaagagaggagtgtannn 

>286 
>287 

nhlatgtggttacgacccactgtattgaggtgacgcgatccataggctgtggtgttt 
gttttcgctgatccacacaaacg ttggg gcactgtctattcatgtgttcaagctgaaggctc 
gttctc»gttgtca7tttacagtg7nrtttctac^gggt7acattacaggaatgttgtagcgac 

GTTTAGCCCGTGGAGTATCAACGTCTT GAG ACTCCGTGTG AG ACT CCCT GGTTTGTTCCACA 

acagtgtgtttttagattccgtacctttgattaaggaacacatcatgccgtgaagccaattta 
tattctgcaattccgtagtgcatgtaatgtattctgccgtctcgtagtgtgaagccatgcttg 
gcacatcgagttctttgatgtctggctgccttctgcgggccaactgtcttgtggaattcgtt 
gctcccagagatagcttgaactgcagatcccgcacagcattgcactgagctgtcgttgtatc 



GCTCTCCATTC! I CI ICT I GGCTTT ACAG GTTCCCAG GTCAAGAGCTT CACCCAT AATTAAGA 
CCTTCTGAGGATGATCGATAGATAAACAC ACCTCCTCTGAACCATCCTT GGGCTTCATGGGG 
TTGGCATTGAGGATCCCTACGACAGTCCCCTGCTCGGTCTTCCAGAGCGCTrTGTGAACTTC 

tccaa ataag aagaaggacagacattgtgtcaggtcacgaagatcattcagtttccatatgct 
GAAGGi i I r l cgactattcacactctgtggcgtaaccttcttcaatataaccccaaatgtcac 
ccaa tctatttcttccagcttctctctggccatcttttccttgatctgagacagtct gatcag 
ttitcggccggtcatgtgtcttcgttcatattctctggaggatactcgagcgcgcctcgagc 
cgc^gaccaggagaaggcttccacacagatggcgattgagtcgtttcctcacagaact^ 
actcggggtccaccacatatttgacctctagttatcccactaggtttgttccgagaaatcgt 
gtgta gggg ttgggagggtgcacttgtcatocttgaagatgaggttttgggatctggaggt 
gaagccttt ggtgttcgagccaccctcttggttctttggtagcgcagggacatcaagctccg 
cagaaaagcatgttgactcctgaattctctgaactct cct ctcctt aagaggtggccgggga 
gacttctcagg g gatttttgcagacgggctgggcttg ctgtctgtttaattgttgtta ct 1 1 1 
agctgctctrgtaaggcgttcatttgctcttgcaaattccttaattcct 

TCGTTTTCTCTCGCCTGGGGGCAGGAGCAGGGAGGACCCT ATTTTCAGTTGACTCTGATGC 
GGGAAC II C I I C I J CATCTGTTAAGTCXrrCCATATCTCCAAAGAGAGTGGCCAGATTTTCCTT 
TTCGTCTCTTGT CTCT CCTGTTTCTCCATCATCAGCCTCTTCTGT ATAAGATTCACCGTCGCC 
GTCGGGATCAAAGAGCTCATCAAATGCGTCGGGCTCGCCATrrrCCCGCGTCAAGAAGTTAT 
TTTCTTCTGAATTACAATCCAAGGCTGACTCA I I MC I I CCAGCAGTGCGGTCAGCAGAGACA 
GATTGTCTTCCTCCTCATCCATGCTGTCAAGAGGAGAGTTGTGACAGGAAACTTCGAAGGAG 
GCTCAGATGCCCAGATGAGCAATGTAGAATCTTGGCCGATGC^^GACGCCTTCTTCGTTC 

acagctgaacccgccaaaatcggacmn 



NAGTCGCGCGCGGTGCAGTCGGGAGGTGGAGGCAGCGGCTGCATTGTTTTCGGGA 
TCGAGGGkTTGAGGGCGCTATGGCACCCGGCTGCAAAACTGAGTTACGCAGCGTGACAAAT 
GGTCAGTCrAACCAAOGAAGTAATGAAGGTGATGCCATCAAAGTTrrTGTGXXSAATTCGTCC 
TCCTGCAGAAAG ATCTGGGTCAGCTGATGGAGAGCAGAACTT ATGCTT ATCTCTGCTGTCCT 
CCACGAGTCTCCGGCTGCACTCCAACOCTGAGCCCAAGACCTTCACGT^ 
GATGTGGMTACCACTGAGGAATCTGTATTTGCAACTGTGGCTAAAAGCATTGTGGAGTCTTG 
CATGAGCGGTTATAATGGTACCATCTTTGGATATGGACAGACTGGCTCAGGGAAGACATTTA 
CTATGATGGGACCATCTGAATCTGATAATTTTTCTCAT AACCT GAGAGGAGTAATCCCACGAA 
GTTTTGAATATTTGTTTTCCTT AATTGATGGTGAAAAAGAAAAGGCTQGAGCTGGAAAGAGTT 
TCCTTTGTAAGTGTTCCTTT ATTGAAATCT ACAACGAGCAGATATATGATCT ACTGGACTCT G 
CATCGGCTGGACTGTACTT AAGGGAGCATATCAAGAAGGGAGTU I U G I I GTTGGTGCGGT 
GGAGCAGGTGGT AACCT CAGCTGCTGAAGCCT ATCAGGTGCTGTCTGGAGGATGGAGGAAT 
AGACGTGTGGCATCAACATCAATGAACAGiAGAATCGTCTAGGTCTCATGCCGTCTTTACAAT . 
TACAAT AGAGTCAATGGAGAAAAGT AATGA6ATTGTGAAT AT ACGGACCTCCCT, ACTGAACCT 
GGJGGATTTAGCJQG^ 

CGTGGGT AATGGAAAACAGAGACATGTTTGCTACAGAGACTCCAAACTTACXnTCTTACTAC 
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GGGATrCCCTTGGAGGTAATGCXIAAAACAGCCATAATTGCAAATGTT(^TCCTGGATOCAGG 
TGTTTTGGGGAAACCCTATCAACACTTAACTTTGCTCAAAGAGCCAAGCTGATTAAAAACAAG 

ggagtagtaaatgaagacacccaaggaaatgtgagccagctccaagctgaagtgaagaggc 

tcaaagaagaactggcggagcrtgcttcaggac^gaciaccaccagaaagctrcctgac 

agacaaaaagaagactaactatatggagtattttoaggaagcaatgttattctttaagaaatc 

tgaacaggaaaagaagtctctgatagaaaaagttacccaattagaagacgtcaccctcaaaa 

aggaaaaatttattcaatctaataaaatgattgtgaaattccgagaggatcaaataatacg<rrt 

ggaaaagctccacaagx3aatcccggggaggttttctgcctgaggagcaggatcgtttgctc 

TC^GAATTAAGGAATGAGATTCAAACTCTGCGAGAACAAATAGAGCACXiACCCCAGAGTTGC 
AAAGTATGCTATGGAAAATCATTCCCTCAGGGAGGAGAATAGAAGACTGAGATTATTAGAGC 
CTGTGAAAAGAGCTCAAGAAATGGATG CCCAGACCATTGCAAAACTAGAAAAAGCTTTCTCT 
GAAATAAGTGGCATGGAGAAAAGTGACAAAAATCAGCAAGGATTTTCACCTAAAGCTCAGAA 
AGAGCCATG 1 1 1 G 1 1 1 GCAAACACTGAGAAGTTAAAAGCACAACTCCT GCAAATTCAGACAG 
AGCTGAATAATTCAAAGCAAGAATATGAAGAATTCAAAGAACTTACTAGGAAAAGGCAGCTA 
GAATTGGAATCAGAGCTTCAGTCTTTGCAAAAAGCGAACCTT aatcttgaaaaccttttggaa 
gcaacaaaagoctgcaagcggcaagaagtttctcagctgaataaaattcatgctgaaac^ct 
taagattataactacaccaaccaaggcctaccaacttcat^^ 

CCCTGAAATGGGAAGCTTTGGCTCrrCTATAC^CTCAGAATTCTAGCATATTAGATAATGATAT 

attaaatgagocagttcxrrcctgagatgaatgaacaagcttttgaggccatttcrr 
ttagaagagtgcaggaacaaatgagtgctcttcaagc<^aactggatgaagaagagcataaa 
aacctaaagcttcagcag CATGTTGACAAACTGG AACATC^TTCTACCCAAATGCAGGAG ct 
tttctcatcagaaag^ttgattggaccaaac^caggaagagcttctct 
ccttgaaaagcagcttcaagag actcaaact aaaaatgactttttgaaaagtgaggtacatga 

CCTGCGAGT AGTCCTTCATTCTGCTGACAAGGAG C 1 1 I C I I CAGTGAAATTGGAATATAGTTC 
ATTCAAAACGAATCAGGAGAAAGAATTCAACAAACTTTCrGAAAGACACATGCATGTAGAGCT 
TCAATTAGATAATCTCAGGTTAGAAAACGAAAAGCTGCTrGAGAGCAAAGCCTGOCTAGAGG 
ATTCCT ATGACAACTTAGAAGAAATAATGAAATTTGAGATTGACCAACTTTCAAGAAACCTCC 
^^GTTCAAAAAAGAAAATGAAACTCTGAAATCTGATCTG AAT AATTTGATG GAG CTTCTTG 
^GGCAGAAAAAGAACGCAATAACAAATTATCATTA<^GTTTGAAGAAGATAAAGAAAACAGTT 
CTAAAGAAATCTT AAAAGrrTCTTGAGGCTGTACGTCAGGAGAAACAGAAAGAGACGGCCAAG 
TGTGAGCAGCAGATGGCAAAAGTAGAGAAACT AGAAGAGAGCTTGCTTGCTACTGAAAAAGT 
G ATCAGTTCCCTGGAAAAGTCTAGAGATTCT GATAAGAAAG TTGTAGCT GACCTCATGAACC 

agatccaggagctaagaacatcggtctgtgagaaaacagaaactatagacaocctg^ 

GAACTGAAGGACAT AAATTGCAAAT ACAACTTJTGCTTT G GTTGACAGAGAAGAGAGCAGAGT 
GTTGATCAAGAAGCAGGAAGTGGATATTCTGGATCTGAAAGAAACCCTTAGGCTGAGAATAC 
TTTCT GAGGACATAGAGAGGGATATGCTCTGTGAGGACCTGGCTCATGCCACTGAGCAG CT 
GAACATGCTCACAGAGGCCTCAAAAAAACACTCGGGGCTGCTGCAGTCTGCCCAGGAAGAA 
CT GACCAAGAAGGAAGCCCTGATTCJW^GAACTTCAGCAC^^ 

AAGTAGAACAGAAGAAGAATGAATATAACTTCAAAATGAGGCAACTAGAACATGTGATGGATT 
CTGCTGCTGAGGATCCCC^GAGTCCTAAGA(iACCACCTCACTTT CAAACACATTTGGCAAAA 
GTCCTGGAAACACAAGAACAAGAGATAGAAGATGGAAGAGCCTCTAAGA C MCI I I GGAACA 
CCTTGT AACAAAGCT AAATGAAGACAGAGAAGTCAAAAATGCTGAAATCCTCAGAATGAAGG 
AGCAGTTGCGTGAAATGGAAAACCTACGCCTGGAAAGTCAGCAGTTAATAGAGAAA^ 
CTCCTGCAAGGTCAGCTGGATGATATTAAAAGACAAAAGGAAAACAGTGATCAGAATCATCC 
AGATAATCAACAGCTGAAGAATGAACAAGAAGAAAGTATCAAAG^ 

AAAT AGTTGAAGAAATGCTGAAAATGAAAG CAGACCTAGAAG AAGTCCAAAGTGCCCTTTAC 

AACAAAGAGATGGAATGCCTTAGAATGACTGATGAAGTGGAACGAACCCAAACT 

T AAAGCATTCCAGGAAAAAGAACAACTGAGATCAAAGCT GGAAGAAATGT ATGAAGAAAGAG 

AGAGAACATCCCAGGAGATGGAAATGTTAAGGAAGCAGGTGGAGTGTCTTGCTGAGGAAAA 

AGGAAAATGTCAGGCTTGCTGAG<iAGACAGAAAAGTTGCGTGC^ 
GAAAAGAAAAGAAGTGAATCTTGAGGATTCXXiGTC^ 

GAIUI I ICI ICICI 1 1 1 ACAAGTAAGACCTACTCCTGGCCACTTAGGAGAGCTGAATTTATGG • 
ACCTrAATTATTAAATGTTTATAAGGT GGTGGTAACCAGCTCAAGTTTCTGATGAAGATTCTG , 
CATCX^TATACACCCTGTG ACAGT CAGCAGTCTGCTrATTAAGTGGCCT ACTTCAAGGCTTTG 
AATCAACTT AAGGGAAAAC^I 1 1 1 GTCTTTGTAAAAAT AAAAGCCT GTAGCT AAGGTTTACAG 



TGGACATTAGCCAGATCATTTTCTTCTnTAGATTATG(X^ATAATCTCCTTTGATTCTTATGGAAG 

TTCTAACAATATATGGTGGTTCCAACACCTGC^GTGAGTTTAATGACTGACTTAGTAGCAGGT 

ACAAGAAGCAAACTTGTTAATATAG ATTATTTTTGTATTCTT ACTTT AGGTATTTTCTTGAGCAT 

TTTCCATG A CTGT AAAT AAAGCCATTTTTT AAG AT AAAAAAAAAAAAAAAAAAAAAAAGGG CG 

GCCGCN 

>290 

>291 

NNACACATTGTAATATTATATCATGTATAGTTGTACGCAGCTCTGTGCATAACTGTGG 

taactttgtgggtcgctcctgtgggtcctgaaacaatgcagttctccccx3cgtatgcgacat 
ccggctgatggtcatggagatccgcaatgcttatgctgtgttatatgacatcatcctgaaga 
acttcgagaagctcaagaagcccaggggagaaacaaagggaatgatctattgagagccctc 
tctcccattctgtgatgagtactcttctgcactgtt c i m c i i ICTAATAAAA C I I I CI 1 1 iic 

GAACCTATACTGTCTTCTGTAAATTCTTCTTACTACCCT^^ 

CGATGCCAGGGTTCTGACACCTCACCTGGCATAATATAAAGTG rTTTTTm I ATACCCTTCC 

ACTTGGAAAG ACT ACAG AGG AATCTTGCACTGCATAGTTCAAACT AAAAAG AGAAG AGTTAAT 

TACCT GAAAAGCAAGAGAAAACAAGAAGGGGTAAATTTTG AACCAAGGGAAATCATTTAAGA 

AGTGTCTGGTATTTTTCAAATTTCTGTCAGTTGTTACATTTGTCATAAGTAAATGTTTAGGAAT 

AAAGGATGGAGAC^TGCTTATTTTATTTAACTCCCCCAAAAAAAAAAAAAAAAAAAAGTACCT 

GCCCGGGCGGCC^CXXGGGGCGGCCGCTCN 

>292 

NNNNNNNNNAAAAAGGGGGGGGAAAAAAACCCOCTAAAGGGGGAAATGGGGGTCC 

CCCCCGGGGGGGGCCCCCCN I 1 1 » I HUM NNNTNGGATATGTACCCAGAAA 

TGGGATTGCTGAGTCAGATGGTAGTTCTATGTTTAAATTTTTGAGGAAC^ 
CCATAATGGAGCACTGTATTCTTGATGrTACATWCAAAGAGCTGGCTTCCAGGTAC^ 
AAGACATCTCTCGTC TCTCT ACAATCCTACAGTTGGTTGAATCCACTCATATGGAACTCATGG 
ATACAGAAGGCCGCCTTTTGGTGACTTGGTGCTCCTTGGAGTCACTGGAGTTCT ACTTTGAA 
TCCCACTCTGAGATCAATCGACTGCCTTAATTCCTGGTCX^ 

CCCGCTCTTTTCCTCAGGTCGAAGG 1 1 ICC II I AAGATCACGCT GACGTCGGACCCACG GCT 

GCCGTACAAAGTGTGAGTAGCTCGGCCGAGATGGGCCTTTTGGGGCCGGACAAGACGGGG 

CTGGGTTGGGGATGATCCGAGCCTTTCCAACAACTACCCXIACGCAGTCTTCATCTCTTC^ 

TCATCT ACrrTCCTGGCTCGCGCCCTTCCAGGAGCCTTTCCCACCGG^CCTGCGAG 

GGTCCGTACT GCTCGCT AGTGTCAGCCGGACNN 

>293 

NNNN^NGTGAGGNNmACTTGACGTGGGGAGTGTCrrGGAGACG^TCTCAGTTNN 
CAGCACAG AGACAATAAACCTCTTATTCGT ATGAACACCCWGCCCCAGCAATTGGATTGG GC 
AGCOCG CCTTGACACACCACTGTGCTG AGTGCTT GAGGACGTGTTTCAACAG ATGGTTGGG 
GTTAGTGTGTGCGAGCACATTCGAGTGGGGATrAAGAGAAGGAAGGCTGarTTGCTGGAGC 
TGTGTGGTCT TCTC CAAGTGAGAGTCGCAGGCAATAGAACT ACTTTGCTTTTGGAGGAAAAG 
gaggaattcattttcagcagacacaagaaaagcag ll l l ll l ll CAGGTGCTGACGGCCACC 
CACCATCATCT AAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATT 
GCAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGAT^ 
A TTATT CAACCAGGATACCT AATTCAAGAACT CCAGAAATCAGGAGACGGAGACATTTTGTCA 
GTTTrGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGT 
GGGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGG 
ATACAGCAGGAACGAAAAAACATCCGACCCAACANNNNNN 
>294 

TGGCGGCCGCCCGGGCAGGTACGCGGGAGGCACATTCTTTTCT acgtgaagagtt 
TTGTAAACTGAACTTT GTTTTCAGTTCCGGCTCC^GCCATCCTCGGGTAGCTTG<X^ATAGAT 
GAATGCCACTCGTTTGACCCATGACG C rOCTTCTTTI CATTTCTCCCTCTTTCCCCACAGCAG 
TGCAT GTCCA CCATAGCACCTGAGAGTCTGTGGAATCTAATTTTCTGTT ATACTTCTTTCCTTA 
CACTCATTTTCCTGTCTTTATTATGATAGTCTAAC 1 1 1 1 IC I C C I CAAAGGTATAGCTGCCTTG 
<^<>TGAAAACACA^^ 

AGATGI I 1 1 1 ATAACATGAAATATTCTACTGCGTTAAGATCAAAATGCT GACTATACTTGTTCC 
gtataccttcagaccactgttaatgt AATATTTTGGCAAGGTGAATGGTCTTTTTTGG^^A 
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AAATCTCAGCAT AAGCCAGG CATGGTG GT ACACAGCTGT ATTCTCAGCTATTT AAGAGGATT 
^CTnaAGTCCAXaGAAATCAAGAOCAGOCTGGGTAACATAGTGACACCCAGTCTCCANNNN 

NCGCCTGTG GGAGGACGTCCG GG TG G GCGGAACTCCT AGCG GACACCT CGTGGA 
GTCCGGCCGGAAGAGCAACCGAGATGAAGGTGAAGATGCTGAGCCGGAATCGGGACAATT 

<Xi ii i GAGGTCCCACGAG AATAT ATAAGAGCTTTAAATGCT ACCAAACTGGAACGAGTATTT 

ggaaaaccattccttgcttcgctggatggtcaccgtgatggagtcaattgcttggcaaagca 
tccagagaagctggctactgtcctttctggggcgtgtgatggagaggttagaatttggaat 
ctaact cagcg gaattgtat ccgtaca atacaagcac^tgaaggctttgtacgaggaatatg 

TACTCGCTTTT GTGGGACTT CTTTTTT CACTGTTGGTGATGACAAAACTGTGAAGCAGTGG AA 
AATGGATGGGCCAGGCTATGGAGACGAGGAAGAGCCATTACATACAATATTAGGAAAGACA 
GTGTATACTGGGATTGATCATCACTGGAAAGAAGCTGTTTTTGCCACATGTGGACAGCAAGT 
AGACATTTGGGATGAACAAAGAACTAATCCTATATGTTCAATGACCTGGGGATTTGACAGTAT 
AAGTAGTGTTAAATTTAACCCAATTGAGACATTTCTCTTGGGAAGTTGTGCATCTGAGAGGAA 
TATAGTACTGTAOGaTATGaGGCAAGCTACTCCTTTGAAAAAGGTTATCTT AGATATGAGAAC 
AAATACAATCTGTTGGAACCCT ATGGAAGCTTTCATTTTTACAGCAGCAAATGAAGATTATAA 
CTTATAT AGTTT TGATATGCGTGCACTGGACACTCCTGTAATGGTCCATATGGATCATGTATC 
TGCAGTGCTTGATGTGGATTACT CTCCX^CTGGGAAGGAGTTTGTGTCTGCTAGTTTCGATA 
AATCT ATTCGAATCTTTCCTGTAGACAAAAGTCGAAGGAGGGAGGTATATCATACAAAGAGAA 
TGCAACATGTTATCTGTGT AAAATGGACTTCTGA CAG CAAGT AT ATTATGTGTGGATCTGATG 

aaatgaacattcgcctgtggaaagctaatgcttctgaaaaattgggtgtgcttacatca 
gaaaaagcagccaaggattataaccagaaattgaaggagaaatttcagcattatcctcatata 
aaacgt atagctogtcatcgacatctaccaaaatctatctatagccagattcaggaacagog 
^^catgaaagaagctcgtcgacgaaaggaagtgaatcgtattaaacacagcaagcctggat 
ctgtcccacttgtgtcagagaagaagaaacacgtagtggcagttgtaaaataattggtattc 
ct aacaatcctgatgtataatt atttg ttacttttgatttgagaactctacaaataaaagtgct 
gggact agattaattgcaaa^ttttagttatatgtgtagagctttattgttac 1 CC 1 1 I IAGC 

TACCCTGAAAAATGATCCTTAAAGGTGGCCTAGTTGGTAAGACT GTTTTATGCTTAATCTGCA 
TTC TTCT TT CATTG T AGAATACAGTATTTG CAACTCA III 1 1 ! CT T GTTTTT ATTACAGATATAC 
TTACTTTCI Crr TGAT CT ATTATTGT AGACACTATACATTCAAATTGACATTTAAGAGCAAACAT 
CTCTT ATGTTATCTTT AATATTACTTTGAAT AATG ATTGCAATGATGTTTCTTCCTGTGATTCCA 
CATAACATTTAGAATAATGATGTCAATTTTTTACAACTrGAATTTATTTCT AGTGCTTTACTTATA 
TTTGGoi 1 1 1 iGACTCTTTT AAAAGAATCAGCCTGCATTTATATAACTTTT AT AAATAAT AATAT 
AATTTGGGTCAAGTTAAGATATTAAAAGTTCCTTTCAGCA^ 



GCGGCCGCCCGG gcaggt ACGCGGGGCTCCCTTGTGAG T AG actatgcaaagaaa 
AAGTGGGCCAOCATATCTGGAAACT ACAGTCT ATGCTTTGAAG OGCAAAAGGGAATAAACAT 
TTAAAQACTCXXXXXiGGGACCTGQAGGATGGACTTTTCG^ 

ACAATGAAAAATCAGAGACTGGTGCTCTTGGAGAAAACTATAGTTGGCAAATTCCCA 
^^^^^^^^^T^^^M^^yMyGMjOGTrCAG^ GTGTGAAGTCCTCCAGAATAAGT 
TTGGCTGTATCTCT ACCCTGGTCTCTCCAGTTGAGGAAGGCAACAGCAAATGTCTGCCAGTG 
TTAACAAA^TGCTGACTCCTAGGATAGAGTTATCAGTGT^ 

TGCTGTI gatgctgtggtgaatgcagccaatgaagatcttctgcatgggggaggcctggcc 

CTGGOCCTGGTAAAAGCTGGTX^TTTGAAATCCAAGA 

ATATGGTAAAGTGTCAGCTGGTGAGATAGCTGTCACGGGAGCAGGGAGGCTTCCCTGCAAA 
CAGATCATDCATOCTGTT GGGCCTCGGTGGATGGAATGGGATAAACAGGGATGN 
. >297 

„ CTGCCGGCCGCCCGGGCAGGTACGCGGGC^ 

GCGGGAATTGAAACCGCCGCTGAAGCCAACAAGAATTTGAGAACT GTAAATACCAAGCCTTG 
AAAGGGACCATGGTGC«GCCTGTCA^ 

• CTGTGACAGTGATGATGA \ TTTGU I C I GCAACTGT AOCTTT AAACAAGAAATCCAGAACAGC 
ACCAAAGGAGTrAAAACAAGATAAACCAAAACCT AACTTGAACAATCTCOGGAAAGAAGAAAT 
CCCAGTACAAGAGAAAACCCCTAAAAAAAGGATGGCTTTAGATGAGAAX5CTCTA 
ACTT AGAAGTTGCACTAGCTTTATCAGTGAAGGAACTTCCAACAGTCACCACTAATGTGCAG 
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aactctcaagataaaagcattgaaaaacatggcagtagtaaaatagaaacaatgaataagtct 
cctcatatctctaattgcagtgtagccagtgattatttagatttggataagattactgtggaa 

GATGATGTTGGTGGTGTTCAAGGGAAAAGAAAAGOW^TCTAAAGCTGCAGCAa^GCAGA 
GGAAGATTC7TCTGGAAG<UiAGTGATGGTGATAGTGCTAATGACACTGXACCAGACTTTGCA 
CCTGGTGAAGATTCTGAGGATGATTCTGATTTTTGTGAGAGTGAGGATAATGAC«AAGACTT 
CTCTATGAGAAAAAGTAAAGTTAAAGAAATTAAAAAGAAAGAAGTGAAGGTAAAATCCCCAGT 
AGAAAAGAAAGAGAAGAAATCTAAATCCAAATGTAATGCTTTGGTGACTTCGGTGGAGTCTG 
C^^AG^^^C^^^CAAATCAGAATCTCAGTCCTrGCCAAAAAAGGTTTCTCT GTCTTCAGATA 

OCACCAGCGGCATCTGGAGGTAGCAGAAGTAGCAGCAGCCCACTGGTGGTAGTGTCTGTG 
AAGTCTCCCAATCAGaGTCTCCGCCTTGGCTTGTCCAGATTAGCACGAGTTAAACCTTTGCA 
TCCAAATGCCACTAGCACCTGAGTGTGGTACAGGAGGAATGTTTGGTTGGGAGAATCACAG 

ctttacaagggtgtttatatttgatttgtgtttatatttgaggcaggtattgtaatataaagga 
atccattaccatgtcctataaatgacctctagccattttatgattatgttctctgtaaaactct 

TCAAGACTTCAATGAGAAGTTTGTTTATAAGAATT ATCTT CT CAT ACCTTTCCTTGTGAAGAGC 
GTATTCT Gi I 1 1 1 CTA TCAGT TCGACATGAAGTCCACATCACATGCTGTTCTrrrCTAGTTACA 
TGATGTGCCTTTCT AGCTTTGTCTAGTTTATAGCACCTTAACTTTAACTGTTCAGTTTTATCTG 
GCAGAGGAAAACATTCTTATTTCTTTCAGAAGACATTTCTGAAATCTTATAAGCTA^ 
ACGTTG TCAGT TTTATCGCAAAGATGTTTTGTATTTTAGCGAAATCTTrTT ATAGTACAAACTT 
AGAAT7 ATTTTACACACTAAAATGGTTGCAGTTTTATGGCATATGTCTCCGATTTAG^ A 
- TTCTCTAGAAAATAGTATTTAAAGACATTTT ATGAAATCTT CATTGTCAAAACCTTTAATAAAAG 
TGGAAATATTTGAAATGCCC I 1 1 1 ICI I GATACCACTCATCCaCGTGTTCCT GATTTGTCCAC 
ATTTCATG ATAAAA TGAGAG CTCTG CAGAGAATGTTAGCCTTTCTGTGTAAATGTAATCTTCA 
AGTAGTCAC 1 1 UGI I AAGTTCTTTAGAAAGTAGTTGTCAAGTACTT AGTCATCCCTATTATGA 
T ATGAG ATAGTACAGI 1 \ ICAGGAAGCTTAGATCrTGAATTATTGTGAAAAACAATGGTATGAA 
TATTTTATATTTACATGAGAATTAATAGTTCTGATCAATTTTATAAAAATTTTCAATTCATG 
GTGTTAAAATGTTATAACTCAGTTrCTGGTTATGACTATCTTrrTAGGGA 
^SI^pGCTCTTAAAAAAAAAAAAAAA 

GCCGAGAAAGGACGGCGGAACGCATCAGGGTCACAAAGCGGCGTOGAACAGGiGGGTCGT 

TAGGAGCAGCCN 

>2S8 

WW NCGCGGCCGCG TNCGACAGAAGACGAGAGAAG GGT ACGGCT GCGAGAAGAC 
G^CAGAAGGGTACGGCTCGAGGTAGGCGGGGAGGOGCGGCXUSCCGGAAGATGGOGGCG 
GCCGCAGAGTT GTCGCT ACT GGAGAAGTCCCTGGGACTGAGTAAGGGGAATAAAT AGAGTG 
CTCAGGGCGAGCGACAGATTGGAGTTCTTC^GACAAACAATGGTCCAAGTCT AACAGGATTG 
ACT ACT ATAGCAGCTC^TCTAGTCAAGCAAGCCAACAAAGAATATTTGCTG^GGAGTACTGC 
AGAAGAAAAAGCAATCGTTCAG^CAGTG^TTAGAATACAGGGTC^CTCAAGTAGATGGGCACT 
CCAGT AAAAATGACATOCACACACTGTTGAAGGATCTT AATTCATATCTTGAAGATAAAGTCT 
ACCTTACAGGGTATAACTTT ACATTAGCAGATAT ACTATTGTACT ATGGACTTCATCGCTTTAT 
AGTTGACCTGAGAGTTCAAGAAAAGGAGAAATATCTTAATGTGTCTOGCTG G 1 1 1 I UIC ACAT 
JCAGCATTATCCAGGCATCAGGCAACATCTGTCT AGTGTTGTCTTCATCAAGAACAGACT ATA 
TACTAATTCCCACT AGAAGCTGTCCATGCCATACAGAAGATCTATTAAAAATGTTTTAAATGG 
AAAATGTA CTCT AG ACCACACGACTAATGTAAATTAATATACAGTCATTCATTATTTGTTGAAQ 
TTGATAGAA i nn G AAGTGTAAAC I rSTBI CTGAATGTTTTATTTGTTCTTTAGCTGAAGTTTT 
G CAAi i 1 1 1 ATGTCAAAATTCAATTGCTATTAAACAAGTTGAGATCCAGTTATAAATTAACCTT 
GTTTI I AGTAGATGACATTTATTTCAATAAAAGTroCAAATCGGGCTT AATCTT AAAAAAAAAA 
*^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAAAAAAAAAAAAAGAAAAAAAAAAA 
A AGGG GGGGGQGGGT AAAAAATTTCCCCGGGGGGGGCCAAAGTTAATGAXX5TGACACCCG 
GTTTTGTGGTGT AAAAGGGGOGCCAAAATAGAGGGGGAGTAATATAAGAGAGAGAGCAGAC 
GGGGCGG I g 1 1 ATAAACAGTTGTTGTGTG GTGGAGAAACTTGTGTGTGQGAT ATTATTGGQA 
AAAGACCCCCI IC ITG1 GGGGT GAATTGTGGACACACCTAGAGAGAATTTAAGCCAGGAAAT 
TAACTTAGGGTGTATGGTGGTAACGCGATATTGTGGAAAATTTGTCAGATTGTTGAAATAKN 
>299 
>3O0 

NNACTATAGTtaAATTTG^aCCCTCXiAGGCCAAGAATTGGGCACG 
GACTTCAGAATTG CAT ACQ CCATCTGTTTTATTG ACCCAACACAGTTTT AAAT ATTTTCATCCC 
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tattttgtccacgtgttgagatcattgctacaatgaaaaagaagggtgagaagagatgtctg 

AATCCAGAATCOAAGGOCATCAA GAAT TTACTGAAAGCAGTTAGCAAGGAAAGGTAGGTTTG 
CTGTTGCCTGCAGAAGAATTGCTCTTTAGGAAACGGCAATCTTGGGAGTCAGA^ 
ATTGTGG 1 1 I UC I G 1 GCAATCGCTGGTTTAAAAGTATGTT ACCACCACGCCCTCCCCT ACCT 
CCATTTATTTAAATGCTGAGGCACCATCTTGTGTGATAAGTATCAGAAGTT ACCCTGATTACC 
AGTCAACCTTGAAGTACAGTTATAACGATCTAAGCAAAACTGACAACATTTTTCCCAAGTCT 
>301 

. AOGOGGGGGGGaGGTAAGGTGGGGGGGTGAAAGAAG HI G O I GACGAAGATGGC 

CTGCAGTTTTGTGATCTGCAATGATTCTTCCXjrTCGAGGTCAGCCCAT^ 

CI II II I GTGGAGAAACTCCG ACATGAGAAACCTGAGATTTTCACT GAG TTG GTGGTCAGCA 

ATATCACAAGGCTCATCGATTTAOCTGGAACTGAGTTGG CT CAGCTGATGG GGGAAGTGGA 

CCTT AAGTTGCCTGG(X5GGGCTGGCCCAGCATCAGGATTCrTCCGGTCTC^ 

AAGCOAAAGGAAAAAGGAGTGATATTTGGGTCCCCACTGAC»GAGGA^ 

TATACCAACTGATTGAGTATCTACAGAAAAACTTGCGAGTAGAGGGTTTGTTTAGAGTACCG 

GGTAATAGTGTCCGACAGCAGATTTTAAGGGATGCTCTCAATAATGGAACrc 

GGAATCAGGGGAATrrCACTCAAATGATGTTGCCACTTTGCTGAAGATGTTT CTAGGAGAGT 

TGCOGGAGCCTCTGCTGACACATAAACACTTCAATGCACACCT CAAAAT CGCTGATTTGATG 

CAGTTTGATGATAAAGGAAACAAGACCAATATACCAGACAAGGACCGGCAAATTGAGG ctct 

CCAGTTGCTCTTCCTCATTCTCCCTCCTCCTAATCGTAATTTGCTGA 

OCT ATAC CIAGACAGCAAAGAAACAAGACAAGAACAAGATGTCAGCCTATAAOCT^ 

IGTI FGCACOOCACGTOCTGTO GCCAA AAAATGTC 

ACAAAGTT AAACAGTGGGATGGCTTTT ATGATTAAACACTCCCAGAAACTTTTT AAGGCTCCT 
GCTTATATTCGGGAGTGTGOGAGATTGCACTATTTGGGATCCAGAACTCAGGCATCAAAGGA 
TGACCTrGACCTCATAGCTTCATGTCATACTAAGTCCTTTCAGCT 



GAGAGAGCTGTTTCAACACGTTCATGAT ATGCCAGAGTCAGCAAAGAAGAAACAACTT ATT A 

gacagttt aataagcaatcattgacccagacaccagggcx5agaaccttct 
caaaagag^igctcgttcgcgctcxntcagtgggctt attaagcggaaggtcctgggaaatc 
agatgat gtcagaaaagaaaaagaagaaccct actccagaat ct gtggccat7 g gt gaattg 
aagggaaccagcaaagaaaataggaactt attattttctggctctccagctgtcacgatgac 

AOCAACAAGATTGAAGTGGTCTGAAGGGvAAGAAAGAGGGGAAAAAAGGAI I nCTCTGAAGG 
ATCCAGA QTTGTC rCC r A TGGTOCATGCAGA Al 1 1 I C I UI I I A GTGGGCAGGTGTTATTCCTG 

cccacagcaaagcttggacttgcagcttgcttgctgcattttga^ 
taccgtgacccgactgatacctctaacc ccactc actg 

ctgagagagtgcttaggccctgtctc I C 1 1 1 1 1 1 AATATTATGGGGAAAOCACTAACTATCCA 
ACCAGCTT ATACAGCACACT AAGGTGGGCTTCAGTGCTC1ACTCAATGTGTTTAGGCAGATTC 
CACTTTTGAAAAAAAATATGAAATGTGTGCTCAACTGCCAGTAATT^^ 
CAGTGGATTGATG I IGI 1 1 1 1 AATGGATATTTTGGG I HI IHUG1 1 1 1 GATAGTATTGGGTA 
II IGG HGI 1 H IGI | IGI 1 TATTTCTTTGTTTTAAAAGOCATGTTTTT GGTTGGGCT CT AAGCT 
AGATAICI | roOC TCI 1 1 rrCAC TrroAGCTirGGGAAAACTCTTTATCT^ 
CCTCAATACCTAATTTtrTGTCCAAAGAATT^ 

GGTG TGACA TCAG AGTATTTGACCTGCAGT ATTGAAAAAGGAGAATTCAGAATGATACAGTA 
TTTTAACAAATCTTAATTATTAAACTCt 1 1 FOCI I C CTTCCATTTCTCOCI COC I I G I C C ATCTC 
TCTCT CIC1 1 IUX'1 I rOCTCAGTGATGTGAAAATAAl HGTG1 1 1 1 GCTGAACTTGTT ATCTTC 
ATTCAA 1 1 IUCHT 1 G ACTAAAACA1 CT C 1 G G I GCCAACGTAATACTTCTGAAOCACATCACT 
GTGACTCAACGAAAGTCACTGACAGCATAAGAGAAGTTTGCT AAAATATTTGT AT6TGGGGG 
AAGC1 CI GGAGTGTGCCT AGGAGGGGGCTGGCTGOCTTT ATGTOOCAGGATGACTCTTTAT 
GGGTGGX^TTACATTGCACCCTCTGAGGGTGCAGGCTAGACCGTCT CCT GAGAGGAAGTTA 
GGATCAQAAAOAAO AAGCAA G C^ G CA G CCrCTGCAQGGCTGACAGGATTTAAAGGAQAGAA 
TCTTCTTATTTGGAAGCAGCTGTGGCTTGT 
uiuiui i rGTCAGAACGGACACAGGTGGTAATTTG^ 
• GGATJTGGTGTG 1 1 CMC 1 1 G I G M GGAT AGAGCAGT GGCATGGCTTTGACCTAGTATGAACT 
GGTGTCTGCOCAGAGAGCAQCATGTAGCAGGGGG^AATGCTCAG G 1 1 1 U Hj CCTGGCTCT 
GTGGAGCTGTACAACU;! I C I CACCCTGTGGGTTGGAGCGGAGTCAGX3GCACTATGGGGAA 

gcagttgccccacaaaatgtggtttgctgacct atttct aaactgttgaat atgctgcaccat 
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jgctgaaatgaaagatgactctgggg^ 

cctctgcgagcctttctgctgc t t t tgctgctgtaacagcaatagtggagaaaaatgtaaaat 

t tggtcttc cagcttaatgcagtgtgaacaatagatggttaggaaaacaaaactgcttagaa 

gcccctjrctctagagcagttttatgtcatt^ 

TAGGTTTCT ATTCAATATTTGAC 1 1 1 1 1 1 1 1 ICI I attaagaaaatgaaatccctt acaccag at 

ATCAGTTAATTCAAACAGAAAACOCTTTGGGTATCANNNNN 

>302 

*303 

>304 

>305 

CGACCCACGCGTC^GCGCGAATCCGTCGCGGAACCTGTCTTCTGTCTTTACCCAGA 
GCTACCATGAGCAAGCGGAACCAGGTATCGTAOGTGCGGC^AGCCGAGCCGGCGTTTCTG 
GCCCGCTTCAAGGAACGGGTCGGCTACAGGGAGGGACCCACCGTAGAGACTAAGAGAATT 
cagcct CAGCCCCCAGATGAAGATGGGGATCACAGTGACAAA GAAGAT GAACAG CCTCAAG 
TGGTGGTTTTAAAAAAGX5GAGACCTGTCAGTTGAAGAAGTCATGAAAATTAAAGCAGAAATA 
AAGGCTGCCAAAGCAGATGAAGAACCAACTCCAGCCGATGGAAGAATCATATATCGAAA^ 
AGTGAA^CATCCCTCAGATGAAAAATATTCAGGTTTAACAGCAAGCTCAAAAAAGAAGAAGC 
CAAATGAAGATGAAGTAAATCAGGACTCGGTCAAAAAGAACT CACAAAAACAAATTAAAAATA 
G TAGOC TCCI I ICI II I GACAACGAAGATGAAAATGAGTAAGTGTAAATATTTTGAATTTAGTC 

TACTTTGAAAGT atatggagtgttcattaaaatcacattttttcctattataaagatact ACAA 

GTTCTTT ATAGAAAGTTTAGGAAATAGAGAAAAAAATTTAATAAACT ACATCT ATTCATCAATA 

CCCCTCTGACTT AAAATGCCAACTCT ATAGAAATTAGCTAX5TATTAACATTTTGTTATTTCCCT 

TGTGTGGTTGT ATATATATGTAAATTATATTTTTAAGCAAAATACATTTTTTGTGTGTAAACAAA 

ATTTTATAAAT ACAACT GTATTGCAAATG7TCTTTGTCCTGCTTCT CACTT GACATTGCATTAT 

GAGTAI ICI I CCAGGTCAGTAAATTTCAAAAACCTGACATTAATAGCTACAGATAATTTCATAA 

ACATCTCATTGTATCTTTTrCATTAGCAATAGCTCCACTTTGGGTGG 

CCTTGTT AAAAATACCTCCIXAACTOCTGCTAAGGGTGGCCATGAGACT^ 

TTAAGAAATACAGGTGG AATTC TGC1TGATAAAGCTGCTGGGTTTTTTGTTA 

ACTTGGCAAACATGAGarrTTGCTCTTATCTTTTCAT 

TGAGTACCTGCCGGGC6ACCGCCACCGCGTGGAGCTCCAA 

>306 

NNGGAGCTCCCCGCGGTGGCGGCTCGAGTACGCGGGGAGGCA^ 
GCC CATGTGAGGTGCCTCCTGCCAATCACAGACTACCCTTCXXrr GGTCCTGGAGGTTCAAA 
GAATTGCAGGAGGGTAGAAAAGCACCTGGGTCGGGTGCAGACTGCGGAGCGGG^ 
GTGTGCGCAGAAAGAGGAGGCGC I I GCC 1 1 CAGCTTGTGGGAAATCCCGAAGATGGCCAAA 
GACAACTCAA CTGTTO GTTGCTTCCAGGGCCT 

TGC GGCAT TGCCCTGACTGCGGAGTGCAICI ICI 1 1 G I ATCTGAOCAACACAGCCTCT ACCC 
ACTGCTTGAAGCCACCGACAACGATGACATCTATGGGGCTGCCTGGATCGGCAT^ 
G GCATCT GCC1CI I C I GCCTGTCTGTTCTAGGCATTQTAGGCATCATGAAGTCCAGCAGGAA 
AATTCTTCTGGOGTATTrCATTCTGATGTTTATAGT AT ATGCCTTTGAAGTGGCATCTTGTATC 
ACAGCAGCAACACAACGAGACTTTTTCACADCCAACCI CI ICOGAAGCAGATGCTAGAGAG 
GTACC I LAiCCCGGCGGCCCCTCGACCGCCO 

TrCAA A^CC AGCTGAAGGAAA GAGGAAGTGCTAGAGAGAGCOCCCTTCAGT G I GC I IC IGA 
CTTTTACGGACTrGGCTTGTT AGAAGGCTGAAAGATGATGGCAXySAATGAAAATCCAGCTTG 
TATGCATGCTACTCCTGGCTTTCAGCTCCTGGAGTCTGTGCTCAGATTCAGAA^ 
AAAGCATTAGAAGCAGATTTCTTGACCAAT ATGCAT ACATCAAAG^TTAGT AAAGCACATGTT 
OCCTCTTGGAAGATGACTCTGCTAAATGTTTGCAGTCTTGT aaataatttgaacagcccagct 
gaggaaacaggagaagttcatgaagaggagcttgttgcaag^ 
aga tggcttt agcttggaagcaatgttgacaatataccagctccacaaaatctct 

GGGoi i i rCAACACTGGGAGTTAATCCAGGAAGATATTCrrGATACTGGA 
GGAAAGGA AGA AGTCATAAAGAGAAAAATTCCTrATATTCTGAAACGGC^ 
. TAAACCCAGAAGACCCTACATACTCAAAAGAGATTCTTACTATT 

^atttacatgtgattgtgattcatcatcccttaattaaatatcaaattatatttgtgtgaaaatg . 

TGACAAACACACTT ATCT GTCTCTTCT ACAATTGTGGTTT ATTGAATGT GATTTTTCT GCAjCTA 

atataaattagact AAGTG 1 1 1 1 CAAAT AAATCT aaatcttcagcatgatgtgttgtgt at aat 

TGGAGT AGATATTAATTAAGTCACCTGTATAATGTTTTGTAATTTTGCAA^ 
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_ , „ „ '" MAA rGTTTGATATnTAl ACCAGCTTATGAGCTCAAAGTACCTCGGCCG 
CCACCGCGGTGAGCTCCAANNNN 
>307 

n - A "^ G7CCA I CAAGAGTC ^ 

CAACCCTCACGCAGAG ATTCCAATCAAAAGTCCAG CTCT GTGCCATGTGCCAATTGTCATCG 

gaatgtggtcatacctctggtggtgctttcagaatatcagtgccaacagtatgacagggata 

G I c / GGA I ACA j^ GATGTG TGGAAGTA™^ 

AGCATGGTTTCCAAGGTGTA TATAT^ 

7GAA I GCCATAAACCACA ™TCAAGTCCT^^ 

CAG 2 7CATAAG TT^^ C ^ 
AC77 7, c l AC ^ T7 f c ^^ 

^ C ^^^ C ^ CG T G ^^ GAAATACTCTGTAATGGAATGC ^^ 

J CG ^ G j^ CA7 J CCCAAAACC ^ 

TGC ? CCA II ATA ^ GCGAC ^ 

GAAATC^O>AAGAGCTCTCC^^ 

IrlPrflL GAATGTCCCu ' 1 1 ' 'Cagggtataaaagtggtgcgagcaccacgggacagcgt 

! CTTG J^ G J^ ATCT ^^ 
A £IS2 AA ®2!S^ 

5 G S^^Z™I G ^ 5G 5 rGCAGG< ^ AAACTC ^ CT 
CTG ^ c7GATGGAAA7cmGG CACT^ 

T r^ Q £f? CQ J£J AAA J^^ 

C£«iGCCAGCX3AGAGCGAGTCAAGATO 
G ^ AA II^ GGC J A I A I CTGGTATCTCTA TAATATC^ 

J 377076 ^^! 60 *^^ 

CTOgACTOCGGCTCT^C^CCCCCTCTrOM 
GCGCCCGCOGCGTAOCTCGQCC»CCACCGCGGTGAGCTCCAA 



A77777A7G7 Z^ A77 ^ 
^?Z G ^ GG i^ GCAC ^ G ^^ 

j 0 ™ 0 ^ 777 ^ 70 ^ 

GAAGAA TGGAJ£A^^ 
TTCTT^CTGTCCTCTTATTT^ 

T J^ A S 7G <nJ CT I A £ AC ^^^ 

T GCCC ^ GTTATAG Z C ^^ 
C JI C I G I ATCTCCCTACCACCACGCCC ^ 

2^ AC ^^ G ^ AA ^^ 

aaatctagt atttgtagttt atttctaggtctgctagaacctgaaaacctgaatgaaaaaagg 
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^TS CTGAGAC T AA X CAAGTCCTGTGAGG ^ AAA ^ A ^GA 
^I AC ^ A ^ AT ^ ACAACAAGCCTGGGTAAGA ^ C ^ 

GG J^ A ^r^ GGGGAGAA GCACTTTAAACATAAGAAATGTGGTGGCAATATAAAGTTTGTAGOCT 
^ CAATAAA GGTTTATAAAACAGTTAAAAAAAA 

>311 

>3t2 

™,^ CA ^ CA ^ G ^ GTGGTCGGTCTGTCCCCTCCCG CT 

C/XTCTCAACCCATGTTTTAAACGGACX^C^ 

CATG T AA J G ^ GGG GAATTCCCAATOT 

A ^ G Xr G ^ A ^ TG ^ GGGTG Z G ^ AGGT ^ AGGGAGAAGG7 aaaggctcaactcgtttcttat 

5 TGT7 H GTT S TGGT A AA ^CT rTTTTCTAAGATATGAATGGTTTCTT CCCT AATCTTCCCTTTC 
CTTri^AAGTTTTAATCl 1 1 ICI I CI IGTA U I I lb I I M 1 1 G ATCCAAATTTGCTAGCTTATG 
TX9I G !? G ^ GTA G ^ ^^ G ATTCTGT ACTTCTTTTCTCTGTTCTT CTTT CTCT ATTAATTGG GCA 

A 1 AGT77777CA77T ^^^ 
CAG i G ^ TCACA I GTG ^ 
AG77 /^I G7Aa3A7777GT ^ 

G7 J G ^ AGATG ^ CAA7A ^^ 

A ™ AC J AAGC I CTTCA ^ " 1 1 'Ct^caacttctctcaacatactagccaagtgtccgtgcc 

7C 1 G ^ C AG , CTTG AGTT CTTT GACAGCGATGCTGAGCAATCAGTCTATCAGCCTGTGCAAG 

^?^^ c jttt^tii ccaaaagatc ^ 

! 77c £ 7ttc a tatacat ^ 

A77C *^ AC7Ga ^ gtt tc^ 

s£ q ^ g ™ aacac77 ^^ 

7ggg ™ 7a7g77ctg ™ 

TAAAGCAGGAAAATCTGGCAGTGGAGCAGCCTCCCAATAATATTCCTCAGTC 
TTG TTCT CTATTATCTGAGGTTAAAGCAAAAGCCAAAGGAGTAATCAAAOfrrftfi 
GTATTrTGT A GAAGCTTACTTCCAT ACCAG GAACC AATGGTTT AAGTTTGTTAATCAAATCAA 

GAG77 T T £^ T I ACA ™ 

ATCCCAGGTCAATCTTGCCATATGTAAATTGTTGTTCTATAAGAGTGGTAriAfrrTnAr^nTTr^ 
S GC 5nZ G P C ^ TCCTTTCACA 1 1 ' " I ICI I GTTAAAGCCTCATCTAGATTTTGTTCTGTGAAC 

T . GAA ^ G £I C ^ GA ]I G T AGG ^^ 

TAGGAGGATCT a ^gacatcctcaaatatttccttgaacaactccaatgct aaaacagaacagt 

ss? A J c ^ c ^ A ^ GG II GGC J rAACCAGG ^ 

^^IxI G 5 c 3TfS^ GA H GGGG 

g ^™T ggcgcag ct g ^ cac ^ 

GAA J™ G ^J ccmccA ^ AA6AAGACCTW 

ata IH GGTGagataatca6Gaat ^ ag <^ttcttaaggagatc^ 

A7AC777 <™^ A i^ TG ^ 

tgmtgtttcgagc>tggaatagcttttcccccacctctt^ 

a TIa^a^I qc £ a £ cac ^ c JS ac ^ 

aag ^ 777GCG ^ 7aa tc^^ 

! A7 j^ G J* AACGTA ^ T ™ 

AA ^? TG ™^ G77GAATGCAC ^^ 

CGAG7Cc ra cG ^ GAG ^ 

cjactgct agttgagacagcaaaccgataatacttaagatcagtgaagcacttatgttgggg 

CC7C T G ^! T <£ CCG £ C ^^ 

ggccttt^ctttcgS < ^^^ GOT 

>313 

A^AT^SISKSSiSS^ 

agatgtacagctcagtcaaagac^ctaaattcttcttagaaaaatagtgctaaggagtatagc 
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* G ^[ G ^ CCTATATGTGTG ^ GGCT ^ GA G^ 

GACAGTTGAAA I GTCAATGCTA ^ 

A ^I A ^ A A TATAGAGAAAAGAAAAATAGAGCAG ^ GAG rrCTATGAGGTAT^ 

S AGAC ^T A ^ GTATG ^ CG ^ AGTOTGC ^ CCTG 

yCCTGGAATTTGATACAGAGAGCAATTTATAGCCAATTGATAGCTTATGCTGT^ 

AATTCGTGGTAAATAACTTAGGAACTGCCTCTTCTITTTCTT^ 

G ^ 7AA ^ 7G7GTA ™ G ™^^ 

S^?^5I^ AC ^™ t f 1 V ' ' CA J AA ^C ^^CAGGTCTAC^TTAATTTATTTTGACG 

7C S AG £^ 7AAC7CA ^ 

CG I CA J5 G ^I CCTGCAACCAAAAGCCAT 

TCTGATGAGTCTATGGACCAATTTGTGGAGGACAGTAGATTAAATAGATCT 

IJ^ G ^ AAA J G ^ G ^i^ G ^ 

I C ^ C ^T5 CCACTTCCTTCCTCAACACA ^ 

c ^?iSS G ^ TGCAGTGTAGGCAAAATTGCAA ^ 

>314 

NGGAG CTOCACCGCQGTG GNCGGTCGAGGTAGGCGGG GGGTCCTGGAQGTTCA a 

AG ^J 7GC ** G / GG 5 m ^ 
G ^ G i?^ AG ^^ GAGGCG 

T GT 4 G ^GGCTTTTC^GGTGCTTCAAGTGCA 

ACC7T7G7G TJ CC77 ^^ 

A7GGGGAAAGAGAAG ^GAACTGTTC6TTGCTTCCA 
T^rrG^TTGTTGCG^TTGCCCT 

G ^ C ™ CCC ^ GC ^ GAAGCCA 

C ^ G ^ G ^^ TTCTTCTGGCGTATTTCATTC I G ^ 

CA7CTTG7ATCACAGGAGCAAG ^^ 

ATGCTAGAGAGGT 
>315 

^i g ^ cc ^ atatg j gtgttggctgggagaatatca 
GACAG I7 GAAA7G I C ^ GCT 

TAGAAAGTGACTCTTTCTGACAGAAATTTGTAG 

AA L AA ^ 7A7 , AGAGAAAAG ^^ 

GAGACATAAGT ATGTrCCTTrAGTCTTGCTTCCT GT GT GCCACACTGCCCCT^^ACiCScATA 
G^TGGGG<^TTGTrTAAAGTCATTTTGTTC<X^ 

T7CC7 ^ GAA77TGA7A g GAGAG ^ 

A^TC^GGTAAATAACTTAGGAACTGCX^CTT CI I I MC I 1 1 GAAAACCT ACTTAT WTGTT 

AA J A p GAAGAG GGGTT ACTTjTGCTGGGTAATGTGT AAACTT GT G I CI It ill I AGAAAGATAAA 
TTTAAAGACT ATCACATTGCTTTTTCATAAAACAAGACAGGTCTACAA 
C^TTGATAGGGGGGCCaAGTAAGCOCCA^^ 

Z c ^ agc ^^I^ ac J caatctaAtgct 

CGTCATGG^TCCTG<>ACCAAAAGCCATG^ 

tctgatgactctatggacxuatttgtggag^^ 
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GAA ^GTAAGGAGGATAAAAACTTGCATACCAATTGTACACCCTrGCAAAATCTTTCTCTGATG 
TTGGAGAAAATGGGCCAGTGAGATCATGGATATAGAAGTACAGTCAATGTTCAGCTGTACCC 
X5^ AGM7GGGAG ^ GG ^ TCCT CAACACAATTCAAACJ\AAT AGACTC^GLACTGTTTCAGGCT 

ccaggacaggaagtgcagtgtaggcaaaattgcaaaaattgagggcacaggggtggaggt 

G 5^^ G Jj! G ZZ G ^ A7 AAGAA GCTG TG CT AAAT AATTA GG TGT AAAT AT ATTTTTTCATTTTT AAAA 

ATACATCCAAGCCAAAAAAAAAAAAAATTCCTGCGGCCGCAAGGGANNNNNNNNNMNNNNN 

NNNNNNNNNNNNNNNNNNNNNN 

>316 

cgtecgcacataTTTGTG GAGTG CCTATT ACGTGCCAGAAGCTGTTCTGG ACACTGAG AA 
ACAGGGATG AAG AAG AAACAGATCCAAGCCTTCCTGAG AGT AACCTCCCCAGG TTT CATGG 
ATGAGGAAACTGAAGGTCGTCCTGACTCAGGCTCATGGCrrCCGACCCCGGCTTCTGTGGTT 
G <^ GGGAGCACC77AC77AGACT 

GGCTGCGC»CG<XACCCGGGCCTCTCCAGTGCCCCGCCTG^CTCGGCATCCA 

CCCG^C^C^CGTG<3GTTCCCGCAC»TCCGCCGGCCCa^^ 

TGTTCCaCTTAAGGCCCCTCCCGCGCCCAGCTCAGAGTGCTGCAGCCGCTGCCGCCGA 

cgggatctcattgccacgcgcxxxxx^cgaccgcccgac^tgcattcccgat^ 
ttccaagtccaatatggcaactctaaaggatc^gc^^ 

gacccccgagaataagattacagttgttggggtrggtgctgttggcatggcctgtgccatc 

AGTATCTTAATGAAGGACTT GGCAGATGAACTTGCTCTTGTTGATGTCATCGAAGACAAATTG 

aagggagagatgatggatctccaacatggcagccttttccttagaacaccaaagattgtct 
tggcaaagactataatgtaactgcaaactccaagctggtcattatcacggctggggcacgtc 
agcaagagggagaaagccgttjttaatttggtccagcgtaacgtgaacatcrrtt aaattcatc 
ATTCCTAATGTTGT aaaatacagcc^aacttgcaagttgcttattgtttcaaatccagtggat 

ATCTTGACCTACGTGGCTTGGAAGATAAGTGGTTTTCCCAAAAACCGTGTT 

TTAAGCTGTCATGGGTGGGTCCTTGGGGAACATGGAGATTCCAGTGTGCCTGTATGGAGTG 
GAATGAATGTTGCTGGTGTCTCTCTGAAGACTCTGCACCCAG ATTT A GG G ACT GAT AAAGAT 
AAGGAACAGTGGAAAGAGGTTCACAAGCAGGTGGTTGAGAGTGCTTATGAGGTGATCAAAC 
TCAAAGGCTACACATCCT GGG CTATTG GACTCTCTGT AGCAGATTTG GCAGAG AGT ATAATG 
AAGAATGTTAGGCX3GGTGCACCCAGTTTCCACCATGATTAAGGGTCTTTACGGAATAAAGGA 
I 0 ^ 070770077 AGTG n-CCTTGCATTTTGGGACAGAATGGAATCTCAGACCTTGTGAAGGT 
GACTCTGACTTCTGAGGAAGAGGCCCGTTTG^AGAAGAGTGCAGATAC^ 
CAAAAGGAGCTGCAATTTT AAAGTCTTCTGATGTCAT ATCATTTCACT GTCT AGGCTACAACA 
GGATTCTAGGTGGAGGTTGTGCATGTTGTCXrrTTTT ATCT GATCT GTGATTAAAGCAGTAATA 
TTTTAAGATGGACTGGGAAAAACATCAACTCCTGAAGTTAGAAATAAGAATGGTfTGT AAAAT 
CGACAGCTATATCCTGATGCTGGATGGTATTAATCTTGTGT AGTCTT CAACTGGTTAGTGTGA 
AATAGTTCTGCCACCTCTGxACGCAGCACTGGCAATGCTGTACGT ACT GCATTTGCCCCTTGA 
GCCAGGTGjGATGTTTA<XX3TGTGTTATATAAC^ 

CCAACAi 1 1 1 1 r t^CCAGTGAGTGACATCCTGGGATCCAGTGTATAAATCCAATATCATGTCTT 
GTGCATAATTCTTCCAAAGGATCTTATTTTGTGAACTAT^^ 

Z G T AAAAAG i^ AC i 7ACAAACAA7GCAA ^ 

CCAATAAACCTTGAACAGTGAAAAAAAAAAAAAAAAAAAAA Aaaaaattg gQy w 



accgg 



^^NW^AGCATTGT^^ 

CCC Z C T C £ AC ^ 7G . 7T Z^ A ^ 

CATGTAACAAGGGGAATTCCCAATCCTCCGGAAGTTTCCGCAGGACC^ 
AAGACGTTC^AATGGCGCT GTGGAC CTTTAGGGAGAACCTAAAGG 

CTbU CIui iCTGC TA AAl ICI I M I CTAAGAT ATCAATG G 1 1 IC I I CCCT AATCTTCCCTTTC 
oi 1 1 iGTAAjji i i iAATCI 1 1 ICI ICI IGTACI TTTGTTTT1 1 GATCCAAATTTGCTAGCTTATG 
TT CTCTGTCT ACTAGCTGATTCTGTAC I ICI T 1 1 CTCTGTTCTTCTTTCT CT ATTAATTGGGCA 
AT ACTTTTTT CATTTTGTTCCTTGAGTG^CTCATTCAACTITTTTC 
CAGAGAATCACATGTGATCTGTAAATCTTTATTCTTCTTTTCA^ 

AGTTCCATGT AGGATTTTGTCAGTATTTCATGfTTCTTCAGCCACAGACT CT AACTTCCTATTAT 
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gttgaaagagatgctcaatatcactctgcgctctttctgattcaacttgctgcgocttcagca 
acacactaagctcttcat ttttt ctctcaacttctctcaacatactagcgaagtgtc 
tctgtttcagcttgagtt ctttgacagcgatgctgag caatcagtct atcagcctg7gcaag 

GGCTAGAG C1 HIGH I OCAAAAGATCTTGTAGCCTG C I 11 C I I T GGAAGCTAATGTGGATAG 
TTTCATTTCATATAGATCCATTATGTCAGATATTCTCACATCACAAATCTGATCCTTAACCACC 
ATTCCAGACTGAAG 1 1 1 C I C 1 ATT AATTCTTCAAT ATTCAA TCCAGG AACACCATCTTTCAAAT 

GAGGAG tt aaacacttt attg atgttg gaaaact gtgattt gatgattgccag ggcatttttc 

TGGGTATATGTTCTGTTTCCTGTTGTCT ATAGGCATTGTTTGCTGCTATACTTT CTCCAAGTAC 

TAAAGCAGGAAAATCT ggcagtggagcagcctcccaataatattcctcagtccagactgtac 

TTGTTCTCTATTATCTGACGTTAAAGCAAAAGCCAAAGGAGTAATCAAACGTGGGTCCTGAA 
GTATTTTGTAGAAGCTTACTTCCATACCAGGAACCAATGGTTTAAGTTtGTTAATCAAATCAA 
GAGTTTTCAAAATTACATCAGCAGCAAGTTTGCATAATTCAGAATCTGCAACC 1 1 I G 1 ICCAA 
ATCCCAGGTCAATCTTGXX^TATGTAAATrGTTGTTCTATAAGAGTGGTACAGTTGACAGTTG 
TCAAGATTTTTGCAATATGCA! 1 1 1 JAGTGTATCATCTCCACAGAGAGTTAACAAAACTTCAAT 
GGOCTTGGCAATCCTTTCACA 1 1 1 1 1 1 ICMGI IAAAGCCTCATCTAGATTTTGTTCTGTGAAC 

tgaagttgatcaaggattgtaggcagcagaagggtcacaaaacgatcagccgaggaacagt 
tagcagcatct atgacatcctcaaatatttccttgaacaactccaatgct aaaacagaacagt 
tttctgatcogtccaaaggttggcttaaccagcgcagtagagcgacagtaggttctaaacat 
ttagt atggcttcccagagtggcgctgccaggtggagactgttcaaacatcatgtgagtga 
gcatatggcgcagctgagtcactgaacagaaggcaagaagtaattctaaaactnttgaagag 
gaatcaggatcctrocattaagaagacctaatacttggtgaagacatgaagaaaa^ 

ATATCTGGTGAGATAATCAGCAATTTTAGGATTCTTAAGGAGATCCATCAGTAGGTCAAGTGA 
AT ACTTT CT AGTT AGAGTGCCAT CACCG TTT ATGAGAAT ATT AAAT ATT AGTTGAAAAGTCT GA 
TGAATGTTTCGAGCATG^IAATAGCTTTTOCCCCACCTCT^ 

ATTGAAAGTGCAAACACAACCACAGTTAAACTACTATGGGCGAACAAGGTGATAAGAGTTCG 
ATAAAAAGATTTCACATTACTCAATGTCTrTATGTGCGTTTGAACAG 

AAGATTTGCCAATAATCCTAGACAAGGCATTTTTAACTCATCTT CAGAAGATTGAATGTGATC 
TATCAGGAACGTAATTAATTCATCTATATTGGCACCAGAATAGAAAATTTTGACATTATATGTT 
AACTTCTGT AGAAGTTGAATGCACCTGCAAAAACACCGAATCAGTGTGGCTGCTCCGACAAA 
CCACTCCCGCCAGCACACT ATTCAG ATTAT ATGT ATTCTGAAGACAATCTCTGGTTTCCATGT 
CT ACTGCTAGTTGAGACAG CAAACCGLATAAT ACTTAAGATCAGTGAAGCACTT ATGTTGGGG 
TCTrCAAGTAGCTCTACAAGGCAACTCAAGCATTCACTTGTT AATATCTGATTTGATGT AAAT A 
GTCGTGTGAG III CI G 1 tX^GAAATTACCTCCAGAGTGCCGCAAAAGCTGAGTGGCGTTCG 
CCTCTGACTTCACCGG CTTT GT ACTGACTGACAGTCAGGAG CAAG GACTT CAAGCAGGCAG 
TGGAGTCCATTGCACCG^CCGCGXSCCCGGCTTAGGGA 
GGCCTTTAGCTTTCGCC 
>318 •' 

GCCGCOCGGGCAGGTACAAGGTTTGGTGATGTAGGCGATTTTTTTAC 
ACAGTGATGGTCAGTCACTTTCTGTAGAGTATCTGTGTGCATATTAACGCTCT AATTATATTTT 
AAAATCTTATTTTATATAGGATGTAATTGGAAAATCGATGAATCAAGCAATTJTGTATT 
TATAGAGATACCAGAAGTGAGGATTCT ACACGT AGATTGTTCAAATTTCCTTTTCTGTGGAAT 
AATAAAACTTCAAATCT ACATTATC 1 I CI M I 1 ACT ATTCT AGAAGATCCACTTT ATAAAATGTG 
CATCTTAAGGAGAi^TAGriT^TATTTCTCAATCTGTGAGT AATGGACT AATTGCTATTAAATTT 
GGG AGCT rrACATATGCCACAACAGAAAAAGTCAGAAGAAGCATCT ACAGTTGTTTAGATGC 
ACAG 1 F1 1 ATGATGATGAAACTGT AACAGT AGTTCTT AAAGACACTGTAGGACGTGAAGGAA 
GAGATAGACTCTTGG7CCAQCTGCCTTTGTCTTT 

ATCAGTTCACTGGGACTTATTCTACAAGGCTAGATGAACAGTGTAGTGCT 
AGCACTGCTCATCACTGAATGACTrrATACATGGATAATAATGAOCA 

GAATGAACCATAACAGAGAAAAGTTCATAGATCAAGGAA I I C I 1 1 1 C AAOTCCAGATTTTGAG 
ATCTrTTCAGTTCCACTATTGGTTTX^VAGACAGATTCCTT C^TGAACACCATCATCATTGGAG 
AAAT CGTTGCTCATCTGATGACTGTGACAACT ACCTTCATTTTCTGCTCCACTCTCAGTGCAA 
CTTTCAGTCTGAGGAGATTTCTCATCTTCAACATCAATAAAAGTA 
GGGAGCTCCAA 
>319 
>320 
>321 
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>323 
>324 

W <ACACATTGTAATATTATATCATGTATAGTTGTACGCAGCTCTGTGCATAACTGTGG 

taactttgtgggtcgctcctgtgggtcctgaaagaatgcagttctccccgcgtatgcgacat 
ccggctgatggtcatggagatccgcaatgcttatgctgtgttatatgacatcatcctgaaga 
acttcgagaagctcaagaagcccaggggagaaacaaagggaatgatctattgagagccctc 

TCTCCCATTCT GTGATGAGTACTCTTCTGCACTGTTC 1 1 I C I I I CTAATAAAAC I ITCTTTTTC 

gaacctatactgtcttctgtaaattcttcttact accctatgacccgtgagccaaccactttc 
cgatgccagggttctgacacctcacctggcataatataaagtgi iiiii i h iatacccttcc 

ACTTGGAAAGACTACAGAGGAATCTrGCACTGCATAGTTCAAACTAAAAAGAGAAGAGTTAAT 

tacctgaaaag caaga gaaaacaagaaggggtaaattttgaaccaagggaaatcatttaaga 
AGTGTCTGGTA 1 1 1 T I caaatttctgtcagttgttacatttgtcataagtaaatgtttaggaat 
aaaggatggagacatgcttattttatttaactcccccaaaaaaaaaaaaaaaaaaaagtac^ 
gccxx^gcggccgccccgggcggccgctcn 

>325 

knctctgaagatgataccaaaattccttttgata^ 
cgaaatgttgtattcrrtatttcagtgtttccttccagacatttttaagot aattggcttactac 
aatatggacattcaccaaat atgatgttaaaactctgtct act agtt agtagtcctctcagcc - 
actcatataagcatatttgatggaaaggttgtcx^cactgagaattatcacacacttgatcag 

GAATGGTACCGTCAAGTTGATAAGCATAACAAATTCCAGAATCCATAGTAAAATCAGATTrTT 

CCAGGATAGCACGAGCTGGAAAATCAATTTCTAAAACATCTTTCAAAr^ 

TTTCTGGATCCCTGAAAGCATGGGGAAAAAAATT ATGCTGTGAACTTT GTAAAATCAOCAAAA 

AGT AAAAAT TAT ATTGC CAAGGTACCT ACCACAAATGTATGTTCCTGCTCAGCTTAATTCCCA 

GGGGTTTTACX^CTTCAGATTAAAAAAAAAAAATTTAATAATTGCATGCTCT actcttggtttc 

TAAAGCCACATTTGATACAGAATGTTTCTTAACTACrrfGATTAGAAAATCAAA 

TATGTTAAAAAACAAAATTATACAGTTCAGAATGCTGAGACATTGTCATTTAAAATAA^ 

TT ATTGCGGTGAACCGAGATCGCGCCATT ACACTACAT ACT GGG CAACAAGAGCAAAACTCC 

GTCTCAGACAAAAAATAAAAAXn-AC<>TTAATATACTAATATTGTCTAATAATT^ 

AGGGCAAGGATTTTATGACTCTATTAAAAAACXSTTTAAATCrCAGATGTCATAC^ 

CTCCAAGAAAGAAGC^CTCAGGAAGCAT AGTAGGATGCCTGGGGTCT ACCTCTATATTTATG • 

GCAACATT ATT ACCTAATGCAATTCTGCG TGCATGTTGCACCCCGGGG AGG Mil 1C1G GCT 

CAAGTACCCAGG I C 1 1 C T CATCGATTTCATCCATAACATCCCAGAATGCCTTTAGTGATTCTA 

TTGCTGCCAAAAACTGACTATAAATGCTT ATT AAGG AGCTCTGAGGAGAATTTACCTGAGGT 

GTCCAGGAGGCACAAAATGGAACAGGAAAATCCACAAAATAATCTGGTtV^ 

CTTTGCCTT CAACITGAGAGTGATTAAATGCTCTCT ACCAGAAGCATCTTCTGCTT^ 

GATGGT ACCTCGGCCGCCACCCGCATACACAAGTTTATCCCAACCAAGAGTTCCTATCT 

CAATA AGGC TTGA G TAGAAC TGGGGAGTGATTCAAAACTTTrAAATTrGAC^ C I CT 

1 1 1 1 1 CTGT ATTTTCA I I 1 1 CATCTATTGAAGGAAGATATTTATTTCTAAAATGGCTCCTGCAG 

CAGCCAAAATACAAACAGCTATTATTGGTGAAGTTAGCCGX^^ 

>3Z6 

nnacacattgtaatattatatcatgtatagttgt acgcaggtctgtgcataactgtgg 

TAACTTTGTGGGTCG<rrCCTGTGGGTCCTGAAACAATG 

CCGGCTGATGGTC^TGGAGATCCGCAATGCTTATGCTGTGTTATATGACATCATCCTGAAGA . 
ACTTCGAGAAGCTCAAGAAGCCCAGGGGAGAAACAAAGGGAATGATCTATTGAG 
TCTCCX^TTCTGTGATGAGTACTCTTCTGCACTGTTC 1 1 1 C I I I CTAATAAAA C I ITCTTTTTC 
GAACCT AT AC IGTCTJ CTGTAAATTCTTCTTACTACCCTATGACCCGTGAGCCAACCACTTTC 
^^^^^^^^J^^*^CCTCACCTGGCATAATAJMfiXjJG 1 1 1 1 1 II 1 1 IATACCCTTCC 
ACTTGGAAAGACT ACAQAGGAATC1TGCACTGCATAGTTCAAACTAAAAAGAGAAGAGTT AAT 
TA CCTGAAAAGCAACAGAAAACAAGAA 

^^^^^^[l^^^^'l^^T^^TT^TT^'" ATTTAACTOOOCC^MAAAAAAAAAAAAAAAAAACSTMCXn" 



atcataatattgctgacagc • 



AATAAACTGCCACATCTT CAGCCTGC*GGCTGCTGGTGGTGAGAGTGAAATCTGTCXX*GA 
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cccgctgccactgaatcggtcagggaccccggattcccgggtagacgcccagtaaatgag 
cagtttaggaggctg gcccg gtttgtgctggtacgcgggagaatggctc^ 
gtgagctcggaaaicci 1 1 1 aaaagaaattgaaatctcactttttatttggttttaagtacctc 
ggccgccaccgcggtgagcnkn 

>326 

tgtcacagacactcxtgggtttggaattttgttgtt C T C > GTCTCTTTGATTTCCTGG 

AAGACGACACCATGACAATTTC^^GAAAATAGAACAAAATGAAGGAAAAAGAGGCTCTGTC 
TTAGCACATTCCTGTGACCAGCCTGCTGTCTGTGGCGTG<XCTCCTG gcccggccttggca 
CATGTT CGi 1 1 1 IGTGGTTGTTGGCTGGACAGGCAACTCTGCAGGGCTGCT^ 
CCCTTTG - " - --— ■ 



CTTTCTOCAAGGCTCCCTGCAACAGCTGGCCTGTCCCTGGTGGGGCTGACAG C ITTCTTCTI 

ACCCTGCCAGGCTGGCCAAGCCCCAGAGGTGACCTATGAGGCAGAAGAGGCCTCCT^ 

GACGTTGCTACTCACTAGCTTGGATGGGCACCTGCTGGAGCCAGATGCTGAGTACC^ 

TGGCTGCTAACCAACATCCCGGGTAACC^^ 

TCCCCCCCHTCCCTXSCCCGAGGCTC^ 

GGACCAGCCGATT GACTT CTCTGAGGACGCACGCCCCTCACCCT GCT ATCAGCTGGCCCAG 

CGGACCTT COGCACt \ 1 1 GATTTCTACAAGAAACACAAGAAACCATGACTCCAGCCG^SCTTG 

TIXTTCTTCCAGTGOCGCTGG^TGACTCCGTCACCTACATC 

GCGGGAGCCGGTGN 

>329 v- "J *.. 

TCCGAAATGGGGGAAAATCACTGCTCTTCCCAGAGCGCATAC^ 

GGATCTGCCTCCCTACCCATCCTTTTCCATGCTTTATGAAAAACT 
AACCAGT ACTTTTGGACTTGAGTGACCTGGAAGCTGAATGCCCATCTCT 
TTCAGAAGCTGCCTTCT AGAAGAATGATTGAACATTGGAAGTTTCAAGAGGATGCTT CCTTT A 
GGATAAAGCTACGTGCTGTTGTTTTCCAGGAACAAGTGCT CT GTCJ^CATTTGGGGACTGGAG 
ATGAGTCCTCT I GGAAGGATTTGGGTGAGCTTGATGCCCAGGGAACAACCCAA^ 

aatcaacagttcttgactg cx^aaactttttccatttgtt atg ttccaagacaaagatgaaccc ■ 

atacatgatcagctccaccgtaatttttagggactc^ 

cgtggttcaagccaaactggcagcatttggcocaatct 

ATAATAAA AGTAA ATATATn^CXTCAAAGTACATTCATGTAAGCCCT 

tcatttcttgctt atgagtgcct gcatggtgtgcaccatagg 1 1 1 ccgctttcatgggacat 

GAGTGAAAATGAAACCAAG TCAA7 AT CUuGGTACCTTTACAGATTTGCAAT AAGATGGTCTGTG 
agaatgtatatgcaagt GGTATGTGTGT AATTATGGCT AAAGACAAACCATTATTCAGTGAAT 

T A 5I AATGA f^ 1 ^ ATCCm ^ 

ACAGWATCAGGAAAAAGTACACAGTGA G I IC I G I MAI 1 11 1 1 G 1 A GGTTCATTATGTTTAT 
uiiiji i I AAGATGT ATATAAGAACCT ACCTATCATGCTGTATGTATCACTCATTCCATTTTCAT 
GTTCCATGCAT ACTCG XSCCATCATGCTAATATGT ATCCTTTTAAG^ CAAGGAAACAAA 
AGGGCCTTTT AM J 1 1 ATAAAGGT AAAAAAAATTCCCCAAATATTTrGCACTGAATGTACCAAA 
GGTGAAGGGACATTACAATATGACTAACAGCAACTCCATCACTTGAGA 

AGci i UiAAATC AAACI I CC I T CACAGTGCCGTGTCTACCACT AGAAGGACTCTGCATCT AAG 
TAATAATTTTTrAAGATTCACTATATGTG^TAGTATGATATGC^ 

TCTCri CCATCCATCAAATACTTTACAGGATGGCATTTAATACAGATATTTCGTA 1 1 1 CCCCCA 
CTCCTTTTTATTTCTACAGCATCATTAAA^ 

AGAACTAT A JJTOAAGCCTGC*TTTTCACT^^ 

U 1 1 1 I I U I ATGGCATTJTCCAAAACTGCATTjACATCACTAGTrrrTACTTCT 
GAAGGTATTCTTCATTTTAAI I GUI 1 1 1 GGGATTACTC<>CATCTTTGTTTATTTCTTGACTAAT 
J^TJjreAATAGAGJGAAGTTA^ 
GCCAGCCTATj^GTrrACAGGC^^ 

CAT^ 1 ^' 1 1 * G T AAGAACTGATTGC1 C I CC I G I GAGTTAAGCTATGTTTACTACTGGGAOC 
CTCAAOAGGAATACCMCTTATGTTA^ 

AATTGGAAGAG AAAAAAAGAAAAACTTTTAACTGGAAATGTT AGTTTGTACTTATTGATCATGA 
ATACAAGTATATATTTAAT Til GCW— P— XHNNMI 9tHMHMd9000WHMinMMMNriMNH ' 
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GGGCTTACAGTGGCGGGAGTTGGAGGCGATAACGATTITjTGTTGTGA^ 

AGCTGCGATTTCTGCTGAACTTGGAGGC^^ 
JCCGACCCTGATGCTCTT(^TTCGGTGCT^ 
GGA I GG T!^ CACAATACCCT OTC^^ 

GAACACACTTCTT ATGGAGAAACAGGAGTCCCAGTTCCT CCTTTTGGATGT ACC 1 I CI CIICT 

C^CAGAATCACAAAGTTrCAGAAAGAAGTGCTTCAAAGAATGGATGGCT 
^CTITGACCTGGCCTGGGTTCCTGGTGAACTTAAACTTGTTACAGCAGCAGGTGATCA 

S^ AAA I^^ GACG J AAAAGCTGGTGAG ^ 

S 0 ** 0 ! 0 * 01 " 1 ' 0001 1 1 1 CTAAGTTTGAGAAAGCTGTATTCTGTACGGGTGGAAGAGATG 
GCAACATT ATGGTCT GGGATACCAGGTGCAACAAAAAAGATGGGTTTTATAGGCAAGTGAAT 
StiSSifl fGAGCTCACAATACCTCAGACAAGC^ 
GAA JJP* AAAGGA £I TG <ra 

GTAAGAATTATACTGCTTATC6ACAAGAACCCATAGCATCCAAGTCTITCCTGTACC 

6 P A ?^ A i? ( 5? AAA ^!?^ GGATA ^ cAAGTCT 

CTAATTOCACAGACGATAACATCTACATGTTTAATATGACTG« 
CTATTTTCAATGGACACCAGAACTCTACCTT^ 

CAAOCT(XTACTGTGCTCCTGGGTCATTCTCAAGAGGT TC TGTCT GCTGGTGTCCATC 

T? a £!1^ caaaga ^ gctacctg ^ ctgat ^ c ^ 
aggcttagaggagaaaccaggaggtgataaactttccacggtgggttgggcctc^ 

^?I G 5I G 5 AG ^I C 5^ C ^ CC ^ CAAAT 

CAAGGCCCG^TCTCCCATCAACAGAAGAGGCTCTGTCTCCT 

TCATCTTTCAAGATGTCGATTAGAAACTGGGTGACCCGAACACCTTCCTC^ 

^?IS A ^^^ GGAGACCAAGAT ^ TO ^ 

A ^^ C ^JJ5£ CAAGCAGAGGOTGCTCT 

CTCAAGCTGT CTGGAGAGT GTGAAACAAAAGTGTGTGAAGAGTTGT AACTGTGTGACTGAGC 

^^^f^^JIS AAAA IE T IS A ^ GGA 

CTTAGT AAGGACTCTCT AGGTCCTACCAAATCAAGCAAAATTGAAGGAGCTG<5TACCAGTAT 
CTCAG^GCCTCOGTCTCCTATCAG^ 

^^ ( ^ M: ^CCTJGTGGA^AAGGGXfrrCAAATGGT AGGCAAAGAGAATAGTTCCCCAGAGAAT 
AAAAACTGGTkui I^GGCCATGGCAGCCAAACGGAAGGCTGAGAATCCATCTCCACGAAGTC 

CATCACGCCCAGCT CCATGAGGAAAATCTGCACAT ACTT CCATAGAAAGTCCCAGGAGGACT 
I£I 6 I!^ CCTGAACACTCAACAGAA ^ AT 

TCCACT AAAA CAAG CTGAGC I II GG 1 CCACTAAAACAAGATGAAAAATACAAGAgrnACTTrra 

^T^ TT ^ T ^ A I^^ 6 ^ TCCCA67 ^ 

GAAACTTTCTATGCTCAGTGGAAAGTATCTGCCAGXXAC^ 
TGCT CAGATAACTACATTTATATCACTTCAGTGTTAAAATGC^ 

^^^^^^ A ^ T -^* GAG T G ^T A ^^ TG ^ G ^ GT CTTCGGAGAAACTAG^WaAACAACAACAAAAA 

^^l^V^^^'J^lSi^ J ^^ C1CCC ^ ' ATGAGATTTCAGGTCOCCATGTTTTC 
ACCAAOCAATCTGCTATT^ 

•GTTCTTT AGGGG CTGCAAIG 1 1 1 1 AAAAAAAATAAGTCATCAGATTTTAAGAAAAAAGTGATG 
AAATCTTGAG 1 1 1 CM 1 1 CATGT ACAT AAAAAGCAATAGCCTTTT ACT AT AGAT AGCCCTGAGC 



P^e9»ofl55 
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caaaaagtaat agaattttctctag atatttaatacagagagtgtatag actgact ct aagtta 
ataatgt gcaa aa tatctta a acatc cctccccttattcaacaattatgtatcagtgatcttga 
accattgttttata1 i 1 1 1 caccttt gtaacctcatggaaagag gcttt acatactttctatgt 
actatttacttagaagggagcccccttccagtcatgaaacttcatttgttttatccatatccc 
T GA( *» A f^I G J AGAGTT ^^ 

aaaatctattctgacaactttttaattcctitgatcttataagttaaa 
tccatggatcaagtaagcatagttttatccagggagaaaaataaaaggaagccatagaattg 

CTCTGGTCAA AACCAAGCACA CCATAGCCTTAACTGAATATTrAGGAAATCTGCCTAATCTGC 

TTATATTTGGTG 1 1 iU m r i r igactgttgggctttgggaagatgttatttatgaccaatatct 
gccagtaacgctgtttatctcacttg ctttgaaagccaatggggg aaaaaaatccatg aaaa 
aaaaaagattgataaagtagatgattttgtttgtatccctacccatctcctggcagccctact 

™ T S^^ GGGAT ^ni GGCTGTCAG ^^ ATA <^ A G T CTACTGG 
C J^°JI GG ^ GCTAGTAC1 ^ AAATGGGTGCCAAA 

CCT GCCTGTGTC C^ TGTCAG ACTTTGAGCTGATCCTGAATAATAAAGCCTTTTACCTTATCTG 
ATGTCCI 1 1 1 1 IGAGCTTTTTGCATTACCTAGAAGCAGTCTACAAAAAAGAACTATAGTAGTCA 
AGAATCCCTTCT AC7TGTTCATTAAAATGTTTATTCCCCAGTTATAATCTATTTCAAGCTGAAA 



>332 
>333 

^^T^^CCQGGTCCCATTCTC^TGTCA 
GAATGCAGAGGGTTGCAAAAGAAATCA^ 

TAGTTTTOCTCCCAGCCCTCTCTCTCTTTACCGAGGGTGAGCAATACAAAAGGGGTGTGCTG 
CAGCT CCAGCTTTCA GAGT ACCGCCACAGATCCACAGCCTGTCCTGATCTGAGAGCTGAGC 
CGATGCCI UICII I C rGGCTGTGTCTTTTTACCTTCTGG ACAAGTAGGATG AGGTG AAAGG 
AGCTGTCCTTCT AA GTCTT T GTATCTCCATTTGTCTG CAGTTTCTCCT CGGTCTCGTGTGTCT 
GTGTGTGCC AGTCX XTr reTGCr rGACACAGGACGTCCCTTAAAGTCCAGTCCTGGCGATTTCT 

GA ^^9^^III CCCTC ^ AAAG ^^ 

GGCTTAGTCTCTTTGTTTCCGGGCGTAAAAGCACTGGGATTAATATGTrTTC^ 
GAAAACACAGGAATGTGATGTCGAAAAGGGAC I II 1 1 1 1 M ICI 1 1 CACTGTGCTTCTCTCCT 
CCCTrrATTTCTCCCTACCTTTTrcCTCCT U I UNCI 1 11 II I 1 1 1 ICI IC I I ICTCTTGCCTG 
GATTGAGTCCCAGAAATGTrAGGACTACCTCAGTTTTGCTCCAAACCAAACTCAAACAACAG 
CAGCC ACT GG AAATCAAGGAAACTTCACTAAGAATTTAACAGATCAGCAAAACACCGCCTCC 
TTCCCATT TTAGCA CCTTCAGAGTGGACTCAGTGAGGAGTGAGAAGGCT GTTCTTTGGGQTR 

TAACCCCTT CCAAACCAAGTCATCGGGGTAACTTCTCTTCCAGTATGTTTTTACCGCGTCGAC 

TCCNNN 

>334 

>335 

^^^i^S?I!! A P U "^ 

°t gg j gaaaa <^tag^^ 

C ^I GCAA 9 CCG g AGA 7 AA X A TAT^ 

CCTCTGCrrCTCCCTAGAAAGCTTCATTATGTGAGTGATACATXTTTT^ 
GTGTGTGTGGT ATCATCAGCCTCAACATCTGAAGCAAATGTTGGGTGGGGGTN 
>336 

GACCCACG<X3TGCGGTTTGTGGAGTCGCGAGTGCTGGTGTCCGGCTGGAGTGCAA 
TGGCG I^ A IE ACAGCTCACTGCAACCTCC ^ 

CATGGCTTOCCAGTGATGTGGGAAAGATGGACTCTGATCTCCAGGGTAGAAGTGGCAACCA 
GAGT AAGCCAGTTCGTI GAAGCAAATGATGGCTGCAGT6AAGAGTACCGAGGCACACCCAT 
CJTCJAACAAGG^rO^^ 

CGAGGCCTCCCGTCAGCGCTTCAGGCAGTTTTGCT ACCAGGAGGTAACTGGCCGAGATGAA 
GCTmAGCAAACTCTGGGAACTCTGTTGT^ 

m ~ 4 ™— — GTTTCTGACTATCTTGCCAGAGGAGATCCAGA 
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CCTGGGTGAGGGAGCAGCATCCAGAAAACGGCGAGGAAGCTGTGGCTCTGGTTGAGGATG 
TACAGAGAGCTCCTGGACAACAGGTTCTAGATTCTGAGAAGGACTTGAAAGTACTCATGAAG 
GAGATGGCCCCTTTGGGAGCAACCAGAGAATCACTGAGATCCCAATGGAAACAGGAGGTTC 
AGCCAGAGGAACCGACTTTTAAGGGATCACAGAGCTCACACCAAAGACCAGGGGAACAGTC 
AGAAGCCTGGCTTGCTCCTCAGGCTC«>GGAACCTGCCTCAAAACACAGGTCTCCACGAC 
CAGGAGACAGGTGCTGTGGTCTGGACAGCTGGGCCCCAGGGACCAGCCATGCGTGACAAC 
AGAGCTGTATCCCTCTGTCAGCAAGAATGGATGTGCCCAGGCCCTGCACAAAGGGCCCTCT 
ACAGGGGTGCCACCCAGAGGAAGGACAGTCACGTCTCGCTGGCAACAGGTGTGCCCTGGG 
GCTATGAAGAGACCAAGACGCTCCTGGCTATTCTTAGTAGTTCTCAATTTTATGGAAAACT^ 
A GACC TGTCAGCAGAAGAGCCAGATCTACAGGGCCATGGCGGAAGGACTCTGGGAGCAGG 
GTTTTCTGCG GACCCCAG AAC AGTGTCG CACCAAGTTCAAAAGCCTACAGTTG AGTT ACCGC 

aaagtgaggagaggccgtgtgcctgagccttgtatcttttatgaggaaatgaatgctctttc 
aggctcctgggcctctgcacctcctatggcaagcgatgctgttcct ggccaagaaggaagt 
gatattgaggctggagagctgaatcaccagaatggggaacccacggaggtagaagatggca 
ctgtggatggtgcagacagggatgaaaaggacttcaggaatcctggccaggaagtcaggaa 

ACTA6ACCTGCCAGTGCTGTTCCCAAACAGACTTGGTTTTGAGTTCAAGAACGAGATTA 
AAGAAAATCT AAACATGGGATGATTCAGAGGAAGTAAG AAATAAACCAAGGC7TTTACAGAG 
GAAAGTTCCCGAGGAGTCTATTGGCACTCTGAGGCT ACGAACAGGCTTGGG AG AGTGAGCC 
AACATCCAGAAGG G C AATGT AG AAATTCTCCCAGGCAGAGTTGAGGAGAAACCCCATTCCC 
AGAGAGATGA GTCCCGAAGTTTGGTGCCGGACAAGCCT^ 

tgtccgatgtgctctcccaagcaggtcaatggaaaatttcaagtcggttgggaaactttcca 
gtctattgtcggnnn 

>337 



CGGCCGGCXXX^GCCTCGCCGCCCCGCGCAGTA . 

CT ACTGCCGGCTCCG CGCCCTTCCCCGAGGGCTGG ATGATGGGCT GTTT CGCCCTGCAAA 
CGGTGGACACCG AGCT G ACCGCGGACTCGGTGGAGTGGTG CCCGCTGCAAGGCTGCAGG 
CACCTGCTGGCGTGCGGGACCTACCAGGTGCGGCGGCCGGAGGTACGCGGGATAATCAAG 
GTGTCA<^TCCC^TGGCTGGACATGCCCTCTTGGGCTTGGCAGATGCCAGTGGATOCATA 
CAACTACTCCGCCTGGTG 



CGGGGACGAGCCCTTGAAGATCATCAGCAGTGACTCCACAGGGCAGCTCX^CCTCCTGATG 

GTGAATGAGACGA GGCC CAGGCT GCAGAAAGTGGCCTCATGGCAGGCACATCAATTCGAG 

GCCTGGATTGCCGCTTTCAATTACTGGGATCCAGAAATTGTGTATTCAGGGGGCGACGATG 

GCCTTCTGAGGGGCTGGGACACCAGGGTACCCGGCAAATTTCTCTTCACCAGCAAAGACAC 

^CCATGGGTGTGTGCAGCATCCAGAGCAGCCCTCATCGGGAGCACATCCTTGCCCACGGG 

AAGCTATGATGAACACATTCTACTGTGGGACACACGAAACATGAAGTAGCCGTTGGCAGATA 

^^CTGTGCAGGGCGGGGTATGGAGAATCAAGTGGCACTCCTTTTCACCACCAACCTGTTC 

TG G ACGTCTG CATG CACAGN 

>338 

>339 

>340 

>341 

CCCAAACAGGTCTTTTTATTTAACATAAGGCCAA^^ 

CCACTAACTGT ACAAAATATrGACTGCATGCCTCGCAAACACCAAAATATCCGCTGGAATGC 
GATAGAAATAAATAACTTCTGCTATAAACAGATGAAAACATATCAAACTGTTATCTrCTTTAAAC 
^ A ^GTAAATAAAAAAATTACCAGTACTTCTACACAATAAATATTAAGAAAC<^TTGACATAG 
TTGAAATGCACT CATATAAATTAACAACTTTAATTACATTAGCCAAACAGAC^TTGGrrTAAAGA 
ACTGCATGTAGT ATGGAAAACAAAACAAAACAAAACAAAAAACAAAGTAAAAAACCAACAAAA 
TAGAAACAAACAAACAAACAACATGAACCACAGAACATAAAAAGTTTTAA^ 
TGAGAT TATCTT GGCTTTCATAATTATA 1 1 men 1 1 aaagaaaaatatcaacccattgtcaat 

GCACT Gr T I I 1 CAAAGCATTTAAATAGAGGGTAAAACCCTTTGGAAATTAATACAGAAGAAAT 

GATTCACTTTATGCATAAAAAATAAATAATAATATAGCTGAGACATGTGGTTTGC^ 

TGAAGATGTGAACAGCTTCTAAGCATTCAtTTTCTCTGACCCAT^ 

AGAGGGTTTAATTTAAACACATACAATGTCCACCCCCAAACCTTCTGCCCACATCT ACAAGTT 
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^ATTTATTTTGTGGGTTTTG AGGGTGA CnAAGTTTTTCCCT ACATTGAAAAGAGAAGTTGCC 
AAAAGGTGCACAG G AAATCATTTTTTT AAG TG AAT ATG AT AATAT G G GTCCGTGCTT AAT ACA 
ACT GAGACATATTTGTTCTCT G 1 1 III II AGAGTCACCTCTTAAAGTCCAATCCCACAATGGT 
GAAAAAAAAATAGAAAGTATTTGTTCrrACCTTTAAGGAGACTGCAGGGATTCTCCTTGAAAAC 
GGAGTATGGAATCAATCTrAAATAAA TATGAAA TTGGTTGGTCTTCTGGGATAAGAAATTCCC 
AACTCAGTGTGCTGAAATTCACCTGAC 1 1 1 1 1 1 GGAAAAAATAGTCGAAAATGTCAATTTGGT 
CCATAAAAT ACATGTT ACTATTAAAAGATATTTAAAG ACAAATTCTTTT CAGAG CTT CT AAGAT 
TGGTGTGGGCAGATTTTTAAGAGCCTAGAGTTTAGTCTTAGAGAAAGAGTGAGGAGATAGTA 
AGGTTAGATAGAG CCACT GAGTTTCAAGAAAAAAACATACTACT AAGAATCCCATATGTT ATA 

at ttaa agcctttaot 1 1 1 ggcctcatgctgct aggtg aaagagtggttgttcacaggacttg 
tattttccaagatgattaaagatggtaaaatactatctttcaatgttatcaaaaaatggtagca 
acttatacttctatttcaaagccatataaatttaacaaaattaaagtttgtgggtgtttgataa 

CCGAATCACTCAATATCCAATTAAAATATGGAATAAGTTTCAAATA^ 

CTCTGCTTCTGATAACTGTGGTCACTAATCAACCCCCATGTTATCCCCGATATGTCT AGGACT 
TAGCTTAAAAAGAT AG^GAT GTATTTGAGATGAGTGGGAACTATACATGGGTATCAGCTTTC 

ttggtaaatttctgctttttccatggcctt gaccatctttgct actgcgcagtgaaccaggac 
tt aattt ctcacac t gttt ctgcaggttccggtaccaaagaagatgcagttcaaaatactgcc 
agttttccaagaaattttgt aaagttgaacatggccatctactcttgccttaaaacttttctca 

CCACACCCACCTTGCCACATGCATG AT ATCCAAGGTCGACAG ACCT GGATTAGAATCCACTC 
TCAAGCTTCTCATGCAGTGCGTATTGTATTTTCTGCATAAGAAAGGGCTGCCTCTAGAACACA 

GTAAGTGTATTTGCCCAGT agtg^acattgcctacatatagccaagtgttatagtataccaact 

TAGT ATAi 1 1 1 1 CAAGGAGAGCTAAACCACCTTTTGTAATGTTTGGTTTCTCACTGTTATCTTC 
CTTTCCT ATAATTAATTTATTTTAATCTACAAATTGACATAGG GCTAAAAGCTTCAATATTTTAC 
^ AAA T*TrAATTAATGTAATTGTrcOCAATTATTAGAAACTTTTTT CCATTTTTCAAAATGTTTG 
CCAACTTCAGACAAGTGTGTAAAAATAGGGCTCTGGATTTTCAAAAGCACATACATGAATAAT 
TT ATT AGCT ATTCCAGGCAAGCTAAGTACTAGAATAAACTAGATAAAAACTTGGCTTTAAGCA 
TGT ACTTT GATATTTAT AAAACAAAG GTGTTTTTTTTTCATTTCT G CATCT G AATCAATACAAAT 
TTACACATGAAGAATTTTGGCTGTAAATTAAAATTTTAGAGATTTrrACT 
TGCCTCCTCTT AACTGTTCT CTTCCTTACTCT CCCTCTCTTAACACTTT CT CTGAAATTAATTAT 
CTTTT AGT T AGTG ATGACATTATTTrTCCCAGCTGGTACAGGTAAT ATTTCT CACAACGAAGC 
CTTAATCATTTTTAAAAAAT ACACTCAACT CT AATTCTG GCCT AAAT ATT ATTTTTCT GAAAAAG 
CTTCTGATTTAAGATTGATTCCAATAGACACTAAGTTGGAAAAACATGAG 
TTATCATCCTGTTTT<XXX2ACAGGGGATAACTTGTAGAAGTGGGAGGGGCACAAAAAAGAGG 
AGGAAATTTCGGACXjOjTGGGTCGACTCCCTATAGTAGN ^v^wwv^oc 

CTG GCGGCC &AGGT ACAGGTTT AGTCTGAATGCACTGTCATGAAATTTAAACTTTCA 
TT AT A AT ACT G 1 1 1 1 AAGAACTTACAGCATCTGCTTTACAAATGGTGTTAGCTACATGTCGACA 
CAGCAi^i i lAGCCAGI 1 1 ICI 1 1 1 GGAAGTTCATCTGATGTCATCTGGAAACTGAGTAGCAC 
ATTTGCCTGCTCTGTTGGTGGCCTCACAAGCAAGGCAAAAG CATTATGGCAATCTA 
Cj^TAACCATAAACATTAAGTGTCACTCCTTGGAAAATGACAG^ 

CCCTCAGAGCAATGAAATTCCAATGAAATGAACT ATCACTTCTCCAC 1 1 1 C C 1 1 G I CCTATTTT 

TAATAAGACAAAGAACATCACCATATTAAGTTGAAGT 

>343 

AC ATCAGAGATGCTCACACATTCTTTGAGTAGTTTAAAAACTCATTT^ 
ATTCTTTGTATTCAAACCAATCACTGGCAATAGCTCT AAGT AGGTCATCAACTCTCCTCCATG 
TCTTCTTTCTAATTCTGCCACAGACTCACTTCTr 

TGAGTTCTT AGAATCTCAAAAGGCATGAG GATAAAGCTTT CCT GGAGATAAT ATAAGTGGTG 
GCAGGAAGATTTGGGAGCCAGATGATACTCTTTTCCTCTT AGAGAAACTCTGTGGAAGCTCT 
GCCTATACTGT GGGAAATAAATTCTAGACGCTGGCTTCTTT CTGT AGTAAACATGTGGGCCC 
TmAAAATGTTGAACCAAAATGTGCTTCAAATATAGTTT agttataaaagatttatgggggagt 
atgt atgtgccaactacagaggcttcagagatgaagaaacagttctt ACCCTAGTGTTGCTT 
agaat ctagt agt agt aagtaataattactaacatatgcatttact atat AGGCAATACTAGGG 

- TAAATATTTTACATAGATTACCTTATTTAGTAGCT CTTAGCT GCrrAAAAAAAAAAAAAAGATAA 

QCCATGGAAACACAGQGOCG<»G^CCCTCAGCTCAGTTTACGAAG 
GGTGTCTTTAACTGGTAA1TACATGAAGAAGAAGAGGTTGGGCCACCAC 
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GACCACAGGCAGGGCACAGGAGTATGCATTTGT AGGCTGTATATGGGGAAGTAAAGCAGAT 

TTGTCTTGCTAGAGCTAAAGATGTAGACAAGAGTAAGTATTATGTAGGAACTTCGATCTGACA 

TGTTTAGCCArrGGG(^TTGGC^TCTGGTGGAGAGCTGTTGACAAAAAATAAGTAATTTGG 

ATTTrAAAAGGATCGACTCGGTGGTATAGATAATGATGGCTGGAAAATGAGGCGATAGAGGT 

AGACAAATGAAGAGGTTATGGAAATAATCCAGGCAAAAAGACAATGAAGGGCAAGGATGGAT 

AGGAGTATATCCCAGTATTCTGGCATGCTAGAGTTCAATTGAAATGTTTAACTTGATGAATAA 

TAGGTAAGGAAAGTGAGTGAAAACTAAACTACTAGATTCTT AACTCTTAAACCCTTCACTGGG 

TCAAAAGTAATGAGTTTG AACATAGTGTCGAAATTAACCT G ATTCT ACCTT GTGTTTG AAATTC 

TAC ATTC^ TAAAGAAAAAAAAAAAGGAAATGTTATGGTCGGTCCCAAATTATAGTCAATTATAA 

CTATTTTCCTCAAAGTTGGTCTT ACCT ATGTTTAGTGGTAAGAATTGATTGTTT AATAT AGGAA 

AAAAGACCTAAAATTGGTGAGATAATGTGATTCAAATTCCAAAAAAAAAGGAAAATTAGATAG 

CTCTGCATTCAGAAGAATTGAATGAfATTTTAAAGACATTCA^ 

ATTCTTCACATTTACTTGTATTCCAAAAACAGAAAACTATTGA 

AAAGT 

>344 

NNACACATTGTAATATTATATCATGTATAGTTGTACGCAGCTCTGTGCATAACTGTGG 

TAACTTTGTGGGTC^CTCCTGTGGGTOJTGAAACAATGCAGTTCTCCCCGCGTATGCGAC^ 
CCGGCTGATGGTCATGGAGATCCGCAATGCTTATGCTGTGTTATATGACATCATCCTGAAGA 
ACTTCGAGAAGCTCAAGAAGCCCAGGGGAGAAACAAAGGGAATGATCTATTGAGAGCCCTC 
TCTCCCATTCT GTGATGAGTACTCTTCTGCACTGTTC I NC I 1 1 CT AATAAAA U I MC I I II IC 
GAACCTATACTGTCTT CT^TAAATTCTrTCTTACT ACCCTATGACCCGTGAG CCAACCACTTTC 
CGATGCCAGGGTTCTGACACCTCACCTGGCAT AATAT AAAGTG I H 1 1 U I I IATACCCTTCC 
ACTTGGAAAGACT ACAGAGGAATCTTGCACTGCATAGTTCAAACTAAAAAGAGAAGAGTTAAT 
TACCTGAAAAG CAAGAG AAAACAAGAAGGGGTAAATTTTGAACCAAGGGAAATCATTTAAG^ 
AGTGTCTGGTA > 1 1 1 1 CAAA TTTCT GTC^GTTGTTAC^TTTGTCATAAGTAAATGTTTAGGAAT 
AAAGGATGGAGACATGCTT ATTTTATTTAACTCCCCCAAAAAAAAAAAAAAAAAAAAGTACCT 
GCCCGGGCGGCCGCCCCGGGCGGCCGCTCN 
>345 

tATAGGGAGTCvaCCCACGCGTCCGGGGGCTT AAGCGGGGGGAGTCGAGCCAGCG 
TCGCCGCGATG GTGTTGTTGGAGAGCGAGCAGTTCCTGACGGAGCT GACCAGACTTTT CCA 
CAAGTGCCGGACGTCXSGGCAGCGTCTATATC^CCTTGAAGAAGTATGACGGTCGAAOCAAA 



GAGCT ACCGATGGGAAGAAGAAGATCAGCACTGTGGTGAGCTCCAAGGAAGTGAATAAGTT 
TCAGATGGCTT ATTCAAACCTCCTTAGAGCT AACATG GATGG GCTGAAGAAGAGAGACAAAA 
AGAACAAAACTAAGAAGACCAAAGCAGCTTGCAACCACT AGCCTGGGGAGGGTCCGCATGT 
GTCAAGGGTGAGGGCAACAGATGCTGGACCCAGGGAGCTCTCTGCCACAGGTCAGTCTAC 
AAG GCCT CAGGGACCAACTTGCCAACAGCT GGACTT GATCACTAGCTGGCAAACTGAGCTC 
ACGTATCGGGTGGAATAACAAGCGGACTTTGCTCTCT GCT gtgcaaaacgctgtttttagag 

catttgccacagcagcggatagagc^ggagagcacc>cxx5gagccctt gagacatcctt ga 
gaagagc^cacagcataagagactgccctgcttggtgttttgcaggat^ 
gaggag^cttctg^attgctggttctgttccttgcagcttt^ 
caggacccagccatggtgcattacatctaccagcgctttcgagtcttggag^ 

AAAAATGTACCCAAGCAACGAGGGCATACATTCAAGAAT^ 

CT GTCATGCTG<5GAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAA 
GGCACTGAGAGTTGAACGTGCOCAACGGGAGATTGACTACATACAATAGCT 
GACGAGTGCATCGAATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAG 
AAGAGAAAAAGATCCGGAGTCTGCTGAATGCAAGCTGTGACAACATGCT gatgggcataaa 
GTCTTTGAAAAT AGTGAAGAAGATGAT GGACACACATG GCT CTT GGATGAAAGATG CTGTCT 
ATAACTCtcaAAGGlgtACTT ATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAACAT 
ACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCT AACACTTTCCT 
GGCACGGAACAGG(X>AGTGATCTACAAAGGTTTTCT ATTTTTTGATAACCAAGCAACTTCTA 
ATGAGAT AATCAAATAT AACCTGCAGAAGAGGACTGT GGAAGATCGAATGCTGCTCCCAGGA 
GGGGTAGGGCGAGCATTGGfrTTACCAGCACTCCCCCT 

ATGAGCATGGGCTCTGGG*XATCCACTCTGGGC^ .. 
AAAGA1TGAGCCGGGCACAC7X3GGAGTGGAGCATTCATGGGATACCCCATGCAG 
GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTTCTCT ATGTGGTCTACAGTACTGGGGGCCA 
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GGGCCCTCATCGCA TCACCT G CAT CT AT GATCCACTGGGCACT ATCAGTGAGGAG GACTT G 
COCAACnrrGCTCTTCCCCAAGAGACCAAGAAG T GACTCCATGAT CCATT AGAACCCCAGAGA 

taagcagctctatgcctggaatgaaggaaaccagatcatttacaaact ccagacaaagagaa 
agctgcctctgaagtaatgcattacagctgtgagaaagagcactgtggctttggcagctgtt 
ctacaggacagtgaggctatagccccttcagaatatagtatccctct aatcacacacaggaa 
gagtgtgtagaagtggaaatacgtatgcctcctttcgcaaatgtcactgccttaggtatctt 
ccaagagcttagatgagagcatatcatcaggaaagtttcaacaatgtccattactcccccaa 
acctcctggctctcaaggatgaccacattctgatacagcctacttcaagcctttrgttttact 
gctccccagcatttactgtaactctgcx^tcttccctcccacaattagagttgtatgccagc 
ccctaatattcaccactggctt ttctc t cccctggcctttgctgaagctcttoooc c i m ic 

AAATGTCT ATTGATATTCT CCCATTTTCACT GCCCAACTAAAATACT ATTAATATTTCTTTCTTT 
TCTTTTCI MINI IGAGACAAGGTCTC^TATGTTGCCCAGGCTGGTCTCAAACTCCAGAGC 

tcaagagatcctcctgcctcagcctcctaagt acctgg gatt acagg gatgtgccaccacac 
ctggcttaaaatactatttcttattgagxstttaac 

t aagcttgg tagatg t aataat aaagtgaaaat att aacatttgaatatcg ctttccaggtgt 
ggagtgtttgcacatcatttaattctcgtttcacctttgtgaaacatgcacaagtctt^ 
ctgtcattctaga gtttag gtgagtaacacaatt acaaagtgaaagatacagct agaaaatac 

TACAAATCCGAT AG T 7 T T T CCATTGCCCAAGGAAGGATCAAATACGTATGTTTGTTCACCTAC 
TCTTATAGTCJ^TGCGTTCATCGTTtcAGCCTAAAAATAATAGTCTGTCCCT^ 

catgtctgcacaagacctttcaataggcctttcaaatgataattcctcx^gaaaaccagtcta 

AGGGTGAGGACCCCAACT CT AGCCttctCTTGTCTTGCtOteCtcTgnTCTCTCTTTCTGCTTTAAAT 
TCAATAAAAGTGACAGTGAGCAAATAACCTCATCAGgttATATtlGCCcaCATACc 

>347 

NNN AGGCAC CCAAAmCATGAAGGCAGATTTCTCCCTCA^ 
TATAAGGGCC7TrTAAAAAGCCAACAAACTGT(^GAGAGGATCCTTTACCAG 
TAGGAAAGAAATGGAATAAGACA 1 1 1 ICI 1 1 I GTCGTGACGGTGTGTGTGTGTTCTGTTATGA 
GATGGAGTCTCACTCTGTTGCCCAGGCTGGAATGGAGTGGTGCGATCT CAGCTCACTGCAA 
CCT(XX3CCTCCCGGTTCAAGCAGTTCTCCTGTCTGAGCCTCCAG^ 
CAGGTGCGTGGTACC^CGCCCAGCTAATTTTTTTATGTTTGTAG^ 
TGTTGGTCAGGATGGTCTCAAACTCCTGACCTCA.GGTGATCTGCCTGCTrCG GCCT 
GTGCTGAGATTAGAGGCATGAGCCACCATACCTGGCTCTTTTGCTTCATC^ 
CTTTGCTGGAGCATTTTAAAGCAAATATCAGAGAT ACCCTTTCACGCCTCACACTT CAACATG 
CGGCTTGTTGAAATTCGTGCTCCACTCCAGCAAC I GC 1 1 1 CAATCGGAGTTCCATOCTCCGC 
CGCAGTATGCCCTAACGCAGCGTTATCTTCAGAGCTAC^^ 
GGAGCGCTTCGCCACCACCTGCACGGCGAAG 

CTTTGCGCCTGOGT QCCG<X>GCTCCQTTTGGTTCCGCTT CGCT GGCGOCCGCOGGGCTCT 

>3«8 

ACTTGACTGCT AACAACTTTCAAATTCTTCTACTT ACTCCC IC I I CI I CAGCTTCACAT 
CTGGGAAAACTGAT AGGGAAGCCTAGGT AGGCCTT ACtnTTGGTGCCAGAGGGAAGCTCAAT 
CCATGCAAGCCCCAGATAATATATGAGAACCTCCCCAACCTT ACCCTAGACCGCTCACCTCC 
CAATOCAAGCCAGTCTCCT 1 1 CCCTGCTTT CT CAAACGATGTTTGGACCTGCTTGGAAGGTC 
CCTCTGCT CTCCCT AGAAAGCTTCATTATGTGAGTGATAGATCTTTTCATA T CT TCT TG GTGT 
GTGTGTGTGGTATCATCAGCCTCAACATCTGAAGCAAATGTTGGGTGGGGGTN 
>349 

GTCCGGGCTGGGTCTCCGAACCT GAAAOCGGGAGCTT CCT GCTOGTGTTCGCTGTT 
GAGAAGCT ACCCG CGGGGTTG T AGACCT CGGACCTCA1 GGCAGAGAT AATTCAGGAACGGA 
TAGAAGATCGGCTCCCGGAATTGGAACAGCTGGAGCGCATTGGACTGTTCAGTGATGCGG 
GATTAAGGCTATCATTAAGAAGGCTTCCGATCTAGAGTACAAAATCCAGAGAAGAAC 
CAAGGAAGACTTT ATCAATTATGTTCAAT ATGAAATT AA I C 1 1 1 IGGAGCTGATCCGGAGAAG 
AAGAACAC GCAT TGGAT ATTCATTTAAGAAGGATGAGATTGAGAATTCT ATTGT ACACCQGGT 
ACAAGGTGTTTTCCAGCGTGCCTCAGCAAAATGGAAAGACGATGTTCAACTTTG 
ATGTGGCTTTTTGTAAGAAGTGGGCTACTAAAACTCGACT^ 
TGGCG ATTCA TTCCAACAAAXCAGCTTTGTGQATTATG 

CGATTGTCTTCAGAAAGCGCAAGGCAACTAI M CT TCX5CGCACTGCGCTTTCATCCAGAGTG 
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CCCAAAACTTTATAAAGAATACTTT AGGATGGAGCTGATGCATGCT GAAAAACTGAGG AAG G 
AGAAGGAA^TTTGAAAAAGCCAGTATGGATGTGGAGAATCCTGATTATTCTGAAGAAATC 
CTTAAGGGCGAGTTGGC^TGGATCATCTACAAAAATTCTGTAAGCATAATTA^ 

atttcacgtgtcactgctttcgattgcacagct atttgactttgccaaagatctagaaaaaga 

^T~^ a TGATGACCTTCAGGCTCTA(>*CACAGATG ATCCTCTCACTT GGGATTATGTG GCAA 
GGCXSAGAATTAGAGATTGAGTCACAGACAG AAGAGCAGCCT ACAACGAAACAAGCCAAAG C 
AGTGGAGGTa^CCGGAAGGAGGAGAGGTGCTGTGCTGTGTATGAAGAGGCAGTGAAGAC 
TCTGCCAACAGAGGCCATGTGGAAGTGTTACATCACCTrTTTGCTrGGAAAG ATTTACTAAGA 
AGTCAAATAGTGGGTTCCTTAGAGGGAAGAGGTTGGAAAGAACCATGACTGTATTCAGGAAG 
^^^^^-^^^^^^^-^^^-^GCT"^ ^TTGTCAXiAATGCX^AAXAOA^GCAXjTTGAGTGTnrTC^TX&CTGTG 
TTATAACTTCCTGAGGGAAGCTCTGGAAGTGGCAGTAGCT GGAACT GAATTGTTTAGAGACT 
CTGGG *£^ TGTGG ^GCTGAAGCTGCAGGT^ 

CATGCTTTTT GAAGAAGCCTTTGTGCACCTGAAACCCCAGGTTTGTCTGCCATTGTGG^TTTC 
CTCGGCAGAGTGG^TGAAGGTGCCAAAAGCCAAGAAGAC^ 

GCTCTCTTA<^TGTCATAGGTGCCGACTCAGTAACCCTGA\AGAATAAGTACCTG GATTGG GC 
TTATCGAAGTGGTGG CT ACAAAAAGGCCAGAGCTGTGTTTAAAAGTTTACAGGAGAGCCGAC 
CATTTTC^TTGACTTTTTCAGGAAAATGATTCAGTTT^^ 
TjGGCGAACATAAGAGAATATTATGAGAGAGCTTTGAGAGAGTTTGGATCCGC^ 
CTTTGGATGGAT7ATATGAAAGAAGAATTGAACCACCCCCTTGGT agacctgagaactgtgg 
ACAGATCT ACT GGCX5AGCGATGAAAATGTTGCAGGGAGAGTCAGCAGAGGCATTTGTAGCT 
AAACATGCT ATGCATCAGACTGGCCATTT ATGAAGATGAAGAATACAGTCAGCTTTGTG aaat 
AGTATTGCAAGCAAGCCCCGTGGGCAAATTTGT ATTGAGTCCATCTGTAATTTGCTCAGTGA 
TGGCA GACAAG ATGGC TGTCT GGTTTTGAGACACACTTTAATTn"ATGTTAACTTGTTAAATC 
1 1 1 1 1 AAAAATT AAAAAATTTTTATGATTGACT AAAAAAAAAAACCT ACAAACCTTCCN NNNNN 



NACAGCT GGCGAGTGTACCCGTTTCTGCGAGAGAAGCT AAAGAATGCCCTTGCGTG 
AGG^AATTAAATTTATGTTCAAGCTAAATGTCGTrTATC^CACT^ 

T AACAAGTTCTGGTGGAAATCCGT ACATGGAT ACG TTC T C I I C I GGGGGCGGTCTCCAGTCC 
TTTCT C*TGAGGGAGCACACTCCTCTGCCTCATTG CAGTGGCCTCAGGGAT ATG GAATTAAG 
ATCGAC ^ GTGTGATG **TAAA 

?™?J G *^ TCTGG ^ T ^ GG ^ TCCTC ^ TCC ^ 
ATCGACtUUVAAAAAAACACTGGG 

ctagaatttccagtagaccagcagacagccgggaaaccagat^ 
gaaaaaaattcxjaagccagcctgagaagggccctatttcaatgctgt^ 
gatcxttcaacagtgagao^ggatggaccaacatactcaagctcaaggaa^^ aacaccg 

aggg ^ ac tctcatot 

CTGGGCATAGGGACCTAGCTGACAAGACGGAGTACCAAG 

C 5*^! C T CGGG ? C £ GG ^^ 

G«**AATCAGAGGACACCGGCCGCCTTGGT^^ 

AAAGAATGAAACGTACAGAAACnrTCAGCAAGTGATCTG C 1 I ICI I GTGTTGTCTTCTCCTGAC 

CNNN 

>351 

KN^WMN NNCCGC CCOTGCTTO^ I rCTG GATGTGA 

CCATGGTGATGATGATGAG GAGGAGGATGATGATGATGAGACACCTCTAAACTTGGAACAA 
GTTT AAGACTTT ATG AGAGAAG AAAAAAAATCACCAACAAGAATTGTTTGAGGAAAAATCATA 
ACTATCtJrGTGTTCAI " " " " nATAAACAATAAGAAAAAGTTGTTGG AI 1 1 1 1 II I I AATG 
Ai i iu 1 1 1 1 iGGGGGAGGGAAl 1 1 IGI I GCAGTTTTATGGTGGAAAATGCAAAAACCAGAGC 

^ A ^ G 5 A T^JST GTAA I CTGTGGATAT ^ CCTG ^^ 

i n i QGG GATACACATGTGAGATACT AAGTACTTGCAGAAG A 1 1 1 1 fGTCI C TfTTTTTTAAA 
AATGCATCACCAACAGCGAATGGCTGCCTT AGGGACXjGACAAAGAGCTGAGTGATTTACTG 

gatttcagtgcgatgttttcacctcctgtgagcagtgggaaaaatggaccaa 
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aagtggac>ttttactgggctcaaatgtagaagacagaagtagcrrcagggtcctgggggaa 

tggaggacatcccagcccgtcccaggaactatgg^gatgggactc^ 

agcagggacctgggtcacatgaccatctctctgcacctttg^ 

TACGGAAGGGGGCCATTACCTCCTATGGAGANN 



CXjCGGTGGCGGCCGAGGTACACCXIAGCTTT GTCrrCXTrGGCCCCAAATCTCCTTTTr: 

cttactttgggcattaactgctgttgaggtctcac^gcctgatggtcattatccct^ 
cataaatcaacaggctgtatgagcattgtgtgagattct acatgagggagagcatttcaaac 

CCATGACAGATGAGAGAAGTT AGT ACACTCT CACTGAACTGGGGATGTTTGACTT AAAATGA 
JGGACAATA^^AGTGAGCAGTAAGTGTGCTCTAGGCTAGGCTACGAGAGGCCATGAGCT 
CCTCATCTC I ICi CTGTTCT GAGC TCTCTrGATCCACCGCACTTGGGGCAGGGGGTGCATTCT 
CTGTGCCTCTCCTGAGTCT ACTTTCTG CATCATT GG TT CT CCCAG CT CACTTCCAT AATGTCC 

I2£L!P GCTG ^^ GGAA ^ GTGTG ^ GTCTAGACCCA 



^.^^J ' • " » 1 1 • 1 1 1 1 1 1 1 1 1 1 i 1 1 GCCTTTAGAAGGTTAAAATGCCAATATAAAGCT AAA 
^^^^X^^Z^^^^^^^^^^^^^^^^^^^ ^* ^^^QCT ACTGTTTTTACT AT ATAAATCTTT AC 
GTGTTAATGGAAAGAAAATTAATTCATTCTGTTACTCCATTTTTTTCTCTCCATA 
TGA^TGAGCTGATGAGGGGCAGAAAGATCATACAGTTAGGAATGAAGACATCAG^ 

^^^l^^y^^X^AGCTCACACMAJGAAACGCACATOTCCATGAATGCTTTC^ AATAAAT 
GCTTCCAGGATAGT ATCATAAAGAAAGTCAAAATTAAGAAAAATCACCT(X^TGGCATGCTGG 
TOOTCTCCATCAGCTCACC I I ICI ICI I ATCAGAATCCACAACTGCNNN 

>356 

JWGTCGCOJTTGTGAC^^ 
*CCGJCCX*CC^CWrrGT(XXXrrCAATTTCT ACCCCT CGCCCCTT AAOCCAAAGCT AAATCCA 
CCATCTGGTATTC TCCCC T AGAGCACCAGATGACCATCT GAAGCTGGACAAGTGTCTCTAAC 
AATAAAACATT ACT GTTT ACAAAACAAAAAGCACAAACAT AATT ATGGAATAAAT AAAAAACAA 
GG^CAAACAGOCAACTG^ 

AATGCr rCTGA AAGTTT CTTGGCCCACAGAGGACTAGGGTGCAATCATTCCXTGTGTTAGTG 
^IJ^^^^f^^^^^^f^^J^^ 001 ^ AA^GGGACTTGGTGAAATGTTTACATTAAAT 
ATTTCACTCC TACCCATTCTTC AGGAAAAAAGGTGAGCTCAGC^ 

AGA I A T rTACTGI 1 1 101 1 1 1 1 C ' A ^*GCTAAAAAAGCAAACTTTACACGAAGAAGCTCTTGAT 

T AAGG ^Tir c ^ T ^ T ^I A ^ 

AAAATGGGGGAGAAAAAATAGGTCIGGI IGI IGTCCCCCI MM CCACCTGCTGCTGGACAG 

TGATGAGATGCTCACAGAAGAAAAAGGCCTGGCTTTGT ACCAGGCTGGCGACAGGTGCTAC 

CAGGAGTOGGCTGAC«5GA^ 

AOTOC^TTCCCIT^CCCACT^^ 

OCCCTC °CAAOCCCAACCCCAGA^ 

GCAATCAAAGGAAACAGTT^ A AC ACCAAACAATTTCTTAAAGOCAAAAAATATTTTTCATGGA 
^''^ AGA * t ' ' '^GAGTlilGI I IIM ICAAGTGTAAAAGCAGTGAC AI FTTG1 ICAAACAGAA 
G^fCCATCTAGGAATI ulGGCACTTGGGTTCT AGGGGGTT ACAGGT ATGCATCATGGATTCT 
ZVjWX^CGTA^ 

TOTCOTCTATOGGACAGTJGG^^ 

AGGAGACATCCCTCATTCACXrrGCTGGGCTT GCM ICI GATTCAGAGGTAAGTOGAAGTGCA 
CAGAAAGAAACJTACAAAAOC^ 

actg^ttgctggagactaocctct^ 
AA JTI S ^ TCAAACATGGGGGAGG ^^ 

GG *^^^^TCTACnrrCTX3CCTCA^0CTGCCCrAA OL 1 1 1 rCTG A GTO OC TD O CTCTCI I G TG 

C AAGC j£TOAfiTTTAAAAAGG 

ACTDjGWATAAAGTrrGCT^ 

~VZ-f£Zl? iGCMXX ' 1 1 CC I CI CATOQGATATGGAGTG A 1 1 iiucic i o gctgctgogacq 
cagatctgagccacagtcaggtaccaatgta^ 

AGGTGGGCTGCTGC I ICI MCI CTCTGCCOCT AGTCCAGGCTCCTTTGCTTCAOGTAAGATT 
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AACACTTT CCCATTCCTCTGAAGTTGCTGGAAGGACATTTCCCAGGAAGAAACAATTCCTCA 

CTGCCT ATAAACTGTAGTCTCATGTGGGATAGTCAATTGAACATGAGAATCAGAACAATCTG 

GGCAAATGGGTATGGCAAGAATGG^GAACACCACAACAGGAGAGATGCCAACTCT CATTCAT 

GCCAGGCCTTTTGGCATCTGGGTGCCTTCTGTGTC I I CI I I U CACCTCTTCCTTCAGTCTCAA 

CATCCA CTTGTAG CCCCAGCT AGCTCCXATGTTTCCAGGTATT^TTGGCTCTTAACTCCCSACA 

AGCCTGCCI 1 1 IGGCTACCCATCCCAACAATATCAAGAGGGAATGACTAAGTATCAGCTAGA 

AACTT AGCCATGTCTCAACATTCCT ggattatctgaaaagct gtcgatgccctttt acaggtt 

tatggtgacagacccgtatcatcttaaagtatgttcatagttaaggc^ 

agagaaccagacataatgisaaagacctcttcaataatgttgtcatgcctctc 

ctcacagtcacacttggtttggctccccaaacccacaatag^ 

gttt1 1 catctgttttgtatttaaaggcattgggttacttcctcctgccctc 1 1 mc i 1 ccctg 

AACAAGAGTTTACAACT CCT CATGGCTTCTT AAT AGGTGAAGT AGGTGAAAAGTCTGAGAAG 

CnT^ACAGCAGGGTTTGCCGTCCCAACTATGCAGCTGAGAGGTCGCC^ 

CCAGCCCCACT AT AATTGGCAG T ATGTTTGTT CATGTTTCCTGAAAACA 1 1 MC I 1 1 AAAAAG 

GAAAAGAAAAAAATGCCAAACAACACCAACCAAAAAAAAAAAAAAAAAAAAAAGAAGG^ 

GACAAAGAGCAACATCCAAACATTCCCCAAGCCCX^ACCCX^AGTAAGTGT GAGATTATCTTAT 

TCCTTCCCTGAAATAATTAT AAAGAAGCATTTCAGGCAAAATACTT AGTATTAATG CTTA 

CTGCTCAA CCTQ(X SAAC€ATGCCCTTTTCCCTTTTATGTGTATCTCT^ 

^ATGGCTTTCXiACAT ACAAATACAATAGAAAAGAAAGAAGTCTGGAACXJT GACT ATCAT 
GGGACCAAAAAGTATCTTGGCCCTTTGGGAG 1 1 1 CC 1 1 GTCAGAAAGTATAAGCCTCAACAG 
GAAATAGAGGC ICCI ICIUCI 1 1 GAGTTCAATACCCTCCCTGTGTCCTACTCACCAGGAAAAT 
AAGTGTGTTCATATCCCACCTAATTTACAACAGAAGATAACCC^ 
AAAATACAAGTCTATGCCCATAGAACN 
>357 

*CGATCTCAGCTGACTGCAAOCTCTGTCT0CCGGTCAAGTGATTCTCCT GCGTTAGC 
CTGCCAAGT AGCT AGGATT ACAGGCACCT GCX^TCAOG^XH'GGCTAATTTTTGTATTTTTGT A 
GAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCT CAGGTGATCTGC 
CCACCTCGGCCT CCCAAAGTGCTGGGATT ACAGG CATGAGCCCCX3AOGCCCAG CCCACACT 
TTT AATT AATGGAGAATTGAGGCT ACT AAACACATACTTCAAGGTATGAAAGGAAATTTGA 
CACTGTT AAGAATTATTCTGGAGAATAAAAGGATCT GAAGGTGGCATCAAGAXaAATCACCCA 
TCTCCATATATGGGAGCT ACTCTAGAAAGAAATGCAAAAGGTATATTTCAGT AATCTTTTCCC 
AACTTCTCTGn CCCCT GACAATACT GAGATCAACAGAAGCATTGGAAGGTI I CT1 1 1 AGATA 
ATAGTGAGAGATAGAGGCAGATTGCTAGCCAAATACAGCTCTGGGTT ACTXTTGATAACACAG 
GTTCCT TCG AATGT ATTTATTrTCCGAAATTAAAAATAAGCAGGGATGGAAGTAGTTATTATGC 
Aim l CAAATTCTCCTTTCAACAGTCT ATATAAATGTTGCAGAATGCT AGACTAGGATGAAAA 
gtaggagtc^tgagaacttgcagaagcctgagat CCTAAAGGAAGTGT ACCATCTGACTTGG 
CAATGT AAGACACACACGTTAGTGTGGGGCACAAACGTGGAATATTAGGAGAGAGCTGGTT 
CCAGGACCAAATTXXAGAGTCACTCGGGGAAGGAGGT ATGGTGGCAACACTTT ATGCTTAATA 
TTCAATTCTGCTCCA^AGAACATGGTACCTOGG 

tc ai rrrg i cuccattcctct aatatgtcaccaattctgctgataca i i c t i ro i a atctctc 

CATCTATTTTAATCTGT T ATTCAXXrmA GCTACAC AA 

CACAAACCAACAATTATAATACACAGGGATAAAAAAAfiTAGAGG 

>3SB 

KMAGAAATTGTTTTCAAGATTGG IGCI 1 1 1 GCACCATTGATTAGACGAGGCAACCAA 

gtgcatttattttgatgc^tgcctggcoxtaatctg 

caacctgagt AAAAGCAGCACT ggtittgagttttctaagctcagggtcttcattagagacc 
tctggaaatacattaaggatggtgggggtagataatcc^ 

gctctt aaaat aagaaagctgagactgaggaggtgaaactg>aaaat aaaaacagaaagttca 
tcctctaatccccagtg ct gta tttcttcaaggtgcatcaggcct aggacagtggttaagtg 
ttgggagctggagatcattttatcactgagttgaga 1 1 1 ci i m i cctgcagagtgtgggag 

A ^ AT ATTT AGGGT CAT CAG T CACT GTCXX>ACTGATCACT AAGGCAAGTTT GCTT CCTTCT A 
• CI ICI I ACAAAGCAAGAGGTTTAGGGGAGCI IC I I G CCCAGAAACAAAGTTTATGCTCTATGT 
GATCAGCT ACATCTCCTT AGGTTCCATACTTATCAGGGGCTTTTGACATGGAAGAATGTGAT A 
ATTGGGGAGCAACCI I U I GAGAATTCAGOCTITGAAGTAATTCAAAGACAG 1 1 IC I I A GCACA 
CACGTTAACTCCAGACACAGCAGTGGTGGGTGCCTCCATTG^ 
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tc 5^T gaagggcccmg * tcga ^ g g^ 

AG^TCTGGCCACTTCGTTTCCATGGTAACTGCC^CGTGTACTTTTCTAGCAGTCTOTGGCC 
GTGTA^CTCTT^T^sl^^ 

GTCAGGCTGTGGTGGGGGGCAGGGGTGTTGGAGGAGAAGTTGAAAATOCGTGTGTTCTCT 
GCCCCTCTGCTpCTOCATCTTAGCTTCTGGAGGAGTTAAGGCACCAAGGGCACCAAGTCAG 
GTrGGCAGTTTTGCTGCXXITTGCCCAAGCCTCCAGCAAAACTAGCTGGTCCCCTGCnTrGCT 
GGGTGCCAGCCAGGTGTGATrTGGTTGTTGATTAGTAACX^CACATTGTTTTTCCCCACCTTT 
TTCATTAGTGGAGGTTTGTATGTGTGTTTGTRGTTTGAGACAGAGTCTCACGCTCTTTrCAGA 
CTGGAGTGCAGTGGTGCGATGTCAGCTCACTGCAACCTCCACTTCCCGGGTTCAGACGATT 
CTCATGCCTCAGCXTCCTGAGTAGCTGGGATTACAGATGTGCACCACCACACCTGGTAA7TT 

tgtai 1 1 1 lagtagagatggggtttcaccatattggcaggctggtctcaaatcctggactca 
ggcgattggccacttggctcocaaagtgctgggattaaaggtgtgaaccaacgtggcctgc 

G CCCATTAGT GGAGTCTGATGGGCCGGGCTCAATATTATTCATGGAAf^fiAfiAriATfiTTnfrT 

1^' I'JJ ^caactccttcatgaagaagggaggggggcgactctacatcgacgacaatg 

G i GC A 7GG A Q ^ TGGG ^ 

§ T T ^^ G ^™ AC ^^ 

>359 

T oo-nrJ^J^L™ lu 1 caggggccaaggttggacccagaagtgcatggggtgcttggtg 

7G 9 t I cag t ga j^ gtaagg a^^ 

^ G JI C ^ GCA . CCT77CCT acamtc ^ a ^ ct ctaccattcccggagtaataaatccagttg 

GAG^AG^^^raATGAGAA^GGGGATOGACACGTTCCGATCACAACACCAGTACCTGCCCG 



CGGJ^CCttlNTTG^^ 

GCGG 2IZ GA 1 ' 1 1 ^ 1 CACTTTGGACT GGTTTTTACTTCCCGACTT CTGGACTCATCTTTCAAG 
AG 9 AC TTI AG ^ AA ™ 

7G ^I QACA7 2 fiAAG j™ T ^ 

AGAAAGGATGTTGTCCCGAAGGTACGAAGGCGAAATACCCAAAAATATTTGCAAGAGGAAGA 

AAACAGTGCACG ^^ 

GTTGTTCTCATAATAATTCAGATGGAGAATATATGGCTGGACAGCTAGCTGCTTATGGCT^A 

AAATTACAGAAAATGCATCCGATGCAX^ITTATGGCTCCTGAACA 

^TGAAGACCJCm^^ 

?i 6 j5 G ^ GCTGGGTcAGAA ^ 6 ^ TMT 

7GGCGAAGA7TAGGAAGA ^^ 

GCTTGTAGCTACTGCAAAACTAAACACGCCAGAGGAAATTTGGCCAGTTATCCAATTGATGA 

A ?T AG J A ^T AGAGGCAAA ^ 

AAGACACGGGGGCTTATGGCAGW^TATTGGCACCAATCTCCCCACACTC^ 
GGTTGAAGTGATTCCTGAGGGAGCAATGCTGAGGCTTGGOAT^ 

ATACCAGTCCAGTpGCCrX»3ACAGCGTACTC^^ 

A ^ A i^^JJ c £ CAA i™ G ^ 

AAAAATGGAACAAGTTCCAGCACAAGTGAAAAAGCAAAGGACAAAAGATCTTTCT 

ttcattcttacagtccatatgat^acaagattggtgaaagacaacaa^tgttagtaa^gaag 
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r ^^^^cc^gttttatgttgcacacaatcaattctatg 

5?£ ACCC T G ^ TT ^^ 

tgaaagggcagccagtatctgatgccaa>gtgtacacgccctccatcagcaaaccgctagc 

aaagggagaagtctcgggtttgacaaaggacttcagaaatgggcttgggaaccagctgagt 

tcaggatcccacacctctgctgcatctcagtgtgactcagcgagttccagaatggtgctgc 

ccatgccaaggctacatcaagactgtgcgctgaggatgtc^gtgggcttggctctgctggg 

tcttctitttgctttttttgtcaaggtctataattagaatacaactaatggaaacatctataaa 

gaagaatacatttct aattaaaatcttcaatgaacaggaaagcgacatctccattctccaagg 

gcaataat^gtactggtcatgctgcctccttctcagcc^ctcttcttaatgaggctccccct 

?Z^ c ^I rG ^ TTGGGCCCA ^ GG ^ A1 TI GAGCTA ^ 
acatccttcaaattaaactgcttttggtttactttta 

J3 G ^ G ^^ G ^^ G ^^ G ^ G ^^^ G T^^GTCAGAACCTAATTGT ACAGTGGCT CT GGCCATCT 
TTTCCTCATGTGGAAGAATTTTCTATCTTTAATAAAC I 1 1 1 I C I M G I I 1 1 1 1 H 1 1 CCAGATGG 

^JII9 GGTCTTGTCCCCCAGGCTGGAGTGGTGCAGTGGG ^ 

C J 1 C I GCC I CCTGGG ^ CAA>CGA ^ CTCCTACCT 

s CA I A J^ CCACCATGCCCAACTAA ' nm ^ AA "^ 

CAGGCTTGCCTGGAACTC^AGCCTCAAGCAGTCTTCTTGCCTCAGCCTCCCA^ 

GGAT 7 ACA i^ CGTGAGCGA CTcc^ 

>361 
>362 

™ ^^Jj^^TTrATAGCTGTTTATTTTTAAATGCAGGATCCAATCAA ATA 
Z9Lt9 T ZH T i^J TCTL ' ' 1 ' 'C^CC^AATTGATTAGAGTTGCCCTTTTGTGTGTGTCTGAG 
TGGTGGGTGGTGGGGTGGTGTTTGTGTATGATGTGCAGTGTGTGTGTGTGGTGTGTCTTTC 
A J GA 5* CTGACA ™™AA* G A G ^CAGGCAAGT^ 

TTACTGATTGTTTCCTCATTACGAGATACAGGTTAC A I HUM I CCCAAGACTGCTACATGTG 
TGATGTTGTTTCCTTACT ACTGCCTCACCTCAGGAAGCACATAATGTCTACTTGTCTCTTTTCT 
G^ATTAGGACTGATCAGGTGTTGTCTGCCTGATCC^TCATTATCAAGTTCCATAATTCATT 

AT J^^?I G ^^ CAGGACACATACA ^ ATAGAGTGTG ^ 

AT .°£ C 7I AA T£ TTCT ^C^QCCCCAGGATCCCCAA^ATATTATAAGCATTGCAGTCAGAT 
^^1^95^^ AGCAAAGCA7TCATCCCTTTCCATATAAGAGGAGGGAGGGGAAGAAGGAA 
GTTTCCATGTGCAGGCAATACGTGAGAGCATGGGCAAGTGAG^^TTTGTATGCTOCCTGTA 
GGC ^ G <^ GAAATA ^ G AACGCTT™^ 

CCCATAGTTTATTTATAGAGAGCACACACTCCCTAATTGTCAGATG G1 I II I 1 I GGGAAAACT 
CJT/^^TACAGTGAGTTGTG ACCCTCCAAAG GATCACAAGTACATAAACAGATACAAAG 

TATTT ATG T GGTATTA * AA ^ 
JCJTAAA£«GCTGTA^ 

TTTCTCTTCOCTGGGTATTTTCTCATTTCCAAAGACX^GAGAGTCT GAGAGCCAGTTTAAGTGG 
AAGAAACTrrGAGAGGTTTCTTAATCAATTCCTTC^^ 

JGGTGAAAAAATCT ATACTCCTCTCAGAGTTCTTCGG^CATGGAAAT/TCTATAGCATTCTTTG 
GTATAACCAACCCACTGAACCCJCCACAACATC^ 

I™ 1 ^f^^^^^^^^CTTrfX^AQTrGTCCCACXTTT AGAAAGGTCCAAAGATGATATA 

I AA I A ^ GAA ^ AGAA ^ AAT ^^ 

Z accag 2J g J ggttcag cctc^^ 

tcttgttgctccattgttgaccacrrgatgtaactcaacgtgaaaccttcccaaagct 
atcactctct gttat ggctgtgaccagcagtgaacatgaaaagaactt atttcatgccttctc 
caaactatattgi i ici i rgctaagtccagacctgctttttctacatt^ataggttttctatttt 
AA T^ c ^SS AAAATTCrrTATCATACTCAATC ^ 

TTAACATCCCTGTTAAAGATTACACTGTTGTTGTTTCAAAAAAAAAAAAAA 
GGCAAAACATTAAGGGAC^ 

A7CAT JH GG7 Z A7 Ji^ 7GGAAGC ^ 

TACT CCATG AGT ACCACACTGGGAGAAAACAAAAGCAAAAAGATTGTGGGAAAAGTATAGCC 

GTTGACACTGTTGTCATTCTTGTTCTGATCATTTTGGACCTTGAAGAA^ 
CTAGAATTAGAGAAACAAAGTGTGTTTGGAAATAATGATTGTTTT^ 



PCT/US00/33312 



Table 4 



TA ^9^ G ^ AAGCTTTT ATAACAGGCTG ^OCCTCTGGACAGGTATTAATTCTGAGTAAGAATT 
TTCAGT GACT ACATAAGGATTTGTGTAACTT ATGAAG GAA G AGTCCATTTCTAATCAAATAATT 

^ C ^ G77 ^ AC7A !^ A7A ^^ 

AG AAAAAGAAATGTTTACT ATATTTTTGGTTCCATTTATGATTGTATTAAGCATTTGACTATAA 
GGA*AACTAA(^TTAAATCAATTAGA^ 

TT^ATATGTGAGCTTGGGTATTCAAATGTCACAAATAAAAAGCATATAACCAAA 
>363 

. a~ta a ^^3^^^^?«^ 7 ^ G ^^^^ GGG ^ GGGGGGGG AaGG ^^ Ggg ^^^^GATAAXATTGCTGACAGC 

CCCGCTGCCACTG AATCGGTCAG G GACCCCGGATTCCCGGGTAG ACGCCC AGTAAATG AG 
CAGTTTAGGAGGCTGGCCCGGTTTGTGCTGGTACGCG^ 

iGCC^GG^AG^ 

>364 

>365 

^^^ G / VGCGACGCGACATGTGGAGGGTCGAG ^ 
AAAA TTC^TTACTAAAAATJACT^ 

TAGAGACAACAGACACGCAATTCTGACGAGGACTCCTGTT^ 

ACAAG A G * A ^ 7A 7 C ^^ 
G ° CGG7 ^ ACA i? AC ^ GTCTCC ^ 

xI A ^ AT ^ r II AA ^ GTAAACAATATGAG ^ 

TATATTTTGACTGATCTGAATAAGCAGGTTACTGTGGAAGCATATAAGATAAAACAGCTAA^T 
T^IT^ 7666 ™^^^ 

II A 5 A ^5^^ G I GG i! G X ATTAAACATGCATCT ^ 

ttVZITggg^ 

>366 

, ^ . ^j^ GAGC l CCCCGCGGTCGCGGGGG AGGTACT^ 

ACACTCAGTATTAACCATCACAAGGCGTGAGGACA^ 

CAAGA^TAGTCCCACACCAAAGTCAG<^TC^ 

T AAG J GAAGC 5*?^ CCGT 
AG ^2£ ACCA TI AAC ^ 

AGTCTJTTAGG^^ 
>367 

QTC^TCAATAATATCCTTGTGTCTCAAA»TCTCX>TGGTGAAW 

Q S2 A SS^ AA ^ CT6<3TCTTGCT 

GTTT(WGWTCCAG^TAA<^ 

TGTCCTCAGCT ACCATTTTCTCAAGTCCTTCTCTTGCOT 
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CAC7GG J CAC J TCA ^^ 

SIJ^^JJJ 07007 ™^ 

G J A ^II CCA , CGTCCACmGTGCAGGG AGGTACACAGCT 

^STfT^P^lj!^?^^!^^ ?? / ??5f!l?S5 AGAG 7 G GCCACATCAG T G CTGAG TTT AG T^ATGTC 

TGTTT^GTAGTTCTAAACCATTTAGTAAAGC^TGATCT^ 

S^ G ^ AG ^ CCC ^ GTTCAGACTGCTGC ^ CGATAGC ^ 

caaagcj^gttacaagtgtata^caagtgttggaagtttgagca^ 

TCACAACCCGCTTG^^ 

JT» r 4 A 5 G ^^ CT Z GAAA ^ 

S^? 7000 ? 677 ^ 0007 *^ 
JJi*^ 7 ?** 7007 *^ 

tct aagagtgaagtaggtgtgacttg aact ax^tt gcatgcttcctcctttgctcttaggaa 
gaccagctttgcagtgacagcttgagtgggttctctg^jagccctcagat^ 

J 07 ? 07 !^!?™ 70 ^ 7 ™^ 

AG ^ GACT T C I GGC ^ GAAGTGG ^^ 

AGG7ACA £ G7GGC J GAAAATC ^^ 

JT^^ 06 ^ 07 ; 37 ^^ 

r^ A7T P^ 7GG7AGGA7A7cTTG ^ c ^ G ccA 

catgtgactggcagtattaagtgtgcttattttaaatgttactggtagaI^ 

il 67 ?!?™^^ 

£ GG7C ^J^ AA I AAGAAC ^ 

?£I AAC 1 C £ 7AG ^^£ 

TCAGAGATGTTTGAGGCTTTCAGAAGCTCTTCTAA^ 



^ 0 J?JSi^JS 7 5^ 77Gccc ^ 

CCTCCGCCTOCCGGGTTCATATGATTCT CCTGCCTCAGCCTCCCGAGTAGCTGGGACTACA 

JGJSCC^GCTGGTCTCX^ACTCCCGACCTGAGGT^ 
GCTGGGATTACAGGCATGAG<^A£C^ 

CT ^ C7 ^ 7 J77 C P T J^ AAAAGA ^ CTAA7TG ^ 

ACAAGACACAAATATGCTCTTATAGGCTGGGGAAATAAGAAAATATGA^ 
?JS G ^ CA ii AGAAT7ACCTGG6 ^^ 

AGGTCAGCATCGOCCACAAACCGGCCGTTCTGAAACACAGCC^^ 

7GTGG ^ C7777G ^ A7G ^ 

AAG i AGA I A I G J G7G i 3CC7GGAG 

GTCTTAGCACTTTCCTCAAAATTGAAAATCAGGCCTGGAA^TCT 
AACATAAGM^J^A^ 
GTCCCTtGA^GTGCCTGGCACATAACAGGT 
CAGACX^AGATAATTTCCCTTACATGCAACCTTTTTGGAAA^ 
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TGGAAGGGCTAAGTTGGGGGGTTTCCAAAGAGCTTT atagtctcatgaatcttcatgcagta 
CT ACCACTGGAAACTCCCATTCCAGG CAGGTTTTT AAAGAAAGAAG CAAGAT GGCT ATCATG 
TCCTAATGTAAAAAACAGAAATTTAATTAAAGAAAAAAGCAAGAGATACGAGTCTCOCTTTCC 
AAGTACAAAAACAAGCACCCCAACTCCOCTCAAGAAAACAGAAGGCTCAATATCTTT ACCAA 
CAATTATTAGAGCCACAGAATACATTCCAAAATGTCATOCAGAAGTTGCTGTCTTAAGCAGCA 
GGAAATAATCTAATGACATTATACCACATAAAAATATATTTGCCTGCTCCTCTTCCAGTAGAAT 

agtgagatccaacx^xttgggagactcttcctccaccccactcccattacatagagtgtaa 
tgctgcagggagaggaagttaaacattactttcaggaagaaggaagggtgtggttttactgc 
taacggacgcgtggtcgactccctann 

>369 

acgcgggggtttcxxsgtttgggtgtggccgcatggcgtgctggggtgcaggtgg 
cxx5aagggggcgttactgttgc«actggcatccgcatccggcagatgtagatggaaccaaa 
gtccagaagttacgcgtcacxxrrtgctctacagccaaacatgcaggactct agtaacccgc 
gaaatgatgggatagcgttgcaaatcctt aaaagagtctt aacggagaaggaaaaatgttac 

agatcaaggaaggaaatccagattttactttaagtgc^ 

tagatacagcaaaaagacaacaatcttcaagccacaatggccctcaccagaaccgagccatg 

TGGTCAGCCTGATCTCGGACTT CACAGCCAGCAGAACTGTGAGAATTAAATCTT ATGGGTTC 

CTTAAGCTGATAAGCAACNNNNNNNNNN 

>370 

TTGGAGCTCCACGCGGTGGCGGCCGAACGTTGGCTTATCATAATATTGCTGACAGC 
AATAAACTGCGAGATCTTCAGGCTGCAGGCTGCTGGTGGTGAGAGTXjAAATC I G I CO C AGA 
CCCGCTGCCACTGAATCGGTCAGGGACCCCGGATTCCCGGGTAGACGCCCAGTAAATGAG 
CAGTTTAGGAGGCTGGCCCGGTTTGTGCTGGTACGCGGGAGAATGGCTCGCAAGCrrGACT 
GTGAGCTCGGAAATCCTTTTAAAAGAAATTCAAATGTCACTTTTTATTO 
GGCCGCCACCGCGGTGAGCNNN 
>371 

AGGATTATTTTGAAACAAiGCCTACGTCGCTGACTAACCGAGTGGAAGGTGTGAGTGG 
CACTACAAATTCACAAAAGAACT gt agcctcaga t aatcaaaggagagaaggtcagatgcaa 
TCACTGATG CATGCT AG TAA7TCTCAAACCTTCGTTTTCAGAAACGATTGGATTTTCAGATAG 

atttgcagtaagagaataacaagtctttatttttttcatcocaao i rcnrci I GCACATTTTTC 

TTCTAGCT ATATTT AAT ATCTGTTCT CCCCACACACTTGCTAATCTACATTTCACAATCTTCTT 

CCACTTCACTTTGTCTGCAAAGAAATCTACCTGGACAGAATAGCA I C I C I 1 I I 1 1 ICCCCCTG 

ACCCTTGQCATTTCCTCTCCCTCCAACTTCTGCCTGATCCT AGGATGGACTC1 C I CATCCCT 

CATTCTCT AT CATT AGCTCTCAG GCTGCCCTTCCT GCACTTTACCTTCCOCCACACCTTCTT A 

CAGGAAAGCAGAATAGAAAGGCATCAGTGAGGGATGAACAGGAGAGTGCCTTGAGAGAAGA 

TTCCCAGGAAGACAGG GAGAGCCAGACAGTGGACTGGTTGCAACCCATGTAGCOGAATCAT 

AGTGCTCATTGCTGTGGAGGCTCTGGTCTGTCT AGCCACACAGACCAGTGAAGTGAGGGGT 

ACATAGGNN 

>372 

>373 

- AOGCGGGGGAAGAAGAGGAAGAAGAAGAAGCAGAAGGAATGGAAAGCCTGGAGAA 
AGAGGATGAAATGACGGATGAAGCAGTTGGAGACTCTGCTGAGAAGCC I CC > ICIACTTTTG 
CCTCACCTGAGACTGCTCCAGAAGTGGAGAOCAGCAGAACTCCACCAGCCTGTGAAACCAC 
GAACCCTTCAATCAAGAAAAGACCTTTGATCAGGAGAAGA 

CACATTCAGGATTTCTCCAAAGGAGGTGAAGGTACG^rrCGAGCGTGGTCATTGAG 
TCGACGAGAGATCCCGAGT ACATCT ACAGTCAGCCTTACGTCTGCAGGTGT ACOCAACAGC 
TCCGAAGAGACAGCGACCATCGAGAAOGGGCCATGATGACGATGGCGGT 1 1 I G I O GAAAAG 
AAAAGGGGGAAATGTGGGGAAAAGCAAGAGAGATCAGATTGTT ACTGTGTCTGTGTAGAAA 
GAAGTAGACATGG^aAGACTCCATTTTGTTATGTGnrTAAGAAAAATTCTTCT 
>374 

tt a ac tW taa iM^t il llu>la a5a o ctaag «wr . iMiirjwK ^ 
cATGTCACTGTCl I CCACCTCCATATCCTGTCCCAclGGAAGTCT ACAGGGGCAGTAACCCAG 
. GCGGAGCCATCAGCTCCT ATGAT CACAATG CC ITCTTCl GGATACCTCCTGAAGGACCTGCC 
TGAGGCTATTTt ACAATT AA rTTTGTGTI I AATAAGTATAAGGAGTACGCGGGGAGCACCATG 
,..„_ , _ - ™ - ^CAGCOCGGCGOGGCCCTTTTCCGTTAGCGTTG C ITSC1 I 1 1 GTT 
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CCTGCTCGGCCCCAGATTGGTCCTTGCCATCTCCTTCCATCTGCCCATTAACTCTCGCAAGT 

gcctccgtgaggagattcacaaggacctgctagtgactggcgcgtacgagatctccgacca 

gtctgggggcgctg<3cggcctgcgcagccacctcaagatgacagattctgctggccatatt 

ctctactqcaaa gagg atgcaaocaaggggaaatttgcctttaocactgaagattatgacat 

gtttgaagtgtgttrrgagagcaagggaacagggcggatacctgaccaactcgtgatccta 

gacatgaagcatggagtggaggcgaaaaattacgaagagattgcaaaagttgagaagctca 

AACCATTAGAGGTAGAGCTGCGACGcctagaaoaociff^ 
AGAAGAGAGAAGAGGA^tflq^tacraacpag treuiran^^ 

*BgB*c *»9ctaoctgQcaQgteBctao ctg o g^^ 
ctett g9g I 099 a 99Ct999gtteac<tCCT»cagt« 



_ ~ PNpTAATCTCTT 07 AGGCCCCGCCCCTTCTGAGCCCCCCCTCCTTCGGCCTGTATG 
ATAGGCTCTTCCT CCATTTCCGGCTTCTGGGACTGGGGTGCACCACGGCTTCCGGTGTCAT 
<^^GCTTGAAGTCCCGGGAGTCGGTGAGGCGG^GCAGGTCCCTCCCTGCGGAGCCGCT 
GGTCCGGCTGGCGGAGATGTGACCGCGGGCCCGGCCGGCCT6CCTCAGGCGTCGCGTCA 
GCTCCCGTGTCC GTGC CCTTAAOOCACACCGATGGCGGGATCCGGCTGCGCCTGGGGCGC 
GGAGCCGCCGCGTTTTCTGGAGGCCTTCGGGCGGCTGTGGCAGGTACAGAGCOGTCTGGG 
][^ G ^®GCTCCTCCGCCTCGGTGTATCGGGTTOGCTGCTGCGGCAACCCTGGCTCGCCCCC 
CGGCGCCCJCAAGCAGTTCTTGCCGC 

G I A I^i , 1 1 t^CGCAAAGAGAGGGCGGCGCT GGAACAGTTGCAGGGTCAGAGAAAGATCGTG 
^CTITGTATGGAGTGTTTACAATCCACTTTTCTCCAAATGTGCCATCACGCTGTCTGTTGCTT 
GAACTCCTGGATGTCAGTGTTTCG GAATTG CTCTTATATTCCAGTCACCAGGGTTGTTCCAT 
GTGGATGATACAGCATTGTGCCCGAGATGTTTTGGAGGCC C I rGCTTTTCI ICATCATGAGG 
GCT ATGTCCA TGCGGA CCTCAAACCACGTAACATATTGTGGAGTGCAGAGAATGAATGTTTT 

AAACTCATTGACTTT g<^cttagcttcaaag aaggcaatcaggatgtaaagtatattcagaca 

GACGGGTATOGGGCTCCAGAAGGAfiAATTGCAAAATTGCTTGGCCCAGGCTGGCCTGCAGA 
GTGATACAGAATGTACCTCAGCTGTTGATCTGTGG - 
TTCTCAGGAATGAAAGTGAAACATACAGTCAGATC7GAGGAATGGAAGGCAAACAGTTCTGC 

tattattgatcacatatttgccagtaaagcagtggtgaatgccgcaattccagcctatcacct 

AAGAGACCTT ATCAAAAGCATGCTTCATGATGATCCAAGCAGAAGAATTCCTGCTGAAATGG 
^^^^p^^XCATTC^XTTAGCATTCCrnTTGCCCCTCAXATTGAAGArCl GG I CATGCTTC 
CCACTCCAGTGCT AAGACTGCTGAATGTGCTGGATGATGATTATCTTGAGAATGAAGAGGAA 

ACTTGTTCCAAAGGAAAATCCTQGCAGAGGACAAGTC ITT BT I G AGT ATGCAAATGCTGGTG 
ATTCCAAAGCTGCGCAGAAATTACTGACTGGAAGGATGTTTGATGGGAAGTTTGTTGTGGCT 
ACATTCT ACCCGCTGAGTGOCT ACAAGAGGGGATATCTGT ATCAAACCTTQCTTTAATCAGTA 
ACCTA ^ GG ^I9' m ^?II TTTCTCC 1 C ' I CCATTTCTTGGGTTATTCCACAT ATGAATGCAG 
S^tv^SS^^IX*2l£i^^** G ^* G ^^^'^ atacatttatttaatcctactaatgtgcagc 
^^^*CCCAAGCAGTCACTGCGTrGCATACATlTGGCACTGAGT AGGACAAGACCTCTCAGC 
TAT ACATTGAGGGGTTTT AGAGCATGCATGTGGGCAACO C 1 ' I 1 1 T T GTGCGGGAGAGCAGGT 
Gl IUCICI I CAGTATQT AGCCT AAAAAAATCTTAATTATTTCATGGATCATQAAGGAAGGATG 
AATAAJATCATGT CTT GCTAAATACTAACAAATTrGTTAGGTTTGGTGACATCATTTACAGATr 
Aiiiyi 1 1 ATGTTGTXXlAGTGGTTCTTCCTTATTGTrGATATCCATAAGCTGGCACTGGATGC 
TCT CAGTAATGT TAAGT AATTGTCAAGCAGCAGTT ACCT ACTG TGTT CTTAACACTGAGTTGT 
GAAi 1 1 1 1 1 CTTAA AGCAGTACrGTAGTACTGAATATTCCTTT AAAGGAACTGCAGTGAGCCT 

ATGT CT AGTATGTATGCTTXSAGTGAATGTGCGAGTATGAATGATTAGAGAAAATTTGAGTCAC 

TGTACTTT^AGTGTGAATCCTGTGAfiCTAATACAGTCT 

<^ T ?^fAGATlTAAGATATAC^^ 

ACAAAACATAATTCCTGAGAGGCCCAGAACAAACTGGAGTCT AGCCTGGAGTTAAATTGAGA 
CTTCTAAAATGATTGGAACAAAGACTAAGTTGTGCCAGATGTAAATCA^ 
^J^J^^^J^t^^^ 1 1 1 1 1 J f*jj^ TA AATCTATAGTTrrAAGGTTTUTjCAAGA 
TGTGGCTUTACCTACTATGATGAAAATTGAAGTGGGTCAAAAGAATT 

ggaaaagaaaaaaaatgggcgaagagagggtggaaaataaaaggati — 
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CTGTTTCTCGTATCCCTGCTCCCrrTTrCCrCCCCTTCCCT£^ 

tgaagacaaact agaggagcagcatcccaggtagtttggcttttgactgcaaggtagttaag 
aaaggtgtagatataatgaggtagaagtagaaaggaagaaaaactcaaagaattctgaaaag 
gattcatagcaacataatgtgtccctgagtagaggatgctgctatg 
caagttgattacatgg tit 1 1 agaattataattatggattcttctt atttcatggtaggttgtc 
tttaaagatataaaaatt aggatgc ctttatgaagcactgattataccaaaaaaaaaagacaag 
ttatatacaggttattaai 7 1 1 m i atttatttattttttcctaaggaaaaagtcttctatctttc 
cctgct ggaagcctcctcatatttcctt atgtttgccatgcaggttgctgagagtccagtta 

AAATTTGCATTTT ACAGAATGAAATACTTT ACCCCATTT^AAACAATTATTGTTTGAGATTTT AGT 
TATTTATAATTGTCAAArrCAGGACTCCCCTrTAATGTTTATTATGAAACCAAAm 

ggaoxsctgattta tgaatta ccaaagggtcttgtggcatgttccccaatacatgcccttaga 
aggaagaact att a 1 1 1 1 1 lattttggcoctttcaggagttgattatcaattggttcgttttca 

AGTCACAATT CACCT GGGCAGAACCCGTATTGTGAAGACCTAAACTTTCCT aaatgttcatat 
GGGTAGCA GATTT TGTGGTGATTAGAAACATCAGGTCCTT AAATACGATGAACATGGGATAC 
AAAGGAATTCTTT ATAAGGGCAAGTATCCTAAGTT AGCACATTTACTTTT CTCTCCCCTCCGC 
CCCCAAAAGAAAAATCCTT ACAAATAAACTGCAGGTAGGCTTCTAAGCCTAGTOCTGCAGTA 
TGCrGCTAACATCTTGATGCCAATCTTCACAGCATTCTTTGATTGT GATCTTATTGCTGATACA 
TTCATACATATATTrAGTGCTTGACACTGTAGAATTTTGTT ACAGAAGATGGTTACT AGATTTT 
AAGGGAGCTGAGGGAATAATTGATGAGCCTTGAATTAACCATC^ 

TGTTGTTGTTGTTGTTTTGAGATGATGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCT 

CAATCTCGGCTCACTGCAACCTCTGCCTCGQVGGTTTGAGCGATTCTC^ 

CCAGTAGCTGGGTTTACAGGCGCTTGCCAOCACACCTGGCTAGATGTT^ 

GAGAANNN 

>376 



"Oca^acctetcaggtalgag^ 

ACTT AGTAATGCACACATGGGGGTTAATTTTATT A I II 1 1 1 1 C I AOCTGTGAAGCTGTTTTAAA 
TTTTCCAAACAAAAAAGAGTCACCCAAAGATCCTCATCCATtnTTCT 

T CATTTT CTCCATTCAGTAACT CATAGATGATAATTCAATGTTACACACATCTTAAATTGCTGG 
T TCI 1 1 ICTCAACOI 1 ICICCACAGTCI C I I ICC 1 1 1 CTAAACTCAAAAT A 1 1 1 1 1X11 rGATTCA 
TCTTTTGCTCTTTCTGTAAACATAGATATTTATGCAGGT 

GATCTCAGCCTAAATTACTGGGCCACAGAGACCtGGGGGTATCTT ACT CC 1 1 1 ICC 1 1 IA CTC 

agaagaggggtaacgtgatgagctacttgacatct gcctctcactaacaaacagtgatgctt 
gaggataacaaactctgtcactcacctat^ 

ACT T IGIGCTAACAGCCAGAGACACAAAGACAAATGGGATGTGATCI CIG I 1 1 IGCCACACA 

aaactgagtagtaggttaatatcaggagtg I I gc n I agtatttjttgaagcagcgaagagtg 
atatct aacgaggttaggcaggaaactaggtgaactaggttggtagaaaaaaactaggct 
gtaggaagaacaccttagaggtacctctcccacaccgagctggct 
tcatccccatggagcattgcaccaccog ci 1 1 1 1 u gagaoctgtoagctggacaatqacaag 

TACCTCGgocGCCACCGCgg 

T AGAAGATCGGCTCCCGGAATTGGAACAGCT GGAG06CATTG<>ACTGTTCAGTCATGCGGA 

CAAGGAAGACTTT ATCAATTATGTTCAATATGAAATT AATCTTTTGGAGCTGATCCGGAGAAG 

AAGAACACGCATTGGATATTCATTTAAGAAGGATGAGATTGAGAATTCTATTCT 

ACAAGGTGTTTTCCAGCGTGCCTCAGCAAAA 

tl^B^il' •JJ^ AAG **G Tt * GG CT ACT AAAACTOGACTT AGCAAGGTATTCTCTGCCATGT 
TG GCGATTCAT TCCAACA A AOCAGC fTTQI G GATT ATGGCAGOCAAATGGGAAATGfiAAGAT 

cgattgt ctrc agaaagc g caa gg caact a i i ic i fco cgcactgogctttcatocaoactg 
cccaaaactttataaagaatacttt aggatggagctgatgcatgctgaaaaactgaggaagg 

«AAGGAAGJWTTTGAAAAAGCCAGTATGGATOT 

CTTAA GGGCGAG rrGGCATGGATCATCTACAAAAATTCTGTAAGCATAATTAAAGGT GCAGA 
£™CACGTGTCA<^ 

ggcgagaattagagattgagtcacagacagaagagcaggctacaaogaaacaagccaaagc 
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agtggaggtcggccggaaggaggagaggtgctgtgctgtgtatgaagaggcagtgaagac 

Jjg^CCAACAGAGGiCC^GT^AAGTGTTA^ 

GCACATGAACTGAAGCTTCTrGTCAGAATGCCAATACAAGCAGTTGAGT GTTTCGTTGCTGTG 
TTATAACTTCCTGAGGGAAGCTCTGGAAGTGGCAGTAGCTGGAACTGAATTGTTTAGA^ 
CTG GGACAAT GTGGCAGCTGAAGCTGGAGGTGCTGATCGAGTCAAAGAGCOCTGACATAGC 
CATGCI 1 1 ! I GAAGAAGCCTTTGTGCAOCTGAAACCCCAGGTTTGTCTGCX^TTGTGGATTTC 

ctgggcagagtggagtgaaggtgccaaaagccaagaagacactgaggcagtctt^ 
gctctctt agct gtc^taggtgccgactcagt aaccctgaagaataagtacctggattgggc 
ttatcgaagtggtggctacaaaaaggccagagc iotgt ti aaaagtttacaggagagccgac 
cattttcagttgaci 1 1 J icag<5aaaatgattcagtttgaaaaggagcaagaatcctgcaata 
tgggg aacataagagaatattatgagagagctttgagagagtttggatgcgcagattctgat 
ctttggatggattatatgaaagaagaattgaacx>cccccttggtaga 
acagatct actggcgagcgatgaaaatgttgcagggagagtcagcagaggcatttgtagct 
aaacatgct atgcatc^gactggccatttatgaagatgaagaatacagtcagctttgtgaaat 
^t attgcaagcaagccccgtgggcaaatttgtattgagt ccatctgt aatttgctcagtga 
tgggagagaagatggctgtctggttttgagagacacttt aattttatgttaacttgttaaatc 
i i i i iAAAAATTAAAAAAI 1 1 1 1 atgattgactaaaaaaaaaaacct acaaaocttccnnnnnn 

NHNNN 
>378 

NMNNNNNNNNNAAACCA^ 
AGATATCACTATACTTTT AAACCAACTTTGTGATTTTTATTt^TGGGOGACAACTTTATACTCC 

tagatatcactaaactgtgtacaattagggacgcagcattagagaagagcagac^ 
taaacagaggagagtgaaggagagatcttcattatttggccatt^ 

CT aacattttaact CCAAATATTTGATCTGCAATGTGTACGTAAGCAGTTTCTCTCAGTACAAT 
TATTAAAAi iiii ^TGTTAG&AACCAGCAACTT ATTTTTTATGTTT A \ I H I CI 1 1 IGAAGTAA 
GAACT AGTTCTTCTTT GATAACTGGCTCATTTTTATCATTT ATCAAAAACTAAAGGGTAGGGAA 
GAAAAGTGTGATGGATTAAAAAA \ I ICI 1 1 1 1 1 AAGGAAAGATAAAATTCATTTTCACAAATTT 

ACAA GTG I mil gctggtgcacgatttattctactaagcaatgacactggggatcaaatoca 

CTTTCTT ATCTCAGGAATCAGCATTATTTt>GAAATATGGGTr^ 
AGCAACAGTCTGTTTCAACCAAAT GATTT TGATTTGGAAGTTAGACT 

GTGCACGAAOCCCAAGGCATCTCCI 1 1 1 CATTCT AGCOCATTTTTGCAAAGGGAGAAAGAGT 
TTCTCTCT AAAGAGCCACTAAAGAGAGGCGGGAAGTGAGTGTGAGGTGGTGTCCTACGAAG 
CGGGCACTGGGCGCACACACAGCGCACACAGGGTTTACAGCAGGTCCACTCG^ 
ACAGGAGGTAGGCTG I GOG 1 1 CAX3CAGTTGGTTTCACCACCTGGTACTGGTTGATCACCTT 
GACTGTCTGGTCAJTCGATCCGCAG<XAGCCATT 

T^TQQCOGCCCQ ^ OtU^ACTGTrGOCGTGATGQTAGACCACT GCAAACAG COGAT AGGTTC 
GGTGGCATTTAAAA1 ICI I ATriTTAACCCCTGGAGAAAGCA b I ICT 1 1 ACTAATTTCCAAGTC 
CACACGATATTCAATATTTTTGATAAGCITCTGG 

X ^ I CAGGTGCA GCAOGAGGACAGGAiSGGAGTriTrCCAGAGTCACT OGACTTATCTCAA 
OCTGTTGT1 1 Ublll 1 1 1 ti IGGTATAACCTTngACJ^TTCT^rrTTnr^i^^ *qrTTTCCiV5TC 
CATCCTGGACI 6TGCGT M C I I GIC IGACTGCATATCCAACTGCAACGTGAAAAATGGCTGC 
AAACTGGCAGAI ICI 1 1 1 QAA^CTCTGCTGQTAAA^OCAGAGA^OCTGATGTGrrOCA^CCJWUUUff 
GCOGGTGATTGGAGTCTGAACAAAATCCGCCTGGOGGGTGACQGAAGT C 1 1 G 1 1 CCGGGG 
GCOGAC TTGTTCC CATTCATCCTtXSCTrCCTTCAOC 1 1 C 1 1 C H C CTGCTCTTCTTCATTGAC 
CGAGTGttT I > I IUGGGCCGI T GGAAATOGT AAGTTCTGAAAAGGAGAGCATCTGCTT ACACT 
^CTATTTAOGACAAGAACAKNKW#0000#QO€K>0#0004 

yCACOGOGGTGGOGGOCQAGGTACTtaCT ACCCAGOCAATAAAAT ATAAACTCCAT 
TTGTCTT AQTTAT ATAQAA CTG I U T I T CCAGCTTAGAAAAAGTCAAACCAATGACTTGTAGAA 
CAATCTACTCTCAI IIII I ATTCAGCCTCTAGAACATGGAAGCTTTAAAAGTGAATTGGCTAAA 
TAGGCAAGACC 1 1 G I GCCCATTAACAGGTCTTAAAGT AACAAGAGGGGCTGGOGGCGGAGG 
TACGCACAACOGAAGOGT AGACTCACACTTGCAACAAAAGGTTACAATATTGTAATtjCGCTC 
TGTO0GGTTCTGC1 1 G IGCAGCnSGAOCATCTATTTC A TCCT C C I CC I CI U AGCTGTCATTTA" 
ATTGCTCATAACAGTAGAGATCAGTTGTCTCTGGTTGCAAATCTAACAT ATATTT 
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TAGGTGTATCTCCATGCATGATTACAGCTGGCTTTCTCTAOyrGTTCTTGATGATCTGCAACA 
AGACATACATCXSACXXKjTCCACXX»ACXXXrrTATATTATGGAATCTTT GCTTTTTGTCCAGATG 
TCTTTGCI MICH CAGGACACAGTGGCTTTTGACAGTTAATACACCTCACGTCGCaGTAACT 



9*CGACCCAC«CGTCCGGAGCGCAGAAGC»GCTCX3AGGCTGGAAGAGGATCCTGG 

GCGCCGCCAGGTTCTGTGGACAATCACAATGGGAATCCAAGGAGGGTCTGTCCTGTTCGGG 
CTGC7GCTOGTCCTGGCTGTCTTCTGCCATTCAGGTCATAGCCTGCAGTGCTACAACTGTCC 
TAACCCAACTGCTGACTGCAAAACAGCCGTCAATTGTTCATCTGATT7TGATGCGTGTCTCAT 
TAttiAAAGCTGGGTTACAAGTGTATAACAAGTGTTGGAAGTTTGAGCATTGC^ATTTCAACGA 
^^J^i^F^SP^^^^^^^^^^^^^^^^^^^^^^^CAAGAAGGACCTGTGT 
AACT^AACX^CAGCTTGAAAATGGTGGGACATCCTTAT(>GAGAAAACAGTTCTTCTGCTG 

CTCCCAAACTCCCCGTTCCTGCGT AGT CCGCTTTCTCTT GCTG CCACATTCT AAAG GCTTGA 

AGGGGCTCTGGGAGACTTTGAAGACGAGTCCTGTTTGCAGgggaAGCCCCACTTGAAGGAAG 

AAGtcTAAGAGtGAAACAGATTCCTGAAGGCGCTAAAAGACGAAAAGTTACAAGGACTGAAGA 

CCAAGC^ACCTGGAAAGAAGTCGGCCTCTCTCTCCTGAGGAGCTGCCTACCAGA 

CAGCCACCTCCTTAGGTGTTAGTGCTTAGATAATGTGTCCCATTTGTTGTCATTTCAAAGATG 

GTTAI 1 1 I C I G 1 1 CTGTATTTACCG CTGTTCTTGTTGCTACCAGCATGT 

>381 

>382 



aagqgacaagccgactgacataagccaggtcctaacggagcctatgtgtaagtccactact 

GGTGCAAGGTTGCACACTTCTAAGAAGAGCGGCGTGGGGGGCTC^aSACC^ 

^CGcroccccGTGCAGTcccra^ 

GCGGACTTGGAGGCGGCGGGAACTGCAATTGGTGGCTTTGAAGGCGCGGCGAGCGGGAA 
CAGCTCTTGAGGAGTGAGACTGCAGGAGATGTGGGCCGTGCCAAAGAGATGGATGAGACT 
GTTGCTGAGTTG ATGAAG AGGAGCATCTTGAAAATCGCCATGAATGAACTGACAACAATTXTr 
GAAGGCCT^GGATTTTTTGTCTGAAAATCAACTGCAGACT GTAAATTTCCGACAGAGAAAGG 
AATCTGTAGTTCAGCACTTGATCCATCTGTGra 

AAGAAAATTCTTCAGAGAGCATTAAAAAATGTGACAGTCAGCTTCAGAGAAACTGAGGAGAA 

T?i? AG 7 G I GGA ^ CG ^^ GCCTGG ^ 

CCTACGTGGTGTACTACT^ 

CAATACAOCGCTTCTGGGTCAGGAGTTAGAAGCT ACTGGGAAAATCTACCTCCGAGAAGAG 
C AGATCATTTTAGATATTACCGAAATGAAGAAAGCTT GCAATTAGTGAACATGAAAGGAAAAT 
AAAAATTCCT^GTCAAAAAAAAAAACAAATACACAC^ 

aaagaaaccccc»ggggggc«cccaaagttaacaggcctata7^ 



>390 
>391 



T gc J[ ga a cctgggagg T gg * gg ^ 

taatatctaattctacctt agaactttcctttggtggtgggggtgggttctgggagtttgaga 
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gattgttaggttttggtataaggtcataaaatcctgtgcatatgggatttctgaccatttgcc 
ctgcctcttgcaaaataggtctaatggcaggatggtgtcataattaaggctaccaagactgc 
ccat t gttcc aggctgggcagttcataatgggggcagacaatagtgcaaaaaaaattttaca 
ttttatcttt AG agtgtcagggtcaaattgatttccatggtt GaGGATGTAG ccaagtgtgga 

ATCAGGTGGAATAGGTGGAGAGTTGCCCATAGTGGTTTGGAAAAGAGAAGAGGACTTTGAA 
AAGTGGAGGGCTCATTAGGTGACCCAAATTTTACCTGGGGCATCCCCCCTTTAGGGCCCCA 

acttagtctgtcagacatctctgaccttagatgggtgctggcaccactttggaatggttccc 
tccatcactgaggacctgacttaaagtttttctatctcacttaaaacaaccctttaacgctct 
caacttaggcaataataaattccttttcatgaattccctccaccacx^tgcaccacacagacc 
acatgcccggaccctctgacttgtgtaaccttttgtgcatagctaggtgggttttctgtcct 
ttgcccagttgggtgggagaagggaagaatttagcataaataaaagcagcataagtcacctg 
aaacctgtgagtttgccctggacgagctgctgctgctaactgcatcacacatagggatcag 
ggattataactgaaaaggacagaaatgagtcttrrccctttctgggcagggcagccatccct 
gttcactccttggccttcagagg^taccagagagtggccccagtcagttcccctcaattacc 
aacgagctactaggaaacggccgctgaaagactgaaaaaggagaaaaggaaaataactcag 
aaaaaaaaaaaaaggaataggactcagaaaaacaaaaacgaggaaaaggactcaggttcctc 

ACCTTAACCAAGCGGTGGCAGTT AGGCACTTCCACACGGAAACCTTCCAGTTTCACCAGAGA 

atggccctggccagaaacttgcaagtgtctttgtgcttaggtgctgtccactgagggtcct 
cagttggaaaggaaaagatggagagaggaagatttccctgtatgtaaaaggggaaaggaga 
aaaagaaatcccaaacttrgggcctaccttctctcctggctagctcaccaaaatatgttacca 
gtagagggtctt gactgcaagttgtttaggttcttggcattttgaacaaagacttggaagaa 

AGACCCAGCAAAGCAATGAAAGAATGCAGCATCGAAAGAATGAAAACAGGAATTTATTGAAA 

ATGAAAGTACACTCCACAGTGTGGGAACAGGCCCAAGCAGTGGCCCAGATACAGAATCTTC 

TTGGGTCCAAATGACTACT AAAGGTTTCCCGTTGGCTGCTTGGTGCTGACCTAANN 

>392 

>393 

NNNNGTGATATGATAAAACTGCGTCAGAGATTGTCGTGGCTGTGTCTTTGTATTGTT 
TTGTGTATGGGTATTCTG CTGTG TGTCTTGTTGCGGCGTCTTGCCTTTCTGTGTGGTGCTTTC 
GTCCGTATG CACGG TGTGTTTTCGCGGGATGATTTGGTTTGCCTCTGTCTCGGTCAGTTTTA 
GCTTGGTCTT CTTTGTTTGTTTGGGTCCTGTCTATCCT CT CTGTTTGGTGTTG GTTGTTGTGG 
GGGGTGTGCCGATGTGTCTGAGTAGTTTCCGTGGACACTCGGGCTGGCACACATGGTGTTC 
ATTTGTTGGTAGAGCTCCGAGGCTTCTCGGGAACCU>GAGCCAGTTGTGTTCCGTTTGGGT 
CAGTGTGCT AGT GGATCGCCTGGTCAACATATGGTGGAGCTCCCCAGTGTGTGCnTAGGA 
TTCCCATATTCTGTCCAGAATGTTGGTGTCACCACAGCTCGTGTTATGTTCGCACCTACTCC 
GGTGGTCTGTAGCCCAGAGTTTCT CCTTGGT ACCCCGGAGCAGAGTTT CCATGTGAGACCA 
AGTTCTGCACGCATGGCAGCTOCAATCCTTGGCCATAAATAGTGTAACTCCAGTTTATATAAA 
TATAAATAAACGTCAATGGAACACTAAATTGCCTGTCTTAACATTGTACATTCGGGGCCTAGC 
TGCCCTTGAGGATGTCCTAGTTAGACCCTCTCTGATACCTGTGGAGTTTAAGCACCATTCCT 
ACCGCTGTG TCCCTT GGGAGGGGGCTGCAGTGGAAGCTCTT AAAGGGGAATGCTTGCTCTG 
CCTCTGTGGCTTTTT GTTTGGGAAAGGGAGTTGGGATTGGAGGATTTAGA7TTGAGGTCATG 
ATGTCAGAGCACACCAGGAACTCCCAAGGCTGTGCCTGAAGATGCCCAOGCTGTCAAGTAG 



GGAGGAGGGGAAATGGAGCTATTCCAGGATACAAGGGATGGCACTGAGGGATGCATAAGT 
CCCCTGCCTOCCTT GTCTCAACATGTTCTCCTCTGCCAGCCCAGTCAGCTTGGGGAGCTAG 

GAGATCCTGGGGCTCCTCCTATTTCTAGAAGGGAT^G^^ 

GTGTAACAGTTTAGAGAAGGAAGTCACACCTGTAGCCTGGCTGGCAGGCAGGTGGACATGA 
GGCT GAG AAGGGAAGCCAGATGTCAGAACATACT AGGCTAGCATGCCTGCTTCT AGCTTTC 
AGGACCCAATGTACACATACACATGTGTAT^ 

GGAGGGGTTATGTGCTCTGATCTTTGAAACCAGAGAAGGGAGGAAAAGCCTTATGGAGGTC 
CCTCCCCCACCCT AATAGGTTGCTACCCTCAATTGOCATCTGCCCACCAAGGCCCCTGCCC 
AGTGCT AGGCAGATAGCCAAGTGGTAGGGGTGACATGACTCCACTAAAGCAGCAGCCCAAG 
GAAGTGAGGGA GGGG ACAGAGGCT GACCCTCCTCAAAG CCCCTT GTTGT AGT AGACCACTT 
CAAOGCTGGGGTTTTGCTCATGGATCTGAGCrcTCAGCTCTGGT^ 



CCCACCAGTCCCACCTGCAGGGGTGCCGCCAGCCAGGGGCGGCGCTrCAGGAAGTCTTTC 
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ttctccagagtgtccatgatcagtggtgggatggccatggcaggaatcgccatgcagattc 

JJ^yj^TCACCACCTGGAAGATTCCCTGCTTGGCTGCAGTCACCGAGTAGCCAAGCCTCTGA 
CCTGCGTCATCAGCCACCGGGATGC^CCACCTGCAGCTCTCTCTGCCTCATCAGGGGGATGT 
TGATGCAGTrGGCAGCTGCCACTGCTGCAAAGGGCACAAATCTGCCGAGCAAGGGGGGGA 

ggtgcttggtgagggatttgagtcx^gggccgtggccacagctccagtggtggcactca 
cataggctgtccccagctgcctcacagtgatgggagtgtcaccactgcggttggagtagtt 
aagaatggcattgaaggactgattcacccact gccagaacaccacggttggggtcttcctgt 
agaatgtgagcatgcagccagtgatggtcatgttcatgggcagctgggctgagatgcggcc 
aatc^gaccaccttctcccctgtgtccggatggaaggcggagtcatacacatacttggcc 
ctccacagctggtcctcggtgatccctg gggtcaccacg ccg g ccct gt agttct g cacga 

GAAAAAGTGCCGGGCT CT GCCCAGG AAAGTACTTTGGTCCCAGCGAGGTTCCTGGATGTTG 
ATGTCTAAAGGCAATTGACCGATTTTGCTTTCCATCGCTCTCTGAAGCTCTGAGAACCGCCTT 
CCTCTCTGCCTCCGCTGTCGCCGCCGGGCCCTCCCAGCCATCACGCGGGACGTCACGCGT 

GGATTGGC^GAACCCAGCCACCACACAGAGCACGCCTGCAGCAAACTCAGGCTCACGTCC 
CGCTCTGCCGCGTGACGTCCCTCCACGGAGGCCTCAAG GGTGCCCAGGGAGGGGTGGCAA 
TTCCAGGAGCCCCGGTAGCTGGCTCGGCTCAGAGAAGAGGTGGTTCGAGCCGGAGGGGAA 

caagccagacccaagccctaaatttgcggaatgagctagggcgtc^gaaccagaactgggg 
tgggtaaagaactccgctgcccgggaccaggtaacgccggacgcgtgggtcgac 

>394 

nngtcgccattgtgacccgagaatttaatatacttgtgccccgcgcgtgttttgtta 
accgtccgcccgagtgtcccctcaatttctacccctcgccccttaacccaaagct aaatcca 
ccatctggtattctcccctagagcaccagatgaccatctgaagctggacaagtgtctctaac 
aataaaacattactgtttacaaaacaaaaagcacaaacataattatggaataaataaaaaacaa 
gggacaaacagccaactgactctacccacttggtgagaagtgatatacttcaactatttt^ 

AATGCTTCTGAAAG I MCI I GGCCCACAGAGGACTAGGGTGCAATCATTCCCTGTGTTAGTG 
AGTTGGGTTTAATGCAGCTTCAAAATTAGGGTAAAGGGACTTGGTGAAATGTTTACATTAAAT 
ATTTCACTCCT ACCCATTCT TCAGGAAAAAAGGTGAGCTCAG CAAG GCTGGATGCCATTAAG 
AGATATTTACTG 1 1 1 ICi 1 1 ! ICTATAGCTAAAAAAGCAAACTTTACACGAAGAAGCTCTTGAT 

taag gaaatttcaatagattcatatttataaaattttaaacatttggcacagcaaaat^ 
aaaatgggggagaaaaaataggtctggttgttgtccccctttttcgacctgctgctg 
tgatgagatgctcacagaagaaaaaggcctggctttgtaccaggctggcgacaggtgctac 
caggagtgggctgaggggagaaaaact atctcccact ctttt gg cccaggcaatgtcaacg 
acttccacattccctggcccacttcctgagcaaccccaggtttc 



gcaatcaaaggaaacagttaaacaccaaacaatttcttaaagccaaaaaatatt^ 
GTTGAACATTTTTCGAGTGTG 1 1 1 1 1 1 icaagtgtaaaagcagtgacattttgttcaaacagaa 
gcagcatct aggaattctggcacttg ggttc tagggggttacaggtatgcatcatggattct 
tctccctcgt atttaaaaaggcctcgtgttt ctattcctgagttcataccaacacctgctagc 
TCTCCCCTCT agcggacagtgggtggccagccagcctccctggttagattgggcaatgcca 
agcagacatccctcattcacctgctgggcttgg > 1 1 C I GATTCAGAGGTAAGTCGAAGTGCA 
gagaaagaaacttacaaaagcacaaccaccaaaggcagcctg 
actgagttgctggagactaccctctcagtcccattcctg gggaaaggtaggtgact 
aatttgaatcaaacatgggggaggacacctgccaagcaatagtatgatggactcaagtcatc 
ocaggccatc tactg tctcx3ctcacctgccct aaocttttctgagtccctggc, i c i c 1 1 gtg 
caagcactgt agttt aaaaaggaaaaaacaccccccaccattaaaagacaaccrrcaaat g tt 
actcagt atataaagtttgcttaggctaaggtggagtcagaaatgtctctaattgtagagacc 
atctctgtgccacccttcctctcatcggatatggag 

cagatctgaggcacagtcaggtaccaatgtacacgacataggcacatgtgcaaacacaaaga 
aggtgggctgctgcttctttctctctgcccct agtccaggct cctttgcttcacgtaagatt 

AACACTTTCCCATTCC ! C I GAAGTTGCTCGAAGGACATTTCCCAGGAAGAAACAATTCCTCA 

CTGCCTATAAACTGTAGTCTCATGTGGGATAGTCAATTGAACATGAGAATCAGAACAATCTG 

GGCAAATGGGTATGGCAAGAATGGGAACACCACAACAGG 

GOCATOCCTTrraG^ 

CATCCAoi i u i accoccagctacctcccatgtttccaggtatcattggctctt aactcccaca 
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agcctgccttttggctacccatcccaacaatatcaagagggaatgactaagtatcagctaga 
aacttagccatgtctcaacattcctggattatctgaaaagctgtcgatgcccttttacaggtt 
tatggtgacagagccgtatcatcttaaagtatgttcatagttaaggcttgacttaagaaaata 
agagaaccagacataatggaaagacctcttcaataatgttgtcatgcctctcagtgaacgtg 
ctcacagtcacacttggtttg gctccccaaacccacaat agaaaagg aaaaatg agtatttt 
G 1 1 1 M CATCT gttttgtatttaaaggcattgggttacttcctcctgccct ctttt cttccct g 

AACAAGAGTTTACAACTCCTCATGGCTTCTTAATAGGTGAAGTAGGTGAAAAGTCTGAGAAG 
ctcacagcagggtttgccgtcccaact ATGCAGCTGAGAGGTCGCCAGCTCCTGTGCCTTC 
CCAGCCCCACT ATAATTGGC^GTATGTTTGTTCATGTTTCCTGAAAACA 1 1 I I C I 1 1 AAAAAG 

GAC^WVGAGCAACATCCAAACATTCCCCAAGC^CCACCCCAGTAAGTCTGAGATTATC^ 
TCCTTCCCTGAAAT AATTATAAAG AAGC ATTTCAGGCAAAAT ACTTAGT ATTAATGGTCT CTT A 
CTGCTCAACCTCCCAACCATG<XXrrTTTCCGTTTT ATGTGTATCTCTTGGAGT A 



GGGACCAAAAAGT ATCTTGGCCCTTTGGGAGTTrCCTTGTCAGAAAGTATAAG CCTCAACAG 
GAAAT AG AG G CTCCTTCTCCTTTGA GTTCAAT ACCCTCCCT GTGTCCT ACTC ACCAG G AAAAT 
^GTGTGTTCATATCCCACCTAATTTACAACAGAAGATAACCCCATCCCATCCCCAAAACATA 
AAAAT ACAAGTCTATGCCCATAGAACN 
>395 

NNNNAACACATTTCCC<n"CCCGTCGCTGGGTCCCTGCGATCGCCCCCAGCTGGTG 
GGGCTCGCX^GCTCAGGGGAGCGGGGTCCTCTGCGCCGCTGCAGC<X^GTTAATAAACA 
GTTAAGTTTGGAAGACTCTGCAG ACACGTTGAGGGGGAGTTACC AAGCCCAGG CAGCAAAA 
ACATCCTGGCACATTCCTGGGGAGTCCTCAGCTGCCAGCATCTGATTAGAACCATATCTCTC 
G CCGGGAGTGGCCGCGCGGCT(XXSAAGCTCCt^G^X^GCGGCTATTTAAGCGAGGCCCG 
CCGCATCCGCTGCGCTGTAGCCTGGAGGCTCCGGGCGCGGGGAAGTCATGCTCGC 
GGAGGCAATAGCTAGCCGGTGTCTGTGGGAGGTTATGTTTATTTGAGACTTCTCCATCGGGA 
TCGCCTGGTGTCACCAAGTGTCCACTG GTACTGAGGTTTGCT GCCTGCCTTCTTGCCATGTC 
TAACGAAGTAGAAACAAGTGCAACCAATGGTCAGCCCGACCAACAGGCCGCACCAAAAGCA 
CCCTCAAAGAAGGAAAAAAAGAAAGGCCCTGAAAAGACAGATGAATATCTCTTAGCAAGGTr 
CAAAGGCGATGGTGTAAAATATAAGGCCAAGCTGATTGGCATTGATGATGTGCCAGATGCAA 
GAGGGGATAAAATGAGCCAAG ACTCTATGATGAAACT AAAGGGAATGGCGGCAGCT GGT CG 
GTCTCAGGGACAACACAAACAAAGGATCTGGGTCAACATTTCCCTrrCTGGGATAAAAATAA 
TTGATGAGAAAACT GGGGT AATAGAGCATGAACATCCAGT AAAT AAGATTTCTT 1 CATTGCCC 
GTGATGTGACAGACAACCGGGCATTTGGTTACQTGTGTGGAGGAGAAGGCCAGCATCAGTT 
TTTTGCCATAAAAACCGGGCAACAGGCTGAACCATTAGTTGTTGATCTTAAAGACCTTTrrc 
AGTTATCT ATAATGTAAAGAAAAAGGAAGAAGAAAAGAAAAAGATAGAGGAAGCCAGCAAAG 
CAGTTGAGAATGGGAGTGAGGCCCTAATGATTCTAGATGACCAAACT AACAAACT GAAATCG 
GGT GTTGACCAGATGGA 1 1 1 til I \ GGGGACATGTCT ACACCTCCT GACCT AAAT AGTCCAAC 
AGAAAGCAAAGATATCCT GTT A GTG GATCT AAACTCTGAAATOGACACCAATCA GAATTCTTT 
AAGAGAAAATCCATTCTTAACAAACGGCATCACCTCCTGTTCT 

GGCA TCCTT CTTGOCTGAAAATGOCTTTTCTGOCAATCTCAA C I ICI 1 1 CCCACCCCTAATCC 
TGATCCTTTCCGTGAi5GATCCTTTCACACAGCX>W^a 

TCTCAAATCTCCAGATCAGAAGAAAGAGAATTCGAGT AGCTCGTCT ACT CCGCTGAGTAATG 
GGCCCCT GAATGGTGATGTTGACTACTTTGGTCAGCAATTTGACCAGATCT CT AACCGGACT 
GGCAAACAGGAAGCTCAGGCAGGCCCATGGCCCTTTrCAAGTTCGCAAA 
TGAGAACTCAAAATGGGGTATCTGAAAGAGAACAGAACGGCTTCTCTGTCAAATGCTC^ 
AACCuk ' • I GTGGGAAGCCCTCCCAAAGGACTGTCCATACAGAATGGGGTAAAGCAGGACTT 
GGAAAGCTCT GTCCAGTCCTCACCACATGACTCCATAGCCATT ATCCCACCT CCACAAAGTA 
CCAAACCAGGAAGAGGCAQAAGGACTGCTAAGTCTTCAGCCAATGACTT GCTTGCATCAGA 
CATCTTTGCTCCTCCCGTCTCAGAACCTTCAGGCCAGGCGTCACCCACAG ACA 
GCCCTGCAGCOCAACCCTCTGGATCTCTrCXAAX 

TGGTGGGTCTAGGT GGTGTAACTGTCACACT CCCTCAGGCAGGACCATGGAACACAGCATC 
TTTGGTCTTCAATCAGTCCCCTTCAATGX^CCGGGAG 
GTTTTAGTC*GCCCGTC*TTTTTG^TACAAGT 
CCCTTTGCAGCCTCAACTCCCCCTCCACT 

CACCCAATGCTT GGTCAACAACAAG CCCTTTGGGGAATCCTTTT CAGAGCAAT ATTTTTCCAG 
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ctoctgctgtgtccactcagcccccatccatgcactcctctctcctggtcactcctoctcag 
ccacctcccagagctggccctcccaaggacatctc^^ 

cacrrtggggataaagagatcaaggatgtgaaagaaatgtttaaggatttccaactgcggcag 

CCACCTGCTGTGCCCGCGCGGAAGGGAGAGCAGACTTCTTCTGGGACTTrGAGTGCCTTTG 
CCAGTTATTTCAACAGCAAGGTTGGCATTCCTCAGGAJGAATGCAGACCATGATGACTTTGAT 
GCTAATCAACT ATTGAACAAGATCAATGAACCACCAAAGCCAGCTCCCAGACAAGTTTCCCT 

gccagttaccaaatct actgacaatgcatttgagaaccc 1 1 1 ci 1 1 aaag attcttttggttca 
tcacaagcctctgtggcttcttctcaacctgtatcttct 

aatccttttgxx^aaatrctgaacttggtctgcagaccatccagaggaataaaaaggttggc 
cttagtagtcaaaaacaaagctgatagccagacacgttctgatttctgcccttgttccagctt 
tgacgtattatctgttgccttatttctcattgcctcttct acttgtaaaatgcttttcactttct 
gtctaggttaaagctaaactgaatctatggctttaaataaattaagatcctaaactctctagct 
taagtgtaaatgaagtacagtagtttccct actgaaccctgcctcttgtgtccxtggaacctt 

CT AGAACACCTGCCTT CT ACCCTCTG GTTG GGAGATG CAG CCACCAC ATCCCTT CAT ATCAT 
ACTGTrTTGAATAAATTTTCAAATCCTT ATTGTTCAGAGTTGTTTGGGGGTTCTGTTTCAGAGC 
ataaaacct aaaggttat AGTAGAACAAGGCACCTTCTTAAAAGAAATCTTGCTTCAGACCAT 
CAGTTACAGAGAATTTCTAAAGTAAAATTGAAGCAACTACAACTTCTCXTT^ 
ATCTAACCACTTAAGGACCTTTTTAAAGAGATAGCTTCTCTTCXrTCT 
CAAGGCCAAGATTGTCCTTTTCTCCCATTTCrrGCTAGCTATTGCAAATGAGGGAA^ 
ATTCATCTCTCCTCCCC I HUM ICIU ATT CI 1 1 1 1 I CAGTCAGTTTTGCTCCTGGGTTCAAG 



TGATTCACAGACAGATCTGGAGCCTCTCTTCATTCTCAGTAATTGCT AGTCCCAAGAACTAGA 
ATTGCAAATGGGCACAACCT ATATCCTTCCT GTGGAAGAGGAGGOCACTCTCTTGAGCTGAA 
GTTC CAGAAGAGCAGTTAATGTTCAAGAGAAATTGAACTCAACTCAGCAACAAAGGACTCTA 
TTTTGAAGAGCAACATATCACAAAGCTAAATGTGATTGTGCCAAACACATTAGGTGCTTATTT 
G GGGTCATGCT AGG CCTTTATCAAGT AACT GGAAAAC 1 1 1 TCI I GCAGCCACAATCTCAATGT 
CGTTAGTAGGAAGATAAGAGGGGAGAAAAAGCTGTAGAACAAATGTTTGGGGTTACCATTGA 
AAATCTAATGTCTGGAATATTTTTCTCCTCACAACTTGGAAACGTTCCCAGTTC^ . 
CTGTTGTGAGCACAGTTCTGAAGG G TTT ATT ATTGTCAAAAT AAGTTTT GTTTT GTTTTGTTTA 
TGTTGGGTTTTTAATGTrGTCTCTrGACCCTTAATGCTCAGGTTCTTGTGGGAGTTAA 
CACATCCAACGTTACCTTGAGGGGGAAGAAGAGGGTGATGCTCAGAAGCTAAACAAGACAG 
GGGCCACATGAGCCTCTATTGATTAGCCCCAAGTAGAAAGTCCTGT GGTTTTATGTTTAATG 
GTAATAGTTGATCATATATGGCATAAT7TTCTATCAGCTTCCTACTCAGTCACT ATAAACACAG 

acttqaaatagt actttaaatqtocaaatacct aaatqtgctaaactggaggtaactatttcta 
ggt agttgaatttttgaaagtcatgatcagccacacaactgttttgt acat actt att7tctga 
tgcaci 1 1 jcigi atgcaaat aaagct at aaattt actcatttcaat aaact ggagtgggagaa 
taacaaaagaaaatataaataataaaaaaaaagcggcc 

>396 
>397 
>398 

NAGCG TT ACTCT GACAAAGAC I IG I I fCTGI I GCAT AGCAACTGAAQATAAGTATGC 
AGAACTTAGCAAAGATTTTGAAGAGAAAATAATGTAAG<^TTACGTAATTCGGGAAATAAAAA 
OCTGAAGGGCAATTTCAACTAAGTATGGCGACTACCTGCT CACCA I C 1 1 NG GAGACCGAAG 
AAGAAATGAATCGTCTGAACAGACTTAGCCCGACAGAGXrTGCCCAAT 
TGTTAAATGACCACAAGAAATGTAACATCICI 1 1 1 1 aggacaccctagataagaaatcttcca 
ACTGGTCCTTCAGGAGGGTGTCTAAGCAGTCTTGGATGAGCTACATCAACAGGT^^ 

ccacagagccagcaatcagggaccacattocccgaggt acaaaatttagaggtttccccttt 
atgaacaagagacccaggtgccagcatgttactaccagatccaqi 1 cmc ! iaggacagtgt 
ggctcaaagggatgagaccttccagacactgctatctgagcatct gtggcct gcooctgag 
ttgtcaagat aatttcctt atctctgaaggagtccagacaggaatgcttccact gct gggt g 
ggtgctcgcccctcttgctcctraagcgcccggctc^pccccttgct agcacagggtgtct 

T ACACAGTTT ATGGGAC 1 1 1 1 CI GTGAACTACCTGAQGGCAAGAACCATGTCCCACTCCCTC 
CTTGCTCC1 CAAATATTTTATAGGAAAGGAGTCCAGAGTCTCACACAGAGGAAACATGAAGTT 

GTTCAAGCTGAGGAGAGTTCTC 1 1 1 CCTTAT * 1 ~ ~ ~ * — ~ — 

OCTGCCAAGTCACCCTTGACTTACN 
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nnnnnnnnnpwngcxx^ggtacgcggggagagaggaaaagaacacagatctcgca 
TGGTTCAGA I rTTTCTTTl i aggtccaggagtaagatatatcatacgaaaatgaaaattataat 

TCTTCTTGGATrCCTGGGAGCCACAWGTCAGCOCCACTTATCCCACAGGGTCTCATGTCTG 

ccagcaatagcaatgagttacttcttaatcttaataatggtcaacttttgccactacaacttc 
gggcccacttaattcatggattccacctrrctctggaattttacaacagcagcagcaggctc 
aaatto^ggactctcccagttctctitatcagctctagaccagtttgctggactg 
atcagatacccttaacaggagaggccagttttgcgcaaggagcccaggcaggccaagttga 

TCCCTT ACAGCTT CAAACACCGCCTCAG ACAGAACCAG G CCCCAG TCACGT G ATGGCCT AT 
GTATTCTCCTTCAAAATGCCTCAAGAGCAAGGACAGATGTTTCAATACTATCCAGTTTACATG 
GTCCTACCCTGGGAACAACCTCAGCAAACAGTTCCAAGGTCACCTCAACAAACAAGACAGCA 
ACAGTATGAGGAGCAGATACCATTCTATGCTCAATTTGGATACATTCCACAACT AGGAGAAC 
CTGCTATATCAGGAGGACAGCAGCAACTAGCTTTTGATCCCCAACT AGG CACAGCTCCTGAA 

attgctgtgatgtcaacaggagaagagataccatatttacaaaaagaagcgatcaactttaga 

catgacagtgcaggagttttcatgccctcaacttcaccaaaacccagcag 

cacttctgctgtagaccaaactattaccccagagctcccagaagagaaggacaagactgaca 

gcctaagggaaoga taag aagttgccctgatcattcagacattttgggaaaaagatgtggcc 

atgccttggatataattttaggctattagcttcctcaatactagtatcagttctttggaataca 

tgaaat atcttgactcttctcxrraaatttgttttt actt at acatgtt att aaact cttt aaata 

TGTCATAGAAAATAATACAATCATGTAATGAGTC I IGICI I ACAAAATTATATGTCTCTTCAAA 

TATCCTATCATTGT AT AATATGGAATATAATAACACAGAAT AAAGCT AGTATCATTAAATCAAT 

TG<^TAATTGCATTAGTAAATGATGCCTCTGCAAAATGGTAGTACCCATGAAGATATGTATAT 

TGTCATTGGATGTATGATGAGTGTTGTGATTGGAACTGATGAAGTAAAATAAGTATCTAGATT 

TGAAAAAAAAAAAAAAAAA 

MOO 

>401 _ _ 

CAATCCTGCAG AACGGCCATACAAATTGCCAGACCTGTGCAC>ACXiCTG^ G 

CAGGACATTACAATAGCCTGTGTCTATTGCAGACGACCACTACAGCAAACC^^GGTGCATGT 

GTTGTCTGAAACCGCTGTGTCCAGCAGAAAAATTAAGACACCT AAATAGCAAACGAAGATTT 

CATAAAATAGCAGGAAGCTATACAGGACAGTGTCGACGGTGCTGGACCACAAAACGGGAGG 

ACCGCAGACT AACACGAAGAGAAACCCAAGTATAAC>TCAGATATGCGTGGACCAAAGCCC 

ACCTTGCAGGAAATTGTATTAGATTTATGTCtTrTACAATGAAATACAGCCGGTTG 

TGTCACGAGCAATTAGGAGAGTCAGAGGATGAAATAGATGAACCCCACCATGCAGTTAATCA 

CCAACATCAACTACT AGCCAGACGGGATGAACCACAG CG TCACACAATACAGTGTTCCTGTT 

GT AAG TGT AACAACACACTGCAGCTGGT AGTAGAAGCCTCACGGGATACTCTGCGACAACTA 

CAGCAGCTGTTTATGGACTCACTAGCCOGGCAGGTACGCGGGTCTGCAAC^ 

AACCT GCT ATGGCCAATTGTGAAGAGATGGGAGTCT CCCCGTATTG<XX>GGCCGGTCTCA 

AACTCCTGGGCTCAAGCAATCTrCCCGCCCCACTTCOCGAAGCCCT AGGATT ACGGGAGTG 

A GCCAC CGCACOCAQCCAGAAAAAOQ TTT CAAAT ATTGGAAAACCTT A C HUH C AATGAGC 

ATTTTTGCATCAAGGGGTAAGAGGGACATTAGGC I 1 1 1 1 1 1 CTCTTAGACTCCAAACAGTAAG 

GTGAGAATTTATCAAGACATTACATAGGAGTAAGGGCACAGCCAGG 

ACATTTTCX^AGCACTAATTAAC^GGTTTTATGATTCACT AGGTTGGCCCAACT ACTGTTCTCA . 

CCT AA7TCOCAGGCACAGCGTGTCAGGAGGCCAAATGACACTTTCCAGTC 

GT ATGAAGGGGGCAGAGATCACCTAGTGACCATCAAGCAGGCCATCCAGAGGCAAAACTCC 

TTATCTGAGGAATTTAGAAGT AATTAGACTCCCCTGTTATTTAAAGCT GGCATCTGGGTCCAG 

GCTTCTTCCCCAAAAACTTGTAAGTAACTAGAATTTCTACATGTCTCCCA 

AAACTTATCGTGCAACCCTTGCTGACATTAAGACACCAAAATGTCCAAAAATCTAATGA 

TCATGACCTAAGTGGTTAATATGGCCCAAATTCCCTTT AAGCTCCTGCTTT AAGGTTCATAAA 

TATCCCT AAGGAAAATCCACTCAGTCCTCTCTTGCTGAGGCACCCTGCT 

CGGCCGCTCCACCGCGGTGAGCNNNNN 

>402 ■ ... . „ . 

AGX5GAGTCGAOCCAGGCGTCCGGAGCTGAATCACCAGAATGGGGAACCCA 
GTAGAAG^TGGCACTGTGGATGGTGCAGACAGGGATGAAAAG^I^^ 
AGGAAGTCAGGAAACTAGACCTGCCAGTGCTGTTCCCAAACAGA 

GCTCTGATGGTCAAAGGATGGCAAAGTCCACTTTAGAGGCCATTAGCCT AGCAGTTGAGTAG 
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GAGCTGTCACTGATCCTGGCATCATCttTCATCTTGATCT^ 

TCATGGAGTTAAAG^TTATTCCCCTTGCATTTTACAGCCCTGATTTTGCATTrGCC 
TTCAACTAAGTTCATGTCATCTTTTCCCATGGAGTTTCACTATGCATTGTCAGACTGCT 
AGGCCTCCAGACCAGGTCCATCTTCTCCCTTTTTCTCT AGCATGGCCTTGCCCTTT CCACAG 
CCCTGAGCCTATGCCTGGCCTGG GTACCTT AGCCCCTTGCCTTTCCTTCCCTGTTTGAAGCC 
TGCCCAAGCCCCAGGGTTGCCC I 1 1 IGI I ATTCTGCTGCCT AGAAGTTAGAACATTCAAGAA 
TTGAGGGT AGACCTAT AAACAGACTGGCAG ATGATATAGTGCTTAATGTCTT GCCTCAGTTTG 
TCATTGTGTCAGAACATCCGAAAAAAGATAGGAGAAAAGTTGGAGTTTAGGGCATAATTTTCC 
TGTAATAATTTGGAGCTCAAAAATATTACAATGAGAAAATAAAAAGCTATTAATTAGCAGGAG 
ACTT AATCACATGAAGAATTTCAATTAGATATTTCTrTCTACGTAGTATAGTTATAACTTTGCATT 
CCATTGH I ICC I ? CTCT ATAGTCCATTGT ATTTCT GTGGGG I 1 1 I I U I II I CAGGTTTTGAG 
TTCAAGAACGAGATTAAAAAAGAAAATCTAAAATGG GATGATTCAGAGGAAGT agaaataaac 
AAGGCTITACAGAGAAAGTCCAGAGGAGTTTATTGGCACTCTGAGCT ACAAAAAGGCTTGGA 
GAGTGAGCCAACATCAAGAAGG CAATGT AG AAATTCTCCAG GGGAGAGTGAGG AGAAAACC 
GCATCCCAGGAGAAGATGAGTCACCAGAGTTTTTGTGCCAGGGACAAAGGCTGTACACATAT 

GAAAAAGTATCTCAATGTCCTGAATGTGGGAAAACCTTT AGCCGAAGTTCTT ATCTT GTTCGG 

C^TCAAAGAATCCACACAGGCGAGAAGCCTCACAAGTGCAGTGAGTGCGGGAAGGGCTTTA 

GTGAGCGCTCCAACCTCACTGCCCACCTACGAACTCACACAGGGGAGA^ 

TGGGCAATGTGGGAAAAGCTTCAACCAGAGTTCCAGCCT"CATTGTCCACCAG 

A<X»GGGAAAAGOCTTACCAGTGCATTGTCTGTGGAAAGAGATTCAACAACv^ 

CAGTGCTCACCGGCGCATCCAGACTGGGGAGAGCCCATACAAGTGT GCAGTGTGTGGGAA 

AA T CTT CAACAA T AGCT CCCACTT CAGT GCCGACCa^AAAAACCCACACT GG TGAAAAGCCTT 

ACAGGTGTTCrCACTGTGAGAGAGGCTTCACTAAGAACT CTGCCCT CACCCGTCATCAGACA 

GT ACACATGAAAGCAGT ACTCTCATCACAGGAAGCAAGAGATGCGTT ATGAGTGTGTCGGT A 

AACT GTCAGATTAAGTTCCTCAGGTCAGCATGT ATGAGCTTT CTT CT GCT GT GGAGAGAT CT A 

GCCAGTCCCTGACTTTGCAACAGACCTACTGACTACT GGATCTTAAGACCCATGTCT AGGAC 

CAGGAGTCAGCAT AAGGaCGCTGACCTCTCCTGGCT GTGCCTGTGACTCGAGAGTCCTATC 

TTACTGT GACTT AAAGTTTGATGGAGAGAAAGCTGTAGGATCCATAAATTCT ACCAGGAAACC 

AGGGTCTTCCTGTTCCTAGCACTGAGAATGGGCACCCAGTGGTC^^ 

CT AACATAGTCCTCACACAGGGCTGAGAAAGAAAGTGTCTCCTTTCCT ggaaagcacatgta 

aaagttaagggcctgaatctcttctaaaccaataattgacctct aggcacccaccataatact 
gct accttcagggcagaagacactgctctttgtaacccaggcaocacy 

AAGAAGGAAGGATGGTTAAGOCATTGGATAATACTGAATCTG 1 1 1 C OG I AAGTGACTT AACCT 
TAGAGCAAGCACAACATCCAGGTTAA \ 1 1 1 1 1 G I AAGATTTCT CCTTTT ATTATTGCCCTCATC 
AT AGTTOCTGATTGTCTCTT AAAGT AAG I GUI 1 1 ATAGACATTACTATTTCTGATAATAATTTA 
CAAACTACT ATAAACAAATTTATAAACATTACTAATTTCTGATGAAAATAAAG U G 1 1 I C TO CC I 
CCACAAANNNNNNNNNNNNN 
>403 

fr^lACX^ACGCGTCCGGATTAATCTATTTGCCTAAATGGG I M G 1 1 CAGGTATCCATT 
TTTAACAAAGAAG I f IGIGTI CATAT AGT AAAAGACCTATCA G I G 1 1 1 C CACCATGCACTTCTA 
1 1 1 1 1 1 AGGAGTTTATAATTTTAAGTCTT ACATTCCT AGTAACATTTGGG C 1 1 MCI I AGGTATG 
1 1 rCTAGTCl 1 OCUC I CAAATCATGCATGCAAATAGGAGGACAGT GGTGGTGACTCAACTGG 
ATACAGGTGCT CAAT AGTCAGGCTTGATAGTGATGTCAGGACGCATT ACAAGCTtaT AAGCCG 
ATACTGACTGGCCATTGGCACCATCCTTGACTAACC rTOCTCTTTI I CTCT AGTGTGCCT ATG 
GTGAAATGGC AATAGCA TTCACTGTCGTATTTTGCA<nG 

TGAAGGTGCTTGTTTGT ATTAGCTCTGCTAGGTTTACCTCTACAACGTAGATTTCAGCAGCTA 
TGCTGACTGACACTACATTCT AGTTCTT AAGATTTTTTTTCCAGATCCCCCCTTCOCCAGCTA 
GACATACGTAGCATACTTTCATCTT ATTCAGTCTTTCT GT AACCTGCT GCTGCTTTT AGTCCT C 
CT CACCT CAGAT CGGAAT CAATGGAGTGGGCCCAGAGGATACATTTTAATTCCAGTAATGGT 
AGGT AGATTTGTCCTGCmCTAAAACATCTCCTCATT^ 

ATAAGGQAAAATTAATGGG I G I I fOCl OCTTT AGGGAGGT AATGCAAAGAGTGTGQACATCT - 
TCT AAT CTTGAGGAACAGT AG TTG A 1 1 rO OCI I G AAGGAGCTTACATATTGA UI G 1 1 1 1C ACA 
ATAACC I G i H CiCaXAGTTCAATOCTCATTTTAATACTT AATTnjGTACTGGCTCAAATAGCA 
1 1 1 ICI I ACAGATAACAAATCAAGAGTGAAATTrGAGGTTATACTCCAGT AAAGTTTTT AACAC 
TTGTGAATATGGTCAGCTAGACTAAACTTGACT C 1 1 1 1 f 1 1 1 AATGG C 1 1 III I ATCTGTGAAC 
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ATTCAGATAAGTGGATTTTCAAGTACT GGTTGG GGATGGGAATCGTGCTTTT CTTT AAACTT C 

AGTTT ACGA GATGCTTTGAGAGCGTTAGGCAAAAGCAGAAATAAATATCAGGAGCAACGGG 

GAAAGCTTTATAAAAGATCATGGTGGCCACTGTTGCAGCTTTGAAGAATGAGTGCTGGCTTG 

AACAGTTCTTTGCCTGCATCATTGGTAGCTGCACTGAAAGGAAAAAACTTTCAC^ aagaat 

TTGAAAAGGAAGAAACCTGGGCTCTGGTCTTCATGGCATTTAGACTGAGATGCTTAAACAGA 

Ai^GAAGTAATACGCATTTCCTGCCATAGGATAGGGAAAATGTAACAAGCTGGTTGCT 

AGGTTAGAAAATTGTCTGTTTCTCTCTGGATGAAGCTGGATTTACTTGAAAATGGAGAGTTGG 

CTT ATTGTT TGAATATTGGGACATCAAGCTATCT ATAGCCAAGTTTC^GTCCCAACCAGTTTT 

CCCTTTGTCTG GGGTAAATTCGATACAAAATGA I fCTTTI I GAATCCTGAATCCATAAATTACA 

CI II I M M 1 1 CAAATTCACAAAATTCACAGTGGTGCTGACTGTGTAATAACCACT ATTGGGAA 

ACATCCCGTAAACCTGCCTGTTGC^TGCCAATGGAGTGACTGAACTGGTGACATCTGTTTG 

AGCATGCTTT GTGTGGCTGGTCACAATGACACGGTTGGTGCTCACCTTGACCTCCGGGCTA 

CGGGACGGAGGGTTTTCTAGAKNNNN 

>404 

tAGGGagttacccacGCGTCX^SCCCTGCGCCAAAGTGAGCAGTAGCCAACATGTCAGG 
GTGGGAGTCATATTACAAAACCGAGGGCGATGAAGAAGCAGAGGAAGAACAAGAAGAGAAC 
CTTGAAGCAAGTGTGT ATCCAAAGTGTGTACATCAGT AAGATCATAAGCAGT GATCGAGATC 
TCTT GGCTGTGGtGTTCTATGGTACCGAGAAAGACAAAAATTCAGTGAATTTTAAAAATATT^ 
OSTCTTACAGGAGCTGGATAATCCAGCTGCAAAACGAATTCTAGA 

GGCAGCAGGGACAAAAACCTTTCCAAGACATGATGGGCCGCGGATCT GACT ACT CACTCAG 
TGAAGTGCTGTGGGTCTGTGCCAACCTCTTTAGTGATGTCCAATTCAAGATGGGTCATAAGA 
GGATCATGCTGTTCACCAATGAAGACAACCCCCATGGCAATGACA^ 

GGCCAGGACCAAAGCCGGTGATCTCCGAGATACAGGCATCTT CCTTGACTTGATGCACCTG 
AAGAAACCTGGGGGCTTTGACATATCC 1 1 G 1 1 C I ACAGAGATATCATCAGCATAGCAGAGGA 
TGAGGACCTCAGGGTTCACTTTGAGGAATCCAGCAAGCTAGAAGACCTGT^ 
CGC<K3CAAGGAGACCAGGAAGCGAGCACTCAGCAGGTT AAAGCT 

TAGTGATCT CT GTGGGCATTTATAATCTGGTCCAGAAGGCTCTCAAGCCTCCTCCAATAAAG 
^C TATCG GGAAACAAATGAACCAGTGAAAACCAAGAA^CCGGACCTTTAATACAAGTACAGG 
CGGI ITGC1 1 CTGCCTAGCGATACCAAGAGGTCTCAGATCTATGGGAGTCGTCAGATTATAC 
TGGAGAAAGAGGAAACAGAAGAGCTAAAACGGTTTGATGATCCAGGTTTGATGCT CATGGGT 
TTCAAGCCGTTGGTACTGCTGAAGAAACACCATTACCTGAGGCCCTCCCT 
AGAGGAGTCGCTGGTGATTGGGAGCTCAACCCTGTTCAGTG CTCT GCTCATCAAGT GT CT G 
GAOAAGGAGtrrroCGGCATTGTGCAGATACACAC^ 

GGCTTTGGT GCCACAGGAAGAAGAGTTGGATGACCAGAAAATTCAGGTGACT OCTCCAG GC 
TTCCAGCT GGT CTTTTT ACCCTTTCCTGATGATAAAAGGAAGATGCCCTTTACTGAAAAAATC 
ATGGCAACT (X^AGAGC^GGTGGGCAAGATGAAGGCTATCGTTTGAGAAGCTTCGCTTCACA 
TAC AGAAG TGACAGCTTTGAGAACCCOGTGCTtSCAGCAGCACTTC 
TGGCCTTGGATTTGATGGAGCCGGAACAAGCAGTGGACCTGACATTGCCCAAGG 
AATGAATAAAAGACTGKKSCTCCTTGGTGGATGAGTTT AAGGAGCTTGTTT ACCCACCAGATT 
ACAATCCT GAAGGGAAAGTTACCAAGAGAAAACACGATAATGAAGGTTCT GGAAGCAaAGGC 
CCAAGCTGGAGTATTCAGAAGAGGAGCTGAAGACCCACATCAGCAAGGCT ACCT GCCOGG 
G06G006aagACAOCTaAAGGQ6QCTGAATCCAGGG^ 
aoQccaCAGCACCTC^caAGTTAac^TGCTGGCTTGG 
GTtG^CTCOSCetaj^TGGATTGAGTTCCC^ 

gteagcUTtecAtf e p QcqgCl I tptagAG TCCAG aaoQATGaGQAajGG T CCCcacCATGCAGCATA^jet 
GTgctaccAAAAAGCAGCca^TacfrcmAAGTGGGC^ 
CTCCCAACAGGGGTCTCCAGCCACCT CCT ACAGtf 

KNCCAATTGGAACXTTCCACCGCGGGGGCGGCCGAGGTACGCG 
GGAGAGAGCTGGCTCAGGGCGTCCGCT AGGCTCGGACGACCTGCTGAGCCTCXXAAACCG 
CTTCCATAAS^ttTTT G^CTTT^ 

GCCCAAGGCIGI rCTCI 1 1 1 I AGAGtTTGCT ATCTCCA G I 1 LCI I GCACT CC I G TTAACAAGC " 
ACCTCA6<*iAGAGCAGCAGCAGa^TAGCA 

GTC^TGACCAGGGCG^3GAGATCACAACCGCCAGAGAGGATGCT GT GGATCCTTGGCCGAC * 
TACCTGACCTCTGCAAAATTCCl I CICI ACCTTGGTCATTCTCTCTCTACTTGGGGAGATCGG 
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ATG i GGCACTTTGCGGTGTCTGTGTTTCTGGTaGAGCTCTaTGGAaACaGCC I CCTTTTGAC 

AGCAGTCTACGGGCTGGTGGTGGCAGGGTCTGTTCTGGTCCTGGGAGCCATCATCGGTGA 

CTGGGTGGACAAGAATGCTAGACTTAAAGTGGCCCAGACCTCGCTGGTGGTACAGAATGTT 

TCAGTCATGCTGTGTGG AATGATCCTGATG ATGG I I MCI I ACATAAACATGAGCTTCTGACC 

ATGT ACCATG G ATGGGTTCTCACTT CCTGCTATATCCTG ATCATCACTATTGCAAATATTG CA 

AATTTGGCCAGTACTGCTACreCAATCACAATCCAAAGGGATTGGATTGTTGTTGTTGCAGG 

AGAAGACAGAAGCAAACTAGCAAATATGAATGCCACAATACGAAGGATTGACCAGTTAACCA 

ACATCTTAGCCCCCATGGCTGTTGGCCAGATTATGACATTTGGCTCCCCAGTCATCGGCTGT 

GGCTTTATTTCGGGATG GAACTTGGTATGCATGTGCGTGGAGTACGTCCTGCTCTGG AAGGT 

TTACCAGAAAACCCCAGCTCTAGCTGTGAAAGCTGGTCTTAAAGAAGAGGAAACTGAATTGA 

AACAGCTGAATTTaCaCAAAGATACTGAGCCAAAACCCCTGGAGGGAACTCATCTAATGGGT 

GTGAAAGACTCTAAC^TCC^TGAGCTTGAACATGAGCAAGAGCCTACTTGTGCCTCCCAGAT 

G GCTGAGCCCTTCCGTACCTTCCGAG atggatgggtct CCTACTACAACCAGCCTGTGTTTC 

TGGCTGGCATGGGTCTTGCTTTCCFrTATATGACTGTCCTGGGCTTTGACTGCATCACCACA 

GGGTACGCCTACACTCAGGGACTGAGTGGTTCCATCCTC^GTATTTTGATGGGAGCATCAG 

CT AT AACTGGAATAATGGG AACT GTAGCTTTT ACTTGGCTACGTCGAAAATGTGGTTTGGTTC 

GGACAGGTCTGATCTCAGGATTGGCAC^GCTTTCCTGTTTGATCTTGTGTGTGATCTCTGTA 

TTCATGCCTGGAAGCCCCCTGG ACTT GTCCGTTTCTCCTTTTGAAGATATCCGATCAAGGTT 

cattcaaggagagtcaa7tacacctaccaagatacctgaaattacaagtgaaatatacatgtc 
taatgggtctaattctgctaatattgtcccggagacaagtcctgaatctgtgcccataatctc 

TGTCAGTCTGCTGTTTGCAGGCGTCATTGCTGCTAGAATCGGTCTTTGGTCCT^ 

tgtgacacagttgctgcaagaaaatgtaattgaatctgaaagaggcattataaatggtgtaca 

GAACTCCATGAACTATCTTCTTGATCTTCTGCATTTCATCATG GTCATCCTGGCTCCAAATCC 

tgaagcttttggcttgctcx^attgatttcagtctcctttgtggcaatgggccacattatgta 
tttccgatttgcccaaaatactctgggaaacaagctctttgcttgcggtcctgatgcaaaag 
aagttaggaaggaaaatcaagcaaatacatctgttgtttgagacagtttaactgttgctatcc 
tgttactagattatatagagcacatgtgcttattttgtactgcvvgaattccaataaatggctg 

GGTGTTTTGCTCTGTTTTTACX^CAGCTGTGCCTTGAGAACTAAAAGCT 

agtcagcagaaattaactgattaatttcccttatgttgaggcatggaaaaaaaattggaaaag 

AAAAACTCAGTTTAAATACGGAGACTATAATGATAACACTGAATTCCCCTATTTCTCATGAGTA 
GATACAATCTTAGGTAAAAGAGTGGTTAGTGACGTGAATTCAGTTATCAT7TGACAGATTCTT 
ATCTGTACT AGAATTCAGATATGTCAGTTTTCTGCAAAACTCACTCTT GTTCAAGACT AGCT AA 
TTTAI 1 1 III I UCATCTrAGTTATTTTTAAAAAGAAATTCTTGAAGTATGAAGACTAAATTTTGA 
TAACTAATATTATCCTTATTGATCCTATTGATCTrAAGGTATTTACATGTATGTGGAAAAACAA 
AACT AGAATCTCT AT AGGGTAGGTAGTTAAAG^GACTTGCTTTTATATCAATGGGGACT ATGG 
TTGGAjGCTTTGTGATTAAAGGGGGTTAATGAACGGGGAAAACAAGGAATGATTTAG 




NACAATTGCCTTCCTTTT AAGTTTCAGGGTCAAGGGAAAGTGTGGAAAGGTTTTTTTC 

NNNNNWIWNNNNWNNNWNNNNNNNNNNWNNNNNWN I MM M VmmTm UO r UG ^ATAA 
TGTTTATTTAAATAGTAATCTTAAAAAGGCTTT ATACAAATCATATAAACACAGTTTGACATTGT 
TGTTTATTTTTTACATTGAAGAATCACAGTAAAAGCTCTCAAGTTCCCGT GATCTGGCTCAGC 
CAGTGAATGGTCGAATACCTCTAACTGCAAACCCTGAAAAAGTTAACCTGCT ATGTGACAGG 
TTAACTCTTTCCCTTG II II CM GTCATTCATGCTTATTCCTT GGACCCTGATCCCTGTCAACT 
CCCCTGCCCCACCAAAGACAGAAAGCAACCCTGTGGCCCTGTTGACCACCTAGTCCGGG 
AGTGTGGAGGTAATGTGTCATTCAGTACAAGGCAAAAAGGAGATCAATACTGGCAGTGGGG 
GAGCCCCGGCGACCCGGGCTTCTGTGAAACATGGCGGTAGGCTGGGACCATAACACAAGC 
ATGACT ATATGAAGGAAGAGGAAGGTTTTCCTGAAGATGAGGCGACTGAATCGGAAAAAAAC 
TTTAAGTTTGGTAAAAGAGTTGGATGCC 1 1 1 CCGAAGGflTGCTGAGAGCT ATGT AGAGACTT 
CAGCCAGTGGAGGTACAGTTTCTCTAATAGCATTTACAACTATGGCTTTATT 
AATTCTCAGTATATCAAGATACATGGATQAAGTATGAATAGGAAGTAGACAAGG AI iui CI a 
GCAAATTAAGAATTAATATAGATATTACTGTTGCCATGAAGTGTCAATATGTTGGAGCGGATG 
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TATTG GATT TAGCAGAAACAATGGTTGCATCTGCAGATGGTTTAGTTTATGAACCAACAGTAT 

TTGATCTTTCACC ACAGCAGAAAGAGTGGCAGAGGATGCTGCAGCTGATTCAG AGT AGGCT 

ACAAGAAGAGCATTCACTTCAAGATGTGATATTTAAAAGTGCTTTTAAAAGTACATCAACAGC 

TCTTCCACCAAGAGAAGATGATTCATCACAGTCTCCAAATGCATG CAGAATTC ATGG CCATCT 

ATATGTCAATAAAGTAGCAGGGAATTTTCACATAACAGTGGGCAAGGCAATTCCACATCCTC 

GTGGTCATGCACATTTGGCAGCACTT GTCAACCATGAATCTTACAATTTTTCT CAT AGAAT AG 

ATCATTTGTCTTTTGG AGAGCTT GTTCCAGCAATT ATTAATCCTTT AG ATGGAACTGAAAAAAT 

TGCTATAGATCACAACCAGATGTTCCAATATTTTATTACAGTTGTGCXiAACAAAACTACATACA 

TATAAAATATCAGCAGACACCCATCAGTTTTCTGTGACAGAAAGGGAACGTAT^ 

GCTGCAGGCAGCCATGGAGTCTCTGGGATATTTATGAAATATGATCTCAGTTCTCTTATGGT 

GACAGTTACTGAGGAGCACATGCCATTCTGGCAGTTTTTTGTAAGACTCTGTGGTATTGTTG 

GAGGAATCTTTTCAACAACAGGCATGTTACATGGAATTGGAAAATTTATAGTTGAAATAATTT 

GCTGTCGTITCAGACTTGGATCCTATAAACCTGTCAATTCT 

CA G ACAAC C ACTT ACCT CTTTTAGAAAATAATACACATTAACACCTCCCGATTGAAGGAGAAA 

AACTTTTTGCCTGAGACATAAAAC C I II I I 1 1 AATAATAAAATATTGTGCAATAT ATTCAAAGAA 

AAGAAAACACAAATAAGCAGAAAACATACTTATTTTAAAAAAAAAAAAAAAAGTCGACGCGGC 

CGCGAATTTAGTAGTAGTAGGCGGCCGCTCTAGAGGATCCAAGCTTACGTACGCGTGCATG 

CGACCTCATAGCTCTTCTATAGTGTCACCTAAANNNNM 

>409 

NNACACATTGTAATATTATATCATGTATAGTTGTACGCAGCTCTGTGCATAACTGTGG 
TAACTTTGTGGGTCGCTOCTGTGGGTCCT GAAACAATGCAGTTCTCCCCGCG TATGCGACAT 
CCGGCTGATGGTCATGGAGATCCGCAATGCTTATGCTGTGTTATATGACATCATCCTGAAGA 
ACTTCGAGAAGCTCAAGAAGCCCAGGGGAGAAACAAAGGGAATGATCTATTGAGAGCCCTC 
TCTCCCATTCTGTGATGAGTACTCrrTCTGCACTGTTC I MC I M CTAATAAAA C II IOM M I C 
GAACCT ATACTGTCTTCTGTAAATTCTTCTTACT acoctatgacccgtgagccaaccactttc 

cg atgccag ggttctgagagctcagctg gcat aat at aaag tg 1 1 m i i 1 1 i ia tacccttcc 

acttggaaagactacagaggaatcttgciactgcatagttcaaactaaaaagagaagagttaat 

tacctgaaaag caaga gaaaacaagaaggggtaaattttgaaccaagggaaatcatttaaga 

agtgtctggtatttttcaaatttctgtcagttgttacatttgtcataagtaaat 

aaagg atggagacatgctt attttatttaactcxxxxaaaaaaaaaaaaaaaaaaaagtacct 

gcccgggcggccgccccgggcggccgctcn 

>410 

WNCAGGTACTGTGCAGTAGTAACCATAATTCT AAATGAGGATTATGGA7TTTTCTG 
GAAGATTCI MM I CCTGTGGAACATGATGAGAAATGTTTAGGAGAGGGG ACATAGCCATTTT 
TGTATGAAGACCAATTCAAGAAAAAAATATATGTATGTGTGTGGGTGTATATGTGTGTATATA 
TGTATATATGTGTGTATGTTCATATGTCTATGTATGTTrATATATGTGGTTATACACACGCACG 
CACACACTCACACACGCATGCAGACATGCACGCACACTCTCACTCTAT^ 
TTCCTGGGGGATTGATTCCAGAACCTC^CrrGTAGATACCAAATCCGAGG^ 
TGATGTAAATAGCTTAATATGGCTGGGCGTGGGGGCTCACATCTGTAATCCCAGCACNNNN 



NNCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTGCTGGGATTACAGGCACGAG 
CCAGTGCGCCCAGCTGCCTCTG I MCM 1 1 ATTAGCTGTTCTGGACT GTGGGGCTCCTTGGG 
CAGATGCT GTATTATGGGGATAAGCCACACACTTTTTGAACTGG CCCGGTGAGGG GGGACA 
TAACCATTTCCTGTGCCACCCCATCAATOOCCACCT ATTCTGAGTGTAGGCTCCTCCCCTGC 
TTGAGTAATGGCCACAGATCTTGGCTCtSGCACTCCTAAGCTGCATGTTGAAT^ 
ACAAGACTGGCTTGTt^TTCX^TTCTCX^GATCCTTGGGTTGG CTTCTGG G TG CACT AGGAG 
ATCTGAAATGCTCTCAGGCCACCAGG AAAGTACT GGAAGT AAAGTCT GACT CT AAAGAAGAT 
GAAAATCT AGTAATTAATGAAGTAATAAATTCTCCCAAAGGGAAAAAACGCAAGGTAGAACAT 
CAGACAGCTTGTGCTTGTAGTTCTCAATGCACGCAAGGATCT GAAAAGTGTCCTCAGAAG AC 
TACTAGAAGAGACGAAACGAAACCTGTGCCTGTAACTTCTGAGGTGAAAAGATC^AAAATGG 
CTACTTCAGTGGTCCCGAAAAAGAATGAGATGAAGAAGTCGGTTCATACAC^GTGAATACT 
AACACAACACTCCCAAAAAGTCCACAGCCATCAGnrGCCTGA^ 

GCAAGCAGXSAAAGAGCAAACGAGGTAGTATTCTCCAGCTCTCTGAAGAAAT^ . 

ATTGAGTCAGATAATGTAGAGGTAAAAAAGGAATCTTCACAAATGGAAAGTGTAAAGGAAGA 

AAAGCCCACAGAAATAAAATTGGAAGAGACCAGTGTTGAAAGACAAATACTTCATCAGAAGG 
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AAACAAATCAGGATGTGCAATGTAATCGTTTTTTCCCAAGTAGAAAAACAAAGCCTGTGAAAT 
GTATACT AAATGGAATAAACAGCTCAGCCAAGAAGAACTCCAACTGGACTAAAATTAAACTCT 
GAAAATTTAACTGTGTGCAGCACAATAAGTTGGACTCTCAAGTTTCCCCT AAATTAGGCTT AT 
TACGAACCAGTTrTTCACX^CCAGCTTTAGAAATGCATCATCCAGTGACT 

tagggacaaagctac^tgatagaaatataacttgccagcaggaaaaaatgaaagaaattaatt 

ctgaagaagtgaaaattaatgatattacagtagaaattaataaaaccacagaaagggctcctg 

aaaattgtcatttggccaatgag^taaaaccttctgacccacc^ 

attcttttg attcag catcaaataag aatttcagccaatgtttggaatccaagct agaaaaca 

gto^tggaaaatgttactgctgcttccactctgctcagtcaagcaaaa^ 

gagaataaatttccaggttcagctccccaacagcatagtattctcagtaacx^gacatctaaa 

agcagtgataacagggagacacc^cgaaatcattctttgcctaagtgtaattcccatttgga 

g at aacaattccaaaggacttgaaactaaaagaagcagag aaaactgatgaaaaacagttgat 

tatagatgcaggacaaaaaag^tttggagcagtttcttgtaatgtttgtggaatgctgtatac 

agcttcaaatccagaagatgaaacacagcatctgcttttccacaaccagtttataagtgctgt 

TAAATATGTGGGCTGGAAGAAAGAAAGAATTCTGGCTGAATACCCTGATGGCAGGATAATAA 

TG G TTCTTCCTGAAGACCCAAAGT ATG CCCTG AAAAAGGTTG A CG AGATT AGAG AGATGGTT 

GACAATGATTTAGGTTTTCAACAGGCTCCACTAATGTGCTATTCCAGAA 

TTCATTTCCAATGAGAAAAAAGTAGTTGGCTGCCTAATTGCGGAACATATCCAATGGGGCTA 

C AGAGTT AT AGAAGAG AAACTTCCAG TT ATCAGGTCAGAAGAAGAAAAAG TCAGATTTGAAA 

GGCAAAAAGCCTGGTGCTGCTCAACATTACCAGAGCXn"GCAATCTGCGGGATCAGTCGAAT 

ATGGGTATTCAGCATG^TGCGTCGGAAGAAAATrGCTTCTCGCATGATTGAATGCCTAAGGA 

GTAACTTT ATATATGGCTCATATTTGAGC^AAGAAGAAATTGCTTTCTCAGATCCCACTCCTG 

ATGGAAAGCTGTTTGCAACAC^GTACTGTGGC^CTGGTCAATTTCTGGTATATAATTTTATTA 

ATGGACAGAATAGCACGTAAAACAAATTCTrGCCTACACCACTAGAAGACATCT ATTGAAGA 

GAATGGATTGGTTGCT GACTTTAACCAG GAACTAG GGCCATTTTTATTACAATGAACTCAGG a 

CT GGCAACAACC ATATGGTTGTTCCATTTTCATAAAATTGGAAACAATGCAGTAATAGCTTAT 

TGTTTTGj ? I F 1 1 AAAGAAGAT ATTTTATTATCTTTTACAGAAATTTATGATTGATGTATTTTATC 

TATAGTTATTTAGACATGTTTACATGCAGCAGATAATTGTTCATAGTGGACTGAAAACTAATG 

CAAGGACTATGGTCTCAGTGATAAGTATATTTTGAAGTTCTTAATATGGAAATATACCAGTGT 

AGCTTGGTACTGTAJ MM 1 1 ATATTGATCTGCTGATACCAGTGATAGGCTTAAAGATTGTATT 

TTCACAGAGTGGAAACCAAI M I M I AGTTATTGTTCAAGGAGGGTGCAATATTAAGTGTTTT 

GGAATTTGAAGCTAATTTTTAAAAGGCCTGAACTATACTTTGAAGAAACCCCT 

AAAGCTCCAGCTAAATAGGAAGAATTAGAATATTGAGC I I I M I I I CC I GATTTTTCTCTTTCC 

TATCTTTGATGGAAGGAGGAAGTAGAAAGTGGTAAAGAATTGAGGCTTTCCTTCrr^^ 

CT GT AAATGACAAG CATT AG GAAAGGT ACCXTTCCTAGATTCATT ATTCTTT CATTCTGGTTT C 

ACTTTT AAAATAAATGGCAACTTGGCACACCTAGGCTGTTAACAAATCTCAAAGAGGTTTATA 

AAAACX3T AT AGAAT ACTTGGAAGCAAAGTATGGATGACTCGGTATCTGCTTTGTTATTCCT 

GAAATACTGCACTGAGTATATGCXXTCATTACTGGACTTCATTTTGATACTTGTCTA 

TAGTGCCCTCTACnTTAAAGGGTTTATATGTTGAAAAACTGCTGTGGCCTTTTA 

ATAT AATGT AGAATAAAAAT AAT AAAAT ACTTGAT AGCTTTTT CTAAGT GACCAATGTACTACT 

GAGGTGTAANNNNNNNNNNNNMN 

>412 

NAT«5AACCCCCCATCCCCCATCGACCXXXXrm 
GTTGGTACCt^GGAATTCG^jCCATTATGG<XXjGGTGGAAAACTGAGT ATCACGTATTTAAAC 
An 1 1 1 1 nui 1 1 1 AGCCATGTAGAAACTCTAAATTAAG<X^AATATTCTCATTTGAGAATGAGG 
ATGTCTCAGCTGAGAAACGTTTTAAATTCTCTTTATTCATAATGTT CTTT GAAGGGTTTAAAAC 
aagatgttgataaatctaag CTGATGAGTTTGCT<>AAACAGGAAGTTGAAATTGTTGAGACA 
GGAATGGAAAATATAATTAATTGATACCTATGAGGATTTGGAGGCTTGGCATTTTAA 
GATAATACCCTGGTAATTCTC^TGAAAAATAGACTGGGATAACTTTT GAT AAAAGACTAATTC 
CT GGCCACTTT GTTCCTGTCTTTAATATCTAAATACTTACTGAGGTCCTCCATCTTCT 
TGAATTTTCATTT ATT AAGCAAATGTCATATTACCTT GAAATTCAGAAGAGAAGAACCAT ATAC 

tgtgtccagagtataatgaacctgcagagttgtgcttctt ACTGCT aattctgggagctttca 

CAGT ACTGTCATCATTTGT AAATGGAAATTCrGCTTTTCT GTTTCT GCT CCTTCTGGAGCAGT 
GCTACTCTGTAATTTTCCTGAGGCTTATCACCTCAGTCAI I I CM 1 1 1 1 AAATGTCTGTGACTG 
GCAGTGATTC 1 1 1 1 ICI I AAAAATCTATTAAATTTGATGTCAAATTAGGGAGAAAGATAGTTAC 

tcatctt gggct ctt gtgccaatagcccttgt atgtatgtacttagagttttccaagtatgtt 
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CT AAGCAC agaagtttct aaatgggg ccaaaattcag acttg agtatgttctttgaatacctt 

aagaagttac^ttagccgggcatggtggoxxstgcctgtagtcccagctacttgagaggc 

tgaggcaggagaatc^cttcaacccaggaggtggaggttacagtgagcagagatcgtgcca 

ct gcact ccagcctgggtgacaagagagacitgtctccaaaaaaaaagttacacctaggtgt 

gaattttggcacaaaggagtgacaaacttatagttaaaagctgaataacttcagtgtggtata 

aaacgtggtttttaggctatgtttgtgattgctgaaaagaattctagt^ 

ctctttcxxcaaattaagtgcctggccagctgtcataaat™ 

aaggttacatgttcaagagtgaaaataagatgttctgtctgaaggctaccatgcccggatct 

gtaaatgaacctgtttattgcttgtatttgctccaacggcttactatagaatgttact^ 

atat cat actt attacaatttttactataggagtgtaataggtctgctgtg aa i m m i i gtcat 

gacttgaaagcaagg^tagagaaacacmagagatatgtggggtgttttttaccattc^aga 

GCTTGTGAGCATAATCATATTGGCCTTTATATCTGATGGTCCTGAACTCCTAAGTGGGCGGC 

ctaccacccaggacccataaatggtgcgttctgcttcttgtaattcatcrctgct aataaatt 
ataagaagcaagga aaatta gg gaaaa tattttatttggatggtttctataaacaagggacta 
t aattctt gtacatt atttttcatctttgctg m icm i gag cagtct aatgtgccacacaatt 
atctaaggtai mgm 1 1 c i ataagaatrgttttaaaagtattcttgttaccag agtagttgtat 
tatatttcaaaacgtaagatgatttttaaaagcctgagtactgacctaagatggaattgtatg 
aactctgctctggagggaggggaggatgtccgtggaagttgtaagactttt atttttttgtg 
ccatcaaatataggtaaaaataattgtgcaattctgctgtttaaacaggaactattggcctcc 
ttggccct aaatggaagggcxx5at attttaagttgatt attttattgt aaattaatccaaccta 
GTTCTTTTT AATTTGGTTGAATGI MM I C 1 1 gttaaatgatgtttaaaaaataaaaactggaag 
TTCTT GGCTT AGT(>AAAAACTTAAAAAAACGAACGCGTGGGT(><N Wh^r^NNNNNN NNN 
NNNNNNNNNNWNNNNNNfWNrW^ 

NNNNWWNGGGGGGGCCGTTAA CI MM I AAAAAAAAAACCTT CCCACCCTTCCCCTGG ACC 

CGGAAAAAAAAAAGAAAGCCATTTTGGGGGGTAAACTGGTTTTTGGGACTTAA^ 

AAAAAAAGGCATTGGCTCCCAAATTTTCAAAAAAAAGGGNNNNNNN 

>413 

NNCGTCCGGTAGGAAGCT AGGGTrAGAGGCAACCCGTGAGGCT AGAACCCCGAAC 
GTGGTCGGTTGGAGAAAATATGTCCCTCCGGAGGCACATTGGGAACCCTGAGrTATCTGATG 
AAAAGGATACCACAGAACCCAAGATACCAGCAT ATCAAATCAAGACT GGACACTGGTGCCT G 
TGTGTATCTTACCAGCTCCCCAGCACTGCCAGACTGTCCCATGjAATGGATTATGCT^ 
GAAATCCC TGGC TG CGTrT TGTTCAGTCACCACCGCAGGGTTCTTGTATATTCT 
CTCTCTG GGTT TCTGTTTTGCAGCACTGCAGGCTGATTAAGGGA^ 
GCTCTCCCTTT ATTAGTCXXX>CAGTCTTTCATTGGCTGCATTGTCCTCCCATTTCT 
TTCAATCTAATTTCCAATTTCTTTAGCTTTCTGAAATTCTGT AGG CTTTCTCCACTCAGTCTTTT 
CT AGCCTT ATTTTTGTCTTTTTAAAAATTCACCTTCCTTTT ATTATCTCT AGACAAACT ATTCAT 
TCCCTCI M I CI I CM I AGGCTMACTTTTCCTGAATATTTGCTAATAGCXrrrGACTGTAAGCA 
TCAAACTCATTATAGACTAGGTACTCTTCATTTTTATTTGTTGTTTAA I ICI G M ICI ICCGTCA 
AAGGAAGAAATATATGT GAAAGGCTTTTATAAACCAGAAAG CACTAT ACACACATATCACTTA 
TTGAGGTTAACAAAGGACCACTGTGCCTCTTTTCA^ 

CTTCTCCATAGAGGAGGGAAATATTGAAGAAAAATAAAGCGATGGAGTTGT CT CTT AGCAAC 

TGAGTT(XJTGCCTTATAAATCTCATCTACTTT AGATT ACTGCTGCACAAAATCACTTGCTTTTC 

TTATTGGCTCAGTATAAATAATGTTATTAAATTATAGGAAAACAATGGAAAAAATCT AAAAAAA 

AAAAAAAAAAACTCGAGACTAGCNN 

>414 

>41S 

>416 

NNNr#«^NNNNNCGANAAATCTTCTA^ 
GTT ACT GGCTGCGGAGGACCGTGGGCAGCCAGGGTCGGTGAAGGATCXCAAAATGGCTGG 
GCGAAAACTTGCTCT AAAAACCATTGACTGGGTAGCTTTT GCAGAGATCAT ACCCCAGAACC 
AAAAGGCCATTGCTAGTTCCCTGAAATCCTGGAATGAGACCCT CACCTCCAGGTTGGCTGCT 
TTACCTGAGAATOCACCAGCT ATCGACTGGGCTT ACTACAAGGKXAATGTGG£CAAGGCT g 
GCTT GGTGGATGACTTTGAGAAGAAGTTTAATGCGCT GAAGGTTCCCGTGCCAGAGGATAAA 
TATACTGOCCAGGTGGATGCCGAAGAAAAAGAAGATGTGAAATCTTGTGCT 
CTCTCTCAAAGGCCAGGAnGTAGAATATGAGAAAGAGATGGAGAAGATGAAGAACT^ 
CCATTTGATCAGATGACCATTGAGGACTTGAATGAAGCTTTCCCAGAAAC^ 
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GAAAAAGTATCCCTATTGGCCTCACCAACCAATTGAGAATTTATAAAATTGAGTCCAGGAGGA 

AGCTCTGGCCCTTGT ATTACACATTCTG GAGATT AAAAAT AAT AATT AT ACAAAAAAAAAAAAA 

AAAAAAAAAAAAAACAAACAGCACGCACAGAAAAAGACGCCAACAAGGCGAACAAAACCGC 

CCGCAAGCCOCAGCGGAGAAACGCGAGACCaCCAGCGGCACAGACACCAGAGGACCAGCG 

GACGCGAGCCAGATCACCAGCCGGGGGCAGAGACGOVCCGAAC^ 

ACGCCGAAAACACGACGGCGACCACCGCGACGAACCAG^ 

CAACAGCACCCAGaGCCCACAAACCGCANN 

>417 

>4ie 

GAGTOGACOCACGCGTOCQOcataGCCGaacaogTGGATAAAGCTGTGTCCAAAACcagc 
AGGGATGGAGCAGAGCAACAGGGACCTGAAGCGACTGTAGAGGAAGCTGAAGCTGCGGCT 
TTCGGCT CAGAAAAGCCTCAGAGCATGTTTG AGCCACCTCAGGTGTCTT CTCCTGTTCAAGA 
GAAAAGGGATGTATTACCAAAGATTCTGCCTGC7GAAGACAGGGCGCTCAGGGAAAGGGGG 
CCCCCCCAGCCACTGCCAGCTGTGCAGCCGAGTGGCCCGATTAACATGGAGGAGACCAGG 
CGCGAAGGAAGCT ATTTCAGCAAGT ACTCG GAGGCAGCTGAGCTGAGAAGCACAGCCTCCC 
TCCTGGCCACTCAAGAATCTGACGTGATGGTTGGGCCTTTCAAGCTGAGGTCCAGGAAACA 
GCGGACTTTGTOCATGATTGAGGAAGAGATCCGAG CAGCTCAG gaaag g GAAGAGGAG ct 
GAAGAGGCAGAGACAAGTCTTGCAGAGTACGCAGAGCCCCAGGACAAAGAATGCCCCATCA 
CTG CCCTCCAGAACATGCT ACAAAACTG CTCCAGGGAAAAT AGAGAAAGTCAAACCTCC7CC 
ATCCCCCACCACTG AAGG CCCCAGCTT G CAGCCTG ACTT AGCCCCTGAAGAGG CTGOCGGA 
ACCCAGCGGCCCAAGAATCTGATGCAGACCCTCATGGAAGACTATGAGACACACAAATCT AA 
AAGGCGCGAGAGAATGGATGATAGTAGTGTCCTCGAGGCCACACGGGTTAATCGAAGAAAG 
AGCGCACrGGCTTTGCGCTGGGAAGCAGGGATCTATGCCAACCAGGAGGAAGAAGACAAC 
GAATAAACTTCCTTCAACCCAGGAAGCGTCTTTGGTGCTTGGGAGACCAAGAAACCAAGAAA 
TTAACAACTGAAAGCATTTTAATGGACTATTTATTAAAGTGCAAACAAACTCAGCAATCCTTAT 
GT AGAOCAGAAGCTGCAAT AT ACAAT G ATGAAAATGAAACGAAAAGGAAGTCCCCCCATCAG 
ACTCT GGACACCCAGG AATCAAAAGAGAAAGGAC 1 1 1 1 1 1 UGTGACTCAATCCCAGTGTCTG 
GTTTGGGGCCAATCT ACAATGGATCCAAAAG GACAAAACATGTT ACAAG CAAATCAACATAG 

CACTTAGAGAGCAGTAGGGCCCAGGCTGGTGCAGCCTACATGGGACAGGATATTTCCCTGC 
ACCAGT7TCCCACCTGTCACTTGCAGCATCAGAGGGTGTTGCATTAACTCCATTTTCTCCAG 
CAGCCATAAAATGAATGGCGTTTCTCTCCATATTTGCCCAGAAGAGGCAGTTATGATTCCACT 
TCTCCTACATTTAATTATrmAATTGAGAACAACTCATGATCACAATTCATAAAA 
TAAGACAATAGTTCTTCATGTCACAAACACAATCTATTCTACATGAGGAGGCTGAGAAAGGG 
GAAGAGAAAAACAAACCAAATGGATGATGTTAGCTTTAGGATCGTGTCTGCTT AT AATATTCA 
CTGATCTGCAGTTTATGAGAACACATTTTCAGAGTTATGTTTCTTCATAGGA 
GTGGGCAAGAACT A7TTTTATGATGGGATGGGGGAGT AT AT ACACGT ATAGAATCTGT ACGC 
ATTGAACAACTTGGTTCAAGATGGTGGGGGCATTTTTAGAGCGGCAATAATTGAAAAA 
GCGAACTCTGCCTTGGAGAGGTAGATGATAAGAAATAAAAAGGTGTTTATAACTATTTTGTAT 
TATAAAGTGGGCCTTAGAGATAGGAAGAAGAATGATGGATTCCTTTTGGATCAATCAGAAAG 
GAAACACGAAAGAAAAGTCAGGAAGGT AGAGAGAGAAAAAGGGAGGGAAGGAGAAAGAAT 
GGGAATAAAAT AAGGAGGT AAGAGAT ACTATTTTTG CT GAGCAACCAGT GTGTTTCAGGATG 
ATACAAAQAAAAATATAGAATAGAAATAAGTGGAGGCTTGGAATCAGCTACAAATCCT AAAGA 
TGGGGTGTGTGTGGATGTGTGTGTGTGTGTGTGTACAGCATTGT GTGTTTGT AAAATGTGTA . 
TGTTCATGAGTAAGGGTGTGTGTGTGTGTGT ATTAAAATTDCAGAGTGACCGTGGCACTTGG 
GTGTACAGGTAATTCCTCCAGAGCT GTTTGCTGG CTTCAGGAGTGG AGTGAG aatttctttt 
TTATGAAAAGGGAT AT AAAGGCACCGAGCTGATGCAGTATTTGTAATATTAAGTTGACCT AAC 
AAGGTATTTGCATGAGTCACAATTACAAAGTTTTGAGCGGTTTTGTAAT^ 
A GTCTCCT ATTTATTCTCATACTTTACATTCATGCTTAGT ATACT ATAGAGGATGOCAGCTTTA 
AICI I I C I G 1 GATTTAAAGCAATATGATAAGGGTATTCAATAATTGGGTGCCCTAAATTTCTGG 
ATGAGAAAATTTTCAATT CTGGCCATGA GAAAG AAAAAAAAATAAACAGO G I III 1 1 1 1 Ml I C 
CTTTQTTI I A AAACTGTGGTTTTTT AAAAAAGGAATAATTAACTCAGAQCTCACT AAAAATCAT. 
nTTGm I J ATATTGTTATGTCATAAGCTCTATTATGTTATTCTAACAAGTAGCAATTTCAC^ 
AATTTGTATGTAGATGTTAACGCACATTTCCTTT GC 1 1 CI 1 1 J ATT AGACT AGTGTT GACTTTG 
GGGGGGGACATTTATTCAGAAATGAGAAGTAGGCACAAAGTAAAAAATGGAAC^ 
ACAAGGATCCTTTAAAACTGCCAAGGGAGCTCT AACTTGAAGCCACATCCT ACAGATGGCAG . 
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CCCAAATAGCACATGGGCAATTGGCACCATCTTTATATGGTTGAGTCTCGTGAATATTTTGAA 
TGAATTCTCAACAAAATGTGCTAGCCACTGGGGACGCAAAACAAGTAAGATCCCTGTTGCAA 
GAAATTC^TTTTATAGTGAGGGAGGTTGGCATGGAGACTAAAATTCTCAGGAAAATGAGATC 

cgtgttagattagaagtcctgatgtgaaatgggaggactcaggaaggaggatcgtctttacc 

tgaggatttctagccagaggtcccagatgcctgggctgagaacccagcgataagggggcg 

ttcccaaagcagacacagggataagaacagaggaggcagcagcattgcacagccccaggc 

acagtggcagttaggatggctggagagtaggatagttctatgggttgcccaaaaaatgtgat 

gcgcttcatgttttctctgactcatggatctggtagagaccatagacatgatatagactaact 

TCCCCATI 1 1 1 cacaagaggaaaccatccttatgacttaccttaaagttttttgttctgttttg 

aaggaaacx^tgtgcttcatgaaacctacagttgacaagagaatgtacagctaagagaaaag 

cttaagaggccacactattcgcggaatggctttagaggcagatgaagto 

gttgattgaaccagagcacttattgcttaaagaataacagagttctagagctgggggttctt 

gggccatgctccgtgtgtggataaggaaagaaatactgtttctgggactctcccagagtca 

caaagctgttttcactgtggcccctacatctcttaacttttgct atgctgcctt 

ccggatt actgctgt ct atcttcttgct ccactcactgaagat cct ATT at aatcccatgaaa 

atgtaaattacagtttacttgggagagccaga i 1 1 icier gtgctcttgagttttttattcatt 

caagaaacxttgggccaccgctttgtacatagcaccgtgct 

gaccci 1 1 r aactttctgaagatgggaccgtc^cctggaggaaagtcattcctgcct 

atcgjvgagaaagagg^ttacgaaaaactttgccrrctgatgcrrcagccc^ 

acacaagctrgttaacgccacctcttacaaaatgftttagattct^ 

ct ggaagtattgcattctgccgtgtttataggtgttcactttcctccagagctga 

tgacatgacttggcrrttctcatcxagaaattatggaaacagggtctgtcagtggcaggaggc 

cgtgctgtgttrtacttggatgacacaatgcagtttacttgcctc^ 

gctcaccctagacaatgacatataagccgtatatagatcaatgtccacatatatatacacaca 

cacatatatatata taaagtgta acaaggaacactaaaacagtgttgattcttgtctctgaag 

acaaataattaaacc1 1 1 1 1 1 1 1 cccaact aaagaatggatttaattaaact atqt attgaaaaa 

a aagta gcctaagtgttagagatggtgaatatattccattttagtt aaagaacaaatttcctg 

aatt r raagcattcagtg agctgccaatttrgattttgtgttgctcti^ 

ctttgi h 1 1 ci 1 1 1 1 1 t gggggaggaggggaaaaaagcagcaatactgtgtttggaaattat 

actctgtatctggttttcctgtgtatgttaaccactraaatgttatta 

gagtgattgtgaggcattcaatggaagtatacagttattrtctcattaaaatccaatgtgtgtc 

>419 

KN^OVNNNNNNNCCGAGGTAC>GTATATTGACCTTAAAAATCAGTAAAGCAGTCATG^ 

ACCTTAGACCAGATGGACAAGCCAGATACTGCAGAGAA G I 1 I C I G GGCTTTTOGGGGAAGA 
CTCTAGA TTCAA TTCTGTAAAGTTATGATGCAQ I 1 1 ICICC I I C CTCT C CTC1CACCTCCTCTG 
AGCACAGCTTTCAACAAAAACTTTGCATACCCCG 

T ACTT CTCTGAGCATTGGCCTCTGGCTGGGATTATGCTTCAACAG TCTTGAAATGAGGTCCC 
TGGCTCCCTCTGTTACAAAGTCAGGGAATGTGAATTCAACCCGTGATATTC^ GT AGGTCT 
CnTGGTATGTGTTTGCCTCAAAAGGAGGCTTCCCAACT 

GGCTCCAGAGATCCACCTTCTCATCATGCATGCGACCTTCAATCAT^ . 

>420 

5-421 

N^ICGTCCGCCCGTCTGA AGCC GGCTGTCGGTCTCCGTGTCGCOGCCGCC<»CCCG 
GCATCGTGGAGCTGGGGCCXSCtrrTTTGCCTGGG^GTTTTGT AGTCGCCT AGGGTCAGCGGT 
GACATCCCAAAG<5GGAGGCCCGGCAGCCGCCATGGTGGCCAAGGATTACC^ 
CACGGTGAAGAGAGCGAACTGGAGCCT GGAGCT ACCT CCCGCCAGCGGTCCGGCCAAGGA 
CGCTGAGGAGCCT AGTAATAAACGGGTCAAACCCCTTTCCCGAGTCACGTCGCT AGCAAAC 
CTCATCCCGCCCGTGAAGGCCACGCCATTAAAGCGCTTCAC^ 
TTAGCTTtXXSCAGTGAGAGCCGCCXTTGACATCCTCGCCCCCCGA 
CGCCCCCTCGAGCACGAAACGGAGAGATAGCAAGCTGTGGAGTGA^ 
CGTCMTCAGATGCTTACATCCAAGGAAATC^ 

AAGGAGAAGAAGACTTGATAGAAGACTTGAAATTAGCAAAAAAGGCCTATCATG 
CTGAAACTCTOCATAATGACAGAACAAGAGTTGAATCAAATTTTTGGAACACTGGACTC^ 
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ATTCCTCTACATGAAGAGCTCCTTAGTCAGCTTCGAGATGTTAGGAAGCCTGATGGCTCGAC 
TGAACATGTTGGTCOCATCCTCGTGGGCTGGCTOCCTTGCCTCAGCTCCTATGATAGCTACT 
GCAGCAATCAAGTAGCCGCCAAAGCTCTGCrrGGACCACAAAAAGCAAGATCACCGAGTC 
GGATTrCCTACAGCGATGTTTAGAATCCCCCTrTAGCCGCAAACTAGATCTCT GGAATTTCCT 
CGATATTCCAAGAAGCGGCCTGGTAAAATACCCTCTGCrrCTCCGAGAAATCTTGAGGC 

cacxaaatgataatocagatcagcagcacttggaagaagct AT AAAT ATCATTCAGGGAATT 
gtgggagaaatcaacaccaagactggtgaatctgaatg cggctattat aaagagcggcttct 
tt acttggaagaaggccagaaagactccctrgatcgacagctctcgagtcttgtgttgtcat^ 
gtgaactqaagaacaatcggggcgtgaaactgcatg i i i i cc i gttccaagaagtgcttgtg 
atcactcgagccgtcacccacaatgagcagctttgctaccagctgtaccgtcagccaatcc 

CCGTGAAAGACCTCCTG CTG GAAGACCTCCAGGATGGaGAAGTGAGGCT GGGTGGCTCCC 
TGCGAGGGGCATTCAGCAACAATGAGAGAATTAAAAACTT cttcagagtcagtttcaaaaat 
ggatoccaaagtcagacccactogctacaagccaatgacactttcaacaaac^ 

TAACTGTATTCXrrCAAGCCAAAGAAACAGTTTTGTG TGCTGCCGGG CAAGCTGGGGTGCTTG 

ACT CCGAGGGATCGTTCCT AAAT CCCACCACCGGGAG CAG AGAG CT ACAGGGAG AAACAAA 

ACTJGAGCAGATGGACCAATCGGACAGTGAGTCAGACTGTAGTATGGACA 

AGCCTCGACTGTGAGCGCATGGAACAGACAGACTCTTCCTGTGGAAAGAGCA^ 

AAAGTAACGTCTGACAGAAGCATGTGGACTTCGGGAAGCAGGCCTGCATCTT ACCTGTACA 

GTATTrGCATTCCACAGATGGAACGGTTTGGAGAAGCACTTTTTCATACTTTrGTGAA 

ACATGTTGGCCCAGTCTCTCGTATCTGTACCTTTGTCCCTAGTACTGTAACTG 

TGTGTAAGCTGGAATCTGTGGCAACTATTACCCTGTGTTGTATTTCCCAAGTGTCTGGATGG 

ATGGAGAGGTACTCAAACAAGTTACTTTCAGTTGTCCTGCTGGATTTTAAAAAAATAGAAA^ 

GAATCTCAAAACTACTGTTTT ACATAGATTGTTTGAAGAGTCCTTCCTCTTGTrGCnTCTGTACC 

ACTTTCCCAGCTCTTAGATGTGGTAGCTAAAGGCACGG^TTr 

GCATGAGGAACTCATCTGTOTATTGGGATGGTATTAGAGAGAGAATCAGGAAAGACCAACTC 
ATGAAGTG AACTT GGTTTGATCTTACTCAACT AGAAAGCTTGAAAACATCCCTGGGGATTCTG 
AAGGCTTAATTTTGCAAAGGAGGATGCATTGnrCTGAACTTTGCAACTTGAT 
TGATGCAAGAATGTATTAGGACATAAAATAGAGGCTGAGCTTAAAAGGGCOKGGAC 
GGCTGCCAGCTCT G^TCTTTAACTGAAATGCACATGGCACCAGGAGGTGTCTCTCATAGTT 
GGTTGCTAGCCT AAAACATCAGAATAGAACCCAAAGGGCTTAGGAAGGCCTGCCAGGAT AA 
CAAGAAGGCCCTGTATTCATTGTGTTTCATCTGCCTAGGCCTACTCAT^ 
ATGAAGCAACAAGGAAGAGAGACCATGACTCTATCGATGACACTGT™ 

gaggaagaatttggaatgaaaagcactt(xn"okgaacctr ct gtgggagccattgagagaaa 
agcatggtccagtgccttctgagaaaggccagagctttgggcttt cctg<tt ctgcttttgg 
gtc gtcaa tttgccatctctggttctgtgct ataatcagaattgt aattatgttctccagagg 
ccaatttgattaactctgattaattagaatcagct agccagattagtaacct ctttgtccagc 

CTTGATTTACAGTGCAGGGTAAAGTGCAGACCTTAAAAACAGCTAAGT ACCTAGAAGAGCTC 
CCTGCAAGTGTAAATATTAAGGATGACCTGTGCAAAATTATAGCCACACCAGCACT agtggt 
AATTATTCTAAATTATTGCCAAAAA G 1 1 1 1 1 1 1 1 AATCT GTCTTT CAAGTTTACAGAAAAGAAAG 
CAGTAAATGCATTGATGTCATTTTATTATGTAC*TATATCA 

GATAT ATGAATATAAAAACAAGGT ATATACTTTATTATTTTTTGAAAACAAGGATATTGTGATCA 
ATTTTAOCCTGTAAAACATATTTCTGTATTTATAGGTCrrrAAACATGATGAA I 1 11 ! I C I A TT AC 
AAGTTT ATTTAAAACT GCTTTCT CAAGTCGTTATTGATACAGGAAGT GAAOCTGCTGCAGACA 
g AAGCAG>GGAAAGCCAAGAACAGCCTTTATTGGTGAAGAAAAGAATGAATG^ 

ggcg<xlatcagccacttrtagaagccatcagccagtgtg ttgggaaaagaggtttgtcaag 
tgttggcctatgggaaggtggtcaatgaatgttttgatgaaatgaatgtttttct 

CTT AAACT 1 1 1 U I GGAAGT ATTTCAAAT AAATT ACATT ATTAAGT CATCAAAAAAAAAAAANMN 

MNNNNNNNNNNNMNNNNNNNNM4 

>422 

NNNNNNNGCANCNHITNGAAGTGTCXXCT^ 
GTTGCAGACOCAGGAGTCTCrGAAGGCTrCCGCGGC^ M GI GAGATGCCA 

CQt>GAGGCCGCTGCCCGGACTGOGGCTCCACGGAGCTGGT GGAAGACTCGCACT ATTCG 
CAGAGCCAGCTGGTGTGCTCCGACTGCGGCTGOrrGGTCACCGA 
AOCTTCAGOGACGAGGGCAATCTCXSAGAGCT 

^^^^l^l^COCAGCCAGCAACGAGCTCTCCGCCGWTGM»AXiA£Xti 1 I G ICGAGTTCT 
GCAGTTGCCACCAACATTTGAGGAT ACCGCGG TTGCCT ACT ACCAACAGGCATATCGGCAC 
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TCACCTGGCGAC AGCAT AACTGGCCCCTAACAATGGGGGCCATCTGCACGCTGTTGTATGC 
AG^TTTGGATGTGTTTTCTAGCACTTACATGCAGATaGTGAAGCTCCTGGGACTGGATGTGC 

catctctgtgcttggcagaactggtgaagacctattgcagcagcttcaaactgtt^^ 

TCACCT7 ci g i gccagcx^aatacgtggaagacaaagagaagatgctgtctcgaacaatgca 

gttggtggag ctggc aaatgagacgtggctggtgaccgggaggcatcccttgc 

cactggtgcgactttccrrggcttggcagtcg ctgcagcct ggagatcxsgctttcatgttcc 

cttgcccgattttgtaaattggcaaatgtggacctgccct acgcggcgtcctcc gc 

aggagcrgctggctgtgctgctgojgatggctgagcagctggcctggttacgagttctg 

g^cttgacaaac ggtc tgtggtgaagcacatcggtgaccttctccagcaccgocagtcact 

ggtccgctctgcctttcgggatgggacagcagaagtggagacccgagagaaggagccacc 

ggggtggggacaggggcaaggagaaggggaggtgggaaataattccttaggtttacccca 

ggggaagcggccggccagtcctgcccttctcttgccaccctgcatgt^ 

GCGGATCTTGCCCTGTACCCCCTGTCrrcCACrrGTCACTGGAG^TGA 

AAATAGAACAGTATTTGCGTACCCCTCAGGAAGTTAGGGACTTTCAGAGAGCXXAG GCTGCT 
AGACAGGCTGCCACG^GTGTCXXrTAACCCT COCT GXTGGJVTATCCACTGGGAGCACTTCAT 
CCT GTTCTG ACAGC TTGATAACATTCCrrGTTATAACCAAGGATGGA 

TAGGTATTGCI 1 1 ICI I GTTTGAAGGAAOCAAGAGGGGCTCTGCCATTAGTTGGACCCTGGG 

TCCTGGAGT AAAGTCAGGAGTGCAGGGATGACTATAGGTAGGAGAGATTCCCATCCCTTGG 

TGTGGGAGAGC AAGTTGC CTATGTCCATGTTCTGTGAGATGGCTTTCCTCATAGATGGATGG 

GAAAATGTCAGGCTCTTT GCTGCTGGTTTGAATTGGACACACTGCTGCGGCTCCTCCTGC^ 

GCCTGAGGGGGCTTCCCTCTGCTTGTGGAGTX^TTGGCATTCCCAGCAGT ATCAACCCTCA 

GAGGAGCX3GGAACTGGGGAATTCTGGCCCTACGTGCATTCACAGGGAATGATGGGTTTGTG 

TGT ATGGTGTCATGAGATCCTCrr ACCTCAT AACAAAAGGACAGTGGGT AGACTAAGGCAGTA 

GCTCAAAGGGCTTTGCAAAATTTTAATATATT AAAACAAGAGGCATCTGCT AGAAAACATTCT 

ATTGTAT AAAAC CCGAGTACCCTATAAGGTCCTGGATAAI IMIUU I GATTATTCATTGAAGA 

AACATTTATTTTCCAATTGTGTGAAGTTTTTGAfrTGTTAATA^ 

AANNNKNNNNNNNNNMNNNNN 

>423 

x*24 



... — .... . _ „ „ . . „ -^jcTToa 

calCCAGCAGGATGGAGGACTTGGCCACCAGTGCAGGCTTCTTogi » a lit toogcGTACTGCCGT 

AGCCGCTCCTCCCGCAGCTGTGCCGCCTCCTTGTOCI CCTCC 1 CATTGTCACTGCCAAACA 

GGTCAATGTCATCATCCTCGTCATCCTCTGCTGGTGTGGCTGGCTTCC^^ 

TGG GCTAO GGTATCCOGAAGCTACAGATTCAGTGTGTGGTGGAGGACG 

CAGACTTGCTGGAGGAGGAGATCACCAAGTTTGAGGAGCAC^ 

AGCI 1 1 CAACAAG^TCTG^VAGCCTGAGTGTGTGT ACCTG^CCCGGGCGGCCGCTCGA 



^*GGCCGG<XXiCCCCGGCAGGT ACTGAATGTGGGAAATGCCTTTT GCCAGAAACCA 

cacctgaccaaccatc^gcgaacacatacaggagaaaaaocctatgaatgt AAGCAATGTGG 

AAAAACATTCTGTGTGAAGTCAAACCTCACTGAACATCAGAGAA 

CCT ATGAATGT AATGCATGTGGGAAAT<XTTCTGCCACAGATCAGCCCT^ 

AGAAGACACACAGGGGAGAAACC II 1 1 GGATGTAATGAATGT GGGAAAAUC 1 1 OGG I CA GAA 

GTCGGCCCT AATTGTTCAOCAGAGAACTCATATAAGACAGAAAjCCCT ATGGATGT AATCAAT 

GTGGAAAATCATTCTGTGTGAAGTCAAAACTCATTGCACATCATAGAACACACACAGGG 

AAACCCT ATGAATGT AATGGTTGTGGAAAATCA7TCTATGTTAAGTCAAAACTAACN 

>427 

j*OjOOCOCC GC O G CO S AGOGCOGGAGGGCAfiGC*fiQGC 
CCGGGGCGCGGCGCGGCATGTAGCTGCGGGCTCCCGCGT^^ 

gt^GCCC<^CCATGGCTG<^TGGACAGTGGGAAC^ ' 

TACCGOC 



attgaggcactgaacctggggaacaacggcctgga 



p*gt i3i ct au 
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. i ! UCTGG1 CC l GCGCA 

GGAACCGCTTCGCXX3GGCTGCCCCCGGCGGTGGCCGAGCTCGGCCACCACCTCACCGAG 
CTGGACGTGAGCCACAACCGGCTGACCGCCCTGGGCGCGGAGGTGGTGAGTGCTCTGAG 
GGAGCTGCGGAAGCTCAACCTCAGKX^CAAOCAGCTGCCCGCCCTGCCCGCCCAGCTGGG 

cgctctcgctcacctggaggagctggatgtcagctttaaccggctggcgcacctgcctgac 

TCCCT CTOCTGCCTCTCCCGCCTGCGCACCCTGG ACGTGG ATCACAACC AGCTCACTGCCT 

tcccccggcagctgctgcagctggtggccctggaggagctggacgtgtccagcaaccggc 
tg(^gggcctgcctgaggatatcagtgccctgcgtgccctcaagatcctctggctgagtg 
gggcxxlagcttggcacgctgcccgccggcttctrgggagctgggcagtttg gagag cctca 

TGCT AG AGAACAACGGGCTGCAGGCTCTGCCCGCCCAGTTCAGCT GCCTGGAGCGGCTCA 
AAATGCTCAACCTGTOCTCCAACCTCTTCGAGGAGTTCCCTGCCGCGCTGCTGCCCCTGGC 
TGGTCTGGAGGAGCTCTACCTTAGTCGCAACCAGCrCACCTCX3GTGCX>TCCCTTATCTCG 
GGCCTGGGCCGGCTTCTCACCTTGTGGCTGGATAATAACCGCATCCGCTACCTGCCGGACT 

ccatcgtggagctgaccggcctggaggagctcgtgctgcaggggaaccagatcgcggtgc 
tgcxc^ccactttggccagctctcxxxjg^ 

gatccagccccctrracgaggtctgcatgaaggggatcccctacatcgcagcctaccagaag 
gaactggctcattcccagccggcggtgcagccccggctcaagctgctcctgatggggcat 
aaggctgcaggaaagactttgctgcgcgactgcctcaccgaggagagagtggagggatgc 
ccaggaggaggggacaaggagaagtgct acccac^cgtcacctccccctgtgagcaagggc 
atcgaggtgaccagctggacggccgatgcctcccggggcctgcggttcatcgtgtatgac 
ttagctggggatgaaagttatgaggtgatccagcccttcttcctgtccccaggggccct ATA 

CGTGCTGGTGGTCAACrrTGGCCACCrrATGAGCCTa3CCACTTTCCTACCACCG 
TTCTTGCATCGGGTCGGGGCGAGAGTGCCCAACGCGGTGGTGTGCATCGTGGGCACCCAC 

gcagacctgtgcggagagcgtgagctggaggagaaatgtctggacattcaccgccagatc 
gccctgcaggagaagcacgacgcggagggactgagccgcttggccaaggtggtggacga 
ggcactggcccgggacttcgagct gggctctgccagcccccacgcagcctactatggcgt 
ttcggacaagaaccttcgacggcgcaaggcccattttcaatacctgctcaagc g 

gggacaagttgctgtcagttgctgagcaccgagagatcttccccaacttacacagagtact 
gcctcgatcctggcaggtgctggaggaactgcatttccagccacctcaggcccagcgact 
gtggctaagctggtgggactcggcgcgcttgggcctgcaggcgggtctgaccgaggacc 
gactgcagagtgcgctctcctacgtgcatgagagcggcaagctactctactttgaggacag 
tccggctctcaaggagcacgtcttccacaacctcacccgcctcatcgacatcctcaatgtct 
tcttccagagggatccctctttgctgctgcataagct gctcctagggaccagtggagaggg 

tcggggccacccagctccatcagtatgtggagggctttctgttgcatgggctcttgc^ 
tcatgtcattoggttgctgcttaagcctcatgtcgaggcccagcaggacttgcagctgt^ 
tggagctgctggagaagatgggactctgttactgcctcaataaacccaaggg 
gaatgggtccacagcttggt acaagttcccatgctatgtgcagaacgaggtgccccatgca 
GAAGCCTGGATTAATGGGACCAACCTAGCTGGGCAGTC 1 1 J igtggctgagcagttgcaga 
TTGAATAT AGCTTTGCTTTTA C 1 1 1 1 CCACCTGGGTT GTTT GCACG CTACAGTGTCCAQATCA 

cctgtggttgtgagttacagacctgccaggggagt cctgcagccagacaccctgtccattg 

CTAGCCATGCATCATTACCAAATATATGGACCGCATGGCAAGCCATAACCGCCTTGGTGGAG 

GAACTGAATGTCCTACTTCAGGAATGGCCTGGACTGCACTACACCX3TGCACA7TCTCT 

TAAGTGCCTTAAGAGAGGATOGCCCAATCCACATGCTTTTCCAGGGGAGTTGCTGAGTCAG 

CCCAGACCGGAAGGAGTGGCAGAGATCATTTGCCCCAAGAACGGCAGCGAGCGAGTAAAT 

GTTGC C I red 1 1 AGGCACGCACGCCGACTGTGATCAGCCCCTGTTCCAAGAAQAATGTTGG 

TGAAAAGCAGAQAAACCAGGAAAAGATAGTG C1 1 UN 1 1 CCTTATGCCATCATTTTTACTGTA 

CCCAAACTGTGACTTTAGGAGCACCAATTTTTAAGGGAAGAA 

ACCAGTGACGTTTGTGGCTGTGGAATTTCGATGGAG^AAAGAGAGCATCTGAACACCTGGA 
CCATC) 1 1 I GCACCTGGCAGACCCTCT GCACTCACCXXAGCGTGTTCTGTGAACTTGAGTGA 
CAACGCGTGCTTGCAGGGTGC 1 1 IJ I GGATGACTGGGGAAGAGGTGGGGAGGGGGTGGTG 
. GGGGGAAGC^TGGACGAGAACATGGAGCAAATGTTTTACA^ 
TCCTCCAAGGAGCGCGCT ACTTGAAG AAAAGAAAAAAAAAAAAAAAAGTCGAATGGC II C I C 
AGGGATTTTGTTTTCCGTGCACATATAAGCCATAGTTACAGTACAGGTGGCAGTATTTAGAG 
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CACTCAGA i i i CAG 1 1 C i G 1 1 AAAT6 1 GAAAGAGGGA7CGAC » GACG i i CATTGui G i i CCA7 
AACTAGTGTAAAATATGTA7ATGTT7ATC7TTATTTTTATAATATGCAAATACATTTAAATTTATA 
CAGTTTG AAGCTTCT AGCGTTAGTT CACACT GGGTGCAAT ATTCTGCATGAGAACTGTGCCA 
AGATGGGGCTG A I I IC I l ATTTTAGTATAAGA C I rTTTTGTTTTTTCTI I ATTCTTT AATCCTCT 

tcacattaaaaaaaaaaaatctctctgttagcccatggattaagtgttggttcatagagattg 

ccaataatcagaaagaaccttaaatgtg<>7ttaagacagtgtcccttccui iciill caatg 

aaggtccctggccttattattaaattcaa7tctgggcacgctgtggtgggggaaaactcctt 

ttggctctttcccaaccggtgtttataaagtgcgctggggccaaagatgtgtcgggcacac 

ttcggggccaaaagaaggaccaaaaaggtccattgcaaacatttggagtcttaaaggggtaa 

agatatcagggagcaagtctgaactttacaaagtgctctgtaaataagacgcttacttagca 

gagcactttgcaaacatattacttattagcagagctctttgtagacct^ 

cagatcttaaggttgtgaatttaggctccagttatattcactggagagcataatcccacacg 

GGTTATTTATAAATACAGAGCXITCTGATTGGACGGTCTCCrrGCCAAGAACrrAGTAATACCCTT 
GTTTTAAAATCrrrCACAAGGTAAAACTTAAAAAGCCAACCAAACAAATTGCT 
TTTAATTGGGCCAAACAGCATATGCTACAGTAGTAACATGTTTTTCGGAGAGTGTAAAAAACT 
CTGTTTACATTTGCCTCCTCCGTGGGTTGATCXSAAAATGTA^ 

GCCTCAGACAAGAAGAGTGAGTTGCTGGTACTCGCTACTCTTTTA C I I CI I I I GTAAAGTATT 
GACTCTTGGAAGGCTACAGTATACAAAGTCTCAACATGT7TTTTAAAAGAAATAAGGAGCAAG 
CGACTGCCCTGCTAGAAATCACAAACCGATTTTTGTAGAATATTTTGTGCCCGAGGCATTAAT 
TTCACTGACTCCAGAACCTGCAGTTCAGAGAATGA7TTCTTATGATGATAAAAATCGAATGGG 
ATCAGACGATGTTTGC A I 1 1 1 1 1 1 AAT ACTTGAAT AG G ACAGCT CAAGTTTGAGATTTCATTTT 
C7TTTAGAACACAGTCACAAGATTAATCTGGTGAATCCT7TTGTCACAGTTCTCGTGTGTGTG 
TGCGCGTCTCCGTGTGTGTGTGTGTGCATGTGTGTAAAACTGAATGGTCACATTTAATTGCT 
T7TTGGACCATTGAATAGTTGGGAAGTAAGAATTTT7TAATTGGCATGAGACGGTTCCTCAAC 
TGTTAAATTAACCAACT7TGAGCTGTCTTT AGAAAAAGGCTTA7TTGTATGATTTTGGGCTAAC 
TCCCCGCGGAGCATATTAAATGACAAAAATGCTC^rrTTGGGTGACACACCCT ACAAAGTATT 
TG CTGTTT ACX^AACATAAAC^CXXIAC^TTCTTAATATCT AAT ATTTTTGACGAGTG ATGTTTT AT 
GCTGTCATCTGAACOCTAGAGAAGC^GTGTCAGAGGAAACCTTGGTGTCACATGTGrrCTTAG 
CAAAAGGGTTACCATGATCGAGGGTCATGTGACCAAAAGATGCTCCAGAGAAGCTTGAGAAT 
TTGT7TCAAGTTGGGAGGAGGGTTGGAGATACAAAAATCACTCTGCTCTACAGGACTCTTCA 
GCTGTCT ATGCAAGAAA7TCCGTTTTCTCTTTCAGCACCTGGAAAGACACAGCAGCXXJAGCG 
AGGCGATAGGTGATTCACTAAGCACAAGAGGAATGTTTTCTAAGCAAGGCGTCCCTTGCCTC 
TCAAACAAATGCCCTCCAAGTTTGTTAGGGTTTCTA TTCCT 

TGACAGTGCTCATATTTGACACTTGTGTATTGGACTCTCTTTTGAATGAATAAAAAGGAAA^ 

GGGTTTGGTGTAATTCCTGATGGGGTGCGTGTT GI I » 1 1 CATGCCATGGTTTGTGAATTTTAA 

TTGTGGTTTCCCATTTCGTTGTTGT AACTGGGCAGAAATTAAAAAAGAAAAATCAAT AAAAATA 

CAAAGAAATGGTTAAANNNNNNfWNNNNNNNNNNNNNNNNNNNNN 

>428 

>429 

NNCGCCGCCGACGCGGGGGGAGGCGTCGGCCACGTTCAGCGGACACGGGAGCAA 

gatggcgatt(xx3ggcaggcagtatgggcttattttg(x^aagaaaacacagcagttgcac 
cgtgttttgcaaaaaccatcagtgtttgggaatgattctgatgatgatgatgag^cctctgtg 
agtgaaagccttcagagggaagctgct aag aagcaggccatgaaacagaccaaactggaaa 

TCCAG AAGGCCCTTGCAGAAGATGCT ACTGTGT ATGAAT ATGACAGT ATTT ATGATGAAATG 

cagaaaaaaaaggaggaaaataatcccaaattgcttttggggaaagacagaaagcccaagta 
tattcacaacttgctaaaagcagttgagatcagaaaaaaggaacaggaaaaaagaatggaaa 

acatctgcatataagaaaaaactgcaagagagagctgaagaagaagaaagagaaaagaggg 
ctgctgcactggaagcatgtttggatgtaagcaagcagaaagatctcagtggattttatagg 
cacctattaaatcaagcagttggtgaagaggaagtact aaatgcagtttcgtgaagagagat 
ctgggt attacaggggcggacacatccagggggttctccaatggcagt acgttccaaaaac 
gaattaccccagagcaactgcttgcttccaactggctgtgcacgttgaggaaaacccggat 
ggagcacagtgaagctcgcagctaaagcaggcgattatgcatcacaacaggaacatacagt 
ggaagcgcgcagggcacAgggcctt aglaacacccggaagatgctcacgcacacacggaG A 
gaatcgaccgccggacctacgcaagcacagaggc*g^ 
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c»caggcagagaaggcggagcggagcccaagagcgcacagcagccaccaaccgtgcgga 
aagcagcagaagcgacagcacacgggacacaaagcgacagccgtgcgaggannnnnnn 

>430 

nncacgcgtccggctaatgaatcttggggccggtgtcgggccggggcggcttgat 
cggcaactaggaaaccccaggcgcagaggccaggagcgagggcagcgaggatcagaggc 
caggccttcccggctgccggcgctcctcggaggtcagggcagatgaggaacatgactctc 
c<xxtttc<3gaggaggaaggaagtcccgctgccaccttatctctgctcctct 
tgttcccagagci 1 1 1 ictctagagaagattttgaaggcggcttttgtgctgacggccaccc 
acgatcatctaaagaagataaacttggcaaatgacatgcaggttcttcaaggcagaataattg 
c ag aaaatcttc aaagg a ccct atctgc ag atg ttctg aat acctctg agaat agag attg at 
tattcaaccaggatacctaattcaagaactccagaaatcaggagacggagacattttgtcag 
ttttgcaacattggaccaaat acaatgaagtattcttgctgtgctctg gttttg gctgtcctg 

GGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGA 

T ACAGCAGGAACGAAAAAACATCCG ACCCAACATTATTCTTGTGCTT ACCG ATG ATCAAGAT 

GTGGAGCTGGGGTCCCT GCAAGTC ATG AACAAAACGAG AAAGATT ATG GAACATGGGGGGG 

CCACCTTCATCAATGCCTTTGTGACTACACCCATGTGCTGCCCGTCACGGTCCTCCATGCTC 

ACCGGGAAGTATGTGCACAATCACAATGTCTACACCAACAACGAGAACTGCTCTTCOCCCTC 

GTGGCAGGCCATGCATGAGCCTCGGACTTTTGCTGTATATCTTAACAACACTGGCTACAGAA 

CAG CCTTTTTTG GAAAATACCT CAATG AAT AT AATGGCAGCTACATCCCCCCTG GGTG GCG A 

GAATGGCTTGGATTAATCAAGAATTCTCG CTTCT ATAATTACACTGTTTGTCGCAATGGCATC 

AAAGAAAAGCATGGATTTGATTATGCAAAGGACTACnTTCACAGACTTAATCACT 

ATT AATT ACTTCAAAATGTCT AAGAG AATGT AT CCCCAT AG GCCCGTTAT GAT G G TGATCAG C 

CAO3CTGCG0CXCACGGCCCCGAGGACTCAGCCC^ 

CTTCCCAACACATAACTCCT AGTT AT AACT ATG C ACCAAAT ATGGAT AAACACTG GATT ATG C 
AGTACACAGGACCAATGCTGCCCATCCACATG GAATTT AC AAACATTCT ACAGCG CAAAAGG 
ctccagactttgatgtcagtg GATGATTCTGT GGAGAGGCTGT AT AACATGCTCGTGGAG AC 



TTGGACTGGTCAAGGGGAAATCCATGCCATATGACTTTGATATTCGTGTGCC1 1 1 1 1 1 1 ATTC 
GTGGTC^XAAGTGTAGAACX^GG^TCAATAGTCCCACAGATCGTTCTCAAC^TTGACTTGGCC 

cccacgatcctggatattgctgggct CGACACACCTCCTGATGTGGACGGCAAGTCT GTCC 
TCAAACTTCTGGA(5CCAGAAAAGCX^GGTAACAGGTTTCGAACAAACAAG>AGGCX^AAAATT 
TGGCXjTGATACATTCCTAGTGGAAAGAGGCAAATTTCTACGTAAGAAGGAAGAATCCAGCAA 

gaatatccaacagtcaaatcacttgcccaaatatgaacgggtcaaagaacttatgccagcagg 

ccaggtaccag^cagcctgtgaacaaccggggcagaagtggcaatgcattgagga^ 

tgg(^gcttcgaattc*caactgtaaaggac<x^t^ 

acgcggaacctctacgctcgcggcttccatgacaaagacaaagagtgcagtre 

ctggttaccgtgccagcagaagccaaagaaagagtcaacggcaaitctt^ 

gactccaaagt acaagcccag atttgtccat actcg gcagacacgttccttgtccgtcgaat 

TTGAAGGTGAAATATATGACATAAATCTGGAAGAAGAAGAAGAATTGCAAGTGTTGCAACCA 
AGAAACATTGCTAAGCGTCATGATGAAGGCCACAAGGGGCCAAGAGATCTCCAGGCTTC 
GTGGTGGCAACMGGGGCAGGATGCTGGCAGATAGCAGCAACGCCGTGG^CCCACCT ACCA 
CTGTCCGAGTGACACACAAGTGTTTTATTCTTCCCAATGACTCTATCCATTCT 
TGTACCAATCGGCCAGAGCGTGGAAGGACCATAAGGCATACATTGACAAAGAGATTGAAGC 
TCTGCAAGATAAAATTAAGAATTT AAGAGAAGTGAGAGGACATCTGAAGAGAAGGAAGCCTG 
AGGAATGTAGCTGCAGTAAACAAAGCTATTACAATAAAGAGAAAGGTGTAAAAAAGCAAGAG 
AAATTAAAGAGCCATCITCACCCATTCAAGGAGGCTGCTCAGGAAGTAGATAGCAAACT GCA 
ACTTTTCAAGGAGAACAACCGTAGGAGGAAGAAGGAGAGGAAGGA 

GAAGGGGGAAGAGTGCAGCCTGCCTGGCCTCAC H GC 1 1 CACGCATGACAACAACCACTGG 
CAGACAGCXXXXnTCrrGGAACCTGGGATCTTTCT GTGCTT GCACGAGTTCT AACAATAACAC 

ctactggtgttt gcgtacagtt aatgagacgcataa i mem » ctgtgagtttgctactgg 

CTTTTTGGAGTATTTTGAT ATGAAT ACAGATCCTTATCAGCTC 
AGAACGAGG<>TTTTGAATCAGCTACACGTACAACT 

at aagcagtgcaacccaagacctaagaatcttgatgttt^gaaataaagatg at 
gacct acacagaggacagttatgggatggatgggaaggtt aato^gccccgtctcactgca 
gacatcaactggcaaggcct agaggagct acacagtgtgAatgaaaacatctatgagtacag 
acaaaactacagacttagtctggtggacrrggactaattacttgaaggatttagatagagta^ 
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TGCACTGCTGAAGAGTCACTATGAGCAAAATAAAACAAATAAGACTCAAACTGCTCAAAGTG 

ACGGGTTCT1GGTTGTCTCTGCTGAGCACGCTGTGTCAATGGAGATGGCCTCTGCTGACTC 

AGATGAAGACCCAAGGC^TAAGGTTGGGAAAACACCTCATTTGACCTTGCGAGCT 

AAACCCTG CATTTGAACCGACCAACATTAAGTCCAGAGAGT AAACTTGAATGGAATAACGAC 

ATTCCAGAAGTTAATCATTTGAATTCTGAACACTGGAGAAAAACCGMAAAATGGACGGGGCA 

TGAAGAGACTAATCATCTGGAAACCGATTTCAGTGGCGATGGCATGACAGAGCTAGAGCTC 

GGGCCCAGCCCCAGGCTGCAGCCCATTCGCAGGCACCCGAAAGM 

GGTCCTGGAAAGGACATTTTTGAAG^TCAACTATATCTTGCTGTGCATTCCGATGGAATT^ 

GTTCATCAGATGTTCACCATGGC^CACCGCAGAACACCGAAGTAATrCGAGCATAGCGGGGA 

AGATGTTGACCAAGGTGGAGAAGAATCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAG 

CGCCTCCTCTTCACTCTCCTCTGATTAGATGAAACTGT^ 

TTTTTAACTTTTTTATTTGTAAACTAATAAAGGTAATCACAGCCACCM 

TGGGTACCTTTGTGCAGTAGAAGCTAGTGAGCATGTGAGCAAGCGGTGTGCACACGGAGAC 

TCATCGTTATAATTrACTATCTGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGC 

TTGGTTTTGATTTTTTGCTTGTITGTTTGTTrrGTACTAA I 1 1 I GAATATCGT 

AGGGACATAAGTATATACATGTTATCCAATCAAGATGGCTATAATGGGCTTTCTCAGAGATAA 

AACTTGACCCCCGTGTCAAATTGACATCACACrrCTGCATGTCTGCGTAATGAAGGTACGATG 

CAACTATAACCAGTGCAATATGACACTGACACTATATTAAATTCAATAATACNN 

>431 

>432 

CGGGCAG GTACGCGGG ATTT ACCG GGCAGTCAAAG ATCT ATT CCT ACATGAGCCCG 
AACAAATGCTCTGGAATGC G I I I CCCCCTTCAGGAAGAGAACTCAGTTACACATCACGAAGT 
CAAATGCX^GGGGAAACCATTAGCCGGAATCrTACAGGAAACGAGAAGAGAAAAGAAATGCT 
GGGAACGCAGTAAGC^GTGGTAACAACGCAGAGTCCCGGGAAGCAGTGGTAACAACGCAG 
AGTCCCGGGAAGCAGTGGTACCTCGGCC 

TACCTCCCCACCAAATGTCCTTAGAGGGCC 
AAAGATGGC C I MUM IC I I CATGATAACATCGC C 1 1 I CI H I II I I 1 1 1 1 GAGACACGGTTTC 
ATTCTGTCACCCAGGCrrGGAGTGCAGTTGTGCATTCATGGCTCACCACAGCTCT^ 
CAGGCTC^GGTGATCCTCTCACCTCAGCXrrCCCCAGTAGCTGGGACTACA 
TCAAGCCCCGGGTAA I Mi l GAAA 1 1 1 1 1 ATAGAGACAGGATTTTAOCATGTTTCCCAGGCTG 
GTCTTGAATTCCTGGGCTCTAGTGATTCCTCTGCCTTGGCCTCCCAAAGTGCT 
GGCATGAGCCACCACACCCCACCTGTCT ATTTTACAATTTTCCTTTGAGCTC I I I I I I CCAGC 
AG TCATGAAGCTGGCAAAT GGCAGAA CTGGAGCT AGAAACTG CT GACTCCCTTT ATCTTTTC 
CATAGCACCCCAAGCCT AAAACCAG ACTGGCACAAATGGTA 
>434 

GGGAGTCGACCCACGCGTCXXTTCTAGATCGCXXXXXM3 
AATAGTATCTAAAAAATCATTTACTTCT CCGAGCCCTTCTT AGCTATGGTGAATGTGATGGAC 
TTAAGTGCTTTGAGATCTCTAGTATCAAAGATGTATTTA 

TCAATTAAGAGCTCCAAAATATGAAAATCCAACAAATAGAATTTrTAAAGCCT GTTCAAAAGAT 

TTATATT7TTCATAGGGGGTGCTCAAAGGATTTAAAGGAGAACT AATTCATGT ATTTAACAAAC 

ATGATGGTGKXnTGAGGAATACAGAATATTTCAATCAACT AAAAGACAAT AGT AACATAATTC 

TTCTGGGAGACTCCCAAGGAGACTTAAGAATGGCAGATGGAGTGGCCAATG 

T CT GAAAATTGG AT ATCT AAAT GAT AGAG T GGAT GAGCTTTT AGAAAAGT ACATGGACTCTTA 

TGATATTGTTTTAGT ACAAGATGAATCATTAGAAGTAGCCAACTCT ATTTTACAGAAGATTCT A 

T AAACAAGCATTCTTCAAGAAGACCTCT CTT CTGTGGGTGCAATTGAACTGTTCATCCGTT CA 

TCTTGCTGAGAGACTTATTTATAATANN 

M35 

>438 

>437 

>438 

NNNNAGGTGACACTATAGAAGAGCTATGACGTCGCATG^ 
GATCCTCT AGAGCGGCCGT CT ACT ACT ACT AAATTTGCGGCCAGCGT CGATCGAACTGGGTT .- 
. AGGTGCCGC I G n QCTGCTCGTGTTGAATCT AGAACCGTAGCX^QACATGG^GACTGGAGGA . 
CGAGCAAAAGATXSCTTAOCGAATCOGGAGATCCTtaAGG 

ATTAGTGGATCCCCTAACAACAGTGAGAGAGCAATGCGAGCAGTTGGAGAAATGTGTAAAG 
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GCCCGGGAGCGGCTAGAGCTCTGTGATGAGCGTGTATCCTCTCGATCACATACAGAAGAGG 
ATTGCACGGAGGAGCTCTTTGACTTCTTGCATGCGAGGGACCATTGCGTGGCCCACAAACT 
CTTTAACAACTTGAAATAAATGTGTGGACTTAATTt^CCCCAGTCTTCATCATCT GGGCATCA 
GAATAI I ICCI I ATGGTTTTGGATGTACCATTTGTTTCTTATTTGTGTAACTGTAAGTTCACAT 
GAACCTCATGGGTTTGGCTTAGGCTGGTAGCrrCTATGTAATTCGCAATGATTCCATCTAAAT 

aaaagttctatgatctgcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaacaaaaaaaaaaca 
aaaaaaaaacacaacaacaaagaaagaagagacgagcaagcaaaaaggaaaagagaaagaaa 
aagaagcaggg aacacgaggaaagag agg g acag ag aag ag agaggggtgaagaag aaag 
taggaggggaaaaggagagaaaacaaaagaaaaagaacaggaacacgagacagaaatagaa 
ggagaaggaaggacagaaaagaacgagtgaagaggacaggaggtagagacgatgatgaga 
gacagacagtagaagggaggaggaaggagaggaattcagagggagatagaggagaagaaa 
tcaatcagcgataggatggaaaacgaaagagtgatgtgtttgtcaggtggagaagan 

>439 
>440 
>441 
>442 
>443 
>444 

nngagtcgacccacgcgtctgi miiiiiii igctgataagaattcttttatgttatt 
ccaataaaaaatacattcatacagaaatataacaatcttgcaaaaaacaatttcaaataaaatc 
ttgtaaaacaaaattttacaaaaatcttacaaagattctttagataacagggtgc^ 

AAAAAAAAAAGAAATTTCACTAATAGAAA 1 1 1 1 1 1 1 1 1 AATTTCAAGCAAAAAGTTTCTGCTTGA 
TTGAGGCTCAGTTATCACCTGAACAGAATGTACTTCTTTATGTACGTGCT AATTATGAAAATC 
ACAGGCATTGACAGGTACGGTA CCCAG CCOCACCCAGGCAAACAGCTCCGJ^CATGTTTCGT 
AAGTGAGACAAGCCAGTGCAAG 1 1 II 1 1 1 1 II ICC I M 1 1 1 1 1 1 I I 1 1GTCI I I I GCI I A CCTTC 
TTGCTTAATGGAATTGTTATGGCTAAGCACATAGAAGGCCAAAAAAGGAGTTTTCCAAACCCA 
GCAAATCAAGTGCTTGGATTCTGAACTGCCAAAAGAAAACTGCACTTCCCCTCTTAAGTAAAA 

cgaaatgagtttcttaggtaaatgtattcatcagcccagataaaaaaaaaaccagttatgtga 

GCGTTAGTCACTGCTCftTTTCCAGGAAGATCAAACAAAATACCAGCCCAGCCAGACTCACAT 

gtgtgtatatatatataaagcaaagagccacacccacaag ccag cagctggggtgaaaatat 
cagctgttcgacgccgtgggtattggoccattcgcggcggaaatttactccttggggaaaa 
actgggagggattctacttggctgggggaaaaattggg N I I C M CI I ggcctttaaaaggt 
Gl I ICI I I ocaaaaggtiisicccgttggggttaggcntactctttggqggtt^iggtccniw^ 

NN 
>445 

NNNNNNNNNNCTCACTTAGATTTCAGACCCC^ 

atgcagcagcaggaggctacagatgtccacaacagaggagtcgactcagaggaagccgag 

TTCTTAGGGCATGAGCGAACCCCCTTGCTTTCCCCTTCCCCTGTCCTCCC^ 
AGGACATGGACACAGCTGTGGAAACTGTTCAACAGAGTAGGGTAACTAAAGAACCAGCTT^ 

gggacacagacccgaaagaggagttct aagggctggaaaatagcaaggaactggcagaaag 

AGGAAGGAAACCCATAAAGTTCTATAAAC rCC TBI I I ATAAATCAAAACACCAAGCCCATAAA 
GCTTAT AAACTCCTGAGCTCACCCAAAGTTGT ATATATATGTGT ATCTGACCCTAAACAGTGC 
AGTAAAGACGTTGAGAATGAACTATGGGATAGCTCACCACCTAGGTCCCAGACT 
GGTCGCATATACCGAGCAAAGATTCAAATAGCATCACA^AGGCTTTGAAACAjGAATTGATATT 
GAACCTTCAACCCACAGAAGACTGACCAGAACTTCTCACTTGAACCC^ 
ACCCAAGTCAGAGAGCTCTGCCATTAG GACGGGCAGAAAOCGTCAAGATTCATACTGCTGTT 
TCTCAT AAGGACTCAACAAAAATGAATCTTT AGTGACTTGAACCATAGACT GAAGGAATCCAA 
AGAATGCAGTCATTTTAGT AAAGTACTATCAGGCTTTGTGCTG A I I I C C I G AACAAACTGCAT 
TATATTATGAATCCTAACGGGGGCCGGGAAGGCCGTCTTGGGTCCTCGCTGCT6X3CTGCT 
CCCGTGGTGGTGGATCCTGGAGTTCTCTC^CGCAGGAGCCAGCGCTCTCCTAGAGG^ 
CT CAGATCCTGC<WKXXJAGTTOCTTG GAGGGACATGAGTAATGAGTOG^TCTTT ACTCAAT 
TTTTATCAGTTTTAT ATTTAT AAGCCTGATTTATGACTGTAT ATTTAATGTTAATAGT GTGCACA 
CTTATCTTTAAAACGTGCCTGGTGTCT GAG CNNNN 
>446 
•i447 

QGAGTCGACCCACGCGTCCGGTGGCTTCCTGCGGCGTTtCCACTCTCGCTCTCCTG 
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GCGTTGCCTGATCGCCGCCCATCATGGGTCGCATGCATGCTCOCGGGTG AGCT GGGGGCA 
TCAAGCCGGATTGCTGGGCGGGGGGTGGGAGGAAGACAGGGAGTGTGGGCAGCGGGCCG 
AGGGGATGATGTTCTGGGCTGCTCTGGCACTAGCCGCCAGCTCACCTCGAGAGTGCTTCTC 
TCCCCGGGAAGGGOCTGTCCCAGTCGGCTTT ACCCTATCGACGCAGCGTCCCCACTTGGTT 
GAAGTTGACATCTGACGACGTGAAG GAGCAGATTT ACAAACT GGCCAAGAAGGGCCTT ACT 
CCTTCACAGATCGGTGTAATCCTGAGAGATTCACATGGTGTTGCACAAGTACGTTTTGTGAC 
AGGCAATAAAATTTTAAGAATTCTTAAGTCTAAGGGACTTGCTCCTGATCTTCCTGAAGATCT 
CTACCATTTAATTAAGAAAGCAGTTGCTGTTCGAAAGCATCTTGAGAGGAACAGAAAGGATA 

aggatgctaaattccgtctgattctaatagagagccggattcaccgtttgggtcgatattata 

agaccaagcgagtcctccctcccaattggaaatatgaatcattttattgcttg^ 

caaatttatgcgaccagggcagaggctgtagatgattcactgaaaaagaaaaaaaggggaaa 

G AAAGAAAATCAggATTAATACT AGTCCCTCGACATTCATTAACAGAATCT CCCCACT AATGGT 
GCTATTCTCAAACAGTTAATAAATCCAAACTATATAAAGTGCACX>AACTTCTGAATGCJCTGA 
ACCAACTCATTGTGTTCAAGCACAATGAATGAAGATGCTCACTCAGaCGTCCAAGGAAGAAC 
TGGCCAACATAAGGAAAAACATTTCTGGTGGAAACT GCGAATGTCTGGCAACCAATCATCata 
gTGAGCAATTCATGTGAAAATTAATTCAAACTCCTGGGAGtaaagGTATGAAAGGAAAATCTAC 
CT ACT ATCTAAAAGAGAGTCAAAGGGCTTATTCATGGTCT AAATCT GGGGTCT GCAACTATG ' 
GCC CTCT GCTT GTCTTTATT AAAGACAAGCAGGCACACCCATTCGATTATGTATTGCCTGTGG 
CTGCTTTCCCACTACCAGGCCAGAGCTCAAAGAGCAGTTAACAAAATAGCCCCCAAGGCTG 
AACAT ATTT ACT ACT AGCCCCCAAGGCTGAACATATTTACTACT AGGACTTTTACT ACTGGCT 
GACTCCTATTCTTGATAACAACGACAAAAGAGTTGATACTTACTATTTCCAA 
GACTCGCTTGGTCTTATAATATCGAGCCAAACGGTGAATCCCGCK7TCT ATTAGAATCAGACG 
GAATTTAGCATCCTT ATCCTAAAAATAAAATTTGGTGCAATTCAAGAAAACCACCCAAAATGA 
CCTAACATTATCAAACATTAATAAGAGAATGTAGAGCTACCATGjCACTTTCT ATAATGAAAATC 
TCTTTG^ATAGAATTTGAACTTCAAAG<^GCACTACACTGGTTTGTGTACT AAGCCTTGGATA 
ATTCAAACCAG ATGCATTACATTTT ACCGAAAAAATTTAGTACAAGCAGTCCT ACTT AGCACC 
AGGTTTT ATTT ATT AGCTTACC I FTCTOn CCTCT CAAGATGCTTTCGAACAG CAACT GCTTT C 
ITAATTAArTCTAGAagtACtctogagggggggc 
>448 

CGCGGTGGCGGCCGAGGTAC 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 J GTT AGTGTTTTCTGATGTC 
TTTTC T AACAAA TCTTTGCCTGCCCAAAAGTCTCAAAAACATTCTCACGT^ AGATTTTTAG 
CTTT AG CTTTT GTGTTTGGGACTATG^TCCAT ATTT AGTGAATTTTATTTTTG GGGGGGCAGA 
GTCCATGrrTGCCCAAACTGGTCTGG^ACCACCACACCCiAGCT AATTTTTGTGAATTGCGGGT 
ACCAGCACACCGGCGCXIX5TCCTGGACTGCGCCTTCT ACGATCCAACGCATGCCT GGAGTG 
GAGGACT AGATCATCAATTGAAAATGCATGATTTGAACACT GATCAAGAAAATCTTGTTGGGA 
C<X^TGATGCCCCTATCAGATGTGTTGAATACTGT CCAGAAGTGAATATGATGGTCACTGGA 
AGTTGGGATCAGACAGTTAAACTGTGGGATCCCAGAACTCCTTGT AATGCTGGGACCTTCTC 
TCAGCCTGAAAAGGTATATACXXTTCn"CAGTGTt7rGGAGACC<5GCT GATTGTGGGAACAGCA 
GGCCGCAGAGTGTTGGT GTGGGACTTACGGAACATGGGTTACGTGCAGCAGCGCAGGGAG 
TCCAG<X7TGAAATACCA<^CTCGCTGCATACG^GCGTTTCCAAACAAGCAGGGTTATGTATT 
AAGCTCTATTG^GGCGAGTGGCAGTGAGTATTGGACCCAAGCtXT 
TATGCCTTCAAATGTCACAGACTAAAAGAAAATAATATGAGCAGATTTA 
TTCT 1 1 1 CACAATATCCACAATACATTTtKX^ACAGGTGGTTCTG^TGGCTTGT AAATATTGGG 
ATCX^TTTAACAAAAGCGACTGTGCACTTGCATO^^ 
CAGCATGATGGATAOSCTTCGACAAGCGCTCATTGTTGAC^ 
GGTTTCACTCCGCAGGGCGTGGGAACAACCAGGCACGATTAGACAAT(^ 
GCX^TAAACACAGACCCAGGGGGTTCATTAAACCQCATTTAGACCGCGAGC^ 
ACCCGCGGGGGGAXSGCAGCAACGCACTTGGTAATG^GCACAGNNN 
>449 
>450 
>451 

G<XSTCCG ATCGCT lTAAAAAGOCGGGCTCT AAAACTTCCAGAATATCTTTCCACCAT 

cagccactctgcotctttocaacattcco cci reran ACAGTGGATGATCTGAAGAACCTTT 

TCATAOAAGCTG^TGTTCAGTGAAfiGCTTTTAAA 

CTCATTC^TTCGGATCTGTOGLAAG^ 

CCTTGGAGAAAATCACCACCTCAGAG 1 1 r C CI I CI CAAAATCTACAATCT GACTTTTCTGTGA 
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ATTTTTCT CCTAA AACTGGACCATAATTTCAGTAAAAGCTTCAGACATAGACT GAAGCAGCTC 
AAGACCAATTTTGCCTCTTTCACAAAAATAACTCTTTCTGAGTTTG^ 

AAATCAAGGGA I I I f 1 1 1 1 1 1 I GTATTCCCCCTGCCCCCGCCAGTCAAAGG 1 1 I C I I I ICCCT 

ttttaccatggtttctacaaaaataaccttcaggaaaaagaaaatcaggaaaaaaa r mini 

CAAT AATCTTATTCCCTATATTAAATTAGATTTGAAGAGGATTAACGTTGTTTT AGTTTGGGTC 
CAGATCAGCCTT AT ACAACATTTCT AAA CTCATTTGTACTTTT AAAAAATTT AAACACAG ACTT 
CT AAAATTACTTGATGT AAGT AATTTAAATCACTTATGACCAAGTTATT AACCTT ATGAATCAG 
AAGTCTGACCCTTGTAGGAAATTATATTCACATATAAAGTACATGAGATCTTTGCCATATATTG 
^TGG TTATTATGCATAAACACATTGAGTTGTGTTG<aAAGCAGATTTATAAACCTGCATGTTTT 
CTTTGAATGAI I ICI I II II I CACTGTAAGACACTCCTTTAAATAATGCCTATCTTT AACTTTTT 
AAGACT ATTTGGAAAAATGCAGTGTCTCAGCTGTCXXX^GGGAAATTAAGTGGAATTCAACT 
AAGATCTGTTAATAAGATGTCAGAATAACT AATAATTTTATTAGGAAAAAATCATGTTTTAAATT 
TCAAAATGA CACTT ATT TGTCAA GTAA T ATG ATCTTGGAAAATTTT AAAGAAAAATAATCCT AC 
TTATAAACTACTrTTTTATAATTGTTTTCAGAAAAAAAGTTTACAGTCTT AAGGAAAAT ATTCAG 
GTCT ATCATATGGTTTGACAGATTTTTTAAAAGTTATTTTTGGTAAGGTC I r C TTI I AGAAAAA 
AATTAATCTCAAGGG rTTTTTGl ACCACTATAATCTCTAATACTTACTCAGAATTACTGTGTAT 
TTACTTAAl MUI F ATTATGTGCCTTATTATGTGCTTAAGATACAATAGGTTAGAGTTTAATCT A 
AATATCTTGAAAGCTATATTGTGGGCTTGGTAAGCATTTTG I M l II C I 1 1 U I CTGTTTTGGTA 
AGGATTT AAAATTTTTTTCATT G CAATTTT AAG TGGTTTTCAAT AAGT AAT AGTTTTT AT CAAAT 
TTTrGGTGCTTGGTGC AGAGA CGGTGTGGGGAAGGGTGAATGGTTTTGGGAATAATTC^AGT 
GCACACCTGTAGGCCTCTTTACATTGTGACTGATA . 
GTAGAGTGCAATCTT^GTTTCATCTTTITCACCCATCAGAAT^ GATTACTTGGTTT 
TTCTCAGTCCTCAAGCGAGAACTTGCI I I ICI II U 1 1 AATGTGACTTTCATTACTGAGTACCC 
ACGTATTTGGAGTATGAGAAGGTGGGTTATTTCTCATACTCTGTCCCT CATTGAA 
TGTAAGAGTACATTTTAATGTTGCTTCAGTGATTGTATAATGTAAAATTG rTCTTTl IAATAAGA 
AACTTTGCT AT TATTC TTTCACTGGTTGGCAAAATGTAAAGCGAAATAGGACAGsGTCATTATG 
GTTCCCCTTAACTTTACAATTTATCXiACTGTCT 

ggggcactcaocatacacacatagacggaacagam 

>452 

N CI 1 1 1 1 1 1 II II M 1 1 1 AT AG I I GI C I III ! I GGTAACAATGTGT AAAGTGAAAACAG 
CAGAGTGCT ACT CXAT ACCACTGCGATCTTGTCCAGT AAACATCX^AGAGAGT GAGGTTAGGA 
AATAAAAAGTATATAAATATTAGATGCCTAGAAATGGAAGTCACTTT aaagattttatgtgaaa 
TAGAAAAAAAAGAGAGGAGAGGGACTCA r TGT CTI GTAATGGGTCCTT CCCAGAGAGAGGT 
GACTGTCCAGTGGCACOGGGCCC 1 1 1 1 OCTCCTTCCC G TTI T ACTCTT ATCGACTAGGACAG 

aaact aagaattttggcttcaagtggct aaaagact gat ggg ggaaaaagaaaa t agaaaaa 
a ataa cagagagactgacgctctaggcagttacaact 

ACTTT AAGT ATTGAG CCAAAACCAiM T CT AC^AAACAT AATGCTGG<XXTT AGATT ATTT ATT A 

ATT TATG AAGAAACTTCTAGATATGGGCGTGACAAAAGGAAATTAAATCCATTATATATG<XAT 

ATATTTTAATGTAAATATATAATAGATAAATTATGTATACATAATATACAACC^AATTGA 

TTTTACAA I fTGGI 1 1 GACTGGAAATTCAAAATCCATATATTAATTTTTGTAGT AAAAGTTTATG 

TAAAAAAAGACAAAAAGGACACTGATTGGCI 1 1 IGI ICCICI ACAAT ATAGGTTCTAACTT AA 

CTTCTAACATT AGCGGCCGGACGCGGTG<»CTCACGCCCCT AATC^ AGCAGGC 

CGAAGTG^CAGATCACCTGAGGTTJAAG^GTTCGJVG^ 

COCATCTCT ACT AAAAG T ACCTGGGCCGGCCGCCGCCACCGC 

>453 

>454 

MNfOOWCTtGAAATNAACCCTCACTAAAGGAACAAAAGCT 
TGGCGGCCGCTCTAGAACTAGTGGATCCCCXX5G<5NGCAG 
ACGGCTCCGAGAAGACGACAGAAGG<^^ 
CTCTGATCOTGCACCCTGGO*GGAA^ 
^rTOj^AAA Q AA^GAC T^ 

AAACTTGGTCG^TCTAAGCGGTGGCTTGGAGACAGA^ AG . 

<*£J*GTGCT*3GGGAT ACATGAGAGGTGATGGATACTCAfiGCGGGCTCCGTGGATGAAGAGA. 
ATGGCCGACAGTTGGGTGAGGTAGAGCTC^ 

TGACACATTTGGCATAGATACCTCATCCTGTCTACCrrTTCATGACCAACT 
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CAACXjTCTGCCA TCAC AGTGGGAATACCTATTTCCTCCGGAAGCAAGC^ 

TGTGCCTT AGTGCTrTGGCAACCTGACATGGCAGTCCCGAACACAGGATGAACATCCGAAG 

ACAATGTTCTCCAAAGATAAGGATATTATACCATTTATTGATAACT ACTGGGAGTGCATGAGA 

ACCAG AC AGAGAACT GGGAAATGACTTGG GCAAT AACTGTTT AAACATG AGT ACG AAGAG AN 

>4SS 

N AGCAA AGCCACATCAGGACTAATCAGCTGGGCXjTAGCCAAAGTTACCTCCTCTCT 
GTTCTTCCI I ICI I CTCTGTAGCCCCAAATCACAATGTTTGGTTCGTTTCCTCCCCTAAGAAC 
AGCTTTT ATTGTCTCTGCT CCCT ATCCTGACCCTT CATTGCTGAGGCCT GAG GATCTCT GTCT 
TTTGTTCCCTC*CCTGTCTGCCTGTCTCCT 

ACATCATCGTCCAGTTCCTCTG<iATTTGAACAGCTGTTCCCCCACCC^ 

GCAGAAGCCAGCAAATACT ATCGGTCAGGGACAGATAGAAACTATTTTCGGCTTCATGGGCC 
ACACAGCCT C^TTGCAGCTCCTCAAATCTGCTGTTGTAG CAAGAAAGAAG CCAT ATACCCTG 
TGT AAACAAATGAATATGGCTGTGTGCCAATAAAACT ATTCACAAACAT AAAGAGTGGGCTG 
GATATGACTCAGATACTGTTGTTTGACAACCCCTGATCTAGAGTAAAAAT(XX 

gcctccagcctgg^agacagagcgagactccgtctcanaaaaaataaataaataaataaata 
aataagtaaaaaatcatatccgagctcggcgtggtggo-cacgcct 

GGGAGGCTT^AGGAGGGGCAAATCACTTCJ^TCAG^GGTTCA^ 
>456 

TTCTCTGACAGGTTTATTAGCTTTCATGTTAATGGATGTTTTTAAACCCT 
CT GTCAACTTCTTTCCACATCAAGAGGCCATGAGATACAGT AATGGCCTCTTAAGAGTCATG 
CCACATAAAG^TGATXaACTTTGATtSTCCrGGCCTGCCTCCT GTAACAATGTGAGGCTGTTTT " 
GGGT ACATGCTGT AATAACAACAGGACTATCACAGGAACAATGAAGCAGA<xAAGCAGAAGG 
TGCCT ACAAAGTTTTACCTAAATGTC II G I II GTCAGGATGGAGCTGATGCGCCCATACTGG 
CAGAACAATTAGCACAGAGTAfiTTCTGAAAAjGGAGGAAGAATTCACAA^ 
GTGCAGGGAACTCGCCATCTGX3ccaATG<iGCACCAGAGTGTCACTG GGAAGGGGAAA 
AGGAGTGGGCT GGATGAGATCCAGGGGCCTCTCTG<XX^TGriTCAAAGTCAGTAG<rrGCTC 
TGCCTGCCGGCATCACAGTGTCAACACATTTTGTGTGTGTCTCACTC^ 

aaagtaaaaacaattg^tgtaggaataaggggaatggataaagaatggtacatgcttattcta 

aagcag<1agagcctacagattgttgtcaaagctcatgtct atctctgtcctcgaggoctccac 

accatctccctctccctcctccacaatccaagggatgagccc^ 

gtggagcagt acctct agtttttggct gctcatggcctcagagttgctgtttggaaaggaaa 

AAGAGCAGCAGAGGCAGCAAAGAAAAGTACTAGGGCAGCACCTTTAGGT^ 

CCAGAGTT AAAAGCACAAGT AT AAT ATGCATT AAACAGAGAAGGGOCT CAAT AGAATGGTCA 

C^TATGCAAGTGTGTCCCACTCATTTACAATATTAG^TTCCAG 

C ^TCT CATTAACATACCTACCTGCAGTTCTG>TGTCTGGCT 

TGTTTCTCT AAAATTTAGAATCTT AAACT AAATCCTTTATTTCAAAAACAAAC^ 

CCCAGGOAGAGAAAAGGTTTGAGATGGAAGCGTI CI I G 1 1 AGCAGTCCCTT CCT GCATAAAT 

GGGGTTGGAGAGAGAGGAGAOAGGGAATGGCCAAGGCTATGGAAAGCT^ 

GCCGAGTGTGAAGT GACACOCCGAGCAGATGGGGTTT ATCATCTTT ACTT AGT CACACAACA 

TCAAGGACTGGTTAGTTCCAGGGAAGGGCTT^CATTTCATT ACCTGGGT GAGTTCTCTTCCCC 

CGCATGCTCCACAATGCAGT AGAAOCAAACAACACATTCATTTACAATAQAATGTTT AAATAA 

CACCT GTCCAATAAC T GCCC 1 1 AC I ICI I IGIGCTGTCGGGAAAAGAAAAAACAGAAAGCAA 

T AAAACCCAAACCCTTTGGCAAGCCAGCCtXSTGCCT ATTCTCCTG^ GAA 



GGTC ATCTGC AGTGG^TAGAAAGAGGAACCAAACACGATTCG^ 

GCACATTTTT AGATTGGATTPGTTAAAA*^^ 

n II III rTTGAG<SGACATAATGOCTATAGGC 

GGAGCTCCCTGTGT AGGGACOGAOAAACTAGGGGGCCT C&AGGAAGGTGTTGGTTCGGAA 
GATGGAGGAGACAATCTTTCOGGTCGCATTGATGGTG^ 

GAGTCGCT GCT CCCAGA^ AGGCACCCAGGCCTGGGAGGAT GCCGATACAT ACTGCAGCA 
GA GGGGCA GT G^GAnSG AC^SGGOCACTCAGGGAGG I SI I ICXX»AGAGACTTGCAGGATGAG 
GGCl CT IGATTCTTG TCAT CTGGCTCAGGGTITXXaGTTTCA G I CT T T T CACACTGTTGCCACT - 
• CTCTGAGCT C1 1 AT GCTTCA^AfiCTCCTGOCAACAGCTrOGCCTGGGAGAA C I »G I rCTTG G 
■ nTCTATAGGCrrCACAGTCA fa 1 4-1 C 1 1 1 ICC A CI IUCI IC I I GI I CTC 1 1 G AGAAATTCCAAC 

cttcttgaggttatttctgtattocttcag i ici i rc A Oi ici rcrrcrci rcGi reci 1 1 i c ta 

TT AGTT CCTGCT GTCGAGAAACCTCATCAAGGAAGTT GGT CT CATCTTCATCT AAGCCTCTT A 
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CX-ATGTTTTTGAATTTGAACTGTTCCTXrGTACTC^ 

GCCTTTCATATAGAGATCGAGGGTCATAAACCTCCTCTGGACATTCrrTCTGGATCTTCAGGTT 
TTCGAACTTTCTCCCATTC U C I I GCCTCCTTTTGCGCCGTTCATCTAGTTCTGCCTCAGACA 
CAAACCTCTTTTT GATAATAAGGTTACCATCATCCCCTCCATCCATAATGAAACAACCAATCT 
GCTCTTGCCTCCAAGTAGATTAAG ATGGAGAATCTCAGCT ATTGAAGACTT AGTGTTCAAGTC 

GCCACCGCAGAGCCGCT C^CTCTTAAAAAAG AAAAAAAAATGAAAGGAAGAAAGAAAGAAAC 

AGAGGAAGGAAGGAATGAAAGAAAGAAAAAAAAAAAGAAAATAGCCTTTGCTTTTGTATTTCT 

TTTGAcCcttCAGG GCTTcctcgGAcgc 

>457 

>458 

NNCACGCGTCCGGCTAATGAATCTTGGGGCCGGTGTCGGGCCGGGGCGGCTTGAT 
CGGCAACTAGGAAACCCCAGGCGCAGAGGCCAGGAGCGAGGGCAGCGAGGATCAGAGGC 
CAGGCCTTCCCGGCTGCCGGCGCTCCTCGGAGGTCAGGGCAGATGAGGAACATGACTCTC 
CCCCTTCGGAGGAGGAAGGAAGTCCCGCTGCCACCTTATCTCTGCTCCTCTGCCTCCTCCC 
TGTTCCCAGAG C 1 I 1 1 1 CTCTAGAGAAGATTTTGAAGGCGGCTTTTGTGCTGACGGCCACCC 
ACCATCATCTAAAGAAGATAAACTTGGCAAATGACATGCAGGTTCTTCAAGGCAGAATAATTG 
CAGAAAATCTTCAAAGGACCCTATCTGCAGATGTTCTGAATACCTCTGAGAATAGAGATTGAT 
TATTCAAOCAGGATACCTAATTCAAGAACTCCAGAAATCAGGAGAOGGAGACATTTTGTCAG 
TTTTGCAACATTGGACCAAATACAATGAAGTATTCTTGCTGTGCTCTGGTTrrGGCTGTCCTG 
GGCACAGAATTGCTGGGAAGCCTCTGTTCGACTGTCAGATCCCCGAGGTTCAGAGGACGGA 
TACAGCAGGAACGAAAAAACATCCGACCCAACATTATTCTTGTGCTTACCGATGATCAAGAT 
GTGGAGCTGGGGTCCCTGCAAGTCATGAACAAAACGAGAAAGATTATGGAACATGGGGGGG 
CCACCTTCATCAATGCCTTTG TGACT ACACCCATGTG CTG CCCGTCACG GTCCTCCATGCTC 
ACCGGGAAGTATGTGCACAATCACAATGTCT ACACCAACAACGAGAACTGCTCTTCCCCCTC 
GTGGCAGGCCATG CATGAGCCTCG GACTTTTG CTGT AT ATCTT AACAACACTGGCT ACAGAA 
CAGCC mil rGGAAAATACCTCAATGAATATAATGGCAGCTACATCCCCCCTGGGTGGCGA 
GAATGGCTTGGATTAATCAAGAATTCTCGCTTCTATAATTACAC 1 G I I I GT CGCAATGGCATC 
AAAGAAAAGCATGGATTT G ATTATGCAAAGGACT ACTTCACAGACTT AATCACT AACGAGAGC 
ATTAATTACTrCAAAATGTCTAAGAGAATGTATCCCCATAGGCCCGTTATGATGGTGATCAGC 
CACGCTGCGCCCCACGGCCCCGAGGACTCAGCCCCACAGTTTTCTAAACTGTAGCCCAATG 
CTT CCCXACACAT AACTCCTAGTT ATAACT ATGC ACCAAAT ATG GAT AAACACTGGATTATGC 
AGTACACAGGACCAATGCTGCCCATCCACATGGAATTTACAAACATTCTACAGCGC^^ 
CTCGAGACTTTGATGTCAGTGGATGATTCTGTGGAGAGGCTGTATAAGATGCTCGTGGAGAC 
GGGGGAGCTGGAGAATACTTAGATCATTrACACGGCCGACCATGGTTACCATATTGGGCAGT 
TTGGACTGGTCAAGGGGAAATCCATGCCATATG ACTTTGATATTCGTGTGCC 1 1 M l I IA TTC 
GTGGTCCAAGTGTAGAACCAGGATCAATAGTCCCACAGATCGTrCTCAACATTGACTTGGCC 
CCCACGATCCTGGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCC 
TCAAACTTCTGGACCCAGAAAAGCCAGGTAACAGGTTTCGAACAAACAAGAAGGCCAAAATT 

GAAT ATCCAACAG T CAAATCACTTGCCCAAAT ATGAACGG GTGAAAGAACT ATGCCAGCAGG 
CCAGGTACCAGACAGCCTGTGAACAACCGGGGCAGAAGTGGCAATGCATTGAGGATACATC 
TGGCAAGCTTCGAATTCACAAGTGTAAAGGACXXJAGTGACCTGCTCACAGTCCGGCAGAGC 
ACGCGG^ACCTCTACGCTCGCGGCTTCCATGACAAAGACAAAGAGTGCAGTTGTAGGGAGT 
CTGGTTACCGTGCCAGCAGAAGCCAAAGAAAGAGTCAACGGCAATTCTTGAGAAACCAGGG 
GACTCCAAAGTACAAGCCCAGATrTGTCCATACTCGGCAGACACGTTCC I fGT COGTOSAAT 
TTGAAGGTGAAATATATGACATAAATCTGGAAGAAGAAGAAGAATTGCAAGTGTTGCAACCA 
AX»AAACATTGCTAAGCGTCATGATGAAGGCCACAAGGGGCGAAGAGATCTCCAGGCTTC<^ 
GTGGTGGCAACAGGGGCAGGATGCTGGCAGATAGCAGCAACGCCGTGGGCCCACCTACCA 

ctgtccgagtgacacacaagtgtttt attcttcccaatgactct atccattgtgagagagaac 
tgtaccaatcggccagagcgtggaaggaccataaggcatagattgacaaagagattgaagc 

TCTGGAAGATAAAATTAAGAATrTAAGAGAAGTGAGAGGACATCT GAAGAGAAGGAAGCCTG 
AGGAATGTAGCTGCAGTAAACAAAGCTATTACAATAAAGAGAAAGGTGTAAAAAAGCAAGAG 
A AATTA AAGAGCCATCTIGACCCATTCAAGGAGGCrrGCTCAGGAAGTAG 

TGGCCTCACTTGCTTCACGCATGACAACAACCACTGG 
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cagacagccccgttctggaacctgggatctttctgtgcttgcacgagttctaacaataacac 

c tactg gtgtttgcgtacagttaatgagacgcataa 1 1 1 ici 1 1 1 ctgtgagtttgctactgg 

ctttttggagtattttgatatgaatacagatccttatcagctcacaaatac^ 

agaacgaggcattttgaatcagctacacgtacaactaatggagctcagaagctgtcaaggat 

ataagcagtgcaacccaagacctaagaatcttgatgttggaaataaagatggaggaagctat 

gacctacacagaggacagttatgggatggatgggaaggttaatcagccccgtctcactgca 

gacatcaactggcaaggcctagaggagctacacagtgtgaatgaaaacatctatgagtacag 

acaaaactacagacttagtctggtggactggactaattacttgaaggatttagatagagtatt 

tgcactgctgaagagtcactatgagcaaaataaaacaaataagactcaaactgctcaaagtg 

acgggttcttggrrgtctctgctgagcacxsctgtgtcaatggagatggcctctgctgactc 

agatgaagacccaaggcataaggttgggaaaacacctcatttgaccttgccagctgaccttc 

aaaccctgcatttgaaccgaccaacattaagtccagagagtaaacttgaatggaataacgac 

attccagaagttaatcatttgaattctgaacactggagaaaaaccgaaaaatggacggggca 

tgaagagactaatcatctggaaaccgatttcagtggcgatggcatgacagagctagagctc 

gggcccagccccaggctgcagccu^ttcgcaggcacccgaaagaacttccccagtatggt 

ggtcctggaaaggacai I 1 I I GAAGATCAACTATATCTTCCTGTGCATTCCGATGGAATTTCA 
GTTCATCAGATGTTCACX^TGGCCACCGC^GAACACCGAAGTAATTCCAGCATAGCGGGGA 
AGATG1TGACCAAGGTGGAGAAGAATCACGAAAAGGAGAAGTCACAGCACCTAGAAGGCAG 
GGCCTCCTCTTCACTCTCCTCTGATTAGATGAAAGTGTTACCTTA^ 

TTTTT AACTrTTTTATTTGT AAACT AATAAAG GTAATCACAGCCACCAACATTCCAAGCTACCC 

TCATCGTTATAATTTACTATCTGCCAAGAGTAGAAAGAAAGGCTGGGGATATTTGGGTTGGC 

TTGGTTTTGAI UN I GCTTGTTTGTTTGTTTTGTACT AAAACAGTATTA t C I 1 1 I GAATATCGT 

AGGGACATAAGTATATACATGTTATCCAATCAAGATGGCTATAATGGGCTTTCTCAGAGATAA 

AACTTG ACCCCCX3TGTCAAATTGACATCACACT CTG CATGTCT G CGT AATGAA GGT ACGATG 

CAACTATAACCAGTGCAATATGACACTGACACTATATTAAATTCAATAATACNN 

>459 

accatgggtggttagtagaccctcagattgatgacattgtaaaagctgttggtaact 
gcagctacaaccaactagtgg^gaagatcatctcttgtaaw;agtcagacaatagtgagctg 
gttagtgaaggctttgtagctgagcagtttctaaataacacagccactcaactgacataccat 
ggattatgtgaactaacttc^acggttcaggaaggagaactttgtgtg i ici 1 1 cggaataat 

cattttagcaccatgaccaaatacaagggtcaactgt ATTTGTTGGTAACGGACCAGGGGTT 
TCTT ACTGAAGAGAAAGTTGTTTGG GAAAGCCT ACACAACGT AGATGGTGATG GAAATTTCT 
GTGACTCAGAATTTCATCTTCGACCTCCTTCAGATCXrrGAAACTGTATACAAAGGACAACA^ 
ATCAGATAGATCAGGA7TATCTTATGGCATTATCTCTACAACAAGAACAGCAGAGCCAAGAG 
ATCAATTGGGAACAAATCCCGGAAGGAATCAGTGArrTGGAACTAGCAAAGAAACTCCAAGA 
GGAAGAGGACAGACGGGCTTCTCAATACTATCAGGAACAGGAACAAGCAGCAGCTGCTGCT 
GCT GCTGCTTCT ACACAGGCTCAGCAGGGCCAGCCAGCACAAGCCT CTCCATCAAGTGGAA 
GACAATCTGGGAAT AGTGAACGT AAACG GAAGG AACCACGAGAAAAAGATAAAG AAAAAGAA 
AAGGAAAAAAATAGCTGTGTTATTTTGTAACAAGTC 

TTTCCTAATGGATTTTGTTCG I H it I (^GGGGAACGGTTQTTACTTAGTTACAATCAGACTTT 

TTCAAGTCACACAATACA CICI 1 1 ATGAGCTGGAGTTTCATGTTACAAGTTGGAAATGCTGTG 

TGTTGTCATTCATGAAAAATACTGCACTTGTAGCCAGATTAGCAAA 

TCATAGTGACATTCATAACTCATATCAGTTAGTAAGCTATTATATCTTCTGTTCTAACA^ 

TGGAGGTAATTGATTTAGTCTGATTCCTTCCTGAAATCTAAATATTAGCACAATAGTTTCT 

ATTTT ACAATG TTAAATTATGAT CTT AATTCATGAGAAACCACGGGTTT AACATAGGGATTCAAA 

AAAACAAAAAC 

>460 

TGGCGGCCGAGGTACGAATGTGCAAAATTAAAGCATGGTAAACTGATATTTACATAA 
ATATCAAACCAACAATTAGTTTATACATTGTCAATGACCTTCT AAGATATGTCATGAGTGGATC 
CAAG AATATCTTrCCCCCAATGGAGAAGGT ATTGAGAGGCTAAATTCCGACyWrrrT AAAATGA 
CACACATCATAGGCTTTACCTGTTrGACCACTGCCTCAAATGTGTGAGATGTGATTT^ 
.. GCCGCGTNNNI4NNNNNNNNKW 
GAAACATGATTCCCCTCAAGCCACAAAGGATTTTCTCATCAAGTC 
ATTTGGACACAAGAAGAGGAGAGCATTTACTCAGGTAAAAATAGTTCTCTTA 
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TAGTTACTAATTTTTAATTTAAAAATACAATTAACTGATCTAGCTGATAAAAGTCACAGACAGA 

AATAAGCTAAGTTCTCTCrrCCTTTAGGGAACGCTGGTGGCAATTCACCATATAAACTGGATG 

GAAGAANNNNNNNN 

>461 

NCGATTCTGTGTTTCTGCTGAAGTGTAGCTTGGTCTTTACACTTGTTrATGGTTATCT 
GTTGTG^GAGACCCTGCCCTTCCGTAAAAGTGTAAGACTTCCGCGAATTGGTCACCGAGA 
ATCTTGCCGAGAACTTGAGAGTCCAGGGAAATTCCGGGCCGCTGTTCAAGATGTCGGACGA 
GT AG AT ATTCT CGGCCCCGGG G AGAAGGTG AACCG G GCCT CT AAAG G CG CTTGCCAG TGC 
AATCTGGGCGATCGCTTCTGGTCCTCGCCTCCTCCGCTGTCTCCCTGGAGTTCTTGCAAGTC 
GGCCAGGATGTCTCAGGCTGAGTTTGAGAAAGCTGCAGAGGAGGTTAGGCACCTTAAGACC 
AAGCCATCGGATGAGGAGATGCTGTTCATCTATGGCCACTACAAACAAGCAACTGTGGGCG 
ACATAAATACAGAACGGCCCGGGATGTTGGACTTCACGGGCAAGGCCAAGTGGGATGCCTG 
GAATGAGCTGAAAGGGACTTCCAAGGAAGATGCCATGAAAGCTTACATCAACAAAGTAGAAG 
AGCTAAAGAAAAAATACGGGATATGAGAGACTGGATTTGGTTACTGTG(X^TGTGTTTATCCT 

aaactgag acaatgccttgtttttttct aataccgtggatggtgggaattcg ggaaaataacc 
agttaaaccagctactcaaggct gctcaccatacxigctctaacagattaggggct aaaacga 
ttactgactttccttgagtagn i ttatctgaaatcaattaaaagtgtatttgttactttaaaaa 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaacaaaaaaaaaaaacaaaaaaaaaagaa 
GGAGACAAACAAGGG gaacg aagtag aaaaag agag aaaaagcaaaagg aaaagaagaaag 

AGAGATACGAGGCAGGATAACAGAGACAGCGCAGAAGAGGAGGAGAAGAAAGAGAGAGAA 

GAGAAGGAACAGACGAGAGAG gagacaggaaagaagagaaggaagaggggagaggagaaa 

aaagagagagacagagacgcggagagaaaagagagaagagagaaggaagagaaaaggcga 

ggaaaaagaagaacagaaagagaaaagtagagagagagagagataccgaacgaggaggag 

caaaagaggaaagaaagaggacgaaaagaagagaggaaagcacaacgaggcaagaaaaaga 

gagaagagcgagggagaagagaaacataagaaggagagaagaaagagcgaacgaacgaaa 

gaggatgagaan 



gagcagacaggtcaggtggaggagtgcctcaaggtcaacctgctcaaagatcaaaacagaa 
ttgtggaaaaaggaggtgctaaacatgctgaaggaaagcaaagcagacatctttgttgacgc 
ggtacttcatactgcttgtgccctggacattaaacacx^ 

gcgggcgtcaaatgtcctgtctcatggaagcactggaggataagcgggtgaggttagagcc 
cgagtgcaaaaagcgcctcaatgaccggattgagatgtggagttacgcagcaaaggtggcc 
ccagcagatggcttctctgatctt gccat gcaagt aatgacgtctccatct aag aact acatt 
ctctctgtgatcagtgggagcatctgtatattgttcctgattggcctgatgtgtggacggat 
caccaagcgagtgacacgagagctcaaggacaggctacaatacaggtcagagacaatggct 

T AT AAAGGTTTAGTGTGGTCTCAGGATGTGACAGG CAG TCCAG CCTGACC > 1 I C IGCACACT 
GCAGACAAACI I UCCAGACAAGCTCCTTTGTG CCTCTACGTGGAGAGGGTGTGGAAAGTTAT 

cacattaaaagatggaggatttacaaaagagaaaaaaaaaaaaaaaaannaaaggggccccc 
cccgggggggaggggggggccctctactatttta 
ctaggggggaaaaaaaaaaanggggggccgttaaactttttta 

TTOOCCTTGAACCTGAAACNflttfMN 
>463 
>464 

NNGTGC A rGTGTTTCCTI CCCCAAGAGAGCTATAAACAGCAAAAATTCTGCTGTAAG 
CGACCCCCGACCCAGGTCTCCCCCAGCACTGAGGAGCTCG 
GGGAAGCAGCACCAAGTTCACGGCCAACGCCTTGGC^CTAGGGTCCAGAA 
GTCXXTGATGGTTGCCGCAATGGCCTGAAATCCAAGT ACTACAGACTTTGTGATAAGGCTGA 
AGCTTGGGGCATtXsTCCTAGAAACGGTGGCCACAGCCGGGGTTGTGACCTCGGTGGC 
CATGCTCACTCTCCCGATCCTCGTCTGCAAG<rrGCAGGACTCCAACAG 
CCTACTCAGTTTCTCTrCCTOCTGGGTGTGTTGGGGAT 

ATCTGC I ICICCI GCCTGCTGGCT CATGCTGTCAGTCTrGACCAAGCTCGTCCGGGGGAGGA 
AGCCCCI 1 1 CCCTGTTGGTGATTCTGGGTCTGGOCGTGGGCrrrCAGCCT AGTCCAGGATGT 
TATCGCTATTGAATATATTGTCCTGACCATGAATAGGACCAACGTCAA 
rrCCGCTCCTCGTCGCAATGAAGACTTTGTCCTCCTGCTCACCT 
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cgctgaccrrcctcatgtccrccttcaccttctgtggttccttcacgggctggaagagacat 
ggggcccacatctacctcacgatgctcctctccattgocatctgggtggcctggatcaccc 
tgctcatgcttcctgacttrgaocgcaggtgggatgacaccatcctcaggtccgccttggc 
tgccaatggctgggtgttcctgttggcttatg1tagtcccgagtt1tggctgctcacaaagc 
aa cg aaac^cccatggattatcctgttgaggatgctitctgtaaacctcaactcgtgaagaag 
agctatggtgtggagaacagagcctactctcaagaggaaatcactcaaggttttgaagagac 
aggggacacgctctatgccccctattccacacattrrcagctgcagaaccagcctccccaaa 
aggaattctccatcccacgggcccacgcrrggccgagcccrracaaagactatgaagtaaa 

GAAAGAGGGCAGCTAACTCTGTCCTGAAGAGTGGGACAAATGCAGCCGGGCGGCAGATCT 
AGCGGGAGCTCAAAGGGATGTGGGCGAAATCTTGAGTCTTCTGAGAAAACTGTACAAGACA 
CTACGGGAACAGTTTGCCTCCCT CCCAGCCTCAACCAC AATTCTT CCATGCTGGGGCT gatg 
tgggct agtaagactccagttctt AG AGGCGCTGTAGTA 1 1 M m II III i gtctcatccttt 

GG AT ACTTCTTTT AAG TGG GAGTCT GAGGCAACTCAAGTTT AGACXXTTT ACTCTTTTTGTTTG 

rrm igaaacaggatcttgctctgtcacccaggcttgagtgcagtggtgcgatcacagccc 

AGTGCAGCCTCGACCACCTGTGCTCAAGCAATCCTCCCATCTCCATCTCCCAAAGTGCTGG 
GAT G ACAG G CGT GAG CCACAG CTCCCAGCCT AGG CCCTT AATCTTG CT GTT ATTTTCCATG G 
ACTAAAGGTCTGGTCATCTGAGGTCACGCTGGCTCACACAGGTCTAGGGGCCTGCTCCTCT 
AACTCACAGTGGGTTrTGTGAGGCTCTGTGGCCCAGAGGAGACGTGCATATCTG^GCAAAA 
ATAGCAAAAGCCTCTCTCAGCCCaCT GGCCTGAATCTT ACACTGGAAGCCAACTTGCTGGCaC 
CXCCGCTCCCCAACCCTTCTTGCXTrGGGTAGGAGAGGCT AAAGATCACCCTAAATTTACTCA 
TCTCTCTAGTGCTGCCTCACATTGGGCCTCAGCAGCTCCCCAGCACCAATTC^CA 
(XTCTCTTCTTGCACTCTCCCCAAACTTGCTGTCAATTCC 

CT GCCAGGAATTCTTTCAGACCTCACT AGCACAAGCCCGGTTGCTCXjrTGTCAGGAGAATTT 
GTAGATCATTCTCACTTCAAATTCCTGGGGCTGATACTTCTCTCAT 

GTAAATAGAT^ACCGCATTTATCGCTGCATTCTGTAAGTGGGCATGGTCTCCTAATGGAGG 
AGTGTTCATTGTATAATAAGTTATTCACCTGAGTATGCAATAAAGATGTGGTGGCCACTCTTT 

GCCCTGAGAGCTTCATGTTGG AGCTGAAGTTCAAGGTTCA C I ICC 1 1 1 GGGTTTGT ACTTGA 
CC CI I' d I CATGTGTCTCTCCCGTTCCCTCTAAAACAAGTGTGTTTCCCCTC AI 1 1 1 IG AGGC 
TGTCAATGGTGTGAGAGCGAGGATCATCACGGGGCGTGAGGTTTTACTCCAGAAAAGCAGA 
GGAGTGGCAACCTTGGCTTGGGGTrTGGCAGCCGAGGAAAGGCAGGGAGGAGAGCTCAAA 
GCCGGTTTCATGTTTCACCCAAGGTCT AATTGTGGGAGAGGACAAATCX1AGATCCCCTGTTT 

gacagaattagttcacaaatgtctcrrtggcaaaaacatgtgacacct 
ttaatccaagaaagagctctgtagggcagagcaataggaaatc 1cici 1 1 cgttatggaaaa 
aaaataatccctctacatagaaac tgagtg acatgtaaaaatgtgtagctaagtcagggagtt 

cgagggaggtggaggtgagtggatgatgtgctgagttagggatgtttatatgccgcaaagg 
tttggttggtaggcttcgcgctaagattagcctggctttgtactgaaa^ 
catagtgacacagatcctrctggtctttcccaggaggtgtgacctaatgagg i i igi igtga 
gaataaagcaatgtggtttcacatatattatctatcataggctagggactcaagaaatgcata 

^ OCCX3CCCCCA C 1 1 3 c _ n TCTCTCTT11 SSUSS - ccnrT0 C i Ai c 1 1 CCT T£? T T U I EES ' _ 1 °. 

CI I I C I I IC I I IGAGAGATAGAATCTCACTCTGN 
>465 

magtgcggagcgccgaggctcgcgtt CCGCCGCTGCTTCCGTCGCCGGGGCGGG 
ggccgcggtgtccctgccgagcg tgagcc ccgaactgtcagcgcgccgagcc 

ATCCGTOGGl GTCCGCGGTCCGAGCn I CCCCGCCTCGCTGG 
1TAGCTAJ 

GCGACGGCGGCCTTGAGT CTCGCT CCCCCGGT GGGAAACGT AGAACTGGTT ACT CGCCAC 
CTT AGGCGGTGGGGTTTGAATGAGAGCGGTGCCTCGGT AGGTCGTCAGAAAGTT ACAGTCG 
CCCATCTGAGCGTTTGTAAGGGAOGOGGGCGCCAGGAOOGGCGGAACGC 
. TTCACCACACTGAAACCATTAGGAAAAATCCTT GTGGTTAACAGCAGAGGCTTCAGAGTGTA . 
ACCTGTACTCGGGCCTAGAAATTATTTAAAATG^CGACTGATACGTCT 
CCATCCTAAGGCACTCCCACTTATAGTAGGAGCTCAGCTGATCCACGCGGA^ 
GAGAAGGTAGAAGATAGCACCATGCCGATTCGTCGAACTGTGAAT^ 
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CT CCCAAAAG CAAGCTTGCTGAAGGGGAGGAAGAAAAGCCAGAACCAGACATAAGTTCAGA 
GGAATCTGTCTCCACTGTAGAAGAACAAGAGAATGAAACTCCACCTGCTACTTCGAGTGAGG 
CAGAGCAGCCAAAGGGGGAACCTGAGAATGAAGAGAAGGAAGAAAATAAGTCTTCTGAGGA 
ATGAGAAAGATCAGTCTAAAGAAAAGG AGAAGAAAGTGAAAAAAACAATTC 
rrACCCTTTCTGCCAGCCAGCTAGCCAGGGCCCAGAAACAAACACCGATGGC 
TTCTTCGOCACGTDCCAAGATGGATGCAATCTTAACTGAGGCCATTAAGGCATGCTTCCAGA 
AGAGTGGTGCATCAGTGGTTGCTATTCGAAAATACATCATCCATAAGTATCCTTCTCTGGAG 
CTGGAGAGAAGGGGTTATCTCCTT AAACAAGCACTGAAAAGAGAATTAAATAGAGGAGTCAT 

caaacaggttaaaggaaaaggtgcttctggaagttttgttgtggttcagaaatcaagaaaaa 
cacctcagaaatcx^gaaacagaaagaataggagctctgcagtggatc^ 

AAATTGGAGGATGTCCTCCOACTGGCXrrTTACTCGCCTTTGTGAACCTAAAGAAGCT^ 
CAGTCTCATCAGGAAATATGTGTCTCAGTATTATCCTAAGCTTAGAGTGGACATCAGGCCTC 

agctgttgaagaacgctctgcagagagcagtagagaggggccagtt agaacagat aact gg 
caaaggtgcttcggggacattccagctgaagaaatcaggggagaaacccctgcttggtgga 
agcctgatggaatatgcaatcttgtctgccattgctgccatgaatgagccgaagacctgctc 
taccactgctctgaagaagtatgtcctagagaatcacccaggaaccaattctaact atgaaat 
gcatttgctgaaaaaaaccctgcagaaatgcgaaaagaatgggtggatggaacagatctctg 

" TATTATCCCAGCCCAGGAGTTCTG 

atgattctagagatgaggatgaagatgaagatgagtcatcaga 

AGAAGACTCTGAGGATGAAGAGCCGCCAGCTAAGAGAAGGTTGCAGAAGAAAACCCCAGCC 
AAGTCCCCAGGGAAGGCCGCATCTGTGAAGCAGAGAGGGTCCAAACCTGCACCTAAAGTCT 
CAGCTX5CCGAGCGGGGGAAAGCTAGGCCCTTGCCT AAGAAAGCACCTCCT AAGGCCAAAAC 
GCCTGCCAAGAAGACCAGACCCTCATCCACAGTCATCAAGAAACCTAGTGGTGGCTCCTCA 
AAGAAGCCTGCAACCAGTGGAAGAAAGGA AGT A AAATTG CCG GGCAAGGGCAAATCCACCA 

CAAAATCTT ATTTTCT AAGCACTTTTGATATCAAGCAAGTGGC1 ICC 1 1 1 1 1 GAGATATTAAAA 

CNNACN 

>468 

s-467 

>468 

>469 

>470 

NCTGGGCTCAAAGCAGCCACTCATTATACCATGACCA7GCGCGGGGTCACTCAGGA 
CTTCAGCAGAACOCCTCTCTCTGTrGAAGTCTTGACATGAGGATCTCCCAGAGCTGGGAGAT 
TTAGCCGTGTCTGAGGTTGGCTGGGATGGCCTGAGACTCAACTGGACCGCAGCTGACAATG 
CCT ATGAGCACTTTGTCATTCAGGTGCAGGAGGTCAACAAAGTGGAGGCAGCCCAGAACCT 
CACGTTGCCTGGCAGCCTCAGGGCTGTGGACATCCGGGGOCTCGAGGCTGCCACGOCTT a 
TAGAGTCTCCATCTATGGGGTGATCCGGGGCTATAGAACAOCAGTACTCTCTGCTGAGGCC 
TCCACWSCCAAAGAACCTGAAATTGGAAACTTAAATGTTTCTGACATAACTC^ 
AATCTC7GCTGGATGGCTACCGATGGGATCTTCGAGACCTTT AOCATTGAAATTATTGATTCC 
AATAGGTTGCTGGAGACTGTGGAATATAATATCTCTGGTGCTGAACGAACTGCCCATATCTC 
AGGGCTACCCCCTAGTACTGATTTTATTGTCTAOCTCTCTGGACTTGCTCCCAGCATCCGGA 
CCAAAAXX^TCAGTGCCACAGGCACGACAGAAGCCGAACCGG AAGTTGACAACCTT CTGGT 
TTCAGATGCCACCCCAGACGGTTTCCGTCTGTCCTGGACAGCTGATGAAGGGGTCTTCGAC 
AATTTTGTTCTCAAAATCAGAGATACCAAAAAGCAGTCTGAGCCACTGGAAATAACCCT ACTT 
GCCCCCGAACGTACCAGGGACTT AACAGGTCTCAGAGAGGCTACTGAATACGAAATTGAAC 

tctatggaataagcaaaggaaggcgatcxx>gacagtcagtgctatagcaacaacagccat 
gggctccccaaaggaagtcattttctcagacatcactgaaaattcggctactgtgagctg^ 
gggcaoccacagcccaagtggagagcttccggattacctatgtgcccattacaggaggt ac 
accctocatggtaactgtggacggaaccaagactcagaccaggctggtgaaactcatacct 
ggcgtggagtaccttgtcagcatcattx3ccatgaagggctttqaggaaagtgaagctgtcn 

>471 " .. . . •• 

NNNATATCTATGTCTGGTGAGGGACTGCTTACTGGCTTATAGATGGTGCCTTCTCAC . 
TGT G TCCT C ACATGG TGTTAGAAGT GA GCAAG TTACCTTGGACCTATT^ 
TCCCAT I TGTGAGGG^I 1 1 iCCCTGAAGi 1 1 1 AGTGAACTCCTGAAAGGCCCCATU I ICI AAT 
ATCATCACCTGGGGCGTTAGGATTTCAACATATGAATTITGGGAAGACAGAAAGATTCAGAC 
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cacagcctacagggagagaggatttctgaggatggtggtgcactgtgagtccacgcaggc 

ctcctgggcataggatggagcaattctatctcagctcaggcctagcacaaagggcttcagta 

aaccactggagtttccttcattaggattccatcccaggatatccagaggacaagaggctggc 

caactgcaggattagcctatgctcccgtgctggatataggctacacgcaagagaaagcttg 

ggtgggatctcctgatcgcgcgt 

>472 
>473 

ACCGCGGTGGC«GAC»AGGTACAAAATAATTATAATGTATTAACTCATACTGCCTGT 
CTTTTATAGGGGAAAAAAATAAC C I UN I ATTTTAAAGTTATAAGGTGGGTTACCTTTT AGTT 
GCTTGGATGACAGGGAATTAGCCTACCCCATTTTGGTCTGGAACAGAAGACTTrCAAATTTA 
ATATGGCCCAAGTGTCTTCCT ACTTAAGTGCAAGATCATGCT ATGTCAGTTACCCAAGCTGG 
AATACCGTGaCaCGATCGTGGCTCGCTACAGCCTCCATGTCCCAGGCTCGAGCAGTTCTCC 
CACCTCAACCTTCCGAGTAGCCGGAACCACAGAACCACAGTCTrCTCATTTTGAAAAGACAT 
GUI 1 1 1 ICI I AAAGCAACAAAGGTGGTAGAGGAAATTTCTTAAACTTTCTCAACGAN 
>474 
>475 

AGGTACATGGGACTACGTCCTTTAAACTACCCAGGAGCAAAGTTGGACTTCCATGAA 
AAAGGTTTGTGCAGATCTAAAACCTCTCATCTATTTCACTAAAAGGACTTCCCCTTCTAACT 
TCACCCTTACCAATGTAACCCTGTCCTCTCTATCACCACCTOCACCTGTCTTT AACT AACTCT 
AGACCTGTCCTTAATTGCTTTTATG GTATTGGAAT AGGAATT AAAAGAAATT AAAGAATGTGTA 



TOCTTT AAAAATTAACT GCCTG I I I I I C 
GACATTGTGAAGACCCTGTTTCTCTAGCTGTGCAGCTGCAAGGTCACTAGACAGATAAACT 
AAGTTGT AAAACA TG 1 1 III CC I I GAAAAGTAAGAAATGCTGTAATGCATTTCTCAATTAATTA 

TACAAAATGCTTAAAAGGTAACTT AACTCTTTGTTCTGGGCTCAAGTCCTGTCCGGTCAGCAG 

CCGCCGCGGCCACGTCCACGCGTCNN 

>476 

CGACCCCGCGTCCGAGATAGCC 1 1 1 1 GGAGTCGAAGATGAGGAAAAGCCTGTATTT 
TATAGTCTTGGAAGTG ICI ICI 1 1 1 GOCAGGACAGAGAGAGGAGCTTCAGCAGTGAGAGCAA 
CTGAAGGGGTTAATAGTGGAACTTG GCTGGGTGTCTGTTAAA C 1 1 1 1 1 1 CCCTGGCTCTGCC 
CTGGGTTTCCXXTTGAAGGGATTTCCCTCCGCCTCTC 

AGTGGTACGCCTCCGCGGTATCTGCATCGGGCCTCACTGGCTTCAGGAGCTGAATA^ 
CCAGGCACACACAGGTGGGACACAAATAAGGGTTTTGGAACCACTATTTTCTCATGACGACA 
GCAACTTAAAATGCCTGG GAAGA TGGTCGTGATCCTTGGAGCXrrCAAATATACTTTG 
TGTTTGCAGCTTCTCAAGCTTTTAAAATCGAGACCACCCCAGAATCTAGATATCT^ . 

ttggtgactccgtctcattgacttgcagcaccacaggctgtgagtcccca i rrrrcTCTTGG 
agaacccagat agat agt ccact g aa tg g gaaggt g acgaatg aggggaccacatct acgc 
tgacaatgaatcctgttagttttgggaacgaacactctt acctgtgcacagcaacttgtgaat 
ct aggaaattggaaaaag gaatccag gtg gagatct actcttttcct aaggatccagagatt 
catttgagtggccctctggagcctgggaagccgatcac^ 
tatacccatttgacaggctggagatagacttactgaaaggagatcatctca 
gaatttctggaggatx*cagacaggaagtccctggaaaccaagagtt^ AC 

TCCTGTCATTGAGGATATTGGAAAAGTTCTTGI 1 1 UCCGAGCT AAATTACACATTGATGAAAT 
ggattctgtgcccacagtaaggcagg CT GTAAAAGAATTGCAAGTCT ACATATCACCCAAGA 
ATACAGTTATTTCTGTGAATCCATCC^CAAAGCTGCAAGAAGGTGGCTCT GTGACCATGACC 

tgttccagcgagggtctaccagctccagag a unci u gagtaagaaattagataatqqgaa 
tctacagcacctrtctggaaatgcaactct cacctt aattgct atgaggatggaagattctgg 
aatttatgtgtgtgaaggagttaatttgattgggaaaaacagaaaagaggtggaattaattgt 

TCAAGAGAAACCATTT ACTGTTG AGATCT CCCCT GGACCCCGGATTGCT GCTCAGATTGGAG 
ACTCAGTC^TGTTGACATGTAGTGTCATGGGCTGTGAATCCCCA I CTTTCTOC IGGAGAACC 
-. CAGAT AGACAGCCCTCTGAGCCGGAAGGT GAGGAGTGAGGGGACGA^TTCCACGCTGACC 
CTCAGOCCTGTGAfiTTrTGAGAACGAACACTCT^ 

GAAACTGGAAAAGGGAATCCAGGTGGAGCTCT ACTCATTCCCTAGAGATCCAGAAATCGAG 
ATGAGTGGTGGCCTCXiTGAATGGGAGCTCTGTCACTGT AAGCf GCAAGGTTCCT AGCGTGT 
ACCCCCTTGACCGGCTGGAGATTGAATTACTTAAGGGGGAGACTATTCTGGAGAATATAGAG 
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TTTTTGGAGGATACGGATATGAAATCTCTAGAGAACAAAAGTTTGGAAATGAra 

ACCATT GAAGAT ACTGGAAAAGCTCTTGTTTGTCAGG CT AAG TT ACATATTG ATGACATGGAA 

TTGGAACCCAAACAAAGGGAGAGTACGCAAACACTTTATGTCAATGTTGCCCC^ 

AACCGTCTTGGTGAGCCCTTCCTCCATCCTGGAGGAAGGCAGTTCrrc 

TGAGCCAGGGCTTTCXTTGCTCCGAAAATCCTG TGGAG CAGGCAGCTCCCT AACGG GGAGCT 
ACAGCCT CTTTCTGAGAATGCAACTCTCACCTT AATTTCT ACAAAAATG GAAGATT CT G GG GT 
TT ATTT ATG TGAAGGAATT AACCAGGCT GGAAGAAG CAG AAAGGAAG TGGAATT AATT ATCC 
AAGTTACTCCAAAAGACATAAAACTTACAGCTTTTCCTTCTGAG 

CTGTCATCATCTCTrGTACATGTGGAAATGTTCCAGAAACATGGATAATCCTGAAGAAAAAAG 
CGGAGACAGGAGACACAGTACTAAAATCTATAGATGGCGCCTATACCATCCGAAAGGCCCA 
GTTGAAGGATGCGGGAGTATATGAATGTGAATCTAAAAACAAAGTT GGCTCACAATT AAGAA 
GTTTAACACTTGATGTTCAAGGAAGAGAAAAC^ACAAAGACTATTT^ 
TGCTCTATTTTCCATCCTCCTTAATAATACCTGOCATTGGAATGATAATT^ 
AGGCAAC^TGAAGGGGTCATATAGTCTTGTAGAAGCACAGAAATCAAAAGTGTAGCTAATGC 
TTGATATGTTCAACTGGAGACACT ATTT ATCTGTGCAAATCCTTGAT ACT GCTCATCATTCCTT 
GAGAAAAACAATGAGXrTGAGAGGCAGACTTCCCTGAATGTATTGAA 

CATCT ATGTCCCTTGCT G TGAGCAAGAAGTCAAAGT AAAACTTGCTGCCT GAAGAACAGT AA 
CTGCCATCAAGATGAGAGAACTGGAGGAGTTCCTTGATCTGTATATACAATAACATAATTTGT 
ACATATGTAAAATAAAATTATGCGATAGCAAGATTGCTTAAAATAGCAACACTCT ATATTTAGA 
TTGTTAAAATAACTAGTGTTGCTTGGACTATTATAATTTAATGCATGTTAGGAAAATTTCAC^ 
TAATATTTGCTGACAGCTGACCTTTGTCATC IllCn CTATTTTATTCCCriTTCACAAAATTTTA 
TTCCT AT AT AGTTT ATTGACAAT AATTTCAGGTTTTGTAAAGATGCCGGGT^ ATATTTTT ATA 
GACAAATAATAAGCAAAfiGGAGCACTGGGTTGACTITCAGGT^ 

GT ATAATGGTTGACTGGGTTTCTCTGT ATAGTACTGGCATGGT ACGGAGATGTTTCACGAAG 
T TTGTT CATCA GACTC CT GTGCAAC TTTCCCAATGTGGCCTAAAA^ I I C I I I I I ATTT 

TCTTTTGTAAATGTTTAGG 1 1 1 1 1 M G I ATAGT AAAGTGATAATTTCTGGAA7TAAAAAAAAAAA 
AAAAAAAAAAAAACCACTACAGGAGAAAACACCCCAAAAATAAGACAAAACAACAGAGGAAG 
CCGAAAGTGACACGCAACACACAACAGCCATATOM^AAAGA^ 

CAACCATAAATAACACCACAGCCCAGCCACATCAGATAAAGCAGCCAGATAACGAACAACCC 

atcaacatgacaatgcgaccaaacgaccacxatgcx3ggagaaacag 

agaattgcaaccagcaaacaacaagccaccgacacgac^^ 

aagoccagacctcgaaacwi 

X77 

NNGACTOACTrGTAGCTGGAAGAAATCATCGGATTTTTA^ 
AAATTTGAAATGCCTTCCATGTG CCAAGCACT GTGTCAGGT GGGAGATGACAGCTTGGTGAA 
AOCTC TGT CAGG CT G I CTT 0C1 I XiGCTTTCTCTATOCCTGG G 1 1 1 CCOCCTGOCTAAAAAGG 
AT TTrGTGCTTOG TGGCTTGTCCAGGCAAGCAGGCCGTCGCGGGACCT AGACCGAGACAGT 
G^GTCTCTCTTTJCTCOOGGGCCTCCCTTCrGTTTCCT 
GGGGCGAGATTCCCGCXX5CX3GACCCGCACTGCCGAAGCCTG 
OOCAGOGGGGGGGCGCAACOGOCAOGCCTCCTCACCCCTCCCTCC^ 
GACAAC*GCCGCCCC I COCACCrrTGGCGTCACGTTCAAAACAATCXrrTTGACT ACAACTCC 
CAQAAOGCCQAOCGGCTTAGCQAQTGCACCCGCT C I C GGCTGCTCCGGCAAACTACACAT 
CCCAAAGGGCAGCQOCGACCGCGTGT CCTTrCACAGCAAAGTGCGGAACTGCGT. I I G 1 1 I C 
CGGCGTGGGTCCGGGCAAGAACCGCI IGIAGI I IGGI 1 1 AAATTCTGCACGGGAGGACCTT 
CTGAGTTTACCTGT IGGGCTGCTGGCTGCGCAGGGACAGOUSCTACaCAGAAGAGATGGGA 
GAAGAGGCT AATGATGACAAGAAGCCAACCACTAAATTTGAACT AGAGCGAGAAACAGAACT 

tcgctttgaggtggaggcatctcagtcagttcagttggagttgtt GACT GGCATGGCAGAG 
ai c m M jCACAG AGCTGACCOGAAACAAGAAATTCACCTTTGATGCTGGTGCCAAGGTGGC 
To m i CACTTGGCATGGCT G TTCTGT GCAACTQAGCG G OOGCACTGAGGTGGCTT atgtct 
CCAAGGACACTCCTATG 1 1 GUI I IAOCTCAACACTCACACAGCCTTGGAACAGATGCGGAGG 
CAAGCGGAAAAGGAAGAAGAGCGAGG T CXXiCGAGTGATGGT AGTGGGCCCCACTGATGTG 
GGC AAGTC TACAGTGTGTC GC CI FC rS CTCAACTXOGCA GTS O GT I rGGGOOGCCGTCCCA 
CTTATGTGGAGCTGGATGTGGGGCAGGGI rC T GT CT CCATCCCTGGTAOCATGGGOGOOCT 
CTACATGQAGCGOOCTGC^GATGTCGAAGAGGG 1 1 1 CTCI ATCCAGGOCCCTCTGGTGT AT 
gai 1 1 reg I CC^CCACTOCTGGCACTAACATCAAGCTTTATAATAAGATTAG A I C I OS 1 1 1 A 
GCAGATGTGTTCAACCAAAGGTGTGAGGTGAACCGAAGGG^ 
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TTAACACCTGTGGCTGGGTCAAGGGCT CTGGTTACCAGGCT CTGGT GCATGCAGCCTCAGC 
TTTTGAGGTGGATGTCGTTGTTGTTCTGGATCAAGAACGACTGTACAATGAACTGAAACGGG 

acctccoccactttgtacgcactgtgctgctccctaaatctgggggtg^ 
caaggacttccgccgggaatgtagggatgagratatccgtgagt ai i i i i atggattccga 
ggctcnttctatccccatgcxrrtcaatgtcaaattttcagatgtgaaaatct acaaagttggg 
ggacccaccatcxx^gactcctgtttacctttgggcatgtct 
gctagtacctgtcactcctgggcgagatatggtgcaccaccrractgagtgttagcactgcc 
gagggtacagaggagaacctgtccgagacaagtgtagctggcttcattgtggtgaccagtg 
tggacctggagcatcaggtgtttactgttctgtctccagcccctcgcccactg 
ttccttctcyvt(^tggatatcxx^ttcatggatct 

tgcctgggacatagagatgatctggccacccctagaggcagatgggctgagataaaagact 
gttggggccacctgaccagtaaactgtggactagtagaaagttcatattctacctctaaaaa 
caggtagtggtaacctgactcttctaatcttc^ccaaaaggaaaaccatgaga 
ggtttctt agaccacctaagatgccactttgaattctctaagacx2ctggagaattgcatttct . 

TTCACT GTGCTACTATGTGGTTTTTAAAAAATCAATGCTTT AT ATTCCAT ATGTGGTTCTTACC 

CATTTA TCTA GGATGAAAGTGTGAATTAGAGGGACTOnTCCAATAAAGTTCAAAC^ 

AAATCATTTTAATAAATATTTTTGCCATATCAAAAAAAAAAAAAAAGGGCGCCGCTCGCGATC 

GGACGCGTGGGTCGACNN 

>478 

>479 

NNNfrlGG CGTTGTAAACT AATGTGGG CCCCTGGAAAAACTITrrCCCCGGAAGGTTT 
GAAAAGAGGCCACI 1 1 1 1 1 1 1 1 1 1 1 II I I 1 1 I I I I GAGACAGTCTCACTCTATTGCTGAGGCTG 
GAGTGCAGTGGCX3CGATCTTGGCTCACTGCAACCTCTGCCGCCT 
CTGCCTCAACCTTCCCGAGTaGCTGGGATTaCAGGCGCATGCCATC^ 
TGTTATCTTTACTAGAGACGGGGTTTCAGCATCTTGGCTAGGCTGCT 

TCATGAT CCACCCACATT GGCCT CCCAAAG T GCTGGGATT ACAG GTGTGAGCCACCGCGCC 
CGGCCAG CAT AGTATTTTT ATTT AT AACGTGAAAAAAATGAAAACAGAT ACTT CACT 
TTGAACCCAAGTTAGAAAATTTTTATGCAGCCAAACCATGACTTTAAGCTCT 
GGCGGCCAACTGAGATGCAGACCACACCCATGTGGCGTGCAGCAAGCAGGACATGCAGGG 
AGCCTTCXTTTGCTCACAAGGCTTCAGAGCAAGCG^ 

ATCATCATCTGCCAGGGCACGC^GTGCTTCTTCAAACATATCTrTTAGTAATTGCT ATGTCAGA 

TTGTCX^CCGAATATCTTCAAAAAGTTGCTGGTATTTTAGAGCTGGTGTTTTG 

AATAAGC mil CAATGCTTCAGCTAATTC I ICI 1 1 CCGTTTACGAGAGGTGGCACTCATCCC 

CGTAGTAAGAATAGATATGTCCACSGATGCCAGTCCGGGGATCAGTTGCAGACTGCTTCAGA 

GCTTCCCGATGGAGGCGTTTGGCXTCrrrCCACATCAATG GCTT CAACTTT G TT AGACAATCT 

TACTTT AGCATGGGCTTCT GCTAAGCGGATTAATGACTCT AGCTGTCGAGGGT ATGCAGAAA 

CCATTCXXXX5CTACTGCCAATCTTCCTCATGTCTACATAAGCCT ATTGTAGGTAGAAAAACAA 

AAGCAAAAGCAAACCTCCTTGGTGACAACTTTAGTTTTGTGGGTGAA 

TTCCATTAAGTAAACCGAACTCAGATATGGTCCTGGGCCCWiGAG 

>4A0 

GTCAGATCCGAGGACATGTTGACGTT^GTCCGAGAGTCTT AAAATCCTGCTCTGGCC 
GGATTCCAGACTCGTGGGGGAAAGG4TrGGATTTCAACAGATCACA 

GGAGGTGGTTGGGGATTGACGTATGCGCTGGAOCATCTGGAGCCCAAG I I (jC 1 G CAGATCC 
TGTCACCACGAGTGGACACCCTCTGGCCACCATAG 
CAAGGCAGGAAAGGTAAAGAGACAAAGAAAGTGCCCGGGC^ 
GTGTAGACTGTGCAGCTGCCAAAGTGGTGACAAGCAATCCAC^ 

AATGCAAGTt^TGAACTTGAATGTGCC<^TXiAGGCCTGGCATTCTTGT CCAGAGACAGAGTA 
AGGAAGTGTTGGCCACACOCTTAGAAAACAGAAGGGACATGGAGGCAGAAX^ 
AATAAATGAGAAGCAAGAGCCTGAGAATGCTGGAGAAACTGGTCAAGAAGAGGATGATGGT 
TTGCAGAAAATACAC ACAT CTGTCACTAGAACTCCTTCAGTTGTTGAAAGCCAA 
TTAAAAGGAGTGACATTTTCTAGGGAGGTAA I IGIIU1 GGATUTTGGGAATGAATACCCTACA 
- CCTCGAAGCT ATACTCXiAGAACATAAAGAGAGGAAATGAAGCTCAAAAAAXX^TAAGAGTGA 
AAGAAAATGT AACGTTTGACT AACGTTGAAAGACTGAGGGT ACAAAATCATGTTGAAACAACA * 
AAACAATGGGGAATTAAGCAAATAAAGATATT ATTTTACC I J 1 G I G CAGAAAGGAGTGAGOCA . 
TGTGCAAAATTCTGT AAGT aaaatacttagagcttgaat at aatttttt aaaaattgaaatctg 
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AGTTCAAGAAATTGATTGTATTGCCCTTAAAACTATCTACTCAAACACTGTTCTGGCATGTGA 

ATAAAGTGAl 1 I 1 1 GTTTGTACCCGCCCGGGCGGCGGCTNNN 

>481 

>482 

nnnnnnnnnnnnnnnaactcctagcggacacctcgtggagtccggccggaagagc 
aaccgagatgaaggtgaagatgctgagccggaatccggacaattatgtccgcgaaaccaag 
ttggacttacagagaggttggtgatgacaaaactgt6aa6cagtggaaaatggatgggcca 
ggctatggagacgaggaagagccattacatacaatattaggaaagacagtgtatactgggat 
tgatcatcactggaaagaagct gtttttgccacatgtggacagcaagtagacatttgggatg 
aacaaagaactaatcctatatgttcaatgacctggggatttgacagtataagtagtgttaaat 
ttaacccaattgagacatttctcttgggaagttgtgcatctgacaggaatatagtactgtacg 
atatgaggcaagcta ctcct t tgaaa aaggttatcttagatatgagaacaaataca atctgt t 

TATGGGTGCACTGGACACTCCTGTAATGGTCCATATGGATCATGTATC7GCAGTGCTTGATG 
TGGATTACTCTCCCACTGGGAAGGAGTTTGTGTCTGCTAGTTTCGGTAAATCTATTCGAATCT 
TTCCTGTAGACAAAAGTCGAAGCAGGGAGGTATATCATACAAAGAGAATGCAACATGTTATC 
TGTGTAAAATGGACTTCTGACAGCAAGTATATTATGTGGCACTGAGAGATCCCCTCATAATTT 
CCCCAAAGCGTAACCATGTGTGAATAAATTTTGAGCTAGTAGGGTTGCAGCCACGAGTAAGT 
CTTCCCTTGTTATTGTGTAGCCAGAATGCCGCAAAACTTCCATGCCTAAGCGAACTGTTGAG 
AGTACGTTTCGATTTCTGACTGTGTTAGCCTGGAAGTGCTTGTCCCAACCTTGTTTCTGAGCA 
TGAACGGCCGCAAGCCAACATGTT AGTTGAAG CATCAGG G CG ATT AGCAGCATGAT ATCAAA 
ACXaCTCTGAGCTGCTCGTTCGGCTATGGCGTAGGCCTAGTCCGTAGGCAGGACTTTTCAAG 
TCTCGGAAGGTTTCTTCAATCT GCATTCGCTTCGAATAGATATTAACAAGTTGTTTGGGTGTT 
CGAATTTCAACAGGTAAGTTAGTTGCTAGAACCCATGGCTCCTrTGCCGACGCTGAGTAGAT 
TTTAGGTG ACGGGTGGTGACAATGAGTCCGTGTCGAGCGCTGA 1 1 1 1 1 1 CGGCCTTTAGAGC 
GAGATTTATACAATAGAATTTGGCATGAGATTGGATTGCI I M AGTCAGCCTCTT AT AGCCT A 
AAGTCTITGAGTGACTAGATGACATATCATGTAAGTTGCTGATAGGTTTCCAGTTTTCCGCTC 
CTAGGTCTGCATATTGTA Cl 1 1 1 CCTCTT ACTCGACTT AACCAGT ACCAACCCAGCTTCTCAA 

cggatttataccatggcactttaaagccaggatcactgacaatgagcggtgtggtgttactc 

ggtagaatgctcgcaaggtcggctagaaattggtcatgagcr ricl 1 jgaacattgctctga 

aagcgggaacgctttctcataaagagtaacagaacgaccgtgtagtgcgactgaagctcgc 

aataccataagtcgtttttgctcacgaatatcagaccagtcaacaagtacaatgggcatcgta 

ttgcccgaacagataaagctagcatgccaacggtatacagcgagtcgctctttgtggaggt 

gacgattacctaacaatcggtcgatrggtttgatgttatcttttgttctcgctt^ 

ggttacggccaagttcggtaagagtgagagttttacagtcaagtaatgcgtggcaagccaa 

cgttaagctgttx^gtcgttttaagtgtaattcggggcagaattggtaaagagagtcgtgta 

aaatatcgagttcgcacatcttgttgtctgattattgattttrcgcga 

gacaagatgtgtatctaccttaacttaatgattttgataaaaatcattaggggat^ 

ttatgtgtggatctgatgaaatgaacattcgcctgtggaaagctaatgcttctgaaaaattgg 

gtgtgcttacatcacgagaaaaagcagccaaggattataaccagaaattgaaggagaaattt 

cagcattatcctg^tataaaacgtatagctcgtcatcgac^tctaccaaaatctatictatagc 

cagattcaggaagagcgcatcatgaaagaagctcgtcgacgaaaggaagtgaatcgtattaa 

acacagcaaggctggatctgtgccacttgtgtcagagaagaagaaacacgtagtggcagtt 

gt aaaataattggt attcctaacaatcct gatgtataattatttgtt actttt g atttgagaact 

ct acaaataaaagtgctgggact agajtaahnnnhnnhiwnnwfnhhhnmhnnhnnhh 



GTCCTCAGGTTATTAATGCTGCACTGGCTTTAGCAGCAAAACCACAGAGTAAACTGGCCCAA 
GAGAAGATGGATCTTTTTAAAGAACAATGGGAAAAAGAAGTCCGTGTTCTCACAG^ 
CGATGACATTACTTCCATTGATGACI I CI IGGCTGTrCTCAGAGAATCACATTTTGGAAGATGT 
GAACAAATGTGTCATTGCTCTCCAAGAGAAGGATGTGGATGGCCTGGACCGCACAGCTGGT 
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GCAATTCGAGGCCGGGCAGCCCGGGTCATTCACGTAGTCACCTCAGAGATGGACATCGAG 
CGgccGCCCGGGCAGGT ACACAAGCTTT ATTGGGCAACAG CAACG AGCCACG CTGGCAAAC 
AATGAAAGTAGAGTCGCTCAGAAACACGAAAGATCATATGTGTGTCATCaCAGCATCGAGAA 
tttaaatcatctggaagttcctgct AAATT AAAGCATACT GTGCCAGAGCTCCCCTCTAATCA 
AAAAACGCTGTCCT GGTGAAAATTTGCAATGaG GATTACAGAGAGAGAG ATGAACCAATG AG 
GAAATCACAGACrCTTACATGAGTTTACAGTTAACCCCACTGCAACAAAATAAT AAATT AGCC 
ATAATTTG I H II I I IGCAAATACCATGCCCCCCACCTGACGCCACAAACACAACAGTCACTG 

acatggccc agctat att aacag actgcg cctcaaaccatg ctgtggcgg agaagac ag at 

tcacgggtaatgcogctttggcctgagaagatgcttcggagctccaggtggaggagctgct 

cacgctgcaggatgctrtgcaggttcattcaaaaccaaatttttttccaagaagaactggaca 

taattcatgggaatattttaagtatatataagtggcaaaaaagctgtacattaaaagaagggg 

gagaaataaaatgtgttgtatatacaaagagctttgtatatttttttaaatgaaaacaaagctg 

ggagtgatagaattttaatgcaatgaaaaccaaataattcttcaaaaaactttagatgacatca 

ITClclogcWTTCACACACACACACACCCCCCACACACTTCAAACATACagaaaagGTrAAAGA 
aaaatAGCTGATAGGTGTttaaOlAGGATGGAGCasAAQAATCGTAr 

>485 

CGTCCGTGCCAAGTCGCTGTTTCCTGGGACAAGAGGGAGCCTCACTGAAACGAACT 

GCCTCAGGACAGACGGGGAAACTGAGGTCCAGAGCCGGACATCCAGCGCCTGCGGAGGGA 

ACGAGAACGCGGCGCGTCCTGCCTTGCGGGCCGAGCGGCGCCAGAGCCGCCTCCTCCCG 

CCCGCCGCGCTAGATCCCCCCGCCCCGTCTTTGCCCTCGCGACGCC»CCACGTCGGGAAC 

AAGCCATGGTG^CGGKXaACGGTGGCAGCGGCGTGGCTGCTCCTGTGGGCTGCGGCCTGC 

GGGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGGTCAACATCCGGGGCAAACTGGTG 

TCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTGAATGTGGCCAGCGAGTGCGGC 

TTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCGAGACCTGGGCCCOCACCAC 

AGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCCATGTTTAGCAAGATTGCA 



CGACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGGTGGTAGGGGCTTGGGA 
CCCAACT GTGTCAGTGGAGGAGGTCAGACCCCAG ATCACAGCGCTCGTG AGG AAGCTCATC 
CTACTGAAGCGAGLAAGACTTATAACCACCGCGTCTCCTCGTCCACCACCTCATCCCGCCCAC 
CTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAGAGACCCACTGACTC 
TCCTTCCTTTACTCTTATGCCATTGGTCCC>T(^TTCTTGTGGGGGAAAAATTCTAGTAT^ 
ATT ATTTGAATCTT ACAG CAACAAAT AGGAACTCCTGG CCAATGAGAGCTCTTGACGAGT GAA 
TCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAATAGAAGTATATCAAGCA 
ATAATCTCCCACCCAAGGCTTCrrGTAAACTGGGACCAATGATTACCTCATAGGGCTGTTGTG 

aggattaggatgaaatacctgtgaaagtgcctaggcagtgccagccaaataggaggcattc 

AATGAACATTTTTTGCATATAAAAGAAAAAATAACTTGTTATCAATAAAAACTTGCATCCAACA 

TGAATTTCCAGCXX^TGAT AATCCAGGCCAAAGGTTT AGTTGTTGTTATTTCCTCTGTATTATT 

TTCTTCATTACAAAAGAAATGCAAGTTCATTGTAAGAATCCAAACAATACCTCAGGATATAAAA 

TAAAAATGAAAGTATCCTCAAAAAAAAAAAAAAAAAAANNNNNN 

>46G 

iGGCGgoogcCOGGGCAGGTACGCGGGAGTGTGgattoAACAGAAAATTGGAAATCATa 
gtoAAAGGGCTTCCCTTGGTTCGCCACTCATTTATTTGTAACTTGActeggg^ 
ATTT CAAKc T CG T G GT AACAACGCAGAGT AGAAGGAGAGGGTGACTTT ACCGAACTGACAG C 
CATTGGGGAGGCAGATGCGGGTGTGGAGGTGTGGGCTGAAGGTAGTGACTGTTTGATTTTA 
AAAAGTGTGACTGTCAGTTGTATCTGTTGCTTTT CTCAATGATTCAGGGATACAAATGGGCTT 
CTCTCATTCATTAAAAGAAAACGGGAGATCTTTCT AAGATTCTCTGTGGGAAAATGACTGTCA 
ATAAAATGCGGG7TTCTGGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



>4Q7 

c^CCCACGCGTCCGGGGGGAGCGGCGCXX^AAGCGGGGCCACATAAAGGAGGGG 

GGAAGCCTAACAGTCGCCCCGACGCTAGTGAGGGACCCAATCTGAGTCCCCGGCCAGCCG * 
AATCCAAGCCGTGTGTACTGCGTGCTCAGCACTGCCCGACAGTCCTAGCTAAACTTCGCCA 
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actccgctgcgtttgccgccaccatgcccaaaacgatcagtgtgcgtgtgaccaccatgga 
tgcagagctggagtttgccatccagcccaacaccaccgggaagcagct atttgaccaggtg 
gtgaaaactattggcttgagggaagtttggttctttggtctgcagtaccaggacactaaagg 
tttctccacctg gctg aaactcaataagaaggtgactgcccaggatgtgcggaaggaaagc 
cccctgctctttaagttccgtgccaagttctaccct gagg atgtgtccg aggaattgattca 
ggacatcactcagcgcctgttctttctgcaagtgaaagagggcattctcaatgatgatattt 
actgcccgcctgagaccgctgtgctgctggcctcgtatgctgtccagtctaagtatggcga 
cttcaataaggaagtgcataagtctggctacctggccggagacaagttgctcccgcagaga 



gaggaacaccgtggcatgctcagggaggatgctgtcctggaatatctgaagattgctcaag 
atctggagatgtatggtgtgaactacttcagcatcaagaacaagaaaggctcagagctgtgg 
ctgg gggtgg atg ccct gggtct caacatct atgagcagaatg ACAG act aactcccaaga 

TAGGCTTCCCCTGGAGTGAAATGAGGAACATCTCTTTCAATGATAAGAAATTTGTCATCAAGC 
CCAlTGACAAAAAAGCCCOGGACTTCGTCTrCTATGCTCCCCGGCTGCGGATTAACAAGCG 
GATCTTGGCCTTGTGCATGGGGAACCATGAACrATACATGCGCCGTCGCAAGCCTGATACC 
ATTGAGGTGCAGCAGATGAAGGGACAGGGCCGGGAGGAGAAGCACCAGAAGCAGATGGAG 

cgtgctatgctggaaaatgagaagaagaagcgtgaaatggcagagaaggagaaagagaaga 
ttgaacgggagaaggaggagctgatggagaggctgaagcagatcgaggaacagactaaga 

AGGCTCAGCAAGAACTGGAAGAACAGACCCGTAGGGCTCTGGAACTTGAGCAGGAACGGA 
AGCGTGCCCAGAGCGAGGCTGAAAAGCTGGCCAAGGAGCGTCAAGAAGCTGAAGAGGCCA 
AGGAGGCXrTTGCTGGAGGCCTCCCGGGACCAG^AAAAAGACTCAGGAACAGCTGGCCTTGG 
AAATGGCAGAG CTGA CAGCTCG AATCTCCCA G CTGG AGATG GCCCGA CAGAAGAAGGAGA 
GTGAGGCTGTGGAGTGGCAGCAGAAGGCCCAGATGGTAGAGGAAGACTTGGAGAAGACOC 

gtgctgagctgaagactgcgatgagtacacctcatgtggcagagcctgctgagaatgagca 
ggatgagcaggatgagaatggggcagaggctagtgctgacctacgggctgatgctatggc 



gcacctgaaggccxrrcacttcggagctggccaatgccagagatgagtccaagaagactgcc 
aatgacatg^tcc^tgctgagaacatgcgactgggccgagacaaatacaagaccctgcgcc 
agatccggcag ggcaacaccaagcagcgcattg acgaatttgagtct atgtaatgggcacc 
(>GccTcrTAGGGAcxxxrrccTccc > ih i ccttgtccccacactgctacacctaactcacct 

AACTCATACT GTGCTGGAGCCACT AACTAGAGCAGCCCTGGAGTCATGCCAAGCATTTAATG 
TAGCCATGGGACGAAACCTAGCCCCTTAGCCCCCACCGACTTCCCT 

TGACTATGGTGCCAATGGAAOCTCCTrrCTCTTCTCTGTTCCATrGAATCTGTATGGCTAGAA 
TATCCTACTTCTCCAGCCTAGAGGTACTTTCCAOT 

GTTGTCCTATGGGAGTCAAGTGTrGGAGTTAGGTTGGAAGCT AGCTCCCCTCCTCTCCCCTAC 

CACTGTCTTCTTCAGGGrrCCTGAGATTTACACGGTTGGAGTGTTATGCG^ 

G^CAGGACCTAGG^TATCTTCTCCAGGATGTCAACTGACCTAAAAITTGGGCTCCCATCCCG 

TTTAGAGTTATTTAGGCTTTGTAACGATTGGGGGATAAAAAGATGTTCAGTCA rTTTTOTTTC T 

ACCTCCCAGATCGGATCTGTTGCAAACTCAGCCTCAATAAGGCn GTCGTTGACTTTAGGGA 

CTCAATITCTCCCCAGG«TGGATGXK5GGAAATGGTGCCTTCAAGACCTTC 

AGAAGG GCATTG GCCATTCT ATTGTGGGAAGGCTG^GTAGAAGATCCT ACCCCAATTGCTTG 

TAGGAGTATAGGCCGGTCT AAAGTGAGCTCTATGGGGAGATCTACCCCTT ACTT ATTATTCC 

AGATCTGCAGTCACTTCGTGGGMTCTGCCCCTCCGTGCTTCAATAGCCAAATCCTCTCCAGC 

T ATAACAGTAGGGATGAGTACCCAAAAGCTCAGCCAGGCCCAITV^GGACT CTT GTGAAAAGA 

gaggatatgttcacacctagcgtcagtattttccctgctaggggttttaggtct 

CTCAGAGCTACTT GGGCCAT AG<rrCCTGCTCCACAGCCATCCCAGCCTTGGCATCT AGAGC 
TTGATGCCAGTAGGCTGAACrrAGGGAGTGAGTGCAAAAAGCTGAGTATGGTGAGAGAAG^ 
TGTGCCCTGATCCAAGTTTACTCAACCCTCTCAGGTGACCAAAATCCCCTT CTCATCACTCC 
CCTCCAAAGAGGTGACTGGGCCCTGOCTCT GTTTGACAAACCTCT AACCCAGGTCTT GACAC 
CAGCTGTTCTGTCCtJrTGGAGCTGTAAACCAGAG AGTCCC 

TGCTC1 IUI ICI IIU I ACAGT ATT ATGTACTCTACTGATATCT AAAT ATTGATTTCTGCCTTCCT . 
TGCTAATGCAGCATTAGAAGATATTAGTCTTGGGGCAGGATGATTTTGGCCTCATTACTT^ AC 
CACCCCX^ACACCTGGAAAGCAT AT ACT AT ATT ACAAAATGACATTTT GCCAAAATT ATT AAT AT 
AAGAAGCTTTCAGTATTAGTTJATGTCATCTGT CACT ATAGGTt^TACAATCCATTCTT AAAGTA 
CTTGTTATTTG 1 1 1 1 1 ATTATTACTGTlTGTCTrCTCCCCAGGGTTCAGTCCTCAAGGGGOCAT 
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CCTGTCCCACCATGCAGTGCCCCTAGCTrAGAGCCTCCCTGAATTCCCCCTGGCCACCACC 
CCCCTGCCTCTGTTTGACAAACCTCTAACCCAGGTCTTGACACCAGCTGTTCTGTCCCTTGG 
AGCTGTAAACCAGAGAGCTGCTGGGGATTCTGGCCTAGTCCCTT<^CACCCCCACCCCTT 
GCTCTCAACCCAGGAGCATCCACCTCCTTCTCTGTCTCATGTGTGCTCl 1 CM CI 1 1 C f ACAG 
T ATTATGT ACTCT ACTGATATCTAAAAATTGATTTCT GCCTTCCTTGCT AATGCACCATT AGAA 
GATATTAGTCTTGGGGCAGGATGATTTTGGGt^CATTACTTTACCACCCCGACACCTGGAAA 
GCATATACTATATTACAAAATGACATTTTGCCAAAATTATTAATATAAGAAGCTTTCAGTATTA 
GTGATGTCATCTGTCACTATAGGTCATACAATCCATTCTTAAAGTACTTGTTATTTGTTTTTAT 
TATTACTGTTTGTCTTCTCCCGAGGGTTCAGTCCTCAAGGGGCCATCCTGTCCCACCATGCA 
GGTGCCCX^TAGCTTAGAGCCTCCCTCAATTCCCCCrr^^ 

TGACCTTGAGGAGTCTTGTGTGCATTGCTGTGAATTAGCTCACTTGGTGATATGTCCTATATT 

GGCTAAATTGAAACCTGGAATTGTGGGGCAATCTATTAATAGCTGCX JrTAAA GTCAGTAACTT 

ACCCTTAGGGAGGCTGGGGGAAAAGGTTAGATTTTGTATTCAGGGGTTTTTTGTGTACTTTTT 

GGGTTTTTTAAAAATTonTTTooAGGGGTTTATGCTCAATCCATGTTCTATT^ 

>4B6 

NCGTCCGTCTCACCCGAG<TrGTCCGCACTCCAAAGAACTGGGTACTCAACACTGAG 
CAGATCTGTTCTTTGAGCTAAAAACCATGTGCTGTACCAAGAGTTTGCTCCT 
TGTCAGTGCTGCTACTCCACCTCTGCGGCGAATCAGAAGCAAGCAACTTTGACTGCTGTC^ 
GGATACACAGACCGTATTCTTCATCCTAAATTTATTGTGGGCTTCACACGGCAGCTGGCCAA 
TGAAGGCTGTGACATCAATGCTATCATCrrrcACACAAAGAAAAAGTTGTCTGTGTGCGCAAA 
TCCAAAACAGACTTGGGTGAAATATATTGTGCGTCTCUTrCAGTAAAAAAGTCAAGAACATGTA 
AAAACTGTGGCTTTTCTGGAATGGAATTG^ACATAGCCCAAGAACAGAAAGAACCTTGCTGG 
GGTTGGAGGTTTCACTTGCACATCATGGAGGGTTTAGTGCTTATCTAATTTGTGC^TrC^CTG 
GACTTGTCCAATT AATGAAGTTGATTCAT ATTG CATCAT AGTTTG C I ITGTI I AAGCATCAGAT 
TAAAGTTAAACTrGTATTTTATGTT ATTT ATAGCTGTAGGTTTTCTGTGTTTAGCT ATTT AAT ACT 
AATTTTCCATAAGCTATTTTGGTTTAGTGCAAAGTATAAAATTATATTTGGGGGG GAATA AGAT 
T AT ATGGAC F TT C7 1 GCAAG CAACAAG CTATTTTTTAAAAAAAACTATTTAACATTC I T 1 1 GTTT 
ATATTGTTTTGTCTCCT AAATTGTTGTAATTGCATTATAAAATAAGAAAAATATTAATAAGACAA 



NNN^AATGAAGACTAAGAAAGTTGAAAAAAAAAAACTATTTTTGAAAAGTTAAAAAAGGGGG 

GGGCCGrTAAACTGGTTTAAAAAAAAACCTTCCCAACXXTCCCC^GAAC 

NNNNN 

>489 

NAACTGGACCTTGAANCGAATACCGTTAATNATGTCXTTG ^GTCC CTCttCAATAATA 
ACTCCAANCCCAACCAAGAAAGAATGAhrTAAAAATGAATCAAATCCTTTTCCTGGCAAC^ 
ATTTCAATC^CAAAACCCATCTCTGTTT AAATATTGATGGAAAAC>GAACCTGTGAAGTCCGA 
TGCCCACCGTCGAAACCAAAAATTTCATTCTGGAAATTGCAAATGGATACTCTGAOGXSCC^A 
(XXiAAACCTGGTGAAGCCCTTTGGGCGATTGGTGATCACCTCTAGATCC^ 
TGCCCCCCCATCCG GGCAAGCAGGGCCAAGGTGGCATCTT CACATTCCTGGAACCCACCCA 
GTAACAGCAGCAGGT A fTTCTTCI GGTAAATGAGAGCCTTTCGAAAACTTTCTGCCCTCAAGT 
ATTTACCATAAATTCTCTTTAAAGTGACATGTTCAGAATCAGGGCrCA^ 
. AGTCATTTCTTAGTTCAGCTrTCAGTTTGTATAirrTCAGCCT 

GAGAGTCAATTTTTCTCTGTTCCATAC 1 I U I H I CAGAGGCAATGATGGCTTGAATGTTGGC 

ACCACCATTCAATGAAAACCTGGAAGAATTATCTCTACCAGCTCCTGTCT 

GATCTGCTCTTCCAGCTTCATAACCATGTTCCTTAAGTCATTC 

GATATGCCCTGTCAGCTCAGCATITTGTCTCAGTAGTCTTTCAGTCAATGAGCCACAAGAAG 
TACTT GGGGACACCAAGCTGGGCTCTTCACCTTGCT GGCCAGTTAATTTCTGTAGTTGTAAA 
ATAATTTCATCAATATTTTCCTGAACCCAAATGAAATCTTCATCATCTGCTC 
GTCTCTCAGAGGCTTTCTGGGGT AGAACCTGTA G 1 1 1 » GAAGCTACACATTGAAGCTTTTGT 
CrTGATCATT TCTAATTCATGATTACAGCCGGTT CCTCCTCCATTCA T^ 

AGTTTGATTCTTTT GTTT CTCCTTCTATTTTCTGTTGAAGAACCCAATTTCTAGTTCT ATCACTG 
GTTAAGCTCCACGTT3GTTGGCTT^TTAAGGTrCTGAT 1 1 1 1 

TCTTCTCGTTCT AGTTCTT GCTTCTGGAATTCCTGCATGATTCCTTGTAGTCGCTG I CC I ICA 
AGGTCT AACTT ATAAAC I I GUG T CI.I I I C I C I I CCAGCTGGCGCTGAAGATCTTCCACTTTG 
GACTGGAGCTCATGCA I I 1 1 1 1 1 CTGTCCCTCAGTG t T GGCCTCCTGTTCTGTCTG CAGTGTT 
TTCCT GTGAACCTGCCT ATC 1 1 1 1 1 CCATTTGCTGTCGTGTTTGCAAAGAATCCAGTTTATATT 
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TTTCAGTCTCATTTAACAATTCTACTATGCCACTGTGTTTTTCCTCT^ 

TTTCAGT AGGTCCTCTGAGGGCAAGGGTGGCCGAGACTGTCCAGTaCCATCACTGCTCTGC 

AGCTGTGCGTGCAATTCCCGCTCCCTATCTAGGGTACTACTCATTTCCCGAATTCGAACTT^ 

CTCAGATTCAAGAAGTACCTGAAGCTCTAAGTTTCGACCTTGTTCTTCTGAGAACTGGGCATC 

ATATAGCATTCTCTGTGATTCT ATTTTTTGCTGGGATTCGTTCAGT AGTTG 1 1 1 C I GTTGTTCC 

AAAAGTAGATTTAGCTGAAGATTCCTTTGTTTCTGACTCTCAAGTGAAAACTTCAGATCCTCA 

AGTT C I I C I I 1 ATCCCGTTCTrCACTGCGTCCCAACTrGGCTTTCTCTTTCTNCGAAAGCCCA 

CTGGAGCCTCTCTTG 1 1 1 I 1 1 GGGACNN 

>490 

>491 

>492 

>493 

cggggcaggtaggcgggggtggcggcgttgggttgagcggg c i 1 1 1 iggaagttt 
gtggcggagttctgtgatatgagcaacaatggaccagaagattttatctctagcaggagaaa 
aaacagcagacaaactgcaagaatttcttgggcagggcctggggaatgct7ttttatctcat 

ATTAGTGCCTGTGATGGCATCTTTCATCTAACACGTGCTTTTGAAGATGATGATATCACGCA^ 

gttgaaggaagtgt ag at cctatt cgagat at agaaat aat acatg aagag cttcag ctt aaa 
gatgaggaaatgattgggcccattatagataaactagaaaaggtggctgtgagaggaggag 
ataaaaaactaaaacctgaatatgatataatgtgcaaagtaaaatcctgggttatagatcaaaa 

GAAACCTGTI'CGCTTCTATCATGATTtaGAATGACAAAGAGATTGAAGTGrrTGAATAAAGACTT 

ATTTTTGACTrcAAAACCAATGGTCTACTTGGTTAATCTTTCTGAAAAAGACTACAT^ 

AAAAACAAATGGTTGATAAAAATTAAAGAGTGGGTGGACAAGTATGACCCAGGTGCTTTGGT 

CATTCCTTTTAGTGGGGCCTTGGAACTCAAGTTGCAAGAATTGAGTGCTGAGGAGAGACAGA 

AGTATCTGGAAGCGAACATGACACAAAGTGCTTT GCCAAAGATCATTAAGGCTGGGTTTGCA 

GCACTOCAACT AGAATACTrnTCACTGCAGGCCCAGATGAAGTGCGTGCATGGACCATCAG 

GAAAGGGACTAAGGCTCCTCAGGCTGC AGGA AAGATTCACACAGATTrrGAAAAGGGATTCA 

TTATGGCTGAAGTAATGAAATACGAAGATTTTAAAGAGGAAGGTTGTGAAAATGCAGTCAAG 

GCTGCTGGAAAGT ACAGACAACAAGGCAGAAATTATATTGTTGAAGATGGAGATATTATCTTC 

TTCAAATTTAAAAACTCAAAAACOGAAGAAGAACATAAATm 

AACTTCAAAAGGCATCTGATTTrrACACATTAATTTCTGAAACCAATGCGACAAATAAGTCGG 
GACAATGGGAATCrrTGACAACAAATATTTTTTGGTTGACACTCAATATT 

TAAATGGGTTCATAATGTGACCGGTTGGTTCCGTTrTAACCCTCCATTGGAGTTTCGGACCT^ 
ATCCGAAACT ATCCTTGTTCAGCGACAT GAGAGAACGGGCCTG CTCAAAGGTTGCCGTTCAA 
TGAAAAAGGGGANNNN 
>494 

, • ACGCGGGGACTCGCGTCGGTTGGCGACTGCCGGACGTTAGGTAGTTTGTTGG^^GG 
GGTTCTQAOGCCTTG C I I 1 1 1 1 1 1 ACTTTTCCACTCTAGGCCAOGATGCCGCAGTAOGCQGG 
GGGGT GAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGATGGCAGCCACCACGGG 
CTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTT GGAAGAACTCGAAGAAGGCCAGAAA 
GGAGT AGGAGATGGCACAGTTAGCTGGGGTCT AGAAGATGACGAAGACATGAGACTTACAA 
GATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGAAAACCGAATATACAGCCTT 
AAAATAGAATGTGGACCTAAATACCCAGAAGCACCCCCCTTTGTAAGATTTGTAAG 
AATATGAATGGAGTAAAT AGTTCTAATGGAGTGGTGGACCCAAGAGCCATATCAGTGCT AGC 
AAAATGGCAGAATTCATATAGCATCAAAGTTGTCCTGCAAGAGCTTCGGCGCCT AATGATGT 
CTAAAGAAAATATGAAAGTCCCTCAGCCGOCCGAAGGACAGTGTTACAGCAATTAATCAAAA 
AGAAAAACCACAGGOCC 1 1 CCOC ITCCCCCCAATTCGATTT AATCAGTCTTCATTTTCCACAG 
TAGTAAATTTTCTAGATACGTCTTGTAGACCTCAAAGTACGGGGGGAA 
AATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAAGGGCTGCCTAATTTA 
CAGCAACCATGAGTAGAAATGGTGATGATCATCAGGTCAAGGATAGTCTGGAGGAATTGAGA 
TGTCACTTTACATGGGAGTTATCCATTGATGACGATGAAATGCCTGATTTAG^ 
TTGKIATGAGATTGAATTCCTAGAGACCAAATACAGTGTG^QAAT AGACAACCT ACT AGCCT AT 
GTGAAAGACCTGAAAGGCCAGAATGAGGAAGCCCTGAAGAGCTTAAAAG 
T AATGCAGGAAGAACATGACAACCAAGCAAATGTGAGGAGTX7TGGTGACCTGGGGCAACTT 
TGCCTG GATGT ATT ACCACATGGGCAGACTGGCAGAAGCCCAGACTT ACCT GGACAAGGTG 
GAGAAC^TTICCAAGAAGCTTroAAATOCCTTbCGCT ATAGAATGGAGTGTCCAGAAATAGA 
CTGTGAGGAAGGATGGGCCTT GCTGAAGTGTGGAGGA 
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TGCTTTGAAAAGGTGCTrGAAG TGGA CCCTGAAAACCCTGAATCCAGCGCTG GGT ATGCG A 

TTCCCCTAAGGCAGGCTGTCCGCTTAAATCCAGACAATGGATATATT AAGGTTCTCCTT GCC 
CTGAAGCTTCAGGATGAAGGACAGGAAGCTGAAGGAGAAAAGTACATTGAAGAAGCTCT AG 
CCAACATGTCCTCACAGACCTATGTCTTTCGATATGCAGCCAAGTTTTACCGAAGAAAAGGC 
TCTGTGGATAAAGCTCTTGAGTTATTAAAAAAGGCCTTGCAGGAAACACCCACTTCTGTCTT A 
CTGCATCACCAGATAGGGCrrrrGCTACAAGGCACAAATGATCCAAATCAAGGAGGCTACAAA 

ATTTTGAATCTGGAGTGGAAAAAAAGCCCACATTTGAGGT GGCTCATCT AGACCT GGCAAGA 
ATGTATATAGAAGCAGGCAATCACAGAAAAGCTGAAGAGAA7TTTCAAAAATTGTTATGCATG 
AAACCAGTGGT AGAAG AAACAATGCAAGACAT ACATTTCCACT ATGGTCGGTTTCAGG AATTT 
CAAAAGAAATCTGAOGTCAATGCAATTATCCATTATTT AAAAGCT AT AAAAAT AGAACAGGCA 
TCATTAACAAGGGATAAAAGTATCAATTCTTTGAAGAAATTGGTTTTAAGGAAACTTCGGAGA 
AAG GCATT AGATCTGG AAAGCTTGAGCCT CCTTGGGTTCGTCTACAAATTG GAAG GAAATAT 

gaatgaagccctggagtactatgagcgggccctgagactggctgctgactttgagaactct 

CTGAGACAAGGTCCTTAGGCACGCAGATATCAGCCACTTTCACATTTCATTTCATITTATG^ 
AACATTTACTAATCATCTTTTCTGCTT ACTGTTTTCAG AAACATTATAATTCACT GT AATGATGT 
AATTCTTGAATAATAAATCTGAGAAAATATTAGTTGTGTTCAACAATTAGTGAAACAGAATGTG 
TGTATGCATGTAAGAAA6AGAAATCATTTGTATGAGTGCTATGTAGTAGAGAAAAAATGTTAG 
TTAACTTTGTAGGAAATAAAACATTGGACTTACACGAAAtgtltaj 



NNNNCGCGGCCGCGTCG ACT ACGGCTGCGAGAAGCCCCGGGAAT AGGAGAATAGG 
AGCAAGCCAGCACTAGTCAGCTAACTAAGTGACTCAACCAAGGC C I Mil f CCTTGTTATCTT 
TGCAGATACTTGATTTTCTTAGCGTTrCTGGAGATTACAACATCCTGCGGTTCCG 1 1 fCI GGG 
AACTTTACTGATTTATCTCCCCCCTCACACAAATAAGC^^ 

CAAGATCTGGACTACTGTTGAAAAAATTTCCAGTGAGGCTCACTTATGTCTGTAAAGATGGG 

AAAAAAATACAAGAACATTGlTCTACTAAAAGGATTAGAGGTGATCAATGATTATCATrTTAGA 

ATGGTTAAGTCCTTACTGAGCAACGATTTAAAACTT AATTT AAAAATGAGAGAAGAGTATGAC 

AAAATTCAGATTGCTGACTTGATGGAAGAAAAGTTCCGAGGTGATGCTGGTTTGGGCAAACT 

AAT AAAAATTTTCGAAGAT AT ACCAACGCTTGAAGACCTGGCTGAAACTCTT AAAAAT 

GTTAAAAGTAAAAGGACCAGCCCT ATCAAGAAAGAGGAAGAAGGAAGTGGATGCTACTTCAC 

CTGCACCCTCCACAAGCAGCACTGTCAAAACTGAAGGAGCAGAGGCAACT 

GAAAAGAAAAAAATCAACCAAAGAAAAGGCTGGACCCAAAGGGAGTAAGGTGTCCGAGGAA 

CAGACTCAGOCT C CCTGTOCTGCAGGAGCCGGCATGTCCACAGCCATGGGCCGTTCCCCAT 

CTCCCAAGACCTCATTGTCAGCTCCACCCAACACTTCTTCAAC^ 

GCCAAATGTGAGGTAACT (XX^GAAG AAATGTTCTCX^AAAAAGGCCCAGTGATAGTGAAGGT 

ACTGAGTACAACAAAGCCATTTGAATATGAGAOCCCAGAAATGGAGAAAAAAATAATGTTTCA 

TGCTACAGTGGCTACACAGACACAGTTCTTCCATGTGAAGGTmAAACACCA^ 

AGAAATTCAATGGAAAGAAAAT CATCATCATATGAGATTATTTGGAATATGATAGTCTCCTAGA 

GGTCAATGAAGAATCT ACTGTATCTTGAAGCTGGTCCTAACCAAACGTTTGAGGTTCCAAATAA 

AATCATCAACAGAGCAAAGGAAACTCTGAAGATTGATATTGTTCACAAAC^ 

TATTGTATATGGGGTATTTATGCTACATAAGAAAACAGTAAATCAGAAGACCACAATCTACGA 

AATTGAGGATGATAGAGGAAAAATGGATGTAGTGGGGACAGGACAATGTCAC^TATCCCCT 

GTGAAGAAGGAGATAAGCTCCAACTTTTCTGCTTTCGACTTAGAAAAAAQAAC 

AAACTGATTTCAGAAATGGATAGTTTTATCCAGA7 AAAGAAAAAAACAAACCCGAOAAACAAT 

GACOCCAAGAGCATGAAGCT ACCCGAGGAACAGAGTCAGCTTCCAAATCCTTCAGAGGCCA 

GCACAACCTTCCCTGAGAGCCATCTTCGGACTCCTCAGA 

AGTTTCTTCACCAAGAAAAGTGAAGACACAATCTCCAAAATGAATGACT^ 

GATACTGAAGGAA^GGAGTCATTTTCCAGGACCGn(^TGACCAGCAT 

AGCCATCOCCACACTCCTGAGATGCXrrCXMTCAACACCAAGCA 

GAAACCAAGACTGAAGACT GAACCTGAAGAAGTTTCCATAGAAGACAGTGCCCAGAGTGAC 
CTCAAAGAAGTQATGQTGCTGAACGCAACAGAATCATTTGT ATATGAQCCCAAAGAGCAGAA 
GAAAATGTTTCATGCCACAGTGGCAACTGAGAATGAAGTCTTCCG^ 
TGACCTAAAGGAGAAGTTCACCCCAAAGAAGATCATTGCCATAGCAAATTAT 
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ATG GGTTCCTGGAGGT AT ATCCTTTCACACTT G TG G CTG AT GTGAATG CT GACCGAAACATG . 
GAGATCCCAAAAGGATTGATTAGAAGTGCCAGCGTAACTCCT AAAATCAATCAGCTTTGCTC 
ACAAACTAAAGGAAGTTTTGTGAATGGGGTGTTTGAGGTACATAAGAAAAATGTAAGGGGTG 
AATTCACTTATT ATGAAAT ACAAGAT AAT ACAGGGAAGATG GAAGTG GTG G TG CATGGACGA 
CTGACCACAATCAACTGTGAGGAAGGAGATAAACTGAAACTCACCT^ 

GAAAAGTGGGAATACXXMGGAGTTGAGATCTGTAATTCATAGTCACATCAAGGTCATCAAGA 
CCAGGAAAAACAAGAAAGACATACTCAATCCTGATTCAAGTATGGAAACTrCACCAGACTTTT 
TCTTCTAAAATCTGGATGTCATTGACGATAATGTTTATGGAGATAAGGTCTAAGTGCCTAAAA 
AAATGTACATATACCTGGTrGAAATACAACACTATACATACACAOCACCATATATACTAGCTG 
TTAATCC TATGG AATGGGGTATTGGGAGTG C 1 1 1 1! I AATTTTTCATAG 1 1 1 1 1 1 1 1 1 AATAAAA 
TGGGATATTTTGCATCTACAACTTCT ATAATTTGAAAAAATAAATAAACATTAT C rTTTTTGTGA 
AAAGAAAAAAAAAAGGGCGGCCGCNNNNNKNNNNM 
>496 , 

gaaaacaggtctatgttttgttacxstgcxrtgggcagggcgcttagcgtagctggaa 
tgcgcaggactggcagttgcacaataataaattagtcct gaacgttgtattccagttgaccc 

AGGTCCTCTCGAAGAACAGAGGGGCAACTGTGCAAAATCTTG GT GACTTGCT GT ACCGTGA 
AAAGGCAC I rCTCCT I GAGAAGCCTGACAGTGTCGTTAATGTCCTGCTGGCGCATGGTGAAA 

atttcagggcaacagtaaagcaccctctttaatttcccttctccaagcocaagct 

TAACTGGAGCGCTT <XTT CATTTGCAT AAT AGGCAGTTTCAATAACT GGGGACTTTTCTTCAAG 

accacacacacaggctctggattcagacccaagagaataaattctgaaatgatgtc^ 

ct gttgaagactggcaccttjgccgtagact GAGGAATTCATT AAT atgggcattgctgaaac 

ccatgtccaggagggaactcatgaccctctctaggtccaaggaccct 

cccctgcttctcaaggaggcactgaacaagattcctc^ 

t attggatctaacacaagataactcctcaatgacccctcx^ttggaggctgt agtcagtttg 



ggcccaggtgagggggatcaggc^tgccaatcaaggacctg 
agcgcggagaagatggcagccggacgcgtgg 



NNNNNGATGGGG«50CGGGTCX>^GCGACCGAC»CGTCXX3G^ 
TCATGGAGCTGGCACTGCGGCGCTCTCCCCTCCCGCGGTGGTT GCTGCTGCTGCCGCTGC 

tgctgggcctgaaggcaggagctgtcattgactggcccacagaggaggggaaggaagtat 
gggattatgtgacggtccgcaaggatgoct acatgttctggtggctctattatgccaccaac 
tcct gcaagaacttctcajgaaxttgcxixxrtggtcatgtggctt cagggcggtccaggcggtt 
ct agcactggatttggaaactttgaggaaattggggcccttgacagt gatctcaaaccacgg 
aaaaccacctggctocaggctgccagtctoct atttgtggat aatcccgtggggactgggt 
tcagttatgtgaatggtagtggtgcctatgocaaggagctggctatggtggcttcagacatg 
atggti CI 0C1 GAAGACC 1 1 C II cagttgccacaaagaattccagacagttccattctacatt 
ttctcagagtcctatggaggaaaaatggcagctggcattggtctagagctttataaggccat 

TGAGCGAGGGACCATCAAGTGCAACTTrGCGGGGGTTGCCTTG 

O CTSn GATTOGGTGC I C f CC1 GGGGAOCTTAOCTGTACAGCATGTCTCTTCTCGAAGACAA 
AGO I U I GGCAGAGGTGTCTAAGCTTGCAGAGGAAGTACTGAATGCCGT AAATAAGiXiGCTC 
TACAGAGAGGCCACAGAGCTGTGGGGGAAAGCAGAAATGATCATTGAACAGAACACAGATG 
GGGTGAACTTCTAT AACATCTT AACT AAAAGCACTCOCACGTCTACAATGGAGTGGAGTCT A 
GAATTCACACAGAGCCACCTAGTTTGTU 1 1 1 UI C AGCGCCACGTGAGACACCTACAACGAGA 
TGCCTTAAGCCAGCTGATGAATGGCCCCATCAGAAAGAAGCTCAAAATTAT^ 
AATCCTGGGGAGGCCAGGCTACCAACGTC 1 1 1 G I GAACATGGAGGAGGACTTCATGAAGCC 
AGTCATTAGCATTGTGGACGAG TTGCTGGAGGCAGGGATCAACGTGACGGTGT ATAATGGA 
CAGCTGGATCTCATCGT AGATACGATGGGTCAGGAGGCCTGGGTGCGGAAACTGAAGTGGC 
CAGAACTGCCT AAATTCAGTCaGCTGAAGTGG A AGGCCCT GT ACAGTGACCCTAAATCTTTG 
GAAAC ATCTG CI 1 1 fGTCAAGTOCTACAAGAAOCI fGCI 1 1 CT ACT GGATTCTGAAAGCTGGT 
GATATGG rTOCTTC IGACCAAGGGGACATGGCTCTGAAOATGATGAGACTGGTGACTCAGC 



- TGAGGCCGCTGAAGCTGTAGGAAGCGCCATTCTTCCCTGT ATCTAACT GGGGCT GTGATCA 
AGAAGGTTCTGACCAGCTTCTGCAGA GGATAA AATCATTGTCTCT GGA 

T ATTTCTGCTT CTT AAAAAAACCT AAGATTTTTT AAAAAATTGATTTGTTTTGATCAAAAT AAAG 
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gatgataatagatatta 1 1 1 1 1 1 cm atgacag aaggaaatg atgtgattt atagaaaaact gg 
gaaatacaggtacccaaagagtaaatcaacatgtgtatacccccttcccaggggtaagcact 
gttaccaattt agcatatgtccttgcagaa i i 1 1 1 1 1 1 1 ct at atatacatat at attttttacc 

AAAATGAATCATTACTCT atgttgtttt ACTATTTGTTTGACATATCAGTATATCT GAAACACCT 
TTTCATGTCAATAAATGTTCTTCTCT AACATTTTTA 



N NNNNAAAAAACCCCCCAAAAAGGGGGGGGAAAAAACCCCCCAAAAGGGGGAATT 
GGGGGCCCCCCCGGGGGGGTGTTTTTAT1TCAATTTTTTAAATTTG 1 1 1 1 1 1 1 1 1 ATTAACAA 
GTGTAACTCTTTATTAAGATGGAATTGTTCTTATTAAAGAAATAGATGAAAATGGTTAAGT ACA 
ATTAAA TGGCT CCAAAAGTCTTACAATGAAAACAACAGTCCTGCCAGTT U 1 1 CI MCCAGAGG 
CAAATACTTTT C ATTCTC TTAGTTTTTCCTTCCGTTAGTTACCTTCAT 

TG 1 1 1 1 II I I I AGTTTTTCAAGTGAATGCATATATTAATACATAAAATTTTAAAAAGGCTTTTCA 
GTTTATAATGCATCCTAACAGTCCCCTGCCCCATCCCTCCT AATTCTCCAGAGCAATGACTTT 
T AACTCTTTT AGCAATGTCTTCTG I M il l 1 CCTCACATAACTTAGTCATTTCATTATTATATAT 
TATATAGTGATTTCTGGATATGACAGATGAGGATTTAGCTCTTACACCATCACAATCTTCACTT 
TTCCTCCCATATTGTGCCAGAGTAGTTACCATATTTAGGGGCT AAGCAGT CACATCATT ATG A 
CTGTGTAGATACTGCTCCTTGTTGAGAAAAACAGTGTATTATGCTCTTCCTATCCTGTACTTC 
TrTCTGCCCTAGAATGATTGCCTAACCAACCCTTTCCOCTTG 1 1 1 1 M 1 1 1 1 I I I 1 1 1 1 1 A CTTT 
TGCTTTATCITCAATGAACTTCTTTCCAAATGCCCCAAATCTGGCAATAAGTTATTATTT^ 
GAGAAGGGATCTGACT AAGGTGGCCAGGCTGGTCTGGAACT CTTGGGCTCAAGTCGTCCT G 
CTGC TGAGC CTCCTGA GT AGC T GGGACTACAGGCATGGGCCATTCCACCCAGCTAACCGAT 
T AAT A I \ 1 1 1 ACTGTTCXTrTTTTAAAGCrrAGCTCCATAGCTGGAATATTGCCTGTGTTGGATCC 
TATTTT^CJ^GATTCATGTCATTCTCTGGTTTGTGTACTCCTT CATTTTGCT GGAGTATTTTCTC 
CAACAGCTTCCCTAAAAAGGGCACATGGAGGTAACCCCTGAGTCTTTGCn 01 C I AAAAATG 
TATTATTTTACCTTCACCCTTGATTATTTGGGTGAATATAGATTTATCAGTTGAAAATA 
TAGAATTATGAAGGCATGGTTCXATTGTTTTCAGATrCCTTTTC^ 

CACCCCGGCTGGAGTGGAGTGGCACAATCACGACTCACTG^AGCCTTGAACTCGTG GGCT C 
AATTAA TTCTrC CA TCTCAG CTTCCTGACTAGATGGGAATACAGGT GTGCGCCACAATGCCC 
AGCT AATTTTTGTGTTTTTTGTAGAGATGGGGCTTTGCCATGTTGCCCAAGCTGGT CTCAAGT 
GAT CTGOCTGCCTTAGC CTCCCAAAGCGCTGGGATTACAGGTGTGAGCCACTGTGCCT GGC 
TGGTTTTTCTTTT ATTTTCTCTCT ACTTCATTCATTTCTrCAAAGGCATCTGAAACTCCCTGGGC 
CCAAAACCAAAGTCAAACTCCATCAGAAT ATCCTGCT ACTT ATAACTTTAAAAT AT ATCTT AAA 
TCACAATTTCTT ACT ACATGCAACCCTTT GGTTCAAGCAACCATCAT(XXrrCACTTGAACTT AT 
AATTG GTTTTCC 1 GC I ICTGTAI 1 1 1 1 1 I AAATTATAAAATACTTCAAATAAACCATGTGTGGC 
CTT AT AC UII I ACTACATAAAAAGAATGAAGATGTGCCGGGTGTGGTGGCTCAAGCOGGTA 
ATCCCAGCACTTTGGGAGGCTGAGGTGGGTGAATCGCTTGCGG 

CCTGACCAAGATGGTGAAACCCCATCTCT ACT AAAAAT ACAAAATACAAAAATTAGCCGGGC 
GTGGT GGCX5CGCACCTGGTCCT AGCT ACTTGGGAGCCTGAGGCAGGAGAATGGCTTGAAT 
CTGGGAAGGCGGAGGTTGCAGTAAGCCGAGGTCACACCATTGCATGCT 
AAGAGCGAAACGCTGTCT AAAAAAAAAAGAACTT ACA T ACTT AAACATCGAATTGCTTT AAGT 
GTTTGGCATTCACTGTTCCCGTGCATGCCAGCAGCTT^ gc 

tat agaggagct ggactatgttccgcocacacacaggacagggagctgacaaccaaggagg 
ggggcgggatcaatactccagc 1 1 1 occooctoca 1 1 1 ucc i gtatctcagcagt atggagg 
ttgttatcaatgagaacctgctcgctgactttaaattgc i rnci iaccttctcagttcagttt 
ctccgtagctcaattgatgatttctggtatgacctagcaaatacact 
cagtcttgcaatctgctttgggtt ccccaatct aagacagaaacat actcattttcccatcac 
tggacttccagg i i u 1 1 i i caatttttcactgttacaaacaaggtggcaacatttatctacaaa 
cctctggatatacacgtaggaagcttttggtatttccactagtgaaact 

GT ATGTGGATCTT CATCTTT AAT AAAT ATT ACCAACATGTGAAAAGCCOGACAATGTCAAGGA 
CTGGCAAGAGTG CCACA TGTGATGGGTGTGGAATGGCAGCTCACTGT AGCAGGT GCTGGG 
GACrCAGTTGGGGTCTTGGAGAAGCACTTAGTT AT AGCAAGAATGTCTCAT AAATGGT ATCT 
GAT AGAGACAAGAGT AGTGGGGAATAAAAACT AGTTGCTTAGAAAT AATT AT AGATCT AAAGT 
CAACAAAAAACCTTTTCTAATGT AAAAAT ACATAC 1 1 1 I I I CA GAGGGiAGOGGGAACAACTTA 
AACT AGAAAACACCTTCATATTAATCA I ICI ICI CAT AT ACTTCAAATTTGT ACTT AATGCCTTT 
CTCCT CCTGGACATCAGAGAGAACACCTGGGTATTCTGGCAGAAGTTT AT ATTTCTCCAAAT 
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GATGGCCTGGGTGATTCATGGCTTCCTTATAAACAGAACTGGCACCAACTATCCAGACCATG 
TCT ACTTTATTTGCTAATTCTGGTTGTTCAGT AAGTTTTAAG GCATCATCTAG ACTTCTGGAAA 
GAAAATGAGCTCCTTGTGGAGGTTCCT^GAGTTCTCTGCTGAGAACTAAATTAATTCT ACCCT 
TTAAAGGTCGA7TCTTCTCAGGAATGG AGAACCAGGTCTTCTT ACCCATAATCACCAGATTCT 
GTTTACCTTCTACTGAAGAGGTTGTGGTCATTCTCTGGAAATATCTGAATTCATTCCTGAGCG 
GTGGCCAGGGCAGGTCXJOCGTTCTTGCCGATGGCGATGTTCTGGGACACAGCGACGATGC 
AGTTTAGCGAACCAACCATGACAGCAGCGG6AGGACCTCCGAGCCCGCTCGTTACAGCAGA 
ACGCGCGGTCAAGTTTGGCGCX^AATTGTGGCCGCCCCGCCCCCCTCGTCCOCATTTGT^ 

CAGACGCCTGGGAACTGCGGCCGCGGGTTO 

ggctgccatccttgccctgccatgtctcgccggaagcctgcgtcgggcggcctcgctgcc 
tccagctcagcccctgctmggcaagcggttttgagccgattcttccagtctacgggaagcc 
tgaaatccacctcgtcctccacaggtgcagccgaccaggtggaccctggcgctgcagcgg 
ccgcagcgcccccagcggccgccttcxxx3ccxx>gctgccgccgcacgtagctacagaa^ 

gtccaacaaaaggaaggaggaagtgatctgggaatgtctggcaactctgagccaaagaaat 
gtctgaggaccaggaatgtttcaaagtctctggaaaaattgaaagaattctgctgcgattct 
gcccttcctcaaagtagagtccagacagaatctctgcaggagagatttgcagttctgccaaa 
atgtactgattttgatgatatcagtcttctacacgcaaagaatgcagtttcttctgaagattc 
gaaacgtcaaattaatcaaaaggacacaacactttttgatctcagtcagtttggatcatcaaa 

TACAAGTCATGAAAATTTACAGAAAACTGCTTCCAAATCAGCTAACAAACGGTCCAAAAGCAT 

ctatacgccgctagaattacaatacatagaaatgaagcagcagcacaaagatgcagttttgt 
gtgtggaatgtggatataagt atagattctttggggaagatgcagagattgcagcccgaga 
gctcaatattta7tgccatttagatcacaactttatgacagcaagtatacctactcacagact 
gtttgttcatgtacgccgcctggtggcaaaaggatataaggtgggagttgtgaagcaaact 
gaaactgcagcattaaaggccattgg agacaacag aagttcactctttt cccggaaattg ac 
tgccctttatacaaaatctacacttattggagaagatgtgaatcccctaatgaagctggatga 
tgctgtaaatgttgatgagataatgactgatactrctaccagctatcttctgtgcatctctga 
aaataaggaaaatgttagggacaaaaaaaagggcaacatttttattggcattgtgggagtgc 

AGGCTGCCACAGGCGAGGTTGTGTTTGATAGTTTCCAGGACTCT GCTTCTCGTTCAG AGCT A 
GAAACCCGG^TGTCAAGCCTGCAGCCAGTAGAGCTGCTGCTTCCTTOGGCCTTGTCCGAGC 
AAACAGAGGCGCTCATCCACAGAGCCACATCTGTTAGTGTGCAGGATGACAGAATTCGAGT 

cgaaag gatggataacatttattttgaatacag ccatg ctttccagg cag ttacagagtttta 
tgcaaaagatacagt7gacatcaaaggttctcaaattatttctggcattgttaacttagagaa 
gcctgtgatttgctctttggct gccatcataaaatacctcaaagaattcaacttggaaaagat 
gctctccaaacctgagaattttaaacagctatc^agtaaaatggaatttatgacaattaatgg 
aacaacattaaggaatctggaaatcct acagaatgagactgatatgaaaagcaaaggaagttt 
gctgtgggttttagaccacactaaaacttcatttgggagacgg aagttaaagaagtgggt ga 
cccagccactccttaaattaagggaaataaatgcccggcttgatgctgt atcggaagttctc 
cattcagaatctagtgtgtttggtcagatagaaaatcatct aggtaaattgcccgacatagag 
ag ggg actct g t ag cattt atcacaaaaaatgttct acccaagau » ic m ' u i i gattgtcaaa 
actttatatcacctaaagtcagaatttcaagcaataatacctgctgttaattcccacattca^ 
cagacttgctccggaccgttattttagaaattcctgaactgtrrcagtccagtg 
ttaaagatactcaatgaacaagctgocaaagttggggataaaactgaattat7taaagacc7t 
tctgacttccctttaataaaaaagaggaaggatgaaattcaaggtgttattgacgagatccga 
atgcatttgcaagaaatacgaaaaatactaaaaaatccttctgcacaatatgtgacagtatca 

GGACAGGAGTTTATGATAGAAATAAAGAACTCTGCTGTATCTT GTATACCAACTGATTGGGTA 
AAGGTTGGAAGCACAAAAGCTGT GAGCCGCTTTCACTCTCCTTTTATTGTAGAAAATTACAGA 
CATCTGAATCAGCTCCGGGAGCAGCT AGTCCTTGACTGCAGTGCTGAATGGCTTGATTTTCT 
AGAG AAATT CAGTGAACATTATCACTCCTTGTGTAAAGCAGTGCATCACCTAGCAACTGTTGA 
CTGCAI 1 1 1 CI LXXrrGGCCAAGGTCGCTAAGCAAGGAGATTACTGGAGACCAACTGT ACAAG 
AAGAAAGAAAAATTGT AAT AAAAAATGGAAGGCACCCT GTGATTGATGTGTTGCTGGGAGAA 
CAGGATGAATATGn"CCCAAATAATACAGATTTATCAGAGGACTCAGAGAGAGTAATGATAATT 
ACCGGACCAAACATGGGTGGAAAGAGCTCCTACATAAAACAAGTTGCATTG^ 
GGCTCAGATTGGCTCCT ATGTTCCTGCAGAAGAAGCGACAA7TGGGATTGTGGATGGGATTT 
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tcacaaggatgggtgctgcagacaatatatataaaggacggagtacatttatggaagaactg 
actgacacagcagaaataatcagaaaagcaacatcacagtccttggttatcttggatgaacta 
ggaagagggacgagcactcatgatggaattgccattgcctatgctacacttgagtatttcat 
cagagatgtgamtccttaaccctgtttgtcaccx^ttatccgccagtttgtgaactagaaaa 

AAATTACTCACACCAGGTGGGGAATTACCACATGGGATTCTTGGTCAGTGAGGATGAAAGCA 
AACTGGATCCAGGCACAGCAGAACAAGTCCCTGATTTTGTCACCTTCCTTTACCAAATAACTA 

gagga attgcaggaaggagttatggattaaatgtggctaaactagcagatgttcctggagaa 

attttgaagaaagcagctcacaagtcaaaagagctggaaggattaataaatacgaaaagaaa 

gagactcaagtatntgcaaagttatggacgatgcataatgcacaagacctgcagaagtgga 

cagaggagttcaacatggaagaaacacagacttctcttcrcattaaaatgaagactacattt 

gtgaacaaaaaatggag aattaaaaat accaactgtac aaaat aactctccagt aacagcct a 

tcttt gtgtgacatgtgagcataaaattatgaccatggtatattcctattggaaacagagagg 

tttttctgaagacagtcnttttcaagtttctgtcttcctaacrrttct acgtataaacactctt 

gaatagacttccactttgtaattagaaaattttatggacagtaagtccagtaaagccttaagt 

ggcagaatataattcccaagcttttggagggtgatataaaaatttacttgatatttttatttgt 

TTCAGTTCAGATAATTGGCAACTGGGTGAATCTGGCAGGAATCTATCCATTGAACrr AAAAT AA 
TTTTATTATGCAACCAGTTTATCCACCAAGAACATAAGAATTTTTTATAAGTAGAAAGAATTGG 
CCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTAGGCAGATCA 

cctgaggtcaggagttcaagaccagcctggccaacatggcaaaaccccatctttactaaaaa 

TATAAAGTACATCTC 

>5O0 

>501 

>502 

>503 

>504 

>S05 

>506 

acgcgggggtctgccgcggtggccgggtgccgatttgacaagatcaaagctgcag 
gaaaatggacagtgaggttcagagagatggaaggatcttggatttgattgatgatgcttggc 
gagaagacaagctgccttatgaggatgtcgcaataccactgaatgagcttcctgaacctgaa 
caagacaatggtggcaccacagaatctgtc^aagaacaagaaatgaagtggacagacttagc 
cttacagtactcgtcttggtgagagcgtgagctgctgagatttgggagtctgcgctagggc 
cgct tggagttctgagccgatggaagagttcactcatgtttgcacccgcggtgatgcgtg 
ttttcgcaagaacaagactctcggctatggagtccccatgtt^ 
aaaaactgaaagagaataaaatatctttagagtcggaatatgagaaaatcaaagactccaagt 
ttgatgactggaagaatattggaggacccaggccttgggaagatcctgacctcctccaagg 
aagaaatccagaaagcctt aagactaagacaacttgactctgctgattcttttttccttttttt 
ttttttt aaataaaaatact attaactggacttcctaatatatacttct atcaagtggaaagga 
aattccaggc ccatg gaaacttggatatgggtaatttgatgacaaataatcttcactaaaggt 
catgt acagg tttttatacttcccagctattccatctgtcgatgaaagtaacaatg 
cgtatattttacacctcgaaataaaaaatgtgaatactggccagacaaaaaacaaaaaaa^ 
gggggggcctcga^tgtttt 

GGGQTQ ^G^^ 

GGGGNNNN 
>507 



>610 

r#MNNNrWNNGCTATGTTGCCCAAGCTTGTCTCAAACrcCTGGTCT 
TCTGCCCTGGCCCTCCCAAAGTTCTGGGTATTACAGGTGTGAG<X^GCACTCCrrGGCCCAT 
CACAGTCTTAAAACCAAAAGTTCTGTGTCCGAGGAAAACCAGGAGTGATTGGTCACTCT ATT . 
TATGACTCATAGCACTTACAGGCTACTTCGGCAGGGACTnJGGGTACCCCTGTTCTTGGATG 
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TT AGCT ATTTT AGACTTGGGGTCAAAG AAGAG AAGCCT CTTGCCC AACTCAGCAACACCAGA 

CAGGGGCCTCATATCTTGGCTCGTGGAAAGTAGCTTTTATACCAAGCCCTCTCCTAGGGGCA 

TAGGAGCCAGCATTCCCAGTTCTGGGGACAGAAAAGGCAGGTGCCCCATTGGAGCCGAGAT 

CCTCTACCGCAGTCGTTTGAGGAGGCGGAACTGAAGI rTTTTCI I A ATTATCATGTGACGGG 

TTCTGGATTTAATGGGGGGAAAAGGGCGGAAAAGGACAAGGATCCAAACTGGCGAATTTGC 

TGATCTTCGCGTCCCTCTCCGCTTT CCGGCCGGCAGCGCT G CCAGG GTATATTTCCTTTTTT 

CCGATCCTGCAACAGCCTCTTTAAACTGTTTAAATGAGAATGTCJCTTGGCTCAGAGAGTACTA 

CTCACCTGGCI 1 1 ICACACTACTCr ICI I GATCATGTTGGTGTTGAAACTGGATGAGAAAGCA 

CCTTGGAACTGGTTCCTCATATTCATTCCAGTCTGGATATTTGATACTATCCTTCTTGTCCTG 

CTGATTGT6AAAATGGCTGGGCGGTGTAAGTCTGGCTTTGACCCTCGACATGGATCACACAA 

TATTAAAAAAAAAGCCTGGTACCTCATTGCAATGTTACTT AAATT AGCCTT CTGCCT CGCACT 

CTGTGCTAAACTGGAACAGTTTACTACCATGAATCTATCCTATGTCTTCATTCCTTTATGGGC 

CTTGCTGGCTGGGGCTTTAACAGAACTCGGATATAATGT C I UN I GTGAGAG ACTGACTTCT 

AAG T ACATCATCT CCTTT CT ATTGCT GTTCAACAA G TT ACCATT AAAG TGTTCT G AATCTG T CA 

AGCTTC^GAATACCAGAGAACTGAGGGAAAATACCAAATGTAGTTTTATACTACTTCCATAA 

AACAGGATTGGTGAATCACGGACTTCTAGTCAACCTACAGCTTAATTATTCAGCATTTGAGTT 

ATT GAGATCCTT ATT ATCTCTATGTAAAT AAAG 1 1 I G 1 M I GGACCCCAAAAAAAAAACGAAAC 

ACAAAAAGGGGGGCGCAGAAATCAACAG GCGCACACTTCACCACACTTTTG AAAGGCCACA 

GGAGACTT AAAACGGGCACCAAACTGTTG G ACACAN 

>611 

>S12 

>513 

>514 

ACTGAGGCTCGA^^ACACCTGGCCTCGAAAGACCCCTGTGTC<^AAAGGAGATTT 

GGGTTCGTTTTCAAAGTTGTCGTAGAATGTGTCTGGATACATCCGCTCCATCG GATTTCTGAT 
CTGTGGATTCATGTTCAGCCTCTTAGATACGCTTGTCCCTGTCCCGCT C I 1 1 1 M 1 GAAGCTC 
TGTGGTTrCACTTCCTGCTTACCCCGCGGACTTGCTTAGAACTGCCAACAGACTC^ 
AAAAAGAOGTGTGAI I ICC 1 1 CCAGACTTT AAAGAATACT GTCAGAAGTTCTATCAGGCAGAG 
CTGGAGGAGTTGTCCTTTGGTGAAG ACACTGAAGAGTG CAGGAAGCAT AT AAATGACTGGG 
TGG CAGAGAAGACT GAAG GT AAGATTTCG GAG GT 
>515 

TTGGAGCTCCCCGOGGTGGCGGCCGGGTTATTCrrTTCACT^ 
TTCmGATTGGT CAACGC TCTATTTGAAAAGCCCGCX^AGGGCTG 

CTCATCCGGGI rOTTTI J CGAAGAGGTTCTCCCG GAATGGAAATG ACCG TTATCTG GAGGAG 
TGCATCGGACTCTGGCGCCAAGAGCTGGAAGCTGTGAATTAACCGTCCCTCTT 
GTACCTCGGCCGCGACCACGTCXiAGCGGCCGCCCGGGCAGGTA>WNNNNNNNNN^NNN 
NNN NNNINN NCACT AGAGGCATCTCAGGTGCCT G CCCATGG GCCACAC7 CTT AGAGGGGG A 

ClClGGr IUI i IliACCAjJl I \\Jl iCt IlIlUI f CTCTGGGAGCCCAAGGTCT ACTTTGTGT 

GTCTTGGTGATCATTCCGGTGGGCGAAATTTCCCCGCCT^ 

GTCCCCGCGTAOCTCGACCGCCACCGCGGGGAGCTCCA 

>516 

>517 

NNGGGATTCAGCATCACTTCCAGAGATGTAACAATAGGTGGCTCAGCTCCGATCTAT 
G TG AAAAA GATT CTCCCOCGGGGGGCGGCCATTC>GGATGGCCGACTT AAG GGAGGAGAC 
AGACTT ATAGAGGTAAATGGAGTAGATTTAGTGGGCAAATCCCAAGAGGAAGTTGTTTCGCT 
GTTGAGAAG CACCAAGATGGAAGGAACT GTGAGCCTT CT GGTCTTTCGCCAGGAAGACGCC 
TTCCACCCAAGGGAACT GAATGCAGAGCCAAGCCAGATGCAGATTCCAAAAGAAACGAAAG 
CAGAAGATGAGGATA 1 1 G 1 1 C 1 1 ACACCTGATGGCACCAGGGAATTTCTGACATTTGAAGTC 

CCACTT aatgattcaggattjtgcaggccttggtgtcagtgtcaaaggtaaccggtcaaaaga 

GAAOCACGCAGATTTGGGAATCTTTGTCAAGTCCATT ATTAATGGAGGAGCA AAAG 
. ATGGAAGGCTTCGGGTGAATGATCAACTGATAGCAGT AAATGGAGAATCCXrrGTTGGGCAA 

gacaaaccaagatgccatggaaaccct aagaaggtctatgtctactgaaggcaataaacgag 
gaatgatccagcttattgtigcaaggagaataagcaagtggaatgagc^ 
agcccccctggacctgagctgcocattgaaacagcgtt ggatgat agagaacgaagaattt 
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cccattccctctacagtgggattgaggggcttgatgaatcgcccagcagaaatgctgccct 

cagtaggataatgggtaaataccagctgtcccctacagtgaatatgccccaagatgacactg 

tcattatagaagatgacaggttgcxjagtgcttcctccacatctctctgaccagtc^ 

gctcccatgatgatgtggggtttgtgacggcagatgctggtacttgggccaaggctgcaat 

cagtgattc^gccgactgctctttgagtccagatgttgatccagttcttg c i i i i c aacgag 

aaggatttggacgtcagagtatgtcagaaaaacgcacaaagcaattttcagatgccagtcaa 

ttggatttcgttaaaacacgaaaatcaaaaagcatggatttaggtatagctgacgagactaaa 

ctcaatacagtggatga(x^gaaagc^ggttctcccagcagagatgtgggtccttccctgg 

gtctgaagaagtcaagctcgttggagagtctgcagaccgcagttgccgaggtgactttgaa 

tggggatattcctttccatcgtccacggccgcggataatcagaggcaggggatgcaatgag 

agcttcagagctgccatcgacaaatcttatgataaacccgcggtagatgatgatgatgaagg 

CATGGAGACCTTGGAAGAAGACACAGAAGAAAGTTCAAGATCAGGGAGAGAGTCTGTATCC 
ACAGCCAGTGATCAGCCTTCCCACTCTCTGGAGAGACAAATGAATGGAAACCAAGAGAAAG 
GTGATAAGACTGATAGAAAAAAGGATAAAACTGGAAAAGAAAAGAAGAAAGATAGAGATAAG 
GAGAAGGATAAAATGAAAGCCAAGAAGGGAATGCTGAAGGGCTTGGGAGACATGTTCAGCC 
TTGCCAAACTGAAGCCCGAGAAGAGATGAACAACAAAGCGATTCAAAACATGTCTTGAACAG 
cacatattgcacagttgttgi mm I AAACAAACAATAAATTTACTTTTAATGAAAAAAAAAA 
AAAAAGGGCGGCCGCN 

>sia 

>519 
>520 
>521 

NNNNNCC^CCGAGGGTTCXXSATTAAGTGCACGGCACCTCXaACCCX^AAAAACTTGA 
TTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCT gatagacggtcgcccttt aacgttga 
GTCCCACGTTCTTT AATAGTGGACTCTTGTTCCAAACTG GAACAACACTCAACCCT ATCTCGG 
TCNATTCI 1 1 IGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGAT 
TTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTC^ 
GG CTGCGCJAACTGTTGGGAAGGGCG ATCGGTG CGGGCCTCTTCGCTATTACGCCAGCTGG 
CGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACG 
ACGTTGTAAAACGACGGCCAGTGAATTGTAATACGACTC^CTATAGGGCGAATTGGCCAAGT 
GAGCTCGAAT TGCGGCGGC ATGCATA AG 

CAGACAGGCXsTAGCCCCGGGAGGAACCCGGGGCCGCAAGTGC^TrCGAAGTGTCGATGAT 
CAATGTGTCCTGCAATTCACATTAATTCTCGCAGCTAGCTGCGTTCTTCATCGACGCACGAG 
CCGAGTGATCCACCGCTAAGAGTCCCCCTTCTGTCGTCTTCXCGCAGCC^ 
GGGCGTCCCGAGACNAAGGGNTTCGATGGACAAGAGACCOAAGCATAGAANNNNN 



MACG CGGGGAAG T G AGAG GAACCGAGAGT AAGAGAAAGAAAGAAGTGAGGGGATG 
TAAACTCGAATAAATTTCAAAGTGCCTCCGAGGGATG CAACGGGCAAAAACTGAACT GTTCA 
GGCTTCAGATTGTAACTGACGATCTGAGGAAAAATGAGGTTTGTGTGATTTTGCTAAAATGC 
TCA(XiAACAGCX3AATGGCTGCCTTAGGtGACGGACAAAGAGCT GAGTGATTTACTGGATTTCA 
GTGCGATGTTTTCACCTCCT GTGAG CAGTGGGAAAAATGGACCAAC I I C I I I G GCAAGTGGA 
CATTTTACTGGCT(2AAATGTAGAAGACAGAAGTAGCTCAGGGTCCTGGGGGAATGGAG6AC 
AT CCAAGCXXX3TCCAGGAACT ATG GAGATGGGACTCCCTATGACCACATGACCAGCAGG GA 
CCTTGGGTCACATGACAATCTCTCTCCACC I II I fa I UAATTCCAGAAT ACAAAGT AAAACAGA 
AAGGGGCT CATACTCATCTTATGGGAGAGAATCAAACTTACAGGGTTGCCACCAGGAATTG 
CGTGATTCCTCACAGACACAAAGATTTCCTCTGCTGAGTAAGC^ 

GAAAGCATCATCCAGACCGTGTGAGTCTGTCTGTGT ATGT GCAGAACACAGACCCTCCTTTC 
TCCGTTTGTGGGGAATACTTCCCTCGGGTGAAACTGAAGTTA A I I 1 1 1 1 HI 1 T T ICCAAACA 
CAAGCAGAGTCTCCTT GGAGGTGAC^TGGAT ATGGGCAACCCAGGAACXXrrTT CGCCCACC 
AAACCTGGTTCCCAGTACTATCAGTATTCTAGCAATAATCCCtt 

TAGTGGCATGGAGGTACAGACAAAG AAAG TTCGAAAAGTTCCT CCAGG TTTGCX^TCTTCAG 
TCTATGCTCCATCAGCAAGCACTGCCGACTACAATAGGGACT cgccaggctatccttcctcc . 
AAACCAGCAACCAGCACTrTCCCTAGCTCCTTCTTCATGCAAGATGGCCA^ 
CCCTTGGAGCTCCTCCAGTGGGATGAATCAGCCTGGCTATGC^^ 
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TCTCATATTCCACAGTCCAGCAGCT ACTGTAGCCTG CATCCACATGAACGTTTGAGCT ATCC 
ATCACACTCCTCAGCAGACATCAATTCCAGTCTTCCTTCCGATGTCCACTTTCCATOGGAGT 
GGT ACAAACC ATT ACAGCACCT CTTCCTGT ACGCCTTTJTGGCAACGG GACAGACAT AT AATG 
GGAAATAGAGGAAGCGGGGCAGCCGGCaGCTCCCaGACTGGAGATGCTCTGGGGAAAGCA 

cttgcttcggaaagccgaattgaagatcgtttagaaagactggatgatgct attcatgttctc 

cggaaccatgcagtgggcccatccacagctatgcctggtggtcatggggacatgcatggaa 

tcattggaccttctcataatggag ccatg ggtggtctgg gctcagggtatggaaccggcct 

tctttcagccaacagacattcactcatggtggggacccatcgtgaagatggcgtggccctg 

agaggcagccattctcttctgccaaacx>ggttccggttccacag^ 

gacttcccctgacctgaacccaccccaggacccttacagaggcatgccaccaggactacag 

gggcagagtgtctcctctggcagctctgagatcaaatccgatnacgagggtgatgagaacc 

tgcaagacacgaaatcttcggaggacaagaaattagatgacgacaagnaggatatcaaatca 

ATT ACT AGGTGAAG fiTCT AGCAATAATGACG ATGAGGACCCCCTCCCCGCTT CCCTGCCTCA 

CCCTCAATAAATAAATTAATTGTTGTCANNN 

>524 



GCAGGAGATGTGGGCCGTGCCAAAGTGATGGCTGACACTGTGGCTCAGTTCATCAAGAGGA 
CCATCTTGAAGATCCCCATGAGTGAACTGACAACAATCCTGAAGGCCTGGGATrTTT^ 

CCATCTGTGTGAGGAAAAGCGTGCAAGTATCAGTGATGCTGCCCTGTTAGACATC^ 
AGTGCTGGAGTGCAGTAACGCCATCTGAGCTCACCGCGACCTCTGCCTCCTGGATTCAAGT 
GATTCTCCAACCTCAGCCTCCCG AGTAGCTGGGACT AT AGCAGTGCACCACCAT ATATGCAA 
TTTCATCAGCACCAGAAAGTTTGGGATGTTTrrGAGATGAGTAAAGGGCCAGGTGAAGATGT 
TGACCTITTTGATATGAAACAATTTAAAAATTCGTTCAAGAAAATTCTT CAGAGAGCATTAAAA 
AATGTGACAGTCAGCTTCAGAGAAACTGAGGAGAATGCAGTCTGGATTCGAATTGCCTGGG 
GAACACAGTACACAAAGCCAAACCAGTACAAACCT ACCT ACGTGGTGTACTACTCCCAGACT 
CCGTACGCXTTTCACGTCCTCCTCCATGCTGAGGCGCAATAC^ 

tgacaattgctagcaaacaccatcagattgtgaaaatggacct gagaagtcggtatctggac 
tctcttaaggctattgtttataaacagtataatcagacctttgaaactcacaactctacg 

CT CT ACAGGAAAGAAGCCTGGACT AGATATAAAT ATGGATTCAAGGATCATTCATGAAAACAT 
AATAGAAAAAGAGAGAGTCCAAACGAATAACTCAAGAACACATTTGGLAGATTATC<rrCAACCA 
CAACTAGGAATTGCACAATATAAGCTTGAAACGAAATTCAAAAGTGGTTGAAATGGGAGCAT 
CTT GGCTGAGAGGAAAGAACCCCT COGATGCCTAAT AAAGTTCTCT AGCCCACATCTTCTGG 
AAGCATTGAAATtXTTAGCACCAGCGG^TATTGO^ 

TGCATACCCAACAAGAGAATGAATTATTTTAAAGATTAGAGATAAATAAGACGTGCGTC 
CTT AAGCACAGCTCCTCCTTCTTGATATTGCACATGCACTT CAGTTCATGGCT AGCTGTATAG 
CTTCCGTCTGTAAACTTGT ATrrTCAAGAATCCTTGGT ATTGAATTTTT AGAAATGCTCACATA 
ATTGTTGGGACTGATTCATTCCTCGACGATATGCCTCCTCTCTCTGATATCTGCT 
CCGTGTGGCATTTGAGATGACAGGACATATTTATATATGGCCCAC^ 

AATGCTCTGAAAGCAAGCATATGGCACAGCGCT CAAGAACT GGCGTGGGGGCCTT ACCATG 
GGGACCGCCCAGT CGGGAAGGGCGGGCAAGGGGGGTCCCGGATAGAGCCCAACGCTGGA 
CGGGAAAGCAC»SGGAAACGACCCAACGTGCAACCACCAA^ 

CCGAGGGGGGAGAGGGCGTGGGGAACAACAGGTGGGACACCCACACAAGACCCAG 

AAGACAAGACGGCGAGAACAACAAGGNNN 

>S2S 

>526 

>527 . . 

NNATGTGQTTACGACCCACTGTATTtSAGGTGAOGCGATCCATAGGCTGTGGTG^ 
GTTrrCGCTC^TCC ACAC AAACXSTTGGGGCAGTGTCTATTCATGTG 
GTTCTCGGTTGTCATrtTACA0TGTTTT^ 

GTTTAGCCCGTGGAGT ATCAAGGTCTTGAGACTCXX3TGTGAGACTCCCTGGTTTGTTCCACA 
ACAGTGTGTTTTTAGATTCCGT ACCTTT GATTAAGGAACACATCATGCCGTGAAGCCAATTT A 
TATTCTGCAATTaXTTAGTGCATGTAATGTATTCT .. 
GCACATDCA<nTCTTTGA7GTCTGGCTGCCTrCTGCGGGCCAACT G 1 C I TGTG gXwTOGTT 
GCTCCCAGAGATAGCTTGAACTGCAGATCCCGGACAGCATT^ 
TGAGCCTGGACATGGCGGCCGAGGTACTCACAGTCACGCAAAT^ 
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GCTCT CCATTCTTCTTCTTGGCTTT ACAGGTTCCCAGGTCAAGAGCTTCACCCATAATTAAGA 
CCTTCTGAGGATGATCGATAGATAAACACACCTCCTCT^ 

TTGGCATTGAGGATCCCT ACGACAGTCC(Xn"GCTCCGTCTTCCAGAGCGCTTTGTGAACTTC 
T CCAA AT AAG AACAAGGACACACATTGTGTCAG GTGACGAAGATCATTCA GTTTCCAT AT G CT 

gaaggi i i 1 1 ccactattcagactctgtggcgtaaccttcttcaatataaccccaaatgtcac 

pcaatctatttcttccagcttctctctggccatcttttccttgat^ 

ttttcggccggtcatgtgtctrcgttcatattct^ 

cgcagaccaggagaaggcttccacacagatggcgattgagtcgtttcctcacagaa 
actcggggtccaccacatatttgacctct agttatcccact aggtttgttccgagaaatcgt 
ctgtaggggttgggagggtgcacttgtcatcxrrtgaagatgagcttttgggatctggaggt 

G AAGCCTTTGGTGTTCG AGCCACCCTCTTG GTTCTTGGTAGCGCAGGGACATCAAGCTCCG 

CAGAAAAGCATGTTGACTCCTGAATTCTCTGAACTCTCCTCTCCTTAAGAGGTGG 

G ACTTCTCAGGGGA I f I \ TGCAGACGGGCTGGGCTTGCTGTCrrGTTrAATTGTTGTTACTTTT 

AGCJGCTCTTGTAAGGCCTTCATTTGCTCTTGCAAATTCCTTAATTCCT 

TCGTTTTCTCTCGCXTGGGGGCAGGAGCAGGGAGGACCCTATTTTC^ 

GGGAACTTCI Id ICATCTGTTAAGTCCTC<^TATCTCCAAAGAGAGTGGCCAG^TTTTCCTT 

TrCGTCTCTTGTCTCTCtnGTTTCTCCATCATCAGCCTCTTCTGTATAAGATTCACCGT 

GTC<MCATCAAAGAGCTCATCAAATGCGTC<5GGCTCGCCATT1TCCCGCGTCAAGAAGTTAT 

1 1 ici i ctgaattacaatccaaggctgactcattttcttccagcagtggggtcagcagagaca 
GATTGTCTTCCTCCTCATCCATGCTG t caag aggacagttgtgacag gaaacttcgaaggag 

gctcagatgcccagatgagcaatgtagaatcttggccgati 

acagctgaacccgccaaaatcggacnn 



nagtcgacccacgcgtccgatttcatcctgttgcccctccgaaatct attgctttag 
attcagagcccaagcotctcattgatgtgagtgtcat^ 

Terr ACAGTTCAAGTATTTTATGGTCXrrCATTCAGGAAATGGCCTT AAAAATTGATCAAGGGT 

TTCTAGGAGCT attattgcactgtttaccccaacaacagaccctgaagctgaaagaagacgg 
acaaagttaatccagc^gatattgatgctctaaatgcagaattaatggagactt^ 
gatatgtcaattctt AG I IIU1 1 1 gaacatttccatatttctcct gtgaagttgcatttgagtt 
tgtctttgggttccggaggtgaagaatcagacaaagaaaaacaggaaatgtttgcagttcat 
tctgtcaacttgct gttgaaaagcataggtg ctactctgactgatgtggatgaccttatattc 

AAACTTGCTTATTATGAAATTC^GATATGAGTTCrTACAAGAGAGATCAGCTTATATGGAGTGT^ 
GTTAGGCATTACAGTGAACAGTTCrTGAAACAGATGTATGTCXrrTGTATTGGGGTTAGATGTA 
CrrTGGAAACCCATTTGGATTAATT AGAGGTCTGTCT GAAGGAGTTGAAGCTTTATTCTATGAA 
CCCTTCCAGGGTGCTGTTCAAGGCCCTGAAGAATTTCC^ 

GAAGCCTCTTT GGACACACAGTAGGTGGTGCAGCAGGAGTTGT ATCT CGAATCACCGGTTC 
TGTTGGGAAAGG7TTGGCAGCAATTACAATGGACAAGGAATATCAGCAAAAAAGAAGAGAAG 
AGTTGAGTTC<iACAGCCCAGAGATTTTGGAGACAGC<rTGGC<^ 

GCGAGGAGTTGTTGGTGGAGTGACTGGAATAATAACAAAACCTGTGGAAGGTGCCAAAAAG 
GAAGGAGCTGCTGGATTCTTrAAAGGAATTGGAAAAGGGCTTGTGGGTGCT GTGGCCCGTC 
CAACTGGT GGAATCGTAGATATGGCCAGTAGTACCTTCCAAGGCATTCAGAGGGCAGCAGA 
ATGAACTGAGGAAGTATCTAGCCTCCGTCXXXX^^ 

GTCCTT ATGACAGACAGGAATCTGAGGGCT CTGACTT ACTTGAGAATCATATCAAAAAGTTG 
GAAGGAGAGACTT ACCGAT ACCACTG TGCT ATTCCT ggaaggaagaagacaatcctt atggt 
TACAAATAGGCGAGTGTTGTGTATAAAGGAAGTTGAAATOCTGGGCCTTATGTGTGTAGACT 
GGCAATGTCCATTTGAAGATITTGTATTTCCTCCTA^ 

CAGTTAAGGAACAGGGTCTGTTCCACAAAAAAGACAGTGCCAATCAAGGCT 
GTTTACCTGAAGGACACCGCCACAGCAGAGAGAGCATGTAATGCCATTGAGGATGCACAGT 
C^ACGAGACAGCAGCAAAAATTGAT GAAGCAG TCATCAGTGAGACTTCTCAGACCCCAATTG 
CCATOTAATCACA^SACCTCAGGG^ 

TGTATATGCT AGCAGTCCACTTCTACGCCACCACCCAAATGGGTCAGACCtrTTGAAGAAACG 
TCACTTCAAACTCAGAATGAAATTTTCATTAATATTAAAATTGTGAAGCAAA 
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TATATTTAATTAAAGCCTTACTGAAGAATAA 

AGGJ 1 1 1 1 1 1 ATTTTTGTTTTTTTGGGG I 1 1 1 1 1 1 1 1 1 1 1 1 1 J GAGACCGAGTCTTGCTCTGTCG 

CCCAGGCTGGAATGCAGTGGTGCGATCTTGGCrrGACTGC^ATCTCTGCCTCT 

GCGGTTCTACTGCCTCAGCCTCCTGAGTA^ 

GGTAAI I I 1 1 II 1 1 1 1 1 1 1 GTATTTTTAGTAGAGATGAGTTTCACCATGTTGGTCAGTCTAGTC 
TOGAACTOCTGAGCTTGTGATCCGC ATGC CTCAGCCTCCCAAAGTGCTGGGATTACAGGCAT 
GAGO^CCACGG<XXXJCGAAAAGGCTTT AAGCCATGAAGAAATGTTGGATCCTGACATTTTG 
TTTAAGAGTGATTTGTTGAATAATTGAACTGAGTTAAC^TTCrrTGGTAAACCAGGT 
GAAGAAAGGTCACT AAA GGGAGAAATGACATGTTTTCT A \ M I C I 1 1 1 CATG AAAAC ACTGTT 
TTTCCCCCTAATAAAGCATATTTTACTrTGGTGCTTATTTTTC<^^ AATAAAAA 
AATCTGGACAATCAAACCTTAAAATAGCTACACTCTGCCCTCTGTAATGTAGCATTCATAAAA 
ATTTGGAAGT ATTT ACATCCTCTTTCAAG ATGAG CTTATATGA CAGAATT ATT ATTTG CTGAT A 
CATGAAAATACTGCACTTTAAGTTTCTC^AGAXrrCTGAAATATGTAAAAT^ 
TCCCAGAAATTGTTTCTT ACAGGTTGAAAGTCTTTTAAGG GCATCACAAATT AAGATTT ACT CC 
TAATGCACGCCTAGAATGTATTTTAAATACTTACTAAGAAGAATGAAAATTCTTTG 
ATATATAAATAAGGCATATATAATGACACTGTGTTCTGTGAGGGAGCAGGCCCTGTGAGAAT 
CAATTCAGGACAGTA 1 1 1 1 1 1 1 1 I GTCCTTTCTCCATttrrTGATCAGAGATAAACTATTAAAAC 
TTTAAAAAATACTCAAAAATATGTAAGTTTTTTGGTTGAAC^rrTT AGATTTGCTCATAATGTTTA 
ACATAAjCAACATTTAmCAAATCACT 

A i n 1 1 toT TT ATGT GTGATAGTGGTTCTGTCATCATCATTCA I G I 1 1 I I I AAGGCCTGGTCATA 
AAACTTT AAATTTTACTAGTGTTACTTAATGTATATTCTAAAAAGAGAATGCAGTAACT AATGC 
CCTAAATGTTTGATCTCTGTTTGTCATTAG 1 1 1 1 IC AAAATT A I M ill 1 CTQT AAAGT ATAAT A 
TATAAAAot IV/M GCTTAAATTGAATTTCTATATTAGTGGTTAATTGCAGTTTATTAAAGGGAT 
CATTATCAGTAATTTCA TAGCA ACTGTTCTAG I G 1 1 1 1 G TGTTTTTAAAACAGAATTAG GAATT 
TGAGATATCTGATTATATTTTTCATATGAATCACAGCTGT^ 

AATTAT ATCATACTGATACT ATTTGT AACATTTTGATTTGATTTAATCTCX^AGGGACAGAAATA ' 
ATTCATTGGT AAAGTGTAATAATGCGTTTTTTAAAAATGCTTT GAGAGGTAATTACTTGCATAT 
GAGAGAAATAAAACATTTGGCACATTGTTT ACAGGTGAAAAAAAAAAAAAAAAAAAAAAAAAG 
GNCGGCCCGCTKT AGAGGATCCAAGCTTACNT ACGCGNGGANNNNNNNNNNNNNNNNKMN 
^mN^O^^O^^OO^^{KNN^WGGGGGGGCCCTTAAA ^ inn i aaaaaaaaaacctcccacac 
CTCCCCTGTGCOCNNNNKNNNffNNNNN 
>S31 
>S32 

tttttattcaatttgtgaaggacagtttt agaacaaaatgtt agtaacactcttagaacactgg 

TACAATATTTGCAGAGTATAACCACTAGTTGC<rTAGACAAAAGCT^ 

ACTTAATGCAGTTTT ATTAAGAGAGTCAAAATTCTCTCAGTTAACTGGATATACATAGTGGT AT 
ATATCTT AAAG CACA AAACCCCAAAAAACAAAAACAAGGAAAAAAGAAAATACATGT CAACAG 
TCAGTTAAATATTTTGACCTGACAGTTTCTACAAATA^ 

t^acatgtjotaaaacttccag^ 

^y^g^j^gpCAGqCTGGGCCTGI 1 1 1 C CATGAAGOCCAAGGCAOTGATCTTCATCATTAA 
GGAGGGACCACTGTGTCCACAACT AAAACCTT CAACCACATGGTGATCTGCAAAGCTTCATT 
GAAAAAGACAAACATTCI CT ICI I CACACAAATGACTGGAGAAATCTTTGT AAGGCT GT ACCT 

AAAATGATGCCCATGTGCAAAGAGATAAATTACCAGGCAC^ 

TTGATCATATAATCI 1 1 T AACAAAAAGAAAAATGGCAATATCTGGT GACAGTATTAAAAAGACA 
ATGTTGAGATTOGCATCAGAAAAACTC<*^^ 

QAAACATCTATGCACTTAGTAAGCCCT ACTCATGCACAAACAAGCAGTTGCTGGGAACTTGT 

CGAAGTTTAH 1 1 1 1 CAGAAAGTTGAAGTATATAAAAAAAGT ACAAGTTGTGACATTCGTTCAA 

6TTOC1 i^ACATGGGTATTCTGAGCTGGCTACCAGGAAGGAC I IX I G OG 1 1 1 CCGAGAGC 

CAGGTCAGCGTCTCTAGGAGCCGTGG I GCAGGTGCATGGTGCCCAGCAOGGCACr G TCGT 

f£OGCTCCTfTGCACCTTCACG^ 

TCAGCACTTGTOCCCAAACOC^ 

CCAGGAAGgOCTAjCTGGCTATGCACCTACTTGGA^ 

AACTCAGGGTT AGCTCT CAGGGTCTTCTCTGGCTCCTCAGTCXJCACTAATGCAGAGAGA^ 
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CTGGCTACTCGGAAACGGCTGTCCtGGGGCTAATCCACTGCTAATTCCGTTAGTTCCTGTCA 
AGCACTGGCCATTCCTGCTTTCCATCCCACTGAGCGGGGTCGTGGAGG^G 
TC^^GATGACAC^TCXlATTTGTCiAATGTATTCGATACGGAGCT ACTGTnTTCATATCTGTT 
ACAACAGCAAACT C^TATCAACGTGCATTTTGT(^GCAT(^TCTTCTGTGTCGCCATTC^GT 
GAGTGGCACGGCAGAATCCAAGCTTTGAGCTTCAGTTCTAG C I HUM I ACTCCAAAGTCTT 
CTGAGTCCTCCAAGCATCTCTTTCGATTTGTTCCCAGACTAATATGA^ 

GATGA«X^X>TTTAAGATGCCfTTGTTATAGAAATGGTTCAGATGACAATTCTGGAGGTTCA 

AGAGAGACT GGGCACACTCTTGGAAACCCTGGGTTTGTGCAATGTCTGCT GCTGTCAGGCC 

ACTGGCATTTCTCAGGTCGACGTGAGCXXCATTCGCCACAAGGGCACTG^ 

CTCCCAGAGCGAG CTGCCTTGTGAATGG GAGTTTCACCCT CACAATCCGG TTTGTT AATGTT 

GGCTCCTGCTTGAATCAGCX^GACCAGGCACTGAGGATGTCXXXXXAAAGGCTGCAATGTGG 

GCTGGC^TCTGCGCGTACCGTGTGGTGGAGACGTTGAGTGTGGCTCXX^CTCTCACCAACT 

GCACTAAGCACTCCAACTTGCCGAAATGCGCGGCCCAGTGCACGGGCGTCC^ 

GGAGTCXTTC^GAGGCCAGGTGGGCMSTGGGGTGTCTGCTGCAGCAGCGAGCAGAGCGTGG 

CCAGGTCCCGGTCGCGGCAGGCGCGGTGCAGCGGGAAACGG^CGAGAGCAGCTCCTCG 

CTGGAGAAGCCCGCCTCTACGCCCGCGCCCGCTCC^ 

AGAGGGCGGGGCTCGCTGGCCTAGAGGACGCGTCGGGGAGGACT CGAGAAGCCGCCGCC 

6CAGCACAAAGGAACGAGACTAGCGCCG<X3GTCGCGTCCCACAGGCTGCCGAGCGGAGC 

GOGNNN 

>533 

>534 

NNNNGCTCAACCTTGAAGACCCACAGCGGCXXACAGACAG 
CCTGT ACTCACATCTCTGCAGATGAAAAAGCAGCT GAAAGGAGTCGAAAGGCTCCACCAATT 
CCTCTAAAACTGAAGCCT GATTACTGGAGTGACAAACTACAGAAAGAAGCAGAAGCGTTTGC 
TTATTATCGCCGGACACACACTGCCAATGAGCGGCGGCX^CGTGGTGAAATGAGGGATCTC 
TTTGAGAAATTAAAGATCACATTGGGATTACTTCATTCTrCCAAGG 

tt act cgagccttcag tgaaattcagggact aacagatcag gcagacaaattgat aggacag 
aaaaatctcct gactcgaaaacjggaatattctgatacggaaagtatcgtctctttcaggtaag 
^^gaagaagtggtcctgaagaagctagagtatatttatgcaaaacagcaagcactagaggc 

ACGAAAAAAATGAGGGAACTAAATAACGN 

>535 
>536 
>537 
>538 

^*cgggccgggggtgaatcggccgagggagggactggcagtccccrrcct 

cggcccaaccctgtcgctgccgccgcgct 

TCGCTOGCTCAGCGTCTCCCTCTCGGCX^ 
CGCGGCCGCCGCCTGGGGTTAGCCCTGGGGCTGCT 
GGTTCI GCGGQCGAAGCCCACCGTGGGCAA 

tcggcgagcggccccctgaggagaaccagagct^ 

GCAAGGAGGACTCCAAGACCTTCGACCAGCTCACCCCGG 

ggaagattgtt GATCGAATCCACAATGATGGGGATGGCTTTGTCACTACTGAGGAGCT 
ACCTGGATCAAACGGGTGCAGAAAAGATACATCTTTGATAATGT CGCXiAAAGTCTGGAAGGA 
TTATGATAGGGACAAGGATGATAAAATTTCCTGGGAAGAATACAAACAA^ 
ACT ACCT AGGAAACCCCGCAGAGTTTC^TGATTCTTCAGATCATCACACCTT^ 
TGCCACGTGATGAGAGAAGATTCAAAGCTGCAGAOCTCAATGGTGACCTGACAG ACTCG 
G«AGGAGTTCACTG«rrTTCTGCAT«r^ 

TGGAAACC CT GGAG GACATCGACAAGAACGGGGATGGGTTTGTGGATCAGGATGaGTAT AT 
TGOGGATATu fill C<X^TGAGGAGAATGGCCCTGAGCCAGACTGGGTTTTATCAGAACGG 
GAGCAGTTTAACGAATTCCGGGATCTGAACAAGGACGGGAAGTTAGA 

GCCACTGGATCCTCCC I CAAGATTATGATCATGCACAGGCTGAGGCCAGGCATCTGGTATAT 
GAATCAGACAAAAACAAGGATGAGAAGCTAACTAAAGAGGAA^ 
. G M i u l i GGAAGCCAAGCTACCAATTACGGGGAAGATCTCACAAAAAATCATGATGAGCTTT 
GATAGACACTCACCAGAATATGGCAGACTGTCATA^ 
CTACAATTGT CTAATTTAC AGCAGTTGTGATCCCACAAAAA^ 

GGT ATAAAAA I IGTTTI I CX5CTCAGTATTTACTGGAAAATGGACATCACTAGTCTTTCAGT AA 
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CA. . f C . CAA AAC ACGTGAAAAC CTTGG I AAA 1 1 GCaaj I UT I 1 C I GGGGATAT ATTGGTA 

caacatgacttaaaac 1 i i i 1 1 1 i u ctatt aaaacttaaaggggaacaaaactt g aaaaagcc 
ctgttcttcag aaggtg agtgggttgagggaggcagtaatatgaagtgactgctgtgtattt 
taact accagattttttatatttgccactgttaaatagttggaaaggggaaattctgattaagc 
gaaagtggtatcatcct aggtaagcttatttcagaacaagtctaatatttcagatt c 1 1 1 C I 1 1 

TCGACTTT ATACT CTGAGTT ATT ACTTACTGTAAGTGGTGT ATATGAAACCTCCATGCATTTTC 

cagtatggatctgctaatatgcacagtaaatccatgtctttgtttgtttttctattaaaaagca 
atcaagaaagataatgtgaaaaagaaaggaatttagaggtagggaaaagatgaatgtcagac 
atttgaagaactatagtaaaatgataaacactaaatatacttgagaaaactttcttaatatgcc 
aatgaggtaggcctgatctttgaaatagtgaataggaatacaatgcatttcctcagtgatcac 
tgattagaatgagttggtgggatccttgggaagccaaacggagcggagttctggatcatgt 
ccc>tccagtccagtgaatccacgacxxgcagacctgcc«xxx^caacagct^ 

GGAATG AGGAC AAGGTGATACTCTGAGCTGTGGACTGAACTGGCAGACACAACCTGTACAG 

attgaaatttx>c«ttgtaaggaggaagtgaatgaaataaaggatccccctaaggaattaaaa 
aaa aaaaaaaaaaaaaaaaaaaaccaaagaaaaaaaaaaaaaaaaaaaaaaaaaaaacttggg 

TTTAAGAACGWN 

>S39 

>S40 

>S41 

9^ T^^^^ ^^^^^^^^^^^^^^^^^^^agcgaacgaatggaaactg 
aaagtggaaatcaggaaaaggtaatggaagaagaaagcactgaaaagaaaaaagaagttgaa 
aaaaagaaacggtcacgagttaaacagatgttgatatatgact acttgtgtcttttaac 

>542 

nnnnncgccggcaatcgagtggcttcctggcxatggagac^g^tctggacccga 
ggacgccagggactcctct c^tctccctggaccgctgtgt aggcacctgagttaacggac 
G £I* A *^ GGGTAGGGGAAG ™<>^^ 

ccctgcjttcctccaacctcagaatggaaatagtc^tcacccatctggctatgttccaggg^ 

aggttgtco^ttgcctccccctcctcctccaa^gagtca^ 

a T? agacgagaagacc <^ caa c<^ 

AAA ^ CAAAAGCAAAAG^GACACAAAAATACTn"CATTrGTTTTGCT ATTACTARTTTfrrnATr 

c ^ G ^ GGAGA I A J G i^ GCGT crGAOT 
GA I GGAGG ^$*^ TA ^ GGG <^ C ^^ 
T9 A II ACA I C H TGATCCAGGATGGTGGGGCGC ^ CATA ^ GG ^ 

^TCCAAAGCGACAATTAC<JTTGTATGGCAAGTCAGATGAAGGTGAAAGTATG<X]AACATTT 

GGCAAAAAGTTTATTGGTGTGGAAGCTGGCGGGACACTGGAGTTACATGGGGCACGGAAG 

GCATCGTCGACGTTGTTGGCAAGGACCCTGAATTCCT 

CTTTGAAAAGGACTTTTCCCGGGGGCTCAATGTGAGGGTCATTGACCA^ 

ATTTTGGAAAGTGAGAOATTTGATACCCATGAATACCGCAAT^ 

GTTTCTGAGATTCCAG GATCCAGGTCG GATTGTTG ccat agct gtcg gggattgag ccg ct 
AAAAGTCTCTTAGAAGGAACCATCCAGATGATCCAGGAACGGTTGGGAAGTGAACT6ATCCA 
AG GACTGGGCTAGAG G GAAGCTTGX3GCTTT AGTTG G TGTCATTGATG GTGGAAGCACTTCTT 

GCAATGAATCX>GTGAGAAACTATGAAAATCATAGCAGTGGCGGGAAGGCrTCrrTGCCCAAAG 
AGAATTTTATAjCT^ 

^m^br' 1 1 CAGGATTCCGGGTAGAGGTTGTAGATGGAGTGAAGCT AAATTTGCTAGAT 
GATGTT AGTAGTTGG AAACCTGGAGACCAGATTGTGGTCGCAAGCACAGACTATTCCATGTA 
CCAAGCAGAGGAt j i iuA OICI iCi iCCCTGt I CI GAATGCAGCCATTTTGAGGTCAAAGTCA 
AAGAAACCCCTCAGTTCCTGCACATX^GTGAGATCATAGACGGTGTAGACATGAGAGCTGA 
. GG7TGGAATTCTT ACCCGGAATATTGTGATCCAAGGAGAAGTGGAGGACTCATGCTACGCA 

GAA ^J? AG J GGCAA ^^ 

TTACTTCAGTC CA I C 11 1 C H ATGTGGAATTGAAACACATGGGTCAGCAGCAGATGGGGnfiA . 

TACCCTGTTCATTTTCAtXrrGTGTTGGTGACGTGGATTATAAAGGAGGATAC^ 

ATTTGTGGACGGCCTGTCTATTCATCAGAGCTTCTCAAGGTGCATCACTGTGCATGGGACAA 
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atggcttgctaataaaagacaggattgggtttgacacactaggtgattg i iTcrm IGGAAG 

^G^ A ^GAACAGAGGAATACTTTGTTCCACAATCT GGGACTCCT CACCAAGCCGGGT ACT 

ctcctgcccaccgataggaacaactc^atgtgtaccaccatgcgagataaagtgtttggaaa 

TTACATTCCTGTGCCTGCTACTGACTGTATGGCTGTTTGAACTTTCTGGATTGCTCATC 

caataatctgattaataatgcagctgcaggctcacaggatgctggaatatggtatttattcca 
caaggaaccaactggggaatccagtggattgcagctcttggcaaaaccagaactcactcx> 

TTG GGTATATTTrATAACAACAGGGTCCATTCAAATTTTAAGGCTGGCTT ATTTATTGACAAAG 

gtgtcaaaacaaccaactctagtgctgctgacccaagggaatacctctgtttggacaatagt 
gcaagatttcgacctcatcaggatgcaaacxxxxaaaaaaccacgtgttgctgctctaattga 
caggctcattgcttttaaaaataatgataatggagcttgggtcagaggaggagatattatcg 
ttcaaaattcagcatttgcagataatggaataggactgacctttgccagtgatggaagcttc 
ccaagtgatgaaggttccagccaagaggtatctgaatctctctttgttggggagagcagga 
attacggctttcagggtggtcagaacaagtatgtaggcactggaggaatagaccagaagcc 
tcgaacatragcgaggaacaggacgttcccaattagaggcttrcagatttatgatgggccca 

TTCATCTCACAAGGAGCACTTTCAAAAAATATGTGCCAACTCCAGATAGGTACAGCAGTGCA 
ATTGGCTTCCTCATGAAGAATTCCTGG CAGAT AACCCCCAG G AAT AATAT CTCCCT CGTGAA 

gtttggtccacatgtctctctgaatgtctrrtrtggaaagcctggtccctggm 
tgagatggatggtgataagaactccatattccatgacattgatggctctgtgacaggataca 
aggatgctt atgtgggaagaatggacaactacctgatccgccatccaag ct gtgtaaatgtg 

TCTAAGTGG AATGCAGTGATCTGCAGTGGGACCTATGCACAG GTCTATGT ACAG ACATGG A 
GCACTCAGAATCTTTCTATGACCATTACACGAGATGAGTATCCGTCCAACCCT ATGGTGCTC 
CG AGGT ATTAATCAGAAGGCTGCCTTTCCACAGTACCAGCCTGTCGTCATGCT GGAGAAGG 



AACAAGAATGACTGGATTCGAGTTGGCCTTTGCTATCCATCAAACACAAGTTTTCAAGTTACC 
TTTGGCTATTTGC^GCGGCAGAATGGCTCATTATCCAAAATC<1AAGAATATGAGCCTGTGCA 
TTCACTGG AAGAA CTGCAAAGAAAGCAATGCGAGAGGAAATTCT ATTTTGACTCCAGCACGG 
GGTTACTG 1 1 r I iGTATCTCAAAGCCAAAAGCCACAGGCATGGCCACAGTTACTGTTCATCTC 
AGGGATGTGAAAGAGTCAAGATCCAAGCAGCCACAGACTCAAAGGACATCAGTAACTGCAT 
GGOCAAAGCATACCCACAGTACTACAGAAAGCCGTCAGTGGTCAAGCGGATGCCGGCCATG 
CTCACTGGACTCTGTCAAGGCTGTGGCACTCGGCAGGTGGTGTTTACTAGTGATCCTCATAA 
AAGTTACCTCCCTGTGCAATTCCAGTCACCTGATAAAGCAGAAACCCAGCGTGGAGACCCGT 
CTGTTAI 1 I C I GTCAATGGCACTGACTTTACCTT CCGAAGTGCAGGCGTCCTCCTCCTTGTTG 
TGGATCCGTGCAGCGTTCCATTCCGCTTGACGGAAAAAACGG i mTCCTCl I GCTGATGTC 
AGTCGCATTGAAGAGTATTTAAAAACAGGCATCCCTCCAAGGTCCATTGTTCTGTTGAGCAC 
AAGAGGAG AAAT AAAG CAGTT AAACATTTCACACTT ACT AGTACCT CT GGGATTAGCCAAACC 
AGCTCATCTTT ATGACAAAGGGAGTACCATATTTTTGGGATTCAGTGGAAACTTTAAACCATC 
ATGG ACT AAGCTATTTACCAGTCCTGCTXaGACAGGGCCTrGGGGTGCTrGAACAATTCATAC 
CTTTGCAfiCTGGACGAATATGGTTGTCCCAGAGC^CCACTGTCCGCAGA 
ACT GCT AAAGCAAGCTTCAAAAGCACATTAGAGACT AACTGTAACTTAAGTGCTGGGGGAAA 
AAAAATGTGAACT AACnTATTTAATTTATGGCATTTTAAAATGACACTGTTAACCCAACGG 
CATTTTCCAGTTTGAT ACAGAATGGG GAGAAAAGAAAG CG TTTGAAATTATTGCTTGGATACC 
AGCTT CATGCACCTT CT AGTTGTACAAAATGTTAAAGACG TTGTTTGT ATTTGTAAGGCTGGT 
GTATT CAGAGAGCAGA TCTCTTATTCCTCACTTTCCAGC^ 

CAAT G 1 1 1 1 1 AC 1 1 1 1 1 GTATAATGGGGTGGGGTGGAGTGGGGGCTTCTGAGAGTCAGCCTG 
AGGTCTTT AGAGGACCAGCTATTGTAGCACCTTGGATACTT GAAGTTTAATGCTCAGTTGGG 

tcgg gtgg cagt tgact tggtggctggcatgttcaqcagtgcctggggc^ 

cagcctttgaggattttctatgatattgaatgacagttttaagtggc 

atgccci 1 1 1 1 1 gcctggagatgtgctatttttattcactt atatatcaattacttgtaagggt 

taaacttt caaacaggaagtatattgggacaaaag^gctcttggggattagatatccct^ 

tctgtgaccattgggcaaaaaattttcctgcagcaaaagtctgaggct g gaccatttt 

TGCAGCTTTAATCCTTAGCCTCTTTTGACTGTATA I I I G l ti l M AAAATGCAGAGCTCAACTGA 
ATATTTCCTTTTTTG Jill G 1 1 M GTTTtGTTTTAAGAAGTAGGTTGTTTTCCTGAACCGTAAAC 
TTGTATCATTTTAACTTGCACAAAGGAAGTCTGTTCTTGGTATTQCTCT^ 

T g ™ttgttttgtgtggatt^^ 

gaaagctt aatgataccccaaaatgmttacacccagxagaggaaaaaaaggagcxscttrctag 

GGTCAGAATC^^GGAGAGAATACTCAG^AATGAACCTCTTTAAAGCCTTGCAGGAATGAG 
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ACTCTTACTTAATGAAATGTTAAAGCCAATTAAAAAGCATGCTGTGATGCCCAGCITCCCTT^ 
W^CAGGGTGCATGCGTCTCCTGCTGGTGAATCA^ 

A ^^5I5 GGA ^^ G ^^ G ^ AGGAAGAGGAAAGAAG ^ AGGAAAA ^ GG ^TTT ACGTTGTTCTAAA 
CCCCCGACGCATAAAGTGTAGAGGAGGGATGGCCAAGGGTGGGTGGTAGAAAGTGTGTTC 

AGG H^ CACTGGCAATGAGTACAGATAAmcACm ^ 

GGCCTCTACCTTTGTATCCAGGAGAAACTGCAGAGC^GCC^ 
ACCTCAA^GTGCTACCGATAAACCTTTCTAATTC^ 

AATGAAAGTTAG I I I 1 1 1 Ul 1 1 ICTGGTTTG 1 1 1 1 1 1 1 1 1 II I GT ATATTGATGAATGAGATCTT 
ACCTATTAAATATAJTATTGGATTATGGTTCCT^ 

TGTGTGTGTGTGTTTTATGACTTAAATATCTmAC^TGTGTTTTTTAGAGCTTG 

GA I![I GGAGAAGATATG T AA *^^ 

GGGGCTAAG ™ AATAAAAATAT GTGTGC^ 

GCCGCAAGCTTATTC 



x^9cacgcaaaACaflflTTGCAATAATGTGATAAGTTCTTTAAAAGaogt*«GAGcaaC 
fltGCJTTGGGAGC*GAGAA«aGGGAGAAAGcag^^ 
AAGCCCACT ATCTCATTTAATCTTTACAaCtCTCTTGCA^GGTTCC 

AgaTAAATCATGAAG GCCACTATCATOCTCCTTCT GC TTGCACAAG TTTCCTGGGCTGGACCG 

TTTC^ACAGAGAGGCTTATTTGACrrTTATGCTAGAAGATGAGGCTTCTGGGATAGGCGcaGM 

G ™ C J^ TGACCG<5GAC TO 
TcTTCGAGToteagtgtrtfl*ttp^g»f^^ 



T&GCGGjCCGAGGACACAATACTTACTTA^ 
^ AA ^V73^ A ^ C ^ GAT ^ GTCTGGTTAGGATA 1 ' "G ill I AGTGGATATGCTTT AATT 
CG ^£]^™CT GCAGTA ^ 

G ^™I?^ C ^ CACCTAAGGCAATGCA ^ CACATC ^ 

ATGWACAG^TCANN?^^ 

>S45 

>546 

t 4 CCGGGCAGG T AC J^^ 
TAGJSATGGCCATCTGCCC^^ 
TTTCACTCTCCAAAGAGTCCCTGTO 

ggccggtaaggaaagtggagtgagggaagcagggtaggtggaggtgtgaaagggagaag 

ggcctcatctcaggctggctggacctgcaccagcatcggcctgcatga^ 

ctt gcccaggctgagtatcaagagaagcaagaaatctagataaaaatcca 

TCAGCGTTTTGAGGTTAACATGTTGGCAATTATTCAGCNNNNNNN 
>547 



>551 

* * /v GG ™^ AfiCAAA ^ CA ^ G ^^ 

AACCCTACTTTCTAAGTCTCTACAAGTAATATATAGTGGQTGAGGTL, I I CI i I CI 1 IGTTCTGT 
T>£rCGGATGTGAAACTCTCCT F 1 1 GTAGATGAAACCATTGCGT AAGGAATATAAAGACTTTT 
SSEJ AGTrA TCTT ACAGACTGGAGAGAGTG CTAGTGAATGC 1 1 1 I G I CT ICAATGCCCATC 
TCTTGGAAATATTGAAGGTGGAGTAGCAACCGGGCATT AT ATT ATCT CTTGGAAAAGGACCT 
CAGCAATGGAGAATATCCCCATCATCACAACTGTCATCACTCT GCCGCACGTGATTGTGGAG 
AATATCXXrrCTCCATGTGAATGCAGTGTGTCATC 1 1 I C I U I GTAGATCTGCCGTCATTTGGGC 
GTGTCAGAGAGTCGTTACCTGTCAAG™^ 

TTTCGATTTACGT ATTCCTCCTGGGCACGGGAGGCAGGGAAATGGTT ATATAATTTTCTATTC 
CT CT GGATGGCT GGGATAMN 



760 
Table 4 



PCT/US00/33312 



AGACAGATACAGTGAACCAAGTGCAATATGTAAGGATGAAAGAAGAAGAGATGACAAAGAAA 

^°^ G J AAA J G i^ G J^^ 

TATGTGTAGTGTTTTCATGCTACXnTGGTAG^AAACTTATTTAC^CC 

TTTAAATATAAATAATATAAAGTGCTrCTGAATAAAGCAGAAATATATTACAGTTCATTCCACAG 
A ^ G ^ CGAAA f CACCC ^ 

GAA ? GAG T AGGGA ^^ 
A I AAA £ G J AGAA G GC ^^ 

A ^ AGG T GAAA ^ A ^ GACTGC ^^ 

TAGTGGAAGATTTTAGGTATGTAGAGC^ 

AA T GAG Ji AA II GC H^^ 

AAACAGGTTTGTCCCATAAAGTATTTTTTCAAAGAC^ 
GTTCACTTCCCTGCTGCCATGAAACrrTTGCCTTAAGAAGGTGCTGG^ 

H G JiS^ AGTAGTATAAAACATGAGGC ^ 

TT3S^"^ SS^^^^^^^^il^T ^^^'"'^^^^^^"'"'^^TAAGTGGCCTCrrCTTAGCTCAGTTACTC 
A ^^I CATACGTAGTA, 1 ' J Ji A AAATAATTrrATATCTGTGTACCAC^X^TATA7^CATATTA 

c J?JJ T ^ CA I GTACAGCmcTAC ^ c ^ GTAAGAAGA c^ 

™ AA Z AG ^ C ^ GAGCAGGGGG TGGCAGATGTTCTATC 

GCACTT ATATG CATTGCTAATATGGAATTTAAGATACCATACACAGTCTCTCATGGACCTATC 
^ CT " ATTGTAGAATTATGACTT AGGTCTTT ACTTGCCAAATTTTTCTGAATGTGACCTTTTTTTGCT 

A ^ G J^^ A T C ^ A JP ATACT ACTTT GGATTGT^GTGCTGGTGTAATGTGGAT^AAC^TOAATAA 
ATATTTGACAAATAAAAAAAAAAa 

>553 

~. . . T GGCGGG CGAGGTACCCATCTCTGCC^ 

GAAAAGATCTGGGACT ATCTGAAACTAGTGAGAATTTACACCAAAC<X^AAAGGCCAGTTACC 

AGATTACACATCCCCAGTGGTGCTTCCTTACTTCXiAGCGGCCGCCCC^ 

ACCAAACACTAGCTCAAGCACTGACGTTATTCTACAGGACTATGA 

AGAG TCCGGACAGATAAAGGAAAACAAOCCAAATCCAGGAGGCAATATAAAAGGAAGAGAA 

CAAAACACACATTCATACACTCACACTTAAAAATA^ 

TCTCTT CT GGATGTCTACTTAAAAATCCCATGTGGT 

>554 

_ ^NNNMNCATGAGTT^ 

AAGATCTGTATCAAGATAAAGGAATTCAAATAGCATATATATGACCATGTCTGAAAATGTC^ 
TTCTCTAGGATAATTTGTATTGAAATGAAACJCACX1AGTGTTATCAACTT gaatgtaaatgt aca 

^ ga ^ a /^ tatctgatactacaaacaggatataatc ^ 
gatgtcacaqagaaaagag agc>ggaaaatggc^ 

^^rr G J AGAG ^ >GGAAACAATci ^ cAAM 

GATTCACT AACTGTTGTAAGAGTT AGTGCTCCCATTACT ACAGTGCAAAATGTATTGTTGTTT 

CAATACAAAATTTAAAGCAATCTCAAGAGTACCTCGGCCGCCA 

>5S5 

TGCAGGAGTCCGACCTCC^TCTGTTCCTGGACGG^ 

AGTG I9 GAAG . GCT G CTACTCX^CAT«>CCTCAGCTT«>GCAGTTrCT 

CTACGCCCTGGAGAAGGAGGAGGGGGAGGACAGGGACGGCCAC^ 

ACGTACAGAAGCTGCTTTC<X5GAGAAGAAAGACTCAAG 

ACACTTCTT ATTCGGCCTCGCTAACGAGAAGAGAGCXXAGGAGTTGGAGGCCACTTT^ 

TGCCGGATGTCACCGGACATCAAACAGGAATTGCTGCAATG(2AAAGCA 

TAAGCCCTT ATCCGT GACCGACCTGAAGGAGGTCTTGGGCTGCCTGTATGAGTCTCAGGAG 

GAGGAGCTGGOGAAGGTOGTGGTGG 

CTTCT GAAGTGATGCATTG 1 1 LCI I CAGCCTGAAGCATTGTCAAGAXrTTGCAGAAACTCTCAC 
TGCAGGT AGCAAAGGGGGTGTTCCTGGAGAATTACATGGATTTTGAACTGGACATTGAATTT 
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gaaaggtgcacttacctaaccattccgaactgggctcggcaggatcttcgctctcttcgcct 
ct6gacagatttctgctctctcttcagctcaaacagcaacctcaagtttct ggaagtgaaac 
aaagcttcctgagtgactcttctgtgcggattctttgtgaccacgt aacccgtagc acct gt 
catctgcagaaagtggagattaaaaaogtcaccoctgacaccgcgtaccgggacttctgtxt 
tgctttcattgggaagaagaccctcacgcacctgaccctggcagggcacatcgagtgggaa 
cgcacgatgatgctgatgctgtgtgacctgctcagaaatcataaatgcaacctgcagtacct 
gaggttgggaggtcactgtgccaccccggagcagtgggctgaattcttct atgtcctcaaa 
gcx^ccagtccctgaagcacctgcgtctctcagccaatgtgctcctggatgagggtgcca 
tgttgctgtacaagaccatgacacgcccaaaacacttcctgcagatgttgtcgttggaaaac 
tgtcgtcttacagaagccagttgcaaggaccttgctgctgtcttggttgtcagcaagaagct 
gacacacctgtgcttggccaagaaccccattggggatagaggggtgaagtttctgtgtgag 
ggctrgagttagoctgattgtaaactgcagaccttggtgttacagcaatgcagcataaccaa 

GCTTGGCTGTAGATATCTCTCAGAGGCGCTCCAAGAAGCCTGCAGCCTCACAAACCTGGAC 

ttgagtatcaaccagatagctcgtggattgtggattctctgtcaggcattagagaatcgaaa 
ctg t aacct aaaacacct acg gttgaagacct atg aaact aatttggaaatcaag aag ct gtt 
ggaggaagtgaaagaaaagaatcccaagctgactattgattgcaatgcttccggggcaacg 
gcacctocgtggtgtgac 1 1 1 1 1 i igctgagcagcctgggatcgctctacgaattacacagg 
aagcgggattcgggtctctaagatgtcttatgaatgcaggtcagagggtcacatgttaacac 
tagagtctgtcgagaggtaggatttgacactggttttctcactatttttgggagattctgcac 

GAGTCACGCAOOCCCTTCACATGAOGCT ATGTACTTTCTCACAGGGATAATAAAGTTAGAGC 

ACTCTCGNNNNNNNNNNNhfNNNNNNNNNNNN 

>556 

NNNNNNNNNI^NNNCGGGGTACTTCT AGAATT AATT AACGCGGGGGAACCGAGGTAC 
GCGGGGGGGGAGTGGCACTCGCAGCT GCAGCAAATCTCAAAATAAAGAGGCAACGGCCTT 
TCTCTTOCTCTCCATCTCOCTATAGCACAC C 1 1 1 1 A l I ICII HCIICII 1 1 I I I I A AGCCTCAC 
GAAAGATTTTACTTGTAG*TCAACTTTCAAAATGTAGG^ 

AAAAATATGTGGAGCTGATCACAAGAAGTGAAGAACCCAGAGCACGAAAGCGGTTGTGACT 

CCTGGGCCGAGGGAGTTGAGAGCGTCTGGGCTTCAGAGGAGCCAGCGCCT CCGAG7TGTC 

TTQAAGTGAGGCTCTGCTGTAGTCCTGTTCTTCTGGCn'CTAAGATCTGAATGTTGrGACCAC 

TAATTTGCTCTTTCCTGGAGGGTAACCCCAGTTTGGTCCACAAGGCTTGCTGCCCAATCTTT 

GGGAACAGTTGAACCAAGACTCTGAGGCTGATATGCAATGTCACTTACAACATTAATCAGATA 

CTGCA TTTAAGATCTTGATGTGGAAGAAATGGAGGACTCAGAACCAAGGATTTCCAAGTGAT 

TTCTTCCAAAGCACAGGAATCTCACTCT GTT AAAGCT GGTCTGTT CT AACTGAGCTGGATGG 

TCCAGGCCCTATTTCTATGATCTGTTTTTTGGAACTGTTGTATTTGTGTGACAC^ 

GCATGATGTCAACTCCCACATACTGTAACCCAGGAAGGAGCATGATGTCAGCTGAGGATACC 

GGAACCGATAAAAACTCAACAGAGCT CAACTCCAT AAAGGACTTCTGTTTTGT AATCT GGTTT 

CGAGATCTGATGCGTGTGTTTCTAAGACCCCGTCACGT GAGGAGGTCTGACGTAGTCACCC 

AACATGACTATGAGAAAATATACTGTAAAGATGTGTGTGGGAN 

>557 



TGGCGGCCGCCCGGGCAGGTAGGCGGGGGGTGCCTGGCTGC AjI I rCCTQCTTTTQ 
GTTCTlTA CAGTAGrrOGGOG TAGGKXrrT 

CTGGGATTTTTTACTGTCTCCTGAACAATTTAACA^ 

AAT TTATATCA A CAATA TGAATGACAAAATTAAAAAGGAAGAATTGAAGAGATCCCT atatgc 

CCTGI 1 1 1 c I cagttggt catgt ggtggacatgtggctttaaagaccatgaagatgaggggg 
caggcctt gtcat attt aaggaactgggct catccacaaatgccttgagacagct acaagga 
tttccattttatggtaaaccaatgcgaatagagtatgcaaaaacagattc 

AAATGCGTGGAACr TT GCTGAGAAAGAAAAGAACAAAGACCACGAAAAAAGCCAAAACTGTG 

gaacagactgagaaacacaaagaaaaagcctggccagggaactccaaattcagctaatag 

aaggaaattcaagacaaatcctcaggtcctgattacctccaaaatatatttitat^ 

actt accagaagagactaaatgagatgatgttatccatgttgttaatgagttcct ggttcagg 

aagtaxxitctggtaccgggagggatgacaatgctttctgtcgaxntgaaatcgttggcagg 

TGGAGTGGCAGGGTGCTTACAGGGTTTTAAATCA^ 
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gaataattggggagggttaagactggggtctacccgctggggacacaggggggct AACGA 

GAGGGGGGACGGCTGGGGGGAAACCTGTTCNNNNNNNNNNNMN 
>560 

NNfMNCCCCCGAGGGTTCCGATTAAGTGCACGGCACCTCGACCCCAAAAAACTTGA 
TTAGGGTGaTGGTTCACGTAGTGGGCCATCGCCCT gatagacggt cg ccctttaacgttga 
GTCCCACGTTCTTTAAT AGTGGACTCTTGTTCCAAACTGGAACAACACTCAA CGG 
TCNAI IU 1 1 1 G ATTT AT AAGGGATTTTGCCG ATTT CGGCCT ATTGGTTAAAAAATGAG CTGAT 
TTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTT ACAATTTCCATTCGCCATTCA 
G GCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT CGCT ATT ACGGCAGCTGG 
CGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACG 
ACGTTGT AAAACG ACGGCCAGTG AATTGT AAT ACGACTCACT AT AGGGCGAATTGGCCAAG T 
GAGCTCGj^T TGCGGCTOC ATG<^TA A^ 

CAGACAGGCGTAGCCCCGGGAGGAACCCGGGGCCGCAAGTGCGTTCGAAGTGTOGATGAT 
CAATGTGTCCTGCAATTCACATTAATTCTCGCAGCT AGCTGCGTTCTTCATCGACGCACGAG 
CCGACTGATCCACCGCTAAGAGTCCCCCTTCrGTCGTCTT CTCGCAGCCGTACCACACGGC 
GGGCGTCCCGAGACNAAGGGhnTCGATGGACAAGAGACCCAAGCATAGAANNNNN 
>561 

AAGATGGCTCACCAATGGCGAAGACGGTGTCCCTGTCAATTGTG^ 

AAATAACAAAGAAGCAACTTCATCTCT AAATAAAAG GCTGG CATCCCTTT GAGGGAGGGCGA 
G TCGTTG CAGCTGCTGGGCT AAG GACX5TCATCTTTCACGCGAGCGGAGTTTTAATGACACG 
GGCTGGAACTGGGAATTTGGGTATATCTTGAAAGGCAACAACTCTTCACAGCGCTTCC 
TGGTACCTOGCCCGOCACCCG 



BtAGGGAGTCGACCCACGCGTCCGCGGACGCGTGGGCGCCGCGGGAGCAGCCGCT 
ATCTCTGTGTGTCCGCGTGTGCGCCCGGTCCCCGGCTGCCGCACCATGGAGAGCT ACCAC 
AAGCCTGACCAGCAGAAGCTGCAGGCCTTGAAGGACACGGCCAACCGCCT ACGTATCAGCT 



AGATCATGGCTGTCC I C 1 I 1 1 I CCACACCATGCGCTACAAGTCCCAGGACCCCCGGAATCCG 
CACJ^GAOOGCTTTg^ 

GGACXSGGCACOCGGTCCCGAnAACOT^^ 

CTTCGACAAGGCCAG CT ACCGAG T AGCTGGGACT ACAGCT ACCGAGTCT AT7 GCTTGCTGG 

GAGACGGGGAGCTGTCAGAGGGCTCTGTATGGGAGGCCATGGCCTTCGCCAGCATCTATA 

AGCTGGACAACCTTGTGGCCATTCTAGACATCAATCGCCTGGGCCAGAGTGACCC^ 

ACT GCAGCACCAGATGGACATCT ACCAGAAGCtiGTGCGAGGCCTTCGGTTGGCATGCCATC 

ATCGTGGATGGACACAGCGTGG^GGAGCTGTGCAAGGCCTTTCGCC^ 

CCAAGMiCCAACAGCCATCATTGCCAAGACC^ 

GATAAGGAGTCTTGGCATGGGAAGCCCCTCCCCAAAAACATGGCTGAGCAGATCATCCAGG 
AGATCT ACAGCCAGATCCAGAGCAAAAAGAAGATCCTGGCAAGCCCTCCACAGGAGGACGC 
ACCCTCAGTGGACATTG^XAACATCCGCATGCCCAG<XrrGCCGAGCTAC 
AAGATAGCCACCCGXiAAGGCCTACGGGCAGGCACTGGCCAAGCTGGGXX^TGCCAGTGAC 
CGCATCATOGCCCTtSG^TGGGGACACCAAAAATTCCACCl I CI CGGAGATCTTGAAAAAGGA 
G CACCCGGACCGCTT CAT(XiAGTGCT ACATTGCTGAGCAGAACATGGTGAGCATCGCGGTG 
GGCTGTGCCACCOG<^ACAGGACGGTGCCCTTCTGCAGCACTTTTGCAG<X; HUH C ACGC 
GGGCCTf I GACCAGATTCGCATGGCCGCGATCTGGGAGAGCAAGATCAACCTCTGCGGCTC 
CCACT QC<KK?GTTTCCATCGGGGAAGACGGGC<XrrCCCAGATGGCCCT AGAAGATCTGGC 
TATGTTTCGGTCAGTCCCCACATCAACIGIUI 1 1 ! ACCCAAGTGATGGCGTTGCTACAGAGA . 
AGGCAGTGGAACTAGCXXaCCAATACAAAGGGTATCrrcCTTCATCCGGACCAGCCGCCCAG^ 
AAATGCCATCATCTATAACAACAATGAGGACTTCCAGGTCGGACAAGCCAAGGTGGTCCTGA 
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AGA6CAAGGATGACCAGGTGACGGTTATCGGGGCTGGGGGTGACCCTG CACGAGGCCTT G 
GCCGCT GCCGAACTGCTGAAGAAAGAAAAGATCAACATC<X5CGTGCTGGACCCCTTCACCA 
TCAAGCCCCTGGACAGAAAACTCATTCTCXiACAGCGCTCGTGCCACCAAGGGCAG^ 

caccgtggaggaccattattatgaaggtggcattggtgaggctgtgtccagtgcv^ 
ggcgagoctggcatcactotcacccacctggcagttaacogggtaccaagaagtgggaag 
ccggctgagctgctgaagatgtttggtatcgacagggatgccattgcacaagctgtgaggg 
gcctcatcaccaaggcctagggcgggtatgaagtgtggggcgggggtctatacattcctga 
gattct gggaaaggtgctcaaagatgtactgagaggaggggtaaatatatgtttgagaaaaat 

gaaaaaaaaaaaaaaaaaaaasaaaastcgggggoocoooocgggggoggacaocaaatgo^^ 
cccoctcagoagaog gmn a a a o^anr wartagAATGGATGGTATAGTCAAAATGCT 
TGTCAGCTAAGAATACTGTACCCAGCAAAGCT ATCCTTCaaaAATGAACGTGAAATAAACTCT 
TTCCCAGACAAGGAAAAGCTGAGATAATTCJVTCACCACTAGATTGGCCTAACAAG AAATGATT 
ATGCTGAGTCCTACAGCAAGAAGTGAAAAGCAGGTAATTACTGTCATGAAAACATCAGAAAT 
TATAAAACTCAGAGATAQAGATAAATTTGTAaTCAAAcTcGTaalACTCCAATACTATGATGATGC 
TgtgTaAATCTTtC*gacctCAacictpa*9gfTaAAAGTcaaaaTAGtnalMCta - 

NACATACTGTGCTAGGTGCGAGGCTGCCTCGTGAATTCCTGG^ 
GTCTGGTTATTATACAGACGCATAACTGGAGGTGGGATCCACACAGCTCAGAACAGCTGGAT 
CTT GCT CAG TCTCTGCCAGGGGAAGATTCCTTGGAGGAGGCCCTGCAGCG^CATGGAGGG 
AGCTGCTTTGCTGAGAGTGTCTGTCCTCTGCATCTGGATGAGTGCAC nTTOCTI GGTGTGG 
GAGTGAGGGCAGAGGAAGCTGGAGCGAGGGTGCAACAAAACGTTCCAAGTGGGACAGATA 
CTGGAG ATCCT CAAAGT AAGCCCCTOGGTGACTGGGCTGCTGGCACCATGGACCCAGAGA 
GCAGT ATCTTTATTGAGG^TGCCATTAAGTATTTGAAGGAAAAAGTGAGCACACAGAATCTGC 
TACrrCCTGCTGACTGyVTAATGAGGCCTGGAACGGATTCGTG GCT GCTGCT GAACTGCCCAG 
GAATGAGGCAGATGAGCTCCGTAAAGCTCT GGACAACCTTGCAAGACAAATGATCATGAAAG 
ACAAAAACTGGCACGATAAAGGCCAGGAGTACAGAAACTG Cil I I C I GA AAGAGI I fCCTO GG 
TTGAAAAGTAAGCrTGAGGATAACATAAGAAGGCTCCGTGCCCTTGCAGAT^ 
GGTCCACAAAGGCACCACCATCGCCAATGTGGTGTCTGGCT CTCTCAGCATTTCCTCTGGC 
ATCCTGACCCTCGTCGGCATGGGTCTGGCACCCTT CACAGAGGGAGGCAGCCTTGTACTCT 
TGGAACCTGGGATGGAGTTGGGAATCACAGGAGCTTTGACCGGGATTACCAGCAGTACCAT 
^^TACGGAAAgAAGTG^ 

GGCAATACmCCAACTCACACGAGGCATTGGGAAGGACATCCG 

GAGCCAATCTTCAGTCAGT ACCGCATGCCTCAGCCTCACGCCCCCGGGTCACTGAG CCAAT 
(TTCAGCTGAAAGCGGTGAACAGGTGGAGAGAGTTAATGAAGCCAG 

AGAGGAGTCAAGCTCXCGGATGTG6CCCCTGTAAGCI I CI I ICI rGTGCTGGATGTAGTCTA 

cctcgtgtacgaatcaaagcactt acatgagggggcaaagtcagagacagxttgaggagctg 
aagaaggtggctcaggagctggaggagaagctaaacattctcaa 

ggGgGGACCAAGAACTGTGA GC^ 

AAGGTGGAGCTTGCAGTGAGCC&AGAOT^ 

GAGj£Tr^TCTCAAAAAAAAAAAAAAAAAAAAAAAGAATATAT^ 

A^SCTTGAAGaAACCAGCAAT 

CAGGTTCAATGTCCTCCGAAGAATGAAGTCTTTCCCT 
CCAGCATCCACTCTCCCTTGTCCTC 



GACCCACGC GTCCGCTCOCGTGTCGCTCGCI I ICIGTCAGO CIC1 CI CCCTC T CCC 

ICIUXXJICIGCI rOCTCTOGCI l€C1CICIOGCAOCTQAGOGTAOGCAOCTG000GGG0C 
CGGCTTOXT C C IOLI UICXXXIOCCI C I I l 4 XA^:?XXre«CrxreyWAfflXTC^^ 
CGTCACOGCCGCCCOCCCAQACAAGATQGACACOl^ 
• T CSGTC ACAAQQA^CAOCTGAQAAACOGCTTTATCATU C I I UIU I ATQTACTGGCAGTATTA 
AGTTTATCX>TGAAG^TGCTTAGTTCAATGGCT 
TATGCAAGCACAGATTTGCTTTrACACCAATTTATTCTCCAGA 
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TTCAAGlACATATTTGCTGGACTGvGTTACAAGTATTGGCACTGCAATACGATATTGGTTTCjATT 

ATACACTTGTGGCCTTTGCATGGTTGGGAGTTGTTCCTCTTACAGCATGC^ 

TGCTTGTrTACTGGCTCCGTGAGXn^CACTACTGACGCTGCCATTAGATATGCTGTC 

CAGCCTGGTGTG^^ 

GCTGCCCCACCGTTCAATGCTGKXiGGGCATCACCAAAATGAGGCTCGAGCA^ 

GTGCAGAAAATGTTGCTGCrrGATCAGCCTGCT AACCCACX^GCTTGAGAACCCAGTGGTGGG 

GGAAAACCCTGLATGCCCAGGATGACCAGGCAGAAGAGGAGGAGGAGGACAATGAGGAGGA 

AGATG^CGCTGGTGTGGAGGATGCGGCAGATGCTAATAACGGAGCCCAGGATGACATGAAT 

TGGAATGCTTTAGAATGGGACCGAGCTGCTGAAGAGCTT ACATGGGAAAGAATGCTAGGAC 

TTGATGGATCACTAGTTTTTCTGGAACATGTCTTCrG^GTGGTATCTTTAAATACACT 

TCTTGTTTTTGCATTTTGCC^rrTACCATATTGGTCATTTCTCCCTTGTT^ 

GAACACGTCCAAGCATCTCATTTTGAAGGCCT AATGACAACCATAGTTGGGTATATACTTTTA 
GCAATAACACTGATAATTTGTCATGGCTTGGCJWUrrCTTGTGAAATTTCATA 
TTACTGGGAGTCrTGCTATATTGTTGTTAAGGTCrrCTTTGTTAGTGGTGGTAGAAATTGGAGT^ 
TTCCCTCTCATTTGTGGTTGGTGGCTGGATATCTGTTCCTTG<iAAATGTTTG^ 
AAAGATCXSAGAACTGAGCTTTCAGTCGGGTCCAGGTACTACX^TGTTTCT AGT 
GGGAATGGTATATGTCTTCT ACTTTGCCTCCTTCATTCTACT actgagagaggtacttcgacc 

tggtgtcctgtggttitcrraaggaatttgaatgatccagatttc^ 

ccatttgccaatatataggcatctccg^gatttattttct 

gtcct cctgatgcrrttggcrrrcctat acgt ataattaagagtgtgct gccta a i imli i cca 

tacaatgtc^tgctct acagtgatgctccagtgagtgaactgtccctcxiagctgct^ gct 

tcag4otx3tcttgxxagcattactcgaacag 

gtgcgagcgtggactgtgaccgccggatacttgctggatcttcattcttatttat^ 
ccaggaagaaaatgaaaacagtgcaaatcaacaagttaagaataatcagcatgctcgaaataa 
caacgctattgctgtggtgggagaaggocttcatc 
ggagggcctgttggctttc*gccttacccc^^ 
gttgattgtcrrcatgtgtataacattactgattgccag^ 
atttgctggccgttggttaatgtcgttttggacggggactgccaaaat 
cagctgcttgtggtctctatgtttgctggctaaccataagggctgtgacggtgatggtggc 
atggatgcctcagggacgcagagtgatcttccagaaggttaaagagtggtctctcatgatca 
tgaagactttgatagttgcg^gtgctgttggctggagttgtcxxrrct ct ggggctcctg 
tttgagctggtcattgtggctcccctgagggttccctt ggatcagactc u i u i llll 1a tcca 
tggcaggactgggcacttggagtcctrgcatgccaaaatcattgc>>gct ataacattgatgg 
gtcxrrcagtggtggttgaaaactgtaartgaacaggtttac<3caa^ . 
gaccttcactatattgttcgt aaactggcagctcccgtgatcrrctgtgctc ccct g 
tgtgt acctt atgt catagcttctggtgttgtt ccttt act aggtgttactgcggaaatgcaa 

AACTT AGTCCATCGGOGGATTTATCC^TTTTTACTGATGGTCGTGGTATTGATGG 
TCCTTCCAAGTCCGCCAGTTTAAGCGCCTTTATC ACCTT GTG 
GGTCAACGACTCGTGAACT ACGAACGGAAATCT GGGAAACAAGGCTCATCTCCACCACCTC - 
CACA GTCA TOCCAA G AATAA AGT A GTTGTC TCAACAACTTGAOCTTCCGCTTT ACATGTCCTT 
TTTTGTGGAtil rCTCTCI I IGQAGATTTTTCCCAGTGATCTCTCAjGCtil IUI 1 1 1 IAAGTTAAA 
TGT ATTTGACTT GTGTTCTCAGCATTCAGAGAGCAGCGGTGT AAGATTCTGCTGTTCTCOCT 
GGAT CTTCTGACATT ACTGCTGTCTGAGATTTGT AT ATGTGT AAATACAAGTTOCTTGATACC 
CTAAAACCTTGGATTAAACAGAATGTGCATTGTACATCTTTAAACAAAATCT 
T AAATCTAGTT GTCACTTT ATTTTGGACCTGCTCTGATCTCGACAGGAAACGTGCOU^AGJ^G 
CAGT AGT GCGCAGGCAAG>CTTTTGAGT GACGCCTTGTGGAACGGAGTTCATGATGTCCT A 
GCAGCTCTCACT AAGGGAACTGT AGATTCTTT CTTTCTTGGCT ATTCAGACCTT ACCAAGAAC 
GTT AAAG^jAAACAAGTAGAAATCXGGAGTGGAGTGTCTCTG^n" AAGAAAACATGAACTTT AT 
GCTTCACTGTTAG QG TTTO1 GGAAGTTATTTTGTATAACACCAAAGGTGTTGTACATTTCCTA 
CTGCCTGAI 1 1 1 1 1 1 C ATGTGTC 1 G I U I II GT AAT ATTGT AT AGT ATCTTGTGCTAGGTGAGGA 
AATT ATTTTT AATTTTGATAATTTAAT ATTCCTAGTGTGATCAGCATT GGGTTT CAG 
TGGGGCATGTCTATACTT AGAGAAAAAAAGTCCAAATGAAGATTTTCATGAGTCAGC 
CeOOOGCOOOCAOOOCACAOOCACMCCT CTCI 1 1 lOC ACACACAACT ATCTgl 1 1 ATTTTTT 
GTAGCAGTGGCCGAAAGTCCTGCAAGGTCATAAATCTTTCAGAGTGACAT GT AC 

TGCATCTTACTGGATTT AGGACTTCTGJ^TGCTTGTG^AGTATAGATCTGGTTGTGGTCTTA 
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„ r „ A - , 1 AG AGAAGAC i GG i i AGAAUAI CTGGTCTCGCTGGTTAGTGCCTCGTTGG 

CTCAG^ACTAGGTGTGCATTTCTCCTAGCTTrTCATCAGGAAATOOCAAAGTlTCCAAAGC^ 
I^ G JIT ACAGAA J AAA ^^ 

GAGCTGGCCTmAAC^TAGGAATGTATTTCGTTGGAAACATTCTGAAAA^ 
G ^ CCC £ A £ AAAC ™ GT ™ CCC ^^ 

GAATGGTTGGGTACATGATCTAAATGTTTAATGCTAAAGGTATATCGTAAGGGTAGTGTTTGT 
!I™ G ^ ACGATAA ^ AGAAG TTCTCAT^^ 

AGAA I7^ CATATAGTA ™ AAAATG ^^^ 

CAGCCAAACATAA^CTACGAAAAT AAAGAGCAATGTGTTCTGGCTGTTTTAT ACTT CAACAAT 
T7TTTCCCT AAGTGGTAAGCAATTACTTTAAAACATATTTTTAAAAACATCGGTATCGGGAGCT 



AC H AGGCAAAATOGAAAAAAAAAAAAAAAA ^ 

AGTGTTATACTCTTT GGCACTTT 

>570 

>571 

>S72 

™ . ^^^N^NNCAAATTCTCTTTTATTGCTTTATGTCCCTGACTrCATGCAGGCX^^ 

I ^9, 9J?^^ AA ^ AGGGAAGAAGAGGG ^ G ^^^^ AAA ^ AAA ^GGCTGCGTGGGTCCG 

AGGWTGGGTGCCCAGCCAGGCGGAGTGCGGGGCGGGGCCTCTOGGGCX^CAAGGTGG 
SP^P 01 " GGGCCGGGAGGTGGGAGGTGAGCTCCGCCAGCCCAG CCCACTGCAGACAGG 

AGGGAGCCCCCGCCCCCTCAGCTGTAACCGGGCCGGCTTCAGTGTCGCCGC^ 

GA J G 7 G I CAG ^ CCG j* GGGGmG ^^ 

]££ CA ? CG 5 GGTCT ^ CCT ^^ 

gtaaaacgaagatcaggagcagatttgaagaattacaaagtgaattggtgccagtcagS^g 
aaggaagactcatgcaac iigiiii ctggcaatgaaagcagcaaattagaaaatgagtccaa 

AC J1^7 CA ™ AAACACTGATAA ^^ 

GCCCAGGj^AAATTATATTCCAGATCATGGTGGAGGTGAG gattcttgtgccaaaacagacac 
aggctcagaaaattctgaacaaatagctaattttcctagtggaa^ 

AA ^ CAAA I G ^ CAG ^ CAGAAA ^ 

CCAGAATCAGCTAATGAAAAGACTTATTCAGAAAGCXXXrrATGATAC^ 
A TTTATTrCAAAAATAAAGAG 

GCTCTTATCT AGGGAG7TTGCAGAACATGGTGT7TGGGGACAGGTTGTGGTGAAGGTGGGA 

AAATGGGGGGCTTTAAGGTTTAAAAAAGCCGGGTGGATTTGANNN ^^^^^ 

>573 

>574 

>675 

>S76 

>S77 

cyfccgcacaUTTTGTGGAGTGCCTATTACGTGCCAGAAGCT GTTCTGGACACTGAGAA 

GGAGGGGAGGA CCTTACTT AGACTCCCAGCGCACGTGGAGCAGTCTGCCGGTCGGTTGTCT 
Cj^GCGCGCGCCAOCttSGGCCTCTC^TG^ 

ccc 5^? A £* c ^ G i^^ 
S G ^TH^" G0CACGCGCCCC ^^ 

GACCCCCCAGAAT AAGATTACAGTTGTTGGGGTTGGTGCTG7TG GCATGGCCTTVTCJr^lATr 
AGTATCTTAATGAAGGACTTGGCAGATGAACTTGCTCTITGTTGATGTC^ 

T G ^ CA ^ AGA ^ r ^7 ^ T ^AACTGCAAACTCX>AGCTGGTCATTATCACGGCTGGGGGACGTC 
AGCAAGAGGGAGAAAGCCGT<TrTAATTTGCTCXXAGCGTAACGTGAACAT 
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A III^T AATG ^®^ AAAA ^ A ^GCCCGAACTGCAAGTTGCTTATTGTTTCAAATCCAGTGGAT 

A J™ GAC 5 TACGTGGmGGAAGATAAGTGGimcc <> A ^ 
ZZ G P^ C 7 GGA ^ CAGCCCGA ^ CCG ^ ACCTGATG ^ 

TrAAGCTGTCATGGGTGGGTCCTTGGGGAACATGGAGATTCCAGTGTGCCTGTATGGAGTG 
GAATGAATGTTGCTGGTGTCTCTCTGAAGACTCTGCACCCAGA7TTAGGGACTGATAAAGAT 
AAGGAACAGTGGAAAGAGGTTCACAAGCAGGTGGTTGAGAGTGCTTATGAGGTGATCAAAC 
TCAAAGGCTACACATCCTGGGCTATTGGACTCTCTGTAGCAGATTTGGCAGAGAGTATAATG 
AAGA '^ G 33 AGG ^ GGG ^ GGAGGGAG ^^ GGAG ^ A ^ G A ^ AAGG ^CTTT ACGGAATAAAGGA 
V^L^LSP 0071 AGTGTTCCTTG CATFTTG GGAC AG AATGGAATCTCAGACCTTGTGAAGGT 
GACTCTGACTTCTGA GGAA GAGGCCCGTTTGAAGAAGAGTGCAGATACACTTTGGGGGATC 

caaaaggagctgcaattttaaagtcttctgatgtcatatcatttcactgtctaggctacaaca 
ggattctaggtggaggttgtgcatgttgtccttritatctgatctgtg^ttaaagcagtaata 
ttt^gatggactgggaaaaacatcaactcctgaagttagaaataagaatggtttgtaaaat 

CGAC £ GC I A J A I C £^ 

™ AG ^ CTGCCAOTCTGACGCACCACTGC ^ T ^^ 

GGCAGG J G <^ G TI TA <^ 

G Z GCA 7 AA ZT^?£ AAAGGATC ™ T ^ GTC 

TGTAAAAAGATCT AC^TACAAACAATGCAAfX^AACTATCCAAGTGTTATACCAACT AAAACCC 
CCAATAAACCTTGAACAGTGAAAAAAAAAAAAAAAAAAAAAAaaaaattggggcgcgccoo 

TaogoccaoaoaoacoggocgcaaogQggtncocraaaaaaccog 

G J A ££ AG £ AGGA J C IIH CT ^ 

GGTrCTTCCCGAAACTCCCAAATATCTATGGAGAGCTGTTCT AGCTTTTGCACAGGG AACCA 
9T GG ^ CAGAGG T ATCATTAAACATGTCCATGTATTGCGAAGTCTGAGGAAACTCAAGCTCCT 
CCAGTCCTTTTAAAATCTTT GCAATGTAGGGATAATTTTTCTGCAGAATCCTTGCGAACAACC 

T C I CC I CA ^ TCC ^ GAAACTG ^ CCCAATGATGACC ^ 
AGTTACTCCATAAAAGATTGTTGT ACCTCGGCCG CT CT AG 
>579 

AACTT CTGGTTGCT AGGTGTGGCTTCCTTTAAAATCCTATAAAATCAGAAGCCCAAGTCTCCA 
CTGC CAGTGTG A AATCTTCAGAGAAGAATTTCTCTTTAGTTCTTT GCAAGAAGGT AGARATAa 
AGACACTTTTTIWAAAATGG^ 

! GAAG /^ CAG j^7 A 7^^ 

AG f*^ A 7 c 5 rAC 5^ CAA T c ^ 

I AAAGG ™T GGA I GA ^ CAAC ^^ 

A ^S AG ^T C ^ AAGGAG ^ TATCTCCAGGAAACAG 

GCCCTT ACAGGTCACCTTGAGGAGGTTGTTTTAGCTCTGCTAAAAACT CCAGCGCAATTTGA 
T^GGCATCAAGAACTAACAAAGAAATCAGAGACATTAACAGGGTCTACAGAGAGGAACTG 

^ 6A ( G ^2^ G «S AAA ^ A JP ATAA ^^ 

IISS^ A iS^ a ^? 3CGATCTGAGGAC ^ GGTCT 

A T GGC * GGGCC 77^ TGA ^^ 

TACCATCCTT ACCACCAGAAGCT ATCCACAACTTCGCAGAGT GTTTCAGAAATACACCAAGTA 

TCACAGCT ATCGTGAAGTGCGCCACAAGCAAACCAG C 1 1 I CI 1 1 G CAGAGAAGCTTCATCAA 
GC^TGAAAGGTGTTGGAACTCGCCATAAGGCATTGATCAGG^TTATGGTTTCCCGTTCTGA 
AA TL GACATGAATGATATGAAA GCATTCTAT^ 

CATCCTGGATGAAACCAAAGGAGATTATGAGAAAATCCT GGTGGCTCTTTGTGGAGGAAACT 

AA ^II C ^ C Z3^T GG IEI9 AAGCTATQATCAG ^ 

^S c J^ AA ^I AG ^ A i AG Z !T ^ CAACAGGA ^^ 

AGC <^ AAA I CA T^ 

TTTCCCCAAACCATAAAACOCT ATACAAGTTGTTCTAGTAACAATACATGAG AAAGATGTCTA ' 
TGTAGCTGAAAATAAAATGACGTC^C^AGGCAAAAAAAAAAAAAAAAAAAAAAGaaagaaaaaaa 
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_ _ A NNCTG^TmAGCAGGACTGATCGTTTTTCTAAGTAGACCTG AGCTTTGTTrATCAG 

CCTGCCAAAAT^CAGTTGGATTTTTGGAA 

TTTTC^A^C^fSI?^^ 

iii^v;iS^ G £i GA ^^ 

^ A T£E C Z A £ G 7£ ACA ^^ 
G 5?5SS A J s ^ G Z A i^IS^ 

AACATTGACCCAGTCAAAAGCTTCTAATGGT 1 1 C 1 1 1 1 IC TrCCTC<>GTTTTAGTrrGCTTTT 
CA^TGGTATCAAAGCACAAATACTTrGGAGGGGAATGCGTTGAACTG 

TI AC I r J£^I^^ G . CCCCCACC ^ 

r^^^^^I^^^^^^^^^ 1 ^ G ^ rn ~'^ r ^ G ^GATTGCCTCATTCTCCAGTCAACT 
C ^T C T GGC ^ T ™^ CAGAATCAGGGAA ^^ 



- ^ . .'i A ii CTA £ CCGCGGT GQCGGCTGAGGGACATTTCGTATCGGTTATACAACATTTGTTT 
fGTCCCTGCATTAACCTC^ 

^t^tgtt^aS^^ 



CCAAGAGGCATCCATACCCTGGGTTTTCCTCCAA^ 
OCT^GGTTGATGCTGTATT^ 
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T ^^^^^ NNNCCGCGCGGCAGGTACGCGGGGAT GACG 

C Z C $II^I GGTGTAGAGTCTGCTCTCTCCTCG ^ GAA ^ 

A ^ TA f? GGG ^IL^ 
r£ AAC JJ£^ A 7 AAAG irc 

CG ™^P^ AAGTGTGA ^ GTA ^ GTCCA ^ GCGGGGGAG <5GG^ 

?? GG ^ GC ^J A 7?Z C J AGCACAAAAAAGGGGAG 
G I ACGA f^ c J TGTA ^ CGGCCACATAGGAGTA ^^ 

ag^S^ 



GGGG ^ G ^J^J£ CCGGATAAGA CCCGCTGTCTGGa^ 
AGCCGC A G ^i GGAGCG ^^ 

£ G ^ GA £ G £ G ^I GA ^ G ^ AG ^^ 
GCC IS^I G ^^ CGCTCTGACAC ^ 

^°3 G 77. C J CC ^ G ^^ CCGAAATCACCTG ^ GCAT 
C I A T A ^ AA ^ GC ^ GAGGAA ^^ 

^ G 7i G ^T A 7^ G ^ c ^ CAC I rCA ^^ 

AGGGTGACACATCTGGAGACTACAGGAAAGTACTGCTTGTT CTCTGTGGAGGAGATGATTAA 
AATAAAAATCCCAGAAGGACAGGAGGATTCTCAACACTTTGAAT/TTTTl^ 

* GAG 7 GG Z A ^ ATGA ^ ATGrcAGAATGG ^ a ^^ccaawa>CC<Sg^Sa^Sctqgctccta 

G ^7 A ^ AGAC 7?J C JF A ^ A ^ A ^ CACCTATA ^^ 
?JIS A ^ AGA ZI AC PTSI A £J^ 

TTTC77TTCAGTGAAAAATTTTTTAAATGGAAGACTGTTCT aaaatcacttttttccctaatcca 
TTTGTCCATAGCATCAGTTGATCCTTACTAAGTTTTTCATGGGA 

^ G T7 G 4 AA I C ^ C1 ^ GTAGTCAAAGTATA ^^ 
G J A Z GG 7 TA I ACA>A ^ TATA ^ TCTGG ^ A ^^ 

^ C ^ G ^^I G ^ A ^ G HHT^^ AGTATGATATAAA ^^ 

^^^^^^^^^^^^^^^^ GGAT ^ AA ^AAAAAGTT cacttagtaaaaaaaaaamaaagg 
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CGGCCGCTAGAACTATCTAGAGAAAAAACTCCCAACCTCCCCCTGAACCNNra<INNNNNNNN 
N 

>568 

cacnTTAATTCTTATGAACTT GGGCAAAAGGATTAAAATTCTACTTTGGT ATCTTTTGA 
AAGAAAACAGAAAGTGGACATTTTAACTATAA6TTTCTTATCAGCCTACAATCAGATGAA6GA 
AACTGGTAAAAATTTCTCACTGTAATAATTXiAAGAGTCTGATATTCCACCATAC^ ATTTT ATG 
GATGAAGTAATATTTGGGATAATATAAA>«3ACTTGCATAGATATGTATATCAAATTAATAGTTT 
AACTCGGATGTGAGCTrTTAATTTCTTATTTAGTTATTAAGCATACTGCCTCCTGAATTAACT 
CTTTTCAI 1 1 101 I CAAGATCTAATTATCATGGACCTGCAACAGTTTCTTATGTGCCTGTCCCT 

gtgcacagcctttgocttgagcaaacccacagaaaagaaggaccgtgtacatcatgagcct 

cagctcagtgacaaggttc^caatgatgctcagagttttgattatgaccatgatgccttcttg 

ggtgctgaagaagcaaagacctttgatcagctgacaccagaagagagcaaggaaag 

gaaagattgtaagtaaaatagatggcgacaaggacgggtttgtcactgtggatgagctcaaa 

gactggattaaatttgcacaaaagcgctggatttacgaggatgtagagcgacagtggaagg 

ggcatgacctcaatgaggacggcctc«tttcctggg^ 

ct acgtttt agatgatxx^agatcctgatgatg g attt aact ataaacagat gatggtt a gaga 
tgagcggaggtttaaaatggcagacaagg atggagacct cattg CCACCAAGG AGGAGTTC 
acagctttcctgcacxx^gaggagtatgactacatgaaagatatagtagtacaggaaacaat 
ggaagatatagataagaatgctgatggtttcattgatctagaagagtatattggtgacatgta 
cagcx^tgatgggaatactgatgagccagaatgggtaaagacagagcgagagcagtttgtt 
gagtttcgggataagaaccgtgatgggaagatggacaaggaagagaccaaagactggatcc 
ttccxh'cagactatgatcatgcagagxscagaagccaggcactrrggtctatgaatcagaccaa 
aacaaggatggcaagctt accaaggaggagatcgttgacaagtatgacttatttgttggcag 

CCAGGCCACAGATTTTGGGGAGGCCTTAGTACGGCATGATGAG1TCTGAGCTGCGGAGGAA 
(XCTCATTTCCTCAAAAGT AATTTATTTTTACAGCTT CTGGTTT CACATGAAATTGTTTGCGCT 
ACTGAGACTGTTACT ACAAACTTTTTAAGACATGAAAAGGCGTAATGAAAACCATCCCGTCCC 
CATTCCT CCTCCTCTCTGAGGGACTGGAGGGAAGCCGTGCTTCTGAGGAACAACTCTAATTA 
GTACACTTGTGTTTGTAGATTTACACTTTGTATTATGTATTAACATGGCGTGTTTATTTTTGTAT 
TTTTCTCTGGTTGGGAGTATGATATGAAGGATCAAGATCCTCAACTCACACATGTAGACAAAC 
ATTAGCrCTTTACTCTTTCTCAAOCCCTTATATGATTTTAAT AATTCTGACTT AACTAATTTTGT 
AAGCCTGAGATCAATAAGAAATGTTCAGGAGAGAG GAAAGAAAAAAAAT AT ATGCT CCACAA . 
TTTATATTTAGAGAGAGAACACTTAGTCnTGCCTGTCAAAAAGTC5CAAC^ 
AGGGGCCACATATT ACATTCAGTTGCT ATAGGT CCAGCAACT GAACCT GCCATT ACCTGGGC 
AAGGAAAGATCCCTTTGCTCT AGGAAAGCTTGGCCCAAATTG A1 M ICI 1 U I 1 1 1 1CCCCCTG 
TAGGACTGACTGTTGGCTAATTTTGTCAAGCACAGCTGTGGTGGGAAGAjGTT AGGGCCAGT 
GTCTTGAAAATCAATCAAGTAGTGAATGTGATCTCTTTGCAGAGCT AT AGATAGAAACAGCTG 
GAAAACTAAAGGAAAAATACAAGTGTTTTCGGGGCATACA I I I 1 1 H I CTGGGTGTGCATCTG 

. TTGAAATGCTt>AGACTT AATT ATTTGCCTTTTGAAATCACTGT AAATGCCCCCATCCGG TTC 
CTCTTCTTCCCAGGTGTGCCAAGGAATTAATCTTGGTTTCACT ACAATTAAAATTCACTCCTTT 
CCAATCATGTCATTGAAAGTGCCTTTAACGAAAGAAATGGTCACTGAATGXSGA A i rCTCTTAA 
GAAACCCTGAGATTAAAAAAAGACT ATTTGGAT AACTTAT AGGAAAGCCT AGAAOCTCOCAGT 
AGAGTGGGGAI 1 1 1 1 1 1 ILI ICI IOCCI I ICIUI I f I GGACAAT AGTT AAATTAGCAGTATTAG 

' TTATGAGTTTGGTTGCAGTGTTCTT ATCTTGTGGGCTGATTTOCAAAAAXXiACATGCTGCTGA 
ATTT ACCAGGGATOGTCATAGCTCACAATGCAAAOCACTTACTAOCAGG C i C 1 1 1 1 I C I UIti lC 
CACTGGAGAGCTTGAGCTCACACTCAAAGATCAGAGGAC^ 

TGAGGACCATGGCTTACCT7T CCTGOCTTTGACCCATCACACCCCATTTCtTrCCTCTTTCCCT 
CTCCCCGCT GCCAAAAAAAAAAAAAAAGGAAACGTTT ATCATGAATCAACAGGGTTTCAGTC 
CTTATCAAAGAGAGATGTGGAAAGAGCT AAAGAAACCACCC HI G H U CCAACTOCACTTTAC 
CCAT ATTTT ATGGAACACAAACACTCTCCTTTTGGGTC 

GAAGAATTATCQTATTCCACG 1 1 1 1 1 AGCCCTCAGGTTACCAAGATAAATATATGTATATATAA 
CCTTT ATT ATTGCTATATCTTTGTGGATAAT ACATTCAGGTGGTGCTGGGT GATTT ATTAT AAT 
CTGAACCTAGGTATATGCTTTGGTCTTCCACAGTCATGTTGAGGTGGGCT 
AAAAAGCCAGGTATAATGTAACTTCAOCCCAGCCTTTGTACTAAGCTCTT GATAGTGGATATA 
CTCTTTTAAGTTTAGCCCCAATATAGGGTAATGGAAATTrCCTGCOCT 

TTACT ATT AAGAAGACCAGTGATAATTTAATAATGCCACCAACTCTGGCTTAGTTAAGTGAGA 
— - TAGGTGGAGAGAGCCCAGGGCTTAT 
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GTTAAAATCATGGACTT GAAAA GC AAAOCTTAATCT GCAAAGACAGCAGCAAGCATTATACG 
GTCATCTTGAATGATCCCTTTGAAA I rTTTTTTTTTGTTTGTI I GTTTAAATCAAGCCTGAGGC 
TGGTGAACAGTAGCTACACACCCATATTGTGTGTTCTGTGAATGCTAGGTCI C 1 1 GAATTTGG 
ATATTGGTTATTTTTrATAGAGTGTAAACCAAGTTTTATATTCTGCAATGCGUVACAGGTACCTA 
TCTGTTTCT AAATAAAACTGTTTACATTGATTATGGGGTATGTATGAC^rrTCATTTTCCAAGAA 

atagaactctagcttagaattatggatgctct aaaatgtcag aatgggaactctccttgaagt 

TCTCO^AAACTCAGAGACAGCACTGCCTTCTCCTAAATGATTATTCTTTTCTCCC 
ggtattttct AGCATCCTTCTC^UXACAGCCATAACCCTTTTTTACTTCCATTAGGGCGTATA 
actg gagagacctgct g ctcgtt at at AATT atct GAT ACCAAACACAAGCGATTCTGAATG t 
ACAGACACGTTTATCT ATAGAGCAT AAGGTAAAACAATTTTGTTTAACATTTTGGT ATTGTTTA 
GAAATGTTATTAAAACATGCAAATTCATTTTGAGAACCTCGAGAGAATtaacclATTAACTTT^ 
AAAGACACAGTTTAAGACAAATATAAAGGTTTAGCCAAATGCAACGGGGAGGAAGCACCTGG 
AATAATTTGTG CTTC CTCAGGGGCTTGTGGAGATGGCTGTTCCTGCCAAAGCTGTAGCAGTT 
ATCCAAAAG TTCTTT CAAACTCATATTTG GG GCCACTGAgTtanngtmttmagTTTtgttttgTTTTTTCT 
TTATGTCTCTCCCATTTCTAAAGTGI 1 1 ICI lAAGTTTGTTCTAATTCAGCATat 



>590 
>591 

CGACCCACGCGTCCGCGCGAATCCX3TCGCGGAACCTGTCTTCTGTCTTT ACCCAGA 
GCTACCATGAGGAAGCGGAACCAGGTATCGTACGTGCGGCt^GCCGAGCCGGCGTTTCT 
GCCCXaCTTCAAGGAACGGGTCGGCTACAGGGAGGGACCCACGGTAGAGACT AAGAGAATT 
CAGCCT GAGCCCCCAGATGAAGATGCGG ATCACAGTGACAAAGAAGATGAACAGCCT CAAG 
TGGTGG1 1 1 I AAAAAAGGGAGACCTGTCAG7TGAAGAAGTCATGAAAATTAAAGCAGAAATA 
AAGGCTGCGAAAGCAGATGAAGAACCAACTCCAGCCGATGGAAGAATGATATATCGAAAACC 
AGTCAAGCATCCCTCAGATGAAAAAT ATTCAG GTTTAACAGCAAGCT CAAAAAAGAAGAAGC 
CAAATGAAGATG AAGTA AATGAGGACTCGGTCAAAAAGAACTGACAAAAACAAATTAAAAATA 
GTAGCCTCCI I ICI I 1 1 GACAACGAAG ATGAAAATGAGTAAGTGTAAATATTTTGAATTTAGTC 
TACTTTGAAAGTATATGGAGTGTTCATTAAAATCACATTTTTTCCTA 

Gi i til 1 1 ATAGAAAGTTTAGGAAATAG AGAAAAAAATTTAATAAACTAGATCTATTCATCAATA 
CGCCTCTGACTTAAAATGCCAACTCT ATAGAAATTAG CT AGT ATT AACATTTTGTTATTTCCCT 
TGTGTGGTTGTATATATATGTAAATTATATTTTTAAGCAAAATACATTTTTTGTGTGTAAACAAA 
ATTTTATAAATACAACTGTATTGCAAATGTTCTTTGTCCTGCT^ 

GAGTATTCTTCGAGGTCAGTAAATTTCAAAAACtrTGAGATTAATAGCTACAGATAATT^ 
ACATCTCATTGTATU 1 1 1 1 ILATTAGCAATAGCTCCACTTTGGGTGGGGGAGATGATAATGTG 
CCTTGT1 AAAAATACCTCCCCAACTCCTGCTAAGGGTGG<X^TGAGACTO^GCTCTGGCAAG 
TTAAGAAATACAGGTGGAATTCTGCTTGATAAAGCTGCTGGG 1 M 1 1 IGU A CAAAAGGACAG 

acttggcaaacatgagcl. nTGCTcn atcttttc^tcctacttggagtgcagagataaaacc 

TGAGTACCTGCCGGGCGACCGCGACCGCGTGGAGCTCCAA 



CTGGAATTCTTTTTAGCGTGGTCTOGGCCG 1 1 1 1 1 1 1 1 1 1 1 1 M I IGATTTTTTAGGAG 
CTTTT ATTGTTTTAGTAATCTTAACATAACTT AAAATAAGAGAGGGGAAATGACATCTGGAGAT 

ct aggtatgtggcccattgcaattgagcagatttcttgggt c hi 1 1 r c tcta tctctaaggg 
gagtctx^aaacxxx^gctcaaaatacgacactaacatcatgaa 

AGTGCTCTGTAGTCTrATGATGATCTAGAAGAGCACTGTCCAATAGAACTTTCTGTGATGATG 
AAAAGATTCT ACTTCTGACCTATTCAATAGGGTAAGCACTAATCATGCATGGCTCTCAAGCAC 
TTGAAATGTTGCT AGTGTGATTGGGGAGCTGCGTTTTGAATGTT AACT AATTTAAATTTAAATC 
TT AAAAAGTTACATGTGGGT AGTGGCT GCCGTACGGAACAGTACAGCACT AGAAT AATGCT G 
TGTCCTGCTGTTACACATACTCATGGTCTCTCCT ACTCT CCCTTCATCACACCTGTGACATCT 
TGCAATTTATATTTGATTGTGTCATTATCTAATGTATI CCTCCTGGACTCT AAGCTCCAAATGA 
ACAAATGCTTTGTCTACTTPGOCATAACTGTGTC 

GAGT AAATACATCAT AAATATTTTGTGAATGAATOAATGAATGAATGACCATGATTAATAAAAG 
GATATAGCTGCTCAGTCTGGTGCTGATAATGGTGGT AGTT^TCAAAAGGGTTAAACCACAGG 
TCATTAAATGAAAACAGCT AACTTAGTGCTTACAGGAACGAT ACACTCCCT GT GTATAAACAT- 



pcrwsoa/xun 



gccggtgtgtgtgtagaggtgtgtgtgtgcaaaagcataataatgtg1ttgcaaaggtttgc 
atttgtgtagtcatgtgtw5acatgtatgctcatgtgtttttgtgtcaggataataaggtgac 
taagact aagacacatgcaaatcaccgaggaaggcagtttct aagtcatt acttttf attttg 
aaggatttgtgaaactcttcagatcatggtaatcatacaaactctcaccatgatgtgaagagt 
ttcacaaatccttcaaaataaaaagtaatgactt agaaact gcctt cct gggtgatttgcatg 
tgtctt agtcrtagtcaccttattatcctgacacaaaaacacatgagcatacatgtct acaca 
tgact acagaaatgcaaacctttgcaaacacattatgcttttgcacacacacacct gtacaca 
cacaccggcatgtttatacacag ggagt gtatggttcctgtaagcact aagttagctgtttt 
catttaatga cctg tggtttaacccttttgatcactaccaccattatcagcaccagact 
agctatatcxrrtttattaatcatggtcattcattcattcattcattcagaaaatatttatgatgt 
attt act ctgcaccaggtcccatgccaagcactggggacacagttatggcaaagtagacaaa 
ggatttgttcatttggagcttagagtccaggaggaatacattagataatgacacaatcaaata 
taaattgcaagatgtcacaggtgtgatgaagggagagtaggagagaccatgagtatgtgtaa 
caggaggacacagcattattct agtgctgtactgttccgtacggcagccact acccacatgt 
aactttttaagatttaaatttaaattagttaacattcaaaacggagctcccca^ 
aacatt tcaa gtgcttgagagccatggatgattagtggttaccctattgaataggtgagaagt 

AQAATCTT TTCATC ATCACaGAAAGTTCTATTGGAC^TGCTCTTCT AGATCATCATAAGACT 
ACAGAGCACTTTTCAAAGCTCATGCATGTrCATCATGTTAGTGTCGTATTTTGAGCT 
TTGAGACT GCCCTTAGAGAT AGAGAAACAGACCCAAGAAATGTGCT CAATTGGAATGGGCC 



NC TACCC ACCTTTCCCCCCCGCTTT ggaccctccccgtctg 
CCCCAACATTJTAATTTCGCCCACCCAGAATTTTCAGTACAATTCA 
ACAAAAGATrrCCAGATTTATTTCCAATTTCAGTTrCAAAAGAAACT 

ATTTACT ATCTGTGAGAATGACCAAT AAAATGTGT AAGAATAT AAAAAATAAATTCATTCATTG 
TTGGTAGT ACTCCAAATTCAATGCATAAGAACCTTTTCT ATCTAAAGCATTTTCCT AAAACTGC 
AAACT AAAAATCTGATATTTAAAACATTTATGTGGTAAAATAATCTCATTTGATTATGGATATCA 
TAAAT AGAAT AT AAAAGATATTACATTTCTTCAAATACCA I ITGTI I GACT GTGTTCAATTTTGA 
GTTTCCT AACATCCATTCTACAGGTCAATCAAAAT ACAAAG^CGTGCAAATGAAACAAGCT AT 
TGAAATGAAATAAAATTCAAGAATAAAAGTT AGAATTAACTTATGAGGTAACTTTT ATGTAAGC 
AAACATCACAGTTGTAAGAAATGCCTCTCTTTAC^AAGAGTCATTAAGATG 
CAGCTATTATTTCATGTGCCCAAAATCAAGCAAAATAAAATCCCATGC I I OCT GGAATATTAA 
ATACCATG CAGTGTACT ATTT AAGAAAAGAACAAGGTT AACT AACT AAAAGCAGAACTCACTT 
ATTTTTTGC1 CCCTAGCCAATTAAAAATAAGTTCATTAAAAGCACTTGAAATTATATATTTAACC 
TGAAAAAAAGTTGCTAAAATTCCAATATAAATGTAAATATCTTrAACTTGCTrAACCCAGCTAT 
CCCCAAAACAGTGTAGTGGGGCAAAATGTTCAAAAGAAAAATCATCCAGTGCACGTAGATGXS 
G CACCAAGAAG CTAAGCrroOCTOAGOGOCT 

CTCC IIIICI UGGGATGAAAAGGTGCTCACCAGATCTTTCT ATGCAACTGAGAGCTCCT AAA 

TCAAGTCAAAGGGGACCATCGTACTGAAATATTCCATATTTTC^^ 

CTCAG^GG CCAGGAGCTGTTCCCCTrGCCCAGGCTGCAGAOCAACCTG^ 

ATGTTCTT ATGGT ACAGAGTGGTGGAGGATGGAAGCCATTCAGAGCCT GATGAAATGGAACA 

G AAAACTCCC A 1 1 1 ICIUI CACCTGTCAACTTCCTGTAATTCAAATCACCCTTG 

AATQT GOJI fCTBl AGTTCAGCAT AT AAGTCAGGTGCAACCTGAGGCATnKSACAGT AOCTC 

OgOGg iOSOOGOGOGGGOOSOOOGGOCACGTAC Al 1 1 GI 1 1 1 GCOCAGCCAATCACTCTTT 

TTTGTGAGGAGCCTAAATACAN 

>S97 

HAOrCQAOCCACGOnxX3GGTGAGCACCGCCCGGGAAC^ 
TGGAAGCTGCAGGCTTAGTCTGTC«GCTGCGG^ 

GCCTT AACATCCC1TGCATTTGGCTGCAAAGAAAT CTGCTTGGAAGAAGGGGTT ACGCTGTT 
TGGCCGGGCAGAAACTtXXiCTGAGCAGAACTT GCCGCCAGAATGCTCCTCCT GTTGCTGAG 
TATCATOGTCCTCCACGTCGXXXJTGCTGGTGCTG 

GGATCGT GGGCAATGGACACGCAACTGATCTCT GGCAGAACTrGTAGCACCTCTT(XTCAGG 
AAATGTCCAOCAC I G I H C I CATCATC*OCAAACGAATGGCTGCAG1 C I GI OCAGGCCACCA ' 
TGATCCXGTCGATCATCTTCAGCAT | C 1 G 1 C I C T ti HCX i lUI I CI fCI LJ XAACTCTTCACCC ' 
TCACCAAGGGGGGCAGGTTTTACATCACTGGAATCT^ ' 
ATGAGTGCTGCGGCCATCrrACACGGTGAGGCACCCGGAGTGGCATCTCAACT 
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CCT ACGGTTTCXjCCTACATCXTrGGCCTGGGTGGCCTTCCCCCTGGCC^TTCTCAGCGGTGT 
CATCT ATGTGATCTTGCGGAAACX3CGAATGAGGCGCCCAGACGGTCTGTCTGAGGCTCTGA 
GCGTACATAGGGAAGG^ 

GAAGATGTAT ATAATAT CTCCGGTTTATAAAACCTATTTATAACACTTTTTAC^^ 
AGTATTGTTTGCTTTTT ATGTTGACCATCAGCCTCGTGTTGAGCCTTAAAGAAGT AGCTAAGG 
A ACTT T ACATCCTAACAGTATAATCCAGCTCAGTAT Fill GTTTTG I T TT T IGTTTGTTTGTTTT 
GTTTTACCCAGAAATAAGATAACTCCATCTCGCCCCTTCCCTTTCATCTGAAAGAA 
CC CTCCC AGTCCACCTCATTTAGAAAACCAAAGTGTG GGTAG AAACCCCAAATGTCCAAAAG 
CCC» 1 1 rCTTGGTGGGTGACXX^GTGCATGCAACAGAAACAGCCGCTGCCXXlAACCTCT GTG 
TGAAGCTTTACGCGCACACGGAC AAAATGCCCAAACTGG AGCCCTTGCAAAAACACGGCTT 
GTGGCATTGGCATACTTGCCCTTACAGGTGGAGTATCTTCGTCACACATCTAAATGAGAAAT 
CAGTGACAACAAGTCTrrGAAATGGTGCTATGGATTTACCATTCCTTATTATCACTAATCATCT 
AAACAACTCACT GGAAATCX^TT AACAATTTT ACAACAT AAGAT AGAATGGAGACCTGAAT A 
ATTCTGTGTAATATAAATGGTTTATAACTGCTTTTGTACCTAGCT AGGCTGCT ATT ATT ACT AT 
AATGAGTAAATCATAAAGCCTTCATCACTCCCACA I II I I C I lACGGTCGGAGCATCAGAACA 
AGCGTCTAGACTCCTrGGGACCGTGAGTrCCTAGAG^rn'GGCTGGGTCT AGGCTGTTCTGT 
GCCTCCAAGGACT gtct g gcaatgacttgt ATTGGCCACCAACTGT agatgt at AT atgg TG 
CCCTTCTGATGCTAAGACTCGAGAC C nTTGTTTTI GCTTTGCATTTTCTGATTTTATACCAAC 
TGTGTGGACT AAGATGCATT AAAAT AAACATCAGAGT AACAAAAAAAAAAAAAAAAAAGGGGG 
GGGCCGTTAAANNNNNNNNNN 
>S98 

CCACXiCGTCCGCrrCTCAAAAAATTTTTAAAATTAGGGTGGTGGTGCACACCT 
CCCAGCTGCTTGGGTGTCTTAGGCGGGAGAATCACCT^ 
TGAACTATGACCACACCAGTGCGCTCCAGCCTAGGCAACAGAiS 

AT(1AGTT(^TT^TGGAT^TTGTGG CTTCAGATGTGAAA GAAAAGAT ACGGGGCTAAAATTCT 

GT ATTTTCAGTGT ATTCTGGT AACCAGCT AATCX^kAGCAGTAATTTTTGAAATG TGAAGTTT A 

GAATATATTCTAAATAAAATAOGTGCAACCTCCAGTGGGAACCTCTTAT^ 

GTAGGAAAGGCGCTGGGTGTTCTAAAATAGGCTCTCCTGGCCCACGGCTGACT 

TGTGTCTCTGCAGTGGACGTGACTCTGGACCCAGACACGGCCTACXX3CAGCCT 

CTGATAATCTGCGGCAAGTG<XK5TACAGTTACtrrCCAACAG<LACCT 

GAGGTTCAATCTGTTTCCCTGT GTCTrG^GGCTCTCCATGCrrcATCGCOGGGAGACATTATT 
GGGAGGT AGAGGT GGGAGATAAAGCCAAGTGGACCAT AGGTGTCTGTGAAGACTCAGTGTG 

cagaaaaggtggagtaacctcagccccccagaatggattctgggcagtgtc I TTG 1 UGTAT 

GGGAAAGAATANNNN 
>599 

W^NNGGAGGGGCCGTTTACGGAAGTGTAACGTTGJU^CXX: rrcTTBi g tatctg 
gagaaaatagaggttctgactcctcaggagcaaaaaacataacct 
gattt ggggtt caccatttctt ggggcacacttgatt gaaaact gagactt ctgaagagaag 

GCCAGAAGATACAAAGACAGACCATCCCAGTTGAATGCT GT cttcgaagaacagaagaaaat 

gatccaggcccaggaatcx>itaacact ggaggatgtggctgtggacttcacttgggaggag 
tggcaactcctgggcgctgctcag^aggacctgtacx^ 

gcaacctggtggcagtggggt atgaagccagcaaaccggatggactcttcaagttg?gaaca 
ag<3agaacaactgtggacaattgaagat6gaat«>cagt 

AAAGTTGATCATGTGCTGGAGCGCTT GCAGAGTGAAAGCCTGGTGAACAGAAGGAAACCAT 
GTGArGAACATGATGCATTTCAAAATATTGTTCATTGCA I I rcTOi I A GGGC 
AAAATCATGATATATTTGACTTTACGTGGAAAAAGTTT GAAAT CCAATTT AACTTT AGTT AACCA 
GAGCAAAGGCTATGAAATAAAGAACTCTGTTGAGTTTACTGCAAATGGG& 1 1 ICI I C 

ATGCT AACCATGAACGACTTC^T ACTGCAATTAAATTCCCT 

CT AAGTCCCAATTCATCAGTCCCAAGCATCAGAAAACACGAAAATTAGAGAAGCATCATG^ g 
'TX^GTGAATGTGGGAAAGCCTTCATCAAGAAGTCTT ggct aactgatcaccaggt aatgca 
T ACAGGAGAGAAACCOCACAGATGT AGTCT ATGTGAGAAAG CC 1 1 CI CC AGAAAGTTCATGC - 
TT ACT^AACATCAGCGAACTCAT ACAGGAGAAAAACCTT AT GAATGCCCTGAATGTGGCAAA - 
GCC1 1 ICI CAA GA AATCACXXSCTCAACATACATCA G AAAACACAT ACCGGAGA A 
T AT ATGCAGTGAATGTGGAAAAGGCTTCATCCAGAAAGGAAATCTCAT^ 
TTCATACAGGTGAGAAACCTT AT AT ATGCAATGAATGT G^iAAAAGGCTTCATTCAGAAGACGT 
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CTCTCATAGCACATCAGAGAi i • CAC^CAGGAAAGACGCCCTTTGTGTGCaGCGaaTGTGGa 
A AATC CTGTAG6CTAGGTACGAGGCTTGGTGTTGAATTCGGCCGAGGCACAGGAGAGAAAC 

CCTTTG aatgtagtgaatgtgggaaagcctttacacaaagcaaaagct cattgtccatcaaa 

GGACTCATACAGGAGAGAGACCCTATGGCTGTAACGAGTGTGGGAAAGCGT7TGCGTATAT 

GTCGTGTCTGGTTAAGCATAAGAGAATACACAC^GC^AGAAACAAGAGGGAGCCAAGGTG 

GAAAATCCTCCTGCAGAGAGGCACAGCTCATTACACACCAGTGATGTCATGCAGGAGAAAAA 

CTCT GCT AACGGGGCGACTACACAAGTGCCTTCTGTGGCCCCTCAGACATCATTAAACATCA 

GCGGCCTCCTCGCAAACAGGAACGTAGTOrrTGTGGGACAGCCAGTGGTCAGATGTGCAG 

C^TC^GAGATAAC^GAGGATTTGCACAGGACAGAAACCTTGTGAATGCAGTGAATGTGGTT 

GTGCCTTCCGTGATCAATTATGTCTTATTTTATGTTACAGAAAACCCATAGGAAGAAAACTCA 

GATCTATGTGGAAAAGGGTTGAGCAAAAATTTGTAGTTCATTATGTGGCCGAAAAGCATACA 

CTGAGAGAACATGTATAAGGCTGAGATAGCCTGATAAACTCATCCTATTAAAATGTATGCTGT 

GATACACAGGCAAATTTGATGTTAACCTAAGCACATACACAGCAATTGCTCGACTGTGTCAAT 

TAAATGAGTAAAGGAAGCCCAAGTACTTTTAAGTGGGAGGAAAATGACGTACCTGTGCAATC 

TTTTACAGCAGATGAGATAATTGCTGTTCCGGAGTGTTGCAGTGGAGTGAAAAGCTTCATGA 

GGGGCATTGGTGGGGAAGGGGTAATCCTGTAATGGTGGCATGTGGGCAC C 1 1 I G I I I CACT 

GCGTTTCCTTAACCATGCCCCTGGGGGATATTGAGACCCGACCTTCTTCGAATTGTrr^ 

GGGTTTGGACCCCTCTTCTTCACCTTTrATTCAAAAACACCGCTTTCGTCGAAGAAAA 

CTTCCCCTAGAGGAACGCN 

>600 

NNNAAGAAGTCTGAGGAGAGGGAAACGGAATGCTCGGTCGCCCCCAGACCTGGCC 
CACCGCGGTCACG^TCTAGCCTATTGGCCGCACCGGTCTGCCCGCCCG^G<3GCCTGCACCA 
GCCAGX3CCCCTCCGGCCCGACGCXX5CCGCGGGGGCGGAGTCGACGCAGTOGCTTTCGCT 
TTOGCTCGGGAAOGGGTGCAGGAAGGTCOCGGCGGCCGCCATCTCGCCGTCGTCCCGCG 
GGGTGCCCGGGGCGTTGCTCAGGCCGGCCACGGCGCCGGGGGAACTCTTCGGGAAGCCG 
TCCATGTCGCCCGAGCCCAGGGATCCG7TCACGTGGTGAGGCTCCATCGCGCTCATGGCG 
GCCATGGGGCCCTCCGGGCCAGGGCCGAGCGGGAAATTAGCCCTGCCGGCGCCCTGCCC 
GATGGGGTTCATGATAGTGTACATGTTTTC^CTGGAGTTGGTGGAATCTCCAGGGCTAGGC 
ATGATGGGTGTTCCAGGGGGCCCACCTCCTCCTGGGGGTCGGGTGTAGCTGCCGGGGGAT 
GAGGAGG AGT AG GGGATCGAGTTTCCACTGG GG CTGGCCCACGG GCCACGAACTCCTG GG 
GCCATG7TCATGGCAGGCAGGCCTGGGCCGGCGAGGGAG7TGGGTGGGGGTCGGATGCC 
ACCTCCATAGCTCTGGGGCCCCACGGTGGCCATGCCACGAGGAGGCGTCACCCTCTGCA7T 
GGGCCGCCCATGCTCGGATGCOCCTGGGCTCGTGGX5GAGGGCTCCATGGCGCCAGGGAG 
GAGGGGCTGGGAGCCAGGGAGGCCTGCGGGAGGCTGACTCGGCATCCGCAGGGTGGGCC 
GGGGGCCCCCTGGGAAGCGCGGTGACATGAAGGGCTGGAAG^AGCCAGCCGCCATGGAG 
CCTGCGGCCATTGTGTCACCTGGGGCCATACTCCCCATAACGGGGCTGGGGGCGGCTGCA 
GCACT ATAGTCCTGGAAGGCCTTGGCCTCGCCGGAGTGCTCG CAGGCCT CT CTTCTGTCAG 
GCGCCGCGCAGTACAGGTCCCAGAAGACGCACCACGAGGAGTGCAGfiAACCCAGGGGGC 
TCCCCCAGCGTX^TGTTCTTCTCCCATCGGATCTCAGACAGGAAGGTCT GGGCTGACTTCT 
GGGCACCGATGTGCAGC^GGTACTCATAAACGTACAGCGCCAACTTCTGGCGGGCCTGGCT 
GTCGGAGGGCACGGCCGAACCCTTGCCCCCCTTGGCGT ACATGCTGCTCGACGCCGCCGC 
CGCCGGGGCCCACACCTGTCAGGC«GCGCCGCGGGCCGCGCTT^ 
CCCCGCGCCCCGGGCCTCACCGCCCCGAG 



G^TTCCCCTCCTCC^ 

>601 TGGCGGGCGCGCGGGTGGGTNCCCNNN 

NNATAAGAACTT ATGGG^TTTCCTACACGGAGAGAAAAAAAGATATTCCTTTATGTTG 
TTTAAAAGTGGGAGCTGCTC I IICII 1 ATTCCATTTTAATCAATGAGTATTGATTCAAGTTTTC 
CTTTCTAi 1 1 1 1 CCTTATGATAAGTTrCTTACAGTAGCTTATACAACAACAAATAGCATAGAAA 
AACTACTGGATTCAATTGATCATCAGGAATAAGTTCTCAGAAAAACACAGGCGGAAAA^ 
CAAGAATCCCAAATACAGAACTT7 ACAAGCTGTGAAACTTGGTCTCTT GCAATCATGTT ACTG 
CTCAAATTAGAGAGAATCTTATTGCTGT C7 ATTGGAGGAGCA7CGTGGCAGATCAGCAGGGA - 
ATGATGATGTTGA GAAC AGCAGCAAAAATAAACAATCtyfATGCT 

ACAAAATGGATACCTTTCAGAAAAGTATATACTTAAAAGACCCAAGACGTCAGGATAATAAAG 
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CTCTGTATTTATAATCTTTTATATGTCCTATTGTGGCTATTATGC77AAGTAAAATAGCTAAAG 
AAAAAAAAGAAAAAAAAAACAGAAAAGATGACAATATCATAAAAATGTAGCTGTCTATTTTGG 
CAGCTATATTGCAGAATTTCTTGACTATCTTTTAATCAGCTGGGAAAAAGTCAATAACTGTAT 
GCAAATGAATAAACTGTCCATATCAAAATACAAAAGTACTATCAATAATCACCTCTGACTTTCA 
GATTTAAATTCAGTGCAATTGACAAATGCATCACTAAAGGAAAATGCAGCTTAAAAGATACTC 
aaa acatttgtg tct ATTCCTGGG AGCACATTTTAAAATTATTGCACAGATTTTT^AATTTTTA 
TTTAi 1 1 1 1 1 1 1 AAACAATAACAGAGG TCAACCACAG ATGTG GACCTCCAG CAAT AAAAGCAG 
GAATTCAAGTGCCAGATACTCAGCATATTAGGTTTCCTACGTAAGTCACAGGGTAATATGTTC 

g cctctcttgatgt aaatt7 t7 aaaaatattatt acagtatcatagtccccact aacaacaact 
ggggtacatataacaatgtattgtgaaattaagtgtatttattctctttaccaatagcaaatgc 
taccctaccttagtaaaaccaagacttgcttcaatcaatgctgttttgtaaaaatagcaaagc 
aacg^tgctgaaatcaatcaaagctgcattacttgggttaaatcagtttctacttagaacac 

^gjta^ttttagtactaaaaggotc^^ 

GC I^ G 5 TCMCWMGC ^^ GG ^A^^ 

tatctatatctttaagaaaatacttgatattataaacagagtaaaagacatgatatagtagtga 
^AcrrAAAAAAAAAAAAATTAGCAGCTT aaatctatctatatttgaaaaaacgtagtcacaagt 
AGGAGa ^J g ^ g ^^ ga ^ < ^^ g ^ < ^ MGM( ^gttagtg caattatactttc attaaaaaa 
aattctgaatgactgctatttaaaaacaccttgaagcaagt 

^^^?^^^^^^^TITr GCCAT GTAATGQCAGTGTTATATGCCG7TATCTTGCTTTGTA 
]t^ A ^^ CAJGAGAGATT ^ AATACJGGAG ^ 

acacagaatgatggacacttcgagaagctaatccttatccagaaacannnnnnnn 

>602 

cogccggagtcgccaccgccgcgccctcgc^acccgcccgcccgccgctcccggccc^ 

t9 gcgc z cgctcccgcggccctcgct ^^ 
cctccccccgccagccgccaaagacttgaccacgtaacgagc^ 
gcccgccgctcx5ccatggatgcxx5gtgtgactgaaagtggactaaatgtgactctcacca7 
TCGGCT I C 1 1 atgcacggaaaggaagtaggaagccgaatggtgagagtcaccattcggctt 

SZX^TH5 :ACG G AAAGGA^GT AGO A^G CAT CATTG GG AAGAAAG G GGAGTCGG TT AAG AGU^A 

I9^ CTGAt ^ 6CCCCACCMTGC ^ 
^P^^T^cmcagct^tg^ 

CTGAGGCTGGTGGTGCCGGCCACCCAGTGCGGCTCCCTGA7TGGGAAAGGCGGGTGTAAG 

*T C ^ G ^ TCGGCGAG ^ TAGGGG ^^ 

aactcgaccgagcgggccatcaccatcgctggcgtgccgcagtctgtcaccgagtgtgtca 

AGCAGATTTGCCTGGTCATGCTGGAGAaSCn-CTCCCAGTCTCCGCAAGGG^GAGTCATGAC 
C ^^ C /^ ACGAG j^ 

G Z™?J C ^ CG ^ GCGGGCTACCCCCATG ^ 

TGCCTACTCGATTCAAGGACAACACACCATTTCTCCGCTCGATCTGGCC^ 
£ C ** G £TPI£?£ G ^ 

ATGAACTCACCATTCCAAATAACTT AATTGGCT GCATAAT CGGGCG CCAAGGCGCCAAGGGG 
CCAACATTAATGAGATCCG«^GATGTC^^ 

™ C X~ TCMTGCCAGGC ^ CGTCTGAG ^ 

AOGAWG^TGCACTTCCCACATATATACTTCACTGAGCCTCTGCAT^ 
AACAATGGCTCCTTCACCCCAGTAATGTTATtt^ 

TTTATTATAAGGGTAGAATATAGATTAAGATCATGAAAATCCAATTTCACATGTGGCACTAGG 
AAAATGGAAT(Kn"ATAAAATTAATCCCCATCACCCAAGTAGGGATGCTGGTGCTT7TATACT6 
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AGTCTTTACTGCCAATAAGTGATGCTGCCTCCAAAGGAACAGCTCTACAGAAAGTTTTTGGG 
TTAGTGTTTCCCTAGCTT ATTTCTGCAATGGAGAAAGACAATTTCATATCAAAAAAAAAAAAAA 
AAAAAACCT GGAAAAGCTACAGATGTTAACCTTTACTTT AAACACCAATAAAAT AGGTGTCAA 

aggagaaaaaatgtaaaaaaaagag^tcccaaacagcctgagagaccaaattctgcaaatgg 
aaatt7taaagcccact g aatacat7tttcatggatccaaacaaatgct acaatacttttaagt 
cctcaactcacagaattaagagcccttaagctgttaactttgtcctgtcttcx:tattcagaaa 



&ttaActnatgtaataGCatTtgggttg9ggattggtttgtTTCCAGTTGTACAAAAGATAGTTGTATTATGTTAGG 
TATAATTATGACCTrATTAGTGTCTTTATTTGAAGATTATGTATGATCTCAGGAGATGTGTATG 
GGTTCAAGTTGACAAGG GGTGGACTTGTGATGGTTAATACT G AGTGTCAACTTGATTGGATT 
AACAAATGCAAAGTATTAATCCTGGGTGTGTATGCGAGGGTACCT ACCTGATG 
gtcccatcagtaatatattgtta 




taaggacaffig; 



„ _ _ agtgactgtaatgacca&stgaggagtta 

iTcatgttaagggcATCTTCGTTGCTAGAGtaantcTTCTAAAATATGGAAG 
GTCAACActnkxx^AGACTTGTGGATAGTGATGGATAAAAATAACTCT 

ACTTGCTCAAGTGCAAAAGTTATAT7TAGCTTCAGCAATAGGAGTCTATAAAACTGTGAGAGA 
AAAGCCTT ACT AACTGAAGTCAT GCTTCAAA CCTTTCT ACCAGGGGTTCCAATAGCAATTGCT 

AAG AATACG GCAGCTTAAAAAAA 1 1| CAATTGACAGATA7TAAGAATGCCTGATATTTA 

CTGACTTGAACATTAGTTAAATTCAGTTAGGGTGAOCAGAGTG7TGCT ATTTGT AGAACTG7T 
GCTTGGTAGAGTCATCCAAAGTGTCTTCCTAAGCT GTAATAAATATGCTT ACTAATGTTTGAC 
TGCrTAATTAATGGTCAGTAAGCATTCAGTCAGAGTTCGTTTAATACATGGAGAAAGATATAT 
GCTTT AGATCAAGGGTTAGCAAACCACTGCTTGT7TGGGCCACGGCCTGTTTTTGTATAGTG 
TGGGAGCTAAGAGTGGGTGGTGTACATT7TTAAAGGGTTATGACAACGACACAGAATAATAT 
GAGACACAAACCXrTATGTGGCCC^TAAAACCTAAAATTCTACTGTCTGGCTCTTCACAGAGA 
AAGTTTGCTGATCGCCGCTTTAGATAATGGGGGTGCTCTACTACTCCCCTTTTC^ 
GTTACATAAGCCTAAATAATCACTGTAGCTGGTGGCATCATGTTTGTTACCTACTAAGTAGGT 

caaagtgattgccagacatacacatgaaggccttgaattagaaagcaaaggaactgatgatg 

ACG^TGTTTAACAAAATTCAGaCTGACTTTGToCCTGATCCT^ 

AATGTCTGAATTATTCAGCTCATATATC7TCAAGCACTCAACT AGTTCATACTTTGAAATCAAT 

TJCTAATAGACAATTCTCATAACACCTTTATAGTCTTCCCATTTAAAAGGTAAAT 

CT GGAGGGGT AAGATGCACCCTTGGTATATTGTCTGATCTCAGCAGAATCAACTACTTGGTA 

GTGT AGTCCAGAGAAAATGGGTCAAATCTATTAATTATTTTAGGATTTTGAAATTCATAATTGA 

GACTCGTGACTT AATAGTGAACTGCTCATGGTACTTTACCCAGT ctt caagttgtatgccttt 

tgtaggtaggcatttagatg ggat gcttttgaaagcataattaagaaactt^ 

G T™ A 7*^ GG< ^*A rG ™A T ^ 

tggtgccttgtttaatggtctgacactact gattttgggtaagtctacaaagccattgggat 
gaaaaattgctggaaagattgtgtgct^msgagcttagacatttragtggag 

GTATGAAAAGTAGGGGATGAAAATGTGGGCCGGGCGCGGTAGCTCATGCCTGTAATCCCAG 
CACTTT GGGAGGCCAAGGTGGGOiGATCACCTGGGGCCAGGAGTTCGAGACCAGCCTGGC 
T AACATGGTGAAACCCCATTTCTACT AAAAATAGAAAAAATTagCTGGGCGTGGTgGCGCAOG 
CCTGTAATCCGAGCTACTCCtSGAGGCTGAGGCAGGAGAATCACTTGAGCC^ 
GGJTGCAGTGAGGtKlAGATCGTGCCATTGCACTCX>GGTTGGGCAA^ 
ATCTCAAAATAAGTTTGAGGTTGTATTCTCTTTAAATAAGTTGGTGATACTGCTTCCC^ 

GGGTGGCTTT ATATGA7TGGTGCAGTGATAAATCTAACCAGGGATAGCTTT ATTTTATGAAATC 
TCACT GTGATATGATTTGAAGCT AGAAATGGTT CCT AGCT CT AATAACTGCAGCCTCACACAG 

ATTCCTCCCAAATCATTCCATTC 1 1 1 I I GAAAAGTTGTATATTTCCCTrTrTCAGCTCTCATTTCA 
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CTCTTCAGTGTTCCTGTTATTrATGAACGGCATCAGGCACAGATAGATGATTATCTAGGACTT 
GCAAATAAGAATGTrAAAGATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCG 
CAAAGCrrGAATGAAAACGCCCAAAATAATTAGTAGGAGTTCATCTTTAAAGGGGATATTCATT 
TG ATT ATACGGGGGAGGGTCAG GGAAG AACG AACCTTGACGTTGCAGTG CAGTTTCAGAGA 
TCGTTGTTAGATCTTTATTTTTAGCCATGCACTGTTGTGAGGAAAAATTACCTGTCTTGACTG 
CCATGTGTTC^TCATCTTAAGTATTGTAAGCTGCTATGTATGGATTTAAACCGTAATCATATCT 
TTTTCCTATCTATCTG AG GCACT G GTGG AATAAAAAACCTGTATATTTTACTTTGTTGCAG AT A 



Com i CAGTTTGTGCACTGTGTATGGTCCGTGTAGATTGATGCAGATTTTCTGAAATGAAAT 

^^TTGTTTAGACGAG ATCAT ACCGGTAAAGC^GGAATGACAAAGCTrGCTTTTCTGGTATGTT 

CTAGGTGTATTGTGACTTTrACTGTTATATTAATTGCCAATATAAGTAAATATAGATTATATATG 

TATAGTGTTrCACAAAGCTTAGACCTTTACCTrCCAGCCACCCCACAGTGCTTGATATTTCAG 

AGTCAGTCATTGGTTATACATGTGTAGTTCCAAAGCACATAAGCTAGAAGAAGAAattcaa 

>604 

AATTTCCAAGTGATCTGAACTAAAAATAGTTTCTTAGTAAGTGTAATATTAATTTTAGT 
TCAGATCACTTGGAAATTAGGGGAACTAAGGGAAACTGCAAGGGTAGAAGCAAAGAAACTAT 
TTTrGTTAATTATACTGAGCAAAGTTTGCTCCAAATGAAAAGTCATTAC 

CCCAAGAAAAGGTCAAATTGAGTCAGAAGACTGTAAGTACCACTTTTGGCGCTCCAGAGATA 

AACCTTGTACACACTGCAGTAGAACAGTGAGTCGAATATATGAGCTATAAATAGATATCATTG 

TrAGTCTGGATAGCTCTGTGTTGAAATACTTrTTTAATGCAGATAAAAATTATCAGGAAATTAA 

CCTTATGGCACAGACCAAATACTTAGTGCATGGCTGAGTTrCTACAATTCAAGGAGC^ 

AAATAGGTTCAAGATGGAATATAGTACATGCATATAAGGCAGATGCACACATGCTGGTTACTT 

TAAGA TTGTGTGTGTTGATTAGTCACAATGATAGACTATAACATGAGACAATACAAAGTTACA 

TTTTTGGACCATATTAAAACTGCAAGAAGACAGGGGTCTT ACTG AAGATCTTTT AGAAAACTT 

AAATCCTGTCACAGGATATTTAGACATGTGTAGAATGTAGCTCAATTTTTAAAAAGTAACTGA 

CCTAGAG<3GTGAAAGTTGAAACTGACACATTTTCAAATTAAGATTATGCTT attttgtacaga 

aaacaatgtttaaacacaagcagatctgttgtatgtaataagtaacacagagttttaa^ 
ttaattatttagctttattgaagtttgtttttttccttccgaacctggag^^ 
agcagtttcacaccagaatagcagtgccc 1 hui i m igtacatactgattggac c i 1 1 C I 1 1 
actgtt acgtggacacttt ct atgtt agttttgatgcat aattcttt g aatcctttt at acaaa 
ctagaatgtatgtgtaagaatacctgtccctgcaatgtagtaactacaaacttatt^ 

AATAGAAAACI IGI 1 1 1 1 CTAAGCNNNNNNNNNNNNN 
>€05 

GAGTCGACCt^CGCGTCCGGGACGCTGGCGGCTCCGCGACG^ 
GGTTGGGCATGCTGCTTACCGCAGGATGATGGAAGC^ACGGCXXXS 

GTTGGTTT CACTGAGTCAGTGCTGATTCT CTCAAATAG AGCTTG AAGGATAAATCTT CATTTT 

TGTTTC AACAA AACTTCGAAACAAAATGGAAGAAAATAAT^ 

CGGTAATTTTGAAGAAGTTCCCAGGGAGACXSGCAATTCAGTTTA^ 

AGCMGTACCAGTTCGTTAAAAATTTAGTGGATCAACATGAGCCT AAGAAGGTTGCAGACCTG 

GGATGTGGTGATACTTCACTCTTAAGGCTGCTAAAAGTCAATCCATGCAT^ 

GG AGTAGAT ATTAATGAGGATAAATTACX^TGGAGAGGGGATTaSTrAGCTCCTTT 

GG^TTTTCTGAA ACCTC GGGATCTGAATTTGACCATCACATTGTATCATGGCT 

AGAGAGACTCTCGTTTGCTTGGATTTGACTTGATAACGTGTA^ 

ATTCAGGTGATCTGGCCAGATTTCCTGAAGTGGTATTTGGGTACXrrCGG<XX2AAACT G 
AGCAGGCTGAGCGATATGATGGTATGGCTGCAGCCATGAAGGCAGTCACAGAACAGGGGC 
ATGAACTCTCCAACGAAa^AGA^ 

AGCAGCAGATGGGCAAAGAGT TCTCCAG ^^ GAG ^ GAAAACA6A *^ G ^ TGAGM6A 
>606 

■KyaqtgaaalQgtoUca gaBt r TGTl 1 OAflTCAAGGGCAGAAATATTTATTTTAAGATAATGT 
AAGAATGTAAGACAGGTTGCTTTGCAATTTATAATAA 

ATAAATTGCTT ATTATGAAATGATTCTGAGAAGT ACCATAAACTGTrTATCATGTATGCTTTCT 
GT AAAAAGTTT AATATGAATTTT ATGTATAACTT AXrrTTTCCAA*GAACAGTTTATAGAAG<3A 
AACACT GACAATAACAGCATACATGT AGTCAGTACATTATT ATTTTCAAACXiACT 
TAAATTGATATAAGTAACAATTCAGTGAGATATTTTTATCTCTCATTTTATAGATGATAAA^ 
AAGAGTGATAATATGATGGAAATCACTGTAATTT AC 1 1 1 CI 1 1 1 CT ATTTTATCAGAACATTTAT 
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TTATTCACA7TTTAATATTATGTTGCAGCCTTACAAATGTTTTCATCCACTCACTCTGATAAGC 

AATAAAGAATAAAATAAAGATTGCATATGTACAATGCTATTCTTCCTTGGGAAAATGACAGAAT 

TATAAAGGATTTCAATGGGCTAATTTAATTTTTAATCTATATTT GGAGG ATTTGCGATGTACGC 

AACATAGTATAACTTATAGTCAAGG ATTAAAGGAAACTGCA I llllll CAATTACTTTAGGTAC 

CAAATCATATTAAGAACATTCAGTGTAAGGteaoataoacctgAGGTATGTTATATGAGTGGGTAG 

GCATTTTCATTTAGTCAGTCAATTAAATCAACTTTCAAAGCAAGTGAAATTATATTCTTATACTA 

TATGGGATCCrrGAACTTATGAATAATATATTrrAAAATTACTG TCATA AATAATATTAGATAAA 

c aatg taaaaagtctaacatacaaaatcaatgct aagagtaaca r n rcTT GGAAAT AACATT A 

CTIT TATAAATAAGCTGATGCAGATCAATTTCACATTTAATAACTTATAAAAACAAAGTCAGTG 
TTTTCTAGTTTCATATGCAAAAGCAAAGTAATAGTTGGAGTGTATGTAACCTACATTAAATATG 
TCTTAGAAAGGAAGATAAATGAACTTATCTTCCATATGCAAT ATGTA AATACGTTAATTCAGAC 
T AATTGTTCT A I I 1 1 1 G TGTAGTATCTTTGTATAAACTAAGCATTTTT AATTGAACAGATTTCTA 
TTAAGTATTATCAAATTGTTTCTCT ATGTACAAATCT AATTTT GTTGTGATTCAACAGAAAGTTT 
TTATTTTCCCTTTT AAGACATATTCTTTTGAAATAATAAA7TATATTTAAATTATTATTACTGGCT 
GTTTAGCATCACTTTTCAAGTGGTCCTTTTGCAAGAAACAAAAATATTAGCTTCTACTTAATAT 
ATACATCAGTCTTATAAACCATTTTATATTAGTCTTTTACAATACTCAT^ 

ATTAACAATATGCAGTAAGTTACAGAGTGTAAGACGTTATT7TTACTAGTGTTATTAACGTGTA 
CCACCTATGATCAGGAGCAGCTTCATGTTTAAGAAGATAATGACAATGACAATGATTACTGTT 
ACAACAACT ACCACTGCTTCTACTATCAACACATGACTTrTTGGGGGAGGAAAATTCT ACACT 
TTAAATTTCTTTACCGTATACATAAAACAATAACAGACATTCTGTTGACTTTGGCAAATAAACA 
ACAATTTTTAAAAACTTTATACAAGTCAAACAACATATGTTAATTAAGTACTTTT AAAAGTT AAA 
AGTACACGT AAACAA I IUI 1 1 1 ATATAGTAGCTTAATTTCTAI 1 1 CI 1 1 1 CAAAATTATCTCAGT 
GTAATTAAAAACTGGGCAAATAACAGGAATTTAATTCTAAGGGTATTATAAATCTCAAGCAATA 
TTCATCTCTGAAAATAAGTTCACAGTCCAAAATCAAAGAAAATGCGATGTTCATTTTTAGGCG 
ATCAAGATTCCA TGAGTCATAAATT ATTTTGAAATGAG GAAAATTTCT AATGAAT AAATAAAAA 
AATAATATTGAAAGTACCTGCCCGGGCGGCCGGCCGAGGTAGGAAATTGGGATGACATGAA 

acacctgtgggactacagatttggaccagagaaacttaatatagataccagaaattgtaaaat 
cttactcacagaacctcctatgaacccaaccaaaaacagagagaagattgtagaggtaatgt 
ttgaaacttaccagttttccggtgtatatgtagccatccaggcagttctgactttgtacgctc 
aaggtttattgactggtgtagtggtagactctggagatggtgtgactcacatttgcccagta 
tatgaaggcttttctctccctcatctt accaggagact ggatattgctgggagggatataac 
tagatatcttatcaagctacttctgttgc<mggataggccttcaaccactctgctgattttga 

AACGGTTCGCATGATTAAAGAAAAACTGTGTTACGTGGGATATAATATTGAGCAAGAGCAGA 

aactggccttagaaaccacagtattagttgaatcttatacactcccagatggacgtatcatca 

AAGTTGGGGGAGAGAGATTTGAAGCACCAGAAGCTTTATTTCAGCCTCACTTGATCAATG^ 
GAAGGAGTTGGTGTTGCTGAATTGCTTTTTAACACAATTCAGGCAGCTGACATTGATACCAG 
ATCTGAATTCTACAAACACATTGTGCTTTCTGGAGGGTCTACTATGTATCCTGGCCTGCCATC 
ACGGTT GGAACGAGAACTTAAACAGCTTTACTT AGAACGAGTTTTGAAGGGTGATGTGGAAA 
AACTTTCT AAATTTAAGATCCGGATTGAAGACCCACCCCGCAGAAAGCACATGGTATTCCTG 
GG TGG TGCAGTTCT AG<X5GAT ATCATGAAAGACAAAGACAAGTTTTGGATGACCCGACAAGA 

AGCTTGTTCCCATCATACCCGTAATGC 1 1 1 C 1 1 1 1 1 1 CCTTT ATTGCGAATCTTT GAACTCATT 
CAACTCCAGGACATGGAAGAGGCCTCTCTCTGCCCTTTGAGTGGAAAGGTCAAGTTTTATTC 
TGGTGTCTTGGGGAAGCTTTGTT AAATTTTTGTTAATGTGGGTAAATCTGAGTTTAATTCAACT 
GCTTOOCTATATAGACTAGAGGGCTAAGG AI rCTGTCTGCTGCTTTGI I IC1 rCTAAGTAGOC 
ATrrAGATCATTCCTGTAGGCTTCCT ATTTTCACTTT ACTGCTCT AATGCTGCT AGTCGTAGTC 
TTTAGCACACTAGGTGGTATGCCTTTATTAGCATAAAACAAAAAAAACTTTAACAG^AGCTTT 
T AGAT ATTACTGGGATGGGGGGTGGTTCGGGATGGGTGGGCAGCTGCTGAAGCCTTT AGGG 
CATTTCCTCTGTAATGTGGCGCTTTCAACTGTACTGCTGCAGCTTTAAGTACCTT AAAGCTTC . 
TCCTGTGAACTTCTTAGGGAAATGTTAGGTTCAGAACTAAAGTGTTTTGGGTGGG ITTTGTTG 
CGGGGGGGAGGGTAACAATGGGTGGTCTTCTGATTTTTATTTTTGAGG I 1 1 IG IC AACTGGA 
GTACGTAGAGGAACTTT ATTTACAGTACTTTGATTTGGCAGG 1 1 1 I CH CTACTTGTGCTCTG 
CCTGGAGCTGTTTCCATATGATATAAAAAGCAAGTGTAGT ATTX^CATTACT ATGTGGCTTAGG 

gatttai rnsTrm I aaaatcaacgatgttagctgggattagactccctacagtccttcaatg 

GAAAAGTAACATTTAAAAATCCTTTGGGTAATTCAAATTACAGATTTAAAAGAGCT^ 
GGTGTTTTGTTAATGCTTCTrGTTTATTCCAGAAGCATTAAGGTAACCCATTGCCAAGTATCAT 
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TCTTGCAAATTATTCTTTTATATAACTGACCAGTGCTTAATAAAACAAGCAGGTACTTACAAAT 
AATTACTGGCA GTAG GTTATAAT TGGTG GTTT AAAA ATAACATTGGAATACAGGACTTGTTGC 
CAATTGGGTAATTTTCATTAGTTGJ II IGI I IUI 1 1 I GATTTGAAACCTGGAAATACAGTAAAA 
TTTGACTGTTTAAAATGTTGGCCAAAAAAATCAAGATTTAAI ! I HI I ATTTGT ACTGAAAAACT 
AATCATAACTGTTAATTCTCAGCCATCTTrGAAGCTTGAAAGAAGAGTCTTTGGTATTTTGTAA 
ACGTTAGCAGACTTTCCTGCCAGTGTCAGAAAATCCTATTTATGAATCCTGTCGGTATTCCTT 
GG T ATCT G AAAAAAAT ACCAAAT AGT ACCAT ACATGAGTT ATTTCT AAGTTT G AAAAAT AAAAA 
GAAATTGCATCACACT AATT ACAAAATACAAGTTCT GGAAAAAAT A I 111 I C 1 I CATTTTAAAA 
CI 1 1 1 1 1 1 AACTAATAATGGCTTTGAAAGAAGAGGCTTAATTTGGGGGTGGTAACTAAAATCA 
AAAGAAATGAT TGAC TTGAGGGTCTCTGTTTGGTAAGAATACATCATTAGCTTAAATAAGCAG 
CAGAAGGTTAGTTTT AATT ATGTAGCrTTCTGTTAAT ATT AAGTGTTTTTTGTCTGTTTTACCTCA 
ATTTGAACAGATAAGTTTGCCTGCATGCTGGACATGCCTCAGAACCATGAATAGCCCGTACT 
AG ATC TTGGGAACATGGATCTTAGAGTCACTTTGGAATAAGTTCTT ACATAAATACCCCCAGC 
CTTTTGAGAACGGGGCTTGTTAAAGGAOGCGTATGTAGGGCCCGTACCTACTGGCAGTTGG 
GTTCAGGGAAATGGGATTGACTT GG CCTTCAGGCTCC7TTG GTCAT AATTTT AAAAT ATG G G 
AGTAGAAAACAACAAAGAATGGAATGGACTCTrAAAACAATGAAAGAGCATTTATCGTTTGTC 
CCTTGAATGTAGAATTTGTTTTTGATTTCATAATT CTGC TGGTAAATGTGACAGTTAAAATGGT 
GCATTATGTATGT AT ATT AT AATTTAGAAATACCATTTTAT AATTTT ACT ATT CCAGGGTGACAT 
AATGCATTTAAATTTGGGATTTGGGTGGAGTATTATGTTTAACTGGAGTTGTCAAGTATGAGT 
CCCTCAGGAAAAAAAAAAAAAAAAATTCTGTTTTAAAAAGCAATCTGATTCTT AGCTCTTGAAA 
CTAITGCTACTTAAATTTOCAATAATTAAAAATTTAAAATTTT^ 

CTACATTTGAGAAGGG I llllll I AGAAAT ACATTTagtAAAGTcCcCaAGACATTAGTCTTACAT 

TTAAACI I I I MCI I I AAAACATG^TTGGtGGTT AACTTTTACACAGTTCTGAGT ACT GtlAAT A 

TCTGGAAAGTATCTTGAgalatCaglGGAAAGCTAAAcagtCTA^ 

GATTGagaaAAT AAAAT Caga I l l ll l l CA AAGteaaaoc 

>607 

NNHHNNNNt4ATCGGAGCATGGQAGGTTCCTAGGftACQCGGGAATT(»AACATCTAAG 
CTGTTCCCCCCACTCGAGCAGATCATACCAAGACCGCCGAAATTCGTATGCCAACATATCGA 
AGTATCAGTCGCGGACCGGACAAGCCCGGATTCGCCATGTACGGGCAGCTGTCCGGGCTG 
CrrTTCGCCTCCGCCTGTGGATGCTGCGCCTCTCOGAACGCAACATGAAGGTGCT 
GCCGCCCTCATCGCGGGGTCCGTCTTCTTCCTGCTGCTGCCX^^ 

GAGAAGAAGAAGGG GCCCIAAAGTCACCGTCAAGGTGTATTTTGACCTACGAATTGGAGATG 

AAGATGTAGGCCGGGTGATCTTrGGTCTCTTC<5GAAAGACTGTTCCAAAAACAGTGGATAAT 

TTTGTGGCCTT AGCTACAGGAGAGAAAGGATTTG6CT ACAAAAACAGGAAATTCCATCGTGT 

AATCAAGGACTTCATGATCCAGGGCGGAGACTTCACCAGGGGAGATGGCACAGGAGGAAA 

GAGCATCT ACGG TGAGC^KnTCCCCGATGAGAACTTCAAACTGAAGCACT ACGGGCCTGGC 

TGGGTGAGCATGGCCAACGCAGGCAAAGACACCAACGGCTCCCAGTTCTTC 

TCAAGACAGCCTGGCTAGATGGCAAGCATGTGGTGTTTGGCAAAGTTCTAGAGGGCATGGA 

ggtggtggggaaggtggagagcaccaagacagacagccgggataaagccctgaaggatgt 

GATCATOGCMGACTGCGGCAAGATCGAGGTGGAQAAGCCCTTTGCCAT^^ 
GGCACAGGGACATCI 11 CI 1 1 GAQTGACCGTCTGTGCAGGCCCTGTAGTCCGCCACAGGGC 
TCTGAGCTGCACTGGCCCGGGTGCTCGCATCTGGTGGA^ 
CACAGGCCCATGGACTCACTTTTGTAAC 

TCXSCCGTATAGTGAGTCGTATTACAATTCACTGGCC G rOGTTTI ACAACGTCGTGACTGGGA 
AAAGCCTGGCGTT ACCCAACTT AANN 

NACGAGGGGTACXXiCCTCACTCCCAC<3C»CCCCCCAGC^ . 
GCTGAGAGAACAGACACAGACCTGTOGGAAGGTCCTCTGCAGGTCCCCCTTCCGCT 
GATCGACTTCCGCCTtXaGGCAGTCAACATACTGOCAAGGAAATCTGATGTGGAAAG * 
TAGAAATAGTGCAGTTTGCTAGCCGGACACGCCAACTCm» 

AATGGGCT AAT AATGCTGGCAAAGTGGAAAAATGTGCGATGATTTCAAGCTTTT^ • 
CAAGCCATOCTGTTTGTGGACACTGCTGATCGCCTGGCCTCG 
. CCATGCAOGCCTGCCTAGTrTTGCCATCCCATATGCCATTGATGTACTAACT 
CCCACGGCTGCCAACCTGCATTAGGGATAAAATTATTCCTCCAGACCCAAT^ 
AAAAACAAGCCACACTOCATCAGCTGAATCAGANNNNNN 
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^CGCCGCCGGACCGCGCCCCACCCCGCGCGCCCGAGCAGGGCGACTGTCATTA 

gcttcctggacgggacccggggcgggatcctggtgtcct G AAAG gggcccgggcgaccc 

TAAGAGGAAGAACTTTTGGGGG<X2GGGTCCCCGGTCO^CGTCCCCTGGGCAGCCGCTAT 
TGTCT ACGCGCCTCG CTGG GCGGCGCGGGGGGCGTGATCGCGGCGGCCCCGGGCTCTGG 



GAAGAGCCTGGGGAGGAGGAGCTGCGAGCGCGGGAGACGAGCAGGAGCCGCGCGGGCC 

GCGGCCAGCGCGATGCCGGCGGCGGCGGGGGACGGGCTCCTGGGGGAGCCGGCGGCGC 

CTGGGGGCGG«>GC»GC«CGGAGGACGCGGCCAGGCCCGC<3GC<MCCTG^ 

TT7GCTGCCTGCCTGGGTGAGCGGCGTGCCCCGC<5AGCGGCTCC<XXJACTTCCAGCACO 

CAAGCGCGTGGGCAACTACCTCATCGGCAGCAGG AAGCTGGGCGAGGGCTCCTTT gccaa 

GGTGCGCGAGGGGCTGCACGTGCTGACCGGGGAGAAGGTGGCCATAAAAGTCATTGATAA 

gaagagagccaaaaaggacacctatgtcaccaaaaacctgcxsgcgagagggtcagatcx^ag 

CW5ATGATCCGCCACCCCAATATCACTCAGCTCCTTGATATTTTAGAAACCGAAAAGAGCT A 
CTACCTGGTCATGGAGCTGTGCCCTGGGGGCAACCTGATGCACAAGATCTATGAGAAGAAG 
CGGCTGGAGGAGTCCGAAGCCCGCAGATACATCCGACAGCTCATCTCTGCCGTAGAGCAC 

ctgcaccgggccggggtggtccacagagacttgaagatagagaatttgctactagatgaag 

acaataatatcaagctgattgactttggtttgagcaactgcgcagggatc^tgggttactcg 

gatccgttcagcacacagtgtggcagccctgcctacgctgcacct gaact gctcgccagga 

agaaatacgg cgcca aaatcgatgtctggtccataggtgtgaacatgtatgccatgttgacc 

gggacgctgcctttcacggtggagcctttcagcctgagggctttgtaccagaagatggtag 

acaaagaaatgaac^cctccccactcagctctccacaggtgccatc^ 

ctcctggaaccggatcctgtgaagaggccaaatattca^ 

TTAATGAGAATTACACGGGCAAAGTGCCCTGTAATGTCACCTATCCCAAC^GGATTTCTCTG 
GAAGATCTGAGCCCGAGCGTCGTGCTGCACATGACCGAGAAGCTGGGTTACAAGAACAGC 
GACGTGATCAACACTGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACVI 0 CTC1 1 
AAACAAGAAACTGGAGCGCTATTTGTCAGGGAAATCTGACATCC^ 

AGACCCGGCTCTACCAGATAGAAAAGTACAGGOCCCCCAAGGAGTCCTATGAGCCCTCTCT 
GGACACCTGGACA CGAGA TCTTGAATTCGATGCCGTGCAGGATAAAAAGCCCAAAGAACAA 
GAAAAAAGAGGGGAI 1 1 T CI I CATCGACCATTCTCCAAGAAGTTGGACAAGAACCT GCCCTC 
GCACAAACAGGCCTCAG GCTCGCTT ATGACACAGATTCAGAACACCAAAGCCC TCC1 GAAG 
(aACCGGAAGGCCTCCAAGTCCAGCTTCCCCGACAAAGATTCCTTTGGCT gccgcaatatttt 
OCGCAAAACCTCAGATTCCAATTGTGTGG C 1 1 C I I C 1 1 CCATGGAGTTCATCCCCGTGCCAC 
OGCOCAGGACCCCGAGGATTGTGAAGAAACCGGAGCCOCATCAGCCAGXaGCCCGGAAGCA 

ctggcatcccccacaaggaagaccccctgatgctggacatggtgcgc rcc i i ugagtctgt 
ggatcgcgacgaccacg t agaagtgct gtct ccctctcatcact acaggattctgaactccc 
cggtcagcttggctcgcagaaattccagc»agaggacgctgtccccgg gtctgccatccg 
gaagcatgtcgcctctccat actcctttg catccaact ct ggt ctcttttgctcacgaagata 
^gaagagcoococaaaagaggagggcctgtgttgcccacctc^ 
ocatgcagcctctggggagccccaattgtgtgaaaagccgagg 
catcggacagatgttaaggaagcgccatcagagtctgcagcc^tctgcagataggcccctg 
gaggccagcctgcccccactgcagccxxtagcccct^ 

atggg<ttcaagacccagtgct aacttgggccagcggggtttggggt atctctagaaaacag 

CAACTGAACAGAGCTCCACACATCTGTCAGGGTCTG 

AT0C1 1 AGTCCCAGCTGTAGCTGAATCCACAGACCCAAAGCCTGCACAACCCAACCTCGCTT 

*fiGGACCCCCAGAGATGCTGGAATCGCTAGGAGGGTTGGCTCCAGGGGCAGCCAATTCCT 

ATCATTCAGATC 1 1 CC 1 1 CCTCOCAAGTACTCACCAACCCCTTCCACTTCCCACTTGCCCCAG 

GCTTGGGGGGAAAACAGGGCATGAGCCTTCTGGGGCACTCAGATTATGGA 

I CI 1 1 CI I CACGCTGTGCT ACATGTGTGCCTCTCACAGCAGTTGGCCACAGTTACAGGGAGA 

GAACAATATCACAGTCATTCATCCaGGCCaCG rT T OC T Cl GOGGAGTGTAGCAGCCCTGCCT 

TTCATAGCAG<XUTTACCTGAAGGCCAGCAGGAGCCGGGGGCAGGCCC^ 

GGAAGATT3GAOAGGAGCTrCG<*AOCAA<^TCAAAOCAA . 

AGAAAGACTGAAGGAGACACTt>TTTCCCAAGCAAQATTTTGAT AGA 1 1 1 HU I IU I I UI IGI ' 

TGTTGAAACATGCT AATGATTGAATT ATCTTTTCCAAAGA 1 llllll I AAATGTGATGTCGGTA 

AATTGAAATAAGATAATrrTTTAAAACTTGGATGGAGAGATGAa 

' ' I CACAGGAGGGTTCAGTGTGGAGAGCAAAAAAGCTGACTGTGGTGA 1 1 rGCTC AGT 
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GCTGTGG(XX^CAGGCAGGGCAAGTCTCGGTGGCCCTGTGTTCATCCTGTTGTTTAAGGCA 
T AGCCCTGATCCTTCTGGAAGGCAT AGAACACGGTT ACAGCT GGTTCTGT AGAAAG AGGG A 
AAAGATGATTGTGACAATTCAGAGCATAACTCAGATGGOGAGAAGGCAGCATTATCTCTGTG 
CGATGCTGATTTCAAGCTGCCCACAGAACTGGTGCGGCTCAGAGCTCGGCGAGTTTGCTGG 



AG<iAAAGCCAAGCTCCTCAAACAGGGTTC^GTCCAOGTTGTGTrrrCACACC7TTCTC 
GATCGAACCAAAGATGTGTCTCCAGTATTGTGTCTGTGCCCCTGTGTGTGTTTTGTGGACAG 
CCGTGTTGTCTGACTGTATCCAGCTGGCACTrGACAGGGTGCAGTCATTGGTGAGAAGAAT 
CAGAAAAAG AAGACCC ATCAGCACAGGTTGG AT AGGGGTT AGTGTAGGAGACCAGTT ACAG 
AATGGCCTGGAGTCTT CAACTTTTGACCGGTGCAGGTGTGACCAGAGACCACCTCTGTGGC 
CACCTCAGAT^TCATCTCAGTCCAAGGATCCCAAAACAGAAATCTAGAATTGCAAAGCCAG 
CCTTT ATTTCCCTGGCAGGCAGCCTTGCCAGAAGGCAGAGAGGCAGTTCTGAATCATTCTCT 
CATTCACCAGTGGTGACATCCTTC^AGTCCCTC^CATCTCTGACAGAGCGGGACAGAATGGAT 
ATTTGGCTGACCTTGGTGAGACCTGGAGCTGCC I G 1 1 ICI TCCCTAGGGGATCACCACGGC 
TCTAGGGCATTCT AGGATGAGGTCAGACCCCTTGGCCATTGGTGTTATTTTTTGTATAGCTTC 

agactgggttccag aactt accattgaaaacagagctttt aggccaggtgtggtggctcaga 
cctgtaatccoagcactttgggaggccgaggcaggtggatc<3cttgagctcaggagt^ 
gagcagcctgggcaacatggt gaaagcccgtctct accgaaaaat acaaaaaaaaat agctg 
ggtgtggtggtac<3tgcctgtagtcccagctacttcx3gggactgaggtgggaggatclactt 
gaacct aggaggtosaggctgcagtgagcx^gatcatgctactgcactccagcttaggtg 

GCAAAGTTLAGACCCTGTCTCCAAAAAAAGAAAATAGAGCTTT ATAAGCAGAGAGAAGAAAAT 
AATCATCTAAGAC GATCT CTCCATTCGACATGAAGTGTACC I 1 1 1 C I A AAGACGGTTTCCAGC 
TGCAACGGCTCCACTTTGCAGGCTTGCAGGGTGT ATACCTGCGCATTGGGAACTTGCTGQA 
ACCCCTGATGCA 1 1 1 1 CC 1 1 GAGAGCAGGOGTACTTC0GGCTTG0CGTTAGCTTGTGGAGAA 
CGTGCTTCTT ATTCCTGGCAGGCTTCAAGAACAGCTGCACATGTGCCGCTAACTGACCGCGT 
TGCCATTGGCGACCT GGACTCTGAACTCAGGTTTATTCTAAACCCAGTGAGAGGTGAGGGG 

gagtgatgaaaggggatcagct gtatttgtgtgtg tgt gtgtgtgtgagcacctgacaaatc 
tatgaaaccgagtgaaaggagaaatgttagattctttattattttattatatttatatggaaag 
ctcx3actctccc 11 t ggtaagtccgaagcatgttgtctgttcgaccgtgactgtuttcctca 
gtctgtgcctgtgattccagtcaccctgt agttactgagagaaattgactggactgtcattg 
tgtgaagtctaggaggaaatgtc<>ttttaattgtatgatttggtcataagtaaggactatat 
ttatgtcaccattattagatatatgtacttttgtaatgact gtgaaatacacttttccctcact 

A O^ACTfjCTTCTTTA^ I l ^j£^ J^*^* J f J *^ 

GGCTGGAGTGCAGTGGCACAATCJV£^U3TTCACT GO^G^^r^ScCTCTCCA^ATTCA^G^G 
ATGCTCCC^TCTCAGCCTGCCAAGT AGCTGGGAGT ACAGTTGT GCACGACAATGCCTAGCT A 
A 1 1 1 1 1 1 1 1 G I ATTTTGTAGAAATGCGGTTTCGOCATGTTGCTCAGGCra 
GGGCTCCAAGGAJCTGCCCACCTrGGCCT CCCAAAGTGCTGAGATTACAGGCGTGAGCCAG 
TGTGCCTGGC CTT A TGT AAAGCTCTTGACCTAGCATCTGACTGGAAACAAGCGGGAATTGCA 
TTGGGGGGCATTTTCTGGGCCAG 1 1 1 CCC I GCCTT ATTTTACCT GCAAT AGGTGTGCCT ACA 
AAAGTCTTTCTGACATACACACTGCAGGTGGCAG 

GGATATT AGG ATC I I CCI 1 UGAGACTCTGAAAATGGCACACAAAGAATGGCAGTTATGAACG 
TGACI ICIAGAI 1 1 IGICCI I G I AGGAGGCCTTGGTCATGGATAGGGGAAGAGGGATGATG 
GGATGGAATGGGAGGGTGGGATtTrAGATGACCTCTTGAAATTGTCCCAGTTACT 
CATAACreTTCACAAAAOQTTTTrCATATATAOCAOC 1 1 fa 1 1 1 C CCCTGQGAAGTGAGGGGAC 
GAAGCCTTAGAGGGT GACATAGCT AGAAAGTGGAGAAGCTCACTGTTCACTGTGCAAAGCC . 
CCAGT AGTCGGATTCCGGGGAAAAGAACAAGTATCTAGTTTTGATGACAAT G I1IICII GT AA 
GAATCCCAACACTTTAGAATAGAAAAGGACCT AAGATTCTCTGGCTTiAGCCCCAGCCTTTCT 
GAAG AAGGGAAAACT AGGGACCCAGATGGTTTAAGGACACOCCAGAGTCCCCAGATGAGCT 
GGCATTTGAGCTGGTCTGGAGCAGAG 1 1 ICICCCICAAAGAACACAGAGAAATCAGAGAGT 
GGCTGCCATGGTCAGAGGGGGATGTCAACAACAGATCAAGCOSTCA 

CTGAAGTTCCT GCAGGGACTAGGTCTTGCATTTTT AAGT LC I i U CA AAACTGTGCAGCTTOC 
.TGAA CCTTATGC I <i 1 1 1 6 1 CCCAT GCACTCTTGAAGCTGGGGAAAGACAG C 1 1 I CI I TO GGA " 
Ac i c iu i ci 1 1 wi lAGTT GACTTTXXAAGTAAGCAGCAAACCCCTGGATAGCC I TGt I A TCTA 
CTTTAGGTATTAAGAGGCT ATTTiG^TTTT ACTAGATT AAATCAAT AAAACATTTGCT AT AGAGC ' 
CAT ATCATACACTCAAGCAAATTCAGAAAGAATGCAAAACTGCT A I C 1 1 1 1 GGGGGAGTCTC 
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otAv»wwiG< iGGi 1AGTG1A1 « iAi i i il-i i i GTGGGG i CTTCl G I UA (J CJT ACAG G CACAG T A 
AGAATAATTCAGAGCGGTACH'GGGAGTTGGTTCAATTTCATGTC^TCACTTTTCAATGAAGGG 
AGTGC^TTTCCTGAGTAGTAAATGAGTATATTATTTGTGGCAGCTTTTCTGTTAAGGGAGCTT 



TTTTGTGACACACAGTTATCTCTAGGAGTTGACGTCTGTGGGCACTAAGGGACTGAGGTTGG 
GGGAGAAAGCTGAGG^GGCTTTGGAAGAGAAGGTGAGGAGGATTTCAGGATTTGCAGTCCC 
CCTGCAX^TGTATCCACATCTGAGCTGGTGGTGTTCCAATAATGAGGC^TTTGGGGCAACCA 
ATTTTGGAyVGGATAGAATAGCTTTATGTGGAGCAAACTTGAGGATCAATTGGAGTTGAGTGG 

TTTAAAACTT aaagatggcacaggaagctgtatcatttacaggtgaaaaaaataaatgggtct 

GACNNNN 
>€10 

TTTTTTCCT ATATAAAGTG ATACT GAAAT ATG CT AATT AATATATT AATTTTAGTTAAAT 
GCTGCTAATATGCATAC CTCTT ACTTGAAGGTTTTrAATATGTTTTGATAACTTTAATAACTTC 
AGGTGATGTCTGTATAATTTTTAAAGTGCAGCTCTCTCTAACAAATGTGCCCTACAACTCCTG 
ATTAAACGGCGTCTTG 
>611 
>€12 

CCCAAACAGGTCl 1 1 1 1 ATTTAACATAAGGCCAAAGAAGCT ATCAGGCGTTGCTGAATACT^T 
CCACTAACTGTACAAAATA7TGACTGCATGCCTCGCAAACACCAAAATATCOGCTGGAATGC 
CATAGAAATAAATAACTTCTGCTATAAACACATGAAAACATATCAAACTGTTATCTCTTTAAAC 
ATATTGTAAATAAAAAAATTACCAGTACTTCTACACAATAAATATTAAGAAACCATTGACATAG 
TTGAAATGCACT CATAT AAATT AACAACTTT AATT ACATT AG CCAAACAGACATTGGTTAAAGA 
ACTGCATGT AGTATGCAAAACAAAACAAAACAAAAGAAAAAACAAAGTAAAAAACCAACAAAA 
TAGAA^AAACAAACAAACAACATC*ACCACAGAACATAAAAAG 

TCAGATTATCTTGGCTTTCATAATTATA i 1 1 1 ICI 1 1 1 AAAGAAAAATATCAACCCATTGTCAAT 
GCACTGIIII lO^GCATTTAAATAGAGGGTAAAACCCTTTGGAAATTAATACAGAAGAAAT 
GATTCAC7TTATGCATAAAAAATAAATAATAATATAGCTGAGACATGTGGTTTGCTTCTGCTCT • 
TGAAGATGTGAACAGCTTCTAAGCATTC^TTTTCTCTGACCCATACAACAGCT^ 
ACAGGG IE AA T™ AA ^^ 

^ A ^^ A ^^GTGGGTTTTCAGGG^TGACTAAGTTTTTCCCT ACATTGAAAAGAGAAGTTGCC 
AWGGTGCACAGGAAATCAT TTrrTTAA GTGAATATGATAATATGGGTCTO 

GAAAAAAAAATAGAAAGTATTTGTTCTACCTTTAAGGAG ACT GCAGGGATTCTCCTTGAAAAC 
GGAGT ATGGAATCAATCTT AAATAAAT ATG AAATTGGTTG G TCTTCT G G GATAAGAAATTCCC 
AACTCAGTGTGCT GAAATTCACCTGACTTTTTTGGAAAAAATAGTCGAAAATGTXiAATTTGGT 
CCATAAAATACATGTTACT ATTAAAAGATATTTAAAGACAAATTCTTTTCAGAGCTTCTAAGAT 

T GGTGTG /^ CAGA ' nmAAGA ^^ 

^^jJJ^^Jf'^ ^X^^ ^^TTCAAGAAAAAAACATAtCT ACT AAGAATCCCATATGTTATA 
AT TT^A GCCTTT ^TrTTQGCCTCATGCTGCTAGGTGAAAGAGTGGTTGTTt^f^QRAfrrTft 
TATTTTCCAAGATGATTAAAGATGGTAAAATACTATCTTTC^ 

ACJTATACTTCT ATTTCAAAGCCATATAAATTTAACAAAATTAAAGTTTGTGGGTGTTTGATAA 
CCCAATCACT CAATATCCAATTAAAATATGGAATAAGTTTCAAATAAAATATGGAATTACATTT 
CTCTGCTTCT GATAACTGTGGTCACTAATCAACXXXX^TGTTATCCCCGATATGTCTAGGACT 
TAGCTTAAAAAGATAGCAGATGTATTTGAGATGAGTG 
TTGGTAAATTrCTGCTTTTTCCATG<3CCTTGACCATC 

TT AATTT CTCACAC TGTTT CT GC^GTTCCGGTACCAAAGAAGATGXAGTTCAAAATACTGCC 
AGTmCCAAGAAATT^ 

CCACACCCACCTT CCCAC^TGCATGATATCCAAGGTCGACAGAGCT GGATTAGAATGCACTC 

I* 2 ™!*' 1 • « I CAAGGAGAGCTAAAGCACCTTTT GTAATGTTTGGTTTCTCACTGTTATCTTC 
CTTIGCTATAATTAATTTATTTTAATCTACAAATTGACATAGGGCT AAAAGCTTCAATATTTTAC 
A i^3 A Z! AA ^ AATGTAA1TC ^ CCCAA ^ 
CCAAC 27 CA 5t!^J^^^ 

TTATTAGCTATTCCAGGX5AAGGTAAGTACTAGAATAAACTAGATAAAAACTTGGCTTTAAGCA 
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IS7^^ C r rTGAT ATTTAT AAAAC AAAG G T G 1 1 1 1 1 1 1 f > CATTTCTGC ATCTGAATCAAT ACAAAT 
II ACAC J^ GAAGAAT ^6G^ 

TjGCCTCCTCTTAACT GTTCTCTTCCTT ACTCTCCCTCTCTTAACACTTTCTCTC^ 

GTTTT AGTTAGTGATGACATT ATTTTT CCCAGCTG GT ACAGGT AAT ATTTCTCACAACC AAG C 

CJTAATCATTmAAAAAATAC^CTCAACTCT^ 

c ^2 ga ™ aagat j ga ^ cca ^^^ 

TT ATCATCCTGTTTTCCCCACAGG GG AT AACTTGTAGAAGTGGGAGGGGCACAAAAAAG AGG 

AGGAAATTTCGGACGCGTGGGTCGACTCCCTATAGTAGN 

>613 

>614 

"NAAGAATTaSGCAa^GGGOSCCT^^ 
ACGGGTCGGGGCCTCTCTCCCTCT(XCTGTa>C^^ 

GCTAGCAGCGAGGGCACCTTG TGAT C^TGTTGTTAAAATTATGAATCTGATTTTTATGATGAT 
GAAAATTTTACCAGCAGAAGGATTTT TTAAA G I I I t 1 1 1 1 1 » H I 1 1 AAT AAT AATCTAGGCATG 
AAGAGCAAAAATATCCCTTCCGGAGTCTrTGAAGCTGAAAATATAAAACAAATAAAAAATAAA 
AAMTAAAAACCCACAAAAATGTTGAACCAAACCTCCCTGCTAA^ 

TCCCACCCTGTTCCCAGTCTT CT GACAAACTGTGTACATAGCGGACTGCTCC1TTrTPJTTr.f: 
G ^^ TG - G J^ AAAGGCTT ^ GGTGTATAG ^ 

TTCCTCcc^wnTocAaxxrrTcc^ 

GTTTACCCGGUIiCI 1 1 I I AAGTAATGTGGAAGAAAATGGTTTATTTTGTATTGTGGTATTGAA 
T ATTGTGTTCC 1 1 1 1 1 ATG AGGCAACCTGATTGTAAACTTCATGTAACT ATAGACTGtfaAAAAAA 

T ^? CGTGGCAAAG ^ 

CACC . A j CC ^CAACAACGCTTGTGAATGT^^ 

AGTGCI 1 1 1 GCAAGTTCAATTTGTTAGTTCCT GT ATGAAAGATTGTGGGGGAAAAATAAACGT 
^?^^ C IIIIl C 3 TAA 7 AAC ^^ 

TTAACTATAAAT ATGT ACAATTG TGG G CAT AAACTGTTTCAGATTTTTT ATTTGAAGGTTTT AA 
SI^SHE^IS 4 ' L'^' 1 GTGATGTTTTGAGAGTAATGCATACAGAAATATAATAAAATGTGTT 

G £ AA ^ CATGAA p^^ 

>61S 

>eie 

NNGCAGAC^GTGACTTCGATGCTAAGAG<^TGCGGATGATGTAATAGAAGAAACT 
AGAG * GAAC TSCAGAA^^ 

GGAGTtX^AACCACTCTCGTTCACCTAGTGTAGAAAGAGGGCAACAGTACGGiCAAGAAGAA 
GAATGA6AAAAGAGGGCTGCTGCGCTGGAAGCATGTTTGGATGTAACCAAGCAGA 

T c ^5I GGA J mATAGGCACCTA ^ AAAT ^ G ^ 

AGCTTTCGTGAAGCCAGATCTGGTATAAAGGAAG* 
AAGTTGAAAAAACAG^TACCACAAGAGAAATGC^TTCTTCAAACT^ 

AAACCCAGATGCAGACAGTGACTTCGATGCTAAGAGCAGTGCGGATGATGAAATAGAAGAA 

^ C 7 AG / GTGAACTGCAGAAGGGAAAA ^ T ^ 

ACAGGAGTC^AAACCACTCTCGGTCACCTAGTGAAGAAAGA^ 

^CGAAAGGATCACGAACGTCGAGAGGACATGAGAAAAGGGAAGATCAGCACCAGCAGAA 

GCAA I CGAGAGACCAAGAGAACCA ™ 

ATAGGCACAGAGAGGCCAGTC*TAGAGATTCCCATTGG 

ACCAAGGGCGAGGGACCAAAGAGAAAGAAGTTGACAGAGTATGGAAAAGGGAGAAAGATAG 

GGAGAAATATTCCCAAAGAGAACAAGAAAGAGATAGACAAGAAAATGATC^GAACCG 

GTGAGAAAGGAGAGAAGGAAGAG^AAAGCAAAGCAAAGGAAGAGC^TATGLAAAGTAAGGAA 

GGAAAGATATGAAAATAATGATAAATACAGAGATAG^GAAAAACGAGA^ 

CJTCAGMAGAAATCAAGACAGAAAGGAAAG<^CCCAAATTCT 

GAAACCCTCT AATTCTGAATCATCACTGGGAGCAAAACACAGvkCTrCA(^GAGGAAGGG^AA 
GAGAA ^ G J AAAG ^^ 

A J G ^ GAAA SI G 3 AA J GTCA ^ 

TGCAAAGACCTATATTGAGAAAGAAGATGATTGATGGCT ACCCCAAGAGAAAGATTTAAGGA 
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AGCAC^GAAAACTG TAATT CCTGGA ACCT GCTGCGTAAAACCATAAAGGAGTGTGTTACCAG 
TAGTTTGGAGGGCAJ 1 1 1 1 AAATTTATTTTCAAAATTTTAAGTTAAAAGTCAGTCTTAC^GCTT 
GG^GTTTGGATGTGGATGTTTGGCTGAATTTATATATAGTGTGTACTCATCAATACCACATT 
CTTTGTTGTATTCAAGAACCGTTAAGAGTGTGCT AATTCCCT GTAG GT ACATAATG AGGAAAA 
TTTGCT CCACT ACAACCATTAAAAAATAATTTTGGCCAGATACGGTAGCTCGTGCCTGTAATA 
CCA £? A ,^ GGGAGGCC '^ 

AGGGCAGGATAATAAGACCTTGTCTCTATTTAAAAAACAAAAAGCCTAGCATGGTAGTCCAT 



GTCGACCCACGCGTCCGGGCGGCTTCCGTAGCGGGAGGGCGAAAGATGGCGGCG 
GCAGTACTGGGACAGTTGGGTGCGTTATGGATACATAACCTGAGGAGCCGGGGGAAGCTG 
GCCTTGGGTGTTTTAGCTCAATCATATATCCACACAAGTGGTT'C ICI I GACATTTCTCGAAAA 
TGGGAGAAGAAGAATAAAATTGTTTATCCTCCACAACTGCCTGGAGAACCTCGGAGACCAGC 
AG^TCTACCACTGTCX^GACAAATAAAATATAGCAAAGACAAGATGTGGTATTTGGCAAA 
ATTGATACGAGGAATGTCTATTGACCAGGCTTTGGCTCAGTTG6AATTCAATGACAAAAAAG 
GGGCCAAAATAATTAAAGAGGTTCTCTTAGAAGCACAAGATATGGCAGTGAGAGACCATAAC 
GTGGAATTCAGGTCCAATTTATATATAGCTGAGTCCAC^rrCAGGACGAGGCCAGTGCX7rG>^ 
ACGCATCCGCTACCATGGCAGAGGTCGCTTTGGGATCATGGAGAAGGTTTATTGCCATTA 
TTGTGAAGTTGGTGGAAGGGCCCCXiACCTCCACCT G AGCGACCAAAGACGGCAGTTGCCCA 
TGCCAAAGAGTATATTC^AGCAGCTTCGCAGCCGGACCATCGTTCACACTCTATGATGAGGAG 
ATTCAGACTCCACAGTGTATATATTTTG<X^TTTATTTTCTAAA 
ACACACCCCCC<XXXX^CGACCGC^CAAAACACCC^ 
CCGACTC»3GJKCG<XX3CACAAC<>AAACACCACAG 
>619 

CGAX^CACGCGTCCGCCCTGAGCCGGCGGAGGAGGACAAAAACCGGCGCGACCC 
CGGCAGGGTGGGAAGTGCAGG^CAGCGCTCCCAAGACACGCTTGTTGGAGGTTCGGGCCT 

GGGTGCTTGGTTGTCTGAGCCTCCI 1 1 1 1 1 gtgtttgcctgggtcoctcgtggccgacgga 
agaatgaaggattggagtaacggatgctccgcagagtgtaccggagaaggaggatcaaaag 

AGGTGGTGGGGACTTTTAAGGCTAMGACCTAATAGTCACACCAGCTACCATm 

AAACCAGACCCX^TAATCTGGTTTTTGGAACTGTGTTCACGGATCATATGCTGACG 

GTGGTCCTCAGAGTtTGGATGGGAGAAACCTCATATCAAGCCTCTTCAGAACCTGTC 

ACCCTG GCTCATCAGCTTT GCACT ATGCAGTGGAATTATTTGAAGGATTGAAGGCATTTCGA 

GGAGTAGATAATAAAATTCGACTGTTTCAGCCAAACCTCAACATG GATAGAATGTATCGCTCT 

GCTGTGAGGGCAACTCTGCOGGTATTTGAGAAAGAAGAGCTCTTAGAGTGTATTCAACAGCT 

TGTGAAATTGGATCAAGAATGGGTC«>TAT^ 

ATTCATTGGAACTGAGCCTTCTCTT^ 

TCTTGAGCCCAGTGGGACCTTATTTrrCAAGTGGAACCTTT AATCCAGTGTC<XT<^GGGCC 
AATCCCAAGTATGTAAGAGCCTGGAAAG GTG GAACTGGG GACTG CAAGATGGGAGGGAATT 
ACGGCTCATCTCTTTTTGCCCAATGTGAAGCAGTAGATAATGGGTGTCAGCAGGTCCTGTGG 
CTCTATGGAGAGGACCATCAGATCACTGAAGTGGGAACTATGAAT CI I 1 1 IC I I JA CTGGATA 
AATGAAGATGGAGAAGAAGAACTGGCAACTCCT CCACT AG ATGGCATCATTCTTCCAGGAGT 

gacaaggcggtgcattctggacctggcacatcagtggggtgaatttaaggtgtcagagaga 
tacctcaccatggatgacttgacaacagccct ggaggggaacagagtgagagagatgtttg 

GCTCTGGTACAGCCTGTGTTGTrrGCCCAGTTTCTGATATACTGT ACAAVW3GG3AGAGAATA 

CACATTCCAACTATGGAGAATGGTCCTAAGCTGGCAAGCC^ 

TATCCAGTATGGAAGAGAAGAGAGCGACTGGACAATTGTGCTATCCTG^^ 

gatacaatggaa^agaggataccaactgtatgctact 

TGTGATAGATTTCTTTGGCTACCTGTGCATAATGTAGTTTGT AGT ATCAATGTGTTACAAGAG 
TGATTG .mum CATGCCAGAGAAAATGAATTG CAATCATCAAATGGTGTTTCATAACTTGGT 

AG J AG J AA S3I ACC ™ CC II^^ 

GTTAAGGTAAATATCTGI 1 1 I C I I CC I GATGT CACAG CCTGAGCAT ACT CTGTGCATT AGGAA 
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ga<x)tgagtgcatttcc^accattgtcctttccacattatgttgtagctggctggctgtcag 
gcgact acaagactgagggtgttgtgccttatagatctttgtatcccccatggct 
agtaggtactcagtaaatggttttataatgaatcagtgaaca i 1 1 ig ci i ctatagaagtgta 
ccttctttgtttct atattatgaaacctctttattagaatttgtgattgattctgacagtgtata 
gatttaccttatattgtctttattttccatgagctactaagtcattagagatactct^ 
agttagtttaggaaatcacttcatattgattgtattagaattatcttggaattgaagatatatc 
cctagagcaggggaccccaacccccaggccatgggccacacagcaggaagaggtgagtgg 

TG GGCCATTGAGG AGCTTCATCTGT ArTT ATGGCT ACTTCCCATCACTCG AATTACCACCTG A 
ACTCCACCTCTTGTCAG CTCAGTGGCAGCATTAGATTCT CATAG AAGCACAAATCCTATTGTG 
AACTCTGCATGCAAGGGATCTAGGCTATGCGCTCCTTATGAGAATCTAATGCCTGATGACCT 
GAG GTGTAACAGTTTC ATCCTGAAACCACCCTTCACCCTG CAGTCT GTGGAAAAATTGTCTT 
CCACAAAACTGGTCXCTGGTGCCAAAAATGTTGGGGACCACTGCTCTAGAGAGAGGTCATG 

atatcataccaaccaa atggaaatgacaaatgttttatgtcaagtgttaattgcagaaataaat 
C I M M f M I M M M iggtagaaaacaaagaggcatactctgatttttatactctgtttttgca 

GGTGCTuM I tCTTT GAATGGAGATTTGATGAGCAAGTGGTTAGGATGCAGGGAGAGCTACT 

ATGGGTGATATTTTCCTTGTTTAGGAGCTGTGAGTTAAAATTGTATCCTTTCTGGTTTATCTAA 

GGAAAGTCAAATCTTGACAGAAAACATTTTTCCTTGGAAGGTCAACTrCTCAGACAT^ 

TGGTTTCCCT GAGTCCTCATAACTTCCTTCTTGCTGAACATATTTTATTCT 

GAAAATAAAAAGGATTCTAAAAGTTTGATGCATTGGAAAAATTTCCTTGAGGCATTTA 

ACATAG*AAATGGGCTTTGATTCTTTTCCAAAACTT^ 

G ATAGT AAAATGAAAATTGAGAAATATAAGATGAAAAGGAATGGTAAAAATATCTTTTAGGGG 
G CTTTTAATTGGT GATCTGAAAT CTTG GGAGAAGCT GTTCTTTT CAGG CCT GAG GTGCT CTT 
GACTGTCX3CCTG<XSCACTGTGTACCCCGAGCAACATTCT AAGGGTGTGCTTTCGCCTTGGC 
TAACT CCTTT GACCT CATTCTT CAT ATAGT AGTCTAG G AAAAAGTTGCAGGTAATTTAAACTGT 
CTAGTGGTACATAGTAACTGAATTTCTATTCCTATGAGAAATGAGAATTATTTATTTGCCATCA 
ACACATTTTATACTTT GCATCTC CAAA TTT ATTGCGGCGAGACTT GTCCATTGTGAAAGTTAG 
AGAACATTATGTTTGTATCA 1 1 1 (J 1 1 ICATAAAACCTCAAGAGCATTTTTAAGCCCTTTTCATC 
AGACCCAGTGAAAACT AAGGATAGATGTTTAAAAACTGGAGGTCTCCTGATAAGGAGAACAC 
AATCCACCATTGTCATTTAAGTAA TAAGACA GG AAATTG ACCTT GACGC 1 1 IC MU I I A AATA 
GATTTAAGAGGAACATCTGCACATCI I 1 1 I ICC I I GTGCACT ATTTGTTTAATTGCAGTGGATT 
AATACAGCAAGAGTGCCACATTATAACTAGGCAATTATCCATTCTTC^GACTTAG 
CACACT AATTT^TCGTTTAAGGCATAAGATGGTCTAGCATTAGGAACATGTGAAGCTAATCTG 
CTCAAAAAGATCAACAAATTAATATTGTTGCTGA^ 

GTTTAATTGGGTTGATCAAATGAGATTCAGCAATT<2ACAAGTGCATTAATATAAAC 
GTGGCACTTAAAATGATAATGATTAACTTATATTGCATGTTCTCTT CCTTT CACTTTTTT CAGT 
GTCTACATTTCAGACCGAGTTTGTCAGCTTTTTTGAAAACACATCAG 

a^a^Q^ct gtcaagacgaaggcaaaacctgagcagttcct AAAAAGATTTGCTGTTAGAA 

Am i iui i GTGGGAGTCATTTATTAAGGATTCAACTCGTGATACACCAAAAGAAGAGTTGAC 

ttcagagat qtgttccatgctctctagcacaggaatgaataaatttataaca 

ct1tg ttttcaaaagcacaaagg aaaagtgaaagggaaagagaaacaagt gactgagaagtc 

TTGTTAAGGAATCAGGi i M 1 1 CTACCTGGTAAACATTCrCTATTCTTTTCTCAAAAGATTGTT 

gtaagaaaaaatgtaagacaaaaaagggaaaaaaaacaaacaggggcagaggcaggcagta 

gcaagaaagcagagcgtaaacatcagctagatggtaacatgcaawtcagctctc^ 

catgggaaacctaagnn 

>G20 

NNAGTGT ATTTATTAACATGACTAATATTTCATATTTTA 1 1 1 I GI I AGAAGATTAATTTG 
T AATCATTGTACCCT GTAGATTTTGAAACAATTTTACT AGAAATTACTAATGCTGGTT CTTGAG 
TAACCT AAAAAA TTGT ATTTACT CAAATTTAGAATGCTATTTATAGTTCT AAGCAGTTAGATGA 
M ^ATTCTGATT ATTCTAAAG TTGCTTTGGT AiCAAAATTT AGAAT AAAGCAAAACTGGCAGCAC 
C^CAACAAATGTGATCCrrTCACAGCAGGTCTAAAGACCCT 
TAAAACACACATAAATAAGAGCCATrCTCCTGACAATTTTXrr^ 

AAATCCACATTCAGT AAGTTATACTGGAGGCOCAAATTTC^TGCTCCATGTAATTATTCCTCA 
ATGATAGTGTAAAAAATGCTTATTATTGCTAACTGATGTCTAATTCATAGTTGAG 

A T CA J GAGGCGAA J G ^^ AGAGTGGTACAT ^ 

TTTTAATGCAAGGTTT AATTTAGTrTrTTATTACAGATTTATTGAGGTATAATTGACATACAACA 
AACTGCACATACTT AAAGTGCACAATTTGATACACAGTCTTTTAAACAGCATGAAAAGTGCCT 
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GATAACTTAAAAATGAAAAAAATTTCAATTTACACATATTATTTTAAAATAGT ACATTCTAATTT 
TT ATGAGACATAGATATGTATTTATAAAAAGGTAGATGGAAAGAGAAG AAATTAACTTAATTCT 

aagagccaaatttactcagaaggtttagaaacaocaaaattaacagccag I 1 1 IC I IGATTTT 

CTTCTTGAAGAAGAGATTGGTGTTGACTTATGGTGAGATATACTATGGCCTTGAGAGGCAGT 

ttcaacttgaaaaqaagatgcaggttgagcaatcggagaggacttcaaagaagctgatgagc 

tctcccgtggacttactttgacaatgttggaagaatctggctggctagtctgaactggagtg 

gcttgagaactctgggci lu ch attctcaaagtt c i i 1 1 i gtttgcaacc c i i 1 1 1 1 iagtaa 

cctgcagaggtataaactgattgtgcacacccoctggtattcgcccagccatgggcatggt 

cccagaatataaagtatgatggaagggcttcccaggaactggcaccgatggtcgccatggc 

a ttgag ccaaagagatgagacgacgaaggcattatattagctgttggtggggccaaggaaa 

atttttctti i agtataatggtatcatcaaagctctgaaatttgagacctccttctgctagcct 

TTTTGGG C I l ll ll l l I (^CCTTTGGGAAAAGGCCTACACTTACTTCACCTATAATTACTCTTC 

cttacaacattcaacctcagggatcctttcctagggactgtgtgggactatcgacacggaat 
ggaatttatcattatactgggcctgaaaagggaggaatactactacttaatatcatatgctga 
tacaaggtttagggggntatttgcaattatatttataaaaaaahn'aaagggggcagaaaagcg 
nccccngtgaggnggaagcnkn 

>€22 

ngccttggtggccataggcttctctttaaccaggaaaaagatatgcatgtgctgtaa 
gtccctaggtgcaagc 1 1 1 1 ici i gttatgttttaaacagctttataaactattgttcatagaa 
gatattatgtacatttatttcagataaaggacaataagtttactttgtatctgaactcaaaaca 

AAGTAGTTGTATATTTTAACATTGAAAATTGGGATTTCCCAATGTGACACATCATGAATGCAAA 
CCOCTCCAGCOCATCAGACGCCAGGCTGCCTACTGGTAATCTGTGTATAGTATATAAACATG 
TAAAAATAGGTTGT ATTrTACTCTATGTATGATGCTAATCAATGAACACTTTATTTATTTTACAG 
AGAAAACTTATCTQTO AACTTTACTATATATCTGTTTATTTTACTTT A 1 1 H 1 1 1 M I AAATAAAA 
AGGGTTTTAAATGCTATGCAGTCATTAGTAGAAAATTTTTTAGGACTCTGCCTGCTCTGTA 
TATCTTAATATGATCTGGCAGAAACTCGCATGTATCCAAGTAAAGTAGTTTAGCTAAAGAAAG 
GTTCTTCATTGCTTTTCTGTrCACAGTTGTGGCTCTGTTTrTTAAGAATGTAACl^ 1 1 1 HA G 
ATTATACTTGCATCTGTGACTTTAGTACCAGCCACGTTGACACAAAACAGGTTXrTGGTTCAGG 
TAAAGTTGGGTCAGTCACCTGCAGCAGAAATCCCTCTTCATTCCTCTTCTCTGTGTTCATTCC 
TCTTCTGTGCTGTTCT GAAGCTTCT ACCAAT ACTCTTTCCATA7TGTCTTTTT CAGTGAAGAGA 
AATGCATTCAAGATTAGGTCCCTCCTGTCTATCCAGTTTCAGGATTTTATGTTGTTTT ATACAC 
AGTTATTTCAGTATAGAAACTGGCTTT ATTGCCAAGTG 1 1 1 1 1 1 1 AAACATGTTTTAACT CTCA 
TATGAGGAAACTGTCCAACTTCAG 1 1 1 1 1 UATAAGATTAAACTTCTT ACGATCAAATTTGTCTC 
TTGCAATGATGTGATGAGTTGCCAAATAATTGAGATTATTTTAAAATG IIIIGII CATATTCTT 
GTTTTATAATTAAAATTTACATTCAGTGTGTATGGG 1 1 I I 1 1 1 H 1 1 ATTTTGACT CTT AATGT A 
AGGTGGATATTTCT GTCATTTTAGATGGTTTCTTACTGAGATTTTAT ATATAAATTATAAAATGT 
TTACCAAAAAAAAAAAAAAAAAAAGATCTTT AATTAAGCGGCCGCAAGCTTATTCCGCT AT AG 
TGA 
>623 

NAGTTAAGCAGACTTTCAACCACTGAG C: 1 ICI M AAAAACTTAAAAAAAAAATCTTATT 
ACCAAATCTCAGCTAGGACAMAGTGTTGCTACCGGGCAGGTACAGCCATTGC I C 1 1 I U AGTT 
TGGTCTGGCT AGCAAAAAGCTGGC7GTGTT ATGTAAATAAAGCCCCT ATAGT AATTAAAATTT 
AAAAAAAGTTTTTTAAGCTGGCTG 1 1 1 1 CO I AGCACTT CAGAGTCCTTGACCCOGTAATTT AG 
GGTTCCCCTTCAGATTTGACTAAGACAGAAACAAACAACAAAACAGTTAAGCAAAACTAACA^ 
TGGTCACAGAAATTATAGAATTTCTGAGTGCTCTAAGTGCATTGGAAGAAAGCTGAAACTtXA 
TAAAAACATCACCTGCCTTCCATCATCATGAAAGCAGGAAAACTTGCCTTCCTTGTTGGGAG 

caagtaaaactccaaaaaaagaggtgttgtacagcaaaataaacttccgatctctaocaaatt 
ttaagagatcagggattttctggagggagtcctctcagagctcagcaaattgtcct attggt 
ttgagccataaagttagctcatgctggtaccaagcacxx^ 

catctccacccagaatccattcatggtttccaaaatgtgaagccagaaaatccgagacaggt 

ATCAG ■ 
>624 
»625 
>&6 
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>€27 

acgcggggagacacacacagccctctgcccagctctgcttcctctaggaacacagg 
agttccagatcacatogagttcagcatgaattcactcagtgaagccaacaccaagttcatgtt 
cgaoctgttccaacagttcagaaaatcaaaagagaacaacatcttctattcccctat^^ 
cacatcagcattagggatggtcctcttaggagccaaagacaacactgcacaacagattaaga 

gataggtcaggaaatgttgatcaccagtttcaaaagcttctgactgaattcaacaaatccact 
gatgcatatgagctgaagatcgccaacaagctcttcggagaaaaaaostatctatitttaca 
ggaatatttagatgccatcaagaaattttaccagaccagtgtggaatctgttgattttgcaaa 
tgctccagaagaaagtcgaaagaagattaactcctgggtggaaagtcaaacgaatgaaaaaa 

TTAAAAACCTAATTCCTGAAGGTAATATTGGCAGCAATACCACATTGGTTCTTGTGAACGCAA 

tctatttcaaagggcagtgggagaagaaatttaataaagaagatactaaagaggaaaaatttt 
ggccaaacaagaatacatacaagtccatacagatgatgaggcaatacacatcttttc^ttt^ 

CCTCGCTGGAGGATGTACAGG4X^AGGTCCTGGAAATACCATACAAAGGCAAAGATCTAAG 
CATGATTGTGiTGCTGOCAAATGAAATCGATGGTCTCCAGAAGCTTGAAGAGAAACTCACTG 

ctgagaaattgatggaatggacaagtttgcagaatatgagagagacacgtgtcgatttacac 

ttacctc^ttcaaagtggaagagagctatgacctcaaggacaggttgagaac«atgggaat 

ggtggatatcttcaatggggatgcagacctctcaggcatgaccgggagccgcggtctcgt 

gctatctggagtcctacacaaggcctttgtggaggttacagaggagggagcagaagctgca 

gctgocaccgctctagtaggattcggatcatcacctacttgaactaatgaagagttcx^ 

TAATCACCCTTTCCTATTCTTCATAAGGCAAAATAAGACCAACAGCATCCTCin'CTATGGCAG 
ATTCTCATCCCCCTAGAnSCAATTAGTCTGTCACTrcCATTTG 

TTCTGGTAAACTGATTGGTGGGAACAAGAGATTCTCTTGGCTCAT A I NCI 1 1 I C I 1 1 C I C ATC 
TTGATGATGATCGTCATCATCAAGAATTTAATGATTAAAATAGCATGCCTTTCTCTC 
TAATAAGCCCACATATAAATGTAC1 1 1 1 1 C 1 1 CCAGAAAAATTCTCCTTGAGG AAAAATGTCCA 
AAAT AAGATGAATCACTTAATACC<jTATCTTCTAAATTTGAAATATAAT^ 

TTITA AATGAACCAAACCAAATCATA C I II I I C I 1 1 6AATTTAGCAACCTAGAAACACACATTT 
CTTTGAATTTAGGTGATACCTAAATCCTTCTTATGTTTCT AAATTTTGTGATTCT ATAAAACAGA 
TCATCAATAAAATAGTGACATAAAAAAAAAAAAAAAA 



GCGTCCGGGGAAAGCATCTTrGGTGAGCGAGGAAGAGGAAGATGAAGAAGAAGAT 
AAGGCT ACCCCTAGAAGAAGGAGGTCT AGAAGAAGT AGT ATGGT CTTCGAGTAGCCTTTCAG 



GCGCACGCTTAGAGAATGAGAAAAAAACAATTCTTGAAAGAAAGAAA6ACTGTAGACAGGTG 
ATACAAAGGGAAGATTCTACCTCTGAGTCTGAGGATGACTCTCGGGATGAGAGCCAGGAGA 
GTT CAG A T GCTTT GCT G AAAAGGACCAT GAACAT GAAGGAGAACAAAGCCATGCTT GCCCAG 
TTATTGGCGGAATTGAACTCGATGCCAGA I 1 1 CI I CCCAGTACGAACCCQAACCTCAGCTTG 
TAGGAAGAAGACAGTGAGGCGGGCCTrCTCGGAGGQAGAGATCACGCGGCGTATGAAGCC 
AACCCGGAGTGOGCGGOCTCCTGAGAAGTTTGCTCTAGAGAACTTCACTGTCTCAGCCGCT 
AAATTTGCGGAAGAGTTTTACAGCTTCCGAAGAAGGAAGACAATTGGGGGGAAATGO 
AGTACAGA CGACG TCACCGTAT A I C I ICI I 1 1 CGGCCAGTGGAG GATATCACCGAAGAGGAC 
TTAGAAAATGTTGCCATAACTGTT CGAGAT AAAATCTATGATAAAGTTCT GGGTAACACGTGC 
CATCAGTGTCGACAAAAGAOCATCGACACCAAGACAGTGTGTC 

GTGTGCGAGGACAGTTCTGTGGACCATGCCTGCGGAACCGCTA7X3GGGAGGATGTCAGAT 
CGGCATTGCTGGACCCGGATTGGGTGTGTCCOCCCTGTCGTGGGATCTGCAATTGCAGCT A 
CTGTCGGAAGGGAGGAGAG<>GGGCGCAGAAGACACAAACTGGAAGAAG<3TrGAGATAGGA 
ATTAGACCACAAGGGGGGAAAAACATAGAGGAGAGAAAACCCCCCAAAAAAAAAGGTATCG 
CAACAAACATGCATACCATTGGTGCCTAAGATTTTTTT ACAG TTGTGTTTTTATACAGAAATTC 
TTTGT AGAAATTACTA 1 1 M I IG 1 1 AAAGATTGTTTAT ATGCTT ACAAAGATTTCTCAGGAAGA 
CAGCAGAGCAGAGGAATCT ATAT AGATGTATGCACAGACCTGTCTGTATGCT GAACTTTGTT 
AAAAATATCTGCGAGTrATTAAAAAGGACAGTTT AAATGGGGTGGGGTT AAAGTTCAGGTAAG 

TAAGTTAGAGAGAAAACATTGTATGATCAG ctcctgcacttgatctatctttggcttcccaaa 
CAGTAACTGACT C<>^G^AAGTmGCXm^AGCACGAGTGACX>CAGrrTAATAGACCACAC 
ACATCGTTTAACCTGCTCTTGGTCATTGGAA^ '. 
AACAGTTTTATTTrrATTAAAACTTGACTGAACAAAAGGGGAGCATCAACATTGTAGA 
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GAGGCCTT ACAGAGTGCT AGCCTCTTCCTTCAG GACTCACCTGG G GCCTGCTGCTTTT atat 

TTTGAAAGAGTTTAANNN 

>€30 

NACCAAGTATAAATATAAATTACAACCCTCCCTCCTT ACT ATCAACGCGCCAACCCCT 
1 1 1 1 11 III ICC 1 1 1 AATAACGACAATTTTCATrTTTTAGGGTTCAGTTCCAGCT GATTTTATTT 
CCTT CT CAAAAAAAGTTA TTTACAGAAGGTA TATATCAACAATCTGACAGGCAGTGAACTTGA 
CATGATTAGCTGGCATGAI I 1 1 I ICI 1 1 1 1 1 M CCCCCAAACATTGTTTTTGTGGCCTTGAATT 
TTAAGACAAATATTCTACACGGCATATTGCACAGGATGGATGGCAAAAAAAAGTTTAAAAACA 
AAAAGCCTrAACGGA ACTGCCT TAAAAAGGCAGACGTCCTAGTGCCTGTCATGTTATATTAAA 
CATACAT AC ACACAA TCTTTTTGCTT ATTATAATACAGACTTAAATGTACAAAGATGTTTTCCA 
CM 111 I CAATTTTTAAAGACAACAGCTATAAACCTGAACACATATGCTATCATCATGCCATAA 
GACT AAAACAATTATATTTAGCGACAAGTAGAAAGGATTAAATAGTCAAATAGAAGAATGAAA 
AACGCAGTACATAGTGTCGCGAACTCAAATCXSGCATTTAGATAGATCCAGTGGTTTAAACGG 
C ACGI [TTTGCI lATAAAAAAAG TGCA AAAAAGATGTGGTTTACAAGTTAAAGCTACAGAATC 
CCTTTTT GCTGTAATTGCACCAGTTTTAAAGCCTCTGGACAGAGGAGTATTTCGTTTAAAACT 
TTGI 1 1 1 ICI I AAAAGCTT ACAGTCTTTGGCTAATTCTCCTCCCCTTTTT ACAAGACGGGGGC 
CGGAGGGTGGACACTGGTGGCAGGTTAAGGGATACTGTCACTTTAAGAAGCCTGCAGATTG 
AAGTGTAAACATGGAGAAA7TAGGGGCTGATTT1TTAAACTGTGTGAGATATTAACCAGCCG 
^CCTG TTATAAAATCAGGAAATCCAAACAGCGATTTACACOGATTAACACCCCCTTT AT AT AT 
TTTTT ACAAAAAT ACACTGAGAAAATAAT CAAACG TTTTCATCTCTCTT GTC I 1 1 1 II IGTTTTT 
TAAAAGTGTCA^AAGTCTACATTTAAATATAAAAAATTAAAAGTTAAAACT CT AGCCCTTCAGT 
GAAGGAGACGT AAAATGGCGTGG GT AACAACAACT ACCAAAAAAAAAAAAAAAAAAAAAGAA 
^^^TCGACACGGAAGGAATAAAGAAATAAAGGAAGTAAAAAGAAAGGAAAGAAAAAAAGGG 
ACAAAAGAAAAAATATGTTTGGCCAGTATAAATACGTCCACATATAAAATGGCATCTGATTAC 
ATTTAGAAGGAAAAAGAAATACGAGGATGGAGCATCGX3TGAAGGAAAAAAACACGTCTTCTC 

atttacacctataaggaataaacacacagactgagaaaaaatttggtcctgaa i re I l l 

TAAAATCCAGCACAGATT7GAGTTGCGTTTGAATCCTTT AAAGAGTTAAGAATGAAAN 
>631 



ATTAGCAGGTATGCAAAGAAGCCTTTTCACCX^TGATGTCCTT agagataatatggatcagtcc 
agagttctcctct gggtgaaagcagaaccctttat agtgggtgocttgcaggtcccccctc 
catccaagtttagtcttcactatctcaggaagatatccacctatgtgcaaatccgggcca 
gaaggagcttacccgogcctctactggt CT acatggaggcacatcggttgtgggaagctgc 

AGTTGGCCAAGGACCTGGCGTGGCTTT ACnTCGAAATATTTGATAGTCTTTCAATGAAGACA 
CCTGAGGAGCGOCT GGAATGGTCTGAGGTT CTGT CCAACTGCATGTCTGAGGAGGAAGTTG 
AAAAG42AGAGAAATCAGCTTTCAGTGGACACCCTACAGTTTCTGC I C I I CI I AT ACATTCAAC 
AGTTGAACAAGGTCTCCCTAAGGACATCTTTGATTG^ 

CAAATCTCAGTCTCCTGACCTGACTGAAAAATCT AATTGTCAT AATAAGAACT GGAATGATTA 
CAGTCACCAAGCI ITTGTCI ATGATG A T C T G I CI GATC I C C I GGAGCTGCTTTTAGATCCAAA 
ACAACTCACTGCATCATTTCATTCAACCCATAGTAGTCTAGTGTCT 

GCT CAGCTTCCTT ATTGAAGG T ACAATAAGTAGAGCCAGGAAGATCT ATCCACTTCATGAACT 

T^CACTGTGGCAACGACTGCATGCAGATAGTGGCTTCTCAAAGATCTCT 

CTACAAACTGGAAACCTGGTTGAGGTCCTGTTTGACT 

GCCTCAAGTCrGGAAAGAAATTGGCTTGGGCTCATCAAGTTGAAGGGAC 

^^*GATTGCTTGTAATACTCATGTGGCCCCTAGGATGCACCt»ACTGGT AGT GATGAGCCAGG 

TrTACAAGCAGA CACTGG CTAAGAGCTCAGACACTCTGGC 

CGTTGCAACGAATC 1 1 1 1 ATATATCTGCTCTCTCCCTTACG^TCTGTGACAATTGAGAAGTGC 
AGGAATAGCATC 1 1 IUICI I GGGCCCTGT AGGGACT ACACTTCACCTCCACAG TTGTGACAA 
JGTT AAAGTCATTGC I G J 1 1 GGCATCGTTTGTCCATCTCTTCT ACAACAGGTTGCATCTTTCA 
CGTTCTT ACGCCT ACACGOCCACTTATTCTCTCT GGGAACCAGACAGTAACTTTTGCCCCTTT 
TCAT ACACATTACCCAATGCT AGAGGACCATATGGX3CAGGACTGGOCTT GCT ACAGTGCCT A 
ACT ATTGGGATAATCCAATGGTTGTGTOCAGAGAGAAC^ 

TTA 9E A jj° 1 1 U ' GAATTCT ATGTATTTATTA 1 1 C CCI 1 1 GAAATGGAAGGGGACACAACAGAG 

atacccgggggtcttocatctgtatatcagaaaggactt^ggtcaaaga 

GATCTGGC^GAAAACTGT GAAG GAGGCTCATTTGA * 

GT ACTGGT agagaacaagttttatgaatggttgattaatacaggacatcxmxaaca^ 
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cagccttgtaccocctgcagcaggctccaaacaagcagctggataaggatcttacatgcag 
acactgggcx^ggaaatacaaggataaatggaggatggtgctcatcttgaaacttgaaact 
aaattg ggaaggtgctattgctgcttaagagacattgggacttttctgt ci 1 1 1 1 1 1 H I I U I 

1 1 1 f I JGAGGCAGAGTCTTGTTCTGTCGCXXiAGGCTGGGGTGCAGTGGCGCGATCTCGGCT 
CACTGCAAGCTCTGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAG^ 
GGACT ACAGGOGCCCGCCACCACGCCCGGCCAATTT "1 1 1 T I GTATTTTTAGTAGAGACGGG 
GTTTCACCGTGTT AG TCAG G ATGG TCTCGATCT CCTGACCT CGTGATCCGCCCATCTCGGCC 
TCCCAAAG TGCTGGGATT ACAGG CGTGAG CCACCGTGCCCAGCCTTCTGTTCTATTT AAGAA 
CTTTGATTTCCATTCCATTTTCATTTATTACTGTTTAGCCCGAAT^ AAACTTCTGACTT 
TCA T AT AAACAT ATTGCTT AT AGAAGATCAGAAG ACTCATCTTT GTGATGCCCAAATGAAAAA 
GTTGGTGAAGGTAGTACCATTGTATGTAT6TTAATATTAAAGCAACTGAGTGTAGATTTGCAA 
CT AT AGTGCGTCTCTCGATTT GTG TG ATCAAAGAG TTTT ATTCTTTCT AGAGTT AAAT AAGAG T 
TTTATAGTTCTTTGTAACTATTTAAAAGGAAATTGCTACTCCTTTT AAAAAATAATTGGAAAGG 
AAGTATGAI I IGI IGGTATATAGACATAGAAATAGTTTTATACTATATTTTAGTCCTTAAAGTAT 
GGAACTT AATGAAATTGTTTAAAAAAATCTTTATTTTTCACTTCXirTGCTTCAAATT^ 
>632 

GCCGC CCGGGCAGGTACT TGTT AAGCCA CATTTGAGG 
TTGAGTTTGCTCTTTGG FT I ITCH CATTCCTTTTGAGGATTCGGAAAACAGAAAGATTCTTTG 
ATTTGGGTAACGAGCGCCTGTTGGTAGGAACCTGCTTGGTCGCCTCTGAGGGGGCTTGTAG 
GTGGCTCTGGCTGAAACAGGCGCCTGCGAGAGTCTGT AGGj^GGGAAACCGCCATGGACGA 

tcagggtt gccctccgtgtaagaccaccaaat atcxxiaacccct ccttgaagct gatggtg 
aatgtgtgcggacacactctctgtgaaagttgtgtagatttactgtttgtgagaggagctgg 
aaactgccctgagtgtggtactccactcagaaagagcaactt cagggt acaactcttt gaag 

ATCCCACTGTT GACAAGGAGGTT gagatcaggaaaaaag 7 GCTAAAGATAT ACAATAAAAGG 

GAAGAAGA 1 1 1 1 CC i agtctaagagaat acaatgatttcttggaagaagtggaagaaattgtt 

ttgaacttgaccaacaatgtggatttggacaacaccaaaaagaaaatggaga^ 

gaaaacaaagatgttattcagaaaaataaattaaagctgactcgag 

agaagcttt agaagtggaacg^caggaaaatgaacaaagaagattattt atacaaaaagaag 

aacaactgcagcagattct aaaaaggaagaataagcaggc 1 1 1 I 1 1 agatgagctggagagt 

TCTGATCTCCC T G 1 1 GCTCT GCTTTTGGCT CAGCATAAAGATAGATCT ACCCAATTAGAAATG 
CAACTTGAfiAAACCCAAACCTGTAAAACC^ 

ACATATTTCACTGGCACCT attcacaagcttgaagaagctctgt atgaat accagccactgca 
gatagagacatatggaccacatgttcctgagcttgagatgctaggaagact^ 

^CCATGTCAGAGCTGCCTCACCACAGGACCTTGCTXSGAGGCT AT A C MC I ICIC I I GCTTQT 

CACAGAGCACTACAGGATGCATTCAGTGGGC IU I CI GGCAGCCCAGTTAACCATTTATAAG 

ATTTGGACCTTGGAGCT GAACCAGGGAGCTAGCAAAAGTAAAGCAGACTTAT AAAATTATAG 

CTATGTGCAGCTGCACAACACAGTCCTTCCACTAGCAGCTGTGTTAAAGTATT^ 

AAAATTTCAGAACTAAGTTGAGTAATATAGGGGATATATATTTGTGAAAAATAATTTTTA^ 

T ATTTTTCAGAGGATTTGACACGAT AAGCCTCATCTGATGGAAGAGAGGAAT 

CT ATATGTG 1 1 1 GAGGTT GTTGACAGACTTA T AAAAT CTTTTTAAAAAATAAAGCTATAATTTATA 

TTAAGTTCTGTGG mil CTCTT ATT ACAGTGTTTTACTTGAACACATTTT AAAT AC(>VTTTTGA 

TTACAACATTTACTGTTAATANNNGNNGNNNN 

>633 

>634 

tcactaaaggcaataagcttgcggccgcag n i i i i i m i i r i i i t i igca g n T T IG 

CTCTCAGCTGT AfiCCCTGACATGAAACGCTTGCTACGTTTCTGCAGCA 

GCAGT acgcacaggtgtggtgtt ag ccag ggagaccgaagcc^cacagaagcggagtcag 

TAAATGTCCATCTCCACGGTGTCTGCCTT^^ 

ATTAAT AGGATCCCAGCCT AACAGC>ACGCTTATG7 AGCACCATTCTTTAGAAACT ACCAAGTA 
TCACACAGGT ACATGTG TATGG GGACC I ICC I CI rGACTGCTTGTCAAGAAAAGGATCCAGA 
GGGAACTTCTGCCCCT ACTTCCATGAAGTGAGATGCAACAATCATTTCCAATCT AI I II I H A 
TCTCATGCT AAAATGAAAGTGGAGGGGAGAGG^GAAGAAGCCTCtXATTCTACAATTATAAAT 
<5CTGTCX^GGTAGCCTTTOTCTCTT 

GCCAGCA AATGA ATGCCCCAGG GACTA CAGOCAGAAAATTGTGCCTTjTA 
CTCTGGAGCTTGGT AATATAGAAATTTTACCTCTACAGATGAGGCAAACT AAAATATTGTTGT 
AACTCAAAAAGGCTT CTGAAGTGCTGAGTTAGAATAAATATATATAGATACATACACACACTA 
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CATTGCTTTGGATGCAGGTTCTTACTAATrTATATACACATGTAAAGAGAAGATACTGAAGTG 

TCTAACTGTAATTT AGCAGCAGAAAAAG AAAACTGCTTAACATATTTGCTTCTT AATGAAGTTG 

TGTTTAAAGAAATGCATTAATACTCCTATGAATTTGTATCATTGATAGTGTATCTGGATAATCC 

CAGTAACTACAGACAGAGACCTTGTGGTATGGAGGTTGCTAAATGAGGGGAACACTAATTCC 

CCCAGAGATAACATCTGAGCTGCTTTAGACTCTGGAAGTGGTTTTCAGTACCTGCCCCGCTG 

TAAACTCTGAAAACACGAACATGATTTCCAACCAAATCTTTGTGCTCAGAGCCATATTGCTAA 

AAAGAAAAAGGCATTGCAGAGGCAAAGTGGACATGGGGTT AATTAAACAAGGCCT CT GAATT 

CAGGTGTGGCATTTAGATCCTG CAATG AATG GACCT ATTTTACAG AAGTCAGCGG CAAATCT 

TTTGTATTTTTTGGTACA6ATGGGGTTTTGCCATGTGGACGAGGCTGCTCTCGAACTCCTGG 

CCTCAAGTGATGCACCTGCCTTGGCCTCCCGAAGTGCTGGGATTATAGGAGTGAGCCACCA 

CGCCCGGCTACGAGTTGGGTTTTAACAGAAGAGGACCTTGAATGCTGAAGCTTCACAGGGC 

GGCCAAACTAAOCCTCGN 

>63S 

NNNNNCCTGATGTGGTGTTTCTGCCACATATGTATGTTGACATAGTTCATAGTACTG 
CGCTOGTACTGCCAGrrAGACTCCACAGACCTAATrTGGATGGGGAGAA^ 
CTGACCCTTCCCGCCCCGCCAATGTTAGACGAGAGCTGTTGATC^CCACACAGTCATTGTTCT 
G AAAATTCATGAACT ATTCT CTCATATCCCTG AAATATCAAAGT AAG TG GCCCGT ATTCCTAC 
AGATTATTTAGTTTAT AAAGCTTT GCTTTT AATCATACCATAGCTTACCT GGCT AAAACTTTGC 
GTATAAAGGGGAAA CTTG CTTT AATCTGTGGGATTGCGGACGCGATG GGACTTGCAGAAGG 
TAAACCGGCGTGGG 1 1 1 1 G 1 GTGCGTTAGGTTAAGGTGTTAGTGG ACTTTT AATGTATCTTGT 
GAAGT ATGGTTACT AGGGTAG GTTCAAGAGCT AC ATG CCTCTGG CTTTT GAACCCAT CT ATT 
AAATCTTTTTTAATGCTGTACTAAGATAAATGTG AAGTCT GAGT AAGGTTGCTCT CAGAAAAA 
AAAAAAAAAAGCAT ATAT ATATAT AAAAT AAAATTT AG GCCATG G G AACTG G CATTTT ACTTG G 
T AT AGGAATTT AATGAGAACCAAATGACAAAAATACCAGATCT CCTTTT GAGATCGTG G ACAA 
TCAGGCCTTTGGACAATCACOGCTTCCAGTGCCCGCTGTGCGtTGAGGTTAGAGACCTACC 
GTTTACAC GCTAG^GCCG AGA GG^CG TTCAAGTTTCTrAAAGATGCTGTTGG^ C I I H 1 1 C I I 
ATGTTATTGTTTTAG J 1 1 1 GTGATTTTGATGAATAATTGAAGTTCAGAGTCACCCGTGGCTCCT 

aaggaaaaaacagcatcat gaaag gcaaaaagtatttgaaacgtaattgtttcaattttccac 

GCCCGTTCTCAGATGCCCCCTTTAGCnrTTATTTCTGGG AACATGTCACT AGACCTCAGCT AG 
CCTGACACCGCAATAACCCXjTTAGGCTTTGAGGCTTTTCAGTGAATTCTT g ACATTTAAATCT 

tcttatagaattgtttatgaag cctraactaat tcttgttrttgctt cct gagaaactttttatt 
gcatctgccatccttcattggatttttgttttggtttgttgttg(dcattct 

gc^gtggggctcaaagccgacctccatgcatcattctctagagt^ 

T^GTAACTACCCGGCAAAATATATATTTTCATAATGTTGAATTTGCCTTTT AGGAAAACAATG 
TTTCATGAAACACTTGTAAGAT ACTT ATTTTGAAATTAATT AT AAG GCATTCATATTTTTAGGAC 
TG AATTCAAAGGGAAATGC 1 1 LI I I GAG I 1 1 1 1 1 1 CATTGTTCGGTT6ACTTTCAGCTTTCTGG 
AGTGACAA AGGAG CAGTGATGGGTGCTATTCCTGGTTCT GTGGTGGTTTACTTTGTCTACTA 
AGTGATTTACTTTCAAACCTCATTACAGATAATGTTACTAAAAGACTT AACATGCACACTGCTA 
AAAGGTGTTTTACT G 1 1 1 1 CCATTAACGCTGTTGTATGAACAATATTTGGTGACATGTTGTCTT 
TTTGTCCTGTAGATTGTCATATCTGTTATGTAGTT ATCCTTTTGGCAAGTCCTTTGA I I I I 1 1 1 1 
TCCTCT AGAAAATCCTGAGACTGTCAATCTTCCATATTTTGAATATTAT AATG CTGTGG CCATT 
TCTATAGAAAG ATTACTT GCAAGTGTGTGTCTTGAAAATAACCTCTTGT ATTTGTGTGTTT ATG 
TGAGTTCTGTCTAAGAGATCAGATAAGCTCTGGAAGCTTt^TCATCTGTACCTACATGTAACT 
GTATCTGGGTTACATGGGCAGGCGGTACCAAGTCCTGCTCTTCGATGGGGTGGCTGAGAGA 
TAAAGGTTCCTGAACTCTCCTTGCTGTGTATGCAGCACATCTGT ACTTCAGAGAGGTGGAAA 
ATCATCTACAACACTGCGTTTCTTGTGGTTGTGCTGTTACTGGTATAAAGTCAAGTGGCTTCA 

AACCTGCATAACCA GTACTT GTGTAACACTTGTTT ACGCTTCAATTGTACATGGTGCTTCTTA 
ACGCAATTTATAG CAM 1 1 T ATTTAAGGTACCTGCAAACTTGAATTCGTCTGTGTGAAGCATTG 
TAAAACGATACAI 1 1 CI CI 1 1 1 GAGTACCATTACAGTTGTACAGAGGTTTTCACT GCTTCTATT 
TTTCTACTGATACTGACTAAGCATGNNNNN 
>636 

tATAGGGAGTCGACCCACGCGTCOGGGTTTCGGTTGGAGGACTCGTTG GG G AGGTG 
GCCTGCGCTTGTAGAGACTGCATCCCCGAGAW^TGGCGGAGGGAGATAATCGCAG<iACC 
AACCTGCTGGCTGCAGAGACTGCAAGTCTGGAAGAACAGCTGCAAGGATGGGGAGAAGTG 
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ATGCTGATG GCTG ATAAAGTCCTCGGATGGGAAAGAGCCTGGTTTCCACCTGCCATCATGG 

GTGT GGTT TCTIT GGTGT TTCTGATTATCT ACTATCTAGATCCATCTGTTCT GTCCGGCGTTT 

CCTGTTTTGTTATGTTTTTGTGCTT GGCTGACTACCTTGTTCCCATTCT AGCGCCT AGAATTTT 

TGGCTC^AATAAATGGACCACTG AACAACAGCAAAGATTCCATGAAATTTGCAGCAATCT AG 

TAAAAACTCGACGCAGAGCTGTGGGTTGGTGGAAACGCCTCTTCACACTAAAGGAAGAAAAA 

GCTTAAGATGTACrrTCATGACGATGATCGTTTCCCTTGCTGCGGTTGCTTGGGTGGGACA 

AGTCCAGAACCTGCTTCTCACCTACXn'GATAGTGACTTCCTTACTATTGCTTCCTGGACTA 

CCAACATGGAATCATTTTG^AGTACATTGGAATGGCCAAGAGGGAGATAAACAAACTTCTCA 

AACAAAAAGAAAAGAAAAACGAATGATTCVVTCTGCTTTAATC^GTGTGATTAATGCAGCACCC 

ATTGCCCCGGGAACCGTTTCTGCTGTACTATCTGGATACTAAAATGTTACGGAAGTAGCTCT 

TTGTTCTCCCT C ACT CT GCCCTT AGTTAATAG AAATTCAGACT CGCCAAGT AAGGCTTCGTGC 

ATAGTGT CTTC AT GTCGC GTATAGTTGAGCGCGTTCTTAGCAGTTGGCTTCATGGACAACTC 

ATTAGTGTTTTGAC I I MCI I ACCCAGCGTTAATTGAATTCTTGCTTTT AGACAACTTCCTTTTT 

GTAGTGGTGAACCTTGCCCTTTAGTACAGTTCAAGTGAATCTGGATAATTGTTCATCTTTGCT 

TTAGCTTAGATaCCATGTAGTGGTCTGTG GCT ACAG GAAG CT GGTTCTGTCTGCTT CCACAG 

TCT G CTT AAAAAAC T GTCTGACTT CGTG AAT AT AGAGACCAAGTTT ACCACTTCTGATGAAGA 

GACCAATTAAGATTCATTCCTCATTCTGI MCI 1 1CCAGTGGGAGAAGAGTCCCCATGAAATA 

AGATGAAACTGATTCCATGCACTAGTACATGTAGGCTTCTCCCrrTGTGCAAAGCTTAGCAATT 

TGTAGGAAACTTTGATCHTTTTGTCOVAGAAAAGGAATGTCT 

CCTTGCACTTAGACTCGAAGTTAG TAAAT CCTTAAAGGCTTTTTAATAGCAGACTTC^ 
ATTGCATTTAGGATTTCT AGC^TGCTrrrAATTTCAGATTTTCAGCTGACATTAGCT ATAGTAT 
ACAGTAGGTTAAGACTCATGTCT ATGACTTTCACTCT AAGACTGGCAAAAGGACAG CAGTCT 
TCT ATGTTT AGTCAAT ATTCATTTCAGTAGAAGATAATCTTATCT AATTTTTGAGACCAGAATA 
AGCCTTTT AAGGTAAACCT CAAAATTATCATTTTATGGTAATACTGACCATTTTAGTCCCCTAG 
GTTTGACATGGGAGATAGTGACTACACTGGTGTCTGACrrTTTTCCTAGAGAT^ 
AAAATACAAGGGCTGTTGGTGAGAGCAGACTTGAGGTGATGATAGTTGGCCTCTGGTCTACA 
AAGATTTCAT AACT CCTTGGAAA GCTTCTT AT AATCATTCTT AACTT CTTGGT AGCT AGAAATT 
TAGAGTAGTTGAAATCTTTAGGAATGAACTTCTGAGGGCCAAAAAATGTGACTGACGGGAAC 
AATTCTTAAACTGATTAACTAGCTGTAATATAGTTTTGTGAATTTATTGCACTGATGTTGTACC 
TrGTGGTATATCTGTCCCTATTAAATAAGTGTTGTTTTCTCCTCTT^ 

TGGTG CCCAT TGTAGCATATGTTTGAI HI M II 1 ATTATTTCATAAGAAAACT ACGTTAATTTT 
ACCTTACTTTC>TTGTAAATAAGCCTGTCTTGCTATCTGGATTTTTTGTG 
ACTG^TTAACTACTTTTGCAGTTTTAATCCTGTATTATTTCTTCTA C 1 1 I G I 1 1 1 GTGTAAAAGG 
GGAAAAAATAAAAAAAGCTGGAATGTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaaa 
aaaaaaa aaflmnaM«mM^Miia»aiMiiflAM«aa«uia^^ 

gagagggegeggacaggaaatp 
tgata 



nnnggaccgccxtttgccccctttgaacnccct 
tgttggtacccgggaattcggcgattatggccgggggcgggt aaacagacatggccggcg 
aaggagatcagcaggacgcrrgcgcacaacatgggcaaccacctgccgctcctg 
agagtgaggaagaagatgaaatggaagttgaagaccaggatagtaaagaagccaaaaaacc 
aaacatcataaattttgacaccagtctgccgacatcax^t acat acct aggtgct gat at gga 
agaatttcatggcaggactttgcacgatgacgacagctgtcaggtgattccagttctt ccac 
aagtgatgatgatcctgattcccggacagacattacctcttcagctttttc^ 

TCAGTATGGTGCGGAATTTAATTCAGAAAGATAGAACCTTrGCTGTTCTTGCATAC^GCA^ 
TACAGGAAAGGGAAGCACAGTTTGGAACAACAGCAGAGATATATGCCT ATCGAGAAGAACA 
GGATTTTGGAATTGAGATAGTGAAAGTGAAAGCAATTGGAAGACAAAGGTTCAAAGTC^ G 
AGCT AAGAACACAGTCAGATGGAATCCAGXJAAGCTAAfGTGCAAATTCTT CCCGAATGTGTG 
TTG CCTTCAACCATGTCTGCAG TTCAATTAGAATCCCTG^T AAGTGCCAGATATTTCCTT CA 
AAACX^TGTCTCAAGAGAAGACCAATGTTC^TATAAATGGTGGCAGAAATAC<^GAAGAGAAA 
GTTTCATTGTG CAAAT CT AACTTCATGGCCT(^CTGGCTGTATTCCTTATATGATGCTGAGAC 
CTTAATGGACAGAATCAAGAAACAGGTAGGTGAATGGGATGAAAATCTAAAAGATGMTTCTCT 



791 
Table 4 



PCIYUS00/33312 



TCCTTCAAATCCAATAGA I I II IUI I ACAGaGTAGCTGCTTGTCTTCCTATTGATGATGTATTG 

agaattcagctccttaaaattggcagtgctatccagcgacttcgctgtgaatt^ 

aataaatgtachtcc<rtttgctgtaaacaatgtcaagaaacagaaataacaaccaaaaatgaa 

atattcagtttatccttatgtgggc^^tggcagcttatgtgaatcctcatggatatgtgcat 

gagacacttactgtgtataaggcttgcaacttg^atctgataggccggccttctacagaac^ 

cagctggtttcctgggtatgcxn"ggactgttgcgcagtgtaagatctgtgcaagccatattg 

G ATGG AAGTTTACGGCCACCAAAAAAGACATGTCACCTCAAAAATTTTGG GGCTTAACGCGA 

TCTGCTCTGTTGCCCACGATCCGAGACACTGAAGATGAAATAAGTCCAGACAAAGTAATACT 

TTGC TTGT AAACAGATGTGATAGAGATAAAGTTAGTTATCTAACAAATTGGTT AT ATTCT AAGA 

TCTGCTTTGGAAATTATTGCCTCTGATACATACCTAAGTAAACATAACATTAATACCTAAGTAA 

ACATAACATTACTTGGAGGGTTGCAGTrrCTAAGTGAAACTGTATTTGAAACTTTTAAGTATAC 

TTTAGGAAACAAGCATGAACGGCAGTCTAGAATACCAGAAACATCTACTTGGGTAGCTTGGT 

GCCATTATCCTGTGGAATCTGATATGTCTGGTAGCATGTCATTGATGGGACATGAAGACATC 

TTTGGAAATGATGAGATTATTTCCTGTGTTAAAAAAAAAAAATCTT AAATTCCTACAATGTGAA 

ACTGAAACTAATAATTTG ATCCTGATGTATGGGACAGGTATCTGTCCAGTGCT CT AAATAACA 

AAGCTAGGGTGACAAGTACATGTTCXrrTTGGAAAGAAGCAGGCATGTATATTATT ATTCT AAA 

GGGGTTTGTCCCTTCCATTTCCTTACTC^^GAATACTGTTGGACATTCGC 

GTTGGCCAATATG^TTCCTAGTGCCGTAAGGTGGGTTGCTTGAMTAGCTTGTGTCAATTTTC 

ATTGTGCATAGGAAGAAGTCCGGAAAGTTTGAAAAAAATAAGAGTAAATATAGTCAGCACCTA 

GCTAGAAACN 

>638 

CCACGCGTCCGGTTCCAAACATGTGGTGACTTCCGGAAAGCTCTGTTGACTTTGG 
AGATGGCAGAAG AGATGAAAGTCTGAAAGTGGATGAGCATCTGG CCAAACAAG ATGCCCAG 
ATTCTCT AT AAAGCTGGTGAGAACAG ATG G GG CACGGAT G AAGA CAAATTCACT GAGATCCT 
GTGTTTAAGGAGCTTTCCTCAATTAAAACTAAGTAC^AACTC^ 

ATGTTG! 1 I 1 1 ACAAATGTTTnGCATTTAGTATACAAAGATCTAGCAAAAACXiAAGTTTAAAA 
TTGCGAAGTCTGTTCCAGTTATTCACACTTCAG G AAACAAAATGG G AAAGCT GTATTGTAACT 
CAAGGAAGTACTGACACTACACTTTTGTTTTTCCATAGAGGCTG^ 

TGAGAGT AGAAGGGCTT GTTCCTGTGGAATG GLAGAAGCCCAACTGCTTCCACTCCACCT AA 
CAGTTCTCCCTAGTTAGCCAGTTCCAXSAAAAGCCAGACACTGCAGGTGAGTCTCTGTTG^ 
CATCAATACTGATTAACCTTTAGCTATTTCTGATATGAAGGTAAGTG GTAGAATGGATCAACT 
CTT ATCGTCAGACCAGCTATTACATGGAATT GTGG G CACCCG CCCTCTT G CTCATG G CATCT 
CCT AGAACT GTTCTGGAAGCAACAACACCTAGAAAACT AAACCTTCATTTACTGTGAACATCT 
TCTGACTGTGGCTTCCAGATG CT AGTTT AC^GAAC AACCACACAG C AAG A CC AAG CTT ATGC 
T GAGT T GAC GGAAC AATGAGTAAACATAAGGATATTACTGTGACTTTGAAATTCrrGAAATTGT 
TCTTTCTT AACI 1 1 1 GCATTAAAATCACATTTATTTT AT AAAAT AATGACAACAGTATGTGATAG 
TTGCTTTATTCATGTtTTTTGTTTGAAAAACAAAAACCAG 

TTU 1 1 M 1 1 AAAAATGTATTGNGGAAAAATATACTTGGCTT ACAAAATATGCCN 

>640 
>641 

GGGAGATCTGTGTCAGAACGTGCGTGTGAGCXSGATACAAAACCCGAGAGAGGCGT 
GAGCAGCGCTG TGI 11 GCGAGCGGGAGCGAGGGGCGCCGGCT GGGGTGTGTGCTCCTGA 
GCTCrTCAGAAACCAGGCTGCTTTCAGGAACATTGCT G7GGATTCCCAGCTTTCAGACAACA 
CATGACTAAGACAGAATGAGACCACTCTAGTTGCCT C^TGGGAAACTCGGGAAAAGACTGCA 
AAAACAACATTGTTTCTCCCTTTGGAATTCTGGAGTTATAAGGCAGAGGT^ 
GAACT GGCCT ATTCCACTAGAAGCAAGATGGCTGAACTCAATACTCATGTGAATGTCAAGGA 
AAAGATCTATGCAGTTAGATC^GTTGTTCXX>ACAAAAGCXATAATG 
CCAACAGTTTGATTTTAATGTGGATAAAGCCGTGCAAGCCT^ 

AAGTTCT AAAAGAATGGAATATGACAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGT 
>642 - .. 

NNCGTCCCCCtt^GCGCTGG^JC^ 
GGTGTTAGCAGCTTGGTCGCG^CAGGTGCGCTAGGTAGAGCGGCGGGACXrrGTGACAGGG 
CTGGTAGCAGCGCAGAGGAAAGGCGGCTfTT*^^ 

TGGAATCATCTCCATTTAATAGA^GGCAATGGACCTCACTATCATTGAGGGTAACAGCCAAA 
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gaactttctcttgtcaacaagaacaagtcatcx^ctattgtgxaaaatattctccaaaacaccg 

aaaatctctcccag cacttt ag aaag gg gaccctg actgtgttaaagaag aagtgggagaac 

ccagggctgggagcagagtctcacacagactctctacggaacagcagcactgagattaggc 

acagagcagaccatcctcctgctgaagtgacaagccacggtgctrctggagccaaagctga 

cc^gaagaacaaatcgaccccagatctagactcaggtcacctcctgaagccctcgttcag 

ggtcgatatccccacatcaaggacggtgaggatcttaaagaccactcaacagaaagtaaaaa 

AATGGAAAATT GTCT AGG AGAATCCAGGCATG AAGTAG AAAAATCAGAAATCAGTG AAAACA 

cagatgctttcgggcaaatagannnnmnnn 

>643 

ncgcatatctgggac^gtcaggccgggtgtgaatcgatggctgctgggcgaccac 
ttcctgacagaagaaaagatgtggagtgggctgct acctcctggcctaaatgaaagtgacg 

CTGaGTCAAACT CG GAAG ATG AAG CT ACGTTGGAGAACTCTGGACTT AACTT ACAGGAAGAT 
AAAGAGGATGAGAGCATCAGAAAAACAGAAATCATAGATTTCTC^CAGATGAACCAAAAAC 
TGAAACAGAGTCAAATGTAAATGCGTATGAAGAGTGTCCTTCTGGAATTCCCATAGATATGTG 

gaataaatttc^wsaattgcataaaaaacattctgaacagaaaagcacaacctcaagattc^^ 

agggaaaagaagaaaacgctccagaaaagataaattgaagaatgaaaaagaattacatagtg 

aacxgtcctcaaatgaaacgcagtggaaagagctractcagtattrtggagtcaatgataga 

tttgacccgcctgttaaaaggaaaaaagttgagaagtcaggccttgaaaagaggatagacca 

ggctgtggaggagtggaatattgagaaggctgaggaactcagcaaccagctagctactcga 

gagcttggtgtaaaaattgccaaagcagttgcctgccacaactttgtaaaagccaaaaagga 

ggttgaaaattcacaggctgcccgaaaaaagaagaaacttgcatgggggtttgaagcaaag 

aagagatggxuaac^aaaagcaacatgggatacatgtaacttgccagagtgcttcaagac^ 

ttgtagactcaaatgctcaatttactgagaatgttttcgtgcctatgttaatatgtca 

TGATAGCACTAAAACAAAAATAAGCATAAAATTTGGGATTTrGATTATCAGCT 

TTCTTT aaggacttt gtcct attg aagtgaataaaatgaagcattctgttatttgtataagaaa 

TGTAAGAAAAAAAATATAGAAAGAAGAGQATAGGATTTTGTTrATTTGTATGGATGGACCCTT 
GGTATAAAATCACI 1 1 1 GGCGCTTT(X>TTC^TTTGGTTTAGCACAGCCAGTGAGCTTCTCGT 
GCCTCGGCCGCCACCGCGGGAGCNNN 
>644 

GCGGCCGCCCGGGCAGGTACGCGGGGACGTGACATGGGTTGGAAGATGGCGTCT 
CCCACA GACGG GACAGATCTGGAAGCATCTTTGCTAAGTTTT^ 
ACCAGATCTTTGGCCAGAACAATTACGAGGTGTTGCTGAATTTGCAGCTTCCTTC^ 
CT ATT ACT AGTTCTCCACCCAAATG GATG GCTGAGAT AGAACGTGATGACATCGACATGTTG 
AAAG AACTGGG G AGT CTCACCACG GCT AATTTGATG GAG AAGGTTCG AGG CCT ACAGAACC 
TAGCCTATCAGCTGGGGCTGGATGAGTCCAGAGAGATGACACGGGGGAAATTCCTCAATAT 



AGCTCCCTTCCOSGTGCACCTCTAACAATGCACACCT^ 
AGGGTGTACCTCCAGGACTGCCCTACTCXXXTGCTCCT 

ATTCAG CAGCAACGAGCAACGCAATCTGCTCT ACCCAAGGATCATTGCAGTCACTCAATCAA 
CTTCAGACTTGAACCTTCTTAGCCTCGGATATCGGCCACT ATCAAACCTT 
ACAAGGATCGCAACTCCTGCATCTCTCCATCTCTGCG^ACGTGATTCAGG 
CCTTTACCGCACGCATTCGACTCAGCCATGGCAATCACTCGCGACC^ 

CCACAGCATTGGCCT GGAACTCCAT AG CAAACG G CCACCATGTT CAGGAAACCCGAAACCA 
GATTGCATTCGGAACCCTATCCCTGOSACCGTACATGGTG 

GTTCCCGCGATCCCCCT GTACGCAACCCGAACATGT CGCGCAAGAGCATATCGGTGTCAAA 
GATTCAT ACCCACAG AGCGCCACAGAGACACT ATGAGATGCAGCCACACCAATCAGACACT 
CGACAAGCACCAGCACCGGGGAAACCACTACACGCCACTCGGGTACAAG^ 



NNAAGTGGCTTTATAATAATTGCCTTACTCXrrCTGATTGGAGTAAGCCAA 
AAGGATAAAAAATAGGTCAATAGGTCTGG^TTG^TGCCATGCTGGTCCn3GGAAGTCCATCC 
TAATTGCTCAGCCGCTTCAGTGCtrrGTGGGTAGATTCTCT ACGT ACGGTAAAACAAGACTCC 
TCAACCTTCCCGGTAGGCAAGTGCTTAGAAGAAAAAGTTTCrrGGTCTAAAGGATA 
AGGGCCAGCAGTCTTTGGGTGTAACAGGAAGCCTGGCATTTCTCATGA Ci I IGIMCI 1 GTA . 
ACAAGACTCTACACTCATTGGCACTGTCAATATAYAGGAGGTG<3ACAG^ 
TTGGAGCATCtXrrTTCATTCTACCAACCTCCCAGAAATAGACAAAAA 
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TAA6CACACGAGTGAGGCTAACGCTGGAGCAGCAGGTGGCAGTCCTGGCATTCCTTGGGT 

GCCAATAT AT AGGCCTCCCTGAAGT ACTT CC1 1 ATTGAAGAGTTTTAGCCCTCTACGGAAAAC 

CAGCCACATGGCAGTGATTGGGAAAAAAAAAAAACTTTCTTCGGCTAACTGGTGGATGTTAG 

CTTTCAGTGCTGTGTCCTTGAAGAATGTTGCACATGCTTTCAAAGGTTGCCAGTTT^ 

AGTACAATTCTGTGGTTCTTACTTGGAACTGCTTTATTTGGCTTTGGGAGGCC^ 

CTCTGOGTTGTGGTCTTTCTAG AGTAAA TCTTCCAGGGCTGACTACATTCTGGAG 

CGAAGAGCTTCTOC7CTGT ATATTTTTTCCCACTGTTTGCTCCTCCTTCT CATGT ATTCTTAGA 

GGTCCAATATTCTCTTGAACTTTATGTCTTCTTAGTTCCCTITCAAATGCTTCAGTAAACGCCT 

GTGCTTGGAATTGGCGGACATCTGTTTCTCTTGGCTTGCGCTGCCTTTTTCT 

AACTGTTTGTGTTTGTTAACTGGAGATGGAGAGAGCGAATGGGATCTGTANGGTGGCTTTCA 

ACGGATGGAAAGTCCTGGCCTTGACTTCTTTGGCCCATACTCCACGATGCCATCACTGTGCT 

gagagagtgtctcctcagtt(x^tcctctacctnctcatttccagta i 1 1 icnci I I ATGGT 

CAG TCTTTTCTTT AATCCN 
>646 

nnnnnnggogcagggcataccgagactacacgcagtacggagagcct ACCCGCAG 

GCGGACTGTGATCTCAGGTAGCATACT AATCCT AAGCGTTGCACATATCGCGTTATGAGATG 

CCTACTGATACAATACGGATCTGTCGGCCGAAGCCGGGCTCTCT CGGGAAGATGGCGGCTG 

TGTCGGTGTATGCTCGACGAGTTGGAGGCrrTCTCTTTTGATAACTGCCGCAGGAATGOCGTC 

TTGGAAGCCGATTTTGCAAAGAGGGGATACAAGCTTCGAAAGGTCCGGAAAACTGGCACGA 

CCATCGCTGGGGTGGTCTATAAGGATGGCATAGTTCTTGGAGCAGATACAAGAGCAACTGA 

AGGGATGGTTGTTGCTGACAAGAACTGTTCAAAAATACACTTCATATCTCCTAATATT^ 

TGTGGTGCTGGGACAGCTGCAGACACAGACATGACAACCCAGCTCATTTCTTCCAAOCTGG 

AGCTOCAC TCOC TCTCGACTGGCCGTCTrCCCAGAGTTGTGACAGCCAATCGGATGCTGAA 

GCAG^TGCTTTTCAGGTATCAAGGTTACATTGGTGCAGCCCTAGTTTTAGGGGGAGTAGATG 

TTACTGGAGCTCACXn'CTACAGCATCT ATCCTCATGGATCAACT GATAAGTTGCCTTATGTCA 

CCATGGGTTCTGGCTCCTTGGCAGCAATGGCTGTATTTGAAGATAAGTTTAGGCCAGACATG 

gaggaggaggaagccaagaatctggtgagcgaagccatcgcagct ggcatcttcaacgac 

CTGGGCTCCGGAAGCAACATTGACCTCT GCGTCATCAGCAAG AACAAGCTG GATTTTCTCC 

gccgatagacagtgcccaacaagaaggggaccaggcttggcoggtacaggtgtgagaaag 
ggactactgcagtcctcactgagaaaatcactcctctggagattgaggtgct ggaagaaaca 
gtccaaacaatggagacttcctgaatggcatcagtgggtggctggccgcggttctggaagg 
tggtgagcattgaggcccagtaagacactcatgtggctagtgtttgccgaatgaaactcaac 
tcaataaaaaacaaaaaccaaattgggcagccaaccac^caaacaaaaaaaaaaac^ 

>647 . . 

>«48 , 

>649 

acgcggggagacatgtggagtcccagcagaggccaacctgtgtctcttcatctccc 
tgggaaaggtgcccccgaagtgaaagagatggcctggtggaaagcctgggagaatgaata 
aacagactaggttgaatocatacaatgqaatggtagcagacaataaaaagaaaatgaactatt 
gatgccccct actgcacagcagaagctctgaatc^tgttcctgaatgaaagaagtcagggg 
octc^tagacatt^ 

NNNNCCGAGGTGACTGCAGAGCGGCTCGCGAGAGTCTCCCTCTGTTGCCAGGCTG 
GAGTGCAGTGGCGCGATCTCG GCTCACTGCAACCTCCGCCT CCTGAGTTCAAGTGATTCTC 
CTGCCTCAGCCTCACAAGTAGCTGGGACTACAGGGTTTTGGACTGATAAAGGAATGGAAGT 
GTGOCQQOGTTGQGTI QAGCGG GC 1 1 n I GGAAGTTTGTGGCGGAGTGAGCAG 1 1 C 1 G 1 G A 

tatgagcaacaatggaccagaagattttatctct agcagcagaaaaaacagcagacaaactg 
caagaai 1 1 ci i caaaccctgagagaaggtgatttgactaatctccttcagaatcaagcagtg 

AAAGGAAAAGTT GCTGGAGCACTCCT GAGAGCCATTrTTCAAAGGTTCCCOCTGCrCTGAGG 
AAGCTGGAACACTTAGGAGACCTAAGATAT ACACTTGTTGT ATCCAG Tl GG I GGAATCGGGG 
GATTTGCAGAAAGAAATAGCGTCTGAGATCATAGGATTACTGATGCTGGAGGCTCACCATTT 
.TOCAOGACCATTATTJGQTTOMTTAQGCAATGAGTTT AtTAGTGCTGTCAGAGAAQQCAGCC 
TAGTGAATGGAAAATCrrrGGAGtTACtACCTATCATTCTCACtGCC^ 
AAAATCTGGCTTATGGAAAAGGTGTACTGAGTGGGGAAGAATGTAAGAAACAGTTGATTAAC 
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acccrgtgttctggcaggtgggatcagcaatatgtaatccaactcacctccatgttcaagga 
tgtccctctgactgcagaagaggtggaatttgtggtggaaaaagcattgagcatgttctcca 
agatgaatcttcaagaaataccacctttggtctatcagcttctggttctctcctccaagggaa 
gcagaaagagtgttttggaaggaatcatagccttcttcagtgcactagataagcagcacaat 
gaggaacagagtggtgacgagctattggatgttgtcactgtgccatcaggtgaacttcgtc 
atgtggaaggcaccattattctacacattgtgtttgc^tcaaattggactatgaactaggca 
g ag aactcgtg aaac actt aaaggt ag gacagcaagg agattccaat aat aactt aagtccc 
ttcagcattgcrrcttcttctgtctgtaacaagaatacaaagatttcaggaccaggtgcttgat 
cttttaaagacttcggttgtaaagagctttaaggatcn 

>651 

gtgattgctgccctaaaggagaatgctctttccttcgtcactggtacgcggacgca 
gcaggaggaaactgctcgggctgcaagcagtcttccaggctttgcggctgccaaaggaata 
attgagacgtctgagttgagcaggtggaatgtcagaagactgagaacattg 1 1 CC I rCTTCA 
TACTGCTGCTCTGT TGCC AGAGAATC^XAATTTACACTCAAAG C I I cm i gattaagtgctag 

gagataaatttgcattttctcaaggaaaaggctaaaagtggtagc^ggtggcatt^ 

gctgcagtggaagacaatcactt gctgattaaagctgttcaaaacgaagatgttgacctggt 

ccagcaattgctggaaggtggagccaatgttaatttocaggaagaggaagggggctggaca 

cctctgcataacgcagtac^aatgagcagggaggacat^gtggaacttctgcttcgtcatgg 

tgctgaccctgttctgaggaagaagaatggggccacgccttttatcctcgc 

gggagcgtgaagct gct gaaacttttcctttctaaaggagcju^tgtcaatgagtgtgattt 

ttatggcttcacagccttcatggaagccgctgtgtatggtaaggt^^ 

tttataagagaggagcaaatgtgaatttgaggcgaaagacaaaggaggatcaagagcggct 

gaggaaaggaggggccacagctctcatggacgctgctgaaaaaggacacgtagaggtctt 

gaagattctccttgatgagatgggg g cagatgtaaacgcctgtgacaat atgggcagaaat 

gccttgatccatgctctcctgagctctgacgatagtgatgt ggaggct attaogcatctgct 

gctggaccatggggctgatgtcaatgtgaggggagaaagagggaagactccxxrrgatcct 

GGCAGTGGAGAAGAAGCACTT GGGTTT ggtgcagaggcttctggagcaagagcacataga 

gattaatgacacagacagtgatggcaaaacagcactgctgcttgctgttgaactcaaactga 

agaaaatcgc^gagttgctgtgcaaac^tggagccagtacagattgtggggatcttgttat 

GACAGCG AG gcg gaatt atgaccattgccttg tgaag g tt cttct ct ctcatggag ccaaag 

gaaggatctccacagaatat acggccctatgattggcaaactcaagttct^ 
atacaaaat/tgctgatacttcagaaggaggcatct acctggggttct atgagaagcaagaag 
tagctgtgaagaggttctgtgagggcagcccacgtgcacagcgggaagtctcttgtctg 
aagcagccgagagaacagtcacttggtgacattctatgggagtgagagccac 
ttgtttgtgtgtgtcaccctctgtgagcagactct ggaagcgtgtttggatgtgcagagagg 

GGAAGATGTGGAAAATGAGGAAGATGAATTTGCC<X5AAATGTCCTGTCATCTATATTTAAGG 

ctgttcaagaact acacttgtcxttgtggat acacccaccaggatctgcaaccacaaa^ c 
ttaatagattct aagaaagctgctcxcctggcagattttgataagagcatcaagtgggctgg 
agatccacaggaagtcaagagagatctagaggaccttggacg^sctggtcc^ 
aagaagggaagc^tctcatttgaggatctgaaagctcaaagtaatgaagaggtggttcaagt 
ttct cx^agatgaggaaact aaggacctcattcatcgtc r CT I ccatcctg gggaacatgtga 
ggga ctgt ctgagtgacctgctgggtcatccci rem i ggacttgggagagccgct atag 
gacgcttcggaatgt gggaaatgaatccgacatcaaaacacgaaaatctgaaagtgagatcc 

TCAGACT ACTGCAACCTGGGCCTTCTGAACATT(^CAAAAGTTTTGACAAGTGGACGACT AAG 

ATTAATGAATGTGTTATGAAAAAAATGAATAAGTTTTATGAAAAAAGAGGCAATTTCTACCAG^ 

ACACTGTGGGTGATCTGCTAAAGTTCATCCGGAATTTGGGAGAACACATTGATGAAGAAAAG 

CATAAAAAGATGAAATTAAAAATTGGAGACCCTTCCCTGTATTTTCAGAAGAC^ 

CTGGTGATCTATGTCTACACAAAACTAGAGAACACAGAATATAGAAAGCATTTCCCCCAAAGC 

CACAGTCCAAACAAACCTCAGTGfTGATGGAGCTGGTGGGGCCAGTGGGTT^ 

GGGTGCNNN . ...... 

>6S2 

WtC CAAC TQCTOCTGCGOOGCGGTOGOGGCCGCOCGOGGCro 
GGGCCAGGTCTCA GGGCT CCTGGAGCTGCAGGCGGCGGGAGGGGCTACAAATGCTTGA 
CAGTGATGCAGAACCTTTCAGAGTTAGCrrGGAAGCCACAGCCCTGCCTCrrrc 
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GGATCCAG(^CGGTGTGAAGAGGAGACCCC7T(XXrrCTTGTGGGGTTTGGATCCTGTGTTTC 

T AGCCTTTGCAAAACTCTACATCAGGGATATCCTGGAGATGAAG GAGTCCCGOCAGGTGCC 

AGGTGTAI 1 1 1 IGTACAATGGAC^TCCAATAAAACAGGTAGATGTCTTGGGAACTGTCATTGG 

AGTGAGAGAAAGAGATGCTTTCTACAGTTATGGAGTGGATGACAGCACTGGAGTTATAAACT 

GCATCTGCTGGAAAAAGTTGAATACTGAGTCTGTATCAGCT GCTCCAAGTGCAG CAAGAGAG 

CTCAGCTT AACCTCACAACTT AAGAAG CT ACAAG AGACCATT GAGCAGAAAACAAAGAT AGA 

GATCGGGGACAC<^TCOGAGTCAGAGGCAGTAT(XX5CACATACAGAGAAGAGCGAGAGATT 

CATGCCACCGCTrACTATAAAGTGGACGACCCAGTGTGGAACATTCAAATTGCAAGG^TGCT 

TGAGCTGCCCACTATCTACAGGAAAGTTTATGACCAGCCTTTTCACAGCTCA 

AAGAAGAGGCACTAAGCAATCCAGGCGCJCCTGGACCTCCCCAGTCTCACGAGTTTGCTGA 

TGAAAAAGCCAAAGAATTCCT (>TGGAGAACAGAGTGC^GAGCTTTTACCAGCAG GAGCTG 

GAAATGGT GGAGT CTTTGCTGTT>XXnTGCX^AATCAGC^TGTGATTCACAGTGCCTGCTCCGA 

CCAAGTGAATTTrAAGAAGGACACCACTTCCAAGGCAATTGATAGTATATT^ 

ACAACT GCTGCAGGAAAAAGGACTTGTTTTCCAGAAAGATG ATGGTTTTGATAACCTATACTA 

TGTAAACCAGAGAAGACAAAGACCT G CACAG AAAAGATCCAGCGGATTCACTTCACGCAGG 

^CTCGCCAGAAAAOCCACATCCCCTGGAGGCAGGGCTGCCACTACCC^VGGCATTC^ 

TGGGTOCCTaAACAACAGCa^ 

CGGGAGACCCGAGGGCCATGGCGGACACGGGCCCTCAACGGCCTTC 

CACGGCGAN 

>€S3 

NNNCAGTGGCCTGCTAAAACATGCACTCTCTGGT7TAAAGTTTGC 
TG CCCCATATGCCAC TGTGCCCAGCTGACATATTCTCACTTAATAGAATTATAGAGAAAT^ 
TCATG 1 1 1 1 1 C 1 1 1 1 1 CTCTCCCACTTTTTCATATTCCTCTTTTTCATTTTrGCG 
GTCT ATGATGT AGGCTTCTGAGGAGAACX^AGAAGCTTGG CTTTAGTGGTAGAATGACAGAA 

G II 1 1 CCAGAATTCAAAAAAAAAAAAAAATCAATCAACACAACACACACA 
ACACACTGCCTTT AAAAGGAGGAGCCAATATTTGTTG T ACCTGTGAACTGAGGAATT ATAGAT 
AAACCTT AG GTCAA ATCATTTCACAATTGCATTGGTGGTATTGAAAAATGATGAGATTTCTCT 
GACAGAGAGCTTrGTCCTAGI rTTTGTTCl ICATAGGTCAAAACTGGCAATATTCTCTTGTCT 
GCAAGATAAAGTGII I GTGC1 I CTATCACCATATGCATGAACATGT AAGAATCAGATACAATT 
TCTGCTTCATCAGTTTCAt^TGTTCATGTTGTCACT ACTGTTTATAGCTC 
OCAAGGAGACOCCAAATCCI ITTTT TI ICI 1 1 1 GAGATGGAGTCTTGCTCTTGTTGCCCAGGC 
TGGAGAGCAGT AGCCCGATCTCAGCTCACTG GATTC 
TCCTGCCI CAG CCTOCCCAGTAGCTGGGATTrACAGGTGCCCGCT ACCATGGCTGtGeTAAT 
^nTT^^Xi^Cr^™' - '^^^*^^^^^*^®^'"'"^ CAOCATGTTGGCCAGGCT CSATOTCAAACTOCTGiAC 
CTCAGGTGATCCACCCACCTC^^ 

GCCrGjSCCOCGTATTC^TTTTTTA n 1 1 CM I GCAAAGA 

AATGAGCATAAAAATGAATACTCCAAAGAAAAXSGAATTATTATGGCAA^ 

>654 

• AC CGGGGGCGGCGAGGTACCtGl I ACCACTTT AAAAGTAAGTTCTCCATCCCATAAA 
GCCATTTAAATTCATTAGAAAAATGTCC^ 

g^gtgacac^gtcattgt^ , GCX^GAATACTOCTT 

TTTTAA^TTJCTATTTCT^^ 

GT ATTOCI CI I AACGTGAACCGTCTGTTCATTGTTTTT ACC I G 1 1 I I C GI 1 1 T A GCAAGTAGTA 
CTTAATTTAAAOT 

TT 1979^ nui AGT g ^"^CTTGGAGTTAAACTCTTAAGT ATCAAGAAGTAAAAT AATTTC 
A! rei i rors i T ACT AA CATTAAAAGTGTG CAAATTGT ATAAAATTAGGT ACTTCGATTAGT AA 
AAGAAAAA^AjrTGAATGU 1 1 1 1 1 1 1 1 1 I ATTTTATTAAGCATGCCCAGTTATGCCAAGCATA 

I^S^JSf AAAACI 1 raTCTTGTCATTATAATAATAAAAAAATGAACATTTATTATGGAATT 

TCT ATGAGCCAAAAT AAAKNNNN 

>€S5 

OGATGCACCGCGCX>GCCCCGGCCGACGGACGVVGA 
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TGGCCCGGCGGCGAGGAAAAAAGGCAGTTTTGAAGCATTTAGTGGCAAAAGGGGTAGGAG 
ACGGCTGCT GAAGTGTCT ACT ACATTCAGGGGACCTGGAAGGCTTCCAGATATT^CGCAATG 
AACAATGAAAAGAAATTGTGAAAGAAGGCGCTTCCCTCAAGCGGGAACT AATGAACTOCTGG 
GTCA CGTGGTGCGCACCC<5CCAAAGGCACACGATAAAGGAAAACGCGGG<rrTATCGTGGC 
CTTT A AAATCC GGCTTGGTGAGCTTGGGTCGCCTCTGAAGGAGAACCATTTTCCATCTC^ 
CATAG I I 1 1 1 1 CCCCCAGTCAGCGTGGTAGCGGT ATTCT CCGCGGCAGTGACAGTAATTGTT 
TTTG CCICIII AGCCAAGACTTCOGCCCTCGATCAAGATGGTGGTTGGACGGCCTrCCTAAC 
CTTT ACXjGGGCCTGGCGGTGCTGACGCCTGAGCTGGTAGGGGTGGAGCAGGTAGGAAACA 
GCAAATGCAGAAGCTGCTGCGOGGAAGTCGGCCATGGACTGGAAAGAAGTTCTTCGTCGGC 
GCCT AGCXjACGCCCAjACACCTGTCCAAACAAAAAAAAAAGTGAACAAGAATTAAAAGATGAA 
G AAATGGATTT ATTT ACAAAAT ATTACTCCGAATGGAAAGGAGGT AGAAAAAACACAAATGAA 
TTCTATAAGACCATTCCCCGGTTTTATTATAGGCTGCCTGCTGAAGATGAAGTC^ 
AAATTAAGAGAGGAATCAAGAGCTGTCTTTCTACAAAGAAAAAGCAGAGAACTC 

gaagaattacagaacttatggtttttgctggacaaacaccagacaccacct atgattggaga 

GGAAGCGATGATCAATT ACGAAAACTTTTTGAAGG TTGGTGAAAAGGCT GGAG CAAAGT G CA 

AGCAAI 1 1 1 ica cagc aaaaotctttgctaaactccttcatacagattcatatggaagaatttc 
catc^tgcagttgtttaattatgtcatgagaaaagtttggcttcatcaaac^ 
cagtttatatgatgtcgctgggcaggggtaccttcgggaatctgatttagaaaactacatat 
tggaactfatcxxrracgttgccacaattagatggtctggaaaaatctttct actccttttatg 
tttgt aca gcag ttaggaagttc rrcrrcTTTTi agatcctttaagaacaggaaagataaaaat 
tcaagatattttagcatgcagcttcctagatgatttattggagctaagggatgaggaactgtc 

CAAGGAGAGTCAAGAAACAAATTGGTTTTCTGCT CCTrcTGCCCTAAGAGTTTATGGCCAGT 

acttgaatcttgataaag^tcacaatggcatgctcagtaaagawsaactctc 

CAGCTACCATGACCAATGTCTTCrrrAGACCGTGTTTTCCAGGAGTGTCT CACTTATGATGGA 

GAAATGGACT ATAAGACCT ACnrTGGACTTTGTCCTTGCATTAGAAAACAGAAAGGAACCTG C ' 

AGCTCTACAATAJATTTTCAAACTGCTTGATATTGAGAACAAAGGATACCT 

CTT aattatttcttt AGG GCCAT ACAGGAACT aatgaaaatccatggacaagatcct gtttca 

TTTCAAGATGTCAAGGATGAAATCTTTGACATGGTAAAACCAAAGGATCCTTTG^ 

CTTCAGGATTTAATCAACAGTAATCAAGGAGACACAGTAACCAGCATTCT 

GGCTTCTGGACTTACGAGAACAGAGAGGCT CTTGTTGCAAATGACAGTGAAAACT CTGCAGA 

CCTTGATGATACATGATCTCTGAAAGACTAGAATTGTCTTATATTATGAGATACT^ 

CATGTAAAGCCTTTAAAGCAAAATCCTCAGAAATGGTCT AAAT AAAACACTTGAT ATGCCT AG 

AGAAJCACANHNfOWNHNNNNNNNNHNHNNNHNH 

>656 

GAGGCCAAGAATTCGGCACGAGGAGTTTACXrrGTGTGGAGCGGCG^ 
AGCTCGGCGGCAGCCTGTGAGAGCTGAGQGTCAGTTCTTCGAGT agatctcaagctgcgtt 
TTCCT CCTTCT CCAAAGCAGGGATGGGAAGGTGGAGGCTACTGGTTGAAGAGAAGAAAGGG 
GTTGGGGGAATGCAACACCTGCAAACACT AG^5GATTGTGGGTCGAG^GGAAGAGCTAATGA 

gagccgagctcaggtatcccaagtcaaccagaatcaaatg^ 

TAATGCCCACAGTAACAGGGCTGTTTGACAGTGGCAGAAGAGGACGCGACT 
GTGCTGAGAGGGGGAAAGAAGCACAAAGI I IbUICI G i I GCGTAATTGAATTTTTAACACTC 

TTATCCACAACAAj&xr rrrrrc Grr gtcctgctgtgt aaaagacatcagat atattacagattt 

TGAAACAGGTGAGCATCXrrTTTACGAGCTGGGCAGGTGGGGAG^ 
GTGAGGAGATTTGGTTGAATGAACGCTAAGATGGCCAGACG^ 

GCAGCCT GGGATTCCAGAGCT GCAAACAACCACTGAATTarTTCTGTAAACCTGTTGTCATT 
TG ACGTTTCCAG GCAGGCATCAACATTTACATTGTAATTCAATAGACGCT actactacaaagg 
AGCTTTAi i u 1 1 CCAGCTT AATATGGTTGCTGCGCGAACACTGCAGGtXlAAACGACTTTGGG 

6ACCGOCCCGCGOGGAACCCI I gggaacaacgcaaattagcaaaggcgtaatttataaccg 

CTGCT ACCAAAGAGT GAAGT ATTTGTGGAAGTTCTGACTGGAATCT AGCT CGT ACATATGGA 
TTGTGGAAATAAAGGA GCAOGTGG AAAGCTTGCCATATTTT ATTTCAAGTT aatactaaagct 
GGCAGACTT AAT ACAAC 1 1 ICI 1 1 1 AATCT/TT ATGGATCACAT ATTTTCTGAAAATT ACATGAT 
>€57 * «. - 
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G CTG TAGGCX*>GGGTGG i iGv/i GCCGAAATGGGCAAi>i i CATGAAACC I GGGaaGGTGG I 
GCTTGTCCTGGCTGGACGCTACTCCGGACGCAAAGCTGTCATCGTGAAGAACATTGATGAT 

ggcacctcagatc^cccctacagccatgctctggtggct 

aagtgacagctgccatgg<xaagaagaagatcgccaagagatcaaagataaaatcttttgtg 
aaagtgtataactacaatcacctaatgcccacaaggtactctgtggatatccccttggacaaa 
actgtc^tcaataaggatgtcrrcagagatcctgctcttaaacggcaaggcxcgacgggag 
gccaaggtcaagtttgaagagagatacaagacaggcaagaacaagtggttcttccagaaact 
gcggttttagatgctttgttttgatcattaaaaattataagaaaaaaaaaaaaaaaaaaaaaaaa 
AAAAcAagaar 



i cc yocg gacaggcgca 



TTCCTCCTTCTCCAAAGCAGGGATGGGAAGGTGGAGGCTACTGGTTGAAGAGAAGAAAGGG 

gttgggggaatgcaacacctgcaaacactagggattgtgggtcgagcggaagagctaatga 
gagccgagctc^ggtatcccaagtcaacc^gaatc^aatgcgcagtacgccacaaggc^t^ 
taatgcccacagtaacagggctgtttgacagtggcagaagaggacgggactaaagttacttt 

GTGCTGAGAGGGGG AAAGAAGC ACAAAGTTTGCTCTGTTGCGTAATTGAATT7TTAACACTC 
TTATCCACAACAAACAut 1 1 I IU GTGTCCTGCTGTGTAAAAGACATCAGATATATTACAGATTT 

tcaaacaggtgagcatccttttacgagctgggcaggtggggagtggcgtggttttgatgga 
gtgaggagatttggttgaatg^acgctaagatggccagacgcac^cttcgatctcaactct 
gcagcctgggattccagagctgcaaacaaccactgaattcxsttctgtaaacctgttgtcat^ 

TG ACGT TTCCAGGCAGGCATCAACATTTACATTGTAATTCAAT AGACGCT ACT ACTACAAAGG 

agcttt attctt ccagcttaatatg gttgctgcgggaacactgcag gcgaaacgactttggg 
gacggccccgcgcggaaccctt g ggaacaacgcaaattagcaaaggcgtaattt at aaccg 
ctgctaccaaagagtgaagtatttgtggaagttctgactggaatctagctcgtacatatgga 
ttgtggaaataaaggaggacgtggaaagcttgccatattttatttcaagttaatactaaagct 
gccagacttaatacaaci i ici 1 1 1 aatctttatggatcacatattttctgaaaattacatgat 

>660 

NACCCACCAGAAACATATTATACTATACACACTCAGCCTGGCT GTTAAGAGCTGGAG 
ACTGAAAGTGGTGTTGATTGACCGTGGACCCAGGGAAATATCTCAGTGGTTAAGGATACTCT 
GTGTGTCTTACAAACAGAGACACAGATGCTGCCTAAGAACGAGGTCCTCCTGGAACCGGTG 

ccagggaaggattttgttgctcacactagtcttgatcttctgggctcgagtgatxx^ c 
ct aagtgttgggattac^ggcatgtaccacxxscgtctgtaccaaagggaaagttaggtctg 
gagactgagccacacaacactgaagttccttt 

tgttaag tggtt aagcattgtagcttacxx>gatgcctggtatcctggagtcccaggagcac 

ii i i i c i ggaagtaggagc ttgtt tccttscaggtgccaagacagagaccgacattgtttgtt 

ggctgggtcggtctcccagttttcagctggcto^gtctcacctgt^ 

catgtctcccatagtcccctcggtggggacagaggcactggatgaagccctgctcgtcacc 

acagagacactgtgcaacctcagtttccccaggtggagaccggacca^cagtgatggcctc 

tgcctcttcaggtcatagtaccggtgaatacaggctggcacgxscct aggcactcagtaaca 

cacggcagaggcacaggacttaan 

>661 

NNCATCA(^TCTCAACATCTCAGAGCCATCCTCTAAGTGTCACATGTrCCGTCTTAAT 

Sn^^ A 5 x ^ A ^ AGC 95 AG ' I 1 1 1 1 1 1 I cm I U I acagaaggccactccatttacccccac 
ttgccgctaacctgcactcttgtccaatatacccctgacccaaatattgtttttc^ 

ACCAAGTTAGAAATCCGGCCCTTrCACAAAGGTCAACATTTTCAGC 1 1 CI 1 1 I AGCAGGAATG 
CTOCTTCAAAGGGTATCTGCATTCCTGTTCCCACAGCCCCX^AGCAT^ 

°^ TG 55IIHI CT A ^^<^^^^CATGGT(XTGGGTTAG AAGCTGCAGAAAGGACTGT 
GCTGACTCCCTCAACTAAGGG^TGGTACTACCGCTTGCTCCTCTCAGCAGCTGACATCCCA 
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GAGATCAGTrrarmCTGGGAGGGACTCCTCATTGAGGGGGGTGCAGAAGACCTGCAGAC 
CTGCATCAACCCTGGTCTTTGAGGCTTCAACTCCTGAAAATCAATCAATTGAGCCCTACCTTT 
ACCAGACCTTT AGTCTCTCTTCTTCTG I I ICI 1 1 1 IGGCTTTCTGGACAGTGCTGACTCCAGC 
TCTTTCCTTCTTGGGATGCTGCTGCCTCTTCCTCTTACCTCTGCTGCTTGAGCC^ 
CTCTGCTCTCTGCTTCCCTAGCCCCGCTATCCTTCTCATC 

CCT CTTGATGCTGCTGTC 1 1 II CH CII GCTTTTCCTGCTTCTTCCATCACTGCATGCTCTGCT 
CTCTGCTTCCCTGGTTCCACCACCTAAAACAGTTTCCTCACCTCTATCTTCTAGGTT 
CTCCT CCT CCTGTTGCTGCCTCTT CTTT CTTCGGCTG CATGGGTCAG GATATGCCCT GCTCT 
CTA(XTCCTCTGTCCTGACACTGCCTGCAGCCTCCTTGCCTTTTCCTCCT^ 
CCCATCTTCTCCTCTTCATGGTGCCACCT CI I C 1 1 1 i I C I I CGCTTTCCTGAGGGACTGATTG 
GTrrGCTCTCTGCTATTCAATTOXCAAGCCCACTTGTTCCTGCAGCGTCCT 
GC TTTAGT TGTACCCTCTCTTTCATCTGAGACCTTTCCTTCTTGATGTCGCC I I I ICI ICI ICT 
TGCTTrrrCTGATGTTCTGCTCAGCATGTTCTGGGTGCTTCT 

ATGCTGTAGCTTCTTCCTCCTCTTTCTGCCTCCTTTTC 1 1 1 1 ICI I 1 1 1 1 1 1 GGGGGGCTTGCT 
CTCTGACTGCAGTTGAGGGGCCCCAGGGTCCTGGCCTTTGAGACGAGCCAGGAAGGCCTG 
CTCCTGGGCCTCTAGGCGAGCAAGCTTGGCX:TTCATTGT^^ 

TGTGCTGTTCGCCCCTCACAGGCTTGGAGCAGCATCTCATCAGTCAGAATCTTTGGGGACTT 

GGACCCCTGGTTGTCGTCATCACTGCAGCTCT CCAAGTCTTTGTTTGGCTT ctctcc ACCTG 

AAGTCAATGTAGCCATCTTCACAAACTTCTGATACAGCAAGTTGGGCTTGGGATGATTATAAC 

GGGTGGTCTCCTTAGAAAGGCTCCTTATCTGTACTCCATCCTGCCCAGTTTCGACTACCAAG 

TTGGCCGCAGTCTTGTTGAAGAGCTCATTOCACCAGTGGTTTGTCAACTCCTTGGCAGGGTC 

ATGTCCTACCCCATGAGTGTCTTG CTTCAGTGT CACCCTGAG AG CCT GAGTGAT ACCATTCT 

CCTTCCGGCCGAGGCCTTTGCCTTGAGTCCATCCATGCTTT AGCAGCTGCTCCT CAGCAAAC 

TTCATaXACGACTCTTGACCTCTGGGGTCAC^rrCATGGTGAGAAAAAAGTCT^ 

AG ACTCTCACCAGAGAAG AAG AGGTGATATCATCCGTCATTGCTGG GTCCG CTCCGCGGAA 

ACATGTGCCGGACCTGACrTTGTGC<5CCOCCATCTTC<XCCGCGTACCCCTGC 

C<X^CTCTTAGAACTGAGTGGATCCCC<X3G^ 

ATACCCGTCGTACCTCGAGGGGGGGGGCCCGGTACCCCCAGCTGTTTGTTCCCflNNNN 

>662 

>663 

natgcccagtcc«>c<^cacctcccactrc^^ 
gggctgcccttgacctggcct agag cxxttcccc^^gctggtggtggagct ggcactctct 
gggagggagggggctgggagggaatgagtgggaatggcaagmggccagggtttggtggg 
atcaggttgaggcaggtttggtttccttaaaatgccaagttggggwccagtggggcccaca 
tataaatcctcatoctgggagcctggctggcttgctctcctrcctgggtctgtctctgccac 
ctggtctgccacagatccatgatgtgcagttctctggagc^ggcgctggctgtgctggtca 
ctaccttccacaagtactcctgccaagagggcgacaagttcaagctgagtaagggggaaat 
gaaggaacttctgcacaaggagctgcccagctttgtgggggagaaagtggatgaggiaggg 
gctgaagaagctgatgggcagccttk^tgagaacagtgaccagcaggtggacttccagga 

GTATGCTGTTTTCCTGGCACTCATCACTGTCATGTGCAATGACT^^ 

ac cgaccc tg aagca gaactcttga cttcctgcc atggatctcttgggcccaggact 

atgcctttgagttttgtattcaataaac 1 11111 i gtctgttgataaaaaaaaaaaaaaaaaaac 

aaaagagaaaaaaccctgccgggggacataatattgcgcggggoggcgagaaacaaaggg 

ggtag attcacaggggcgcagtggcgccacaacagcggtgcttccctgtacxa^ 

tttttgtgacocaggggtggaacaactctacgcgcggcgccoccaatattgggacggcgc 

gcgataaaattcagaaaaccgcccgcgtacccctggattagagtagcgcgccgaggcaaac 

agctagcgcaatcatgagatcagcaagggcgactcacccaggcaggctccatggatgacgt 

gcatgcatctatcccacgcgctctgcctcgactrcacxact agccggcgagcgaactcacgcg 

gcccgcagctgctc<3tgtaatcccgtx3cccgcattgacacgcagcaggcccgagcaggca 

CTTCATGC<?rATamCAGCGATCC6GGACC^ 
GCGCCGCAGGACCACCACTGCTGCGGCTAAGAN 
>664 . -• 

. cgaqcccgcgtccgagatagccttttggagtcgaagatgmggaaaagcctgt 

TATAGTCTTGGAAGTGTUI ICI 1 1 IGCCAG6ACAGAGAGAGGAGCTTCAGCAGTGAGAGCAA 
CTGAAGGGGTTAATAGTGGAACTTGGCTGGGTGTCTGTTAAA CI I II! I C CCTGGCTCtGOC 
CTGGGTTTCCCCTTGAAGGGATTTCCCTCCGCCTCT 
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ACTGGTAC<3CCTC<X3CGGTATCTGCATCGGGCCTCACTGGCTTCAGGAGCTGAATACCCTC 
CCAGGCACACACAGGTGGGACACAAATAAGGGTmGGAACCACTATTTTCTCATCACGACA 

ggaacttaaaatgcctgggaagatggtcgtgatccttggagcctcaaatatactttggataa 

tgtttgcagcttctcaagcttttaaaatcgagaccaccccagaatctagatatcttgctcaga 

ttggtgactccgtctcattgacttgcagca(xacaggctgtgagtccccat^^ 

agaacccagatagwtagtccactgaatgggaaggtgacgaatgaggggac^catctacgc 

tgacaatgaatcctgttagttttgggaacgaacactcttacctgtgcacv^gcaacttgtgaat 

CTAGGAAATTGGAAAAAGGAATCCAGGTGGAGATCTACTCTTTTCCTAAGGATCCAGAGATT 

catttgagtggccctctggaggctgggaagccgatcacagtcaagtgttcagttgctgatg 
tataccxattrgacaggctggagatagacttactgaaaggagatcatctcatgaagagtcag 
gaatttctggaggatgcagacaggaagtccctggaaaccaagagtttggaagtaacctttac 
tcctgtcattgaggatattggaaaagttcttgtttgccgagctaaattacacattgatgaaat 
ggattctgtgcccacagtaaggcaggctgtaaaagaattgcaagtctacatatcacccaaga 
atacagttatttctgtgaatccatcc^caaagctgc^agaaggtggctctgtgaccatgacc 
tgttccagc^ aggg tctaccagctc^gagattttctggagtaagaaattagataatgggaa 
tct acagcacctttctggaaatgcaactctcaccttaattgct atgaggatggaagattctgg 

AATTT ATGTGTGTGAAGGAGTTAATTTGATTG GGAAAAACAG AAAAGAGGTG G AATTAATTGT 
TCAAGAGAAACCATTTACTGTTGAGATCT CCCCTGGACCCCGG ATTGCTG CTCAGATTG gag 

acrrcagtcatgttgacatgtagtgtcatgggctgtgaatccx^tcttrctcctggagaacc 
cagatagacagccc tctg agccggaaggtgaggagtga6gggaccaattccacgctgacc 
ct gagccctgtg^ttttgagaacgla^c^ctcttatctgtgcacagt 
gaaactggaaaagggaatccaggtggagctctactcattccctagagatccagaaatcgag 
atgagtggtggcctcgtgaatgggagctctgtcactgtaagctgcaaggttcctagcgtgt 
acccc cttgaccggctggaqattg aattacttaagggggagact attctggagaatatagag 
tttttggaggatacggatatgaaatctct agagaacaaaagtttggaaatgaccttcatccct 
accattgaagatactggaaaagctcttgttt'gtcaggctaagttacatattgatgacatggaa 

TTCGAACCCAAACAAAGGCAGAGTAC<XiAAACACTrTATGTCAATGTTGCCCCCAGAGATAC 
AAC<X3TCTTGGTCAGCCCrrTCCTCCATCXTGGAGGAAGGC 

TGAGCCAGGGCTTTCCTGCTCCGAAAATCCTGTGGAGCAGGCAGCTCCCTAACtM 
ACAGCCTCTTTCTGAGAATGCAACTCTCACCTTAATTTCTACAAAAATGGAAGATTCT 
TTATTTATGTGAAGGAATTAACCAGGCTGGAAGAAGCVM3AAAGGAAGTGGAATTAATTATCC 
AAGTTACTCCAAAAGACATAAAACTTACAGCTTTTCXrrTCTGAG 

CT GTCATCATCTCTTGTACATGTGGAAATGTTCCAGAAACATGGATAATCCTGAAGAAAAAAG 

CGGAGACAGGAGACACAGTACTAAAATCTATAGATGGCGCCTATACCATCCGAAAGGCCGA 

GTTGAAG< 3ATGC GGGAGTATATGAATGTGAATCTAAAAACAAAGTTGGCTCACAATTAAG>AA 

GTTTAACACTTGATGTTCAAGGAAGAGAAAACAACAAAGACT ATTTTTCTCCT GAGCTTCTCG 

TCCTCTATTTTGCATCCTCCTTAATAATACCTGCCATTGGAATGATAATT^ 

AGCCAACATGAAGGGGTCATATAGTCTTGTAGAAGCACAGAAATC^AAjWTGTAGCT 

TTG ATATGTTCAACTGGAGACACTATTTATCTGTGC^AATCCTT GATACTGCTCATCATTCCTT 

GAGAAAAACAATGAGCTGAGAGGCAGACTrCCCTGAATGTATTGAACTTGGAAAGAAATGCC 

CATCT ATGTCCCTTGCTGTGAGCAAGAAGTCAAAGTAAAACTT G CT GCCT GAAGAACAGT AA 

CTGCCATCAAGATGAGAGAACTGGAGGAGTTCCnTGATCTGTATATACV^ATA^ 

ACATATGTAAAATAAAATTATGCCATAGCAAGATTGCTTAAAATAGCAACACTCT ATATTTAGA 

TTGTT AAAATAACTAGTGTTGCTTGGACTATTATAATTTAATGCATGTT AGGAAAATTTCACAT 

TAATATTTGCTGACAGCTGACCTTTGTCATCTTTCTTCTATTT^ 

TTCCTATATAGTTTATTGACAATAATTTCAGGTTTTGTAAAGATGCCXK3 

GACAAATAATAAGCAAAGG<5AGCACTGG<rrrGACTTTCAGGT ACT AAAT ACCTCAACCT ATG 
GTATAATGGTTGACTGGGTTTCTCTGT ATAGTACTGGCATGGTACGGAGATGTTTCACGAAG 
T TrGTT CATCAGACTCCT GTTjCAAC TTrCCCAATGTGGCCT AAAAATGCAA C I I C I II I IATTT 
TCTTTTGTAAATGTTT AGG I I 1 1 1 1 1 GTATAGTAAAGTGATAATTTCTGGAATTAAAAAAAAAAA 

aaaaaaaaaaaaaccactacaggagaaaacaccccaaaaataagacaaa 
ccgaaagtgacacgcaacaca^aacagccatatgacaaagaacgact ccccactcagagga 
caacxataaataacaccacagcccagccacatca^ 
atcaacatgacaatgcgaccaaac ga ccaccatgcgggagaaacagcagc^ 
agaattgcaaccag caaacaacaagccacc&acacgacgaacacacacacaaatgacaaga 
aagcccagaoctcgaaacnn 
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atagggagtcgacccacgcgtccggcggggaggtcgctcgactcggggggcggc 
cgcc<m3ggcaggtacgcggggagactttccctgccggcacatggac^ 

tct caatggtg(xx^c^aggtcttcccttctttggacgaggaggtccccccggccgaggct 
aacaaggaaagcc cctg gagctcctgtaataagaatgtggttggaagatgcaaactgtggat 

GATCATCACCTCCATTTTCCTAGGTGTCATTACAGTGATCATCATAGGCTTATGTCTTGCTGC 

AGTAACTTATGTTGATGAAGATGAAAATGAAATACTTGAATTATt^TC^^ 

at GATGCTGAAGATTCCAGAGGAGTGTGTTGCTGAAGAGGAATTGCCTCACCTGCTCACCGA 

aaggctcacagatgtgtacagtacatcgccctctctgagtcgttattttacttcagttgaaat 

agtggactt cagtg gtgaaaatgccacagtaac^tatgacctgcaatttggggttccatcag 

atgatgaaaattttatgaagtatatgatgagtgaggagttggtgctgggcatttttgctacag 

gatttccgtgatcagaatatacctggttgtgagagtctggggcttgatcx^c^^ 

gctctatgaatgaagtgatggaggctggtctctgtctgaaagc^gtgctctaccaaagtcct 

ggagattgaaggggatc^actc^ggtttgcagagaagattctgtggattaatacagaagca 

cx^agcaacac^cagaggggtggagactcctttctrctcxxx^ttct acagtct ggct ctaagc 

C^AGTAAAACAGCTCCCGAGCACTG(TrTCAGCTGGGTCC^GTCTTGACAAAGGCAGGAAGC 

CAGCTAGGGTGGGGGCGATAGGGTCAGCGGGTATGTCCCACTGTTGGAGGTCACTGGTAT 

TCTGNNN 

>668 

TCCGGAAATCTACAAGAATCATAGAATTCAATAAAAAAGGTAGAAAGTAATTTTTTTA 
CTTAAAAATATAAATTAAAATAAATTTTTAAAATCATAAGCACATAAATAGAACTT ACCAGGGA 
GAAAGAAAAACCTGAAGGCACAA 1 MCI I 1 1 CTGTTCAAAATGTGAACCCAGGATGTCTCTAG 
ATGATGATGGATGATAGGTGGGGAGA 1 1 1 1 II I 1 1 1 1 1 1 1 AATACAGAATCTCATAGTTTTGGA 
TTAATTAGCACCAATCAGTTTAAACACTGACTGTTAGAATAGCTGGATGGG I 1 1 I 1 1 I C I 1 1 AA 
ACTAATTAAGCG TTGGCTA CTTAGTATAAGTAAGTATAAGCCGAATTAAGGTTCTGCTACATC 
TGTGTTTAGAATA F ? M F 1 1 AAAACTAAATAAGTGTTGGCTAGTTTTGCGGTGTAAGCAGAATT 
AAGGTTCTGCTACCTCTGTGTTTAGAATATTCCCAATGG 

tttgaccctgtatag actttaatttaaaatgaatttg gtaacgtttctcctctgtctctacata 
tattcatgctttcacctgctcttttaac^cctgctttt agtatctgaggcactttttctgaact 

CTACTTGTGCACTGGATCCCTa^CCTTTCTCTGCCAGGCTGT^ 

ACCACTT AGTGATTCC I MCI 1 1 GT ATAAACATGGTAAATGTCTTCATTAGCCT AAAAGGAAAG 

ACCAAA TAAA ACCTTTCCTACCACTTGGATGCATTTGCATCCT 

GCTC CATTTTCTCCCTTCCCAGTTATTCCTTAGCCCAGCCATCTCTGTCT^ 

TTTTCT T AGGATATTCTTGGGAAAGATGAGCGGAGACTGCCCGCCTTGTCAAATCTAGTGTCT 

TTTTTTCAGTCCTCACACTGCTTGACCTATGTATAACCTCCTATACTTC^ I C 1 1 1 GCATACTC 

CTCTGGGTTTT CT GTG G TAGTCAAGATTCCTCCCTGAGATTT ATTTCCCATGAGTCTTGACCC 

CTCXXXrrCAGTTGGTGCTATTTaXCCTACCCtSCCCT 

GTTCAGI 1 1 ICI 1 1 CATGCT ACCTGAATGTCCTG AT AAACT GG CT CGCTCTC I ICI 1 1 ACCTTC 
CAT AATG GCATT ACCATTT ACCACGCCACCCAAG AT ACTTACT AGGAACCTCAAAGT ATTGTA 
TTCTTTTTCTCCATGACACTCATACTTAATCATCAAGTCCTTT^ 

ATGTCCCTTCTT AATTCCTGCTGCCTTCTTAGT AAAGGCCTTCA I IC I I H 1 1 C CCTAGTAATA 
ATCTTTT CCATATGTTCCAGTTAAAATACCATGTTCTCCCTATTCCTT ATTACATAGCT AGCAT 
TCCTTGAAAAAAAACAATTCTCTCAGGCCTCCATACCTTTAGCATGTT 

GCT CTTCT GGAACT AGAACACT CATCCTTGAAGGCT GGGCTTCTGT ATGAAGGTTGGTCCTG 
CCTCCTTACrTGAGGTGAAGCTTTGTACATGCCTGTATTACGGA^ 

TTGTCTCTTTCGTCATTGGGACTCCAGCACCCAGGATAGTCCCT AGTATACTAGTTGGTGCC 
GAATAAATAGT AGCT ATTATTAGAAAAGGAAGGGTGAAATTGACATGGGAGTTAGTAAAATQT 

atatggaaatgatttttaaagggaaaggt aatgattttctggcaggaaaagc^gcaatgacaa 

GATTACTT AAGTCTTGTGAAAT AACACTTCTCTTCCTTGACCTG<rrGCTTCC^ ACCAC 

acacacacgcacacataccacagccctttgagactgaaagcagctct attgagaatagtagt 
gtcaactgt attatgtagaaattct aaagtttttgggattatttcat agccctgvac^ ac 
ttctctccactttatgtggca^gtttaatctcaggtctccctcatac^ 
acctaaccctcacacaacactccagtattgatgcagtcaatctitjt 
ccaatgtgcaaagcacqatgttggaaattataca 

TCAAAGATGTCAGTCI 1 1 1 ICI 1 1 GCAAGGATAACACATGTAGAGTAAAATGCATAAAGGGGA ? 
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CTAATTTTAAATGTACAGCTTAATTAATTTTTATGTATGTT aacacccatgtcaccaccatgttt 
AGGACATTTCCAGCACCCCTGAAATTTCCrrTGATGCCCCTTCCCAGTCTGTACCTAGACCT 

aaatctattttcaatcttaatggccttttaaataactgggcttctcacaaocatagtgaacaga 
aacagctgg gttgtcaacg tct aacct aat actt cag gaaaactcatga7 g g tttccatgtt a 
agagagacatggagcagggcactggcatggtggatggatcacgcctgtaatcccagcactt 
tgggaggccgaggtagggggattgcttgag«x>ggagttcaagactagcctgggtaatat 
aaggaaaacct gtctctgcaaaaaaaaaaaaacaaaagaggttcacccccatttggaaaaact 

TCXX^GGCCC^GGGGTTAGAAAAAATCATTATCGGGAAATTGGCATTTGGCCCAGGGTATTT 
ACAAATTCGGGGCCACTCCCCATTAAAGCTACCAAGGCCTCTCTTC^CAGC^ 

ACACCTACTTT accgtcctcttgggatccaaaaaaaagcccattttgcgaagttaactcttag 

GCCAAAAAACCT AGCTGGGGCGCTCCCACTACCCGGACTTCAAATTCTGCCACAGCGTTCA 
GTCAOCAACCACGCTGGTTCTGTACCCAAGACTrrCGATCTGGAAAGAAAGACTCGAAAAAA 
GCGATATTCAAGACTGTCTTGCACCGCAAACGCAGGGAGCTCCTAACOGCGCGACAN 

GCG TCCGG GCTG ATTT AATTGACAATT ATCAAAATT ACTGAAT ACAGAGAAAGG CA G 
GCGATTTGTGCCCTAAAAGTGACTAACG AG CCTCTGTGACTCCCACT ACAGCAGTTCGCAGA 
GATGAAAGTGGCAACAGCCTCGGCAC7AGGGGGTTTGTATTCTAGGAGAACGGCCGATCCG 
TCAGCGCTGGCTTCAGCCCCAGGCCTCAGATAAGTTG GGCACTGGGAAGCTCAG ATGAGAC 

atcttgctagctcctgtttt attccgctccaaacctggagctgagctgaggccatggtcaaa 
gccgactctccagtacccccagaagacactaaggccatatgcctctcccccacgatcatttc 
ttttgtcgttttaacagctgaactttgttcgattttatctccctct^ 

GAAAGTCATCTATTTGGACGATGATGTAATTGT ACAAGGTACTCTCGCTAACTGCCAAGAACA 
CTTGGAAAAGAAGCAGCTGATAAGTAGGAATAACTGTCTCTCTrCATTTTTCAAATAACn^ 

GTGCAGCAAGTTTTGACCTGCT attaaggtgtgagngagtattgttgtaatgtggaggnnnn 

>668 

taggctgggtgcagtg gctcacgcct gt aatcccagcacttgggaggccgaggca 
ggcagatcacttgaggtcag gagttcg ggact accct gaccgacatggagaaaccct gt ct 
ctactaaaaaaaatacaaaattagccgggtgtggtggtgx^gactgtaatcccagctactt 
gggaggctgaggcaggagaattgcttgaacccagggggcgg^ggttgcggtgagccgag 
atcgtgccgttgcactccagcctgggcaacaagagcgaaactctgtctcaaaa^ 
aaaanaaagattctattagagatatggcagatgtactctctgagctatttaattgataaagaa 
aaagtaggagttcctgctttttactaagatgataatcataattatgatcatactgatgaaaata 
ataaaattgctctgaxrrgtggtgaagattttttcccatttgttgatttagct attcagaaatat 
atttcaccctcagttcataactgatatactgctagaagttgagggtcaaatcgtggcaacaga 
ccgtgcacttcatggcattgttgagcatgaaacaaxttctgatagagccagaaattaagattc 
gcttatggcttacccaaggtaaatgcgaagtcaatggcagaatgaaaattcaagcttggggc 
caggtgtggtggctcacatctgtaatcccagcac 1 1 1 uggaggccaaggcgggcagatcac 
ttaagccgaggagttagagacgagcttggtg aacatg gcaaaacccagtctct 
annknjcwnnnnnnnnnnnnnnnnnnnal-i i1 1 1 m igi 1 1 1 m h nctggtcgaaaatttt 
tgttggaAttttaaagaaaagaaaggcaaagtagcactcagatggcc T llll T I GTAAAGTGA 

AGTCAACCTAATACTCTGGTGCTTACTTTGCAAATC 1 1 1 I CCAT AAGTCAAGT ATT AGTGTT AA 
CAATACACTTAAGAAGTAAGGATAAACCCATCAAGGTCCACAGCT AAATAACCAGCAGATTC 
CCAGAAACTTrATGTAITTGGGAAAAGTAAAATATACAACAGACATATCOCTGCCCTGATTAA 
GAGGGT AGAT AAAAACAAAACAT AAAACAATTTT ACTTGAGAT AG TAAT AAGTT ATTTGAAAAA 

aatacaacagaatatagggagagagagcaactacagaaagaagacagaagggttc reciTT G 

AATAGTAAGGCTTGGGAATAGCTGAATTGTAAAACAAATCT GTCAGTCCAAAAACGAAGATAA 

TTCCATTCAACCGCTGACTACTGAATGGGAAAGCAAACGTTGTCAC^ 

GGAGT AACTATT ACTTAAAGTCTCACTTTCCATACAGAAGAGACAAAGAATCT AGTCAAAAGC 

ACATGGAATCATATCTAGTCATAGTGGTAGGTGAGGAATCAATGCCCAGGCAGCTGAAAGGT 

GGGAACTGTTAAAGCCTTACACCAAGGAAAACATAAACTTGTACCTCGGCCG^ 

GGCGGGCGAGGTACAGAGTAGGATCAATAAAATCTGTGTGTT ACAGCGGCAGACTG 
AAGGACGGGTGCCTGTTTTCAGCCATGAGGT AGTCCCTGACCATCTGAGAACCAAGCCTGA 
CCCTGAAGTGGAAGAACAGGAG^AAGCAACTGAGGACAGATGCTGCCCGCATTG 
GCAG<XJCAGGTTGGACTGAGTCACTGCCT^ 

TAGGCATTACCATTCCTGTCTAGTAGGGATACATAGTTGGTTGCGCCTAAGTTGCTTCTGGC 



PCT7US00/3J312 



AGAACCCAAGGAATAAATTTCTCC^TATC»TTrCCTAGTTACtXTAATCTCTGC 
GTGTTACAGAAGCAGATCCAGAGCTTGAATAAAATGTGTTCAAACCTTCTGGAGAAAATCAG 
CANAGAGG AGCG ANNN 
>670 

NNNNCTGAGTGGCGGCGGCGGCGACGGCAAACCCGGAGCTGCCGGCCGGOGCG 



ACGGGGCOGCCCCCATTAGACGGAGCTGCGGGC«GAGGCTCCATGTTGGGAAGCGGCGC 
CGTTCGTGCTTGTTAGCGGGAATCCGGGAGCCGCGGGGTGAGCTGGCGGGGGCCGGGCC 

ctaagtgaagatggaggccccgctgcggcctgccgc^gacatcctgaggcggaacccgca 

gcaggactacgaactcgtccagagggtcggcagcggcacctacggggacgtctataaggc 

cagaaatgtacacacaggagagctggctgcagtaaaaatcattaaattggagcctggagatg 

An n ict ttgattcaacaagaaatatttatggttaaagaatgtaaacattgtaacatcgttgc 

ctactttgggagttatcttagtcgggaaaaactatggatttgtatggaatactgtggtggcg 

gatc^cttcaagatatttaccatgttactggac^ttatcagaattgcaaatagccta 

gcagagaaacctt acagggtcttgcctatttgcatactaaaggcaaaatgcatagagatatc 

aaaggtgctaatattttattgacagaccatggcgatgtaaaattagctgactttggtgtggct 

gcaaaaataacagctacgattgcaaaacgaaaatctttcattggcac^cccttactgga 

cccagaagttgcagcagtagagaagaatggtggctacaaccaactctgtgatatctgggca 

GTAGG^Tj^CAGCAAJTG 

atggtcatcaacattccataattttgt caaaa tagcagtaaccaaaaacccaaaaaaaagag^ 

AACTGCT GAAAGACTT CTGACTCACAGI I 1 1 GTTGGACAGCCAG GTCTCTCT AGAGCCCT AG 

cagttgaactgttagacaaagtgaacaatcx^gataaccacggacattacactgaagcagat 
gacgatgacttt gagccccatgcaatcattcxstcataccatt agatctacaaacaggaatgc 
cagagct gaacgg acagcttcagaaat aaattttgacaaatt acaatttgaac^ 
aaaagaaacagaagc^cgagatgaaatgggattgtcatcagaccc^aatttcatgttacagt 
gg^tccttttgttgatggtgcaaatactggcaaatcaacctcaaaacgtgcaataccacct 
cccctacct cctaagccaaggataagcagttaccxn^gaagacaactttc^ix^atgaagaaaa 

AG CATCAACCAT AAAACATTGT CCTGATTCAGAAAGGAG AGCTCCCCAAATTCTCAGAAGAC 
AGAGTAGCCCAAGTTGTGGGCGTGTGGCAGAGAGTTCTTCTATTGGAAATGGTGATGGTATT 

tcaaaactgatgagtgaaaatacagaaggatcagcacaagcaccacagttaccacgaaaaaa 
ggacaaacxsagacttccctaaaccagccatgaatggccttccacccacccgaaaagttct 
tgggagcatgcttttcaaaagtttttgatggctgtcctttgaaaattaattgtgcaag^ 
ggatacatctttgatacaaaagatcagtacattatttttggaactgaagatggt atttacacac 
tgaatctcaatgagct acatgaggcaacgatggaacagttatttocaoggaagtgt actt gg 
ctgtatgttatc aataa tagtttaatgtcattatcagaaggaaaaacgtrrcagctct actctc 
acaatctt atagctttgtttgaacatgccaaaaaaccaggattagctgcccatattcaaactc 

ACAG GTTTCCAGACCGAAT ACT AOCAAGAAAATTCG CTTT AACAACAAAGATTCCTGATACAA 
AAGGCTGCCACAAATGTTGCATAGTCAGAAAGGCTTACAGGXSGACATAAATACCTCT 
GCTTT ACAGTCTGGAATTGTTTT ACTT CAGTGGTATGAGCCAATGCAGAAATTCATGTTGAT A 
AAGCACTTTGATTTTCXTTTTGOCAAGTCCrrT^ 

CAGGAATACCCT ATGGTCTGTGT AGCTATT AGCAAAGGCACTGAATCGAATCAGGTAGTTCA 

GTTTGAGACAATCAATTTGAACT CTGC^TCTTCATGGTTTACAGAAATTGGTGCAGGCAGCCA 

GCAGTTAGATTCCATTCATGTAAGACAGTTGGAGAGAGATACCGTTTTA G PGTG1 F IAGACAA 

ATTTGJGAAAATTGTAAATCTACAAGGAAAATTAAAATCAAGTAAGAAACT 

AAGTTTTGATTTTOSCATTGAATCTGTAGTATGCXrrTCAAGAX^ 

ACATGGGATCCAGGGTAAAAGCTTCAAGTCAGATGAGGTTACCC^ 

CAAGAGTTTTCXXSCTTATTAGGATCAGACAGGGTTGTC^TTTTGG 

AATCCT ACTGCACACAGCAATCTCT ACATCTTGGCTGGACATGAAAATAGTTACTAAGGAACA 
GAAACTGATCT CAAATGACAGGAAAATGAATATACTCCA77 G AAAGGAAAAATAAGGAAATTC 
AATACAAACTGCACT ATGATTTGCTTTAACTATTATGGGTTATATTGCAAATGATCTGTACTTT 
AGG^TAGAATTCAATATTTTCTGCAGCTGGAAAGAGCTAGTCT ATCTCTTGCCACTGTGTGGT 
GQTTATATCAAGTTTGCTTAATAAAAGCT ATGAGACAAATAGTCCTCTAGTTCCAGGAAACAC 
AGTCTTTTTTTAAAAAAAACAATGTTTGTAACAAGGGTG(X>TGGT^ 

ATTATTJTTAAGAGAGGTAAATTTACTGATCATTTTATATCATGTCTT ATTCCTTCTTAATGAACA 
TAAi i ran AAATTCTCAAGCAAGGTTTTCACTTTTATATTGGCCATTCTGTATGTTTTTGTAM 
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ACAGAATATTT AAT CCTT ATTT ATT AAT CT CTTGCTGGAGTGGTGTAATGTATCT AACTTTT AG 
CAAAGGAGGGTTGCAGAGCAGCTTAAA 1 1 i 1 1 1 1 I ATAATGTATAAGAATTTTGTTTATCTTTT 
AAGAGTAGTAAAGTACTTTGAGTGTTTGGGGGTTCAACACACACATGGAAT^ AACAA 
AAGTATTTTATAATACAGTTTCATACAGAATTACCTTAAAAGGGAGTCT^ 

AGATAGTTGTAAGGGATCATACAGAAGATATTGATGATAGTTGAAATATTCTTAGAAGGGGTG 

TGTATGTCT AGCTGTGTCTACCATGTGTATGTATTCTTGACAAGCAGTATAAAATACCTGTGA 

rTTTTCTI lACATTAGGGATAATGCATAAGGAATTAATCrrCATATATATTATCATCCCTAATGT 

AGCAGGGGGAAGTATTTAATTGCXX^TGATATGTATTTTACTTATACTATGC^AGAGAGG 

CTATAAAGTAATTAGACATGTAATCTTGGGTTTTTCACATATGTAGGTATTCATTTTGA 

TTGAAGAAGAAAAAAAATATTTAAATGAATTGAATTCXrrGATGGGATAGTATCAATAAGTATTT 

AAAAGCCAGTATTCTAAAAATAATAAAGGGTAGGGTCATTTTTGAGTTTGI nTTCTTI I GCTA 

TTGTTAATArrCAAAATTAAAGTGTTACATTGGTACCTGTTGTCrTAATGCATTTATTG 

AGCATTGAGATGATGAACAAGGGGTTAGCAATAGCAAACTCTATAATTATTTTGA 

TT AAGAGGAAAACAGTAT AAG T ATCTCATT CAGT ATTT AGCAATTCT GT AAAAT AAGTATT ATC 

TCTATTTTTC^GATGAGGAAGTAAGGGTTTAGCAAGGTTAAGAGATCTATCCA^ 

CAAGTTAGTAGTTGAGCCTGACCATGAGTCTT ' 

>67t 

>672 

NN ACACATTGTAATATTATATC^TGTATAGTTGTAGGCAGCTCTGTGCATAACTGTGG 
TAACTTTGTG^GTCGCTCCTCTGGGTCCTGAAACAATGCAGTTCTCCCC^ 
CCGGCTGATGGTCATGGAGATCCGC^TGCTTATGCTGTGTTATATGACATCAT^^ 

TCTCCCATTCTGT GATGAGT ACTCTTCTGCACTG 1 IUI I 1C I i I CTAATAAAAC I rrCTTTTTC 
GAACCTATACTGTCTTCTGTAAATTCTTCTT ACT ACCCT ATGACCCGTGAGCCAACCACTTTC 
CGATGCCAGGGTTCTGACACCTC^^rTGGCATAATATAAAGTG I II I lATACCCTTCC 

acttggaaagactacagaggaatcttgcactgcatagttgaaactaaaaag 
tacctgaaaagcaagagaaaacaagaag gggtaaattttgaaccaagggaaatcatttaaga 
agtgtctggtatttttcaaarttctgtcagttgttacatttgtc^ 

AAAGGATGGAGACATGCTTATTTTATTT aactcccccaaaaaaaa acct 

»673 
>€74 

CCACCTAATAAATAAATCCTTGCATGACAAAGCTGCAAAATATTTT^ 
GGAAACTGATTTTAAGCAATTGCTTCCTCAGTGTCAGGGCACATGTGAATTT(^^ 
CAGAGCATG^AGGAACCAGTTGACATGCTGGGTTCTGACTGGCAGCTTTAGCAGCCT 
CTGAAGCCACACCAGTGTCOGGATGGAAGTCTGCA^ 

TTCATTTACACATTTTAA CTTGCATTA AAGAGCTG 1 1 CI 1 1 I C I GTGGCCT AGACTCTTTTCAC 
TGATC TCAAA AT AAACTG G I 1 1 1 1 1 ICAAAAAAAAAAAACAAAAACAAAAAAAAACACAAAAGC 
TGCATGTCTAAAATTACATGGAGTTAGTGTCTATTC I 1 1 1 I CCCC TTTTGCAGCAACTTACACA 
GCATTTTT AACA CC I HUM I CTA GI 1 f I f I IUI T CG GI 1 1 1 GTTTTCCATCAGGAATTTGAGT 
TCTCTCTAACCCAGCrrACTCTGGGACATAGGAAAACT 

GTTGAGTTTAAGTCTGATCTTGATCTTAAACTT^AGT AAGCCACT ATCTGCAATTTTGT ACATT A 
TAT AGT ATTTTGAAGATATGGAAOCTTATGAAAAAAAAAT AGCAAATT AGT T L' 1 1 1 1 IC CCCCA 
GAGGGG AAAG TTATGTTCTGCAAATAGTCTCTCTCTTATTTTACTGTTGAACAG 
TTTATTTTTITATTGCCTAGAACTTCAAC^^ 

ATGCCGAATTCTCATCTGGATGTTACCATCAAACATCAGTACACTTG 

TAATGTGACAGTTTTTCAGTACTCTATGTGTTAATTTCTAC 1 1 1 1 1 1 1 AATATTTAAAATTGCTT 

TTAAATAAACATATTCTCAGTTGATCCCAAAAAAAAAAAAAAAACCCAC^ 

GTGGCGGACNN 

>€76 

NNNMCT ATCCCAGGAGCCCCXSGGCAGGAGC^ 
GCCT ttCTCCCT T^^ 

AACCACATCCTG^^ 
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AATTCATAAAAGATGATTTCrTATTTTTTCTAATTTATGGAGAGGTATCTT 
ATAT^TTCQGCAAATAACTG^ 

CX»AJTTTGGGTACAAGGCAGCATACGAGGTCGCA(X^AATGTGT 

TATTTTCTAGAGTGGATTTTTCACTTAATTTTAAAATCACT 

ATTTATTTTAAAAGATTATTATTATTTTAGATGAN 

>677 

NTTTTTTTTTTTTI rO CI I I H M 1 1 H 1 1 1 1 1 M I I I 1 1 1 1 1 1 1 AAACT ATTT AATTCACT 
CCTTTATTCTGGGATGTATATTAC^GATAACACAACTCACAAATATACCATCAGACATTGAAAA 
CT AAGGCGATTCTGTGAGTT ATT1TTAAAACTTGGTGTTTTG CACATAATG ATCTTAAAAAAAA 
ATGAATTACCAAAAC^>AGATTCTCTTCTAAAATGAAAATTTAATGCAGGTACAG^ 
AGGGCTATATCTAATCTGAAGCTTATT^aGGTAGCAAAACCATTTTCGTTTTCT 
TAACAGCTCT aaggcaaccact ACCT CAGCATGAAGCTCATTTCTCCACG TTAGAGT agtgt 
TTACCTGCTACAGTGACCAGTGTTTAGAGACCATTTCCCT^ 

CCTAAGAAACACACTACATACTAC^AATCCTTGGAACAAGAAACAGAAAGGGAGCTGTAAC 
TAAGGCACT G AAAGCACATTATTTGTATAAAGAAATGTAAAC^ATTTAACACGAA(>VGGCTCC 
CTCCGTTGGAGTTJT1TATTGTAAATGCACACAGGGGCATCATAGTCAGGGCT AGAGTTTGAA 
GTCTGGGTGAAACAAATGTTTCTGCAGAGTTCAGAGTCTTGGCTGCCAATATATITCT AACAT 
GGTGCT GTTGAATT AACT ATTTCAAAGTGACTCAAGTGGAAGCTGGT ATTTCCCTTGAATCTG 
AGTTTT AATAGGATT A ACAAA CAAGGAAGAATTTT AT AGGAGTCCTAAAATTATGGGCT C 
TAACATTTATTTGAGTCTTTTATCAAGTTCTTCTAATGAA7TTTAG 

ATCAAGGAAGTTTGGTGTAAGAACTTGTT ACCTAATGAGTGCCCCTCCTTGTAAGTTCATATT 

TCAAGTTACATGGTTCATGCCATAAATAAACCACACTTGCTTT AACATCGTTCAGAATGGTAC 

AT1T AACAATTGGACATACAGCACTGTAAACAATTCAAAGTCCATTGAGCTT ACCATGAAAGA 

ACTGAAACGTATGTAGGAACTGTGTCCTGCTGC^GACAAGCX^GCTGCTCTGGCT 

GGTGGGGTGCAGTGAGGTGAGGTGGGCTGGTGTGTGAAG GGAGAATTCCTGAGGCCAGTC 

ACAATTATTGAAGCTAAATTAATTTCTTCCTGACACTTT AGCATTAATTCTAAGTAGTTCT AAG 

TTCTAAACCATTAACTGTATAGTTAANNNNNNNN 

>67B 

>679 

NAGTTGTGGGTCTTCXnTTGCAAOG ITTCTT C TI CGCTGTGCCT CTGGTCGCCATCA 
CAAACTGCATTAGGAAATGAGAGCGAGCATGTGGCCTGGCTTCTTGATGGAGATCCT CCTC 
GCTCI 1 1 1 CAT AAAGGGTCCT AGATGAACACCGATTCCCTCCT GTGGTGATTGGCT ATGGCA 
GCCAGTGTCCCGTGTOCTCTCCATATGAGGAGTTiAGGGCCTGCAGATCCCTGC 
GCCT GGTTCT GGGAGAGAGAT ACTGAACTCAGCTTGGCCCT GGTCCCA 1 1 I CI I C CAGTACC 
CGCTGGTGGTCTCTGCCCTCACOW5AGCT 

GT AGGATTTTCTCCCAGATGGGGATGACCAGCTGCATCACTCCCCaGAGTC^ 
AATAAAAGCATAGGCATGGCGGCT AAGGGAAGAAGGCATAAGAAGGCAGATTGAGGATTGT 
TAAGAAGACTCCGCCCTTGCCCCCAGCTCTGCCT ACCATAGTATTGCT CCCTTTTGGTCACT 
GATGCACATGACTAAGAAGAGTTCTCTCCTGAGT AACTAGATGACTTTGGAGATCTTTGTCAT 
TCTCTTCACTGCTT ATCTTTCCAGAACTCATGATT ACCCACACATGGCTTTTT ACCAGGCAGT a 
AGAAATGCAGOCOGCCTAGGTAGT AGCATGGTOOCC I CC 1 1 G GCCCACTCCACTCTTGCOC 
TCAACC I Clll CAACCAAACATTATCACCCACCTTAAGAAATTAAAAAATAATTCTGTCOCAGT 
CTGTCT GAACTT AGT ACATGATCCAGGATGTGATGGGATCTTAGGGCTTGGCTGGAAGGTTT 
CTCCAGTCAGCGATCTAGCAGAGCTGCAGATCTGGGCTGGGCTGTTGGCT AAAGTGCT CTT 
CACAGACACCTCATTOGGCTCI I CO ICAGCI ICI I CACTTATTTCTr ACTCAGTCACTACTC 
AGCTCCTTGTCCATGTGTCCTTGAAGCCATCCTAGGTCTT ATTCTGATTCTGAATTCTTCAGT 
CACCCATAAGCI tGICCI I ACGGGGAGTGAGTGGGTGTGTGTTCCCAGGT GGACTT AACCAT 
TCTTCTCCTTTATGATCCTTTGCCTTGGGT GGACAAGT GT GATTTGGTTGTAAGGCCATTTTT 
GAAGTTGCCTATACATTGATAAAAGAAATCCCACTAACGGAAGT AGACTGCATGCCAGATTTC 
AGTGTCI 1 1 C I CCAGGGGGGAAGGTTGGACCCAGAAGTGCATGGGGTGC » rGGTGTCGT I C 
ACTGA TGTAG TAAGGATACACGCTTCCAGACACGGCACAAGAAATGC C 1 1 CC 1 1 G GCAGTTC 
AGCAOCTTTTCT ACAAATCTTTATTCTCTGTGAGCATGACACCCT A COb 1 1 1 1 I I I CA ACOGG 
TTGTCGTTAAATCCTAAAAAATAAGTTGTTGTACC I I rCTGTl C I acagacgcacagatgcag 

ggcagtcagcggtggcagtgtttgaaccatcaattgxsgaggggaaggt ggtttcct gtgca 
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ACAC^TTGGTTTTCCCCCTCA^*.CCATTCCCGGAGTAATAAATCCACTTCGAGCAGCAGTGA 

TGAGAAGAGGGGATOGACACGTTCCGATCACAACACCAGTACCTGCCCGGGCGGCCGCCA 

CCCN 

>6B0 

NCAGCTTAGTGATGACTTTCCAAGCTTCGTTG AGAATCT G AAGT CG GTC ACTCTCAG 
GAGGATAAGGCAAGGCCAAGTTTAATCCCAGTGATCGAAAAAGAAGATGCTTGGGGAAACC 
AGATTCATCAC^CTCTTTAATCATGCCAATGAAATCCATAGACCTTGTGGCGATGAACTCAGC 



ccatcatctcggtcagcagagcctcgggggcatggtaggagatgcactggaagatccagtc 

catggcaggcgggtacaaggggaggtaagatgggagctccactccttggaccaccagctg 

gttctggaccgtatccccatgaatctgtttgaacgtaaggaggaagtcaaaaaagttcttatt 

tagggtttctttgagatgtggggccacttccattcccacccggcacaggtaggcacgggca 

tacaccgacac tagtg ggtctccgatccctctgatcatgcatgtcaaccggggcaggcact 

ctgaaattcccgttrtggagaggaatttgttacatttcaggatggatgcctccacgtaaaatc 

ttggaatgagttccctgatggaggcaatcttgaaaaaccaatttaggcatgtttccttggcc 

ttgccatttgcattctctggagaaaagtgatctggtaagacgctgcggctatccacacacat 

ggaaaagatgcgctggtacacgagctttccaaatgtatcaagtatgtcggtgataaggacaa 

atttccggacgcgtgggtcgacacn 

>681 

HACCGTACAGTGCCTGCCTTTTCCGTTGTACCTGAGCATGCTCAGGGACGGTGAGC 
CTTGAGAGCTGAGGCTTOSGCAGAAAACAGCCCTCCTGGGTAAATTCTCATTACTTGCCTAC 
TCAAAAGGATTGAGAGAGTTTCCTGTTGTCTTGCCTGCACTTGGCCCCAGTTAAGAGGAGGT 
TGAGATCTATAGTACCAGCAACATAACAGGTAAAGCTCTGTGGCTGGTTTGTATGGGCCAGT 
TAGCCCTGCCAX3CTGTTTCTGCCTTCTCATGTCTGTCCTGGATCCT AGCTCGGTCTCTCTGA 
GTGGCCTCAOCTGATCGAGAAAGCAGCTTCGTCATTCTTT AGGATCAAACCT GGTGtCTCCC 
AGTGTGGGGAGAGOCTITrGTCTTTGGAGTTGGGCCTAACCTGTCCCTCGCTGTCCTTTCTC 
CTCTAGGTTGTGCGCTTGOGACTTGGACACCATGCTG AGGG CACTGTGCCCCTTrCCTGGG 
TGTGGACAAGAGGT CATAG GGAAAGGGGAGGGCTCTAGCGGGAAArrCCTTTGTAAACCAG 
TCTGTGTTOCTGTCATTTTAGATCGAAGACCAGGGGGACAAGCTGGAGATGGCGAGAGAGA 
AACATCAGGCTTCCCAGAAGGAAAATAAACAGCTGAGTCTGAAGGTGGATGAACTGGAGAG 
GAAACTGGAGGCGACCAGTGCCCAGAATATCGAGTTCCTAGAGGTGATTGCCAAGAGGGAG 
GAGGGAATCCACCAGTCCCAGCTGCGGCTGGAGGAGAAAACACGGGAATGTGGGACCCTG 
GCAAGGCAGTTGGAGAGTGCCATTGAAGATGCGAGGAGGCAGGTGGAACAAACCAAGGAG 
CACGCACTCTCCAAGGAGCGAGCAGCCCAGAACAAAATCCT GGACCTTGAGACCCAGCT GA 
GCAGAAGCAAAACGGAATTGAGCGAGCTGCGGCGGAGCX^GTGATGATGCGGACCGCCGCT 
ACCAGAGCCGGCTGCAAGACCTGAAA GATCG CCTGGAG CAGTCCGAGAG CAGCAACCGC A 
GCATGGAGAACTACGTCCAGTTCCTCAAATCATCATACGCCAACGTGTTTGGGGATGGTCCC 
TATTCCACCTTGCTGACTAGCTCTCCCATCCGCTCCCGATCTCCTCCTGCCTGAGGCCACT^ 
ATCAGGGCCTGGAGCCCTGATGGAAG CCATAGGAACTCCAGAGTTGCCAAGCCATAGCTGA 
GAAGCCTGGTXK3TTTTCCTCTCCCAGTGAAAAAATGGGTTCAGGGTCTTGTCCTTAGCTACT 
AGCTCTAGAAAAGTCCCAAAAGCAGGAGAAGGTGAAGCAGGAAGCACTTGGTTTTCTCCTTC 
CTGATATAGTCAC CTGT TGGAAGTGTTAAAATTTCCTCGACAGGCCTT AAATTTACTACTACA 
TTAGGGT ACCACATTTTACTTACATGATTTGTTCCTTCTTT CTG AG AAAAT ATTTCATGGATTT 
CCACAGTCCTTACGCTCATGTTAGGATTCTCC^GTTTTCTGCTAACAGGGAGTCTCTGGGT 
GAGCCCACTGI 1 1 1 1 1 CGTCAAGCGNN 
>682 

CTATAGGGAGTCACCCACGCGTCCGGGCGGAGCTCCTTGTTAACGACATCCCCAAC 
CTCTGCTAACGGGAGCACTTCCTGAAGCT GAAGGTCAGCATAGACAAAGGGCCTCAGAATC 
GCGGAGGCGGAATTGTGCCCTGGTTCGCGAAGATGTCG7TCCCAAAGTATAAGCCGTCGAG 
CCTGCGCACTCTGCCTGAGACCCTCGACCCAGCCGAATACAACATATCTCCGGAAACCCGG 
CGGG CG CAAGCCGAGCGGTTGGCCATAAG AG CCCAGCTGAAACGAGAG T ACCT GCTTCAG 
T ACAACXiATCCCAACCGCCGAGGGCT CATCGAAAATCCTGCCTTGCTTCGTTGGGCCT ATG 
CAAGAACAATAAATGTCTATCCTAATTTCAGACCCACTCCTAAAAACTCACTCATGGGAGCTC 
TGTGTGGATTTGGGCCCCTGATCTTCATTTATTATATTATCAAAACTGAGAG GGATAGGAAAG 
AAAAACTT ATCCAGGAAGGAAAATTGGATCGAACATTTCACCTCTCATATTAAGTCTGGCAAT 
GATGACT ATATGTATTCCTGCCTAAATAAATCATCTATTAATCATTAAAAAAAAAAAAACAAAA 
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AAAAAAAaaaaaaGAAAACAACAACAAAAAACAACACAAAAACAAACACCACAAACAAAAAAA 

aaaaaagg gggggggg cggcaaaaaaatactcttccccggggggggcacccatttcttcc 
cccccci rctttttttl i aaacgaggggggccccccaacagggggggtcataataaaaaaaa 

gtta^gcag^ 

A gCCGCCCGG^CAAAA^ 

cacactaacgtcaagccgtacggccagaaccactgtcagagacacaccacnn aatccag 



acgcgggggcggcttcttagctttacgatggcaacaagtatggcggctgctagtg 
gtagatttgaaagtgcgaagagtatcgaagagcggaaagaacagacccggaatgccagggc 
cgaggtgttgcgccaggctaaagcgaattttgaa^aagaagaaaggcgtaaagaacttaagc 
gacttcggggtgaggatacatggatgctacctgatgtgaatgagagaattgaacagttctca 

CAGGAACACTCTGTGAAGAAAAAGAAGAAAAAAGACAAGCATTCAAAAAAAGCAAAGAAAGA 

AAAGAAAAAAAAGAGCAAGAAACAGAAATATGAAAAAAACAATGAGTCATCTGATAGCTCATC 

AAGCTCTG AAGATGAGTG GGTTGAGGCTGTTCCATCCCAGACTCCTGACAAGGAAAAAGCC 

TGGAAAGTGAA AGATG AAAAGTCAGGAAAAGATGACAGCGAAATTATCAAGAGGGATGAGTG 

GATGACTGTTGATTTTATGTCTGTTAAAACTGTGTCATCATCATCACTCAAAGCTGAAAAGGA 

AACT ATGAGGAAAATAGAGCAAGAGAAAAACCAAGCGCTTGAACAGTCCAAACTGATGGAAA 

GAGAATTGAATCCGTACTGGAAGGATGGTGGGACAGGTCTTCCACCTGAAGACTGTAGTGT 

GTCATCGATTACTAAAGTTTCAGTGGTAGAAGATGGTGGATTAAGCTGGGTAAGGAAATCTT 

ATCT AAG AATG AAGGAAC AAGCTGAGAAACAAAGTAGAAACTTTGAG GACATTGT AGCCGAA 

AGATATGGGTCAATG«AAATATTTCAGTCAAAATTAGAAGATGKTGAAAAAGCTGCATCCAC 

GAAAGAAGATTATAGAGGGGAACGGTGGAGGAAACCCACATATTCAGATAAAGCACAAAATT 

GTC^GAAAGTAGAGAATCAGACTTAGTAAAATATGGTAACAGTTCAAGGGATAGATATGCTA 

CAACAGATACTGCAAAAAATAGCAATAATGAAAAATTTATTGGTGATGAAAAAGATAAGAGAC 

CTGGGTCTTT AGAAACGTGTAGAAG AG AATCTAACCCAAGG C AAAATCAAGAGTTTTCTTTTG 

GCAATTTGAGAGCT AAATTCTTGAGACCCTCTGATGATGAAGAACTGTCATTTCACAGGAAG 

ggcagaaaatttgaaccacttagttcatcttcaggattggtagctcagggctctttgtgtagt 

GGT7TTAGAAAA 
>688 

NNNNNNNNNNNNNNNNNA AATTT AATTTCCTCAGCAAGGCCATTTTTTAACTTTCTG 
CAGAAAGGGTACACTCGCGAGCGGTTTTGCCACAAGAGTATACGGAACAAAGGAGACAGGC 
TCATTTATAATCTGACGCGGCCACCCTCCTGCTGCGTTCGGTTTCCATTG GCTGG GACGGG 
ACCTCACCTTCTGT ATTTGTCCCGACTGGCT AGCACTTAGAACTTTTTAAAAG AGGCAAAGG 
CATAGGAGAACAAAGGAAGGAGGAAGTAACTTGTGGAATATTGAGAAAGGTAAAAACACCTT 
TAAATAAG GAAGAGGAACAGGCTATGACCT AATGCTT GTTTGGATCAGTATAAGCATGTTAG 
GGCAAATATTTAGGCTAAATTGTGGGAGCTAAGAACATAAAGTATATTGATTTTTTATTATGG 
CTAGCAGATATTTAAGAATGTTAGCACAGGTCTTTGAATAAATTTTGCTTCT AAGAGAAGTTA 
CT ATTGGGCCCT AATTATATGGGGAGTAAAGTCTTTGAAGAGGAACCTCT ATTTACTTTTTAG 
AATTOCCCCTTTTT ATTTTAT AAT TCCTC TTCAAA C I i 01 1 I AAT ATGTTTTGACTT AATTGCTTT 
GTTTGCXXrrrCTAAGAGAAGTAATTTTTTCG 
TTTAAGAATGGTAGAGACAAGT 
>689 

nknnngtgcagtagacgtgtgtatagttttatgaccaggaaaaagtagttgtagct ct 
gtgttgatttttctt atctcctgccctctxn^ctccaaaagaaaagacat aaatt aaactag ga 
actgagagggataagttagacctagcactgccaccagcccagtgctctctcagagaatcac 
a7tgccttgtgtcattctaggtggccccaggagatggcttctctgatcttgccatgcaagta 
atgacgtutccatctaagaactacattctctctgtgatcagtgggagcatctgtatattgttc 
ctg attggcct gatgtgtggacggatcaccaagcgagtgacacg agag ctcaag g acag gt 

AGAGCCACCTTGACCACCAAAGGAACTACCTATCCAGTGCCCAGTTTGTACAG ccctcttgt 
ATAG CATCCCCACTCACCTCGCTCTTCTCAGAAGTGACACCAACCCCGTGTTAGAG CATTAG 
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CAGATGTCCACTGCGTTGTCCCATC^GCCTCCACTCGTGTCCATGGTGTCCTCCTCCTCCT 

CACCGTGCAGCAGCAGCAGCTGGTCGCTGGGGTTACTGCCTTTGTTTGGCAAACTTGGGTT 

TACCTGCCT GTAG ACAAGTCTCTCTCAT ACCAACAG AACTTCCG GT ACTTCC AGAACCAACT 

CACCTGACCN 

>€90 

NNGCGGCCGCAGGAATCTTGCTTGTCTTATAI I I I I I I 1 1 I I M I CACACAG AAACCA 
ACCACAI 1 1 1 1 ACTGCATCTGCTCCACGCTGGATTCCAACATGCTGGCCCGGAGCGTGGCT 
GGCTGGAAGCAACTCCAACAGGTTTTTCCCTTCCCCGTCATGTAC^TTATTTATTTTTGATrc 
TACTCACTGTCCCAAGTCCAGAGGCAGTTACAAAAAACACTCTTGATGCAAACCGTGAGTGG 
CTACAACACACGGATGGGGGTGGGCGCGATTCCCACAACAGGGAGTGGAATCCGGGAAGA 
TGATATAT AGGG GC^GACGCCCCCTTACTTGCTAAGAGTATATGGAGCTCAAAACCCACAA 
TTGCTTTGTTTTGTTTCTCAGTTC^rrGGAGTATGTTGAAACTACTTGCTCTTAACATTAGTTCG 
TA» i 1 1 iCATCAGATATCTGACCTGATTTAAAACATGTTTGTTTGCATACATCTTTTTGTAGTG 
CACCTTATAGTTATTCCTACTGATATAGGATGAAGATTATAAATATCTGCATAAAAAGAGAAAC 
CATGTGACTCTATATGAAGACAACCATCACATTCCACAGCACATC^TTGGTTCATTTTCGAAT 

tgtcaag^>gtatttgaaatgagagagagagagaatataactgaacacaagcaccacgacaa 
aacaatcataacaacagaaccaaacccaaacttctctattattaatcatgttaaaattttagct 
ttgtttct aacacttttcttt acttgttattttaagctccagtagcaggatcagatttctgctg 

CCTCT AGGCAAATGAGTTATG ATCTG ATCTCG ACTTCC AAGG G AAAATG CT CAAAG TTTT ATT 

ttcc<x>gttgaataaacagtaccatgtatattatctctcgtgttagaatagtgttgtcttcac 
ataagactcaaataatggtattagtcattcatttc<xtgaacacagacaccctcatgcgtgct 
gacaggtttataaggatgc»gtggcagccgc<mgttctgggagctgctagacggccgagt 

TTGATTTCITGCAGTCC TGAGC GATG GAGCCCGGGGGTGCCTGGTrATTGTCCGCTTTCTCT 

ctc*gatgcttggcttgt t tttcaagagaac^tttttcgatattcattgctccatcgattgga 
tccagtgcttgttcag aaaat t gtttcaaggcacttaaggctgcctgaaagccttgaatcctt 

GCTAAATATTCCAGTTGTTTTGAAGGTTGTACCCTGGCCGGCXXSCCCGGGCAGGTACAT^ 
GCTTCTTAGAAAAGCTAAGTCCTGGGTTCCCT CTGATTTTAGGTTCCAGG AACTTTCTTGGAA 
CACCCGATCGCAGANNNNNN 
>691 

GCCGC^CCTCCGGGAGCCGGGGCGCACCCAGCCCGCAGCXSCCGCCTCCCCGCCCGC^ 

GCCTCCGACCGCAGGC<X^GGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCG 

GCCGOGGAGOCGGGCAACGCTGGGGACTGCGCCTrTTGTCCCCGGAGGTCCCTGGAAGTT 

T GCGGGAGGACGCGCGCGGGGAGGCGGOGGAGGCAGCCCCGACGTC<»CGGAGAACAGG 

GCGCAGAGCCGGCATGGGCATCGGGCX3CAGCGAGGGGGGCCGCCGCGGGGCAGCCCTG 

GGCGTGCTGCTGGC^SCTGGGCGCGGCGCTTCTGGCCGTGGG^JTCGGCCAGCGAGTACGA 

CT ACGTGAGCTTCCAGTCGGACAT(^GCCCGTACCAGAGCGGGCGCTTCTAGACC^ 

GCTCAGTGCGTGGAGATCCCCGCGGACCTGCGGCTGTGCCACAACGTGGGCTACAAGAAG 

ATGGTGCTGCCCAACCT GCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCAGGCCAGC 

AGCTGGGTGCCCCTGCT CAACAAGAACTGCCACGCCGGCACCCAGG7 CTTCCT CTGCT CGC 

TCTTCGCGCCCGTCTGCCTGGACraGCCCATCT 

TGCGCGACTCGTGCGAGCCG^TCATGCAGTTCTTCGGCTTCT 

GTGTGACAA^TTCOCOGA^GGGGACGTCTGCATCGCCATGACGCCGCOCAATGCCAOOGA 



GCCATCATTGAAC^TCTCTGTGCCAGCGAGTTTGCACTGAGGATGAAAATAAAA^ 

AAAAGAAAATGGCGACAAGAAGATTGTCCCCAAGAAGAAGAAGCCCCTGAAGTTGGGGCCC 

ATCAAGAAGAAGGACCTGAAGAAGCTTGTGCTGTACCTGAAGAATGGGGCTGACT GTCCCT 

GCCACCAG CTGGACAACCT CAGCCACCACTTCCT CATCATGGG CCGCAAGGTGAAGAGCCA 

GTACTTGCTGACGGCCATCCACAAG TGGGACAAGAAAAACAAGGAGTTCAAAAACTTCATGA 

AGAAAATGAAAAACCATGAGTGCCCCACCTTTCAGTGCGTGTTTAAGTGATTCT 

CAGGGTGGGGAGGGAGCCTCGGGTGGGGTGGGAGCGGGGGGGACAGTGCCCCGGGAAC 

^GGTGGGTCACACACACGCACTGCtjCCTGTGAGTAGTGGACATTGT AATCCAG TCGGCTT 

GTTCTTGCAGCATTCCCGCTCXXrrTCCCTCCATAGCCACGCT 

GGCCGGGTAAAGC^^ 

GCAGCCACTGCACAGGAGGAGGTGACAAACCATTTCCAACAGCAAC^ 
GAAAAAGGGGGATTGGGCGGAAAGTGAGAGCCAGCAGGAAAAACTACATTTTGGAACT 
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TGGTGTGG ATCT ATTGGCTGATCTATGCCrrTTCAACT AG AAAATTCT AATG ATTGGCAAGTCA 

CGTTGTTrTCAGGTCCAGAGTAGTTrCTTTCTGTCTGCTTTAAATGGAAACAGACT 

CACTTACAATTAAGGTCAAGCCCAGAAAGTGATAAGTGCAGGGAGGAAAAGTGCAAGTCCAT 

TATGTAATAGTG ACAG CAAAGGGACCAGGGGAGAGGCATTGCCTTCTCTGCCGACAGTCTTT 

CCGTGTGATTGTCTTT GAATCT GAATCAGCCAGTCTCAGATG CCCCAAAGTTTCGGTTCCTAT 

GAGCCCGGGGCATGATCTGATCCCCAAGACATGTGGAGGGGCAGCCTGTGCCTGCCTTTG 

TGTCA GAAA AAGGAAACCACAGTGAGCCTGAGAGAGACGGCGATTTTCGGGCTGAGAAGGC 

AGTAGTTTrCAAAACACATAGTTAAAAAAGAAACAAATGAAAAAAATTTTAGAACAGTCCAGC 

AAATTGCTAGTCAGGGTGAATTGTGAAATTGGGTGAAGAGCTTAGG^TTCTAATCTCATGTTT 

TTTCCI I I I CACATTTTTAAAAG AACAATGACAAACACCCACTT ATTTTTCAAGGTTTTAAAACA 

gtctacattgagcatttgaaaggtgtgctagaacaaggtctcctgatccgtccgaggctgct 
tcgcagaggagcagctctccccaggcatttgccaagggaggcggatttccctggtagtgta 

GCTGTGTGGCTTrCtrrTCCTGAAGAGTCCGTGGTTGCCCTAGAACCTAACACCCCCTAGCAA 
AACTCACAGAGCTTTCCG I 1 1 1 1 1 ICI I FCCTGTAAAGAAACATTTCCTTTGAACTTGATTGCC 
TATGGATCAAAGAAATTCAGAACAGCCTG CXrrGTCCC^CCGCACTTTTT ACATATATTTGTTT 
CATTTCTGCAGATGGAAAGTTGACATGGGTGGGGTGTCCCCATCC^GCGAGAGAGTTTCAA 
AAGCAAAACATCTCTGCAG 1 T M I CCCAAGTACCCTG AGATACTTCCCAAAGCXXTTT ATGTTT 
AATCAGCGATGT ATATAAG CCAGTTCACTTAG ACAACTTT ACCCTT CTTGTCCAATGT ACAGG 
AAGT AGTTCTAAAAAAAATGCAT ATT AATTTCTTCCCCCAAAGCCGGATTCTTAATTCTCTG 
ACACTTTGAGG AGATTTATG ATTGTCCCTCTGGGCCAATGCTT ATACCCAGT GAGGATGCTG 
CAGTGAGGCTGTAAAGTGGCCCCCTGCGGCCCTAGCCTGACCCGGAGGAAAGGATGGTAG 
^^CTGTTAACTCTTGAAGACTCCAGTATGAAAATGAGCATGCCCGC^rTAGTTAC^TACCGG 
AGAGTTATCCTGATAAATTAACCTCTCACAGTTAGTGATCCTGTCCTT 

GGGGTTCT CTCT GACCTTTCATCGTAAAGTGCTGGGGACCTTAAGTGATTTGCCTGTAATTTT 

GGATGATTAAAAAATGTGTATATATATTAGCTAATTAGAAATATTCT ACTTCTCT GTTGTCAAA 

CTGAAATTCAGAGCAAGTTCCTGAGTGCGTGGATCTGGGTCTTAGTTCTGGTTGATTCACTC 

AAGAGTTCAGTGCTCATACGTATCTGCTCATmGACAAAGTGCCTCATGCAACGGGGCCCT 

CTCTCTGCGGCAGAGTCCTTAGTGGAGGGGTTTACCTGGAACATTAGTAGTTACCAC^GAAT 

ACGGAAGAGCAGGTGACTGTGCTGTGCAGCTCTCTAAATGGGAATTCTCAGGTAGGAAGCA 

ACAGCTTCAG AAAGAG CTCAAAATAAATTGG AAATGTGAATCGCAGCTGTGGGTTTTACCAC 

CGTCTGTCTCAGAGTCCGAGGACCTTGAGTGTCATTAGTT ACTTTATTGAAGGTTTTAGAGCC 

ATAGCAGCTTTGTCTCTGTCAGATGAGCAATTTCAGAACCAAAAGGGAGGCTCTCTGTAGGC 

ACAGAGCTGCACTATCACGAGCCTTTGTTTTrCTCCACyVAAGTATCTAAC 

AGACTGATTGGCCTGGTCATTGGTCrrCCGAGAG AGGAGGTTTGCCT GTGATTTCCT AATTAT 

CG CTAGGG CCAAGGTGGGATTTGT AAAGCTTTACAAT AATCATTCTG GAT AGAG TCCT GGGA 

GGTCCrTGGCAGAACTCAGTTAAATCTTTGAAGAATATTTGTAGTTATCTTAGAAGATAG 

GGGAGGTGAGGATTCCAAAAACATTrrATTTTTAAAATATCCT 

TACGTGTXKsGTTAGCATCAAGTTCTCCCCAGGGTAGAATTCAATCAGAGCTCCAGTTTGCAT 
TTGGATGTGTAAATTAGAGTAATC^CATTTCCCAAACCTAAAATCTGTTTT^ 
TGAGTAACTGGTTGCTGTGTCATAACTT CATAGATGCAGGAGGCTCAGGTGATCT GTTTGAG 
GAGAGCACCCT AGGCAGOCTGCAGGGAATAACATACTGGCCGTTCTG ACCT GTTGCCAGCA 
GATAGAGAGGACATGGATGAAATTCCCGTTTCXrrCTAGTTTCTTCCTGTAGTACTC 
AGATCCTAAGTCTCTTACAAAAGCrrTGAATACT 

TGTTGI DIM I AACTGCATT TTACCAGATGTTTTGATGTTATC<jCTTATGTTAATAGTAATTCC 
CGTACGTGTTCATTTTATTTTCATGCTTTTTCAGC^ AAAATC 

>€S2 NNNNNNNNNNGCCTGAGGTGGGTGGCGGTGG AAGTT A^ ATC 

gctcctcgagactcgcagtcgcggxxactgcagtgactt cgccagtt agcccttagggtag 

GAGTC^XX3CGGGCAGCGGGTGTGTGGACGCCGCTTTGTTGCGTGAGATGAA^TTGGAGCC 
CTTGm i t iGA CATTGGATCXrTATACTGTGAGAGCTGGTTATGCTGQTGAGGACrrGCCCCAA 
GGTGGATTTTGCTACAGCTATTGGTATGGTGGTAGAAA^iAGATGACGGAAGCACATTAATGG 
AAAT AGATGGCGATAAAGGCAAACAAGGOGGTCCCACCT ACTACATAGATACT AATGCTCTG 

cgtgttccgagggagaatatggaggccatitcacctctaaaaaatgggatggttgaagactg 
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ggatagtttccaagctattttggatcatacct acaaaatgcatgtcaaatcagaagccagtct 
ccatcctgttctcatgtcagaggcaccgtggaatactagagcaaagagagagaaactgaca 

GAGTTAATGTTTGAACACTACAACATCCCTGCCT^ 

GCATTrGCTAATGGTCGTTCTACTGGGCTGATTTTGGACAGTGGAGCCACTCATACCACTGC 
A ATTC CAGTCCAOGATGGCTATGTCCTTCAACAAGGCATTGTGAAATCCCC I C 1 1 GCTGGAG 
ACTTTATTACTATGCAGTGGAGAGAACTCTTCCAAGAAATGAATATTGAATTGGTTCCTCCAT 

atatgattgcatcaaaagaagctgttcgtgaaggatctcgagcaaactggaaaagaaaagag 

AAGTTGCCTCAGGTTAGG AGGTCTTG GCACAATT AT ATGTGTAATTGTGTT ATCCAGG ATTTT 

caagcttcggtacttcaagtgtcagattcaacttatgatgaacaagtggctgcacagatgcc 
aactgttcattatgaattcccgaatggctacaattgtgattttggtgcagagcggctaaagat 
tccagaaggattatttgacccttccaatgtgaaggggttatcaggaaacacaatgttaggag 
tcagtcatgtrgtcaccacaagtgtrgggatgtggatttccaagcatttcaacagatgtgga 

agagttoccaagcttctac c i rcci 1 1 I GTCACCTTACXSTTTCATAGCTTTAGTATACTCAGG 
AAAAGAATGACCATCTTTTGTAGAATGTTTATACATTTTTGCATATTTCAAT^ 
TTTTAAAGCTTTAACTGGCTCTATAAATTAAGTTTGTGCTTTCCTTGAAA 
TACAAGCATTTTATAATTTTGTATAAATGTCTATTTTCTCTAAATATTTTGCTTrCAGTAA 
CTTTCCAACTCTGTTT AGT GT ATT AATT ACCAGTG GATTGG T AGAACTGCTTTTT ATTGACT A G 
TAAAAGTTACTGCCTATGCTTrTTACCTTAGGCTrACAGAATTAAATAAAAATrAGCCATTCCA 
GAAATATATTTTGGACTGTTGTGCACTGTGATTACT ACTTTAAGGACTAAATGTATTTCTCATT 
ATTrTGAATGAAAGTCCTCCGTTTATTAACAGCAATACCCACATOCTCTTCATAGCCTATTAAC 
AAC AGAGGT AAAACTATTATTCAAATT CAAAAACT ACGGTATTGCCTTT GCTGTGGCAG TT AC 
CATCACCTTGACACTCTAAGGTAGCAGGTGACATTTAAAGCGTGCTT AAATGTCAGAATTTAT 
AAAGTGGGAATCTCATCTGAACTTTATAGCTGATTTTTAGAAGCAAATTAGCTTCT ACCAAATT 
AGCTAATTAGCATGCCATATTCAGACTTAGAACAACTGATTAGTAAAGTCACTT GACTAAAAA 
cagaai rrcn IATAAACCACTTAACATATTTACTCCTGTACACAGACTATTCAAGAAAAACAA 
AATGXSTA AATTTA ATAGTTC AGACA TCTTAGACAAGACTTGACTTTTGGGCTTCAGCAAGATG 
TGGAAA C I I II I IAAAAGAATTTTTG C I I IC I 1 1 CTCTCT aaattttcctt ccgt gctttgatgc 

gggc rcGTTTtn cacgttccagtutgagaaaatggtccacataaggcaaggcaaagaatcgt 
ttoctattgtatcttttatttaggtgccaaggtataacccactgcttgaacttgtgccagatg 
attcttccaaagatgtctcttctccaagcaccaggtct agctctttcttgaccagtctgaaga 
agcctt agggcatcttctctitcctggagaactttatctaatgcatccatggaatctactacc 
ttatctaaccgctctggacttggcattggcaatctctgccgcitggcctcctgct ct agggt 
tagaagcatg nrcTTTCm cagtaagacataccaaagtttgtgt aaatcttcattacttttg 
ttcctt agtt gctgacaggtccatgctgctccagattttac 1 1 I 1 1 Ul I goccccagtttttt 
gggtcatcaaaaaattcttct agtccittccttg acaatgtggtatgaagt aatct atattgg 
tgaaaggatgtcacatttggtgtactcttaggcaacaaactaagagaagacctgcaacccga 
gtgagcagcgtccttgggatttactagggaagaagctgccttcag^saacg 
ctacaaatacocactagactggtcgcagccatgtttctaatggagaocgtcagtttcaccta 
ggccagcaan 



cccccgcggtggcggccgaggtacgcggggaggcattcaggcagcgagagcaga 
gcagc gtagag cagcacagctgagctcgtgaggcaogagactcagcccgagoaaatcgca 
gat aagttttt aatt aaaaagattgagcagtaaaaagaattagaact ct aaacttaagct aat a 
gagt agcttatcgaaatattacttagtctt aataatctaagaagatcttaagagataacatgaa 
ggctt atttaaacagtttgaaaaaggaaatgaggagaaaagtatttgtact gtataatggagg 
ctgaccagagcagtttaggagattgtaaagggaggttttgt 

ttgaaggtcgvgccttgtagattaaaacgaaggttacct aaatagaatct aagt ggcatttaaa 
acagtaaagttgtagagaatagtttgaaaatgaggtgtagtttt aaaagattgagaaaagtag 

GTTAAGTTGACGGCCGCCACCGCGGTGGAN . 

. TTGGAGCTTJACCGCGGTGGCGGCCGAGGTACGCGGGGAGCGATGGCGCGATTTCA 
CAATCCTGCAGAACGGCCATACAAATTGCCAGACCTGTGCACAACGCTGGACA^ 
CAGGACATTACAATAGCCTGTGTCTATTGCAGACGACCACTACAGCAAACCGAGGTGCATGT 
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CATAAAATAGCAGGAAGCTATACAGGACAGTGTCGACGGTGCTGGACCACAAAACGGGAGG 

ACCGCAGACTAACACGAAGAGAAACCCAAGTATAACATCAGATATGCGTGGACCAAAGCCC 

ACCTTGCAGGAAATTGTATT AGATTT ATGTCCTTACAATGAAAT ACAGCCGGTTTjACCT A 

TGTCACGAGCAATTAGGAGAGTCAGAGGATGAAATAGATGAACCCGACX^TGCAGTTAATCA 

CCAACATCAACTACTAGCCAGACGGGATGAACCACAGC^CACAC^TACAGTGTTCCTGTT 

GTAAGTGTAACAACACACTGCAGCTGGTAGTAGAAGCCTCACGGGATACTCTGCGACAACTA 

CAGCAGCTGTTT ATGGACTCACTAGCCGGGCAG GT ACGCGGGTGTGCAACTGCAAACCAGT 

AACCTGCTATGGCCAATTGTGAAGAGATGGGAGTCTCCCCGTATTGCCCAGGCCGGTCTCA 

AACTCCTGGGCTCAAGC^TCTTCCCGCCCCACTTCCCGAAGCCCTAGGATTA 

A GCCAC CGCACCCAGCCAG AAAAAGGTTTCAAAT ATTGGAAAACCTT ACTTTTTTCAATGAGC 

AHII 1 GCATCAAGGGGT AACAGGGACATTAGG L. I I I H 1 1 CTCTT AGACTCCAAACAGT AAG 

ACATTTTCCAGCACTAATTAACAGGTTTTATGATTCACTAGGTrGGC<XAA 

CCTAATTCCCAGGCACAGCGTGTCAGGAGGCCAAATGACACTTTCCAGTGCAAGTGCTTGTA 

GTATGAAGGGGGCAGAGATCACCTAGTGACCATCAAGCAGGCCATCCAGAGGCAAAACTCC 

TTATGTGAGGAATTTAGAAGTAATTAGACTCCCCTGTTATTTAAAGCTGGCATCTGGGTCCAG 

GCTTCTT CCCCAAAAACTTGT AAGTAACTAGAATTTCTACATGTCT CCCATGCATGCATGTCA 

AAACTTATCGTGCAACCCTTGCTGACATTAAGACACCAAAATGTCCAAAAATGTAATCATTTA 

TCATGACCTAAGTGGTTAATATGGCCCAAATTCCCTTTAAGCTCCTGCTTT AAGGTTCATAAA 

TATOCCTAAGGAAAATCCACTCAGTCCTCTCTTGCTGAGGCACCCTGCTGT ACCTG CCCGGG 

CGGCCGCTCCACCGCGGTGAGCNNNNN 

>€97 

GGCGGCCGCCCGGCGCGACCGG GTGAGACAGCG TCAG GCGCTTGATTTCCCTGA 
GTGCCGGTG CCTCA GCTGCCCAGTGCCCACGTTCGTAAGAAGGCAACAAGTTCTTCTGCTC 
TACAGAAGGATTTTGCAAACAATTCGGCAAGTTCCAAATGATTCTGATCGCAAATACCTGAAA 
GATTGGGCAAGAGAAGAATTCAGAAGAAAGAAAAGTGCCACCGAAGAGGATACAATOCGGA 
TGATGATTACTCAAGGCAATATGCAGCTCAAGGAGTTAGAAAAAACACTTGCTTTAGCAAAAT 

cttaactatagcattattctgaaggattttcaaagtctccatgtgttttgttgcatttaggat^ 
aacaatggacaacacaaagcccagtcttactatttatatgtacagtatttggtgatagagagc 
aaaggaaaaacacatcagaacagttg ccttaacactgaaaaac at accctgcattttgaaaat 
gtttatggatgtctcagcagttcttaaaagaagaaaaaaaccactaacaaatggccagtagat 
gctggacacaatcattaggggaagtgtaaatcaaaact act atgaggtcgggtgtagtggct 
catgcttgttattccagtactttaggagggtgtggtaggggatcacttgaggccagg^ 

AAGACCAGCCTGGCCACATGGTGAAACTTATCTCTACT AAAANNN 
>€98 

NNNNNNNNNNNNNNNNNNGCAGGTTTTTGTCCACAT^ 
CATTCTTAGCTGTCA GTCTGA TATTGTACTAATCATATTCATTATAGAC 

AAAAATTT ATTAGT AAT 1 T CT 1 GAGCATAGCACTTTGGT AGGGCAAAAGCATGGTATCAGATG 
TACTCCAGGGCAGATGTCTTATTTGCTAGCCTrCCTGGTCAGTGTGTCTrCA ^ 1 1 ICI CI I G G 
TTTGT AG GCAG CATT ACAGAAACAACAG GAAGTAGTAGGGG CTGG GTCTTCCTT GCCT ACAT 
CATCAAAAGTGAATGCAACTGTACCAAGTAATATGATGTCA6TITAATGGACAGGCCAAAACA 

cacactgacagctc cgaa aaagaactggaaccagaagct gcagaagaagccctggagaatg 
gaccaaaaggtaggttttgcatatgtgtttgtccagatgtcataattccaaatc i i g i 1 1 1 a aa 
ttcct gatagtggccattgacaagcatcactcagtgtgttgccagaattcact^ 
tataaacctgaaaatccatgcxngtagtcccagtctctggatgtagtggtgcacgcctgtaa 
tcccagct actt gggaagct gaggcaggagaatcxicttgcaccgaggaggtggaggttgc 

AATG AGCTGAGATCACGCCGCTGCACTCCATCCTGGGTGACAGAGCAAGACT 

TCTTGCACGACATGAACAGCAAGGACAGAAGATTGTCCCCAGCTACGGAAAACA 

GCTGCATTACCGTAGGGTCGTCAGGAACAAACACGATCCCTACGAGCG^ 

GGCCGCCACC 

>699 

CGCGGTGGCGGCCGAGGTAC1 I II II 1 1 1 M T I'll I T T T GTTAGTGTTTT CTGATGTC 

TTTTCTAACAAATCTT^ 

GTCCATGTTtjCCCAAACTGGTCTGGAACCACCACACCCAGCTAATTTTTGTG 



811 
Table 4 



PCT7US00/33312 



ACCAGCACACCGGCGCCGTCCTGGACTGCGCCTTCTACGATCCAACGGATGCCTGGAGTG 
GAGGACT AGATCATCAATTGAAAAT GCATGATTTGAACACT GATCAAGAAAATCTT GTTGGGA 

cccatgatgcccctatcagatgtgttgaatactgtccagaagtgaatatgatggtcactgga 

agttgggatcagacagttaaactgtgggatcccagaactccttgtaatgctgggagcttctc 

tcagcctgaaaaggtatataccctctcagtgtctggagaccggctgattgtgggaacagca 

ggccgcagagtgttggtgtgggactt acggaacatgggtt acgtgcagcagcgcagggag 

tccagotgaaataccagactcgctgcatacgagcgtttccaaacaagcaggg 

aagctctattgaaggcgagtggcagtgagtattggacccaagccctgaggtacagaaga^ 

tatg^cttcaaatgtcacagactaaaagaaaataatatgagcagatttacccagtcatggcat 

TTCTTITCACAATATCCACAATACATTTGCCACAGGTGGTTCTGATGGCTTGT AAAT ATTGGG 
ATCX^TTTAACAAAAGCGACTGTGCACTTGCATCGGGTACCCAG 

cagcatgatggatacgcttggacaagcgctcattgttgacggtggaccaaaactccggaat 
ggtttcacjo^cagggcgtgg 

acccgcggggggaggcjuscaacgc^cSggt 

>700 
>701 
>702 
>703 

TGCAGGAGTCCGACCT CCGTCT GTTCCTGGACGGAGACATCCTCCGCCAGGACAG 
AGTCTCCAAGGCTGCT ACTCCTTCATCCACCTCAGCTTCCAGCAGTTTCT CACTGCCCTGTT 
CTACGCCCTGGAGAAGGAGGAGGGGGAGGACAGGGACGGCCACGCCTGGGACATC^ 
ACGTACAGAAGCTGCTTTt^GGAGAAGAAAGACTCAAGAACCCCGACCT GATTCAAGTAGG 
ACACTTCTT ATTCGGCCTCGCT AACGAGAAGAGAGCCAAGGAGTTGGAGGCCACTTTTGGC 
TGCCGGATGTCACCGGACATCAAACAGGAATTGCTGCAATGCAAAGCAC^ 
TAAGCCCTTATOCGTGACCGACCTGAAGX3AGGTCTTGGGCTGCCTGTATGAGTCTCAGGAG 
GAGGAGCTGGCGAAGGTGGTGGTGGOCXXAj 1 1 CAAGGAAATTTCTATTCACCTGACAAATA 
CT I CrGAAGTGATGCATTGTTCCTTCAGCCTGAAGCATTGTCAAGACTTGCA 
TGCAGGTAGCAAAGGGGGTGT1 CC I GGAGAATTACATGGATTTTGAACTGGACATTGAATTT 
GAAAGGTGCACTTACCTAACCATTCCGAACTGGGCTCGGCAGGATCTTCXSCTCT 
CTCGACAOATTTCTGCTCTCTCrrTCAGCTCAA 

AAAGCTTCCTGAGTGACTCTTCTGTGCGGAT I CT rrGTGACCACGTAACCCGTAGCACCTGT 

CATCTGCAGAAAGTGGAGATTAAAAACGTCACCCCTGACACCGCGTACCG 

TGCTTTGATTGGGAAGAAGACCCTCIACGCACCTGACCCTG 

CGCACGATGATGCTGATGCTGT GTGAOCTGCTCAGAAATCAT AAATGCAACCTGCAGTACCT 

GAGGTTGGGAGGTCACTGTGCCACCCCGGAGCAGTGGGCTGAATTCTTCTATGTC^ 

GCCAACCAQTCCCTGAAGCACCTGCGTT^CTGAGOCA^ 

TGTT GCTGT ACAAG-ACCATGACACGCCCAAAAGACTTCCTGCAGATGTT GTCGTTGGAAAAC 
TGTCCTCTTACAGAAGCCAGTTGCAAGGACCTTGCTGC I G I C I rGGTTGT CAGCAAGAAGCT 
GACACACCTGTGCTTGGCCAAGAACCCCATTGGGt^TACAGGGGTGAAij 1 1 rCTGTGTGAG 
GGCTTGAGTTAOCCTGATTGTAAACTGCAGACCTTGGTGTT ACAGCAATGCAGCATAACCAA 
GCTT GGCTGT AGATATCTCTCAGAGGCGCTCCAAGAAGCCTGCAGCCTCACA^ 

ttgagtatcaaccagatagcttctgcattctggattctct 

CTGTAACCTAAAACACCT ACGGTTGAAGACCTATGAAACTAATTTG 
GG AGGAAGT GAAAGAAAAG AATC CCAAGCTGACTATTGATTGCAAT^ 
GCACCTCOGTGCTGTGACI 1 1 T 1 1 T GCTGAGCAGCCTGGGATCGCTCT ACGAATTACAGAGG 
aagcgg^sattcgggtctct aagatgtctt ATGAATGCAGGTCAGAGGGTCACATGTTAACAC 
TAGAGTClGICG^GAGtrTAGGATTTGACA LlGGl 1 1 1 C I C ACTATTTTTGGGAGATTCTGCAC 
GAGTCAC GCACCCCLI I CACATGACGCTATGTACTTTCT CACAGGGATAATAAAGTT AGAGC 
ACTCTCGNMNNNWOWNNNN?WNNN?OOOWN 
>7D4 

. MCTGTGGCGCGOG I CCI 1 1 tX^CGTGTCCXXXaTCACCGGCCCTGCCCCCGGGCC 
CTGTOCTTATAACCTCGGATGGGCACCOCT 

CCt^GAGCCOGACGCT ATGTCCAGCAAAGGCTCCGTGGTTCTGGCCT ACAGT GGGGGCCT G 
%£?^l™J!5E A TF CtCa 1 G 1 **?^J i '***'*Q*ACAAG<XTATt5ACGTt^TTGCCTATCTGG 
OCAACATTGGCCAGAAGGAAGACTTCGAGGAAGCCAGGAAG 
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CCAAAAAGGTGTTCATTGAGGATGTCAGCAGGGAGTTTGTGGAGGAGTTCATCTGGCCGGC 
CATCCAGTCCAGCGCACTGTATGAGGACCGCTACCTCCT 

GGCGCCACAGGAAAGGGGAACGATC^GGTCCGGTTTGAGCTCAGCTGCT actcactggcc 
CCCCAGATAAAGGTCATTGCTCCCTGGAGGATGCCTGAATTCT ACAACCGGTTCAAGGGCC 

gcaatgacctgatggagtacgcaaagcaac^cgcgattcccatcccggtcactccca^ 
cccgtggagcatggatgagaacctcatgcacatcagctacgaggctggaatoctggagaac 
cccaagaaccaagcgcctccaggtctct acacgaagacccagga cccag ccaaagccccca 
acacccctgacattctgt^gatcgagttcaaaaaaggggtcxxrrgtgaaggtgaccaacgt 

CAAGGATGGCACCACCCACCAGACCTCCTTGGAGCTCTTCATGTACCTGAACGAAGTC^ 

GGGAAGCATGGCGTGGGCCGTATTGAGATCGTGGAGAACCGCTTC^TTGGAATGAAGTCCC 

GAGGTATCT ACG AGACCCCAG CAGGC^CCATCXTTTT ACCATGCTCATTT AGACATCGAGGCC 

TTCACCATGGACCGGGAAGTGCGCAAAATCAAACAAGGCCTGGGCTTGAAATTTGCTGAGC 

TGGTGTATACCGGTTTCTGGCACAGOXTGAGTGTGAATTTGTCCGCC^ 

GTCCCAGGAGCGAGTGGAAGGGAAAGTGCAGGTGTCCGTCCTCAAGGGCCAGGTGTACAT 

CCTCGGCCGGGAGTCCCCACTGTGTCTCTACAATGAGGAGCTGGTGAGC^TGAACGTGCAG 

GGTGATTATGAGCCAACTGATGCCACCGGGTTCATCAACATCAATTC^ 

atatcatcgtctccagagcaaggtcactgccaaatagacccgtgtacaatgaggagctggg 
^^ctcctcaatttgcagatcccccaagtacaggcgct aattgttgtgataatttgtaattgtg 

ACTTGrnrrCCCCGGCTGGCAGCGTAGTGGGGCTGCCAGGCCt^^ 
CCCCTGAAGCCTGGAAACtaTTGTCATCGAAGGGAAGGGTGGGGGGCAG 

gct AT aaaaatgacaattaaaagagacactag tccacacacaaacacaaagacaacaacaaaa 

CAAAAACAACACGCGTCTC I ICI I UCGCAACAAAAAGCGACCGAT ATTACACCACATTCATAA 

GATACGCCCTAGGATACGGAN 

>705 

>706 

^^^AACGTACGAGTAAATTTTCATTACCTTT AATTAGGCAATGTTTCTTAGATAACC 
ATAAAACTGCAAAAGCAAI 1 1 T T AAAAATGAT AAAT AGGACTTCATCAAAAAGT AAACG CTTCA 
AAAGATACT ACTGAGAAAGTCACAGAATAGGAGAAAAATCTGATGAGACTTTATGTCTAGAGT 
AATGAATTCTTGTTAACGAATAACCAACCCCCTTTTAAAAATG^GC 

ATTTCACT AGAGACAATAAACAAATGGCCTTAAGCAGAAGAGATGCTCAACATCAGTAATTAT 
TAGGGAAATGCCAATCAAAACTAGAACGAGATACCCTATATCCACTAGTATGGCT ATAATAAA 
AAAGAGT AACAAACGTTGAGGAGGATATGGAGAAACTCCACCCCTGGTCAGGTGTGGTGGA 
TCACACCT GT AATTCCAACACTTTGGGAAGCTGAGGCAGGCAGACTACTT CACTGAACCCAG 
GAGCTCAAGAGTAGCCTGGGCAAGACGGCGAAACCOCATCWN 
>707 

1TCGGAAGGGCTGGGGGCTGAGAAGAAAGGGCGGTGA1TCACAGCGCCGAGGAG 
GCAACATCCGGGATCCTCGCCCCTCTG^SGG 

ACTCGTTCCTX5GG AAOCGAACCTGGAATOCCCGGCGGCAGT GGGGCTGTTGCTGTTGCTG 
TGGCTGTCGCTGCCCGTCAGGCTGCC7 ICI 1 1 1 GT CGTTTCCCAGCGCTGCOCAGGACTTC 
TCCTGGCGGCGCTGGGGATQCAGGGGGTCGGCTG€CAGCT 

aaaaaatggaggaatctgtaaaccaaatgcagccactgaatgagaagcagatagccaattct 
caggatggatatgtatggcaagtgactgacatgaa7cgact agaccggttcttatgtttcgg 

TTCTGAAGGTGGGACTTATTATATCAAAGAACAGAAGTTTGGGCCTT GAAAATGCTGAAGCTTT 

aattagattgattgaagatggcagaggatgtgaagtgatacaagaaataaagtcattt 

AGAAGGCAGAACCACAAAGCAAGAGCCTATGCTCTT^^ 

CCGACAT AAGCACAAAACAAGCAGCATTT AAAGCTGTTT CT GAAG NIG IC GCATTCCTACCC 
ATCTCTTTACTTTT ATCCAGTTTAAGAAAGATCTGAAGGAAAGCCTGAAATGTGGCATGGTGG 
GGT OGTGCCC CTCOG GAAGGCT ATAGCOGGACTGGGACAATGAGAAAGGTGGCATGGCCC 
TTGCTCTGGCGGTTCCAAAATATAAACAGAGAAATGGCTGGTC1 CACAAAGATCTATTAAGAT 
TGT CACATCTT AAACCTTCCAGTGAAGGTATGCAT AAGATCTT CATTGGGAAGAAGGG TGGG 
TAAQGGATATTCAATAAATAGCAAATTTTTATTTAACAT AAGACAAGTGTGT AAT ATTTTCT AG 
AAATAGCGI 1 1 1 ATTCTCAAAATATACATTAATTCAAaTATATTTAATGOU 1 1 C C I bGGATTACA 
C^CCTACCACAGGCGATCCAGACATTAAT AATCAAGAATCAAATGACAGGTTT AT AGGTT AAT 
ATTCTGAATCCTAAAATGCCACCG T CC I ACTGGAATGCTCACCAAGTGATTCTGGACrGTGT 
AAGTAGCAGGGGCAAGACTAAAAGTTGCAATAAGCCTTGGATATGGGT 
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>708 

NNCACGAGGATTCCTTCC I ICM I I GGTCGGTTCTGAGTGTGGGGTGTCTACTGGG 

gatctgctaaggctaagaggcaaagataggcaagtcactoccctgacctcaagaaactccc 
agtctacaggcgaagatacaccacccaccggtagagtcgctggaccagaatattaggtgtt 
ccagtcaaagtcacccagatttgccaaaagacctggcacaaatgtcacttccactatgaagt 
cccactgacttccatatacaagacaatctgctggga a I MCI I gggttgacagctcttggat 

CU^ATTTTGAACAGTGGTAGTGTCXrTGGATTACTTTTCAGAAAGAAGTAATC CI 1 1 1 I ATG A 

cagaacatgtaataatgaagtggtcaaaatgcagaggctaacattagaacacttgaatcagat 

GGTTGGAATCGAGTACATCCI 1 1 1 GCATGCTCAAGAGCCCATTCI 1 nCATCATTCGGAAGCA 

ACAGCGGCAGTCCCCTGCCCAAGTTATCCCACTAGCTGATTGCTATATCATTGCTGGAGTGA 

TCTATCAGGCACCAGACTTGGGATCAGTTATAAACTCTAGAGTGGTAAGTGTCTTCACATTCT 

TTAAGCACTAAAGAAAACTTTTAATTAGCTACCTTGCTTCCAGTAATCAAACTAGAGCn'CCTCT 

GCCTTGTGTAAGTTGCTATAAAGTATTGACTATTAGAATGTCTTGAACTTTGGTTACTGTGAG 

CCAAGTCGGTGCTCAAAGTATATTTCATAGTCTCAATTATATAGTAAT7TAGGTTCTGAAAAAT 

AGGTTCTGTCTTTGCATATGTAATATTTTGTGAGTATTTACTTTGGAAAGTTTGGTCGACCTAA 

TGATAAATTTAGAGTTTATTTTCCTTTTACAAGCTTACTGCATTGCATGGTATTCAGTr 

TTGATGAAGCTATGTCATACTGGTCGATATCAT 

>709 

ACAAGCATGGTCCATACCACTGTTTACTTTTCTAGAAAGTTGTTAGACT AATTTTTCA 
ACAAAAATTCTTTATTGTCTTGGTAACAAAAG AAG CATACTAAAAATTCTCAATAAGG CACAGT 
GTCTCTAGAAGCTTGAGCATTCAACATAAACTTCTAATTAACACGAACTTGTGCTCTTATTTCA 
GCCATTGCTGTGTGGGCTTGGAGCCAGGAGAAGATGCAGAGGAATTTTACAATGAATTACTT 
CCATCAGCTGCAG AAAATTTTCTAGTTTTGG G GAGACAATTACAAACATGTTTTA 
>710 

ACGCGGG CTAATCCCAGTT ATGAGGGCTCT GCGCATG ACCT CATCACTT CCCAGAG 
GCCTTACCATCTAATACCAATACATTGGGTTTAGAATTTCAGCATGAGAATTTGGGGGAGACA 
G TCAGACTGT AG CX1ATGATTCTGGAGT ATTCATCA7TT AAGAG AC ACTT AAAAATG ATCAGAA 
AGGAGAGGATGAAGGCTAGAACTAAGACTfTAGCGTTGAACATGGAAAGGAAGTGATGACT 
G CAGAT ATCTCCAGT ACCTGGTGACC AGAT GT ACAG CAGC AG T GTGACT ACCGTGG GT G CA 

ggtaccagt aag gaaat aagaactgaagcag ggcagat atttcatttg gg cacccact gagt 
tt7tggtgatgggtatagctatttgattttggagtct ag ggtgg atttaagtctggcttagaa 
atatggctctaaggggacattcagcataaaagttaaaaaaaaaaaaaaaaaacccacagagaa 

AGAGTGAGGAAAGAGGGAG G GCCAAGGGTAAAACCTTAGAACTTGAATAAGCAAAGAAGTC 



GGAAGATGCNNNNNNNNNN 
>711 

ACI H 1 1 1 1 1 1 H II I M 1 1 1 1 1 1 1 1 1 1 1 1 GATAGCCATATACCAAATAAATGTTCTGTG A 
CTA GGGG TTATGGCACAATGGGTATTGAGACACTAAAAACTCTGCTTCAGGCTTCCATCCTC 
TTAATTTTAGAATATCTCTGATTTGCTAATTTTCTGATTGACAT O) IM G GT AGATTATCGTGTT 
TTTACTTT ATGTTATTGACT GATCCTTTAGAATGA 1 HI CI 1 1 1 1 GTTCT GGGAAAAAAAATGCA 
TTCT AAATCAGATTCACT AATACTTTGATTCACTTCCAAGGA 
>712 

AGTT ACAAAAATTTTTAACATTAGGAGGTAATT ATAAG T AGATTCT GTGATTAG GACTT 
CATTCATGTATCTrTTGCTACATAAACCTTT GTTAGATTAAATGGAAGACACCTGCT AGGTGA 
TACTT7TTATAAAAGATATGAGTAAGTCATATATCTTTGTTAAATTTCTGT ATGTTC7TTTTTGT 
ATAAAGATGGAGAGAAAGGATGGAGTGATACT AAGGACCCT AATAACATCTCTGTTCAAATTA 
ATTACTAAGTGATAGAAGT ATTCATATGCCATTAAAGATTTGCCAATTCT ATTTGAATTTTATTT 
GATAAACTTG AAAATC AAAT AACC TAACAGCTGT C I II I CM I C I 1 1 CTAAACCCTTTTAAGAA 
TAGATTTAATATTTTTCTGAGTTTTCATTAAAGAGTTATTTATGTT ACGGTTTGTTTTTATAAAA 
GTAGCATGGCAAAATAAAAAAGTCTGCATCCTTGCAAGTT ATTCACTGCTCACGTGCTTGCT C 
TTCTCTGGTAAATTAAAAAAATAAAGATCAAGAAGAGTCTGGGAGGAGGAACAGATGAGTCA 
GATGGGTTGAATCCTGTGAGTAAGTGAAAGAGTAATAGGAAAAAAAACACT ATGGTCATGAA 
AGAACTGCATCCTGAAAGGTTGTG 

>713 . ... 

^NWX^CTTGTTGCCCAGGCTGGAGTGCAATGGCATGATCTTGGCTCACCACAA 
CCTCCGCCTCCCGGGTTCATATGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACA 
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ggcatgcgccaccacgcccggccaattttgcattrtcagtagaggcggggtttctccatgt 
tggccaggctggtctccaactcccgacctgaggtgatctgcccacctcagcct cccaaagt 
gctgggattacaggcatgagccaccatgcctggccaaaaataaagaacttttaagacaaacc 
tgaaaatcxcagtaagagccatcttttgactg^ctttatgttact 

AGGCTCAAGACCAGCCAGATTTTAGGTAGGGTTrTTTGCTGCAAAGTAGAAGCCTACTGTCT 
CTAGAAAAGAAACGCAGTACTTAGGGATCAAATGGGGTGGCAGTGGG6AGGAGGAATAGTC 
AT AAAG ATCAATG GTCATT AAAATTCATTCCAAAACAAAACTGCAT AT AT ACTTTG GA GCT CCA 
CTTCTrrGTTTTCGTAAATAAAAGAAACTAATTGACGAGTATGTACAATGTGCCAGGCACCTT 
ACAAGACACAAAT ATG CTCTT AT AG G CTGGG GAAAT AAG AAAATATGAATG AAG CAACCCAG 
GTCTTGAGCCAAAGAATTACXrrGGGGTCCGTTGAGTrTCAAATCTGAAAATTTCTG 
AGGTCAGCATCGCCCACIAAACCGGC(^TrCTGAAACACAGCCCACAAGAAGACAATTCATTT 
TGTGGCCTTCTTTTGCTCTATCAAATACAAACGAT C I M i l l I CAAACAGGAAAAGGGCTTTC 
AAGAAGATATGTGTGGCCTGGAGTCCTTGTGTCAGTACTCATAAAAGAACCATTCACTCTTTA 
GTCTT AGC ACTTTCCTCAAAATTG AAAATCAGGCCTGGAACT CTGGCAGACAGG GTCT CT GG 
AACATAAGAACGTAAACTCCACGAGGGCAGAGGGTTTTTTGGGATAAGTTTAGTTTAGTCTA 
GTGCCTTGAACAGTGCCTGGC^CATAACAGGTGCTCAAAAACTATTTGCTGAATAAACAGCA 
CAGACCTAGATAATTTCCCTT ACATGCAACCTTnTGGAAAGCCATAAAAATGTAAGGAAAGT 
TGGAAGGGCTAAGTTGGGGGGTTTCCAAAGAGCTTT ATAGTCTCATGAATCTTCATGCAGTA 
CTACCACTGGAAACTCCCATTCCAGGCAGCTTTTTAAAGAAAGAAGCAAGATGGCT^ 
TCCTAATGTAAAAAACAGAAAT^ 

CAATTATTAGAGCCAC^GAATACATTCCAAAATGTCATCCAGAAGTTGCTGTC^ 

GGAAATAATCTAATGACATTATACCAGATAAAAATATATTTGCCTGCTCCTCTTCCAGTAGAAT 

AGTGAGATCCAACCACCTTGGGAGACTCTTCCTCCACCCX3ACTCCCATTACATAGAGTGTAA 

TGCTGCAGGGAGAGGAAGTTAAACATTACTTTCAGGAAGAAGGAAGGGTGTGGTTTTACTGC 

TAACGGACGCGTGGTCGACTCCCTANN 

>714 

cccttagcggccgcccgggcaggtacatatgcactatttagaatatgacattaatca 
accactagaattaaaatcaggttataaatcctcaaaatcaccagagtataaatttaaatgaaaa 
acccagaccacagaacaaaaacagaaataccaaaaaataatcacaaaatattaaaaacagtat 
ataaacacagtgacagaattaggactaaacatatctgtaaaacaataaatgtaagggtaatct 
caccaattatgaaaaagaccttcagatcatartttaaaacaaatttaaaaactcaac^^ 
ttatgcaagagacagatttaaaaataaagagactcagaaagctggaaataaaaagaaagtgca 

AAGAAATAGCAAACAAATACAGGCATAAAAAAAAACAAAGATCCCAATAGT 
>7I5 

ACGTGTGCTGGATATGCAGGCTTGTTACATAGAATTGGTGTAAAATTTGAAAACCAT 
GAAAAAATAAAACAATAAAGGATCTAGATGCTAATAATGTGGTTAGTTAACATGTTGACCATTT 
CAAAGCAAAATAAGTCTrTGATGTTTTATACTATTCATAGCAAGATATAAGTATTTAATCTGCA 
AAGACGTGGATTTG^AAATTCAGCTGCCAAATGTAAAGAACAGATTCCTAGATTATTAT^ 
AATATCT CTAT AAATATTAT ATTTATCAATAATGGGT 
>716 
>7t7 
>718 ■ 
>719 

NNNNNNNNNNNNNNNNNNNNNNNNNNNGGAGACAGGGTCTCGCTCT ATCACCTAG 
ACTGGAGTGCCTGGTGCAATCTCGGCTCACTGCAACCTTCACACCCCAGGCTCAAGTGTCA 
ATCCTCC<^CCTGAGTAGCTGGAACXiACACGTGCGCACCACT 

CCAl 1 1 1 1 AACCC^AAACATTAAGAAAAATATAGGAACAGTAAGTAGATTACATTTTGTAAACA 

GACAAGCTTACAAGTTTTCTCAAATATGAAAGTCATACTAAACTGGGAGACTGTTAACTTC 

GATGGGGTTAATCTCTAATATGAAGCCACAGTCATAGCTAACTACAAATTACATATACAATGC 

CAAAAATATTCAAAAATAACATTTTTTGCACCTTAATGATTACAAATGCTAACCAGCATAA 

CACTGGAAAGTTTCAGAATCTCCTCATCACATACTT^^ 

>720 

ACTT GAAGAACATG GT AAAAAT ATGTTCACAATAATATTTTATCTT AGAAATGTATTCA 
GTAAAAAATCTCTTTANTTCAACTATCCTCTrGATTCAGGGGAAAAAAGGATTAGCATGGG 
ATAA(^GAATAGGAAGTTTAGGAGATAATGAGACTTCTGTTTTAGTAAAGTAAATAAGCTTTA 
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ATAGTTTTirGGTCATGTATTCAGTTTACCAGCCTTGAAGATATTTGTAGGAAATTTTAAAAGT 
TTCTCT ATTTCATCCCCCATGATAAAAATTAT ATAGAATAAAAGCTGAATTGAACTTTCTTCAC 
AGCACACTGAAAAATATCTTCTATAGGATrAATCAGATCACAGAATGCATATTTAAACAAAAAT 



eogtacocccaacgctgcteacagaaQaogeaaoaftaatttcetT 
ACTAAATTTGTCCTTGTTAAATAAAGGTATAAAACAAAGATGTTTTTTCTTT^ 
TTT TACAG AGCTGAGTGTAAAAATGTTAAGTATG I INCH I AACAG AAAGTTCTGTTTTTGTG 
ATCCTTTT AAAAAT AAAGCTrCACGGAAGGTATGAGAATAGTATTTTTCAACTTT AAATTTCTC 
ATTACCAGAAGACCATGTGGTAATTCTCTGTATAGAGTTAGAACAGCACGGAAACTTGAAGG 
CCTAAAAAATTAGCTGACCTTGTTAAAAATGTTGGCX3TGAGCAGTATATTATTACCTATCTTTT 
TTTATT GTGTG TGTGTGTGTGTGTGTTTTAAACTAATTGGCTGAAATATCTGCCTGTTTCCCTC 
TTTACA I unci i GTTTCTTTCCTT A TTT ATCTTTGTCCATCTTGAGATCT A CTGT AAAGTGAA 
(MM lAATGAAAACAGTTCCAAGTTTTACTCTCAGTGGGTTTGGGACATCAGATGTAATTGA 
GAGGCCAACAGGTAAGTCTrCATGTCAGTGTTTGTTGAGGAACGAGCCTATGAGGTCAGTTT 
TCCCAAAAGGAAAAAGGGCAG AAGGGATTTGTTCAT7TTTACATCT CGTTTCT GT AAT ACACC 
TTTX^CTTCATGGTTGATCAGAC TTTG AAGTCTAAACAG AACGTAAGCACTT GGTGTATCGAT 
TATCAT ACT ACACAGTAGACATGTTTTCAAGGCATATCTTGTCACTGTGTGTGCTTCT GACTT 
GTATCTTTC CCT A^SAACTAAATATTTrGAAGGTCCAAACATTATACTTTGGGAAACTTGATTGA 
I 1 1 1 1 1 1 1 I AATC TAGCT TTTCAGCTGATAATGGTTACACXjTGGTATTATTTGAAGTCCAAG AT 
ATTATGTATTCTATTTT AACTGTTTCXCAGTTTTGTATTTTTTGTATTGGAATGATTATGAAATT 
A ATGA GAAATCTTrAQAAACAGACTACAAAAGAGTAGTTCTT AAATACAGAAGTGATTGTGAA 
ACTTTGTGATAGCAATGTGTGTAGTAGCAGTTTTGTCTGAAATACATT^ 
AAGGATGTTGA AGTAT TAAAAGTCATTATGCTGTCTGTGGAATCCTACTATTAGTACAGAACA 
CCCTGCAGTGATTTITCCGCCCATGTTAGCATTGAAGTTGTGAGCTCT 

TGGCCCTTTAATTCAAGTAATTGGTCAGTATCCGTATGGGTCTTTTATACCA(XXX3CTG6GG 
GCTAAACTAATfTAGCTGCTGCTGTATACTTACTAACAAGGAATAAATGTrAAGCTTTCTTCTC 
AGTATTGATGGATGGTATCTAAAAQTATTTTTATG I MCI 1 1 AACATGGCTT AAATTTTG AACTT 
AATGTATCAAGTTAGTATGGTCAT ATT AAT ACCT GTGCTTTTCAGATTCTTCAAACACCT AAAT 

gaaagtgataaattcaaaactgatccttttagttcctcattatatgatatgaagggattaactg 

TAGCAGGATAGTCAACCTGACCGTACGGCATGGTGC > I I I I 1 1 caggataaatcttcagaca 

aaaaagtgcaaacaaaagggaaaaggggagcaaagggaaaacaggccgaagtggctaacca 

agaaact aaagaagacttacctgcggaaaacggggaaacgaagactgaggagagtccagcc 

tctgatgaagcaggagagaaagaagccaagtctgattaataaccatataccatgtcttatca 

gtggtccct gtct cccnci i gtacaatccagaggaatatttttatcaact attttgtaaatgc 

aagttttttagtagctctagaaacatttttaagaaggaggga^ 

aagtgtaaatgc 1 1 1 1 1 1 1 aagaggtgaaatcatttgct ggttgtttattttttggtacaacca 

gttaaca n cca caoat Qgggggll a flfln g U t a l^ 

CATTTAATGTCTTGAACATTTTAAATTACTTCTATTACCATGTTGTTTT^ 
CTAAAGAAAACCACrrCTTTGATCATGGCTCTCrCTGCCAGAATTGTGTG 
TTTGTGGTAGTCC I U 1 1 1 1 CC I AATAACTTTGTT ACT GTG CT GT GAAAGATT ACAGATTTGAAC 
ATGTAGTGTAOGTGCTATTGAGTTGTGAACTG GTGGGCCGT ATGT AACAGCTGACCAACGTG 
AAGATACTGGT ACTTGATAGCCTC 
>722 

ACATGAACCTATTAATAAACCATTCATGCTTCCCAGTTTGGCAGATGTGAGCAAACTA 
TGTATAGGAATTCCAAAGGTAAC 1 1 1 1 ICC 1 1 1 CATTACTTTACAGAAATACT GTCAAGTCCAA 
T AGAGAGGACAGACTTGGGAGGCGGATTGGGTGGGTTTGAATCT CT GCT CT GCCACTTTT AT 
T AATCATGTGAGTTGAGT ATGTGACTT AATCTCTTTT AGCTCAATTTCCCCATCTG TAAAAT AG 
GAATAATAAAAATACT GACTTCAGAGAGGTTTGTGAGGATCAATT AGACAG TCAT GTT AAGTC 
TGTAAATTGTTTCTGT AATGGGCAAGATAGCAAATATTTTAGATTTTGTGGACCATGCAGTCT 
TTATCATAACTGCTTAACTGCCATTATAGTGAGAAAG<>^ 

AAGTGTG T CTCT GTTCCAATAAAACTTT ATTTTCAAAAACCAGCTGGCTTGTCACATCTGGCC 
TAN 

>723 ... 

ACttACTTTQTtGCI CI 1 1 1 1 CTAAGTTTTAAAGATGGATGCCAATCTCAGG C I I CM I 
TCGTGTGTGTATGTGCGTATGTCCATAAATTCTCTT CT AATTACAGTGT AAGCCACATCCCAC 
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AAGTTTTGATAGTC^CAGAACTGTATCGTCAC^CTATTTTTTAATTTCAGTA^ 

ATCCCTGTGTAATTTAGAAATGTTTC^TAATTTCCCTACAT^ 

TTATTATTAATTTCTAGCrGTATTGAGCTCTTGTCAGAGAATATGGrTTATTTT 

AATTTAAGATCTGCnTAATGGCAAAATGTATGGTCAGTTTTTGTAAATGTTGCCAGTAAGCTT 

GCGAATCATATGTACTCTAGTTTTGAAATCCATTGCTCAGTGGATGTTX^TTAGGCCAATTTG 

TATAATCATGTTGTACAAATCTATTCTGTTCTTAACTGTTTTTTGTTTTAAAGGTGTGGGGTCT 

TACTATGTTGCCCGGG(^GGACTCVVAATTCCTCAGCCTCCCAAGTATCTAGAACTACAGGCA 

CGTGCAGCTTGGNNNNNNNNNNNNNNNNNNNNNNNNI^INNN 

>724 

ACTCCT(^GC7TGTGCTGCCCTTCTCGAATGACTCGCGTTTCCTGCTTTCATCACT 
CACCTCCCACCGCTCTCCATCACX^TGCTCTGCTCTTATAAGGATCCAGAGAAATGGAATAAT 
CTTATTGCTGATCTATGTAAACAAGTTGAAGAATCGTCTGAAAGAAAATACAGTGTGTCTAAA 
CTGGAAAAGTCCTGTAATAGTTTGTTCATGAGCATTTGCACAGTGGAGTTACTGTTTCATCAT 

TTGTAAGGTACTTTGTAGTGTCTGTCTAAATATTAGAy^ 

AAGCT AAG G AAAAGTTGT AG TTCTCT AATTATCAATTAGCA I M CM T T ATACTTTCAGTGTGG 
ATATTTGGGATTTCTTTATATATTTATTGCAAAGCCCTAGATCTTAGGGATTTGACN 

>725 

ACT AACTATTCCT AAATATTAAACACT GGTCAACTAAAATGC ACAAATTCATGAATTG 
GATTTGCACTCAAACAAAAAAAAATACCATAGGCAGTATCATTTCTACCTTT GTAAGAGGCAG 
GAAT ATT CATT AGA CTCT ATG CHTGACTTTTCAT ATGT ATTTT AACACTGT AGT AG G CT ATCGG 
GTCTAGTTTAAGCTTCATTTCTAAACT ACTCAACAGCTCAGAACTGACAAAG ATCACAAGAAA 
TCAACTATTAACCTCTTGCCTGAAGACACAAATGAAATATTCCCTATTTTACAA^ 
ATTCCAAGATTTTCXiAAAGCCAT ACTCCTGCAGTTCACTT GGGTTCAAACTT AAAATCATAATA 
GTAATATACACATATTTACATTATAACCGATTACACATTATTTTC^ 
>726 

ACTCACTT AAATAAATAATTGGTAAGATGATTTTATCTG ACAATT AAAAAAAGGT AT AT 
GTGAAAAACCTTAAAAAAAATCrTATTTCATTACATGTTGAAATGTTCTG^ 

AT(^T TTAA ATTCTTTTCACATTTGGACAACAGAAAAACTGAAATCT ATGGATTCCAAGCTGCA 

AAGTATTTTATCTAAATTGCAAATCAAAAAACATCTATAACATCrTGTTGGGGATACAAAGTTC 

TCCTGGCTGATTCTCATGCTACAGAAAGCCXXaAGTTTCTGTTCTGTAAATTGGGACA^ 

CCGCN 

>727 

ACATTCTATTGTTATCTCTATTTTTTGGATGAAAAAACAGCAGCACAAAGAAGTTCAG 
TAACTGGCCTAAGGCCACACAGCTTGTCTTCCTGAAGACTGGACCCAAACCCAGGC 
TAGAACATGCTGGTCXaCTATTGGGCCGCTTGCTCTATGGGGGACGGTGCT CCAGG AACACA 
GCAATGCGGTTTAG GATTC CAGGACCTGGGGCAGCTGCTGC I I CI I I CI I AGTTCTCGACAG 
ACCACrTGAGTGCAGTTTTTCTAAATCTTTTCCCCACTrrGA 

CCACACGTATAACCCACT GTGAAGTTTAAAATGATTTCATGTTTGGGCAAATTCCT ACTGAAT 

GTT AAGCT AGATAGGAAACAAGTTCTGACTAACACAAANN 

>728 

>729 

ACTTATCAGGATGAAATCAGAATCACAGTTGGCCTTTTGCCATAAGGGAAGGGTATT 
TGGAGAAGAGTCAACCACCACTCATGCCTCTrCCCCTGCCCAGC^ 

GCTTTATGCCTCCTGTTTCCXXrrGGCTGAGTAACTGCAGGCATTAGG TT C CT C T A CACACGA 

TATATTACAGGGAAATGGCAGCGATGGTCTGGAAGGGCAACACTGGCCTT ctttcctcctg a 

GCACTAAAATCCTAAACATGCAACTTAAAAAAAAAATTCTAAATGTGAACACCAC 

ATTTATATTAATGTATCATCTCACCCTCTTTTCCTCCTCTTCCAACGCCCI I C I I ICTACCCAA 

ACTCCAATATACCAATTTGTTTGAACAGTTTTACATTCTAAGTGTCCAACT ATT GCT AAAGGAA 

TGGATAAATTGTTGT 

>730 

ACTCACTTAAATAAATAATTGGTAAGATGATTTTATCTGACAATTAAAAAAAGGTATAT 
GTGAAAAACCTTAAAAAAAATCTATTrCATTAGATGTTGAAATGTTCTGT GCTT AATCCAAT AC 
ATCATTTAAATTCrrTTTCACATTTGGACAACAGAAAAACTGAAATCT 

AAGTATTTTATCTAAATTGCAAATCAAAAAACATCTATAACATCTT GTTGGGGATACAAAGTTC 
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GTTTCTGTTCTGTAAATTGGGACAAGTGC 



TCCTGGCTGATTCTCATGCTA1 
OCGCN 

>731 ... 

nnn^nnngtaaactagtcttctaagctttc^ttacattttgttcaaaatcagttgga 
aaactttaatcaaatcactcaacattaggaaccatataaaagctaccatct atttttgaagctt 
tttgcagactgaaattaattgaaaagtaagcctgaaaacaggaagacactaaggaattaaaca 

AATTCAATGATAAGAGTTCAAATGT ACAAATGAAATTCT GT AATTATTATTTGGAATATCTGAA 

ctcatgagaaagatct7ttgagggtcttcctatcttgattaaatttatattctgtaacaatagg 
atggaattgagaagtcctataat7t7gatgtatctaagtattggaaaacatttaaagaatcata 
aacgaacacaacttggtaagtcttgtcttataaatgttaataaatttgtcaaagtatttgttaa 
agaaagtatrrtgaaaggaggtaaagacatagaacctgtgacaatcgtggactgaattattca 
catatgttaacatttacatgtcaataccaagacaaaaaagaatttacgatatcctccacttaga 
atttactctacaattatagttcaacacttgagagtgtcttcttaattaaaaaattatgatctaca 

CAGAAGAGTCAACTGCTGATGTCAACAATTTCAAATTTAATAGTCAAAAAATGAAACX^TAAC 

tctaaagattacaagcaatattatagaaaaaattaagaatacgaccacattgaacgaagatgt 

ATTGTTCAGAAAAGTACCACTAATTCTGGCATTAGAGATAGATATAGAATTAACATTAAAATAC 

GTGAATATATTCCAGTACTATTTTGCACATGCACNNNNNNMNNNNN 

>732 

>733 

NNNNNNNN NNNN N NN NN NNNhlNCAACACAAATCGCTrTTAGATACCCTTTCACGAAT 
CTAATAGGTTGTCACACACTCTACAAAACTATGTGAGAACGTATACTACTTCTCGGCCACAAC 
TACTATTTWAGATATTCATAAAATAACCTCTGATTGTGTTTTCACATTGACCCATTCAGTTCT 
GTCCAATCrrATAATTCTGATTAAATGTTCTGGGCCTCAAAACTAATTTTTAAAAGGCCACTAA 
CTCCAAATCTAGGAACAAAACACTCTGTAAGACT ACTGTAACTTGTATAAAATTAACTTGAAAA 
ATTCACTCACTCCAATAAAACTATGATTTATGTAGCTCATAAGAGGGTGAATTTTGAATATTTA 
CTCTATGAAAAAGCCTAAGCAATTCAATAAAAACTTGATAACTGCACGTTTAGTTTGCAGCAT 
CTTGTACCTGCCCNNNN 
>734 

NGA TGCTC GAGGG GCCGNAGTTGTGAGGAATCTGCAGAATTGCCCTTTCGTGTGGA 

cggcctttcaggtact7tctctgaatttcattagctacattaaaaaaagaaaagatcaaatgc 
aatagatagcactgtaatagattttgctacattaaaaaaaatccatttgaatacacagtgaact 
aaaacaccagagtggct aaaaagtcccttcatgcatatttactt agcagagagctcttgaga 
aagacocaaccaataaaccccaaccaaagcaaatccagctacttctct agctgagagggtgg 
aatgactccaaaatattgtttcaaggtcaaaaagcctaaaacaact ccagat aaaanaacaaa 

AATCTATCTAATTGGACATTTACCTn 1 1 GGAAATAAAAGGCCCAGNNNNNNNNNNNNNNNN 
>735 

ATTTCCCAAATCTTA 



CAAGTTCTCGTCCCCTTTCCCTTAACAACTCrrrTXXSGAGTATCT 
GTAAGOGAGGAGAGCAGCCAGGCT GCACCTTTAACATTTCATTCACAGGATCT CAGCTCAG 
CCAAGTCCTCAGCCATTTTGTAATGAGGATCAUI MCI T CCGGTTCCCCGTGACCTGTCOCT 
CGCCTCCTCT AAGCCTCAGCAGAAAGG CCTT CAACATCCACTTTTCCACAACATTCTGTCTAT 
GATACCTGCATTCTCTOAGATGCTAGAAGCTTTCTCT 

CTTCACOOGAGTGOCCATTGATGTCCGTATTTTTACCAACAAGCTCTTCACCGCT ATGGAGG 

CTTTCTCCAGCAGGTOOCTGAAAACGTCTGCANNN 

>738 ■ 

ACCTATTCATATAGTTrTATGTTATTTGTGTGAATTAAAAAATTTAGATGAAACAATTTT 
AGCATGTAATTTTTATT AAATTTGCATTGTTTCX^ACrrTTTG TAGAAGTTCCAAATGAGTTT AT 
AGAACTT AAOAAATT ACTCAGTC 1 1 I C I G CTGTCCAGTTTTAT AATGTATG C I rCTTTTTTTTC 

CTGGATGAAGCACOCTTCCACCAGTGAAATCAAAGACAAATTTTATTGAAGC^ 
>737 

NNNNNGAATTCTATTACCCAATGTTTTGTGAGTCT cgaaaagagtaaaggaaacaatt 
ATTTCAAGTTTCATATTTGAAAAATAACTGA 1 1 1 1 1 1 1 1 G TAGTAAAGAGATGCTTTATGTAGTT 
CTGGICI rCTGGTGCI I 1 1 ICI I I ICI 1 1 1 AATTTGATATGATATCT AATTTGATTCATGAATAA 

tgcggcaacttgttgaaattttcctttttggaaatacattaggtc 
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gtggaatctacccagaggaaatgattcagactggcatttcggccatcgatgggatgaacag 
tattgctagggggcagaaaattcctatcttctctgctgctgggctaccacacaatgaggtga 
ggactgggatcggtttgctatgaagtttagcagacaagaatttcttaatgcatcactgttact 
gagaaaccgaataaagggttttcaaaacaaaaaaaaaaaaaaaaaaaaannaagn 

>738 

ACT ATCTGCTCTGAATTAAAATTT AGAACAAAAATCACCTGCCGTGCCACTACACATG 
GACATAA TCAAC TGCTAAATTATGA I ITGTTTTCI I CCAGTTACTTTTCCAATTATTTTTACAT A 
TAC AAATA TTITCTTGGT AGAA GAACAAAAGTGGCACTATTCATrGTGTAGTTTTTTTGTAACT 
TATATTTTACCCTAAGCATTTTCTCGTTGTCTTAAATTA 

T AATGCCT AGG ctgccatgagtcttttctccttctat AAACCGTGTCAG CATTCTTTT AT AT AT 
ATCTrrCAGC AGATC TGCAATGATTTCTTTGGAATAAATTTCTAAAGTTCGCTGGATCGAAAA 

GATTCAGGGA1 1 F n agtgttctttcaatttggcaaagtatttttcagaaacaagcccatttca 
gttctgaataaacaaattci Mil iatgttgcatttaaaatctann 

>739 

NNCT GGGCTTTAAXTrCCrrGGGCTCAAGTAGTCTGCCX^CCACGGCCTCCCAAAGTG 
TTGGGATTACAGGAGTGAGCCACCTTGCCTGTCGCAGAGCCTGCC I C I C 1 1 AACTGCTGCA 
CT AGACT ACCTTCTT ATTTTAAAAGTTAGCCTGCATTTCTCTT ACCTGCATTTTAAATTTTATTG 
AGTAGTTATTTTATGAGCAGTCTTTGAAAAATTTTCATAAAGCGGGCTCCCCCTGAAATT^ 
AACCCACATTCCAGATTTACCAGCrrGTGAATGTCrrrGCCACACCTACTTGATCTCTCGCTATG 
CGCGTACACAGTTTCC I IC I I CGAAACAATCCAGAAGTAGGCT AGCAATGGTGACCCCTACA 
TACTTCCGGACACATCTTTAAGAACAGGACACCATTACCACACCCAAGAAA^ 
TGAATTTATTCAAGAGTATCATCCAACATACTGAAATATCCACAGCTGTTCX^AAAGTATCXnT 
CAATTCTGGATCCATTGATGGTTCACAGGTTGTATTTGGCTGTTACATCTrTTTAGTTGTTATC 
CTTCAGAGT AAAACTGGCCTG CCCCTC I I ICI I I til I ACAATATTGACT CCTTTGAGGAACC 
GGGGCTGGATGTGGAGCATTCTCCATTCATCTGATTGTTTCCATGTGACCAGATTCGGGTCA 
CAAATTTCTGGCAAGAAOCCTTCACAGATGACCATGTATTGGTT ATT AGGT AACAAT AGATTA 
CTCAAGTAGCGAACTGGGTTGGCCTTTGGCATTAGANNNNNNNNNNN 
>740 
>741 

KNWINMNNAGTAGAGACAGGGTTTCGCCATGTTGACCAGGCTGGTCTCAAACTCTT 
GACCTCAAGTGATCCACCT GCCTTGNCTCCCAACAGCTGGCATTACAGGCGTGACACCNGT 
GC CCGGCT GAGTT ATGTAAACTTCAAAAGCCCACGGCCCGGGGCACACATTCACACACCCA 
CAOCTTTrATAA GGAA GCATGCGTGTTCAGGACACACATCAAACACATCACAGTGGGTATCT 
GTGAG GAGGCA I MUGGAGCTAAAGACCAATAAAGGGGAAAAAATAAATCAGCATGTTTGTG 
ATTCTA nTTTTGTl CAAAAAAAAGTCTT ACACACGTGTTATGTGTAGAGAAAAATGTTAGGAA 
GGATGTGTGCCAACATGTTCAGAGTGATATTTCTGGATGGTGGTAGTATGAGTAATTGACAT 
CI 1 1 1 CT ATTTACT AATGAGCATTTTGTAACTTTTCT ACCAAGAATGTTATATACAATATATACT 
TTTAAAAAATGCT AAAAGTT AACAAAACAAAAATGGATTTCATTAACTGT AT AAAACACCTGTT 
CAACt^ATGCAATGTCACTCGATCTACATCCAGCCACTCAAGGT gtoctta 
rCTTI CAGCTAGCTGCCATCCTCTCATXnCTGACTGCTTCTCT AAACCGCTCATGAGCATTAA 
TGCACTGCATCAACCTATCGTGTTCAjCCTTGTCCTT actcttgctgtttcgtggcctgcaaag 
CCCATGTGCTCCTGCTCAGAGGACATCXiAGACT^ 

^CAATC^TTCACTGGGAACATTAGCCAATGATGACTCCATCCCTTCT aaaataoccagcact 
GATGCTCT AGCATAT AT<^ACT ATAATTTCATC^TTT ACACCATTT AATAGAATTCT GAAGTTT 
CCTCCT ACAAGCC II MUM CCAGAAAGTCTGT CAGT ACTT CAGGTT AGAGATGACTTCAAT A 
TATGTCGCAGACCTCCCAAGGTGAGCATCACACAGCACTTATCAT^ 

ACAGAGGCT AAGATGAAAACAAAAATCTCAGGAAATTTATGTTTATAAAAATGATACTTGCAA 
AAAAATGAATGGAACCATCTCCATTGCTTATTTAGAGTGT^^ 

TTAGTCAATAGTATTGGATGCCTCTATATCTGCATATCAATAGGCT CATAAACAAGGTTGCTC 
AAAGAACT GCGC^TCAACCACTTGGTTTCATCTCTGGACACCACACTGTTA I C » I CC I I I GGC 
CTCTGTCCAT AACGGGT OCAGGCT ACGTGCACCAAAGGAAAAGAATTGGGTCCTTCTCCCGT 
CCCCTGGTTT GGTT AGGAGGGCCCAGAAAG AAG TCAGGACAGAGCAG GTG TGACT GTCCCT 
AAC<XSAAAGGAGGCTACCGTGCAGAACCCAAGCCAGGACATAATCACCA 
CAT GGTTA GCATGACCAGCACTCATCACAACGATCCCAGCCTT^ 
ATAGTTTTCAAAAGCAATGTTCTACCATTCCCCAOCTTCTAACAGTAATGA 
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TT ACT AAT AGAAGTCCCATGGACTTAAGATCTGAGCAAATAAAGCTTT AATCTTCTTAAGCATA 

TAA7TACTGGCATTTGAAAAGGCATGACATT 

>742 

ACAGGTTrCCCTTGCCTCAACTTCTCATCCTGGGTGATGAGACTGTTACTTTCCTTCT 
TGTATAAAGAGGGCAACTTTCATGTAGAAATTTrACCTCCTACTTTT AAGAAAAAGGAAAATC 
AGAGTGCTTTAAAGGAAAATCAGAGTGCl HI CI I GCATCTGCTATTTTTCAAGTGTCTTTAAC 
TCAAAAAAATCAATATGCCAAAGTGGGATGTTTGGGGGTATCTGGTTCTGAATTCCTTCAGGA 
AAGAT AGAAAGCAAAAGCAAAATAAT AGGTTTAAAACTAAAAAT ATCCAGGTGCGGT GGCTC 

acgoctataatcccagcaccttgggagactgaggtgggcagatcatgaggtcaggagttcg 
a&acgaogctgggcaagatagtgaaagcctgtctcrtacnunknhniinnnnhbi 

>743 

CCACGCGTCCGAGAAAACT AACAT7TACTGTATAAATGTTTGAAGGGCTTTT GTGTG 
TTGTCTT ATTTAATCTTC^CTCATTTTAGAGATGAGGAAACTGAGGCTCACAGAG G TTAAATC 

ac i m i gcaagatcgcattcagtagtgaagccataactgcgtagcacggttccacctccatg 

aggctgtcttgggttcttgtttctcaggtgtggtgagxxjgtctcttggatctgtgtggtgat 

ccgatcttgcactccgtcactgtggctgactgcattgtcagattc^ct^ 

ttcctacaggtgctttcaggatcaggtcactc 

tgctctcttgtctrtaggagccgggtgtgggctgagccctgcctgat^ 

aggagtacaacggggtgatagaagaatttttggcaagaggagagaagcttttt ggacctt at 

gtttggggaaggtatgacttgctcttca 

ccc rrorci gacctttgtcaccccctgcctgct agctggggaccgctccttggcagatgtc 

atcatccatgagatctcccacagttggtttgggaacctggtcag 

aattct ggctcaatgaaggtttcaccatgtacgcccagaggaggatct ccaccatcctuttt 

ggcgctgcgt acacctgcttggaggctgcaacggggcgggctctgctgcgtcaacac^ 



acgcggacgacacctataatgagaccccctacgagaaaggtttctgcttgtct gg 
cccacttggtgggtgatcaggatcagtttgacagttttctcaaggcctatg^ 

AAATTCOOAAGCATCTTaGCCGATGACTTTCTGGA 
AAGAA^AAGAGAGJTGGA 

ATGTGGG AGCG AAGAGCT GGACTGAGCATTGAAGCGTGGC rrCTCCI GGAAGACTACCGGT 

GGCTATTCTSGAAAAACCCAGAATCCCTCCCCGGGTGTGAAATGGG^ 

GGAAGGGACCCGGGGGGGGATTCTTTAAAACCGTTTGGAAGCTTGCG^ 

AGGGGGGGGCCCCATTTCCCOCTTTTACTCN 

>744 



. JTAAI 1 1 ICI I GAGTT AGAAATGT AGTTAGAACTGTGA 
CT AACGGCATTGCCTGGAATGTGCTACAAACACGATT AGATATTCAT^ 
AGACTGCTTGTATAGAGACTCAGTGTTTAGACATTC^ 
GTATAAGACTCGGTGTTCATTTATGTTTTTAAATTAAAGCAGAAG 

CATTGCAATGT GCAAT AAATAAATATATCTGAAGT AGCATTAGCCTTCT AGTTTTAAATAAT^ 

ACC] II I 1 1 1 1 II I H 1 1 1 1 1 1 1 1 CGTCAAAGTCACT ATTTGGGCOCT AACATAATCCCT 
GCTCAGAGCGACXiGAAAAAAGGCAAGCCTTTTCAAACATAA ACAAGCCAGCTAT 
^^CGCAAGGGAAAAAAGAAAGCATCTAGATAAATATCTATCAAAATTAACTTT AAGAGAAAT 

A CTC ronj ccTi AAAAGCOCTT attttttaagacact agaaaataagtt actataaaaagtgg 

TGGI GiGGGGGCTAAAAACAAAACAAAAAAAATCC IC I 1 1 IC W A CATTTTTTA GI I 1 1 C I G 
>74S 

»<NCACCACGCGTCCGGGGGCGGGGCGGAGCA^ 
GTCCGCrCTTCOCGCAGCCAAGGGTGGGCGCCGGTCCT 
AGACAA AAAGA ACTGGGGTGCCCGGAGTGOCAGGTGGC^ 
CGGGGTCTTTAGGATTTGCAGCTOCAGtaAAGCGAGATGTOG^ 

CAAACCAGGGCAAGTT AAAGTCTTCAGAGCCCT GT AT ACGTTTGAACCCAGAACTCCAGATG 
AATTATACI 1 1 GAGGAAGGTGATATTATCT ACATTACT GACATGAGCGATACCAATTGGTGGA - 

aattccaagcaactatgtggctgagcaggcaga 
atocajtgacaatccattgcatgaagcag^ 

gtttggacaacagagt gggtgttaatggctt agacaaagctggaagcactgccttatactgg 
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GCTTGCCACGGGGGCCACAAAGATATAGTGGAAATGCTATrrACTCAACCAAATATTGAACT 

G>ACt^GCAGAACAAGTTGGGAGATACAGCTTTGCATGCTGCTGCCTGGAAGGGTTATGCA 

GATATCGTCCAGTTGCTTCTGGCAAAAGGTGCTAGAACAGACTrAAGAAACATTGAGAAGAA 

GCTGGCCrrCGACATGGCTACCAATGCTGCCTGTGCATCTCTCCTGAAAAAGAAACAGGGA 

ACAGATGCAGTTCGAACATTAAGCAATGCCGAGGACT ATCTCGATGATGAAGACTCAGATTA 

ATTCCI 1 1 C I CiGAGCTTTGAGATCTAAAACrnr CTGTTGCTTTTGCCATTCCAAAACTTTGTCTT 

TGCCAGAAAAGTGTTGGTAACTATAAAGAAAATTATATATGAACACGGCAGTGTTGCACTGTG 

TTTGAGTAGAACGTGTAAATGAATTGTTCCCACCTTTGGTTTGCCAGTAAGTGACTGGATTCT 

T GGCAC ATT TGTGTTCACC AAAGTAGAACAAGAAGATATTATTTCTATTTATCAAGCAAAAGG 

AATTTTAAGAI I II 1 1 II MCI 1 1 AAAAACAAATTAGGA I M I I I II I 1 1 1 1 I 1 1 I I I I II IAGTTA 

AAATGCTTTACCT CAATGGTTGAGATATTITGAATGGATTTTTCAAGGGG GGGAAATGCTT AT 

TATAAT AATAA AOC AAAAT ACTTAACAGAAAATTGTCAGCTATTCTGACAAAAATAAACATTTT 

GAGA GACTTT A 1 1 ICI 1 1 I GTCCGTTTCTGTGGTATCACTCATTGTCGTTAAGTAAGTAAAGCT 

TTTTATATTTAGGTAAGAACT GATTTTATTTTTT AAATT ATATTTTATATTTATTAGCACAGAAGA 

ATAATGAGAGCCACATTTTAGTTCAACTT 

>747 

GTCGAOCCACGCGTCCGACTTGGCC<MGGACXKX3CTGATGGGAGGATGGACATAC 
TGGTGTCTGAGTGCTCCGCGCGGCTGCTGCAGCAGGAAGAAGAGATTAAATCrrCTGACTGC 
TGAAATTGACCGGTTGAAAAACTGTGGCTGTTTAGGAGCT^ 
AAGAAGAAAATTTAAAATTAAAGTATCGACTGAATATTCTT(X5AAAGAGTCTTC^ 
GGAACAAAGGAACTAAAAATATGATTAACATTATTAGOCGCCT ACAAGAGGTCTTT GGTCATG 
CAATTAAGGCTGCATATCXAGATTTGGAAAATCXrrCCTCTGCT AGTGACACCAAGTCAGCAG 
GCCAAGTTTGGGGACTATCAGTGTAATAGTGCT ATGGGTATTTCTCAGGTGATGTATTGTCAT 
GACTCTTGGCTGTTTGA I UNI I AAGTATTATTATCATCATTGCCATTTACAGAAATAATACTA 
TTACAAGTTGT ATCCTTAGTGAAAAGGACATTTGCGACAGTTTGAAAAACTTGAG 
TGGGGGG^TAT ATGTTTT AACTTTTTT AGGCACAATTTTT AAGG TTTGG TT AAATTTT ATATGT 
ATTCTCAATATTTAAGGGCAATCATTGGTACT CI HIGH I AGGTATTTCCCTCCTGCTGTGTC 
CAGGATTGCTGTGTGGTGGTGATGAGTGCTGGGAGGTGAAAAATTAAAATAAGCCAriTACC 
AGTCAGCATCCCAATTAAATATTTGATGTAACTGTGATCTTTGAGCCAGGCTTATATATTC^ 
TTCAAGCAGAGGAGTTCCCCATTTTAAATAGAGGCATTGTCTGATGTGTTTATGGTTAACT GC 
ATCTGGCTTGGGTC I HCIGTTTTCCI II CM I GCTGAATTAGAAGGGGTT ACTCTGAAGAGT 
CCAGGTCTTACAGTGTGGTTT 
>74fl 

ac 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 n 1 1 1 1 n 1 1 m n n I cattcaagaaagataattttacactt 
A 'HH!IT GAAAGAAA ^^ CTATGGAA ' nTCTTCI ICI aattaaattccaaaatacattctctc 

AACCCT ATGCCCTCAT ACT AGTAACTTGATGGTT AGCG GGT AAGTAGGT AGTAGT AAAAGAG 

CAAAAGGGGAAATTTGGGGAGCAAAAAAGGGAGAAAAAGAAAAAAAGGG^ 

CCT AAT AGAAAAGCT AGAGAATTCCATTCCTGAAAATTAAANN 

>749 

N CTAATT T AGAAGG 1 1 G 1 II ACAAACCCTTTGGAAAGGlAATCTTTTATTTCCCAATTTA 
AAAGGGG 1 1 luGAI I fCCI l AAGAAG AGGCAAAAl Mill I GTGTGGGAOCCATGCATAAAAA 
TTAAATGGATTTCCT AGT GGAGTTTT AT AAATTAAAAGGTCAAAACTGAAGCTTTTT ATTTAGG 
CI 1 1 rCCAAATCAAGGAT AAATATCTT ACGCACGAT ATCTAGT GACAGAGGAGAAAGTGGAAA 
ATGACCTCAAGGAAGCT ACGGGCACAACTGGAAGCTTTGT AGAAATTAAACATATTCAAGGA 
GCAAGAAATGAACTTT CAGCAG TCT AAATTTTCT AAAT AACCAAT AGTTGCCAATCT AAAAT G 
GCA GAGAA GATGAAATTTG ATAAACT GAA llll l lll AAAAATCCATTT ACCCT ACAGGTTTGC 
ATTTGTTT GCT GAAATTTACCI Mill I AGTTATATAT ATGTGTGTGTGTGTGTGTAATTTCTGC 
CAATAAATTCT AAATTACAAAGGT AAGAGAAAACCT AGT ACATTACT AAAT ATATAAAGTATAT 
G 1 1 C I GATTATCTAT ACTTGTT CTAGTQTCAAGTCTTTTTAAGTGG^^^ rTTGTTA 
TTGGACrTGAATG<^TTTTTGAGACT AGGTT AATTATTTTTGAGGTCTT ATCCT AAAAGGCAT C 
TAAGGT ACATGA AT GGAGT ATGGTGATTTTAT AACATTTTTTATCAGAATGGAAAAAGAACT G 
TTTAAAAGTTTGAT ACTTTT AAATAGTTGG 1 1 1 1 II IGC1 I ACTCTGGT AATGATTTTCTACAAA 
TACATAATAA ATTGI 1 1 1 1 1 4 GATTCTATATTCTGTATGCAGTTGAATATCCATT ACTT ATTCTG 
CTGTGCTTTAAT AGAATGGAATGTTT ACAGGCCCTT AAAAT ATT ATTTTTAAAAAACCTTCTGA 
AGATA CATACCAAAGTTTTTCCAACAAGATTTTATAATW 

ATTCT ATAAT AGTATAGTT ATT AAGGCAATTrTATGTTAGAGACTATTTTGTAATGTAGTGAGT 
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ggta ccttta ta*.gaa.m.gt(l4.ctgccaatatat^ 

ttgag lllll aatgattattttaaatgtttatatagtttagtaaaatttgcatctcaaagtatca 
1 1 1 1 1 atattatgggacgttttcagattggct aatatrtgcattgtaaattttgtatgcagttta 
tctaaaattcaaaaatactgtcagtacaccagcgtttaacatctatattccaatttgtatacag 

TTTAAAATTGTACTGCAAAACTATTGTGTGCTCTTACACAGTATGCATCATATTGTTGTCTGTG 

aaattaaaggacatttgatagtctactgaatgtaaaatataatgcttggtatatgaatgan 

>750 

acatttgattgtggcatattcaactatgattttagacaagatgtgtgtgtgtgtgtgt 
gtgtgtgt agaca aaataaaattcagaaagagaaaatctattctacaatgaaattcaatctctt 

ACrTAGCTATTTTGAAATTGTGTCCCAATACCACATTAACAGAGCCAAAATGAAATTTA 

ATGGTTATACTATTATTCACACTAGGTAGGGTCAGGn Mil FGTCTGAATTAAATGGCTCCTT 

TACGCT AGCTACTTAGGAACGACTTCCCATACCCTCAAGCTAGAGTAATAGATACCTGAGCC 

ATTTTAAAAAGGAAGAGGGTTCCAATCAAGGTCrrCTATGAAGTTTAGAAATGGGTTCAAAGCC 

TAGTTGT 

>751 

ACATTTGATTGTGGCATATTCAACTATGATTTTAGACAAGATGTGTGTGTGTGTGTGT 
GTGTGTGTAGACAAAATAAAATTCAGAAAGAGAAAATCTATTCTACAATGAAATTCAATCTCTT 

acttagctattttgaaattgtgtcccaataccacattaacagagccaaaatgaaatttaaaatt 
atggttatactattattcacactaggtagggtcaggtttttttgtctgaattaaatggctcctt 

TACGCTAQCTACTTAGGAACCACTTCCCATACCCTCAAGCTAGAGTAATAGATACC^nAr^ 

ATTTTAAAAAGGAAGAGGGTTCCAATCAAGGTCTCTATGAAGTrTAGAAATGGGTTCAAAGCC 

TAGTTGT 

>7S2 

•cgCGTCCGCGglcGCGTGGGTGAGCCATGTACTTCAAACAGAAGGCAGCGAATTACT 
AACTTCTGGTTGCT AGGTGTGGCTTCCTTTAAAATCCTATAAAATCAGAAGCOCAAGTCTCCA 
CTGC CAGTGTG AAATCTTCAGAGAAGAATTTCTCTTTAGTTCTTTGCAAGAAGGTAGAGATAA 
AGACACTTTTT CAAAAATGGCAATGGTATCAGAATTCCTCAAGCAGGCCTGGTTTATTGAAAA 
TGAAGAGCAGGAATATGTTCAAACTGTGAAGTCATCCAAAGGTGGTCCCGGATCAGCGGTG 
AGCCCCTATCCTACCTTCAATCCATCCTCGGATGTCGCTGCCTTGCATAAGGCCATAATGGT 

taaaggtgtggatgaagcaaccatcattgacattctaactaagcgaaacaatgcacagcgtc 

AACAGATCAAAGCAGCATATCTCCAGGAAACAGGAAAG CCCCTGGATGAAACACTGAAGAAA 

GCCCTTACAGGTCACCTTGAGGAGGTTGTTTTAGCTCTGCTA^ 

TGCTGATGAACTTC«KKrrGCX^TGAAGGGCCTTGGAACTGATGAAGATACT 

TTTTGGCATCAAGAACTAACAAAGAAATCAGAGACATTAACAGGGTCTACAGAGAGGAACTG 

AAGAGAGATCTGGCCAAAGACATAACCTCAGACACATCT GGAGATTTTCGGAACGCTTTGCT 

TTCTCTTGCTAAGGGTGACCGATXTrGAGGACTTTGGTGTGAATGAAGACTTGGCTGA 

ATGCCAGGGCCTTGTATGAAGCAGGAGAAAGGAGAAAGGGGACAGACGTAAACGTGTTCAA 

TACCATCCTTACCACCAGAAGCTATCCACAACTTCGCAGAGTGTTTC^ 

CAGTAAGCATGACATG^ACAAAGTTCTGGACCT^GGAGTTGAAAGGTGACATTGAGAAATGC^ 

tcacagctatcgtgaagtgcgccacaagcaaaccag c i ncn ig cagagaagcttcatcaa 

GCCATGAAAGGTGTTGGAACTCGCCATAAGGCATTGATCAGGATTATGGTTTCCX^ 

aattgacatgaatgatatcaaagg^ttctatcagaagatgtatggtatctccctttgocaagc 
catcct ggatgaaaccaaaggagattatgagaaaatcctggtggctctttgtggaggaaact 

AAACATTCCXrrrGATGGTCTCAAGCTATGATCAGAAGACTTT AATTATATATTTTCATCCTATA 

AGC TTAAA TAGGAAAGTTTCTTCAACAGGATTACAGTGTAGCTACCTACATGCTG 

AGCCTTTAAATCATTTTTATATTATAACTrCTGT 

TTTCCCCAAACCATAAAACGCTATACAAGTTGTTCT AGTAACAATACATGAGAAAGATGTCTA 
TGTAGCTGAAAATAAAATGACGTCACAAGGCAAAAAAAAAAAAAAAAAAAAAAG* 
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aicaccacagcgocagccacag, 



>7S3 

TTTTTCACAGTTTAATATATCTGAAATCATGATGCATATGCAATTAATTTCAGAA7TTA 
GCTGCCTTTGITI 1 1 1 [ 1 ! II I II I I GGTATTATATAAAATAATAATGCATCTTACAGGGGAAGT 
CATAAATCCAATGAAATAAAGTATTTACCTGACATATTTTTTCCCATCTTCT^ 
TGAC TGGT TGTCCAGCCCCAAATTGTTGGACTTTTTT AAACAATTCACACTGACTGGCAGTCT 
TCACCTTTAAATAGTT G AGTTCCATCCCTTTAAAATCATTTAAAAAC ATGATTrTTAAATTTATC 
TCCATTACCTTATTTTGTGTTTAC I Mil I ACTTTTATTTATTTCCTCCTTCTATTAGATGAATAA 
AATGTTCAATAGCTCCTCTTGTTCCCAATTCCAGGTTTTTAAACTCT ATACATTTTGTAGTAAG 
TTTTAAG ATCTCAGTTT ACCTACTT ACTT ATAAATATTCAT AAG AATGT ATAAATTTATTTT ATG 
TTTCTrACTAATACTCTAAAACAAAAAATGATCCCCGCGTTAATTAAThfCTAGAAGTATCTCCG 
AGGGGG AGACCCG GTATCCCAN 
>754 

AC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GGTG G GGAGCTGTATTTATTTCCCAGGGCTGTCAAAACA 
AATATCCATAAATTGGGTGGATTAGAACAACAAAAATTTATTCTCTCTAGAGAAGAACGTTTTC 
TTGCCATTCCCTGGCTGCTGGTCATTGCTGGCAGTCCTTGTCCTTCCCTGACTAGTA^ 
ATCATTCTCATTT CTGCCT CTGTCTTCATATGGCTGTCATTTCACTGTGTGCTTGTCTCTGGG 
TCTTCAAGTGGCCTTTTTATAAGGACACTGGTCATTGGATGTAGGGCCTACCCCAAT 
>755 

CAGGACATGGGTTTTCAACTTCTGGATGGTATGATGGGGTGATAGTAGGGTATAAAAGTATC 
CTGAGAAGTTGAAAGCAGTGTGTGAATGGGGTGTTCTTTTCT CCCCACAATCCTTTCCCATC 
TGCTGACAGTAGAGTTAGCACCTCACAGATGCTTGGGCCTGGAAATGAAGCCATGAAAATGA 

agccctcagccttcttggagatcagagccatggtcctcacccacagcacatggg 

>756 

>757 

ACI IUI 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 I T 1 aaatgagtaggaagagatggtatcacaaacacaa 
agcacaggttactgtctttaaaaatttgcgttcttctattctccaatggaagtgggaacaaag 
agaaaacccctgtgtgtcctagcacaatatgggcatttgtgtggatttaataaatgggcattt 
ggattgttgggaaaatgtgatcaatcagcaggctatagaaacacagtttgatacgatggtga 
aaacttgtctagaatgatgtttttrcagaaatgttggtgtgattagaacaagtcagcaatgat 
gatgacaaaatatttacataatgttatagatgtggcttgctaatggaaatacctatctgaggc 

TGTTTAGGAATACACAAATTGAGAACCGTTTAGTTCAGTTTGCTTTAA^ 

AACCCTTTTTATGTTC»TGATCCTATGATTAGTAACATCTTACX^TT^ 

>758 

AGAAGCTTTCCAAG G AGTT ATGAAATCTTT GT AGAGCAG AGGGCAACT ATTATCACC 
TCAAGTCTXKTrCTCAGCAACAGCCCTTGT ATTTITCAGGGAGAAATCTCTAGGAAAAAAG T CA 
GACACCAGTGTAGTCCCCGCGTACCTGAGGGAGGAGAGACATAAATCCCTTCATCCCTAAG 
ACT GAACTATGT AACTAGCAGCCTCTGGl 1 1 G I 1 1 I C TACTCCCTGTCCCTCAGGATAAAATG 
TTGATATTGCTCATTTTCCTCATTTCCAACATTGTTTTAAAAC 
>7S9 

ACTCCGATTGCCTCTCCCATGCTTCTCTGCTTTCCAAAGAAAAAACTGACCTTGTATA 
GATGCTGTCAGCTGATTGCAGTGCTCTTAACTrCTCCATTGTGAGTTGTTCAGTCTGAGGAG 
TTAGGTATAAACCCAGAGTGGTATTCTCTT7TCTGTTGTGTTTGGTTTTGCTT ACATATTCAGG 
AGCTGCTCTTTACCC<X^GAAC^TC<^TATATATGTTrTTTTCTGTTTCT AGATTTAAAAATAT 
TCCAGAAGCCTGGCCTCAAGATAGATAATATTTTACTTTT AATTTGGTGCCCTTGTTTTAGAG 
ATTTGAACT GGATTGCATTTAGT AAACCAAACCCCATCTTT AAAAAAATCATCTT AAAAAAAAA 
TAGTGGGCCGGACATGATGGCTCACATCTGTAAATCCGAGCACTTrGGGAGACTGAGGTGG 

TACANNNhftlNNNNNNNNNNNNNN 

>760 

>761 

ACAGATATAAAAAGGCT ACTATTCCAAGAACAAAATCCTGGAAACAAATGTCTATCAA 
GAAAGCAAAGATAATCTAAACAGCAGCATATTCATAGGATGACAAACTATTCAACCATTATAA 
AGAAAACCGAATCAAAAGCACTGGCTTATTAGACAAG AGTTTCCCAAACT ATCATGCT AAAAC 
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AGTAACAGCGAGCTTCCAAATTAATGTTGC CI I 1 1 1 1 1 1 1 1 1 1 1 I I CCAAACTGAAAGGAGGGT 

GGGGAAAACAAACGCATCATATGTAAAGCACTGAGTCCAGCCTG 

>762 

ACGCGGGTATGGTTTTACGAACAAATTTTTAAGGAAAAAAATTATCATGGTTCT AATC 
TTACATGTTAACATTTCCTTGTTATGTAGGGATCAGACTTGTTATAACATAATTCCACTTTATA 
ATTCAATGAAGAAGAAAGTTTTGTCTGATTCTGAGGTATGTAATATTTCATTATTATTACCATA 
TTG AT ATTCTCT AT AT AAAAAAATTT ACATATTGT AGTTTTCAG GT AAAAG CT GTTG T G AACATT 
ATTTTTTGTCTAGTGTAGTTAATTTAAAAAAAAAAAACAACTNN 
>763 
>764 

>765 

ACAGAAGCAATGTTTTTTGAAAGTTTTCTATCTGTGGTTTGTTGAATCCACAGATGCA 
GAACTCATGGAAACAGTGCCCACTGTATGTCACAATTTCAGAAAATCAGTATTTCATACAATC 
AGCTAATAGCCTAATTTGTTGAGCACAGAAAAATACACTGAACCAATTCTGATTATTGCAGAG 
AAATGATTGGCAGGATATTGGGAAATAGAATGAAGGG CGGAAAGAATTTCACATGGATTCAG 
TATACTCTCCGTCAGGAATTTTTGTTCGCTTGATCTTTTTGTGTTTATTGC 



>766 

ACAGAAGCAATGTTTTTTGAAAGTTTTCTATCTGTGGTTTGTTGAATCCACAGATGCA 
G AACTCATGGAAAGAGTG CCCACTG TATGTCACAATTTCAG AAAATCAGT ATTTCAT ACAAT C 
AGCT AAT AGCCT AATTTGTTGAG CACAGAAAAAT ACACTGAACCAATTCT GATT ATTGCAGAG 
AAATGATTGGCAGGATATTGGGAAATAGAATGAAGGG CGGAAAGAATTTCACATGGATTCAG 
TATACTCTCCGTCAGGAATTTTTGTTCCCTTGATCT 



>767 

NNGAAAATAACCAGAGTTGGTTTCTGTCATGTAGAATCAAAGGAGTCT AATGGAACC 
AAGTAGCAATGTTCCCQAAAACAAACAAACAAAAAACXXX^AAACATTTTGCT GTTTCTTTCCC 
CTCT GT ATTTG CT AACTTT ATGATGACTTTATTCTTAAAGCCT ATCACT GGTCTGCTTTT ATTAA 
TAGATTAGTGGAAATTTTCACCTGGCCTATTAGCAGCTTATAAAGAAATAGATrAAGAGTAGG 
AAATATATAGATGAAGATGTACTGTATAGAAGTTGTGT AAAATCAGT ATGAAAGTTCAATGTT 
GCTGTTCTTGCTCAGTGATTTTAAAGAAATTGAGTAGTTCCT ATGTGA 1 1 1 11 1 II IA ACAACT 
GCATGGTAAACTTAGATGACTCTTCCCCCTGGATTTTACCTGGGAGTGGCCTT^ 
TATTTAAAAGAGGGCAGGTTTGGCACTTTTATACTGATGTCACCAATGTTA^ 
ATCTCAGGAAGATTCATATTCTTTACAGCTGATACAGCACGGGCTGGAGCTCCTGCTAAGC^ 
AGCCTGTATACAGAAGCAAATAAAAGAAAAAGAAAATGTACAACTCAGGTGTAGCTCATCATA 
AAACACTTACAATGTCN 
>768 

>769 

ACTT ATTTTTTTACT AAGG I II I G 1 1 1 1 GGAGACTTGTTTGAAATAAAGTGATCCTCAT 
TCAGGATTTAGAAAGAAAAGTTATACTCCACATGCT AGGGATTAGGAAGGCT AATGTGAACT 
GATCAAAAGTATGAATTATGGAATGCCTTTAGAATAATCAACTTTTAGGT AATTTGATACTGCT 
ATAATTTCAAGCTTAGAGAAAAGTTGTAAGAATGGCATAAGGAACTCCT ATATATCCTTTATCT 
AGATTCACTAAATGTTCATTTTCTGCCATTTGTGTTATTCT 
CCTAACACCACCAGGGATAAACCAGTAGTCTGAN 
>770 

ACCTCTCA IMUI CACTTTTCAACACTTCCTGGCAGGCAGGCAGCATAACTGGTCCT 
GCTGGGTGATCCAGACCACACTCTGCAACTC I 1 1 CI I C I GAGCX^AGGCTCCCCTACTGTCTT 
TTCATTTATGT CAAGGCAGGGGAAGAGCTCAAAGGGCTCTTGCATCCCAGTCTCACT^ 
GAGAGGCACGAGGCCCTCCAGGATGTGGGGACAGGAACTTTGGG 

CCAGAAGATCACCAGGAGGGCT AAAT AGTAGAAAGGAGAGT CTT ATTGGTGATATGTTTGCA 
AACrrGGGAAA AGAT AGCCTCCAGTGTGGAGCAAAGATGC r CCI I CI I CAAAGAGGGCAAGG 
GCAGCTT GGATTTTGTGCCTT ACAGGGTCGGTATTATATAATAGAGTCATGCATATTCAGTAG 
G TTTGGGGGAAAAGCTATAT ATATTT ATGAGGGGAGCCAACT ACATGGGCAATGGATAAACA 
TAC^TGTAACACATCCCATGTTCACTT AGGGGCAGGATTTTAGCATTAAAATGAGGTGGAATT 
TGGCTCTTT ACATCAAAAGGTGAGCTA — 
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AAGGGACrn'GAGGGCTACAGCrrGCTCATCTGGAAAGAATCCTTGTAAGACCAGTCCTCTGTC 

CAACCAGAGTTAGGAGGCATCTGACAATTTGCCTGATTAGCTGT 

>771 

ACAAATAAAGTATTCCAAGGGTTCAGAATAGAAAATGATTTCTTCCAGCTTGGGGAC 
ATTTGGGAAATTGGGATATCCTTTGGGGAAATGTAGTAATCAGTATATTCTGGGAAAACATAG 
TAGAGAATGAATAAATAAATrCCATTGAATTTGGAATATGTTGTCCATTCTCCCTGTAACTAAT 
GCTATCAAGATAAAGTAGAAATACCACATTTCAGAAACAGCTGGAGTAGACAGGTCTTCATA 
GGCTAGCTTGGAAACCTAATAGCTATTAATAATGAAATTTTAATTATACTCTGGATTCTAAACA 
ATGAACACACAGTGATCTTTTTGACTTGCTGCTrGTTTATAAGTTGGGGATGGTCTGAATTCA 
TTGTTGATTCAAAAGAAAACGAATAGTGTTTTAAACCTTTTATTTACAGAGACT^ 
ACCTTCCTACCAACATATCTTAAATACACCAAGACCTTTTACAGAAGCTTAAACAAGGGTGTA 
CCTGCCCGGGCN 
>772 
>773 
>774 
>775 
>776 

GGGGGAAGAACNAGAGCTTTGTTTAGAGGCTGTTGTAGTAAT<X^AGGTAAAGGCTITTTAATC 
ATGTCCTGAACAATGATCA GCAATG GCAATGG AG ATG AC AGAACAG AATTT AAG AAGG AAT A 
AAAAAGGCTTGCTGACTACTTGGATGTGGGTGATGCTATCCTTTGAGACAAAGGANNNNN 
>777 

NNNNCCGCGGCGGCTGCTGCTGCTGCAGTGGGACAGGTGGCGGCGACCGGCGGC 
GTCCGAGGAGATTTAATCCAGAGACTGACTrTCACTATAGAACCCACAGTTC 
GGGGAAAGATAGTGGCAACAGGCAAAGGAGAAACAGCTCTGACATACAAAGAAAATGAGTA 

GAAGACACCTACGGCTGTTGTAGCTCCAGTAGAAAAAACGATATCCAGTGAAAAAGCATCAA 

GCACTCCATCATCTGAGACTCAGGAGGAATTTGTGGATGACTTTCGAGTTGGGGAGCGAGT 

TTGGGTGAATGGAAATAAGCCTGGATrTATCC^GTTTCTTGGAGAAACCCAGTTTGCACCAG 

GCCAGTGGGCTGGAATTGTTTTAGATGAACCCATAGGCAAGAACGATGGTTCGGTGGCAGG 

AGTTCGGTATTTCCAGTGTGAACCTTTAAAG^GCATATTTACCCGACC^ 

GAAGGTGCAAGCAGAAGATGAAGCTAATGGCCTGCAGACAACGCCGGCCTCGCGAGCTACT 

TCACCGCTGTGCACTTCTACG^CCAGCATGGTGTCTC^ 

CTTACAAAAACT GCCAGTGAATCTATCTCCAACCTTTCAGAGGCT GGCT CAATCAAGAAAGG 

AGAAAGAGAGCTCAAAATCGGAGACAGAGTATTGGTTGGTGGCACTAAGGCTGGTGTAGTC 

CGGT TICT 1 GGGGAGACCGACTTTGCCAAGGGGGAGTGGTGTGGCGTGGAGTTAGATGAG 

CCACTTGGGAAGAATGATGGCGCTGTTTGCTGGAACAAGGTATTTTCAGTGT^ 

TGGCTTGTTCGCTCCTGTCCACAAAGTTACCAAGATTGGCTTCCCTTCCA 

AAG CCAAGG CCAACG CAGT G AGG CGAG TGAT G G CGACCACGTCCGCCAGCCT GAAGCGCA 

GCCCTTCTGCCTCTTCCCTCAGCT CCATGAGCTCAGTGGCCTCCTCTGTGAGCAGCAGGCC 

CAGTCGGACAGGACTATTGACT GAAACCTCCTCXXX3TTACGOCAGGAAGATCTCCGGTACC 

ACTG^CCCTCCAGGAGGCCCTGAAG^AGAAGGAGGAGCACATTGAGCAGCTGCTG 

CXiGGATCTGGAGAGGGCGGAGGTGGCCAAGGCCACGAGCCACGT G GGG GAGATAGAGCA 

GGAGCTAGCTCTGGCCCGGGACGGACATGACCAGGATGTCCTGGAAT^ 

GAOCAGCTGCGAACAATGGTGGAAGCTGCTGACAGGGAGAAGGTGGAGCTTXrrC^ 

CTT GAAGAGGAGAAAAGGAAGGTT GAGGACCTTCAGTTCCGGGTTGAAGAAGAATCAATTAC 

CAAAGGTGATCTTGAGACGCAGACCAAACTGGAGCATGCCCGCATT 

AGCCTGCTCTTTGAAAAGACCAAAGCTGACAAACTCCAGAGGGA^ 

TGGCTACAGTTTCAGAAAAGTCACGTATAATGGAACTGGAGAAAGACCTAGCATTGAGAGTA 
CAGGAAGTAG CTGAG CTCCGAAGAAGGCTAGAGT(XAAT AAGCCTGCTGGGGATGT GGACA 
TGTCACTTTCCCTTTTGCAAGAGAT AAGCT CTTT GCAAGAAAAGTT AGAAGTCACCCGTACTG 
ACCACCAGAGAGAAATAACTTCTCTGAAGGAGCATTTTGGA 
AAGGAGATAAAGGCTCTGTATACCGCCACGGAAAAGCTTTC 

AAGCAAGCTGGAGCATGCCAACAAAGAGAACTCAG^TGTGATAGCTCT atggaagtccaaac 
TGGAGACTGCCATCGCATCGCACCAGCAGGCGATGGAAGAACT GAAGGTATCTTTCAGCAA 
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AGGGCTTGGAACAGAGACGGCAGAATTTGCTGAACTAAAAACACAAATAGAGAAAATGAGAC 

GCTAAAGAGATGGAAGCCTTGAGGGCTAAACTGATGAAAGTTATTAAAGAAAAGGAAAACAG 

TCTGGAAGCCATCAGGTCGAAACTGGACAAAGCAGAAGACCAGGATCTCGTAGAAATGGAA 

GACACGTTAAACAAATTACAGGAAGCTGAAATAAAGGTAAAGGAGCTAGAGGTACTGCAAGC 

CAAATGCAATGAACAAACCAAGGTTATTGATAATTTTACATCACAGCTGAAGGCT ACTGAAGA 

AAAGCTCTTGGATCTTGATGCACTrCGGAAAGCCAGTTCCGAAGGTAAATCGGAAATGAAGA 

AACTTAGACAGCAGCTTGAGGCAGCTGAGAAACAGATTAAACATTTAGAGATTGAAAAGAAT 

GCTGAAAGTAGCAAGGCTAGT AGCATTACCAGAGAG CTCCAGGGGAGAGAG CTAAAGCTT A 

CTAACCTTCAGGAAAATTTGAGTGAAGTCAGTCAAGTGAAAGAGACTTTGGAAAAAGAACTT 

CAGATTTTGAAAGAAAAGTTTGCTGAAGCTTCAGAGGAGGCAGTCTCTGTTCAGAGAAGTAT 

GCAAGAAACT GTAAATAAGTTACACCAAAAGG AGGAAC AGTTTAACATGCT gtcttct gactt 

GGAGAAGCTGAGAGAAAACTTAGCAGATATGGAGGCAAAATTTAGAGAGAAAGATGAGAGA 

gaagagcagctgataaaggcaaaggaaaaactggaaaatgacattgcagaaataatgaagat 
gtcaggagataactcttctcagctgacaaaaatgaacgatgaattacgtctgaaagaaagag 
atgtagaagaattacagctaaaactt acaaaggctaatgaaaatgcaagttttctgcaaaaaa 
gtattgaggacatgactgtcaaagctgaacagagccagcaagaagcagctaaaaagcatga 
ggaagaaaagaaagaattggagaggaaattgtcggacctggaaaagaaaatggaaacaagc 
cacaaccagtgtcag gagct gaaa gccag gtatgagagagccacttct gagacaaaaacca 
agcatgaagaaatcctacagaacctccagaagacgctgctggacacagaggagaagctgaa 
gggcgcacgggaggagaacagtggctitgctgcaggagctggaggagctgagaaagcaag 
ccgacaaagccaaagctgctcaaacagcggaagatgccatgcagataatggaacagatgac 
caaagagaagactgagactctggcctccttggaggacaccaagcaaacaaatgcaaaacta 
cagaatgaattggacacacttaaagaaaacaacttgaaaaatgtggaagagctcuvacaaatc 
aaaagaactcctgactgtagagaatcaaaaaatggaagaatttaggaaagaaatagaaacc^ 

cttgctgaggagctggggagaagcagggacgaagtcacaagtcatcaaaagctggaagaa 

gaaagatctgtgctcaataatcagttgttagaaatgaaaaaaagagaatccaagttc^taaaa 

gacgcagatgaagagaaagcttccttgcagaaatccatcagtataactagtgccttactcac 

agaaaaggatgccgagctggagaaact gagaaatgaggtcacagtgctcaggggagaaaac 

gcctctgccaagtccttgcattgagttgttcagactctagagtctgataaggtgaagctcga 

gctcaaggtaaagaacttggagcttcaactcaaagaaaacaagaggcagctcagcagctc 

cct aaattcagtaatagtggaccttcaaaggaagaatcaagacctcaagatgwukggtggaga 
tgatgtcagaagcagccctgaatgggaacggggatgacctaaacaattatgacagtgatgat 



ccacgacacagaggattgtoctacccaggcagagatgtcaqaggaccctccocattccaca 

caccatggcagtcggggtgaggaacgcc^tactgtgaaatctgtgagatgtttggacact 

gggccaccaactgcaatgacgacgaaaccttctgatgaagcctccagtggagaactgggct 

tgctcagacgcactcgcattgacacaacgtaacagcagcattgtct 

gaactcatgttattttttaaccccgtcaacaaatgtaggaaaatattttgatcttc 

gccctttagtxrrccccgt atgagtt agaataat aaat atttagt aggtgagttttcacctcga 

Al 1 1 IUI I I ICI I gatttttacgtttgaagacattgcaccagatcccattagatttattggccc 

CCCGAOCTTGTAGAAAAACCCCTACCCTGACAATACCTT ATTTAAGTAACTTTAAATTATGCC 
GTTACTTTTCATA7TTGCACTAAGATATTTCCAGGCTGCATTTGTATATTT 
AGCTTTGACACTGGAATGAGTn3AAAAAATGTGCCATTTTGCATTTTCA 
TATrTTATTCTTATTCAAAGAAAT ATCTGAG CTCTTTGCACT ACCTGTTATCAGT AGTGCCTTT 
ACTrCAGGCTTGATAATACTTAGGTGTGATTATAAAATCATGAAGCAGGTAAAGGGAGGGGC 
AAGCCCCCAAACT GCTG TGGG GACATTTT AT AAT CT AT ATGCTGCACCCACTT AATCT ACTG T 
GGTGTTTTGTTT ATT AGTTTTG CAT AATTTCAG CTTCT ATAT ATTGT ATGTATATATTTTTTAAA 
AATCT AT ATTTTGGGAAAAAAACATACACAATGTGT CT n CT II H GGACATTTACCTTTTTGA 
AAAAQ AAAACACTT AAAATGATCATTAGGACATAAC^GACTAGGCCAGACATAGGATCnTGTG 
QCTTTGCAACCATTTTCAI rTO TTTGTl I ICCTTTTAI tTCl 1 UACCAGATTTAAAT AACTGGAG 
GAATTTTCTCCAAI 1 1 1 1 1 1 1 ICCTTCTCTGGGAGGTATCCOCAGCAGTCAATTAACAATAAGC 
CAGTATAAAACACCTAAATAACCAATCTACAATCTCCCrTCACAA G HUM I AC IG1 1 1 1 1 AG 
ATGAATGTACGATGAGAAATTCAACGTTAATAATTTTGG Al I 1 I U H ATCACAAAAAAG AAAAT 
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GAAGGACCTCAAAGCACCTGAACAGTTTATCGACCAGTTTGAATCT atttatcttcatttgaa 
TGTCTTCTAGAATATGTAAAAAGTCATAAAATGTATCTTCCATGCT ACATGTACAATAAGAACT 
TCTATAATTGTATATATGCCTTTGATGTATTTTCCCCTCAAGATTATCAACTGTGTGTTCXjACA 

gtgaatattcaatctggtaccagttgaaatttttggttataaatgtaatacgaattgtt^ 
aacagaaaacatgtaaagcagtattaaaatttggccaaacaagtgttctgt atctactttaata 

AATGGTTATTCTTTTAAATGAAAAAGGAATTAGAGGAATATTGCGGCCGC 
ATAGGTAAAATCATTAAAAAAAATATTTAATTGT AAAAAATA 1 1 M I I II I GGGGGGGCCTCTC 
AANNNNNNNNNNNNNNNNNNN 
>778 

AGTCTTTATCTGT7AGAGATATACATTGAAGTATTTAAAGGTGAATGATATGTTTGGG 
ATTTACTTTAAAATAATTTCAGCCCCCGCGAAAAGAGTAAGGTGTrGGGGAGGGGAAGT ATA 
AGT AAAAT AAGAATGGTGGACTATTGATAATTATTGAAGTTAGGCGGGAATACACACAGGAA 

crrc^tgatactattcttactattttttggtatgttaaaaaatttc^ntaat 
aataccagttcttcacatagtc^rrccaaaaaaatagaagaggagggaatactrcgcvvgcttct 

ctccgg^jcg^^ 

cccggcx5c«cx^tgaccgtcxicgcxjgcggagcgtgccc^^ 
gagctgcccxxm^gctgctgctggtgctgttgtgcctgccggccgtgtggggtgactgt 
ggcctt cccccagatgt acctaatgcccagccagctttggaaggccgtacuuvgttttcccg 
aggatactgtaataacgtacaaatgtgaagaaagctttgtgaattcctggcgagaag^ 

AGTGATCTGCXrrTAAGGGCAGTCAATGGTCAGATATTGACACGAGTTCTGGAATC^TAGCT 
C^lAGGTGCCAACAAGGCTAAATTCTGGATCX!XrrCAAACAGCX!rrT ATATCACTCAGAATTATTT 
TCCAGTCXjGTACTGTTGTGGAATATGAGTGCCGTCCJAGGTTACSAGAAGAGAACCTTCTCT AT 
C^CCAAAACTAACrTTGCCTTCAGAATTC^^ 

CCGCACAATCATGCCCTAATCCGGCGAGAAATACGAAATGGTCAGCATCGCATGTACCAGG 

TGCCATATTCATCCGGTGCAACATCTCCTTCTCMTGTAACACAGGGTACAAATTATTTGGCTC 

GACTTCCTAGTCAACGTCnTACTTCCAGGC^AGCrrACTGTCCAAGTGGAGCTGA 

AGAGTGCAGAAAACTTCAATTGTGCAAGAAACAGCACAACTCGACCATGCGAATAATCCACG 

GCGAATCTC^CACATGGCITTCXiACTACCTTGTAACGCTATGTCCG 

TTACACTGCCACCCCTCTATAGCAGCC*5CCACCCCC^ 

CAAGCACCAACCACAATTCTCCiACAACMCTCCCCCCTrrTOGTACACACT ACAAAACCCAOC 
CX^CCAAATGCTCAGCCAACACGGAGTAC^XTTG^ 

AACAACCCCAAAT AAAGGAAGTGGAACCACTTCAGGTACT ACCCGTCTT CT ATCTGGGTTCT 
CGTGCCTGTCACCCCAGGCTGGT ATG CG GTG G GTGTGATCGT AG CTCACT GCAG TCTCGAA 
CTCGTOGflTrCAAQCOATOCrroCACTTCA OCCro 
CCACGACACCCGGCTAAGTTTTTGAAATTT ATTTTTTGTAGAGACA^ 

CCAGGCTGGTTTCAAAjCTCCTGGCCGT AAGCGATTTTTCGGGCCTCCCAAAACGTTGCXaATT 
AT AAGTGTGAGCCACTGCACCTGGCCCCAC A I HIGH I ATCCATTTGTACATTGATGGACAC 
TTAAGATGATTCCATATCTTTGCTATTCT 

>779 

GGGGGTTTTACCATGTTGGCCAGGCTGGTCTTGAACTCrrTG 
CCTGCCT CCCAAAGTGCTGGGACTACAGGCGTGAGCCACCATGCATGGCCATTGATTTCTT 
AATTATAAAGATACATTAGATAATTCATTGCTCTTTATCATTC 

AGTGGATCTGAAAATAA1TTCAGTATATATAAATTATATATCATTATCACATAAAGTTGATGAA 
AT AGT AT AT AGAATGTTTGT ATGCTCTAAGACT AACTTTT AGATTTTCTTTTT ATTCTCTCAATG 
GTGTGCAAAGTATAAATCAAAGCCATTCTrGT GAAAAACTGCTTT CCTGGGTGAAAAGCAG T G 
GCTGTGCCAGAGTGA ITGTTCTTI CAAG CAGTC ATTCATATCAGGGTAATGATCTGCAGCTTC 
GTAGGT ATGTTTCTGCCTCTGGAAAGTTATTTTTGGTATGGTTTATACT ATGATAATTTCTCTA 
TTAAAAAGTTAAACTTT AAAAGCCAGOTAGATGCTT ACCTC1 I C I I A CTGGAGTTTAGAAAGA 
GGGGGGTAAGANN 
>7B0 

CACGCGTCCGGTCGAjOGCCTCCTiCATGCG 



GCCCGGGGGCGCTCGAGCTAGGGT GACAGGCCCGGCCTCTAG 
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GGGAGGCCCGAGCCGGCGGGCGCCCCGGCCCCGCGTCT AGTTGTTGATGAAGCATGTCG 
G CCACCAGCGTG GACACCCAGAGAACAAAAGGACAAG AT AAT AAAGT ACAAAATGG TTCGTT 
ACATCAGAAGGATACAGTTC*TGAC*ATGAC^ 

AGAGTAACAGTT ACXXXTTCAATG AGCGACCCCT ACCTG T CCAG CT ATT AGCCGCCGTCCATT 
GGATTTCCTTACTCCCTCAATGAGGCTCCGTGGTCTACT GCAGGGGACCCTCOGATTOCATA 
CCTCAOCAOCTACGGACAGCTCAGTAAjOGGAGACCATCATTTTATGC ^ 

CCCTGGGTTCTC^CATGGGGGACAAGTGGGTCTCAAGGTCAGC^^ 
GTATGG GAGC AGCTACACCT ACCCCCCX3AGCTCCCTGGGTGGCACGGTGGTTGATGGGCA 
GCCAGGCTTTCACAGCttACAC^TCAGC^AGGttXXCT^ 

C^TTGGCACCTGGGATAACAAGGGGCCTGTGCCGAAGGC«X^GTC«XXAGCAGGCACC 

CTCTCCACAGGCTGCCCCACaGCCCCAGCAGCTGGCTCAGCCTCT cccagcacagccccc 

AGCTTTGGCTrCAACCGCAGTATCAGAGCraTCAGCAGCCACCXX^ 

CCACXJCAACAGAAACGCGGCGTTTGGGCAGAGCGGAGGGGCTGGC^^ 

CCTGGAAAGGTCCAGCCTAATTCTGCCCCCAGCGTCGAATCCCyVCCC^ 

GAAG GCTGCTCACAGCT AGAACCCGAAAGAG TTTGAG TGGAATCT GAAAAGCGGGCGTGTG 

TTCATGATCAAGAGCTACTCTGAGGACGACATCCACCGCTCCATTAAGT ACTCCATCTGGTG 

tagcacagagcaccgcaacaagcgcctggacagcgccttccgctgcatgagcagcaaggg 

gcccgtctacctgctcxtcagcctcaatgggagtgggcatttttgtggg^ 

aagtcccccgtggactacggcacc^tgccggggtctggttjtcaggacaa^ 

aagtttgatgtccagtggatttttgttaaggatgtacgcaataacca^ 

gctggagaat aacciacaagaaaccggtcacaaact cccg g gacaccc ag gag g tgccctt a 

gaaaaagccaagcaagtgct gaaaattatcagttcctacaagcacacaacctccatcttcga 

cgaci 1 1 gc i cactacgagaagcgccaggaggaggaggagctggtgggcaag&aacggga 

ga gtcg aaacaaacaatgagggcgaaccagtttcttacatgttct aacgtttgactttgaaaa 

cag tttaaa ac acct gtgcttggtcagctccagtgtgtcgtcc03tg 

TGCAICIIIGCCI I ICI I G I CG 7 7 GATTTTTGCCCJAGATGGATCTGCATrrATTTGTACTTTTT 
CTATGTATTATAATCCTGTAGAAGTCACTAATAAAGGAGT A I HIII I I G TCAGCTT ATCAATC 
AGACTGATCTAATGTGAAATGTAAGTATCCTTAAAAACAAAGCATCT ATTTTGGCAGAAATTG 
TGTTCTTAAATTCAGTCATTTGATATTCTGTGAGACTTCATATTTCTCATC^ 
TAGCAAACATAAGAAACCATGAGTCATTTTGTCATTTAGAGTATTCTGATAAAATCT 
ATACTGAAATCAAAAGGTTAATGATTTT TTGTTC ATrCTGATTrGTCA 
GGTCT AAAG TCCTAATTTACCCATTr GATTTTTCTGCrAGACAGATAACT^ 
TTTGGCAGAGACI 1 II 1 1 1 1 1 1 1 1 1 1 IGAAAATCI I ICCI I OCAGATCTrGTTGCCCACTGAACA 
GCCACCCGTCCCTCACTQTCCT GG TGTCCGATTGGGCT GQATGGTGTTGG GGCATGATGTG 
T GGAGGA A CTGGA AGG TGCTT TAGGTCTGGTTCAGGGTCGGGCATT C 1 1 I UI I U I I IG CACA 
TCTTTTT AAATTTT ACACG I 1 1 ICI I AAGAATTCT AATGCCGTCTT AAGTTTTT ATACCAATAAT 
GCTGAGCTTT AAGTGTAGGATCrTGGTAGTACAGACAGTCrrGATGGATGATGCTGCTGGTTGT 
AAATTTCATCGTGTGTG7 CT AA I TI IT I ITCCTGTTGAATGGGTAAAAACAAAACAAAACTTTT 
TTTA GAAGAT GAATTTGCTGTCA rGTTTTGT G GAATGAGGGACOGTTGAGCT CACTACCACC 
TGGAGTTTGAGTTGAAGCATGAAAATGGTGCCCATGGCTGACGCT^ 

CAOGTGO0C TTGTAQAO GATOCT TAO0G TOCT AGAGAGCAGACG C 1 1 I C I GAAAACTACTTC 
C TCCAAAA GAOCCTCTGAGTTAACGTTTCAGCnT^ 

ACI ICCICI GAACTGTrACTGC*rrGAATGGAGTGCTGGAC*»ACATTGGG 1 1 1 1 rOCTCTAGG 

AGAATACAAGCCTTAATAAACAATACTATTTAGCAAAAAAAAAAAAAACT 

CCAGGGGGGGOCCGGTACO^AATTCGCCCTTAT AGTGAGTCGT ATTACAATTCACTGGCCGT 

CGTTTTACAAOGTCGTGACNNNN 

>7B1 

•°0 c *** 3C GGTGCGAAAGCCOCXjGACTCGTGGAGTTGTGAA 
GCCGCACAAACCAGGGCTCGCCATGAAGCCAGGATTCAGT^ 



_ _ GAGGOGGCGGC 

" GGOQGTGGAGGAGQ»GGAAGAATGTGAIGflTGGAGOOGCATCGCCATQAGGG I G I C 1 1 C AT 
I i U < tAaAGGAAAGGAAGATGCACTGGTCACCAAGAACCTGGTCGC I GGGGAATCAGTTTAT 
GGAGAGAAGAGAGTCTCGATTTOGGAAGGAGATGACAAAATTGAGT 
GCnnrCCGCTCCAAGCTAGCAGCAGCAATCCTGGGTGGTGTGG 
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GGGGGCTAAGGTTCTCTACCTC^GGGCTGTCTCGGGCACCACGGTCTC 

ATCGTTGG TGCGGATGG TCT AGTCT ATGCAGT CGAG TTCT CCCACCGCT CT GGCCGTG ACC 

TCATTAACTTGGCOVAGAAGAGGACCAACATC^TTCCTGTGAT 

C^CAAATACCGCATGCT(^TCGCAATGGTGGATGTGATCnTTGCrr^ 

CX^GACCCGGATTGTGGCCCTGAATGCCCACACCTT 

ATTTCCATTAAGGCCAACTGCATTGACTCCACAGC^ 

AGTGAAAAAGATGCAACAGG^GAACATGAAGCCGCAGGAGCAGTTGACCCrrTGAGCCATAT 
GAAAGAGACCATGCCGT6GTCX5TGGGAGTGTACAGGCCACX3CCCCAAGGT 
AGTTCAGCGCTGTCAGGATTGCGAGAGATGTGTGTTGATACT Gc*OWtt©aatutlcxctixaacactt: 
atgugagacttaatccctaatpogcaatactga^ 

cttcattcatgggtt aatggattaatgggttatciacaggaatgggactggtggcttt ataag 
aagaggaaaagagaactgagctagc^tgcccagcccacagagaggctccactagAgtgatg 
gt aagtggaaatgtgaggtgcagctgccacagagggcccc^accagggaaatgtct agtgt 

CTAGTGGATCCAGGC»kOW3GAAAG^TGanTGTGGAGCG 

ACCACCAGGAGCCX>GAACTGTGGAGTCAGTGGCAGCATGCAG fGGGAAAGC 

GGCACGGGGCT ACCTGAGGCX7TTGGGGGCCCAATCCCT GCTTCXAGTGTGTCCGTGAGGCA 
GCACACGAAGTCAAAAGAGATTATTCTCTrCCCACAGATACCTTTTCT . 
CAGCATCT GCTTCATTCCXXXrCACCTTCCCAGGCTGATCTGAGGT AAAcfltGaagaaaaTAAAAG 
CTgtgtnpapo 
»782 

rWACAGAGAGTGGTACATGAATGACTAGTTTTCTAAGATG I GG 1 1 1 1 1 ATTGTGAAT A 
AAATATAAAAGTTAAAGGCXXn-CTGCTAAGTCACATAAAGTACAGCATATAAGTTCATATAGG 
TACAAATAAATGAGTTTGCAGTGAATTGGGCCTTCAAATTACCTCAAGTGACAGATAGT 
AAAGGMCI I UAGCAGGTGGAGGTCACTGAATCCCCTACT ATGCACTT ATCAAGATTTTACTT 
ACTTT AATTTACTGGAAATTGATTTTTTAAAAAATGACT ACACTGT AACAAGGGAAGGGATCT 
GGGTTTTTTTGTTGTTT1 ATTCTTGI III IT I TAAGTAGTTCAAATTCTGAAACTGTGATTTAAA 
AATTTTTT ACAGTCAAGCA TTCTT^ TTTTGAACATAACTCCCrrrCC 

TCTTCTCTGTTATCTCTT AAA 1 1 1 IGTT ACATCTCCCTCAGCCI C I II G 1 1 I G 1 CCGTCTCCCTT 
CTGTCATTGTCT ATGGATGTTT ACCTCTCTGTTCT(Xrr AAAAG TTTGU^AGATTAGGTCAACTCT 
TATTTCT AGTTCATTGGT AATTTAATCTTAAI I T T 1 TIT T03TGATTTTTGTTGGTTGT AT AATCT 
GCTGACGT ATTTTTAT ACTCAAGTGTAGTTTTCT ATT AAAAAGAAAAG TGGTTGGATT AAAAAT 
AGT AAGCTATGTAACCCTCATGTTACTTIXACTTTCAAATATTGGGTACCT 
AGAGATTATGTAATCCT ATTATAGT ATGTTTGG 1 1 ICCI 1 1 ATTGTTGGATTTT ACATTCT GATT 
"^^^^^CClCCMAAAATG^ATATCKTGMMiACTM3^CI^CnYAITnSCM/G^GTTT, AGAAA 
GGTGTT AAAAATGTAAAGCAAAQAGTCTTAAC IT TCTOC1 AATTGGGAGAAAAATGCTTTAAC 
ATTAC TATAAT AATATTOCAGGT1TGGAGGGGGTCTCCAGGCCOCAT A 1 1 I G C IG I I AATAGT 
TGGACXnTTTAGACCATGTGTTATTTGCAAT«X>GAATGATTG 

GATACTATTC 1 1 1 IC I 1 1 CTGTACAAG TGCA ATAC I OOCC 1 1 G AAGTCTTAAAAACTATGGTGA 
tttt nm ICIMICI^^CT^XT^TCCTT 

GTGGAAACCACT AGTCTGATCTCATGGT ATCTTTATTTAAGCTAAATTTGCATGGAAATTAGTA 
ATCTTTTGCTTGAAAAATGTGTCCT AAAGTTGAACTTTTT ACAGATTGAATTJTTCTT AGACCCT 
CGOC CAATGC TCTAAATTAAGAACCT AATACTT AATATTTTTATTTTACTTCTOCOCTTTTAGAA 
ATAAA CTTTTAAA TAAAAGCAAAGCACTTAGCTGAGTTTTAAACAC^ 

AGAAAI II II Ml AAAAAT ATTTGGAGCAGTCC I G 1 1 1 1 tJATACAAATTTAAGTAAGAGGTATT 
TTTCTTATACATATTT ATATCTAGTGTGCTAA 1 1 1 ICI 1 1 1 1 1 1 AT ACCTGTGTCCCTGT AGTAA 
AACTGCTGT AAT AT AAAT AC A I G 1 1 1 IGI I AAAAGATAAC AI MCI I IG GC AI I ICI 1 1 I AAAG 
GCAGTTACTXXATTTCTGCATTTGT ACAGT ATGTGTCTTGGOCATTTTAGATA1 rCTTTCTTTA 
ACAATACt^AAGGTAATTAGAGTATTTTAAAGACT AATTGCTTGACAGTTTCTAGGGTATTTTG 
TGTTTTAGAANNNNNNNNKNNNNN 
>783 

ATTATTTTATATGTT ATGTAT ATATTTAGGAGACAAATGAACATTTGTCTGT ATCTGAA 
TCAACTGTACCAOCCAAAAAGTGTOTGCCACiACTGCAl I faUCC I GGGACATAAAACTGGAAG 
C TAAAOCT AGTAT A 1 1 1 ICCI I AC^TCCTCTGGGT AGCAGGAGTCATGTTTGACAGCTTCTCT 
GCTTTTCTGACGCATGGCI ICCI 1 1 IGI I CACAATTTTCTTACTGT AAGTCTTTCAGAGGTGG 
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T0AGGC^GAG*^0C^GTTGATAGA0AAGTTTTCAAGATCT ATTTTCCCTTnATTCT AGGCT 

ggggt gtcat agaaagtttact>attt act agagtg gacactggagtacact agaccagtt a 
gtactcttcactgtcrrttgccatgaaactttataacatggctctccaggtgttgaatctggtg 
ccctgtcaccctgtgctcagggaacacatg6cggcaatcagcatgtgaggcgcagaggqa 
gggcaagctccccttgtgatatttgaggtatcagctgactcaagtctctctcccttctctcc 
ttattctcatgctacctctcccaaccattgtcttaacttccctggccaggatgcctgccatat 
tagatggagaggaggcagtttctaaatggcttgactttggtgaagtctcaactcaagaagct 
ctgaaattaatccacccaacagagaacattaccttccann 

>784 
>785 

acaagaggatatgtgtgcattacatgcaaccactacaccatttaatatctggggtgt 
gagtatccgtgggttttgggtatccgtgggggtcctggaaccaatttctcctggatactga 
gggatgactggattactgtgtgtttgtgtgcttgtttttaagcttcaaaagattatgtgatct 
aggagttgttagattttattattggtcttaaaagataagctt agatgtgtta c i rm I GGAGT 
tttagtttacagtgattcatgaatcgggcagcttcagaccacaggagacatgaagcaggtag 
aagtttaagaaagcttgacaagcaaaatatttgatttggn 

>786 

actaaaactaaaactgagc^gtttaa>ac^ttcatttaaju3ggatatctaatgtgttta 
ttattaacataaataatgttttatgaaaaatgtaaccttagttttccaaaacaaaa^tgtttagg 

GCAAGAGT AACATT ATTTTACATT ATT GCATCTCAG TGAAAAAT AAATGG CAA CAAAATTCTT A 

tatctgcttctgcagttaattctgttcattttgttttggttgaaatatatgaaggaaatctgtc 
ctcacacagttgtgtagtggaaaaagggggactattgtaacaggctgtgcacataattgtgg 
atgai ii ici i igatacaacaacaaaacttn 

>7B7 

NNNCGTCCTGGTTCTTTGAATCGATGGCTCACTCCCAGCCACTtTTTGACTGACTGCC 
AGCTTAGCAGGGGGTAGCTTACCAACAAATC 1 1 1 II 1 1 1 I 1 1 1 1 AAAGCCAGACTGATTCATA 

gaaactcctttacactctccagcctct caccgcaaaattacacaccccagtacacgagcaga 
ggaaacttataacctcgggaggcaggtccttcccctcagtgcggtcacatacttccagaag 
agcggaccagggctgctgccagcacctgccactcagagcgcctctgtcgctgggaccctt 
CAGAACTCT CTTTGCTCAGAAGTT ACCAAAAAAAAAAG AGCCAAC atgttggt attgctggct 
GGTATCTTTGTGGTCCACATCGCTACTGTTATTATGC7ATTTGTTAGCACCATTGCX1AATGTC 
TGGTTGGTTTCCAATACGGTAGATGCATCAGTAGGTCTTTGGAAAAACTGTACCAACATTAG 
CTGGAGTGACAGCCTGTCATATGCCAGTGAAGATGCCCTCAAGACAGTGCAGGCCTTCATG 
ATTCTCTCT ATCATCTTCTGTGTCATTGGCCTCCTGGTCTTCGTGTTGCAG CTCTT CACCATG 
GAGAAGCGAAAGCGGTTCTTCCTCTCAGGGGCCACCACACTGGTCTGCTGGCTGTGCATTC 
TTGTGGGGGTGTCCATCT ACACTAGTCATTATGCGAATCGTGATGGAACGCAGTATCACCAC 
GGCT ATTCCT ACATCCTGGGCTGGATCT GC rTCTGCTTCAGCTTCATCATCGGCGTTCT CT A 
TCTGGTCCTGAGAAAGAAATAAGGCCGGACGAGTTCATGGGGATCTGGGGGGTGGGGAGG 



GGAGGGGGGAGGGGGAAGGAAAGGGGGGAGGTCAAATCXXIAAACCATTAGTGAGGGGA^ 
CTCTACTGCCAAG CCCCTGCGCTGGGGAG AAAGT AG TTGGCT AGT AC7T7 G ATGCTCCCTTG 
ATGGGGTGGAGAGAGCCTGCCTGGAGCCACCAGAGTTGGCtjrCCAGCTG I IC I 1AGTGACA 
CACAXJTGTCTGGGGCXXX^TCAGCTGCCACAAC^CCAGCCCCACTTCTGGGTCATGCACTG 
AGGTCCACAGACCTACTGCACTGAGTT AAAATAGCGGTACAAGTTCTGGCAAGAGCAGATAC 
TGTCTTTGTGCTGAATACGCTAAGCCTGGAAGCCATCCTGCCCT^ 

ATCACATTOCAGTCTGAAGTGCCT ACTGGGGGGCTTTGGCCTGTGAGCCATTGTCCCTCTTT 
GGAACAGATATTTAGCTCTGTGGAATTCAGTGACAAAATGGGAGGAGGAAAGAGAGTTTGTA 
AGGTCATGCTGGTGGGTT AGCTAAACCAAGAAGGAGACCTTTTCACAATGGAAAACCTGGG 
GGATGGTCAGAGCCCAGTCGAGACCTCACACACGGCTGTCCCTCATGGAGACCT CATGCCA 
TGGTCTTTGCT AGGCCTCTTGCTGAAAGCCAAGGCAGCTCTTCTGGAGTTTCTCT 
CTAGTGAACAATTCGtGTGGTAAAAGTACCACACAAACT ATGGGATCGAAGGGGCAGTCTTGC 
AACAGTGCCATGTT AGGGTTAT G 1 1 1 1 I A GGATTCCCCTCAATGCAGTCAGTG I ITCTTTTA A 
GTATACAACAGG AGAG AGATGGACATGGCTCATTGTAGCACAATCGTATTACT CTT CCTCT AA 
CA M I MG AGGAAGI 1 1 1 G I C I AATTATCAATATTGAGGATCAGGGCTCCTAGGCTCAGTGGT 
AfiCTCTGGCTTAGACACCACCTGGAGTGATCAGCTCTTGGGGAGCCreCCT 
GAGGTGAGGCATGGCAATTCTGGAAGCTGATTAAAACACACATAAACGAAAACCAAACAACA 
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G GCGCTTG GGTG AAAGG 7G CT AT AT AATTGT G AAGT A i ■ AAGCC i ACCGTA i 1 1 CAGCCATG 

ataagaacagagtgcctgcattcccaggaaaatacgaaaatcccatgagataaataaaaatat 

AGGTGATGGGCAGATCTTTTCTTTAAAATAAAAAAGCAAAAACTCTTGTGGT acctagtcaga 

tggtagac<^ctgtctgctgcc<k^ggagcacctctatacaggacttagaagtagtatgtt 
attcctggttaagcaggcattgctttgccctggagcagct^ 

gtctaaaggggttttttgggaagacg i men tatcgccctgagaagatctaccccaggga 
ga atctg agacatcttgccta c i men i attagctttctcctcatcca i rrcm i atacctt 
tcctttttggggagttgttatgccatgatttttggtatttatgtaaaaggattattactaatt^ 



gaggttagagagattggccagcaaaaactgtgggaagatgaactttgtcattatgatttcat 

tatcacatgattatagaaggctgtcrragtgcaaaaaacatacttacatttcagacatatccaa 

agggaatactcacattttgttaagaagttgajvctatgactggagtaaaccatgtattccctta 

TCTTTTACI 1 1 1 1 1 ICTGTGACATTTATGTCTCATGTAATTTGCATTACTCTGGTGGATTGTTCT 
AGTACTGTATTGGGCTTCTTCGTTAATAGATrATTTCAT AT ACT AT AATTGT AAATATTTTGATA 
CAAATGTTTATAACTCT AGGGATATAAAAACAGATTCTGATTCCCTTGAATGTGTGAATGTTTT 
TTTCTAAAAAAAATGTGGAGAAATATGGATAATTATGACATTrATCCCTCATTAAGCTGCCTAT 
CAGTTTGATTTGGAGAACTTGACATTTATTTGAGACATTAAGCTACTTTCTGGTAATATATTAG 
GCATTTCTGCAATAGCTCTTTCAGGTAACTGAATATTATTAAGCATAGTTTTATCTTGCTTTG^ 
1TAAACCTCTTAGGCAAAAAATGGAACTTCATAAGGTAATACATTAGAAAGGGGTTATGATTA 
TAAATCAGA^TGCTTGTGACATTAAGAAATGAGGCACTTGTGAA AI IICH I GAAATAGCCA 
GCTCCTCTAATGTGTCTTCAAAATATAAAGTGATfCACAAAGGCATGCATCACACCTATTTGT 
AGCAGCCCATTCATTACATAAAGCAGGGCATACCTGTGTGGGCTCTGTGAGTGAAGGGAGG 
CTTCACTACTTTCTGTGAGCAGTAAGGACTGGT ATCTTT CTGTGAGCAAT AAGGACTGG ATAA 
AGACTGCATATCCTTGTGTCGTGTCAGCACCAATACAATAAGGAGGGTTTTAATGTGAAGCA 
GGCAATCTTCCAGGCCCTTCTGGTCTTGGATGAAATAGTTGCACAGAGTATTGCACCAAAAA 
TACACAATGGAGGCTGAAAAGTTCAACATATTTTAAGTCAATTAATCAAATTGCATTGATTCTT 
gatgctttcttagagg cct ACATGATTTCTT AGATTG CTCTGATAAACTATCATAAGGG GTCC 
ACCTCCCCTCATTTAGCTCCCCCAGGGA U 1 G 1 t t tc^cccatgtcatacacccagtcctaaa 
TCAACCCCCAAGGCTATCGTTCGATGCCTTCTGCAG AGGGAAC I I 1 1 G I GAG ACTCT GCAAC 
AAACTCCTAGCTCTATCCAGAGTGTCCTCTGGTGCTAAGATTGGTAT C H I CI CCTCAAAAGC 
CTGGATGGTGAATGGGGGTGCATTAGTCAGAATTCTCCAGAGAAACAGAAAAAATAAGATTC 

>788 
>789 

ACTTTAA J I ICI I IATAATTTGTTCAGCTATTTAAAAAGATAATCCACAATCTCCTACC 
GCGATTAGAGCACAGGAAAAAAAAATTCAAAAAT AAAGG AAAAACATGGCTCAT AT ATCT ACA 
GAAGTCACAAAAATACTATAGGGCACATATACCCAGGCCTCAGCGGTGGGAAGAAAAGATAC 
AACCACCGGGGAAAATGTTTGAACACTrGAAGACGGGAATTTTTTAGGG 
TGTTGAAGGTAACT GGGAAAGTCCTGGAT AGAAATAGANNN NN 
>790 

nnnaaattttaattgttaaaataacaaactttgcagtgacagagtctctttttggcag 
ccatgaataactgtactcaagtcgccctt atggagcccttgattcaggcttcaatagtgtgg 
acagtggtg at aagagatggtagggaatgaagtaagtgtttttt atgttccgtgtgtt ataac 
acctgattaagagaaaacagaatgatgaaaatgaaaagcgtctt aactggattcagtttctca 

CTACATAAAATAGAGAAAAGTCAAGGTGGAGGCAAGATTCCX^CCCTCTCCAGCAGAATTGG 
CATTCTGCGTCCTTACCGGCTTTCTGTCACGTGGATTTCCGGCTGTTTCCT 
GAAATAGTTTCATATCAT AGAAAGGCAAACAGGAGCTGAGCCAGTTGAAACrrGAAGCCT ACA 
ATCTGAGGTGGGGGGTAATCTCGAGCAGAGGTGCTAGATGGTGAGAAAGCAAGTATGACTT 
TCGGCTGATGGGT ATAGACAAGGACCTTACT ATAGAGT ATTCATGTGCTTCAG AGGAATAAC 
N 

>791 



actaatt c 1 1 h cctctttcctagaccgattct agtttgttgccttccctttcctogg 
aaaccccaagtttgtggatgctgcagacactct gtgcccccctgcatgctgggtgcctggc 
cagctgccagggcataaagacagagacgatgtggcctttgtccttaagaatgaggtttgam 
gccccagttcttccatgttaggtg a i I I CI i gcagctcttggtatctgcagaattagtgtgaa 
tgcttaaaaaatattaacagctttatatcatgaaagttttaacatgn 
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>792 

agGGCCGgttagaGAACTGCGCATCAGATGCGTGGccagaAAGTTCTTATATTTATGGAT 

tcgaatgacttttggaagagtatttccctctaaagccaggtagtattagctc^gaacaacata 

ATTGTGTTGAGTTCATTCTAGGG I M ICI 1 1 1 ATATTAAGCCTGTATAT AAACTTCATTATGGC 

ttgtgctgttacattcctccccagcctccagtaggtagaagcagtccctaatataaccatgt 
aatacagtggtaacttcatatctgttagttgactgcagctgcattcatggtgagctgtgaatg 
ttatctacaaagtagccagtaaaggttaagttttagtgttcagtggtgtggcctttagctagc 
tactgtggccttcagcttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaagt 

>793 
>794 

ACGAACT TAAATTTATGATGAATATCTTTGATAATGAGAAATCCTGAGAGATTTTACTT 
TCA>TTTTATrrrAATTTGAAAGAGC^TATGACATCTGGAATATTTTTAACATATAGCCATACT 
GTTTATTTAAATTTGTAATAATAGAAATAGAGTAATTCTACTGTTGGATTTTAATTTTTAATCAT 
ATTAAAGTTTAACTGGATTTTATTTTAGGACTAAAATATTTAGGACTAAATAAAATTTTATTAAT 
TAATTTAGGACTTrTGGGAAAAGATATTTCAGAAGTTCAGTGCATATCAAAAAAGCGAACAAC 
AGLAGGCTTCATCTTTTGAAAACTTCATTGGCTAAAAGTGTCTTCTGTAATACTGATAGTGAAG 
AAACTGTTrrrACATCCGAGATGTGTTTGATGAAAGAAGATATGAAAGTGCTGGAAGACAGG 
CTTCTTTAGGACATGCTAGAAGAGGaGCTTCTTAANN 
>795 

CC^CGCGTO^SAAAG^GAGTAC^TGTGATATATAGCCAAAGCTCTCTTCTCT 
ATTAGCTCTCACAGCCATTGGX^TCTTCTACCATTCAGlAGCTGATAAGCAAAGATATCAGCG 
TACTGGAACACAGAACAGTGCCTTGCATACATACGTAGGACAATATCTGCCACAAGGTAGGC 
ACTCAAATATTCATTGXAGGAGTGGCAAGATGGTAACTATTCACATCAACCGCGACAGACAC 
CTTTTTGCAATGACTATAACGCGTCCTGACCAAGCCTAAAACCAATATGTGTGGTTGATTTGT 
ACCCT AGGTGATCTTTG GCTTCtTTCAA G TTTTTG CACCACTCAG AATCATTTCAT AT ACCACC 
TTTGGCAAACATGCCAGACCTGCAGTAGACTGAAGGAAGCTCTC^CAAGCT CTAAATTG ATT 
AATTTATTAGTTCCT AGAAGAAAGAGATT ACATGTTTATCTTTTTGTTACAGAAGAAACTTTGA 
ATAGCAGTTGAAAATTTGGCAGGGTGGACCACGTAACTTGACAGTGTATTATTGTGTCTGTTT 
TGAAGGAATAAAATGGAATTATTTATAAAGTTTTCATTTGTATTAGAGAGAGCCATTTTTAA^ 
TTTCACATTCAATTTGTAACATTCTAGAAGACAGATTATCTTCTTTCTGTATACTGTCT 
AGGTA7TTAATAAAATATTAACAGAAATGTAAAACAGTGATTTGTCATAGGGCT AGTT ATACCT 
TTGTATTTTGGTTAATGATGATAAGACTTGTCAAAGGGCCCTAGGATTAA C I I I I rCCCGTAA 
ACGGCCTTATTTTGTGTTTTTAATAAGTT AAAGG GCCATAA I I I CI 1 1 1 
>796 
>797 
>7G8 

ACAATTTTT ATGTTTACAGCTGTAACCCCTGAG TT ATCAAGAGATGGAACATTAGATA 
TGATTTATTCCTATTTAAGATAATAGGACATTGCTT^ 

AGAAAI I 1 1 1 1 1 1 1 1 1 AATCTAAAGGAAAGGTTTTGATTCTT ATGAGAAAAGAATGAGATTTCTT 
TAACTGGAAAATTGATTTATGTCCT ACAGTCCATTGTGTAGTGATGTT GGATCAATCAGGTAT 
CGCT AGGGTGTCTGTAGAAGT ATCT ATATATTGC 1 1 1 1 1 AAGTTCTTAT 
>799 

ACCATGT AGCTCTACTTTTCCAT AT ACAGAGTTGTTTCCT AG CTTT CT GCT AATCTAA 
CTGGATTCXJTCTTCCCCATTTCCTCATTTACTAGATTATAATGCACATCAC^ 
AAAAATGGGCTTTCAGAGTTACTGI n I CI 1 1 1 IAAATAATTGTGAGAGAGCTTTTGCATCATT 
TATTATCTAATCATGATTCAAGTGACTAGGCTGTAGCACCCAAGAACCT aaaacagt 
TTATTTTACCCAATAATACT ACTTTGCCTTCTTACTTAAAAATGTCCCGTGCTT AACCCTTTTG 
CT CTTTATTTTGATTT AAGCACTTGACCCTAAAGTCCrTGCTTCT 

CATTTAAAAAATAAAACAATCAGT ACT AGAACTAGCATAAATAGTAAT ATCT AACATGATCATT 
TGTATTATATAGGCATATTTAAi^AAAATGTGTTTCATTCCATTCAGTATrT 
GGCCTCTGGAAAATT AGTTTT ACTCCTT AG G I I I I C I C ATOCANN 
>800 

WWCTCT ATTTTT AACAAGGCTCCCTCAAGAT ATT AATGTGACAAACTT ACAT AGCCA 
GCTGTAAGATATCTTTCAAATGCGCAAGT AACCTAACAGATTTGTGCATGTCAGCCAGT AATT 
TCAACAT ACATTAT AAAT ATGGCCAATTTTCCCAAATTCT AAATGAATGGAGAT AAAATGCT AT 
ATAATAAATATGTTAGAGCACCTTTCTrTGAGAAACTTCTAAA 
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TATACTCACACXJACCAGTAAAACCTCTGGTCA<^TGTTTTGGGTTGTGGAATGCCCCCAGCA 

GCCGAGAGACCTATATTAATATCAACAGAGAAATATCACACACAGAATTAAACCACATACAGT 

AAACAAGAGCGAGGAAGTCCTGATGGATGGTAATGCTGCAACTTGGCACAGATATATTCAGT 

AGCrrCCCAGGAATACAAATCTCATGTATTAGCTCAATGTGGCAAGCTATCTCAGATTTGAAA 

GCCTAAATACTCAGATTTTTACTTTAGATTGAGT 

>601 

TGCATTCCAACTTGCTAATTCAACATAGATCCTAATTCCTTAAATGCTrGTAATTAGAAAT^ 

CGTGAACTGTATTGGTTTTTGTCAAGCAATCTGTTTGGGGAACTTGAGCAACTGGGGCACT G 

CTGGCTAGGGTGAAGTTTATTTAATTTGTTTTTATGACATTCTTCATCTTGGAAATGGGGTTTT 

CAAATATTGCnTTCCCAGGCATCATTACTTATTTGCTGGTTTTTAT^ 

>802 

CCCTTTGAGCGGCGCCCGGGCAGGT ACGATAGGCATG CAATT AAAGAAGACCTG C 
CTCAAACA TTTTCT GTGTGACCTGAGGCAAGTCCTTTTATAGCTATAAACT AGGGACAATATT 
TGCTGTCATTTTTTCTACAAATGTCACAAAGAACAAATTTGAGCCTGTCGCTGTGAAAGAACT 
TAGCAAATGAAAGGATCXrrAGGGAGTGTTTTAGATATCGATATTTTTATCGAATTAACTTTTCA 
AAATGAAGTTATTTGCTCACTGAAACTGAAGT 
>803 

ACCTTGCACAGTGCCTTTTAAATTC^TTTTGCTGG ACAG TTGGCAGGCTCTTTCACTT 
GAGAGGCTATATCTTAACGATTTAGAATGGAGAGTTTGGCTCAAGCTCCCT GTGTGTGGTCT 
GTGCTTTCTATA C 1 1 1 I ATTCTTGGTATTCCAGAGTCTGGAGGCTTCT C I 1 1 1 1 AAAAATTGCT 
AGG CTCCTG CCAAATGTT ATAATTTGGGGATGTGAGTTCACT AAGAAATCAACTGACAAGAG 
GCAGATTAATAGGAGAAATGACATCGAAATTTATTAGCATGCAGGGGGAAAAAATTGATTAC 
CA AATAT CCCAGTAGGGTAGAGATGCTTATATACCCCAGGTCTTAAGAGAQAGGGAAGTGGA 
tgattttaggggaatagtaaatactttnt ATGGGAACTCACTGGGCTTGAAGAATATAACAAA 
AGCCTGGGACAAAGTCTGTTGGGCCCACAGAACAGACAGTGGTTTATGACAAAAGNN 
>805 

AGCGGGTGGCXjGCGGCCCGGGCAGGTACTATTACTAGGTTCATTGTTTCGAGAGG 

ggtgaaacggggctttggagaggttaaataacttgccgagggtcacacagct attaagtgg 

taaagctgggatttagatgagcccagacaaagaacggaagaagctaagctattctcttgtaa 

tacctccaacataggaggcaagaagtgaggtattatacaggttgaggagataaag^ggaga 

gaggcctgcagtgctaacaggaggag ctgg gattcatcct ggcttgttct gataggtcagt 

tagtcttagagatacccatgaggtcj^cctactcaaaatggggctcagagtagcgttgtcc^ 

ttcttgtccagtgggcgcagctacagtcttcctggcct ggagtgactggaggctgtcccca 

cgtcccacttcagtgaggcattcatgtgcacccagcagactt^ 

ggggaagattctccagaaocttgttaagatgcacagtgtggtgcttggactggcagtgtgg 

cctosgcagtccctgggagcttgttaggaatgcagaatctc^ 

aaagctgcattttaacaagatcjccagctgtttcgt atgcacatgaaagtttgagcagcactg 

CTCT AGAAAGTTACTTTTATCTTT ACT GTGCCCACCAAAGTAATCAAACTTT GT GAAGT 

>ao6 

NNCTGTTTCXrrGTGTGTAAAAGGACGGCAGAGAAACATGGGCCATCGGTGTTTC^ 
NNfMNf^KNNrmNN^a^lNNNN^MNNN^•m 

ACACATATATAGATATATACACCCACATATATATTTGCTGACATTTTAATGTGAAGTTTTAGTC 
TGGGATAT AAAATGGAATGTATGACATCCTCAAATGTCTGAATACT GTTCACTCCT ATGTTTTA 

catttaattttoc^aagcaaaacattrcagltgaggattitatta 

cx>tatctagaaaccagaataaacaatgccataaagcctataggaaaatgcagg^ 

taaatattcatgtgtttactttcagtacagggaggaatttgaagtagatag^ 

att actccggtctgaactcagatcacgt agg>cttt aatcgt^ aa7 a 

gcggctgcagcatcgggatgtcctgatccaacatcgaggtcgt aaaccctattgttgatatg 

gact CT aaataggattgcgcn 

>007 

AATTCCCATGATGTCAGACCACTGGAGTTTC^GGG 
CGCTGCAGAAGAGCATCAGACGTTCAGT AAGAATGCAAAGGGT ATCTCAGTGGGAACCGCG 
GACCAGGAGAGCTCCCAAACCAACACATGGCTAGGGCTCTCT 
CTTGACGAGAGAAGAGTAAAGATCTTTCTGAGGTTGGTGCAACTGAGGAAAC 
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GCCTCT GCTGTCAGATCTATGAAAGGAAAGAACTGTGAACTTGTCCCC 1 1 T TGTTTTCTTTGA 

cttaaaacaaaagaaaatcactggaacaaagtcttaaagtaataacagaaatgtcagaaaagt 
tgaacatctt atgggcacatgcggtgag tt acgct aacttat agcatccactgagattagcc 

GCATAGGATTCTTCOCATSTTAjGAGCTAAAAGGACCTACTGTCOGCCAGCTGCATTGCAGN 
>80B 

act atccoctacctataaggcatttataatgtgctgggca ttgtg acacttttcatat 
attatctcatgaaatcctcacnaataattct^ 

a>ttctgaggcttacaga>gttaattcagtggcccagggtcacacagtttai^ 

ttgg tgaat at aaagt agcaacttct aagttt cactctcccacttccct ag tt attttcctaag 

GCATGAATGTCT GGGAAATAGCATGCATCAGATWTTCCACCTCTTTAAAACTCTTCAGTTCAT 

ataatntagggtgtgactattcatagatacctttgagctaatcttctgggaggcaatgtaacc 

gcaatgcacactgcaaaacaatgcacgctttctctgtaaat^ 

ggaaaagcccatcrtttgatatgaaccaatagggcagttta^ 

ACTGTTOCTGCNNNNNNNNNNKNNNN 



ACTrrmcTTTTCTTTmTTrm-n i ggaaqaatattgcatacct attagaaaagtc 

TTTTAACAATTAAAATrGAAAATGACTGACAAACTTACACTATTTGA 

TGGTCACATGATAACAATCTCCTGATTGATATGCTTTATTTAACCAGGTTCT 

ATGTGAAAACCAAATTTTACAATGCAGAGGTAAGTGTTGAGTGTTTAATGGGATTTXIATATTA 

AACATTAAGATCGTATTTGACTAAAAATCTCTTATATAC^TTTCT AATACTGAAGCAAATCGCC 

AACGTGACTGTAAATTATTTGAAAAAATCAGAAATTTCAGTTAAAATTGA^ 

GTCTGATA A I C 1 1 1 1 > CAGCTTAGATGGAATGAATGTGTCTTGATTTrTArf CnTDGTTAATTTTA 

TAAGGGCTTCATCTC Cl 1 IL^ TAAATGATTGCCCT CTCA TTCCATTTAATGGTG G TT G TTAC 

ACT AG^CAATCTGTGGAATTTTACATGTGGTTCGGGATTTTAC^AAAANNNNN 

>610 

ACTCCA I rTCTTTI I ATTCATATTATTrCACCAAATAATATTCCACTGTGTAGATCTATC 
ACATTTarrTTAGCAGTTTATCAGCTGGTGGACAATTrGGCTGTT^ 
GAATAATGCTGCTATGAGTCATAGAAACCATTCCTCTTACT 

CTAAGCT AAACTTGTTTGAAATGTAAATTCTCAGGTATTCTCAGTATAGACCTATAGATTCACT 
TAGCTGGTGGGGTCCACCCAACI I CI 1 1 I AACAAGTCCTCCAGTGGATTCTGATGCAATGCT 
AACATITGTGAACACTGTCAAAATCAAAATGGAGTCACT^^ 

AAGCCAGGGACAGCTATGAAGAGAGGGTTCTCATGCATGAATGCCT^^ AT 

CCCAAATGACTCTGCANAAACCACAATCCTGCACAAAGGTCATCA 

ATCTTCACAAGGACANN 

>811 

TATTCAGAG G1 ITTCTn ICCIC I I A GTTCTGTGCCTGCTGCACCAGTCAAAT ACTTCCTTCAT 

TAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTCn^G 

TGAAAAAATAATTGAACAAATAGAGGATATGGTGACTACAGCTTCTACGTA^ 

TTATATTGGTATAAAACAGGTATATCAAATATTAGTGAAGTGTATGGAGACACGCCAGAATGG 

AT ACAAAGACAAACAGTATTACAACATAG TTTTAATGATTGT 

>612 

ACCT AAGAGTTATTAATACTATTTCAGTAAAAAAAAAAATTTAATAAACCCTGTGTGAT 
CCCATTGTAACAGAAAGGCTGATGTTTTCT^TTGTGAAATACAAAT 

TCTTTGTTTCAAAGGATGCA I MCI I CCATAAAGAATAATTTGTATTTA 1 1 1 1 I AAGGGTTTATT 

TTAACTTATACATCAGCCTATATAAAATACATTTCAAAATGATCTGTGCT 

AGCAAATGTTAAI II 1 1 1 II I CCCTCT AACAGATAACAAGTTTTACTCCTATGCTGATTTTTCTG 

GTGCCACTGAAGNNNNWNNNN 

>813 

ACATGTGCATAAGAGGGAATG^rrTCCCTACATTACTCCAGAATACAAAG C M U 1 1 1 CT 
GCCTTTCTCATCCACATAATGGAAGACACTTCTTGGGTGAAATACT 
CTCACTGGrrGAGTCTGAATATAAGCTCTATGAGAGC^ 

TATCCCCAGCCTCT AGAACAAGGCTGGCACATAGTAGATGCACAAAAGGTGTTTGCTGAATG 
AATGGATGACTGAGTCTGTGTGGGGT AATGATAGGGCTAAGGATGGGACTCTAAACTCAGG 
TTTCCTCTGTGGGTTTCACAGTTrACTGGTCTT AAGAGGAGAGTTTCCT AAACTTGCCTTATG 
ATAAAAACCACCTTCAGCATTTGGTAAAAATTACCCATTCCTGTA^ 
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TGAAGTGGAGCTGATGAATCTGTTTTTTGTGATACTGCTGCTGCTGC^ 

TT CAGGTG GTTCT AAGCTT AGGAAACCTTGCCCAAGGAT ACCATCCTG T CTCNNNNNN 

>ei4 

>81S 

ACAAGTATTATGTATCCATAAAAATTAAAAAATCTTTAAAAATGCATATGGGGGTCAG 
TAGGTAAAAG AAAAGAGAACCAAGAGAGCTGCAGCGGGGAGCACAGCTTGCTTT AAACATG 



AAAAAGATGATGGAAACTTGAAATTTTATATTTAATCTTCT CATTTTTAAGTGTTGGCAATGT A 
TTGAAGACTTTGAAGCCTCTCT GCTGGTCAAACAAGATGTATCTGTAGGCTGGATTT AGTCC 
ACAGCrrGGCCAGTTTGAAAACTGAATCCTGCTAGCCTTAATTTAAATTTT^ 

gctttgattcctgcactcctgctcaaaaaaatcttcaatggctccccactgtctgg 
agtccaaactttgtcacoagtccttcaaagcaacgcatgactatatccaagaccccaaaccat 
atttct acctt at ag ccagtctccat cttcgaccx5caacca g aat gat ag ttgaattgt actc 

TAGGAAGGAAAATATTCAGAAAGGCACCAGTCCTGGCCATGAGGGCTGCTTCTGGGTCCCT 
AAGCI 1 1 I rCCI 1 1 CCTGCAGTGCCCTACACTGTGCATGCCTATCAATGAAACCCTGTCCATC 
ATTAACCATCCAGCTCAAATACCACCTCTCCACAAAACATCCCTGATGGCCt^GCCAAATGC 
CCCTTTGCTCTGAATTTCCATGGGACTTTATATCACTCACATGACACTTACAAC^ 

tgttttggcattacctacatagctcattttccgaaacacctaataatattcacagaaggaaga 

GTTTGGGTTTTGCACTATATTGAATATCCCTGTAGTGCTrrGTACATAGCANNNNN 
>616 

nncgcgttgaggcggctgcagcagttgcgcgctgggattgttgcggtgcgctgga 
gccgaatacaaaatacagttaaaataaaatgtcaacct cct ggagtgatcggtt acagaatg 
cagcagatatgcctgctaacatggataagcatgccctgaaaaagtatcctcgagaagcctat 
catcgggtgttrgtgaaccgaagtttagcaatggaaaagataaagtgttrtggttttgatatg 
GATT ATACCCTTG ctgtgtacaagtccccagagtatgagtccctt ggttttgagcttactgt 

GGAGAGATTAGTTTCTATTGGCTATCCCCAGGAGTTGCTCAGU I fTGCI 1 ATGATTCT ACATT 
CCCTAOCAGGGGACTTGTCI I rGACACACTGT ATGGAAATC1 1 1 1 GAAAGTCGATGCCTATG 
GAAACCTCTTGGTCTGTGCACATGGATTTAACTTTATAAGGGGACCAGAAAGT AGAGAACAG 
TATCCAAATAAATTTATCCAGCGAGATGATACTGAAAGATTTTACATTCTGAACACACT attca 
ACCT ACCAGAGACCTAGCTGTIIGGCCTGCCTAGT AGATTTTTTTACT AATTGTCCCAGATATA 
CCAGTTGTGAAACAGGATTTAAAGATGGGGACCTCTTCATGTCCTACCGGAGTATGTTCCAG 
GATGTAAGAGATGCTGTTGACTGGGTTGATTACAAGGGCTCCCTT AAGGAAAAGACAGTTGA 
AAATCTTGAGAAGTATGTAGTCAAAGATGGAAAACTGCCTTTGCTTCTG^ 
AAGTAGGGAAAGTATTTCTTGCTACCAACAGTGACTATAAATATA^ a 

TTTGACTTGATCTTG GTGGATGCACGGAAACCAU ICI I I I 1 1 GGAGAAGGCACAGTACTGCG 

TCAG gtggatactaaaactggcaagctgaaaattggtacct acacagggcccct acagcatg 
gtatogtctactcaqgaq g i ic i i c i gatacgatctgtgacctgttgggagccaagggaaaa 
qacattttot at attggagatcacatttttggggacattttaaaatcaaagaaa^^ 
tggogaaci 1 1 1 1 ! ggtgatrcctgaactcgcacaggagct acatgtct ggactgacaagag 
ttgacttttggaagaacttcagagcttggata1 1 1 ici i ggctgaactctacaagcatcttga 
cagcagtagcaatgagcgtccagacatcagttccatcc^ 

atgacatggacatgtgctatgggatgatgggaagou i u ii i cggagtggctcccggcagac 
ccntrttgccagtcaagtgatgcgttatgctgaoctct atgcagcatctttcatcaaoctgct 
gtattaccctttcagct acctcttcagggctg cccatgtcttgatgcctcatgaatcaaogg 
tggagcacacacacgt agat atcaatgagatggagtct cctctt gccacccg gaaccgcac 
atcagtggatttcaaagacactglactacaagcggcaccagctgacacggtcaattagtgaga 

TTAAACXTTCCC^ACCTCTTCCCACTGGCCCXXCAGGAAA 

GATGATGAAGAGGAGGAGGAGGAGGAAGAATAAGGAGGAAAACCAAAA 

ATTAAACAAGTCCTGGGAGGACTCAGAGGAACAAACGAGGTOC^ 

GGGGAGGGAGGG GGCT CCATGAAAGGTACGTCTGAAAAGTTTCTGAA 

QATACTTG1 1 1 1 GGTTTTGTGTATCTGTACTCTCTGCAGATGCTCC^AAATTGT AATGGAGTC 

TGTATTAGAAGAAAATAAGGGTAAAATCAGGCTGAACTGCATGTATATGGCT CCACTGTGGC 

TTGTGACACTITTAAAATCATCXXrrATGTCAGTGTATCTGGATACACGAGGAA 

GTCTCAGAGTGGAACAAAGAGTGGGAAGAGGTGATCT GT AATGTTACAAATTGTGCT ATT AC 

TCCAAGGTCCAACTTTTCCAGTGCATTACATGGTATTGTATATCAGTGGAGAAATGTATTATT 
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TCCATGATCAAATGTAGTCTCTGTTAAGGTCAAG 1 1 I I C I I 1 I AT AAGCCTTT AATTGATCCTC 
AGTGACTCTGGCAAGGCTGCTTCTCT ATCACTGGCTTTGCACAGAAGTATGCTCTACTTGCG 
ttgctttagggcaggattct ATTTTGAGGGAAAAGACAGTATCCTT ATT ACCTTTTGTTTGTTT 
AAT AGCACAAATGCTTATTTGTT ATCCAAAAACAACCT CCTTCTT ATCTGTGATAAATCTATAG 
AAAGAATTT AG CTGCAAGTGGACAAAGGAACAAGCCCCCAGAAAAGAAAGGGAAGAACTGC 
CTTCrrTATACTACAGAACATGCATTAGTGTGGGCTATATAGCTGTGGCTCATGCT AOCCAATT 
CCAGAI MOW IGTCCTCTAAGAGTTGATTGCTGTATATTAAAATTGAACATCAGAGGATGGG 
AAGAGGGCTCTGTAAGCCAGAACCTTACTAAAGTAGAGGGCACAATCAGTGTGAATAAATTC 
ACTTCAGAATCTCAAGTCAAGG<X>GGCACGGCGGCTCACGCCT 

GGAGGGCGAGAGAGGCGGATCACCTGAGGTCGG GAGTTCGAGACCAGCCTTACCAACATG 
GAGAAACC<X^TCTCTACTAAAAATACAAAATTAGCTGGGCGTGGTGGTGCATGCCTGTAAT 
CCt^TCATCTACTTiAGGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCGGAGGTTG 
CAGTGAGCCAGGArTGTGCCATTGCACTCCAGCCTGGG 

AAAAAATAAAAATGCCAATCOCAAGTCGAAATCACC I C 1 1 G 11 1 1 AAACAAGAATGAATCATTA 
CTGTGT ATGTT AGGGT ATT AAAACTGTTTCACCAGT AGAGTGAAAG TTG TTTCAAC^TTTTAAA 
CAAACAGTGGTTATAGACTCI NCI I I AACCATTGTATAI 1 1 ICI I CCATTCTTGTCATTGGTC 
AATAG^GGAGGGTAGATTAGCTGCTCCAGAATTGAATAAAGTGTAATATTTCTAACGGTGAC 
TTTGACCTATTCTGTAG T ACAACTG T AAT AGCT ATTGGT CTT CAAGT GG GTTT AGATTTGGTG 
ACATCAGTTTGATATTCTCrTAAAGGAAATAAATATTCAAGAACTGATTATGTTCTAACATGAT 
TATATTCATGGTGTTACATAGGCCTCAATTrrrrCACAGAA 

AAGTGAGGCTrTTTAAAAAATTATTTTATAAGCAGAGAACACAGCtrrGATAACTT AGTCAAGG 
ATATACTtrrCTGTCTCACTACTTTGGACTTATATGGCTTCAGATTAACT 

CATACATTCTAAATGGTATATATTGGGAATATATGCCCCTTTAAAAGAATCAGGTCAGAAATG 
CAATAACAATTAG^CTAGACTGTTGOCCGTGTTAGGAGMATGTGTGGTCATCCTAGTTACT aa 
TTACTCTCACTCAAGATGGAGATGTT GTCCAGTTrAACATAGTGTTAAGTTTTCTTAAACCCAA 
AT AATTTATGAGT AGCTTATT ACATCTGCAGAGCT ACCTT ATTATAAT AGT ACCTGCOCNNNN 

>ei7 

ACATGTAATAGACACTATGCTACAGCAAAAG CI 1 1 I C I I ATTGTCTTTAAA A I I I I C CI 
GGGTGCATAAAACTATGTMGGTAACrrCTTTCCCAATTTTTAA CI 1 1 1 ACATT ACAAGTCATTTT 
CAGAGTAAAAAGTCATTTAACAAAGGCAGATAGAAAGGCCTCAAATCCCTG^GGACCAAAAA 
TCCCAACACATTTTCAAAAGGGAGAAAATTTCTITAAACTTCATGGGAAAAGTAT^ 
AATAGAGAGGCTTTATGCAG TCTTTGAGAAGATGATACTITTGGAATAGAACAAGAAGAGGA 
AAATATTTCAT ATTATAAT ATT AGTGTGTCAAATTCTATGTrrGTCAAAAGAGTGAXirrATCTCTG 
ATGANNNNNNNN 
>818 

>ei9 

NNCGCGTTGAGGCGGCTGCAGCAGTTGCGCGCTGGGATTGTTGC^ 
GCCG^AT AGAAAATACAGTT AAAAT AAAATGTCAACCTCCTGGAGT GATCGGTTACAGAATG 
CAGCAGATATGCCTGCTAACATGGATAAGCATGCCCTGAAAAAGT ATCGTCGAGAAGCCTAT 
CATCGGGTGTTTGTGAACCGAAGTTTAGCAATGGAAAAGATAAAGTGTTTT 
"GATT ATACCCTTGGTGTGTACAAGTTXXX^AGAGTATGAGTCCCT^ T 1 1 I GAGCTTACTGT 
GGAGAGATT AGTTTCT ATTGGCT ATCCCCAGGAGTTGCTCAGCTTTGCTTATGATTCT ACATT 
CCCTACCAGGGQA C II U I C I I I G ACACACTGTATGGAAATCTTTTQAAAGTOGATGCCTATG 
GAAACCT CTT GGTCTGTGCACATGGATTT AACTTT ATAAGGGGACCAGAAACT AGAGAACAG 
TATCCAAATAAATTTATCCAGCGAGATGATACTGAAAGATTTTACATTCTG 
ACC TACCAG AGACCTACCTGTTCGCCTGCCTAGTAGATTTTTT^ AATTGTCCCAGATATA 

ccagttgtgaaacacgatttaaagatggggaoc rem c atgtoctacoggagt ATGTTCCAG 

GATGT AAGAGATGCTGTrGACTGGGTTGATTACAAGGGCTCCCTT AAGGAAAAGACAGTTGA 
AAATCTTGAGAAGT ATQTAGTCAAAGATGGAAAACTGCC I ITQC1 I C I GAGCCGGATQAAGG 
AAGT AGGGAAAGTATTTCTTGCTACCAACAGTGACT ATAAATAT ACAGATAAAATTATGACTTA 
CCTGTTTGACTTCCCACATGGCCCCAAGCCTGGGAGCTCCCATCGACCA AC 
TTTGACTTGATCTTGGTGGATGCACGGAAAOCAC TC 1 1 1 1 1 1 GGAGAAGGCACAGTACTGCG 
TCAGGTGGAT ACT AAAACT GGCAAGCTGAAAATTGGTACCTACACAGGGCCOCTACAGCATG 
GTAT CGTC TaCTCAGGAGG I ICI I CI GATACGATCTGTGACCTGTTGGGAGCCAAGGGAAAA 

gacattt tgtata ttggagatcacatttttggggacattttaaaatcaaaga 

TGGCGAACTTTTTTGGTGATTCCTGAACTCGCACAGGAGCT ACATGTCT GGACTGACAAGAG 
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TTCACTTTTCGAAGAACTTCAGAGCTTGGATATTTTCTTGGCTGAACTCTA 
CAGCAGTAGCAATGAGCGTCCAGACATCAGTTCCATCCAGAGACGTATTAAGAAAGTAACTC 
ATGACATGGACATQTQCTATGGGATQATGGGAAGCC I G M I CGCAGTGGCTOCCGGCAGAC 
CCTTTTTGCCAGTCAAGTGATGCGTTATGCTGACCTCTATGCAGCATC^ gct 
GTATTACCCTTTCAGCTACC I C I I CAGGGCTGCCCAT GTGTTGAT GCCTCATGAATCAACGG 
TGGAGCACACACACGTAGATATCAATGAGATGGAGTCTCCTCTTGCCACCCG GAACCGCAC 
ATCAGTGGATTTCAAAGACACTGACTACAAGCGGCACCAGCTGACACGGTCAATTA 
TTAAACCTCCCAACC I C 1 1 CCCACTGGCCCCCCAGGAAATTACACACTGCCATGACGAAGAT 



ATTAAACAAGTCCTGGCAGGACTCACAGGAACAAACGAGGTCCCTGTTAGGGTTCT ACT CG 
GGGGAGGGAGGGGGCTOCATGAAAGGTACGTCTGAAAAGTTTCT GAAGATTTTATTATCATA 
GATACTTGTTTTGGTTTTGTGTATCTGTACTCTCTGCAGATGGTCCAAAATTGT AATGGAGTC 
TGTATTAG AAGAA AATAAGCGTAAAATCAGGCTGAACT 

GTCTCAGAGTGG AACAA AGAGTGGGAAGAGGTGATCTGTAATGTTACAAATTGTGCTATTAC 
TCCAAGGTCCAAC I I I I CCAGTGCATTACATGGTATTGTATATCAGTGGAGAAATGTATTATT 

tccatgatcaaatgtagtctctgtt aaggtcaag mTcrm at aagccttt aattcatcctc 

AGTGACTCTGGCAAGGCTGCTTCTCTATCACTGGCTTTGCAC^ 
TTGCTTTAG^GCAGGATTCTATTTTGAGGGAAAAGACAGTATCCTTATT 

AAT AGCACAAATGCTT ATTTGTT ATCCAAAAACAACCTCCTTCTT ATCTGTGAT AAATCT AT AG 
AAAGAATTT AGCTGCAAGTGGACAAAGGAACAAGCCCCCAGAAAAGAAA C 
CTTCTT ATACTACAGAACATGCATTAGTGTGGGCT ATATAGCTGTGGCTCATGCT AOCCAATT 
CCAGATTTCTTrGTCCTCTAAGAGTTGATTGCTGTATATTAAAATTGAACA 
AAGAGGGCTCTGT AAGGCAGAACCTTACTAAAGTAGAGGGCACAATCAGTGTGAATAAATTC 
ACTTCAGAATCTCAAGTCAAGGCCAGGCACGGCG^ 

GGAGGCCGAGACAGGCGGATCACCTGAGGTCGGGAGTTCGAGACCAGCCTTACC^ 
GAGAAACCCCATCTCT ACT AAAAATACAAAATT ACCT GGGCGTGGTGGTGCATGCCTGT AAT 
CCCATCATCTACTCAGK5AGGCTGAGGCAGGAGAATTGCTTG 
CAGTGAGCCAGGATTGTGCCATTGCACTCCAGCCTGGGCAACAAGAACAAAACT 
AAAAAATAAAAATCCCAATCCCAAGTCGAAATCACCT C II G I I 1 1 AAACAAGAATGAATCATTA 
CTCTGTATGTTAGGGTATTAAAACTGTTTCACCAGTACAGTGAAAGTTGT^ 
CAAACAGTGGTTATAGAC I CI I ICI 1 1 AACCATTGTATATTTTCTTCCATTCTTGTCATTGGTC 
AATAGGGGAGGGTAGATTAGCTGCTCCAGAATTCAATAAAGTGTAATATTTCTA 
TTTGACCTATTCTGTAGT ACAACTGT AATAGCT ATTGGTCTTCAAGTGGGTTTAGATTTGGTG 
ACATCAGTTTGAT A I ICIC1 1 AAAGGAAAT AAATATTCAAGAACTGATT ATGTTCTAACATGAT 
TATATTCATGGTGTTACATAGGCCTCAATTTTTTCACAG 

AAGTGAGGCTTTTTAAAAAATTATTTTATAAGCAGAGAACACAGCCTGATAACTT AGTCAAGG 
ATATACTGTCTGTC1 CACT ACTTTGGACTTATATGGCTTCAGATTAAGTCATCCAAGAAACAT A 
CATACATTCTAAATGGTATATATTGGGAATATATGCCCCTTTAAAAGAATCA6GTCA6AAATG 
CAAT AACAATTAGACT AGACTGTTGCCOGTGTT AGGAGAATGTGTGGTCATCCTAGTT ACTAA 
TTACTCTCACTCAAGATGGAGATGTTGTCCAGTTTAACATAGTCTT AA G 1 1 1 1 C I I AAACCCAA 
AT AATTTATGAGT AGCTTATT ACATCTGCAGAGCT ACCTT ATT AT AAT AGT ACCTGCCCNNNN 

>a» 

ACT AGAATT AGTTCCAACT ACTGCTGGTGATAAACTGACCATCTACCTTGACTTGTTT 
TCTCTTAATTCTCCAAGAAGTAATCAGGTGAATAAAGAATCATCATCAGATAA 
ATTCTTTAAGAAATTAAT7TTTATCTACTCTTAAATGATTGCACAATTA 

ATCTTGTGCTCTAATTCAAA 1 1GCICI IAATGATCCTAGAGAGAAATGAATTACTAGAGATAAA 

AGATAA AI 1 1 1 GC TGTGGTTTGCGAT C I MUN I CH ICCI lAAAACTTAAC 

>621 

NNNKtNr**AGTAGAGACGAGGTCTCGCCATG^ 
GAGCTCAAATGATCCACCCACCTTGGCCTOCCAAAGTGCTGGGATTA^ 
CACACCTG^CCTGATTGTTTTTAAATGGCAGCAAGA 

CACACAGTATGTGGCATATTCAGAATT G GTT G I G AGTTTCCAGTAGAAAGCACTGAGAATAT 
CCATAGGGCAAAATGGAATACTAATAATCCTCATTTGCCTTTGCCTTTGTACTGGA 
ccttactt AAGCCC A CC AAAGGCAAGGTTTGGGCCTGCCACAGCGGATTTCAAAAAGACAAA 
GCAATGCAAGCCACGTGTTCAAAATGCCCT AAGTGGCTATTCAGGT AATATATAAAAGT AAG 



P**c243ofl55 



837 

Table 4 



PCIYUS00/333I2 



ACCaGGCTAATTAGTATACAAI GGGG I AAACCAGAGAGCAGAAAGCCCI I CTTT AAAATGAG 
CCTACCACTGCTTGGCCTCAGTGTGAATTTAGACCCCATCTTCT GATATTTCAGGAGAAAGTA 
AAAATCT AGATTTTTATCTAAAATCTTTTTAATTTTTAAACAGTCAC^TGATTTTAACGAAATAC 
T ATGTGTGCACAATGGAACT GTCT ACGGGAATGAT CT AGTCCAG AG GAT ACCAGGTTCCTCA 
CAATGTCTTATAAAGTTGTTTAACCCATAGTACCTGCCCGGGCG GCCGCTGCCACCGCGGT 
GGA 
>822 

ACAGAGCATCTTAAGGTTGGAAGGACTCTTAGAGACCATAGTCCAGCCTCOCACTTG 

atactgaaacacgtttgtgaattc^tggccgatgtctaacttccct 

gcatataatcagcatctttcccactaagtgaagggcccagactcgagcacaggaggacagc 
accccttaaactcacgag g ggctg cattcacaccatcag cagg g ag attacacttgtgtcat 

TTG 

>823 

aacacattttgatatatcaagaaaataaatggaaatgggggtgataatttatatttgt 
aatttacaaatcaaacatctatcatacacttactatgtacgcggggagagaaagaggcaacta 
cattgcctggaggaagcctaaggaacccaggcatccagctgcccacgcctgagtccaagat 
tcttcccaggaacacaaacgtaggagacccacgctcctg 

ACCTTCAAGTCCCCTTTCTCAAGAATCCTCT GTTCTTTG CCCTCTAAAGTCTTGGT ACATCTA 

TCTCCTTATGTrTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCAC 
CTCTGCCAACACTGGATCCAGTGTG ATCTCCAGTG gagccagcacagccaccaactctggg 

tccagtgtgacctct^gtggggtcagcagagccaccatctcagggtccagcgtgacctcca 
atggggtcagcatagtcaccaactctgagttcx^tacaacctccagtgggatcagcacagc 
caccaactctgagttcagcacagcgtccagtgggatcagcatagccaccaactctgagtcc 

gggccagcacagtgaccaactctgactccagcacaacctccagtggggct agcacagccac 



ac^cctccagtggggcxx;gx>cagccaccaactctgagtcc^cacagtgtccagtggg 
atcagcacagtcaccaattctgagtccagcacaccctccagtggggccaacacagccacca 

ACTCTGACTCCAGTACGACCTCCAGTGGGGCCAACACAGCGACCAACT CTGACTGCAGCAC 

aacctccagtg^ggcgagcacagccaccaactctgagtccagcacgacctccagtggggc 
cagcacagccaccaactctgagtccagcacaacct ccagtggggccagcacagccaccaac 



tgtccagtggggccagcacagccaccacctctgagtccagcacgacctccagtggggcga 
gcacagcc^ccaactctgagtcx^gcacagtgtccagtggggcx^ 
tg^agtccagcacaacctcgagtggggccaacacagccaccaactctgggtgcagtgtgacc 
tctgcaggctctggaacagcagctctgactggaatgcacacaacttxx^ 

TGCAGTGAGTGAGGCGAAGC^GGTGGGTCCCTGGTGCCGTGGGAAATCTTCCTCATCACC 

CTGGTCTCN 

>624 

ACCCCCATTATAGTAGGGAGACTGAATCTTCAAAGTTACAGGGTGAATCAATGATAA 
TGATCTTTGCAGCTTTCTGGAGTTAAAAAGCATCAAAATTGGGAGATATTAGATGATGACATC 
TAAGTA7TAAAATAAGGAGATATTAAATGATGACTCCTAGAAATGAACCTGAATAAGGACTAC 
CGCAATGTGTGTGGTGTGGGAAAGGACAGTTC 1 1 1 1 AATGGCTGGCTG ACCCAGCCTCAATT 
TTCTTGCAiSCTrOGCCGAGACGAGGTGACCATCTGCAATTACGAAGCATCTGCCAACCCAG 
CAGACCATAGGGTCCATCAGGTTACCCCACAGACACACTTT ATTTCCTAAGGGCTGGCCAAG 
GCTCCCATAGAGGCGCTGTGTCAGTGAAGATGTACGACT ACCTGTTGGGAAGGACAAAGGG 
GATGAGGCTCCAGAGAGAGTTGGCTGCCACAGCCT CTGCCAAGC I 1 I G TCTTTGGGGGCTT 
GCTGCAGAAACCTGGGCCTACGGAAGGTACGACACCACTGGGwAGGGTTGTGTAGGTGCCA 
GGGGACCATCGTGGTTCTCTAGGGCGCTGTGGAAATTGGGTGTTGGGCTGGGTGGCATCT 
GGCAGTCATGGATAACACTTGC 1 1 1 ICCAGTTAATGTGGCCATGTGATTCCAAGTGTCATGTT 
GCTTTGTGGCAAGATTGTTGTGTGACTTG I 1 1 II I I GTTTTTGTTTTTGGT ?T T T 1 AAAGGAAA 
CTATTTGTGGGCTATAGGAAA C I I I C I GATGCCTCCGGATTGTGTTAATAATAN 
>825 

GTCCGGTTGAATAGATTAGTTTTTCTCAGTAACTTACT ATCCAGCAGACTGGCTTTCC 
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TG AGAC7TG AGG7TGTGGCTTA i ACTijGAaTGAGACCACTGTaCGTGTAGGTGGT I CAUA 1 C 

CTGCGTAATGGCAGCATGAGGACTTAAAAGGTGGTTTTCATTTTGAAGATGGCTATGTAGCT 

TGTAAGGTGTATCACAGCAGTAC^CTCATGGCTTTTTGGTTCCAGCAGTGAGGGCATTGGT 

GAGATCAATGGTAAACT GTGGAAGC I MCI 1 1 1 I ATCATTAGGAAATGTGAAACGTTGGACAA 

ATTrrGAGTTTTAACAAGGACAAAAAGTTGAAAGAAAAGGCACAGTTAACAAAAAAGGGTGG 

CTAGATTTATCTTGGGTGATGGAGGAAATGAGAGAGGAATGCTCTTGAAAGGTGGTCTGTGG 

ATCTGTCTGAAT AGAAAGAGCACAGT AAGTATG CATTGCCGGAGAAAACGTCCTTGAAGCTG 

cttgtctcatgtgtatgatgtgctttttaaatcatgcccctcgttgcctgcctaatctgtgac 
tccctaaaaact aactgggcccatgt agatggggctgcaaccagagctgaataacatgttag 
gctcacacatgcatcagcactgcacactggaatcattgctcttoctggactttgtagaaatx; 

AGTCTCAAGTGCTTCAAGAGTCTGGCTGC7GCT ACTTTTATCTGTCAGGTAGCACATAAGGTT 

tgcagggtttatattttgtatagaatcacagttgtggagaaaaagtaataatttctcaatgaat 
tttaaaaatgggcctattttctatc^ccgtggttcatctgatataattagtgttccctgtgaat 
tccccccctctatgggaaggatgccttractctttatcagtaataaattatgactgttttcata 

TTGCCTrAGGGTTATTTCCCTGTGTAAACCATTGTCTTTrGTTTTG G T 1 MC I I IAGCATTATG 

aagctttggtattgtacaaggtcagtagtaagatgctcactagtctcagggcttgtgtaatat 
tctgggaggtcatttaaatgccagaaatggtcaagcaattatacacagt attta tgactctgt 

tcatttaaataatttctaaaagggtcttttcaagctct aacaaagtcactaacaaatgcattat 
tttctacagaattagatgttagtagtacagtactgcatattcagggaaaaagtgtgaggaatt 
gatttcaaaatagttcgttcttgtgtttgacctaagaatgattgtcgcatgaagtgtttgtt^ 
tacagtttagcatatataaacaaacatgataggattccttaagatgttaccacccagggggcc 
acaagccagcctgctgtctcaggaagctgtagaaggagtgtttgtcaatttcttgtcactgg 

TTTGCTGACTTACTGAGGATTAATTGTTGCCTTACAATGTTACTGAAATAAACTGT^ 
AAAAAAAAAAAAAAGTCGACGCGGCCGCGAATTTAGTAGTAGTAGGCGGCCGCTCTAGAGG 
ATCCAAGCTTACGTAC«CGTGCATGCGACGTC^TAGCTCTTCTATAGTGTCACCTAAANNNN 
N 

>G26 

ACTCAACAAGCAGCTGACTTATGTTTTATTGGACATTGTGATACAGGAACTGTTTCCA 
GAGCTC^ATAAGGTACGCGGGAAAGTCAACTCAGTTACCTCTGTTTGGTGTGTGTATCACTT 
GCAGATGCTGTCTACCACCTTTTCAGTGACATC^n'AGAAGCTTCTCTATTACCACAGTAACTG 
GCTAACTAGATATGATCTTTCCCTAATTTTCATGAGCATCTTT 
GGAAAATAACAAAGTTGCTTCACTCT GAAGGAGTATTCTCCTCT AGTACCNN 
>827 

AGAT ATATGAAAAGCCAAC ATTCT AAAGT AGAGGTTCACTT AATTTTTTTTTTTTTCAA 
QAGAGGCTTCTTGGTAGTTTCATCAGAGAGTGGTTTTATTAGGGGATGTAAGGATTACAGAA 
ACATCXjTATTTTTTAACATATAGTATTTTTTGA 

GAAGGGGAAATAAAAATTCCAGAAAGATGAAAAATTGTTGATGTAAGATGGAGGCANN 
>828 

NAG CTGTTTCCTGTGTG T AAAACGACGG CACAG ATG CCG TCTCCGTGG CCG CT CCC 



CTrrGTTAAACTAACCCTTGCCATCCTGGCT ACTTT ACCCAATTAACCACCCT AQCCCAGG AC 
GTrrrGCTTTATCACATGTTCACAGTTTGCTATTCTrrGTTCAATCTTGTAACT 
GCTTCTGTGGGTCTCTG 1 1 (CI I 1 ATGAAGTTTCCCAGGGCATACAAAACTTGTGTTAGCCTA 
TCTTCTGTCAGTTT AATTGTGGAACTCAGCGAGG CCCTT AAGAGGATG GAGGG GAG TTTTTC 
CCACAGCAGTTCTGAATGGGATGAAGTGAAAAATAAAATCTCCCCATTG CCACT ACACCACC 
TCCAG^TGAGTCTTGCAGCAGAAATACCGTTTAACTGTTTCTG CI I t IAN I I II I CTGATTAT 
CATCCAGTTTTATATATTCATATCTGTGTGCTTTGATAATTATATATACATACl M il I GAT ATT 
ATTTACTT A TTCTT TACATTCAAAAGGAACTTGCTCTG 

M 1 1 M 1 1 AGTTTTCTTCATTTGGTTTCTAATTGAAACTAGAGGTAGACTGACTGTAAATAGAA 
AGTAGTTTCAGCTrCAGGAGCTTT AATrTCTCAGGCT CCTTCCAATGTTCTGGACCT AATTTC 
GAATTTGCAGTATGTN 

>e29 

NNNNCGAGGTCGACGGTATCATAAG I IIIII I III I IIIUI I IIUI I I AAAATCATTT 
ATTATTATCAGGAGTGCCTTTTAGGrrGG^CCGCTCTGTATGAGTCTCATGCTTC 
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TTTTATTCAAGTGACTTACAATGGCCCTAGGAAACAAGTTCTGTTATTATCCCCCATmAAAA 

TGATGAAAATGGACAAAG CAAAAGCAAGCAACTTAACCAAT ACCCCATG GCCTCACAGCCTT 

TAGAATAGTCATATTATATAAATATGGCAATAACAATGCACTGAAAATGTCTCCAAAACAAACT 

CTACATTTTAAAAAATGTATAACAGGAATCTAAGGAAGGGGTCTT ACTTCTCTGATTCAGGGA 

GTGCGAAATCCCTT AAACTC AT ATCAG ACCT GTG ATG AACAAACT CACACT AAGTTTT AAAAA 

CTGCTTAATTT ACTTT ATCATGACTAGTAAT AT AT AAGAATTTTGTAT AT ACTTT ATT AAAAATA 

ACTTTGGAAAACTATTTTTGCCTGATCAGCAATAAAACTACTGATAAGATAAGCTGGTTATCA 

AATAATTCCCTAGTGTTAATCTGGCAAATAATTGTTATAACCCAATCTTGTGAATTGAAGACAG 

GCACATTATAGATAATCAAAAATATTAGAAACACATTTAAAATGTCCATATGTTTAGTATAAAT 

AGAAAATTCATTGACTAAAATAAGTCATGAAACATGGACTGGAATTTTGTGAGCTATTAAGTTT 

CATTTTGTTTTACTGTTTTAAAATTTAAATAGTTTGGCTCATTTCTACAm 

TCTTAAAATATTAAAAAAATAACTTTACAAACACACAGATATGACTAGCCATTTTATCTAAGTTA 

TAAAATAGTTACCCCAGGGAGCCTTCAAGATCCCTATATITTAAAGCTTATTTCAAAAACCAT 

C^CTACTT(^CATTTTTAAACTGCATCCTrGC^TTACAGAACTGTAAATGTGTTGTCATACAGT 

AAATATCCACATCCCAATTCTAATTACACCATTCTTGTTGTTAACAGAGAGCACT 

AAAAGCATGGCATAGTTCTACAG TTTCAATG G CAAAATT AAAGGTTTTTTTTGTTG AAG AGAT 

AATGAAGCAATCACCAAGATATTTGAAACAACAAAGTTAACATTACAATGATTTCAAGAAAAA 

G AG AGACAG ATTTrrAAAATATTTGGATTCTTT AGCCCATATG GAAATTAAAAAAAAAAAAAAA 
ACTGGAGAGAGAAACAGAAAGAGACAGAGAGAAAGAGAGAGATGCTACTTGACATTTTAAAG 
ACCAAAAACTTGCAGTAGTAAAAATACATTCTGAAAGTATTTTAAAACTTAAAT^ 

CCACATTCCTGTCCTGGAGACCAGCCAATGTCAACTGATCTrATGAATCTGTTATTGCTTGTGA 

GT 

>830 

NNKNCGCGGCCGCGTCGACACGGCTGCGAGAAGACGAGAGAAGGGGGAACGCTC 
GGCGCTGCCGGGTGAAATCGTAGGACAGTGAAGATGCTGCTGGAATTGTCCGAGGAGCATA 
AGGAACACCTGGCXTTTCCTGCXrrCAAGTGGACAGCGCGGTGGTCGCCGAGTTTGGGCGGA 
TTGCTGTGGAATTCCTGAGACGCGGCGCAAACCCAAAAATCTACGAAGGCGCCGCCAGAAA 
ACTCAATGTGAGTAGTGACACTGTCCAGCATGGTGTGGAAGGATTAACGTATCTCCTCACTG 
AGAGCTCAAAGCTCATGATTTCTGAACTGGATTTCCAAGACTCTG MM I GTTCTGGGATTCT 
CTGAAGAATTAAACAAATTGTTGCTTCAGCTTTATCTGGACAACAGAAAAGAGATCAGAACGA 
TTCTGAGTGAATTGGCACCAAGCtrrrCCCAGTTATCATAACCTTGAATGGCGACTAGATGTA 
C^AGCTTGCAAGTAGAAGTCTCAGGCAACAGATTAAACCAGCAGTGACT ATAAAGCT ACACCT 
TAATCAAAATGGAGATCACAACACCAAAGTTCT GCAGACAGACCCAGCCACCCTGCTCCATT 



CGCAACATCAAGTAGTACCAGTrrTAAGGTTTTAATTCATTTGAATCACTTATGAAT^ 

ATACAGCAATTACTmCAAAATTAATTTTTTATTAATTCATGATGATAAATACATAGTAT^ 

CAGTATCTATTCCAAGATACTGAGGTCATAATCAGAAGCTAAGCTGGGTGCAGTGGCTCATG 

CCAGTTATCCCAGCACTrTGGGAG^GCCGAGGTGGGCAAATCATGAGGTCA 

CXnTCCTGGCTAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAAATTAGCCAGGTGTG 

GTGGCACGCATCTATX^GAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAAT^ 

CCTGGGAGGTGGAGGTTGGAGTGAGCTGAGATTGTGCCACTGCACTCCAGCCT 

AGAGTGAGACTCCATCTCAAAAATAATAATAATAATAATAAAGTAAAAATAAAAATAAAAAAGT 

AATC AGAA GCTAAAGTAAAGTTCCTTTCCTGGTGCTAACTGTGGTCTrCTT GACACATTAAGA 

TCT ATTT TGTATTTTAAGAGTCTCATGCTCTACCGTTQGAACT AGCCAGATGGCCATTATTTT 

GTATTTTAAATACATAAATAGGATTGAATCAACTAGAAATGAATCTATATGTTCTGTATATATG 

AATGACTATCTTGTTTTTGCTACI 1 CM I 1 GACTGCTTAATTTTATTATTTTCATCTTTATTGATC 

AAATTTGAAAATAAAATTCACAATGTAATACTACTACTATGCAGAATTTTCT aaacagttcagt 

atttt tgacttttaaaaacaccccacagtgttaatagccacagaatattg 

TTTTAATGCT ATATTGTTATAGGCAGTTTATTC A 1 1 1 I IC I 1 1 GT ATATGAAGATGAT AAGTATC 
ATATTGCCTAAGTTrGAGTGATCATGGTTAATT AATTGGCTT AAATAGTACTCAAATTTGTGTG 
GTCX5TAT ATTGT ATTTT ATCAAGGACTAATTCTT CCACCATACCCAAAGCATCT AGGAGACAC 
TCTGTCATTT ACATTTACXAATAATGGATGCAGAGAAATATAATCAATTCTT GATTGTCCTGGA 
TACAAATTATAGTCACTTTTTATAATTTGAATAAA^ 

TTACATTATAGGCAACTATCCCTTCTCAAAACATAACTTGATTT AIUI rTTGTTl I ATTTTTTAG 
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tagggaaccaaaagcttttggtctgaaaaactcaggtgctgatttgtcaggtcagcctgaga 
gagtgcagagatactgaggtcagcttagggagaagaattgtatggaagggtggaaatgagg 
gttcagtcttaactatgtgtccaagaccttatcctgagagtcttcatggaataatctagaaag 
ggctgtagaagcaatgaaactctaaactgcgctctagactttgtcaatactttgctrtcgtaag 
aattagggattttcatatcagagtgataccatgttgcaatctcnrttagagaagagagagccta 
gcagggaataggctagtggaagaaaaacatacttgagatggttacccattggtccctgcag 

GCATCTGGGCCACCCTTTTCCTGTAGGTATATATTCTGCCGTTGTGAAACAACTATATTATTA 
GATGTC^GCAAACTATAATCAGAAATAATTTTTAATTATAGTGGCAACTTGATTAATGAAGCTC 

GTTT AAAAG CAAC ATGCCTGCAAAACAT ATG AT AAAAAAT ACCT CTGT aagg GTTTTTATTTGT 

cx:a<xttcgatggaaggtaaacatggctgatcccagccaagatgaaacaatagccctttgtc 

TCCTCAGAACAACCTATCTCCAACTC1TTGTCCACTGTTCTTACCCCAATTAA 
C ATGTGCT AATCC I ITTTT A ATm I ATCAACCTGCCTTGGCCTCCTTrGACTGTACTCCATTA 
TTCCTTCTGTGATGTTCTACC! I CI ICACCI ! 1 1 GAGACCAGGGTGAGTTGGGAGCAGTTTG 
CCTAOCI re m IAGAATCTGGGGCTTAT C I I I I AGTACAAGCCATTGAATAAGCCTCTTCCTT 
IM I M GCTCAAACATTCCACATCCTTGTGGATTCCCCT GCATTG 1 1 I GI 1 1 1 ATATAACATTT 
GATA HI GH GTAGCTTGT ATATGAACATA A I I I I C I I I A GAGGTAGTCACTGTTCTCTCCAGT 
ATGACCCAGGTTTCTT GACTCTGAGTAATGCACCTTCT ATAACTATCT AAATTTCTACTGAAG 

CH I I I ggattatgagtatgctgacttttcacgattggctggtgcatgtttagacttaaatgt 

CATATCCTTCATGTCrCAAAGCCAAAATAGTAACATCTCATCTCAGAACAGAGCTGTGACCAC 
ATGCCAATATATGTGTCACAAAGTCTrACATATGTTACATTCCTTGGAAGTCTCCTTAAATGTTT 
CACAAAATGTCAACAAGCTTGTTTrGTTATTGATATTTCCGAGATTG 

TAAACGGGAAAGGTG GTGAAGATG CT ATAAGAAGATGCTGTATCTTGAG AATTGAAAAATGA 

GAATCTGACATGGTTTGAAAAATCATGAAGGGTATATATAAAGGATGCATGTGTAGGAGCCA 

TTAAAATTCATAACAGTATGTGCCCCTTCAGCGTTTTAATCTTATGAAGTGGTTAAGAG^ 

GTCTTCGGAGTTGG^CAAAAGGATTTGAATTTAGGTTCTGTGGATTATTATGGGTCCrTGATA 

AATTCTCAAAGCC I I M I II I ? GTTCATCTCT AAAATAGGAATATAGGATTACCTACCACATTG 

ACTCGTAAGGACTATATGAGACCATGTATATAGAATGTTTAACATAGTACCAGGCTTTACAAA 



NNNNNNNNNNNNNNNTiMGGGGGGCCCTT AAATTrrTTAAAAAAAAAAACCCCCCCCCCCTC 
CCCGGGACGNNNNNNNNNNN 

ACATGTTACTGGGTATTAAATGCGTTCATAGTAGGGTATTAAATCAGCAAGGTCCCC 
ATCCCAGAAAAATGTGCAGTTTGTTCAATGGGAAAGATGCAGAGACAGTTTCAGTTAATATAC 
TAAGTGCTAAGATTGGGATGTGCACAAGAAGCT GGAG GTAAAAATTCTGG AAAACTGAACGT 
GAAGTCACCACTAGGCAAGCTGCCTCTAATTGAGCTTGCTTGTATATGACCAATCA^ 
GCTTGTTGAAGGGTTAGTTATCT AGTTTCCTC C I I II CI I H 1 1 GGAATTTGGT C 1 1 1 IAAGGT 
CTTGATAATCTTTCT AGTCTAGAGCATGTGAACAGAACAGAAGGAAAATCAGGACTCAGTTTA 

CTCTTATTAGCATGGATGCTTAAGAACTTCAGGGTTTGGAGCT 

GCACTCTN 

>832 

ACCCTAGGCAGGGACAGTCAAGAAAACTT CATGGATCT GT AGTGTAAAGCT AG GGA 
GAAAGAGGAAGAGATCCTTGTTTGAATTTCTGTAACTAGCGTATCT 

GCCAG T AAA GAT GAACG CAGATT AT T GATGGAAAGAACACACATGGAGAAGAGAAAAAGCAA 
GTCCACAGAGCTTTTT AACAT ACACTCCCTCACCCCT ACC^ CTT AGAAGGGCAGGAAC 
CTGCTCTCCAAAACAGGAAATATAGGAAATACCAGCTG^GAAACTATCCACT 
GAGCCCAGCTGCCCCTCTCACCTCACTCCTATTTTAAGTCAG 

CTTTTCTGCACCTGAAG^AGTGATGTCACTCCAGACTGAGTCCTT attagaggtgatgatag 
AGTTATTTTTAGACCTGGGAATGGTCT AAAACATTTTGCCTCAGCCTAATCATTGGATCCTTC 
AAGGAAATGGAT ATTCTGAT GCACAT CCCACCCG^TGTCCTATCAAATGGAACCTT AGCT CT . 
CAGGCAACTTGTTAGTTrGAAGGCAGTGACAATGAGGCCAAGAATGACT 
CTCTGCCATCACAGGCTGAGCCAGAGCCTCCCACATCAGAATGCC^ 
AAGTGATCTCT AAGT AAAACT ATTCCTGOCCTCTCT AGOCT Al 
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AC! n 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 GGGTCAAGT AGAAATCAAACAGTCCT AATGGA 
GTTCaTaTCTTATGGCATTATAGAAAGGCTTAGTTATGAAACTATCTTGTTATTCTTACTATTA 
CATTGCCTGGCTCATATATATAAAGCATTTAGAGAGACTGTTCCAATAACTCTCATTTAATTG 
GTGAAAAAATTAAATATTGGTTTAGATACTTACCTAAATATTACTAGTT AAATT CAAAGTAAAT 
GAGTCrrGTATCTTTAAAACTACTTGGCAGTAATAATTTTTAAAAGTAGATTTTTATTG C't 1 1 f CT 
TGAACTAACTAGTGTTCATACAACACAGGTAGTTTTATTTGTGCCTGGAATTAAGGAGTGAGA 
CACATTTGTAAAATGTTCACAATCAACGCCTGTCCCATTTTAAAATCTCACAAG nTTTCTTCA 
TGATTAACACAATTCAC^AAATAAGAAATGGTATTTGGTCATTCTCTGAGTTCAATCTGTGC^^ 
TAGTAAATATAACTTGTGAGGAAAAAGTAAAAAGGTCAAGAGTCTAATTCA7TTTCAGTTTTTA 
AAACTATANNNNNNN 
>834 

actttttttttttttrrttttttttn igg1 1 1 1 1 1 1 atctgaccacttccaggaacaaa 
gccagggctctctgggcacctgagtatccattctctttgtatcatccattccatgtccagaa 
cacattcacatgcatgcttatagttoctcattgcctgaagcctgctgggtggggcatagtat 
gaatacttgccctc^twtccccatttcacagatgcataaacagaggccagtcag 
gcagactgtggatagagcccgaagcctcaggttaggcagcttgcatixiagctgtgagtccc 
agctaggggaactgagtcagcctocatcactccgtgtctcggttttctgacctctcaggtg 
ggtatcatgatgctggctttggagggtagctgtgagtattaaattacgctgatgcagggca 
ggtgagccccccaaattggggtttagcttgogagagttcttggctnnn 



A O I I I 1 1 1 1 H I 1 1 1 1 I III I 1 1 1 1 1 I I I 1 1 1 1 i a attcaatggaagaaaagtccagctta 
ataactttaatggagaaagaag gaag gagtataaatttgtggagactccaatcacatgtcctc 
cactctgctaccctgggcccaaaataagggaggagacact(^gagccaggtg 
atgggaatgtgatcaggtgcgacatgggctcacagoctcactgaggctggat u i inl i n e 

TGTTCCCTCTGAGTCATGGAAGTG7TCAAAAX5GAATCATGAGGGTATTTCGTTACTTT ACTT A 

CNNNNNN 

>638 

NKN NNNWATTGTAAAAACATGCCAAAGCANAGCAGAGGCTTrTATTGCAGATAAGGC 
TGTGTTTTCGCTCAGaGACCAATTGTGT AGATG CCT AGGACAT AAATGGT G GG GAT CGCT AT 
TGAAATTAAATT AATTATTGTAAGTAG GACTCAGTTCTGT AACACATCT AATGAT ATGCTGCT C 
AGTTCTGTAACACATCTAATGGTATGTTTTGATAGAGATGCATCCAAGAAGTCAGATTCAAAT 
CCGCTGACTGAAAT ACTT AAGTGTCCT ACT AAAGTGGTCTT ACT AAGGGTAAGAAGCTGAGG 
AAAGCAAGCrrCTCTGCCTGCCTAGATTTGTGTGTGTTJACCCCAG 
AAAGT TA AAACTO AAAATGATACTTTTGCAAATAATACTTTGGGCAGATCCCACGT 
ATTTTAGATTTTTCAGTAATGTGTAAGTCTCTTTGCTAAGTACAGTGTTTAAGACT 
TAATTAAATATTTGAAATTGCTAGAG^TTTCAGATAGTTTATACATTTTGGT 
CTnTGTAACTCTTT ACAGGCTTAATATATGTTrrAGTGTTTTAAAGAAATACT AACTTTTGGGT 
GCTTTGGG^TGAAATTTITCATTATGTCGTTTGGGATTGAGAACATT^ 
ATATTTATATACCTCTGCTAGGTGTTTGTAACTTATACTTGGTACTT 1 1 HG I 

GTTTAAAGAACATGGTTGGTGOGGGAGAGGTGGATGAAGACTTGGAAGTTGAA 
AGAATGrrGAAAAATATGGCAAAGTTGGACAATGTGTGATATTTGAAGTAAGAGCG 1 iTTCTTl 
TGATGTTTATAACACAAGTTGTAATTGGCACATTACAAAACATTTTCT 

T ATGCT AT AAGT AACATTCTT ACTGCAGAAGATGACAGGCGTT GGTT ATTTGT ACTTCT CT^ 
CTCAATTATTACAGTTTTAAACATAAAGACAATGATTTCAAGTTTT^^ 

AATGCTTCATGATTGAGGATCAGTATGATGACTGAAGACCTTGATTCT AGCGTGCTCAGT AG 
TTTAGTTCCTTAGACATGCCTTTGGTTTCAGTCATTTGGTGAGTATTTAGT GCCTCTCACATG 
CACTGCACTGTGCGGAGAGCACTrGGATTACAGGAAGCATGCTCTGT ACTCTCAGTGGTGT 
GTTCAGTTGAAGAAACACAT AAAATAACTGAAAGAT ATTCAGTTACTAGACT AGGTAGT ATTG 
ACT AAGTTCAGGAGTTGAGAAAAGGATTAG^TCAGTGAAAAAATAGACTGCCTTGTGAGCAA 
TAGCAGATATGGGCTGCGCTTTGAAACGAACAGGTGGGATTTGCAGGGA^GGTATTCTGA 
TTGGGATAATGGCTCGT AGGAGACAGTGAAGATCAGGACT ACGAAAATGGAATGGATGGGC 
CTGTTGGGGATT AGCAGAAGTGCTT AGGTGGGTGGAATTACAGAGGGCT AGGAAAG CT AGA 
CAGACT AGTCAAAATGAGAGCTGATGTGAAAGGCCACT AGGGCACCAGCCTTTTT AATCTGA 
AAGCTCttaCTTCCTCTGTI fCI 1 1 A TCCTGTTGCTG G I I ti l I GACCACAT AATT A 1 rTGTTTAC 
GTCATCAGCTGGCACAGGGCCTTrATGAGGTCAGAGACTGCCTAGCCCTGTC 
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cctaacaatgcctgtcataaaaacaggcattgagtacgtttttgatgatgaacaaattatact 

tatttccttggtatttgacaactacttgactgtatatgatx^taattagaatatcc^ 

aaatcatttaaaagagaaaactcattcaatgcaatctcaaatgccttt actcatcacgctctga 

ttttcttccagattcctggtgcccctgatgatgaagcagtacggatat^ 

gagttgaatcagcaattaaagc^ttgttgacttgaatgggaggtattttggtggacgggtg 

gtaaaagcatgtttctacaatttggacaaattcagggtcttggatttggcagaac^ 

GGCtUATTCGAAGCTTACGTAGAACAAAAACTCATCTCAGAAGAGGA^ 

cgaccatcatcatcatcatcattgagtttaaacggtctcc 

>837 

ACI 1 1 M 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 IGCAAACTTTAATAGUI 1 1 ICI I AGCTTGACAACT 
CATTCTCTATATTCACGAACATCTCCTGACTT gttccttcagtggagat acccttttct agcc 
AGAGTrGGCAAAAGTAGCAATAGCATGCATTGGCTTGTTTGAGAGGCCCTGGGTGAGCCTTT 
GTrG CATAAAGTAGGAGGTCTGTTATTGTCTTGGTAGCATATGCCTTCATTATAAGTTTGCCT 
CTrTGAAAGAATATTCAAAGACCAACACAAAAGAGAACATTTCCAGATCC^ 

GAGGAAAAATTCATAGCAATTTATAACAGCTGAGAAAAAGAG GGAGGATGCGGGAAGGT AG 

ATTTTGTCAGAACTT ACT AGACTAAGGAThTTATTGCATATTTTTT ACT AATT AAATGTTG 

TGNNN 

>638 

>639 

ACAGACAAGGGGGCGACT GGCATGTGG 1 1 lb 1 1 I CTGGTCTTGTAGTCGGTTTGGA 
ATTTTCTAAGTC^GGGTGGGGTGGGGGCIACTGTGCACGAGTCATGTGCAGACT GGAACCCA 
TCTCCCXXTTCXSGTCTGCAAGTTAAAACAATTGGGTTGTCCT^ 

CTTACTCAATCTTGGATCAAAAGGGCGTTGGAGG AGGAGGCTG GGAGGGAAATCCAGACAG 
TTCrrCCGCCTCTGACATCAGGTCCAGCTGTTAGCATCGTG CT GTGGGTCCCT GAACAAGAA 

gcaaagtwggactggtttggccaggtaggtgaggatccagtgttgggtgattctgatcca 
TG(^GCCCTTAGAGGCX5ACAC^GACGTGAACTGGACATTCT aggaagaaagagccg actgc 
cgggtgacctgtct a gttca catccactcaccatttccctcctcgttcctattcttagaaata 

AGACTCTGACGCIUICI 1 1 1 ATACAGGCTAGTCCCCTATAGGCATGTCATGGTGATTATTTGC 

AATCCTCCTGACTTTCCTAAGAAGAGATCAGACTTANNN 

>MO 

>841 

NlWhIGATAACAGAGTTT ATATACCTCCTGTCXXXATCTGT ACACTTAAAAATGT ATGT 
GCTGTTCTAATGCTACTTATTATTATTCCX7TTCCTTTGTAGAATGTATCACACT 
AATCCTGACTATAACAATTATTTGTTAAiCTATTAAAGGGGTAATTATACTCT AAGCTTCCAGTT 
TTCAGTTAAAACAAAAATGATTAATATGCCT ATACAGAACrTTTCTCCAGCACTTGGTAAGT ATT 
TTTTAAAGTGAAGTCT ATTCAGACTGCAACCAGTAAACTATTTATGCTTATAA 1 1 1 1 I C ICACG 
ATGGATTTCTGTTCCTTTGTTGCATTGTTTGTGTTT ATTTTATGTG A I C 1 1 1 1 1 1 AGCTACAAG 
GTGGGAAAATGACAGTGGTTTAGAGATAAGAAGCACATGAATGGAAAGTAAATATGTGGAGA 
TTTTTGGCCACT CTTGTAACTACTATCTGAAGTAGTTTTAAATANNNN 
>642 

. ACTAGCTTJTTTGGGAAACTGAGGATCATAGCTCACTGGAACCT 
CAAGTGATCCTCCCACCACAGtrrrCCAAATAGCTGGGACCAC^ 

T ATT AATGT AG A I HUUII I G TAGATGTAG A I I IC I 1 1 I ACAAAGTGACAGCTTTTCAGAGCTA 
GTCCT ATGTCTGCAGTTTCTCAGAATAACCAGCrcAAAATATGCCAGAGAAGTATATTTTGG 
GTGGCATATTCTAGTCTCCTCCAGTCATATTTTGGGGTGGTGTGT CCTGAGCCCCAACAAGA 



TAGGTTTCATTTTTGAAAATTGCTCTTCCAGTCCCACTGTTCA 
CCACCTGTTGATTTCCTAGGCATCTTCTTGCTCAGGGTAGT AGATGTTTGGTGGACTAGAAA 
TGCAGGGAGAAGAAAAGGAAGGCTTGGTGATGTCAAGGATTTTTAAAAGCAAACT ATCTCAC 
TGTGTTCTCTCAATAGTC^CCCTCTGTGCTGCTCATTCCATGAGGCTTAAAGCTGATAACTG^ 
GGGACAAAAGGGTTAGGGTAACAATTTATTTTTGTCT<XTGACAATACACACAT 
GCTGTGGTTGGGGAAAATATAACTGTGGATTAATTCTGACTCATTGTTTTGG^ 
>643 . 

ACI 1 1 1 1 1 II n 1 1 UN 1 1 1 GCCT ATTAATTGATT AGGAAAAATAGGT AGACCCT GAG 
TGAAAGT AGAAAAGAACCATTCTGGT AAAAATTCTGAAAGT AGAAAAG AACCTT^ 
AGGTATGTCTTAATAGAGCAGTGCTAAGACAGGTGGTTAGGTATGTGAATGCATGCCACTTA 
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GAAAAGAATATGAAGGAGAAGGGACCAAGAAGGCAGATACATTGCCCCTGATAAAGAAGTC 

ATTTTTCTCTCACCTTTACATAAATATCAGCCACTAAAAATCTAGGAGCACAAATAATG^ 

GAACCCTGTTCGCTCTGNNNNNNNN 

>844 

ACAAGAGAACG<5ACGGCACTTACTGAGCCCATCGCAAATGTCAG<*CTCTGTGCT 
ACTTACATATCCCATAATCTTCAAGACCCCTCAAGACCCCAC 

ACT AACTCAGATTT ACTTGCCAAAG GTCACACAGTT AAT ACAT G G TGG AATCAG GACTCAAAA 
TCAGGCCTGTGTGACTCCAAAGTCCAG TGCTCTCTCCACTTTACCAGGT AACCTT CAT AATA 
CCGGATTGGAAATCAAACCTGTCACTTACTTTCT ATGTCCCT GAGTGAGTCACAACTTCTCTC 
AACCAGCNNNNNNNNNN . 
>845 

ACCAGGAAATTGGTTTGATTGCCATAGGCTAACCTTGGACCAATCACTGTGGCCAAA 
TACATGAGGATCCTTATTGGCTCCTTCT ACTAGCAACAGATG GTTTAGAGAACAGTGTATCAC 
AGAGAAATGGGGATCACTATTATAGGCAGATTGAATAATAAATGTTCACTCTACT ACTCAAT a 
AATATTTGTTGAACAAATCAAAGCTGATCCC 1 1 1 1 1 1 CAAAATTTTT AATGTGACTCTT AGGGG 
ATG GTG^TC t^GGAGAGAAQATTAGTGCCACACTGAAAAGAGAATTTGGTGAGGAAGCTC 
TCAACTCXrTTACAGAAAACCAGTGCTGAGAAGAGAGAAATAGAGGAAAAGTTGCACAAACTC 
TTCAACCAAGACX^ACCTAGTGATATATAAGGGATATGTTGATGATCCTCGAAACACTrGA 
GCCTGGATGGAAACAGAAGCTGGGAACTACCATGACGAAAOVGGTGAGAT AATGGA TAATC 
TTATGCTAGAAGCTGGAGATGCACACTGGAGCGAGGACT CT GAAGCTGACTGCCATGCGTT 
GTAGCT GATGGTUTCCXJTGTAAGCCAAAGGCCCACAGA^GGAGCATA 

AGTATCACAGAATTTATACTATAAAAAGGGCAGGGTAGGCXACTTGGCCT ATTTACT7TCAAA 
ACAA TTT GCATTT AGAGTGTTTCGCATCAGAATAACATGAGTAAGATGAACTGG 
TTTTCAGCTTJTTTGGTCAAAAGGAATATAAGTAATCAT ATTTTGT ATGT ATTCGATTT AAGCAT 
GGCTTAAATTAAATTTAAACAACTAATGCrCTTrGAAGAATCATAATCAG^ 
CTTGATCAGCTAflNNNNNNNNNNNNNNNNN 

CCOCCAAI rC COOCI rA OGGGGQGTTTTTTCCCa»0CCCOCGGGGCCGGNWWWM 
>64fl 

ACI 1 t ACTTATTTATTTATTTATTTATTGTTTTACTATTTCACAAAA 
TCTTAAAATTGTT AGTTAAAATU I II ICI I IGI 1 1 1 CCCAAT AAAATGTAAAGTTTAATATGTGA 

TGGCT aaactcctaaggggataaggaggcgctaggagaataggcaggttggaaaagggtag 

TCGGGACTTGTWIAGATTTJTTGTGTGGTAGTCTGGGT AGTCTT5T ATATTTACCATATGGGCT 
ACAAGACACACACACACACACACACAGACAC^^ 

TGTGAGCATTTATTAATTC<K^AGTTGATGGTGCATAGTTTGCGGAGTGGGT AAAGGAT ATGT 

T ACTTTTGTAAGTANM 

>847 

>848 

GGATGI 1 1 r& I CAGCTGT ATGATTCT ATCATACTTGAAACTGATGAAAAATTTAGTAT 
CCTATGTCTTTTTT AT AGATCTT ATGATTT ATCAAGGGGATTTAT ATGTCTT ATTCTCCAT AT AA 
GTCT AAAAAATCACATTCAAATTCTTAGTGATAGGTTGACT ACACATCATTTGTTGCCAAAGT 
CATT ACCTCAAGCTTTGTAAATTCTT CTAGATAAAAGGAGACTT G T GATT AGAGAACT GAATTT 
CAGCCATTT AAACCTGTAGACATAGAAAACAACCCAGTGGTCAA GAGCTTCAAGAGGA 
AGOCCAAGTCCTCCI G 1 1 1 CCI GAATGGCTTTTCCACT CCAATTCATCACATATCT gct aaga 
CACATTTGCAGGTAGAATCCATAATAATATCTrGTTATTTATGATCTACCT AATATTTAAAGGAG 
AAGAAAAGAAGT AGCCTTCGTTGCAGCCAACATCCAACAAT AAGTCCCACCCTT AATGACAT 
TTCAAAGCCTTCAGATCACTTCTGACATGTCTGTCCA^ 

TTA AATAGoi lul l l AACTOGGTGGTATTGAGACTGT AACCTT AATGGCTCAGGT CTTTTAGC 
1 1 IG I ICI 1 1 G I CATAATCT GCACTCCTGGCCAGACTGACCAAACCTGCiGCT 
GAGATTTGAATTGATCACCAATGT ATGAAAAAT ACAGAACCCATG T CTG GCT CCTAAACTCCT 
GGTAAT AATTC CCGCCATCGTAGCAAGTGTCCCATGCATTCAGATAAAGCAGTCATAAT^ 
GGT GT ATTTTAOOGAATTGG I CCAGCATATTGGTCCT ACAAGGCTCAGCCAAGAAATGT ATC 
TGGTGCCTTAAACT ACAGA 1 1 1U I UG I GACTCTT AAAACTGGTTTACAAAGATTGAGAAAACA 

. attagattt^aagagttttgcttcaagtagacctgcattcatagct 
agcactgtagaatt tcaca caggscacaagcataacacca 

tgcatctattaaaccttttgagaaatgacattgactt atgcacattt aagcaatcaacaatgtt 
ttagaagtccaggt aacatataaaatatattcagacctagttttactttccatattcttgctcc 
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aaagtaaggtaaataggagttttagttgtactggcctgtaagattttttaaattacacagtgc 
atgacaacaactaaagccaactcagtccttctcttggcattgtgagttgaggacctttag 

AACT GTTAGCT CA TCTG ACT ATCTGTCTAGACCAAGGGTCGGCAAAGTTTTCAGTAAAQACC 
CACAGAGTACAT ATTTTGG AGTTT GTGTTGTCT GT ACCAACAGACAGACATG G TGTCT GTCC 
CAACTACCCAACTCTGCCATTGTAGC^CTAAAGCAGCCATAGACA^^ 

CATGGCTGTGTTCCAGTAAAACTTrATTTACAGGCCAGGCATGGTGGCTCATGCCTGTAATC 
CCAGCACTCTGGGAGGCTGAGGCTGACAGATCATCTGAGGT^^ 
GGTCAACATAGTGAAACCCCATCTCTACT AAAAGTACAAAAATTAGCCAGGCATGAN 
>849 

CGGCCGAGGTACAAAAGTTCTGAAATAACACTATAGGCTTAAGGAATAAGGGACCA 

gaag tagcctggtagccagtgtatttctggcttt atacattccttaggaaaaaaaaaaacttt 

atagacgtatttaagtagaattaaggtttacagaaatgattttttgagaga 

gaoct aaacattcgttctacogagatagggtcaacacgcjigatatttatttagcagc^ 

CTGCAGAAGTAGGAGGAGGTGACCAGATGTGATGGATTATGCCTGTAATTCCAGC^^ 
GAGGCTGAGGTAGAAGGATT ACTT GAGCCCAGGAGTGTGAGACCAGCCTGGACAAAATAGC . 
AAGACATCATj^TCTCCA^ 

TTTTACACACAGGAAAC^CIW TCACAGGAAAt ^ 
>650 

NNNACATTATCCACATTTAACACCTTTTCAGTTGATGGCTGATTATCTGA 
ATGGTGCCTCACTGTGGGGATCCCCTCCTCT ATTCTCAGATGTGGTGATGGT(DCCCTCGTTT 
TTCGACGTCCCTCCCGCTGTGGGCTGCAGTGTCATATGAATCTCGGAAGC^ 
GTCTTTCTA I C I ITTG1 1 CATAATCTTTGAGT AGTCTCTTCAC t IC I I G I 1 GACT CTTCTGTGA 
AGGCAGCTCTTC^CACAGCTCCCTCAGGACTTCCATGTGGTGATTAAGCTGG 
CCTTC CTCAGAAAAG CAGGCCT CATGTT CTTTGATGAGACCCTTGG I UC I I CC I CTACGGAT 
AAGTTTCTTTJCTTT ATTTATTTGCTTTT CAAGTTTTAAAATATTGTCACT AAGCTTTCCTTTTTC 
AAT ATC TA 1 1 1 1 1 AGTTGTT GAACATAT AAAGACAGTTCATGATGAAGAGACTCCCATTTGGGA 
CAGACATCTCT ACT C 1 1 1 ICI I CTAAAGACATCTTTATGTCCTCAGTTACATTATTTTTTAGTTC 
TTCAACAGCTTTTJAAATACTrTGGCCAGACTTTTTGTATTAGAAATTAT^ 
TTTGAAATGAGTTCACCGGGGCTCTCTCTTTGCCCCAGTA^ 
TGATTCAAGTTCCTTCTGGGATTCTTCCATCTTCAGCCTGATGTCO 
AGTT A AGTT CTTCAACCCTGCAATGAGTTTTACCACATTG^ 

TGAATCTTTGCCT GAAGCCCAGTTGTGAGAACATGTTGAAAGGACTCCT G^JCTTGAAAT ATTT 
TGTATATCCATTTGTAACATATCTTCACTTCTTG^ 

TGGCAATAAGATQCTTC^AAAGATTCTTCCATTGAGGTGTCC^CATCATC^ GATCCAAAACAG 
ATTTGGC I ICI UCCTCC CACATTTCCACTTG TCX^TGAGTTTTTCA^ 
TA GTTG CCCTGTGAATrCTTCAl nTCTiTI I CATGTTTCTCTQCAGCCTTT ATATTTTGATTAT 
AATTTTCAGGAAGTTCTACTGACATATCTGTTGAAAACTCAACAGTT ICI T I AGA 

TA GAAT ATTTCCA GAATTGT CAAAAGT GGGC TGATCCT G I M I IT 1 ATCATCAGTGGCAGGTT 
CATTTCAAGTTGAG 1 1 1 1 1 GAAACCAACTTTCTCCATCTTTT ATTCAG C C I I CIC AGI ICI I IA 
GAAATAGATGATCCAACCACATCATTGCTGACTTCGACTAAGAAA rrrccTGCTi CATTTAAA 
GTAGCGTGTTCTAGATTCCACTGGGCTAG 1G1CI CAAAGGGTACCTCTTTAA 1 1 I CI I CCI IC 
TTTGTCTCCTCAAAGCAAGCCCTTAGTAAGCGA 

CAGCACI 1 1 1 Hal AG AfiTGG ACTTCACATTATATATATTTTTTCTAT ACATACAAACATCAGAT 
TTCACCATCATATACTGTTTATTAATATTTJTGACATTCTCCAGCCAAAT^ AT AAA 1 1 IC I UC 
ACAI 1 1 1 1 GAAAAGAAGTATCAAGTCGAGCTAQGAATT C 1 1 1 I I CI I C AATAAATTT ATGCCAG 
TCTTCCAGCAATAATTCCACAGATTCTCTGCT CCCATATTTAATGTTCCAAATATCCAATTTTG 
ATTT CACTTC ATCTACCAAACCAAGAACT AAGCACTTGT AGT AATGAAATTCTAGAAGTAGAA 
TAAATTTTTTCTCCAAAATGTTGTTGATTCGTCTTTTCATT^ AGGTGGTAC 
CAATGGCAAGTGATTTTCATCCrrTATTrTCAAAGGTAAGGAGAATGTTCGAATGATGCT 
TCTATCCATCAGGCTCTTGAATAAAGTCA 1 1 1 ICIC I I GIA TCAGAGTCACGGCTTGAGAGTG 
ATCCTGGGAGGCTGACAAA7 CTTCATCCATAAGCTCTTCTACCT CCTGGAGCCAAGCTTCAG 
TrTGATGGAGGGGTT^GGC«CAAG<iCATAATTTAGCTTTA 

TCGA GGCCATOCCAGGCT I CI CT CAACTGTAAATQATCCTTGTCCAGCTCATCCAGATCCCG 
TTTTATTGACAGGACATCCAAAAAQGAC 1 1 1 1 1 1 IC I I CATTGAATGACTCCAT AAAGGACAQ 
CAGGCT ATTATACTTTTTAAAGTAGGTATCATTCTCTGAATCCTTTAGCAAC I fCTCI AGTTTT 
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TCCTT7TGCAGAG77AACCAGCCCA7AGCATCT77CACCT77CCCTGAGCCTCCTCTCCaGTC 

ccaggggcatctttggaatactgcagaaactgtgccacataggtcatgatggacttttcatc 

aggatcaacaacatccacatcttctggttccagcaatctggggattmaattcttgttctgc 

aattctgaagg cctctctcagattgtctttgttggatct atgctt cacactcttcatgtcaatt 

aggtctggtcgcaaggcatgaatgatggccaaaaaagccatcccatttct ccaacttgactt 

aaaatcggtcacattgacagactgataggtggcgcattgttcctgagccgacaaaagaagg 

gcctttcttgcagacatttgccatcttgcttgcacttt agagcatttctt agctggaggactt 

gaggcaggagatgagtcaaccacactcacatcatccagggaaggctgattgtaattgcaag 

aaagagtctgggcaagcttctcaatatgaaagtgcaggataattgtccaaattaggccaagg 

ataatggatgggtttccatcaatg^tatcagtaacatgaatatttattagcttaattgatcggt 

ttcnrtaggaatgtcaaggcatgttctatattgattctacactggaaggtattagatc cttta tc 

tgtctgtgaataggtcggatataactgagggagaagtgtgcctggccaactgtgagtttatc 
cagcacgtgaaggci i i ci ictgggtgtcttcctgttcagcttgcaatgaaatatggagatc 

GTC^TGCCCCAGGAACCCTGTTCATCrrCGGTGGGAAGCTCAGGACrrAGATGCCATTCTTT 

GGGCAj^GrfcG^c^G^ 

ATCGCGCCTTTTGCTCGCTCTCCCTCTCGCTTTTTCAGGTCCGGCCGAAGGCTCTGGGCGC 

AGCTGAGCTCGTGCC 

>651 

ACCTATATTCTATGCAAAATTTATAAAATAATCCTTGAACATGAAAACTCATCTT AAAA 
TTACACGAATTAAGTAAGCATGCAATACAG ACACTTGCAGGATGCCTGGCCTCTGGGAACT G 
CTCCTGTCT CTGTGTGAATGTAGAAGTGAGGCTCAAACTCTCTCTT AGGAAAATTTTCCCTTC 
CCACTGCCCATCCATTTCTGCTGACTCAACAATTCCCACAGAGGAAATGGGAATAGTATCAT 
CAACTAGCAGTCCTCCCATGCCAAGAGATTTGGGGTCCTTATCTAAGTGTTTCTGCAGCCGG 
TCTTCCCTT CCTGACTTCCCGTATTGGCTCGTTAAAATGATTAGCTGGCAATACAG GTATGTT 
TG G ACT GCT ATTG GTG GTGAGTTT AAT CTTCT AACTGTGTTTTGTGAAAGG AAAT ATTCCCT A . 
AAAGCTTTGGTGTCACTTAAAAAAAAACAACTATATATGATTGAAAGAAATTTGAGATATTTTT 
GTTTCAACAAAAAGCACTGAGTTTATGTCTAAGAAGAAAATTCAATAAGCATTTATCAAGTGCT 
TAGGATATGCTGCAATGTATGT 
>852 

ACTAGCAGATGATGGCACAGTGACAGCTGGGAGGGATGGGATGTGCTTGCTTCATG 
TCCCCTCCCCT CTGCCTrGCCTCAACCTACACAGTCCTGTCTGGTGACGTGCCAAAGTCCTTC 
CTG CCTTGCAG AGAG G CCTCTCTTCGTCGAACATG GG CCTCAGG AAAGACAGCCT G AATGC 
CACTACCCCAGGCTTGTGGAAGGTTCTGCATCAGTGTGGCATTGTTGCG ATAGCCCT CAGTT 
GATGCTTGTTTGTGGTGtTGGGAGGCAGGAACT ACTTT AGGAGGGTGGAGGGGTGAG AATG 
AGAGAGGACTTGCCCTGAGCCACCCAGCTGTGGTCACCTGATGGCCCGGATGGCTACATAA 
ATCCTGGGAGATCtXSTTGTCCTCATAACCAGAGTGAGCTGGGCTCCAGACCAGCCCTATGG 



NNNCACATACATACAGACTAACGCTCAGCATAAACTTTCCATTACACT^ 
CTTGTGGAGG AAAAACA AGGATAAACAAGAGTCTCAAGAACTT AAGAAAAACATCAGAGTTG 
ATTAtTTAGCACTTTCTCAGGATTCTAAGGCAATAAGCCT aattcaaaacgtgaaattgttctc 
T ATTTCCCATTAGTCATTAAATGAGATAAATGACAAGCTATTGCT GCTTCTCCATTCT GTTTTC 
AAAGAACATTACAAAAATAAACCAGTGTGTTCTCTAACAGTTCT AAAAACAGTTTGAN 

accagaagcaaggcagtttagggacaaagggcatgagctt agagtcagatttccta 

GGTTCAGATCCAAGGATCACTACTT A 1 1 II CI 1 1 AAGAACTTGGGCATCTGTAAACCAGGGAT 
AATATC 11 C 1 1 CAAAGGGCTTGTGTGAAGATTCAACAAGGTAATACATATAAACGTCACAGAN 



>«56 

NCCTCAAGTGATCCAGCCACT ACAGCCT CCCT AAATGCT GAGATT ACAGG CAGGTA 
CTTGCACGGCGAGCCCACTAAATTCCrrCTTAACCTCTCAAGTATTTTTAT^ 
TTCAGAAATGT AAGTAAAATAGTGT AATTCAGGGTGTATAAGGAAGCtTGCAATGTAACAAGG 
C ATATTG AAAAAACCCTTT GTACAGAATACAGCATAATG AATACAACAACTAGCATCAAACTC 
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AGTGTATATAAGAATGG^ i AA GTGA CCA i i AGTCA i uTGAAAAljC i i AACAAC I A I J AaGC » C 

TTATTTTCTTACT AAAAAACAATTTTAAGTTCTTTCAAGGCTATAGTTACGCTTTACATAAGAG 

GCCCTATTACCCACTAATTCTTAAAATTTCTACCTACTTAAAATTTCTrrAGACATTrCCAAAG 

GTTAGTAAAGGAAGACATAAGAT ATGCTT ACTT AAAT GCTT GCT GGTTCCATGCCTGGCCATA 

CATTCCATATGCAGGAACTTACCCACCANN 

>€57 

ACCATGAAATAGG ACCTTCT ACGG7TT AAAATAAATGTTTGTTTTTTTCT AGCCCTGT 
AGGTCAATGAATGCCTGACTCCAGTGAC^GACCATAATTATCCAAATCTCTCATTTATGAATA 
TGGAATATAAATATGCrAAATTGATTATGTCATGAATAGACTTCTTTTTTGC^TAACAATGTTT 
GGAGTTTCTCACCTTTCTCCTAGCCTTC I I H I CI 1 CCTT AAATGTAGCCTGG AGGATTCCT A 
TCT ATTCCATAN N 
>856 

NNNNAGTCGAC<>(^CGTCCCGTTTTGTAGCAAAGTTTTCCACACAGGTCTTATACA 
TGTCCTGTAAAAGGTATTCCTGGATTTATATATTTTTAGATTGCTATTGGTTTCXJCATCT^ 
TAACTAATTG^TTATTGTTGGAATATAAACATATTTGAATCTTTTGTTGTTACATACCCT 
G ATGATAATTCTGCTA I MCM J CrCTTAACTGAGAATTCCTCCAATTTAAAACAAAGCCrrTTC 
AATGAATCCCTCAGCAAATGGAGACTGAGCAAACACATATAAGCTACAGCCAATAGGTCAGG 
AGCCA GGTGGT AGAATAGACTCTTTTTTCAAAAGATGGCTCTGGCGTCTCCTTTAATAGTAGA 

catccatttttctggagtggtacatatgcagctcttgctagaattaggtgcctggtctagag 
ctactacctctgccagagtcttctgaccctgtgacagaagatttctgtgtgtgctccatggg 
gaagactctgcagcccaatctaagctgttatttccatttggtcaaacagtatttcagtggcat 
ctggtctggcatgagaaatcaattcagtcaggtctcttctggaagaatattctaattgattac 

AGGTCCCTATAGTTGTTCCTTTTGGGTTTTGATTTTTCAAATGAC^AAGT ATATGGGA 

aggggctggggcctgacttttgcactgggcctgagttgtggtatgcattg^tc^ctcttat 

TGTTTCTGAACTTTCTTGACCACATTCAGATTTCACTCTGGGATGTrGTAAC 

GTTGCAAC^TGGGGTAAGGTTATTACGAAGGAGAGACACAGCGATCAGTGTATAAATCCAAT 

CTATATTGATGGTCTGGAGAAGAACTCACATTrGTACCACAAGAATCATCTrCCTTGTACTAT 

AACCCTTGAGT AGATG ACACCTG CCTTTGGC AAGACTT G G A GG AG AGACACATG CTGGTCA 

TTTGACCAATTTCCAGTCTCCTACCTGGGACCCTGGGGCT ATTCCCTrACTTTTTGTCATAGG 

CATTAATAGTCAAATAGTGGGATAGAGCCAACAGGTATGATTATAATATATATGTTGTATTTGA 

AAATTAGGCTCTGAAAAGGAGATATTAATGTGGTGTACTTATGTATTTCAT ACTTT AAT AATGA 

AGTGACCTTATCTCTCCAAAATGTAGCCACTGTTAACTGCTA7TTATGTATTATTCAGTGAGTA 

TATTACTATCTrTGTATTTTCTACAAAGATTTATCAGCACAATTGTGAGAAATCT^ 

TTAGGGTTCX^ATTAAGTTGGTAATAATGATGCACTTGGCAATTGTTTATAAAGGGATCATTT^ 

AGAAAAAGGAGGAAAGTATTTTTTCCCTCTTCAAAGTGTGACTTGTAGTCT^ 

1* 1111 1 rTCTGTTTA ACCGA GTAGGGCTTTGTTCTCAATGCAGAATCTCCTGTGACCCTTGA 

CATTITAAGGGATTAATTTTACAAGTCCiCATTAATAAAACT ATACT AAA 

TACCI J CI 1 1 I CCTT ATTATACAGCTCAT ACATACCCATATCT AAAATT ATTTTTGTTCAAACTC 

CAATTATTJTTATAGGATTTAAGATGCTATTrAATTGGGTAAGTCTGGCCCACACC^ 

TGATAAAATGGTGAAACCTCAGAGTTAAACAGGCTACCCT GTTGAGGGTGATCT ATTTGGAA 

AGGAAAAGAAGGTT ATGAT AGG CTGTGAAAACTTG GAAG G AAGTT AAGTCAGATGGGATGTT 

TGAGGGTGAGGGAAAAGAAAGAATATGTTTGGTAAGAGC^AAGGAGAGCXX^TTCTATAACT 

TTAGGTGAGAGCTGATATCTATCAGGGTTTGAATATAGCTCCTACAAATCATTCTGCAATAAT 

TCATCTACACGTTTGAGCACTOCAACTAAATTTTAATTACCTCAAAGGTA^ 

ACTCAATATTTTTATCCCAAGACTTGCTGGGGGCACn"GGCTCACAACTGTAA 

TAGGAGGCCGAGGTGGGAGGATCAGTTGAGCCCAGGAATTCAAGACCAGCCTGGGCAACA 

TAGTGAGACCCATCTGTACCAAAAAAAAAAAAAAANNN 

>659 

actggctggacttgaggtggtttaagttggcagctacatcgaaggacttctgaaaa 
gctcaagtgacagttacacctttgc^ctctccac^ttcagct 
gataatcttcaaacctccctgaacaggtggaaatgcgtcrrrtcctctaag 
ccacattagtc^tacttggctacagaattgacg 
ctttgggtaatatccttatcttgttg^tctagttagggtcaagtaaa 

ATGAAATACTAAGAAGCTGACAAAATGGAATTTAGGTGAAAACACACCAAAGTAGTATTAAAG 
TTCAAATTCCAAAAGAAATACTTGCATTAAAAAAG AAAAGCGATGCATGGTGGCT ATGC 
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CTGTAGTCCCAACAATTCGAG^TGGGAGGATTGCCTGAGCCTGGGAGGTCGAGATTGCAGT 

GAGCCGAGATCGTGCCACTGT 

>860 

ACTTT ATGCAGAAGGAAAGCAATTGCAGATGGAAAAAGCTGAGATGCT AT AAGGAAT 
TACGGATTTTATAAAGAGATCACCATGTGGGTGAATGTAAATATAGATGAACAATGAAGCATA 
AACAAAATTTTAATATCTTACAGGCTAAAATATTTAGAAATGAAAGACAACAATAGCATATAAG 
TTAAGAAAGGGGGTAAAAAGAATCAAGAGCATTCTAAGGTCCTTATATTACCTGGAAGGAGA 
GTAAAGATAATGACTATCTTCAGGCTGATAAATTAACAATGTATGCTGCCATTN 
>661 

CGTCCGAAGAGGTTGATAGTTTATGGAAAGAATTGGTCCGCTCTCCCAAAAGGTGCT 
GGGATTACAGGCGTGAGCCACAGCACCCGGCCTCCTTCAAGAATTTTAATAGTAAAATCTTA 
TAAGTGGATGAGGAGAGTAATTAATATATAGAACATTACCAATTATAAATACATATTTAAGTAT 
AATTTAGTGTTTTCT GG AAAG AAATCAAATGTGTTTCTGTGGGCATTC ACA I rTTCTTTTTTG T 
GAATTTTCCAAGCTTTCTACXATAGCCATACATTCTGCAATATTTATAGTCTATAACTTATT 
TCT AG TTGACAAACATG GTTCTTTGI 1 1 C T T AAATT ATAAATGTCTTT AACCT AAAAAAATT AAA 
TAGTC ATTCAGCTTACAGGTCTGCTr AAAAA ACAGAACATTTTTTAGCTT GTTTTG AACCAATT 
M IT I AA TGTTCCTGTQCAATTAGTACTTTTT ACTTTAAACATATGCACTTAAAGCATCCATTCA 
CATTTT AGGCA CTCA TTTCAGTA 1 II I CI 1 1 CTGCTTACATTAGAACAAAAGCAAATGTTGGGT 
TCTATGGAATCTTTTCX^TGTG(^TGTATTTGTATCCTTTCTTGTTCrrT GTGATCTT GTG ATGA 
AGLA CTTCT AT ACT AAGCrTTGGTTTAGATTTTTTGTTCTGC I III 1 1 1 1 1 1 1 1 M 1 1 1 C CCCCATT 
GAGCAGTGTTCTGAAAGTTGGATGGTTCAGTTTTCACTTTCT^ 
CTCCTTTCATTTCCCCCATGCTGATTGAGTGTTTTCXrrGTACCATAG 

AAATCAGAATTCATTTAGCTCACX^CATCTCTTGAATGTGATTGACCT ACTTG AAAAATATTAA 
TACCTTT AAC IT T T GGAAACTATTAGTCTTC^TAAACTACCCCAAAGCCACT AAAATGAAACAA 
CATTGTGTGCTTATGTAAGTATTATCTAATGGCATTTTCTTTATCTCTTGAGAATTATT^ 
CCAGATACTTGTTAGAGATTTTCTAAAACTTTTTGGGTCnT^ 

TCCTT CTGGTGACTCCAC^CTTCX^AACX^TGTTCTAAAAAACGCAAAATAGAAC AG GT AGATAA 
AGAGGATGAAATTACTGACAAGAAAGCCAAAAAAGCCAAGATTAAAGTTAAAGTTGAAGAAG 
AGGAAGAAG AAAAAGTGGCAGAAGAAGAAGAAACATCTGTGAAG AAGAAG AAGAAAAG GGG 



TTG CTGT ATQ i IIG1 I I 1 1 AATTATCGGTTTTCACTTGGAGGGGCCAGTTCTCT ATATTTCAAT 

CTATTTTCTATATCAGAAATGAGCAGGCATrTTAAAAAATGGTTTTCATTGATGGAGAGGTAA 

AAGTGAAATGGCTTrraTTGTATTTATATTATAAAAGGCCATTTCCCAAATCT 

TAAA AATCAA GT TTGC ATTGAGGGGAGGAGTATGA7TTGCTCAAGCTT A C 1 1 II 11 1 ATAGGT 

G GG GTTTTT ATATTTTCAATGTGATT ACTCACCCAT ACAATTTCAG CTCTTTCAAAGCATTCTT 

GTTGGCTATTTTC 

>662 

NNNNNNNNNNNCGGGGTACTTCTAGAATTAATTAACGra 
CCATGTACATTAAATCTGATACCATGCAAGGTTATATGAAAATGTCTATCTAACAAGTAGCTTT 
TTATGTTGCCTATCATT ATAGTAAAGATTTAAAACT ATTTCT GT GTTCT AGTGATTCAAGAAGA 
CT GCTCAAATTTGCAGATATTrrCAATTTACTTAAAATAGCCAAGTTATGC 
AAGAATTCACATCAAAGAGCTTTGATTTACAAAAAGTGTACTT^ 
CAACTGGATTCCCAGATACGAATGCTAGAGGTAATAGTTTTGATTTTAAGATGGT 
ACTTCCCTTTCAGGACATGTCAACTCAAACTrAATCTGGAATC^ 

ATACCTGAGTGCCAGTGGAATATAATTTGGAAGGAATAAOGTTGTTGAAAAACATCCTCTACA 
GACAATATGAACAATGCCTTAGTCATCTATTGATTATGACAATATACTC^ 
TCGGTTCTGGTTTCTGTGGTACT AATANNNNNN 

AC TACACCTCACCAGCTGGGTGTCTCTCAGACGTTA^CAAGAGACAGAGTAAACCC 
ATGCTTTCT CCT ATCCAAACCAGTCTCTCCTGTTCXXTGCTTT GTCCAAACCCAGTTGCAGGA 
ATTTATGTCTTAAAGTAAACCtATCGT ATGAT AATTTCCCCTGAAAATGTGCCT ATTAAAAAAAA 
ATAGGATATGATGGGAGGCAGACATAAACATTCTGGTCAATTTATTGGTGTTAT^ 
AGTTAATAAACTGCOCTrrCGCTATGOTItAXJtTTCGACGTGTTT AGGCAGT 

-NAGTCCCCATTATCAAGATTTAAAAGAACACT aattaatttttjacactgcatatctcat 
TCCAACTTCT CTGAG GG TT ACAACTGATTGCAAAAAGTACTCCAT GCT CT GAAATGTAAGGAT 
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TCATTTATGAGAGAGTGAA CAT ACTGCTTGTAGCT AAAA CATTACAGGAGACCTTAAAAAGGG 
GT AT A ATTGGT CCCT ATGTG AAATG AA CCTG ACAT ATTTTT AT AAATT ATTTGTG CATGACT AT 
CTTTTGTTGATAGCACTAGGAAG ACTTCT AACGTTTAAATACTTTATTTGCCCT CAATTACT AT 
TT AAAAGTCCT ATAATTTTAAGT AATTTT ACAGCTG ACAAAGAT AAATA I I I 1 1 H C I 1 1 1 AGTT 
TTTCTAATGTCTTGGAGGTAAAGTGGAAATGGCCTGTTTTGAC^CATAATTTCTAGAACT^ 
AGTTAATTTGATCAGTT ACAATTGGGG 1 I I I Ml I CAGTTTCAAGTTTTGTGGTGGATACCTTT 
TCCTGCNNN 
>865 

ACATGTTACTGGGTATTAAATGCGTTCATAGTAGGGTATTAAATCAGCAAGGTCCCC 
ATCCCAGAAAAATGTGCAGTTTGTTCAATGGGAAAGATGCAGAGACAGTTTCAGTTAATATAC 
TAAGTGCTAAGATTGGGATGTGCACAAGAAGCTGGAGGTAAAAATTCTGGAAAACTGAACGT 
GAAGTC^C^ACTAGGCAAGCTGCXTGTAATTGAGCTTGCTTGTATATGACCAATCAAC^TTT 
GCTTGTTGAAGGGTT AGTT ATCTAGTTT CCT CC 1 1 1 ICI MM T GG AATTTGGTCTTTTAAGGT 
CTTG AT AATCTTTCTAGTCTAG AG CATGTGAACAG AAC AGAAGG AAAATCAGGACT CAGTTTA 
CTTAATTTAAGCAAGCATTGGTTGCT G CAGTTCAGGGGAGGTTAAAGTTGCTGGGCTCCACT 
CTCTTATTAGCATGGATGCTTAAGAACTTCAGGGTTrGGAGGTCAGCTGAACAGCTGTTT^ 
GCACTCTN 
>866 

NNNNN NNNNN NNNNNNNNNNAGGTACTATGGTATGCCCTAACTAAAAAATAGATATA 
GGATAGTGATACTTTGATGAGGACTATGAAAAGGGACAGTACGGCTTAGTGGAAAAAGTTTT 
AAQTTTTCT ACT GTTATTGAAT AAAATT ACAT AT AGTGTGATTCTn" ATT ACTTGAAATTAGGAG 
GAGAAAGAA1 TT7 7TGAGGTAAATTTGAAAAGACATAAAATAGACTACCCTGACAAAAATCTT 
C ACAGATTAAAAA TACTAATATTTGCATTGTCATGTATATTACAAACAGTATTCCTTGTCTnT 
GCTTTTTT GTTTTGT GTTTAGTGTTTCTTGCTATATGAAATATAAC ( J CT TTATG CAGCCT AGA 
CI I II M ICI 1 1 GTATTTCCTGACCrrTGTTGTCATGTTTAGAAAGCCATACTCT AAGAAAACT A 
AGT 
>667 

NNNN f<WNC ATGCAGCCAG G CT AGACCG GCTCAGCXXX2ACTTCAAGACAAAATCTCA 
GCAttXATTACTCACCATACATATTTATGWGTG^ 

CTCATAG 1 1 1 II I ICT CTGGT AGAGATAATTAACTTATTATGCTTGATCAGTTAACTCTTGCTT 
AGAAATTTAAAAAAT ATTTTT AAGTGACAAATTCTTTGTAGAAATTTTTGAAAAT AGAAAT ATTT 
GAAGTAGAAAGTTAAAATCACCCACAATTCTGCTTTTGTTAACATTTGAA 
TGATATATAACAAAATTTGTCTGGGTATTGCATATGTCGTCCTTTCCTTC^ 
TGAGCATTTAACCAGAACACTAAATATTCT<XXrTAGAAC>TATGGATTTr 



GGTATGAACACnTAAGG^CTCTTGACGACATACTGCCATACTGCCATACTGGC 

AAAAAATAATTAAGCN 

>66B 

GCGTCCGGGCTAAGCATCCGCCGCTCACCTTXCAAAGTGCTGTGTTACACAGGCCT 
GAGCCACCCAGACCAACCTG 1 1 1 1 ICI 1 1 1 MCI 1 1 1 AGCTCACAAGAAGGAATNNNNNNNNN 



AATTATGATAGTTCATTGAAAAAGGTTGAGAATTGAGAACTTCTCA g 
CAAAGATACAAT ATGACAAAGTTCACTrTGCTTAAATGAATCT AGGAATGAAGAATCT AGAAAT 
T ATAATGGAGAGGTGATTAGGAGTTT AAAATGGTTTATTGATTGGAGATCCrrrTATCT GGATT 
ATATAGGGCACACTTTGCTTTAGGAATTCCXXX7rTTGATCTTGGA^ AATTGTA 
CATTAATACTGTATTGGTAANNMNN 

NGTACATT AAATT AAGCATACTAAAGAAAAAAGGAAT G I M 1CT l AGCAATTTAAGAA 
CTTGCTTAAAAAGAAAAAAAGATCAACCACTCCCTCT AGT GACAAAAATTAG OCACAAGATGA 

aattcagttaaaattccaaacactgtggagatggaaagccttgt ATTTTAGATGAAAGGATTT 
atggctggaattaaaagaaattaaaaggcagaaaagtgggtgaa 

GTTTTTAAGTGTT AATAGCCAC HIM GTOCAGTCTGTATC I CC 1 1 1 CATTaGTCTTTATATATA 

tatatacacacacacacacacgtatgttatatatacatataatggtttatgtattatatatggt 
at atatacactt atatgtt atatatatgggnnmnnnn 

cccttggccgcccgggcaggtactaatattcttcaacagaat 
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CTACAT AAATCCAAACTTGGTTCAAAG GT AGCTATG TTTTTTT AAAAAAGTTATTAT AACAG AC 
AAAGCANATGCAAACTTATCCTTCCAAACCCTGATAATTGGTAATAC^WVTAACTGGTATCT 
AATAAATATACAAATCAAGAGAATACCTrGCTAGCTAAATTAAAAAAAAAAAAAAAAAACT ATC 
CATACTTAACAACCAAGTGCAACWTGTAACCAAAGTGTTCCTT AGCTCCCGCGN 
>671 

HATGTACAAGGGCTTCTTTGGTGATAGTTTCTACTCTCTTTAAATACTGTTCTGTTATT 
TTTGAAATCTGATCAAGAATTGACACAATAAATCTCTTTGATATTTATACTTATGCCTACTTTTA 
ACCTTTTAGGAAAACTTTATGAATTGGAATATTCTrAAAATCXrrGAAATAATTTGGAATATTCTA 
AAATTCTGAAGAGAAT ATGAACGG ATTG TTGGAATGGAACTTTT ACCCGATTCCCTCA GACT A 
GAGTGTTCATACGACATTTTGCCAAGAAGTTCCTATAGAGGCAATATCACTTTTAGGATGGAT 
GGGTCTAAAA^aGATCATATTTAGTTTCTGGTTATTCATGGTTGCACTCACTTTAGAGGATGTG 
TTCCTATTAGCTTGCTGCTACTATMTGTCTCTCCTAAATAACAGTATGGAATTATAGAAAGAAA 
GGTTGGGAGAATAGTCGTGTGANN 
>872 

ACAGTTCTGTG Mill CAATTGAT ACAT ACT ACTT ATGT AAGAAAAAT GAGT AAAAAT A 
GAGGGOCACACAGGCAACAGCCATTAGGTTATGCACAGAGAAGGAAAAACTTCAGAGGTTG 
TGCTGCCATCTTCTGGAACAAACAAGAATCTACAGGAACAGAAACATGATGGAAGAACAAGG 

gttagttactgcaacgaaaaaacatggcaggaaaaaaaaccattttgaagccaag(^^ 
ttaaccatgaatgaaaacaaatgggaaaacaacaacaac 
aagaat gagcaaat acagaaatt att aatgtttt agan n 

>873 

gtggaaacaaggactttgcacagc^ccttccag cccaacatttcccaggt aaaggg 
ccttogcrrgccttgaagaccattatccctggcccaa ih1i g 

CGCCTTTTA I C 1 1 C 1 1 1 G AATACTT TGAGATGCACAGAAATAGAAATGAGACTCAAAATAAGC 
gtcatatcaaaagggaa i inn IGTAAGAATTTTCAGGTAACTCTCTAAATCTAGATAGGAGA 
TTGGCGATGACTCAGAACAAACCAGAACX^GGGTCTCTCT GTGTCTCATCTGTTTCTCTAAG 
TCTGCTTCATCTGCUJ I C 1 1 GCrrGTGATCACCTTCCTATGAGTCCAGATATTACAGAGAGTCC 
TTGAGTGCTTGTGTGTTGTAATGCAGCCCCATCTACXTCTAAATGCATA 
AAACTTGGC I UC I 1 1 UCCCACCATGGGACCGCTGAGCTCCAGCGAGGTGGGCAGGGTCTG 
GGAAAGTGATTGAGCTGGGCCCAGTTCAGGAGGTG CCATCTGAGACT ACAGT ATCAGGGAG 
GCGAGGACCATGTGGATAGCAATGGGGGACATTCCACT CCAGAAGATGGAGAGTAGTTTCC 

acagaagcaaagcaagacatggccactgacaaaataatctatgtggaagcttgttataatga 
agatgtccaggccatgtctgcagagatttggatccagtaggtct agggtgggtccaggaat 

CTGAATTTTAACAAGTACtrTGAAGAGGTTCTGATCACACTTTAAGAAATATTAATCT 

ggtgcg^gtagcttaagggagtcgtctagagtcnn 

>874 

f^NNNGCGTGGTCXSCTTTTCGAGGTACTGAGGATGACT 
AAAAAATG<»CATTGACTITGTATAGAACTTAATAATCAG^TTTTTAAAGAGG^ AGTCTATTCT 
CTT ATTTGAGAGAT ATGGAAACTATCTAGGCCT AAAGACTGTAAATCTGCCTGG AATCAGATA 
GTTGGCAGCAAAATCAGAAATAGAAAGCAGTTACTCAACAACCAACA 

ATTTGACAAG<^TCTCCTGTGGATAAGACCCTATGCAAG^TGTCATGAATATAAATATGCACA 

GTAGT 

>©75 

NNNNNCXTTGTGTGTAAAACGACGCGCCAGT AGCCTTGGCGATCGCT GTTTCCTGTG 
ATGGATACGACCGCAATTCGCCCTTATCGTGGCCGCG GCCTATGTTCTTT AAAAAT AACAGA 
GTGTGATTTAAGAAT ACTCAG ACT AGAGCCTT CAGTGAGTT GTCTGAGGGAAAGGAGTGAAG 
TCAGGACTTAG^TAGAAAGATTACAAAGAAAGTCAAAGTAAGCAGAG^ 
ATGACAGGTTCAGAATAAGCAGTAAGGGAATAAAGAAAAGAAAGTTGTGTGT GTGTGCATGT 
ATTACATGATAAATCGATGGAAAAAGAACTCGCAATTTACT AAAGGAATAATTCATGGTCATA 
CCAA rTTCTGTI GTCGAAAACTAACTTGATTAGT ATCAGAAGGAAAGTCAATGTTTAAACAGTC 
CTTCCCACATCTGCTACTTCCATAATGCCT ATGCAACTGTCAT AAATTAAGAGT AGAGAAGGG 

>879 : ■ 

GTGGAAACAAGGACTTTGCACAGCACCTTCCAGCCCAA 
CCTTCCCTCCCtTGAAGACCATTATCCCTGCCtXiAAATG . 
CGCCTTTTAICI ICI 1 1 GAAT ACTTT GAG ATGCACAGAAAT AGAAATGAGACTCAAAAT AAGC 
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GTCATATCAAAAGGGAA? 1 1 1 1 J IGTAAGAATTTTCAGGTAACTCTCTAAATCTAGATAGGAGA 
TTG^CCATGACTCAGAACAAACCAGAACCAGGGTCTCTCrGTGTCTCATCTGTTTCTCT AAG 
TCTGCTTCATCTGCCCTCTrGCTGTGATCACC^ 

TTGAGTGCTTGTGTGTTGTAATGCAGCCOCATCTACTTCTAAATGCATAGGGATTTGTTGCCC 
AAACTTGGCTC^rTTTGCCCACCATGGGACCGCTGAGCTO 

GGAAAGTGATTGAGCTCGGCCCAGTTCAGGAGGTGCCATCTGAGACTACAGTATCAGGGAG 
GCGAGGACCATGTGGATAGCAATGGGGGACATTCCACTCCAGAAGATGGAGAGTAGTTTCC 

acagaagcaaagcaagac>tggcx^tgacaaaataatctatgtggaagcttgttataatga 
agatgtccaggccatgtctgcagagatttggatccagtaggtct agggtgggtccaggaat 

CTGAATmAACAAGTACCTGAAGAGGTTCTGATCACACTTTAAGAAATATTAATCT 

ggtgcgggtagcttaagggagtcgtctagagtcnn 

>877 
>878 

ACATGTTTGTAAAATTCCTT aaaatattt atgctcaaaccaacatttccatttt ATCT AT 
CTTAAATATATCTTCGTCTTCrrTACGCCTAATTTCTTAAACTCCCAGAGTTT^ 
TCTAGTCATCTGTAGCACrrTCTCACAAATTAAGCTCrrCTTATGCCC^ . 
GGTCTCTTAGTTGGACAATAAGCAGTGAAAGATATTTCTTATGGGACAAGAAATTAACATTAT 
TAGTCAAATGTTG ATGCCGGT AGGCTGAG AAATGATTCTCACTTAAAAGCCCCTGGGTTTTA 
AACCTCTCTT AGAAAAACATTAGTT AGATG AAAANNNNNNNNNN NNNNNNNNNNNNN 
>879 

NNNNNNNCAGGTACAAGGAGCT AGATCATCAAGGAAGGTCAGGGCAGGGTTCACA 
GGATGAG GGCA CTTTGCCATTCTTTTGTGATATTGGTCAACAAATGACACAOGTTATT^ 
TCTTGACCTTTTGGAAAAGATACAGCAGGTAATAGCCTACAGGAAAGAGGAG^TAGAAAACA 
AGTGCCACAGTAGAAACACTrTGATAGCTAAG^TGCTGTCTATCCCTTGTGGTTATTCTGTGC 
AGTTGTCTGCCTGGGTTCTTGGAAAGTCCCAATCT AAGGGTGCTTGATTGCGCCCAAGGATG 
TCTGCATTCATTCACTGGGAACTTACAAGCCCTCTTGC^ 

CCAAACTCT AATCCTTACT AGACTGCG AAAAATT ATTCTCCTTTTT ACAGCAATGGAAAAAGG 

CCTTTGTGGTATCAATCTGTGTTATGTGTATATTATTAACCATGTATTTTTATATAGTTGT 

>680 

ACATACAATAG^GTATTATTCAGCCTTAAAAAGGATGAAAAAATCCTGACATGCTAAA 
ATATAAATGAATGTTGAGAACATTATGCTAAGTGAAATGAGCCCATCTAAAAAGGCAAATACT 
GTATGATTTCACTTAACTGTGATATCCAGAGTAAACAAATTCATAAAAACAGAAAGTAGAATA 

gaggtttccagggactgggagttacttgatatagagtttcaattttgtaagataaaaaagttc 
tggatattggttg cacag caatatgaatatacttaacactactgaactgcacacttaaagatg 
gttaagatggtaaattttgttaggtgtttctraccacattttacaaaaaa^ 
aattacaaaatgtacaaaatact attc^tc^ttgtgtttccagtttatattgaacacaggagta 

TTTCAGGT ATAGTAATTAACTTACT ATCATTTGAAAAGATGTCTATAGCTTAGTAAATATCCAA 

CTCTTATTCATACATTTTGTGATTATCTAAN 

>881 

ACCACTGCACTCCCACCTG^TGACAGATCAAGACCCTGOCTAAA 
AAAAAT AAAAAATTTAAGAATATTTCTATGCCCTTT ACCAGGCCAGCTT AATCAGAC IT CTC1 A 
G GCCTA GG ACAG GCTT AAGA TCAGT TAATTTAAAACACTTCTGATGTTTCTTGAGCATTGAAA 
AQTTTTATTCI I I C IU C I I U TGTTTTCAATCTTTTTGTQ I I IUIC I H I ACTAAGGCTAGAAACA 
CGTATTTGGTTTGGTTATCTCAAGT1TAATTGCATTCATTGTGTTTATAGT 
GTGTTGGAATTACCAGTCACTTACATTCATATTTTAGTTTTTTGCCT^ 
>€82 

AAAATAGCTAAGACATAAGCAGTGT1TCACAGAACCATCCATCAG IUI 1 1 1 1 1 AGGATGTAGC 
AGTCTT CCATGTATCACTTAACCAATCATTATTCTT ACCCCA I C 1 1 1 1 1 GGGCAGGGGGT GGT 
AGAATTTAAAATTTAOCATTACT AAGACAGGGTGATAGT aagcatagaattttgggatgtctt 

CCI I GCCCTAAACCTTCAGAGTTCTGCCAGGTGATTCAAATGTTAAGATCCCATAAT 

CTCGCCTGTGTGCTCAAGCGAACACTAACACTTTAAAAAGT 
TTGAATrAGACACAGTATTTGGGCCCCATCrn-CAATTTCAGAA^ 
GGCCATTGCGGCCTTCTGTAGTCATACN ' 
>B83 
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ACTTTGATACATGTAAAGTGCAAGGCACCTTGCTAGAGAGCATAGGAGCTATACTAA 
GATATAGAGTCCTGCCACAAATACACACAAAATAACATGAATACAAAGTGTCCTAAAAGTCAT 
GCCAAATAAAACAGAGCATATAACTGGGCAGAGGGATGGAGAGTCACATGCTGGAGGAGGT 
GAGCGTTGA£^TGGTCrrATGGGATATGAACTTGAGATGTTGAAGTAGAACTGAGACATTTC 
TGGAAAACTAGATGTATGAACAGAAGCAGGAGG AAT AGGAGAAGGTTTGG AAAACAG CAAG 
CAGCTCAGTTT CTTGG GTGGTCCAGGAGAAGAAGCTCAAACAACAGTCAGTGATAACACTAA 
AAAAATCAAAAATTrTAAAAGTCTGGAATCACAGCATAAAGAACCCGTATGCAGGANNNN 
>885 

^P^ ^TAAAC AAG^CAGTGCCTGCTTGTGACCAGGGGCTGGGCCTCTTCATAGCTCT 
TTTCCCTGCCi r I 1 GTCrTCAGAGTTGATCTGCTTCTT ACACATTCACTTTTTCAG AGTTTGCT 
ATCTT AGAAGCAAGGATCATTTTTAATTGGTTTGTTTACTTCAAAGTCCCACTCATCA^ 
AGGGTTTCGCTTATATTTGGCT CAACT ACTTTCTCTTGCTTG GTTT AGT AACACT AATGTTTAC 
T A ^^ AAAATGAA(VC <^' n ^GCAGCT^ 

^f^^^^^^^CAAATTXA>AfiCATfiXXCAK\GCVJMrJQP>ACfiT\CT AACATAATAGCTT ATGAT 

AAAGGAAAATATAACATCTGGTTTTGGATCTGAAGCACAACCACTGCN 

>888 

acatatggctcggcaaagggggactggattaataaattctggtaatatagtaaggac 
^^^^^^^Q^^ ^^^ ^t ag^a gtaaatgtgagaacatcaacatgaacgcgtgctcctttg 
A G^^^ TAA li I _' ' ctgctttgtcactcaaatagct ggcagacct gacatcaccctgcct 
ctgcttccatgctctaaaactttcctgggcct 
agtg^taagagctccctggagaagggctcattcttggatggacaacaga^ 
agtct agagct aataaaacaaagacaaagaagggatcacgcagaaagcttggtaaagactgt 
cctggccaatctgattacagtcagttggtan 

>B87 

^^aocgatgaaagtttaaatctaatcaacagtattatgcactggt^ 
attaagacggaggatagtcagcatggaatctaagaaaggaaaagtccggtaactatatot 

NNNMNNNNMNNN 



^t aaaca ggccagatat attctctcattaacttattgcct agcagagaagaccaaca 

HJTJff^^H 7 AT WT AT ^^^^CT ATT ATGATT ATATG>T ACAAATGGAAAGTGCT A 

TGAAAATGTO&AACAAAAGAGAATAATCTGT^^ 

A^CTGAGCTXAAGGTO 

^^•°ctcacaggccacgttaaa^ 

CTCAAG^TJMAGCTTGAiG^^ 
CAAGGAGTTOGGTCTCTGACC^^ 
TAT ATCCTGT CTTTAANNN 

>eso 

J^^^^^^CAAAAXfKTKTT^CAAGAAJ^AAGCACACTTGVT ATAGAAGTGCTGAAT 
TCTATGGAACCT AAATCTGTCAAGTTACCXTrGTCTTTCAGGGTDCXSTCT CCCCAOCTCCCAG 

CGGACT AGGGG<5CAT(^TCTGCTGTT AAGAGGGT GATGACTCGCTAAAAATGAGGGCCTGA 

T AATAGGGX»AGCGCt>GACAGGTAGOGOGGAGCTCACCATG04 

AC CACTTCATGGCTAAGtUTGTCCGGGATGGAACCGGT LI lia.n^f^TrTAr^Tr-r 
JTGCTTTGCCTrniCTTJCCCT 

J^JICACT^ACCC^ 

GCTCAGCAAATCTGAGCTATTATTTITATCACGACTGTCAGA<3GTCA^ 
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TCAGACACACCTGGGTTCAAATCCCAGGAGGGCCACTTACTGTTGGAGCCGGGGCAAGTCA 

GTTATTCrrrCCCTGAGGGTCAGTTTTCTCATCCCTAAAATTCCAACTAATAATACT 

AATGAT GCCG G GA G G TCTT AAAAA T AAT AT AAG TT CAGAATG AT AAAACAG G CTG G CACAAA 

TTGiGATGGCAGCAAATGTCCTTGCAGCCCTGTGTCTCCTGCCTTAGTTTGTGTTGAGGATTA 

GGTCAGATAGTGT 

>S92 

act acagaacaggaacaatctgccatgtgtgtttacaacttcagaaagccct gg aat 
g acagttgocagggcagttcttctgaatttgcaggtcagaattagtggatgatgaa i 1 1 1 i i i 
cacacatggtcaactctgtgccacctgctacaagatgttggaacaggtatatttatttattta 
atgatgatcaatgattcttccaacatcagggaacatcagggaaatcagctagtatatgctctt 
tttgaggattttcagctccaaatcctgaaagcattcatgaaactacataaattact 

GCAAATC^TC^TAAGTAAATCCAGTCATATGAATCTGGAAGGATTTGCTGGTGGGCACTAAC 
ACT GACCACATGTTT CAG T GTGGG CAAG TTTACCATCX^TC^CGGATTTTGTGCTTGGTGAA 
TTGTAGGGAGTGAAAGAGAGAAGGATGTTTGGCCCAGTTGTC I Mil iacctatatctgaaat 
TCTCACTT AGTCAAGAACAAAACATTTAGACATTTAA 1 1 fOCI 1 1 1 G GGGTTNT AAGTGAT ACA 
TGTTTAAAANN 
>893 

ACTAGCATTAAAAAAGTCCTACAAATTATTAGAGAGAAAATACAGGTTGCACGCAAA 
GCATAAAGAATGAGAATGGCATAGACATCTT AACAGTG^CACAGAAACTAAAAAGTAGTT CT 

gagtaaaaatgaactatttacccagccaaaccgttaattaggtataaaggtagagttaagaca 

tttatagacatacaagatattaagattactgagtcaattgatattcaacaggggtgcaaatgg 

agaaaaagtcttttcaacaaatagtggtgggacaaatggatagccacatgcaaaagaacata 

t at ataagagctaaaaccat aat gcttttagaagaaaatatagggttt atcttcatgaccttga 

atttgacaaaggattcttggacatgacaccaaaagcacatgc^^ 

gatatgattaatan 

3-694] 

acaggtcacacagcacatcagtggct ACATGTGAiaCTCAGACCTGGGTCT gctgct 
gtctgtcttcccaatatccatgaccttgactgatgc^^ 
gtcctgctggagcccagagcacggaagcctgg^xxh'ccgaggagac 
gacaccatgacgagagcttggcagaataaataacttctttaaacaattttacg 
atctggaccagtttattaaatgggatttctgcx>caaaccttggaagaatcac^ 

>B95 

ACAGGTCACACAGCACATCAGTGGCTACATGTGAGOTCAGAC^ 
GTC FGTCT I CCCAATATCCATGACCTTGACTGATGCAGGTGTCCAGGGATACGTCCATGCCC 
GTCCTGCTGGAGCCCAGAGCAOGGAAGOCTGGCCCTCCGAGGAGACAGAAGGGAGT GTCG 
gacaccatgacgagagcttggcagaataaataa c i I CI I IAAACAATTTTAOGGCAT6AAGAA 

ATCTGXiACCAGTTTATTAAATGGGATTTCTGCCACAAACCTTGGAAGAATCACAT 
»£96 

ACCTTGAGCT GCCT CAG CACTCTTTl G CCATTCGT GCT AGAAACAGCCAAAGCCAGA 
CAAOCAAATT AC AGATGCTTAAATXaTT AATG{X?A/^ <^C CA>SGGCT CP GTGAACTTCCCT G I f 
GAACATCTGACCXXXiACTACTTGAGGACATGAAAOT 

AAG^^CAATGACATCGGGTCCTATGATTTTATTCAGGAAAGCAATAAGGCAATCGGG 
TGTGAACATCATTTGAAGGGAAGTAACTTGTTAGCTTTATTCCACAAATC 

>es7 



ACrrGA^AGATGCCTGGGTAACCATGTCX^AATGTTCXATTT ACTTT CTGCTGGACAGA 
AGAAAGCACTGACATTTGGAGCCTCATCAGGTTCAGA 

ATCT I IU I C I IAI 1 1 I CI rrCI I T I AATGTTTCOCAACTT ACGTTAGGACAATGTCAACAAAGA 

CAGATGTCCCT AATAGT AATTGCAGGACA I GTGTTI I C I CA TTCCTATCA 

>G00 

acct atgagatgcatttgaaaacttaccttgtttatat g t 1 1 c 1 1 c ig 1 1 gcaatttct 
tccattacctggaatagctgcrrrbgacggcaaaccaagcaa tgocc 1 1 1 cacagctgtggg 
atgaatggggaaagaagtcttggtaaggaagcaattcagagaacatgggagcatctcatgg 
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CAG ^GTCAC AATTTTGTGTTG CXiT aat atttcag GAACTTGCAACCCT GATAAC7TGTG OCT 

GCCTGTCTGTAGGCCTTT AATGATGTTTTATTGAATTTTGGN 

>902 

nnnnw4ngagtttttaaaactgtacttctatacaaggcaaaatgaactctaagtaaaa 
aag aaaatcacacttctaaacacaaattaaccatttcagtatttaattgctcct aaaaggtgt a 
ttctacttcattaaatgtaagagaaaaggttacct acatt acgcagtttaag aaacaggataa 
actttaggat at aaac agtct gatt acattttcacactttcaaccatcttatttat act ct acat 
tagataatctttaaattccatcataaggtttcccatgttaactccatata>aattttgtaatcct 
g ccca cgccatg tcaact cag tgt at act act act aagcttca g act caaattt atttccaaa 
ccaaagaacgctcaagggtcttttcgcatg ctgcaact gctcaattcacagaatcaccccta 
agggtgaaaaaacagagcatacaaagcagac i rm i icaaccactctttaacctaatattca 
taataatttaagagtaacaaataatatacacttac 

>903 

actgggtgacaggagagagctcatgtgacccgagtctgggtggtctcaggcatgg 
t ataaagaact ag gccaa ccaact g cact agacat a gaaact ag ct gaat aaactcatccact 

CCGATTTT^TTTC^GGTATCTCATGAGAAACTAGAGGACAAAAACAAT^ 

ACAAAGTTTACTCTAGCCATCAGTGCCAATGAACATAAATGACTGCCTGAGAGTTATATTAAC 

AAAATAATTAATTCAG ACGAATTAAGGAATT AAACCAGCT ATGGGAAATATACACT CT ATACTT 

AGATGCACATTT 

>904 

ACTT AAAT AAAATAAAATTAAAACAAATCATTTTAGAGATAAAG AGTGAAGTTACTGAA 
AAAGGTGACTAGGAGTCTGTTTATGAAGAAAGGTTAGTATTTAAATCATGAAAAAAAAGTAAG 
AATACTTTATTATTCAAGTAACTTAAAATTGTAATTGAGATTGGCTTTTATGGTAT 
TCTGGGCTGCTATAAAAATrCAGTCAACTTCTAAACTTCCAAACACAAAATAGTTATACTCAGT 
CTAAGAATATCCGACCTACCGTGCAGGACCAGAGGGCTCATCTCTTGCCGAGCTTATTACAG 



ACTT AAAT AAAATAAAATTAAAACAAATCATTTTAGAGATAAAG AGTGAAGTTACTGAA 
AAAGGTGACTAGGACTCTGTTTATGAAGAAAGGTTAGTATTTAAATCATGAAAAAAAAGTAAG 
AATACTTTATTATTCAAGTAACTT AAAATTGTAATTCAGATTGGCTTTT ATGGTATCT AAAACAA 
TCTGGGCTGCTATAAAAATTGAGTCAACTTCTAAACTTCCAAACACAAAATAGTTATACTCAGT 
CTAAGAATATCCGACCT ACCGTGCAGGACCAGAGGGCTCATCTCTTGCCG AGCTTT ATTACAG 



ACCTTTGCTTTAAATG^TACTAAGCTGTGAATGACTGATATCAGAGAC I MCI I GGA 
AGTAGGTTTCATAGGATGGAGGACAAATGAAACTTTATGGGCXaAAGAAAGAAGGGTCAGTTG 
GGTGGTGCATTGAAATAAGTGGTTCCAAAAGCAAACT AGGTCAACTTTTTAACTGGCTAGTG 
AAAATGAGATTCCTCAGGATACAAAAGCAAGGAGAAGACAGGAATAAATCAGGACTCX^ACA 
GGCAGAACAGGATTTATTTAGGGCATGCAATGTGGAGGGCC^AATGGGAACATGACAGTG 
TT 
>907 

ACAAATTGGATTGTCAATTTATATTTGTTTCCCCAGT AAAGCCTCCAAACCTTGCTTG 
1 1 1 IGI 1 1 AAGTATOOCTGGGGCTCATCACAGGG^XrrGTTGAAGTrCTTTTGAAATGAATTGA 
AGAATGTGAATAATAGTTCTAGTTCTTCGGGATAATGGAAAG CT AATAAGG TTTATGCTAGAG 
GCTCTT ACTGCTGGGACTCTC I ICI IG III I 1 GGTTTTTAGGAAAAAAGCTAGAAAATCCAAC 
TTCAG CTAGAGTAACAGT AGTAACTGACTTGAAAGTATGTCAAAAC 

>eoe 

ACCT ATGAGATGCATTTGAAAACTT ACCTTGTTTATATGTT T CI T CTGTTGCAATTTCT 
TCCATTACCTGGAATAGCTGCTTTGGACGGCAAACCAAGCAATGCCCTTTC 
ATGAATGGGGA AAGAA GTCTTGGT AAGGAAGCAATTCAGAGAACATGGGAGCATCTCATGG 
CAGCAGT CAGAATTTTGTGTTQCGTAATATTTCAGGAACTTGCAACCCT GATAACTTGTGOCT 
GCCTGTCTGTAGGCCTTTAATGATGTTTTATTGAATTTTGGN . . . 

>909 

>O10 ^ 

TGGCTTTCCTCACTGCTAGTTATAATTCAGTTTTCTCAGGTCTAAGTCAT^ 
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Tv, i GCi 1 I tCAGCl TC CAAAAAH Ca 1 1 GCT ATTATCTCCT CTCCTGTTTTCCCT ATG GTGTGT 
TTGTGTCI I II ICU lAAAAAAATTCCTrrGTGGTGGTTTTAGGGGAGTTTTTGGGAATATATA 
NN 
>911 

acaacctagccagctgcacagcagctctccaagaaaaaggtgtatattagacagatt 
caattattcatcttgtgattatgagtagtaaccaaattgtctatgt aattttcttatggtgaac 
tacccaaagcaaggcxttcaarrraggctaccagcttgactcttaag 
aaggctaaaaggtttgtgagaaacctcatgagcactgagtgttctagttccagatgaaaacc 
ggtttcaggtatgaagc^agagggagtgctaattggtagaagtaattacatcttt 

>912 

acaacagagcacaatgcttagatttgggtggatttgaataagatgaaagataaattat 

GATTTTGTTCAAGTGTTAAAATAAAACT AAGACACTTAAGGACCACAAAAATTTAGACCAAAG 
TATCTTGTAAATTCTACCTGGTGAAAGTTTGAT ATAGCAGACAT ATGACTTTT CT AT attatttt 

ctgttttgagtttagtagtaagcag atggtttgta I men iagttgcaactaagtgatcagt 
ttcatgatttctcttactatgaaacai 1 1 I 1 1 I m m I ici I aacagttatcttaaaagcaactc 
tgggtctactgggctcaaaaaatg agaaatct aaagaaaaact ggaa i mm iccccaaat 
taaatgagtcctggtataatttaaagcctattctttgtagtttcaccn 

>913 

accacaaagttattgcctacatccaggtcaagaagatcttct actgtattttcttcta 
agagcttrtacatataggtcaatgatcaatctaaaattaagagttgtgcaatcattaactctag 
ctttagactggtatactaattggtttgtatacgaactgggttaaaggcataggacacatgca 

GGCTGTGTTCATTTCACAGCAGGGCTCTGTAATTAGGCAATAATTACTTACCATCATACCTAG 

TGAGG GAAT ATGGGAGAAACAAAACAGGCCATACAGCTTCACT ATT ATTCCT ACT 

>914 

naccactgnnncacgca^x«:aatttcataaaatgtatcagtaaaaaa 

gttttttaaagtaaagaaaaattatttatgacataggtaacttggtgtcaactaggtaact 

ctat7ttaatttaggaagttagtgtcttccttcctcaattttc 

CAAAAGGCCCGAG AGG CTCTCT ACAAGGAGAAAGCAAG CCAG AGAATCTGA 
>91S 

accagaaatggtaaatatatgagtaaatataacaca ci muc i n i aaaattttattt 
aaaaggtaacactttgcagcaaaataattaacaatgtattgtgggttatatagtagtaagatg 
tttg^cataaattacataaataattggagcagggaaatagaagtgtgttgttguuatggtttg 
atattatatatgaagtggtatattattatttcaaggtagccttgataagttaaaggttacatat 
tgtaaaocctacaataatcattacaaaataaagagatataacagtaagn 

>916 

acttcatagaggtccagaccccttgcgtctggcattcctttggtct ataattcagta 
aactctgctaaaaaggaaacgagactagcttgctgtggccccttaagcgacccaggg 
ttgtgatggttcagattatgatttgttctagagcttttccag^ 

TCCT ATTTGT CCCCTTCCCTTT AAACAAACAAAAGTGCCGGCTGGACG CAGTGGCTCATGCT 

GGTAATCCCAGCATTCTGAGAGGCTGA<3GCAGGCGGATCACCTGAG 

>917 

ACTGCCTGGCATGCATCTTCTCGATGGTCnGTTATCTTGTGGGAATG^ 
AGTTGTTTTC1 GTGTGCATCCCACCCAAAT AAAGAATGTTTCATCAGCAAAGTGAATTGCCGT 
ATAGTCATCAGACTCTAGAAATAAATTATCAACGATGACT GCAGTGGGTGAGGCTGTTTGTTT 
ATCACATCACTTGAGAACAGAGT AAAGTGAGTTTCAT ATTTTCCT GiAGTCTTGAATTCTCATTT 
TAGACATCTGTTCAGAAGCTTTCTAAGCCATGGAGTATTCTAAATGAG 
>918 

NATAGG GAGTC GACCCACGCGTCCGGGGGAGTAAGGTGGTCAACTATAAACAAGC 
ATTTTCAGTGTATTTTTAAGGTTCATAAATAAAAATAAATGTGC^GGACAGAATTT^ 
TTAAAATGATTAAAATGAAAAAGTATTACAATGCTAAAAGATTTATGAGATGACTATTTTGCAT 
TGACT AGACTAAAAGTTGATTT AAAATATTAATATGTGTATT ACAGTGCTGAAAATTCTT AGGA . 
CATTTAAATGACATGGTTACA GACTC TGTCTTTCCAAGtACTTAACAATTTjCA 
TGATGTCACTTGAGAGGCATATTTT ATTGAATCXSCCrGTATCAtCTGG^ 
CGCCAGAATATCTACAAAGTGGTAATGCATTAAATATAGGGCTTTTG " 
TAGTAGGCAAACTTGTTGATAACCCCACGCTCTGGTTCTATTATTGTATTAGCACCGGCTATG 
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CTGCTTGACATCTTAGAGCATACTGTGCTACTTGCTGTCCCAGCAATCAATACTGAGTCACAT 

CTGGATGGCTTGAGCAAGCCACCTATGGOGAACCACGCAAGAACCTTTTCTGCTGCCATGT 

ATTT AGGATATGG TTGTG TGTG ACTTGAGACTGCTTG G GAG CCATTCT GAACATTTTCAT AAT 

GACTTCCTCCATAATCAGTCTGGGAGACCGTAGGGTATATGATAGTTCTTAACATGATTTGCC 

TTCrrGAACTAGTTCTAAAGAAGAC<XATGTTGATCTGAGTCTCATACTTTAGCT 

CAAGGGAAACACCATTTATTTGAGGGAATTATGTCATTATTTTTTGCTAAC 

AAATGCATTAAAAAAAGACAAGTGACCACTGCTGAAAGCTGTTCCCAAAGCCATATGCATAC 

CATCTGGTATTT CT T CCCACGGCCACTGTTTCAGTATTCTGTGGCAGGTCGGAGGGAATCAG 

GGAGACATTTCATACTG<X^TGAATAAAAGAGCAGGGAAAGTGAGGACATACTTTGTrTTCA 

AG G GG AAGAAAATGATTTTCTT AATATAAAGTTATTGGTAACTTTATAATTGTAGTACT AATGT 

>919 

ACTT ATCAGGATGAAATCAGAATCACAGTTGGCCTTTTG CCATAAGGGAAGGGTATT 
T GGAG AAGAGTCAACCACCACTCATGCCTCTCCCCTGCCCAGCAGCACCTTGGATTTTGCTG 
GCTTTATGCCTCCTGTTTCCCCTGGCTGAGTAACTGCAGGCATTAGGTTCCTCTACACACGA 
TATATTACAGGGAAATGGCAGCGATGGTCTGGAAGGGCAACACTGGCCTTCTTTCCTCCTGA 
GCACTAAA^TCCTAAACATGCAACTTAAAAAAAAAATTCTAAATGTGAACACCACCTTTCAGTA 
ATTT ATATT AATGT ATCAT CT C^CCCTCTTTTCCTCCTCTT C^AACG(XJCTTCTTTCT ACCCAA 
ACTCCAATATACCAATTTGTTTGAACAGTTTTACATTCTAAGTGTCCAACT ATTGCTAAAGGAA 
TGGATAAATTGTTGT 
>920 

ACT GCT ATTTCT AGTTCAAAATGACAGATTTTCAGATTGAAAAAATTTCAATCCACTT A 

i M ( I CAAATGAGATAACTGGG^CAAAGAGAAATTC<^TGACTTGCCCAAGATTACCT ACAGT 

TTAACT GTCAGCGGGGCTTAAAACCACAATCGACATCTCCTGACTCCCAAT<XTTrCACTTAA 

AACAAACAAGCAGACAAACAAAAAAG ATTTCT AAT AAAG T GGAAT AATTTT AAG AAAGGCAAG 

TATCACT ATTTTACAAGGAAAAAATTAAATCATTTTAACAG^TTG^CAAAACATG AACTAGTTC 

TTGGGGGGAAAAAAGAGAAGTCTTACAAGAAAAAATGTAATCAAGAGAGTGCCAAATTCGGT 

AAAATGCTT 

>921 

ACTCACATGTAAACTTCTACTTTOCX^TTCAGATTAGAGCAACCATCATGCCAAAGCT 
ATACACTCTCAGGGAATCCCTGTGGATTTCACTGATG ACCACTT GACCAACT ATCAT AAAGAT 
CAAGGCXiAGGGGTTCTCAAACTCTCAACATTTGTGTGCTCATCTCCCX^ 



AAATTTCCAAAAGGTGTTAATAGACACATAAAGATTTTTAAATATTAAAAAAAAGAAAAAG 



NNHA CTTT ACAGAAATA CGGAA AGTGACCATGCAAGATGTCAGGGACTCTTCATATT 
TTATCTCTTTCTGAAATAAAATCATTTTGAAGTCTGGK3GGTACCTGGCAA 
GGAACTATGTCATATGGGTTGGCCACAAGAACCTGGCGTCTTTCrCGCAT ATGGGATCT ATT 
TTTAAAGACATATGGCCATAAATGTCCCAAGAAATACCGAGGT AGAGATGAAAG CAAACATG 
TACATATATATAAAACACACACACAAACATGTTCATGTACATACAGTATGCACTCCCTTCTCTG 
TG 1 1 1 1 1 I GTCTGAGTTGATGATTTGGAGCTCAAAGAGCT AGOGGAGGGAAAAGCTGAAGCC 
ATTCAAACACAT AATGAGAATTGGAGATGT AAAAGAAGGCTGAGTTCT AGGAGTTGCAACAA 
CTT AGGAGATAACAGAACCAATTCGGAATGAGCAGG^ATTGTAGGAATGCAG<^GAGGACT 
AGAAGAATCAGCTACATGCT GTTT ACTGGCAAAGCAGGAGAAATGTGACT GAGGACAGT ATG 
CCACT GAAAACTGATGAAAGAGGAGGGAGACAGGAGGACAGGGCTCTTGTGGGTAGCAGG 
AAGACAGA^TGGAAATAAACCATGfTTAACAAGATGGCTTTTGGCAC 
GCCAGCTGTCTCTCTCCTCCAATTCAAGTTACCACCACAAAACCCATTTGGA^ 
CTGCTGCATATOCGAGTGGATTAAGGTT ATTTATTCACATGT GGACCATT CATTT AATTTTTGT 
TTTCT GTGTTTCCTAAATGATTATCATTGGA6GTGCAGCACTGGTGT ACTT ATGATTTTTTAAA 
AAGTGCTTCAAAGTTTAAGCATTTATTTCAACCAATAGTGAGTT AACCTCTGTGTTAT AGAAAT 
TATCTGAG<iMAGGTCGTCCAGCAGGTTCrGTTTATCA 
AATGGCAAAGAAAAACTAAAAATAA^AAACTCCTGGTTTTGTCCCAAT 
TAGTCAAA CCCCC AACAGAAAAQATGTTAAAAAATCAAGTACTGTAQC^ 
CAATTTGCATTCTACAGCTAAGTGTAGAACTCAGAAGTAGCATTT^ 

GTGGCAGACTGACATTATCTAAATTAAGCCTGGGATTTT ATCAAGACAGACT ATCTTCATAAfl 
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>923 

ACT GTTGTCT CATG CTCTCTTTCTGTT AATAG CACCTCAATTCTACTCTGGGGG ACAT 
TCCTCCTCTCn 1 1 T GGTCTGGAATGTCCCCTGGCTTCAGGGACAGCTCAACATGGGCCTGG 
ACAGTCAAATTCCATCCCCAAGCTTGGGACTCAGGGAGACCATCCAGTGACTGTTCCTGAAG 
TGCTGGGAAGGCAGAGCTCCCTTTCTGCGGGGTGCTGAGTGATGGGACGACAGTGTGGAG 
CTACTGNGCTCTCCAAGCCGGTGCCCAGGACCAGCCT GCCTG AGAACGAAGCCAN 
>924 

ACTTGCCTTGCAAAATTATATTACAAGAAGAAGCACACTTGTTATAGAAGTGCTGAAT 

tgtatggaacxjtaaatctgtcaagttaccctgtctttcag^ 

ACX^TCATTATATTATC^XXaAAAAGAACACGATCTCTTT AAGGCT AGGCAAGTATTGCGCTGAT 

gagcc^gggact gcccac caattggcaggcxx^ttgggtgataaatgtccaaggacctcta 
ggctgacgacacai ) i i jcat(^ttaatccagcctattgtaaccagggccactcacattgatt 
cggactagggggc^tcatctgctgttaagagggtgatgactcgctaaaaatgagggcctga 
aactaatcaaat atatttag agcctt ccctggcaacttg ctgg g agag cag cagtag ac agc 
taataggggagccccagacaggtagcgcggagctcaccatgcn 

>925 

ACATATAAAAATAACrrTTAAAATATAGTAACACTTTACAAAATATGTATCTAATTAAAAATACAT 
TAACATAGC^TCC^TCAAACTATACAAATATAGAATATATATTCATGAAATTCTTTAGAAATATA 
ACATCT ATTCTTT GAATAAAGCTT AAAATTTGTTTATAATTTTCAAACT AAGAAAAGAAGTAGT 
GAAT AATAG CTCCATCCAATTTATAATTGTCTTAAAGAGAATGATTATGT ATCATTTCTTG CTT 
GTCTTrTCTAATACCCAGTCAATCACCTGTACAGCATTGTTGTTTGCT GTTTTCTTCATTTCTT 
CAAATAGACCCCTTGAAGTTTTAAGATCXTTTTAGATAGAACrrTAGAGATTTCAAAGAG . 
GGCTGCATG CAGTGAAACATTCAT G AGTCTCGGT AAT ACTGTGTTTCTT CAG GGTTTCAGT A 
ACTACTTCTTTCAG TATCCGT GTGTGTTTCTGTGATCTTGATTGCTTAATTrG 
TTGATTCATGCATACI II I I I GACCTGAACTGGTCTTTGTAGCAAGTTTTGGAGTTTCACAGC 
T AGGAATTG GAACCAATGGACAAGTCACTGGTAAGGACCGCTCTGTTTGTGG CTTTCTGGAT 
GTCACATGATGATG ATAGTGA TTACTATTACCAGTTGTCTGAGAAAGAAGAGACTCTGAACTT 
TCAGATTTCACAAGTCTTTTTCTGGG ATAAAG GCAATTCAG ATCTTTAGATCTCCGTTTCAATC 
G GGATACTT CAGTTCG AAGTTCATTTTGT AAAACTTCTCCGTCAGGGACACTTCCAGGATCT 
GACGAAACTACAGGAGATGGAAGAGGGCTCAACACAGTAGGTACTGGAAAACTTTCTCTGG 

tgcaggtagtttgagtttcataacgaataagacgagactgaagttcanaaaatcctccatctc 
tttccaaagcttttcggtcatccaag caatattcaattccatg at aatgacat actgaggctt 
tctcaaaagctaagacctgaagtttcccaggactaagttctggtgtcaaa^ 
cccccttttgaatcggtaaaggttgaagatctccaann NNN 

>926 

acccaaacagaagattgctaatagactgctaataatagaacttaataaatgaaataat 
ttatttcatttattgttgcttggaatacagaaagtgcitagtaaatattgaatgaatcaacaaa 
gtacctccc aata tagagaaatcacttctgaaaaggataaaaccaagttgatcct 
gaaggcatcttttggggctgttacagttatttccttt atttgaagaaggaatatgatatacct 
actttgttccaagtcactgcttataatgtgctaatggtacct 

>927 

acgcggggattgctgatggatcagtgagcctgtgttgatgccagtgagctgct 
gctcagatactgatal 1 1 ICI 1 1 ccaaacagcataagaagtgattgagccacaagtatactga 

AG GAAGGG CTCCCT CGAGTTGTGGTGTGAAGAGAT AAAT CACCAG TCACAG ACT ATGCACC 

cgactgctgctgttcagtccagggaaaatgaaagttggagtgctgtggctc ai I I C I MCI I 

CACCTTCACTGACGGCCACGGTGGCTTCCT GGGGAAAAATGATGGCATCAAAACAAAAAAA 
GAACTCATTGTGAATAAGAAAAAACATCTAGGCCCAGTCGAAGAATATCAGCTGCTGCTT^ 
GGTGACCTATAGAGATTCCAAGGAGAAAAGAGATTTGAGAAATTTTCTGAAGCTC^ 
CTCCATTATTATGGTCACATGGGCT AATTAGAATTATCAGAGCAAAGGCTACCACAGACTGC 
AACAGCCTGAATGGAGTCCTGCAGTGTAOCTGTGAAGACAGCT acacctggtttcctccctc 
. ATGCCTTGATCCCGAGAACTGCTACCTTCACACGGCT GGAGCAirrcCCAAGCTGTGAATGTC 
ATCTCAACAACCTCAGCCAGAGTGTCAATTTCTGTGAGA^ 

^^^^^TGAAAGGTTTACAAATGACCTTTTGAATTCATCTTCTGCT AT AT ACT CCAAAT ATG 
CAAATG GAATTGAAATTCAACTT AAAAAAGCAT ATGAAAGAATTCAAG^TrTTT GAGTCGG T CA 
GGTCACCCAATTTCGCAATGCTGTOCTT o 
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GTGCTATAAI I ICI II I ATTGAGAAATGGAAGCATOGTTGCTGGGTATGAAGTTGTTGGCTCC 
agcagtgcatctgaactg CTGTCAG CCATTGAACATGTTGCCGAGAAGGCT AAGACAGCCC 

ttcacaagctgtttccattagaagacggctctttcagagtgttcggaaaagcccagtgtaat 

GACATTGTCTTTGGATTTGGGTCCAAGGATGATGAATATACCCTGCCCTGCAGCAGTGGCTA 
CAGGGGAAACATCACAGCCAAGTGTGAGTCCTCTGGGTGGCAGGTCATCAGGGAGACTTGT 

GGCAGCTGTGTCATCCTTCGTGCAAAATCTTTCTGTCATCATTCGGCAAAACCOCATCACCA 

cagtgggaatctggcttc»gtggtgtcgattctgagcaatatttcatctctgtcactggcca 

gccatttcagggtgtccaattcaacaatggaggatgtcatcagtatagctgacaatatcctta 

attcagcctcagtaaccaactggacagtcttactgcgggaagaaaagtatgccagctcacg 

gttactagagacattagaaaacatcagcactctggtgcctccgac^gctcttcctctgaattt 

ttctosgaaattcattgactggaaagggattccagtgaacaaaagccaactcaaaaggggtt 

acagctatcagattaaaatgtgtctxxlaaaatacatctattcccatcagaggccgtgtgttaa 

ttgggtcagaccaattccagagatccctrccagaaactattatcagcatggcctcgttgact 

ctggggaacattctacccgtttccaaaaatggaaatgctcaggtcaatggacctgtgatatc 

CAGGGTTATTCAAAACT ATTCCATAAATGAAGTTTTCCTATTrnTTCCAAGATAGAGTCAAAC 
CTGAGCCAGCCTCATTGTGTGTTTTGGGAT7TCAGTCATTTGCAGTGGAACGATGCAGGCTG 
CCACCTAGTGAATGAAACTCAAG ACATCGTGACGTGCCAATGTACTCACTTGACCT CCTTCT 
CCATATTGATGTCACCTTTTGTCCCCTCTACAATCTTCCCC^TTGTAAAATGGATCAOCTATG 
TG GGACTGGGT ATCT CCATTGGAAG TCT CATTTT ATG CCTGATCATCGAGGCTTTGTTTTGGA 

agcagatt aaaaaaagccaaa cctct cacacacgtcgt atttg catggtgaacat agccctg 

tccctcttgattgtgatgtctggtctatgtggtgccagagtggacaccactmtgaaccttct 

gggagtctgcacagorgctgtgtttrrttacacactrcttct actctcttggtcttctggatg 

gtctgctgggtccctggggtacggatcatctcggttcatccatggcagcattgagaggggg 

gggttgcgggagggggcccctttttggttcctgtggtcgccctgcactcaagagggtggg 

tgctgccgggacccccgtttgccccggcggggtcggggcgccacgagggcggggagga 

ggcccgggcgctcctgcgtgggaggtggaacnnnnnn 



GGAGGATGOCAGTT CCAATGCTTTGTAAAGTCAAAAATAGCCACATTGCAAAACAAACAAAA 
AAAAACGAGAACGTTCCCGAGTGTGOCTCCAAAACATAAAGGAGAAAATCATACAGAAAAAC 

ctcatgtaagggttgg^acttgagcaaccagctatccaaatac^gaggggaatc^cgctta 
gctagggcatggcgtgagagaagcgcgttcctgxtttcagagcctacaagt^ 

acttaagcaat aaatctgagcaattatx^aggttattttattgcatttct aatgagttct 
tct aaaaaaagtcaatcaattatcactgct atatatgttctgtgtgt aaggagtgcttgagag 

TCTTTAATTGTAACATTTATTAAATAAGAATAAGAGGACATTmAAAGGAATTAAAGG^ 
TAATTCCTTCATAAATGTATAGTGXnTAAGCTCTGCTTTAAAAGGTCTTTC 
GTAACCACTTAGGGCTGAATTCATAGT ATAAATATGAATAAATGTTGGAATCACAAT 
>930 

NNAGGCATGCGCCACCACACCCAGCT AATTTTTGTATTTTTTAGTAGCGATGGGGTT 
TTACTGTGTTGCCCAGGCTGTTCTTGAACTCCTGACCTCATGATCC^ 

CAAAGTGCTGGGTGGGCGTGAGCCAACGGCACCCGGCCCCCA AI I I I I 1 1 I I I C CAAAAAA 
GT AACCAGAAAAGTCATCTCAAGACAGTGTTAGAGAAAGCGTTGTGTTTCCTCCTGTGVk 
TTAAGTGGGGGCTTCATGGGAAAGGGAGGATGACTCACTT ACTCTGAAATCTGGGCCCAGG 
AAGGACCTCTCGCATCCTTGGAGCGTGCTCATTCTCCTGTCTCTC 
AOGAJGATGTCCTX>TTCTGGAAACCCCGAGCAGG 

C I III CTCCCCACACTTGCTCCGTTCT AACATTrTCTCCCTCATCTGACATGGAAGGGGCAAT 
GGTT AACOCaG G^ GGAGG^CAGAGAACAAGGGOOCCAjGATCCTGGCTCTGCCTCTGACA 
AGCT GTGTGACCTT GGGT A T CAGCTGACT CAT CGGAGAAG TGAG G AGGACGGACTT GGGCC 
GTGTCTGGATATTAAACTTTGTTGGGTCtXTGACCTCTTTGAGAAACT GATGCATGCTTCTCA 
AGAGGCCGCAAGTGCATTTTACACACTATTTTJAGG 
TCCACAGGCTTCATGTrGAAAACTCCTTAAGTGACAGT 
TTTAACCTTGACAGTTTGCAAAGKNN 
>931 
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ACAGTATGTTTCCACTTATGGACAGATAATTAGGTAGTAAACATAGAAACACACCAAC 
TGAAAGGACACACACCAGTATGAGAACTAAGTCACCCATGGGGAGGGACAGAAGGAAATAG 
GATGGAAAGGGGTTGAGGGACTTCAACTGTATTTGTGATGTTrrAGTTCTTTAAAACAAAAAT 

ctaaatgacatttgaaatatgaaacaaaogcagaaaacatcaaaatgtcaacaatacttaaac 
ct gagtgttgggtg cctgaatgtt atattggtctctg 

>334 

acccagtatatgagcaattgctcagcagtgtttggatatagggagtggatagctatt 
attaattgcagattattrrggaaggaaaaacacacagagaattatgtatctttcagtgtaaatg 

TTAGTTCT AAAAACAATCATATTATTTACAAAGCTGCAGTTATAGAACACAATTCTGATTTCTG 
CCTCACCC CCACG GTrAATACTGTAAAACATTrCCTACGTTTCATCTGATAGTGTTATTAAAAA 
TAGCTGTTATTnTAATAGCTATACT AAAACATAAAAATGTTTAGGCCAGGCGTG 
>93S 

ACCTAATTCATAAGATAAGGATT AAATGAATTAAAAT ATAT AAATCCCTT AG ATAACAA 
TGCTAGGC^TATGTTAAGCACTATGTTAGTATCATCAAATGTTGTTGTTACTGTTATGGAATTT 
ATCACyUVATATGTAATTATATGTTTCGTAGTGATTATTCATCACCCCT ACT GGACTCTAAGGTC 
TGTGAGGATATGTCTATTTGGTTTACCACTGTATCCTCAACAACTGaTTC 

AGGTGTTAGGTATTAAGTGCATGATAGTGAATACATAAAGGTTTACTTTTTrAAAAAAATTCAG 
GAAAGCAGATAATCAAAAAGAAAGAAATTAATCACTTAATAAGTTTCATCTCCCAGGGATAAG 
AAAACATAGGTAAAGAGAGATTAAACTACTCCTTCAAGTTCAG 

AAAGTGTTGT GTTTC Xl 1 1 1 I AAGTCTAGTTTTGCTTTTGTGTTTATATGTCATAATTAATTGTG 
TTAAAACATAATTTTAGAAACCG^TCTTTCTATATCXXII CI 1 1 1 CTATACCCCCCAATTTTACTT 
CAui i i^t I AAACAACAATAAAAGTCTCCTGTAACATAAGAAAGC ' I 1 1 IC I I CCTAATTATCTT 
CTTTAGGT 
>936 

ACT ACAGATTAAGTATTAATATGCTGTGAGTGCAGATAGAGAACAGAAACAGGCTGT 
TTG ATTT GACCATGGTCAATGCT CTGATGTG CCAAACACAGGAG G TTGTGG GAACAT AT AGA 
GAGTGACXAAACTTTTAATGAATACAGGAAGATTTTCTGGAAAAGATGA 
GCTGACAGACXSAGTTTACCAGGTTCAGAACTTAAGTGATAATAATCrrrTTT ATCATAAAATTTT 
AAGTGTGGTAGAGAATAAAAGTTTTGAATTAAATGTTGAATGAAATGTGTTATG 
>©37 

GTCGACXX^CGCGTCCGAAG AGGTCCTATGACA G^TCJWSGTAAGTATGTAGATA . 
AAGATGCGTACATTTCTTCTT CTTCCTC J II I M I 1 1 J 1 1 1 1 AAGGTGTTACTTGGCTGGATCA 
ATTCXAGGATCTAATTTAGTTAAGAGACTTTAAAAAGGGATTATATATTGGAGAAAAAG 
AAATTAAAAGTGTA 1 1 1 1 CAGTCTT AATATCTCACATAAATGACCTT AGAATTGGCTATGTTAG 
TAGTTA GTTT ATGTGGTACATGTTAAACACCAGTAGAGAAACAACTATGGTTGTGATTAAA 
ACTTGACTTTCCTGCCAGAGCTAGAATCTTAACTCXrTTTAAAAG^ . 
TGTTTGT ATGT AAAAAT AAAAGGT AAGTTAATTCTAGATTGAGGGGCAGAGGCTATTTCTTAA 
TCTCCAATCTCCTTGGGAAGGGAAAGTATTAGGAGGCAGT AATGGAGT AGAAAGGTGGGGA 
TGGCAAATAAGAGAAAGATTTAATGTAACAAAACTGTTTTGTGCC1 C I I C I I AAGTAAATAATT 
ATTGGAATAATTAGTGTAA(^TCACATAGTAATGTGTATTTTGTCTTGACTAAGTT G I G 1 AAAQ 
' LLiHi }JJ}^ C ^^^*^^]^^CA(^TnTGGT ATTTATTT 
ACTT AGAGCATATGTT ATGAAGCT GGTTTGAAAA I I CUI N I AGATATATCTGCAAGTTTACTA 
CTTTGACT GTAAAAAAAAAAATGAAAAAGTAGTTGACATCTGTCCTCAGAAG AAGTTTGCAGG 
TTGCATATTTGTGTGTAAATACACAGGCT AAAAGGTAATTTATGTTCC 1 1 G GGAATTGAAATG 
GTCAGTGGCCCGJ I ACAGAAACTT ATCAGTCATAT ATCAGCAOCAGTTC A I rCTTl IGCACCT 
TAGG GAXXATCTGTCCCCTGAGGTGACCTGAGAAACAACCAGTTGCCCA 
CTTCAAG TGAGCCAG^TTTGATTTCACTGCCTTATATTCT 
mm i i GGAAAAACTTAAATTTTAAACATATTTGAAAAATGTTATAAGJ«rrTG 
GTTGATAGOCAAAGTCAGTTT ACCAAAGTAAAACAAATAAATTCT ATG CTTCTTCATTGTCAAA 
GAGCAGTCTG CCATCATGTGGATAT AAATGGACT ATGT AAAGTGACATGGT GCTTACTCTCT 
ACCT AATAATAGCCTCCCTCCT GTTCCAAACAAGJkTAACCAACAGGTATATTTAATTTAOCAG 
TT AATAT GTTTrGGATAATTGGCTGCCTTGAAATGCT ATATGTTTTATAGTACATCATAGCTTT 
AGTTTTC1 1 GATAAGGAAATT ACAGTT ACATCCT AATCGATTATT AAACT ATCACTGTGTCT AA 
GAATGGTGGAAGAAGATAGGGAATAGGTAGGGAAGTCATTATAAATATATTTTCACrrGGCXIA 
GGCGTCGTGACCCATGCCTGTAATCCCAGCACTTTGGGAG GCCG AGGCGGG CGGATCACG 
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AGGTCAG^aAGATGGAGACCATCCTGGCCAACACAGTGAAACCCCGTCTCTACTAAAAATACA 

AAAAGTTGGCCGGACGTGGTGGCAGGCGCCTGTAGTCGT AGTCCGAACT ACTCAGGAGGC 

TGAGGCAGGAGAATGTTGTGAACCCGGGAGGCGGAGCATGCAGTGAGCCGAGATCGTGCC 

ACTGC ACTCC A GOCTG GGCGACAGAGCAAGACTC^TCTCAAAAAA^ 

AATGTATTTTCATTTGTGACTCCATCTCAAAAAAAAATATATTCTTT AAAAAAGAGAGAGAGAG 

ACCXGGAGTAGAGATTCTGTCAAAGAACNNNNNNNNNN 

>038 

NNNNNNNNNNNNCGNGGGTAOT 
TCAGAATAGCAAAAAGAACT AGAAAACTTGAACAACACTAAATACCAAGTATACTTCACCAGA 

tatct atagaacattccactcagcaacagcagaatccagcaga^tatatattcttctgaagtg 

tatgtggaacattctccgggatagaccatatgttaagtc^taaaacgagt^ 

aaaggactgatatcataccaagtatgctctctgaccagjvatggaatgaaattagaaatca^ 

acagaaoaaaatttgggaaattc*caaatatgtagaaattaaaaaacac^ 

cagtgggtcagaaaagaaatcacaagggaaactagaaaatactttgagctg 

AAATGTAATATACCAAAACTTATTGAATGCAGCTAAAGCAGTGCTTAGATGGAAATTGATAGC 

tggcaaatgtct gcattcagaagoaagaatgatctcaaatc>^taacattgccoctacctt a 

AGACTCTAGAAAAAGATCAAACTAAGTCCAAAACTTAG 

>939 

^WN^K^^CTTCTTCCATACTCT^TTAATTGGATATGCCAGTGT GTCTCAGTAATTTCCA 
GTGGCTGT AAAACTTTGAGAAATTTTGTAGCTTTTAGAAACCACATACCT GTATTGCCTGATT 

gcttattaagtgatct ctt agaggtttcc^aagttatgagtttgagtttacaagtgcagttttt 
ttccatgaaaatttcagtggtgacaaattatagaatttatcattcaattcagtcttaact 
ataattgcatataataaaacaggttcttgac 1u1iciiiiiui cagtotttaagaatagagaca 
aaataaagtt agatttgagtgcctcagaagatattagaaaatagagataaggtttatggcctt 
t aaaaaatt aaagacagt attgggggaaagaaatgaaaattgggaccgggcgtggtg gctc 
actcctgtaatcccagcagtctgggaggccaaggtgggtgaatcacctgn 

>940 

ACT GCCACTTCCATTTTGTAAGTGAAGCCCAGAGAAGCAAAGA^ 
CACATAGCTAGT(»OTGGCAGAGCTGTGATTGGCAGGTrGGTCGAATG<Xr^ 
CGACC TTCCC ACT AT ACTTCACG CATCT CT AGAGAAGAGACAGAAG T AGCCAGGATGAAGGT 
CTT CAGGTTTAAGAAGAACTATGAAAAAGCAAAAGA I nTTGTn rCGTGGTI 1 1 1 1 1 ACTATA 
AAGGAAAACTTTAAATAATAGCAAGAGTGCTATAGGTAAGATATCAGAA 
>©41 - 

ACCTCGTGGTTGAACTTATTTGGGGACAGAATTGAGA^ . 
GGAAGTATCAAAACCCTTGATGTGGTTAAGAGCATGGATAGTGAAACT AACCTCT GATGTAT 
GGTQAGAGAGCAAAAOAGAAAGGATTGCAAAGAAACTGGAATGTAGAGGATGAACATATTG 
GT AAT AAT AAT ACTGGTGGAATTGTT ATTCA GG AAAAAAT AGCAATT ATTCCT GTTCATATCTC 
AAATCATTGTATGI IGI I IATTTAAAGGGAGACATGGTAGAAGATATCAAATATAAAAATGTTA 
TCCTTCTGGTTTTCAAATCAAAACCAGGAATAA 
GCCAATTTTCCATAACATT 

ACATG^ AAATGGCTGTTTTTCCCCACATT AGT CAGCTCTGGATTTTGCATGTGTGGG 
Go 1 1 1 1 1 1 1 1 iGATAGTTAI I IGI 1 1 1 1 1 ATTTTAAAAATTTATTTTGCCAACCCAGT AGAGAAC 
AGCTGAGCA TCI I CT CATGTATTTATTGGCCATCTGCA 1 1 rCTGCTGCl I A TTGGCCATGT AT 
^ T T! GG *?^T TTC CC GTCT GCTGTGW 

AACACTGAAfiCACATTTTTAAAGACTTCTGATCATTTITATTGTC 
>G*3 

. *^"TC****»**^TACATTCTGCT AGTTT GGGGTGGTGTGTTCT ATAAATGTCAATTT AA 
TCCAGTCGGCTTA TGAI I T I CAGTTCT AT ATTCTT ACT QATT AATGTGT ATAT ACT AGTTCT GT 
T^CTAAGG^CGGATGTT AAATTA ATCCCTAGCTGTAATTGTGCATTA b 1 1 I G I C 'I C I I II C AG 
CTGTTCTAGCTCCATAAATjrTTG 

CTTGGTGAACTAGACCT 1 1 J ATCATTAGGAAACTGTCCATATAACCA 



M ' *OTGCAAGTtt>AAGAGGACCAGG* 

GGACACCAAAGTT ATCAGTCAAGT AAGGCAGAAATGCTTGAATGAAT AAATGT AT A T AGAT AG 
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AAAGTAGAGACCTTGATAAAGTCAAACTCCTTGCCTTTACAAGTGTGTGTTCAGCAGCCATG 

CAAGGGAGATGCCCATCTGGCAGTGGCCCAGGGCAAGGTGTCAGAGCCCTAGTGGCAGGG 

AGATGGC^TCCACATATGAGGGAGGGTGACJVTGGTGCTAACTGGGCATCTACATAGGGCAG 

GGGGACAGTGGTGATGTGAAATTGATTACATCAGGGTNGACGGAGTACCTGCCCGGGCGG 

CCGGCCACCGCGGTGGA 

>946 

nnnnnccccccctcggaagtcttctagwattaattaacgcgggatcctgaagttga 
actggtgcagcagt agtatcgttatg cttgttag ccttcataatcctt cct aagcaataaaat 
acccatgtggtcxiaatcccaggacacgttgaatt acaagct ccaagtttttccgctgcagcg 
tatcc aagt gtgctt ggaaagaagaacaaaattaaac^tgctatttagagctttcagggct aa 
ctag^attttgatgttgtcattgtagcaaatagttct AGAGTGTGG AAGAAGTTGAAAATGTTT 
TTATGATACAGAGATTTTTATTGTACTGCATATTTAATGAATTATTTTATAAATTGCTGTTGTGA 
AGC ATTTG TGAATGAC CTGCX rrCCTAGCmCAATGCTATTGCCC^ 
ACTGTTTTATGATACAGTTTTGCATTGTATGTGTTrACTTTTTAAAGAAG 

rrrcTTTi I ctggttatgaaaataatatatgcttatggggaaaaattggaaaatagaaacaagt 

ATCT AGAAGAAAAATCACTCAT AATTCCAGCACCCT G TT AAT ACTTTGTC 1 1 1 I C I I ACAGTTT 

ctaatatgtgcatgcatagtatatcaatgtggttttacaaagagtgtgcaaattatgattctct 

TTTTTACJVTCATTCATGCCATTCTGC^TTTTC^CTTAATACTATACT 

CCCTT^ AGTA T TCTCCT ACATAGCATTTAAAGGTGAAATCTACCAQCTCCT ATTTTTAATATTT 
ATG rTGTTTI GACTTTT CAGTATAATAAATCATGTTTAT ATGTAAAGGTTTTTATCTCCGGTT AT 
TACTGTAGAATAGATTTCTGGGAAGTATAAGAACAGGAGACATAAATATTTTTAGGTCATTGA 
TACATAATTTGAAAATGACTOCTAGAAAGATrTTAACAATTTGTGTTCT ACCAACGGTGTTTG A 
GGGTGTCTTTTTCT CATTGT CT CACCAGTAAATGACAATTGTAATTTGTTTATTTGCAAGGCAA 
AAAAAAAAAAAAATTGCAATTTGATATTTAAAGAGATTAAAC > 1 1 I ICIC AGATTTTTATTTGGT 
AATTTGTAI MCI 1 1 1 1 AGTGAAGCTrCTTGCTTTT ACCCTGGCTACTACCTATGATTGTGTTAT 
GTCCTGGAGGAGAGGGAACTTGGCT GAGGGGGACNNNNN 
>947 

ACCAGT AGATGAGAACT ACTT ATTTAGAGTGGCAGAGCATGCTATAGAAACAAAAT A 
TGAGTAATTCTAACTGTAGTTATGTTATATTAGCATAGTGAGATAGTAACATTAATAGAATTCC 
TTAGTGGAATTTCTT AATGCTTCAGTTCAATCTAAATTAGTATTAATACTTT AAGGCAGGAAAT 
CT GTCCGAAAGCATTTGTAAATTTAAAAAGCATTGAAATGAGAAGOU5AAACAAAAAATATTC 
ATTTCTATGTATTGCTCTATCTATATTATATAACTGATTTACTACCATTAAATTATAAAATATTAC 
ATGTTCAGCGTATTGTCCTTCTGCAGTTACTGATTTATAACTTT AAT AGT AACAGATG T AG CTT 
T ATT ACT AG 
>®48 

KNWNNCOCCCCCTCGGAAGTCTTCTAGN^ 
GGTGCAGCAGT AGTATCGTT ATGCTTGTTAGCCTTCAT AATCCTT CCT AAGCAATAAAAT 

aoccatgtggtccaatcccaggacaogttgaattacaagctccaa g i nn ccgctgcagcg 

TATCC AAGT GTGCTT GGAAAGAAGAACAAAATT AAACA7 GCTATTTAGAGCTTTCAGGGCTAA 
CT AGATTTTGATGTTGT CATTGT AGCAAATAGTT CT AGAGTGTGGAAGAAGTTGAAAATGTTT 
TTATGATACAGAGA 1 1 1 1 IATTGTACTGCATATTTAATG>ATTATTTTATAAATTGCTGTTGTGA 
AGCATTTGTGAATGACCTGCCTCXrrAGCTTTCAATGCT ATTGCCCAGGCTGACTTTTATTGCA 
ACTGTTTTATGATACAGTTrrGCATTGTATGTGTTT ACTTTTT AAAGAAGCATTTCCTGGGAGG 
TTTCTTTTT CTGGTT ATGAAAAT AATATATGCTT ATGX3GGAAAAATTGGAAAAT AGAAACAAGT 
ATCT AGAAGAAAAATCACTCAT AATTCCAGCACCC T G 7 I AAT AC I FTGTCTTTTCI I ACAGTTT 
CTAATATGTGCATGCAT AGT ATATCAATGTGGTTTT ACAAAGAGT GTGCAAATTATGATTCTCT 
TnTTACATCATTCATGCCATTCTGCATTTTCCACTrAATACTATACT ATTGGTACTTT ACCAAT 
C C ? TT AAGTATTCT CCT ACATAGCATTTAAAGGTGAAATCTACCACCTCCT ATTTTTAATATTT 
ATG 1 1 G 1 1 1 1 GACTTTT CAGTATAATAAATCATGTTTATATGTAAAGGTTTTTA f CTC OGG 1 1 AT 
TACTGT AGAATAGATTTCTGGGAAGTATAAGAACAGGAGACATAAATATTTTTAGGTCATTGA 
TACATA ATTrGAA AATGACTCCTAGAAAGATTTTAACAATTTGTGTTCTACCA 

GGo« un, CTCATTGTCTCACCAGTAAATGACAATTGTAA U I G 1 1 1 ATTTGCAAGGCAA 

AAAAAAAAAAAAATTGCAATTTGATATTTAAAGAGATTAAAC I I 1 1 1 C I CAGATTTTTATTTGGT 
^JTTGTAI nrCTTTI I AGTGAAGCICI TGCI 1 1 1 ACCCTGGCTACTACCTATGATTGTGTTAT 
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ACCAAGAACTAAATTGTGATACGATAGGTGACTTATGAGTAGCACAGAATGTAATAG 
GCCCATCTCTACXTrAGTTCrrGGTCACCACACTTCTGTCAAGGTAGCTCGGAGAGACGGTGTC 
TACTTATTCACCACATCATGA^ATGACCTCAAACTGAGCAGGCAGOCA^ 
CTTTCTTT ACATTAACTTTCTGAAAGTCA HUNCH ATTCCACTTTGTGCC l l l l ll l AAAAG 
CTGCAGCTTCATGGAATTTAATCCTGGTATTTAAAACACTN 
>950 

ACTTGGTAGGTTGATCTCTTTCATTCTCATGGTTTAATTACCATCTATTCACTGATTAC 
TCCCAAAACTGTATCTATAGTCCAAGACTGTTTCTAAAAGGTCTGCACCCACATATGCAAATA 
AATACCAGATATCTCTCTTGGTTATATTGCACATA 
>951 

ACTCTTAGGAAAGAGTAATGGGGTTGAGGATGGTTAATTTAGCCCATCCTAACTTCT 
GTGAGAI 1 1 1 1 1 1 CAGAATATTTTGGATGGTTCTCTCACTTTTGTTATTAAGCATTTGGGAAGA 
AGATTCTGCAGCCTACTCAGGTGAGCCAATCTCATGGCATTGAAGAGAGAAGATATGTTTTC 
ACGTCTCTAACCAGTGTTTTTGATAGTGTAAGTGAGGCCTTTCTCCTTTGATCTAAGTGGAAC 
CAAGAGGTTAGATACTCCC I I I I CM I AGTTATATAATGGGCTTCATGTAAC 
>952 

ACACTCTGTAGGTCTACAGGTAAAAAGCTATTACGTTGCAAACATTATAACGTAATGT 
AAGGTCTGGATTACATGCCTAAAAATCCAATGATTCTTGGAACCATCAAATCTGTTAAGACTG 
AAAAGAATACCAATGTTTAAATATATCTATAAAATGCAGGTCAAGGGGCTAAGAAAATTGCAA 
CACTAGAAAACCAACAAACTT AGGTTGTTCTAACATACATAGACAAATACAGGAGGGAGGTTT 
ATGGGTCACATCTGCG AAACA TTTmCCCAAAAAGCTGAATTTTrAGGCTTGCGTGTAAGTA 
GATATAGAAGAGTGCACTTTTGGGGATCCTCAN 
>953 

ACCAGCAATAATTATGCCCACAATTrTATCCTAAATAAGAGTGATTCCCTGTTCCTTTT 
CCTACAGAACATGTTTCTGTCCGCAAAGAGAATAAGAAAACATGACGCCTCCATCCAGAACC 
AAACTAAACTCAGGAGTGATTAGAATCACCTGTGGGCATTTTCCCCCAAACCACCGATACTC 
TGTAGATTCTG ATAAG CGCTCTTAAAGAAGCTACAGGTCTTCtXXATTCCCTATCTGAAAGCA 
AGGAACCACTGCTTTGGTCAGGAAACAGGCATACAACATCAGATGTGANNNNNNN 
>954 

ACCAGATGTTGTAAAATTTACTATAATTAATAGGAATTAATTAATGAATGCCAAGGGG 
CAGAGCCACACTTOCTATGATAGTTCCTTGCTATAAGGTGCT ATTTANNGTTCTCTACATTTA 
CTCCATAGT AAG CTGTTGTTTGAGAAAAAAAATGCCAGTTTG GTG CGTAGT AGANN 
>955 

ACCTTT AAGCXVVGATTCATGGTATGAAGGCAGCAGCATAGCACCTCCATTGACCCAC 
ATGGGGGCCTGCCrrTGGGCTTCATGAGCCCTTTGGAGTCTCAGATCCCTCACCTGTTAAAG 
GAGAGTAATACTACCCACTTACCTTTTTGGGTTGTTGTGAAACACACATAAGACAGTATTAGG 
AGAAGTAAGGTCTGAGGGCTGGGCTTTGGACCCAGOGGCCCCT AGGTAG AGGCCTGTTGA 
AT TGGAT GACAGTGAACTTTGCAGCATTTCCT AACCTCAGAAGTTCAAGAGCAGGAGCCTG A 
GTGTTTTAGGTCCCTGGTATGGCTGTGGATTTCCAGGCATGCAGCAGCTCTGGGGC^^ 
CTTCCT ACGCGCCAGTGGTTCCAGCTCTTTGATTAACTGAA^ 
>956 

ACTTCTGCTTTATTCAGTCTAGGTAAGAAATGTAATGGATGTGTGCAGGTGACATAAT 
TTCAGGGGAT AAGGT AAAAATT AGATGAAGCCCAAGCAAATATTCTT AAAAAGAAAAACTT AG 
GATTTrTTTTTACAAAAGTTAACTTAAAATC^^ 

AATGTCGTGATGCCTATAAA 1 1 1 FTCl ACAAACAAGAGTATGN 
>»57 

AGTTCAGGAG ATACA TTCTGCTAGTTTGGGGTGGTGTGTTCTATAAATGTCAATTTAA 

TCCAGTCGGCTT ATGATTTTCAGTTCTATATTCTTACTGATTAATGTGTATATACTAGTTCTGT 
TACTAAGGAGGGATGTTAAATTAATCCCTAGCT GTAATTGTGCATTAGTTTGTCTCTTTTCAG 
C I G M C I AGCTCCATAAATTTTTGGAGCTGTTAGG 

CTOGTGAACTAGACCTTTTATCATTAGGAAACTGTCCATATAACCA . 

TAATGCTT ATAGGGTAAAAGAAAATTTTTGAAGCACT GAATTCAGT AACCTGG GTCATGGTCC 
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AATTTTGCTCACTACTTCATATCJJ 1 I T ATGTAGATTA7TCCTATAAACATGTTCCCTAAATTCCA 
CAT^TTTGTAAAGTCAATGGATTAAATTATTCAAATGTAGCTATrTAACGGTCACT 
GCCTAGAAACCTATTrATTCATCTGTAATATTAAAAAGCTGAATTTGATGATCTTGAAAAATCC 
TTTCCAGATTTACAACNNNNN 



ACTCCAGCCTGGGTGACAGAGTGAGAATATGTCTCAAAAAAAATTATCAGCAGAAGA 
TAAT AT AGACCCCAAGG CTAAAGG G AACCATT ATCATCTCT AGGCCTGAAAGCCT AGGAGAG 
GGTGCTGTATGGAGAGGACTGCTTCTGACAGAGGGATATAGCCAACCTTGGTGGCCTAATA 
GAGAGGAAAGTAGGGAATAGCTTCACCTTCCTTCTCTAATCTTCTGCTAGTATCCCTATTAAT 
TTAGCCTAATTAGAAGCTGGAAGGTAGGAGAGOCTCCATGGGCAAAAAGCTGTGTAGAGAA 
CATGGATCCTGAGGGGGTAAATGGCAGATAATCTAGCACAGATTGGTATGATTATCTATACT 
TTTCAGATGAGAACACTGAGAGTCAAAATTAAGTAGATTTGCCCAAGGCCATATAGCTGGTA 
GGAGCTATAAATAATTATCTCAAGAAGTCATTATTACGTGGATCATTCAAGAAATTTCTGGATT 

tagaaaatagccttaaaatatgaaacaaatattagcatttgttaatttgagatgttatgttcac 
agatattggtaagattatcatttttaatttatgtgttttaaaannnnnnnnnnn 

>961 

actccagcctgggtgacagagtgagaatatgtctcaaaaaaaattatcagcagaaga 
taat at agaccccaaggctaaagggaaccatt atcatctct aggcctgaaagcct aggagag 
g gtgctgtatggagaggactgcttctg acagagg gatat agccaacctt ggtgg cctaata 

GAGAGGAAAGTAGGGAATAGCTTCACCTTCGTTCTCTAATCTTCTGCTAGTATCCCTATTAAT 
TTAGCCT AATTAGAAGCTGGAAGGTAGGAGAGCCTCCATGGGCAAAAAGCTGTGTAGAGAA 

catggatcctgagggggtaaatggcagataatctagcacagattggtatgattatctatact 

TTTCAGATGAGAACACTGAGAGTCAAAATTAAGTAGATTT6CCCAAGGCCATATAGCTGGTA 
GGAGGTATAAATAATTATCTCAAGAAGTCATTATTACGTGGATCATTCAAGAAATITCTGGATT 
TAGAAAATAGCCTTAAAATATGAAACAAATATTAGCATTTGTTAATTTGAGATGTTATGTTCAC 
AGATATTGGTAAGATTATCATTTTTAATTTATGTGTTTTAAAANNNNNNNNNNN 

>962 

ACTTGAGAATATGATTGTAAATTTGAT(^GCAGCTACAACATTTCAATGATGCATATT 

I i T T 1 1 1 CAG ATGCATTCCTTTG ATTGAATTTAAAGTCAAGCTTGTGCTTCT GGATGGTTGCTT 

TGTC AGTG AACACTTGGATTTGGAAAATACAGCAC^GGGTTGGTTTTGAGAGAAAATGGTT 

TCAACTTT ATAATTACAGTTTTAACCACCACAACAACAAAATTAGGATGGTAGTGAAATGGAA 

CTAAATCAAATGCAAGGTTTTAGTTTAATAGAACAATGTCATCCTTTAATAATCTTTAAAGAAG 

AACAACTTAATAACCAATAACAAAATTGAAATAGGTCAAC 

>963 

A CTTGAGAATATGATTGTAAATTTGATCAGGAGCTACAACATTTCAATGATGCATATr 
1 1 1 1 1 1 1 CAG ATGCATTCXTTTTGATTGAATTT AAAGTCAAG CTT GTGCTT CT G GATGGTTG CTT 
TGTCAGTGAACACTTGGATTTGGAAAATACAGC^CCTGGGTTGGTTTTGAGAGAAAATGGTT 
TCAACTTT ATAATT ACAGTTTTAACCACCACAACAACAAAATTAG GATGGT AGTGAAATGGAA 
CT AAATCAAATGCAAGGTTTTAGTTT AATAGAACAATGTCATCCTTT AAT AATCTTTAAAGAAG 
AACAACTTAATAACCAATAACAAAATTGAAATAGGTCAAC 
>964 

CCGGGCAGGTACACT GCATAAAGCCAGAGTTAAAACTT CACTGCCAGCCTCTGAAC 
AGAAGGCTGTTCT ATCCACACTATCAC^AGACCTGGTGGAGTTGAGGCAACTGCTGAATTAC 
CATACAGGGAAGAATGAATTCAAGAAAATTCCCATG CAAGATAGGCT CTT AAAAAAT AAATTT 
ACACAAGAAAATCAGCACTGTAAAGGTAATTGATAAGCCCAATAGAAGGGAAACCTATACAA 

agaaatagaaataact aagcaatctgaaatggacttt aaataatgatgtttacaattctctaag 
aggaaaaggagcatta gcatca gtgaaacaaaagtagggct atagaaaaaacaatacttatg 
aaaaaaccaattgg aaa 1 1 1 1 1 agatggaaaagcgtgaaataaaaaattcaacacatggtcta 
aagaataaactgcacacagctggagggaaaattaattaattttacgaaaaaacaattaa 
acagaatggtaagagannnnnn 

>865 

CCGGGCAGGTACACT GCATAAAGCCAGAGTTAAAACTTCACTGCCAGCCTCTGAAC 
AGAAGG CT GTTCT ATCCACACTATCACAAGACCTGGTGGAGTTGAGGCAACTGCTGAATTAC 
CAJACAGGGAAGAATGA^CAAGAAAATTO^^ 

ACACAAGAAAATCAGCACTGTAAAGGTAATTGATAAGCCCAATAGAAGGGAAACCTATAGAA 



863 
Tabic 4 



PCT/USOO/33312 



AGAAATAGAAATAACT AAGCAATCTGAAATGGACTTT AAATAATG ATGTTTACAATTCTCT AAG 

AGGAAAAGGAGCATTAGCATCAGTGAAACAAAAGTAGGGCT ATAGAAAAAACAATACTTATG 

AAAAAACCAATTGGAAATTTTTAGATGGAAAAG(^TGAAATAAAAAATTCAACAGATGGTCTA 

AAGAATAAACrrGCACAGAGCTGGAGGGAAAATTAATTAATTTTACGAAAAAACAATTAATCTT 

ACAGAATGGTAAGAGANNNNNN 

>966 

ACGCGGGTCAAAAGGATGAAAATGTTTTCTGTCAGAATGAAATTCAAGAAAACTTAA 
AGGAAATAAAAACTATTTAGCACCCAGTGAGGTAAAAATCGCAATGTCTGGTGTCCAGTCAG 
TTA(XAGGCATGGAAAGAGACAGAAAAACATGAGCCATCATGAGGAGAACAATTAGCAGAAA 
CCAAACCAGAACTGACAT ACAT ACC AG AATTGGC ACACAAAA G GAT ATT AAAACAAT AACAAC 
TGCGTTCCATATGTT CAAAAAGTTAGAAACATGAAAGATACAAAAATAAAATCAAACTTCTAAA 
GATGAGAAACTGTAGTGTTTGAGGTGAAAAATATGCTAAATGGCATTA 
>967 

ACGCGGGTCAAAAGGATGAAAATGTTTTCTGTCAGAATGAAATTCAAGAAAACTTAA 
AGGAAATAAAAACTATTTAGCACCCAGTGAGGTAAAAATCGCAATGTCTGGTGTCCAGTCAG 
TTACCAGGCATGGAAAGAGACAGAAAAACATGAGCCATCATGAGGAGAACAATTAGCAGAAA 
CCAAACCAG AACTGACAT ACATACCAGAATTGGCACACAAAAG GATATTAAAACAAT AAC AAC 

^g^^ttccatatgttcaaaaagttagaaacatgaaagatacaaaaataaaatcaaacttctaaa 

gatgagaaactgtagtgtttgaggtgaaaaatatgctaaatggcatta 

>9ea 

acgcgggcggtctgtgccccatcaccatttctaaagcaccct accctcatggcagt 
gtcxx>aaggaaggggtttccatggtaacctcaatggatagagtc^ 
cgcctgtgctggxgtgcgctagcctactcact^ 

tatttattagi rctci 1 1 aatggaaagtatataatccctt aatgtcagaccttg agtggcactc 
agcttt a ttaat ttatttaggtaataaatttaccttcctaattaattctcagtagtcx^gggag 
ct gtattattttaaacatcttgcacaatgtttatagttctgcgnn 

>969 

ACGCGGGCGGTCT GTGCCCCATCACCATTTCTAAAGCACCCT ACCCTCATGGCAGT 
GTCCCAAAGGAAGGGGTTTCCATGGTAACCTCAATGGATACAGTCAGCT G ACGTCTGGCAC 
CGCCTGTGCTGGTGTGCGCTAGCCTACTCACTCCCTCGGCCCTCCCT 
TATTTATTAGTTCTCTTTAATGGAAAGTATATAATCXXriT AATGTCAGACCTTG 
AGCTTT A TT AAT TTATTTAGGT AAT AAATTT ACCTT (XT AATTAATTCTCAGTAGTCCT GGGAG 
CT GTATTATTTTAAACATCTTGCACAATGTTTATAGTTCTGCGNN 
>970 
>971 

ACCAAGATTATGATAGCCTCTTAAAACAAATTGGAGGTTATAACCTTTT^ 
GCAACAGTGGATATAGGATTGGAGTTA > II H 1 1 C I 1 AAGTTTTTGGTAGAAACTAGCCAGTG 
AAGTCATGTGGGTTTGGA J II I CI 1 1 GTAGGAAGGTTCCTAATTACT AATTAGCTTTTCAAAAT 
AGTTATGAGAATATTCAGGTTTTCTATTTCTT CCTGTGTCAATTTTGTTGTC 1 1 1 1 t C l A TAAAT 
TTGTTCATCT ATAATTTTAATATTTTTGGTAT AA 1 1 i 1 1 1 1 CAAAATAATCTTGTATTTATTTACA 
AGACAGGATCTTAATGTTTAATGACAGGATCT ACAGTGT 
>972 

NNNNA AGGAA ATTf 11 T 1 CCCCCAAGGGGGGGGGGCCAATTGGAAATTTTGGGGGC 
AAAAAAT AACCCTTTT AAATGGT ITTTAGCCCT AGGGGAAAATTTTTAAAAAAAAAAGTTTTTA 
GGGGAAWTAGGGTCAAA ATTTG GGGATTTGGGGGTAAAAAAAAA I Mil l T ATTGACAQAT 
TGAGACCCTGGCTCAAAAAAI 1 1 1 1 1 I GATTATGAGNNNGANGAAGGAAAAGAAAAGAAAAG 
AAAAAACAAGAAATTAGCTCATGAATAGCCAGCCTT ATATTATAATTATGTGACACTTTGGATA 
TTTCAAAGCACATTCACAAAGGGT ATGTCACTTAAAT ACCT CAAAATTTCCCTGTTATACATGC 
AGATCATTCCCCATTCAGCCCTGGTATGGACTGAACTGT GTACCTGCCCGGGCGGGCCGCT 
CGAAAGGGCCGAATTCCAGC^CACTGGGCGGCCGTTACTAGNN 
>973 

NNNNA AGGAA AI HIM »CCCCCAAGGGGGGGGGGCCAATTGGAAATTTTGGGGGC 
AAAAAATAACtX f I • lAAATGGTTTTTAGCCCTAGGGGAAAATTTTTAAAAAAAAAAGTTTT^ 
GGGGAA^TAGGGTCAAAATTTGGGGATTTGG^ 

TGAGACSCTGGCTCAAAAAA M 1 1 1 t TGATTATGAGNNNGANGAAGGAAAAGAAAAGAAAAG 
AAAAAACAAGAAATTAGCTCATGAATAGCCAGCCTT ATATTATAATTATGTGACACTTTGGATA 
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TTTCAAAGCACATTCACAAAGGGT ATGTCACTTAAATACCTCAAAATTTCCCTGTTATACATGC 

AG ATCATTCCCCATTCAG CCCTGGT ATGGACTGAACTGTGT ACCT GCCCGG G CGGG CCGCT 

CGAAAGG GCCG AATTCCAGCACACTG GGCG G CCGTTACT AGNN 

>974 

>975 

>976 

ACCTCTCATTTGTC^CTTTTC AACACTTCCTGG CAG GCAGGCAGCATAACTGGTCCT 
GCTG GGTG ATCCAG ACCACACTCTG CAACTCTTTCTTCTGAGCCAG GCTCCCCT ACTGTCTT 
TTCATTTATGTCAAGGCAGGGGAAGACCTCAAAGGGCTCTT G CATCCCAGTCTCACTTCCCA 

gagaggcacgaggccctccaggatgtggggacaggaactttggggcaagccggggctgt 
ccagaagatcaccaggagggctaaatagtagaaaggagagtcttattggtgatatgtttgca 
aactgggaaa agat agcctccagtgtggagcaaagatgctccttcttcaaagagggcaagg 
gcagcttggattttgtgccttacagggtcggtattatataatagagtcatgcatattcagtag 
gttrgggggaaaagctatatatatttatgaggggagccaactacatgggcaatggataaaca 

TACATGTAACACATCCCATGTTCACTTAGGGGCAGGATTTTAGCATTAAAATGAGGTGGAATT 

tggctcttt acatc^aaaggtgagctatcagacac^uaggcagtttgtgcac^agctctcca 
aagggacttgagggctacagctgctcatctggaaagaatccttgtaagaccagtcctctgtc 
caaccagagtraggaggcatctgacaatttgcctgattagctgt 

>977 

ACTTT AAAAAG TAAAC AAATTT AACTGAAGCATGGCTATTAGTT AGTGATTCTTTGTA 
GATTTTCTGGAAAGTCTTGTTTGTTTGTATTAAACATTAACTCTGCTGTATGCTGTAAATAC^ 
TGCTAAGATCAATATTGAAAAACGAAC^TAATACCAATTCATATGGACCTTCAA^ 
ATAAAATTTTATGATATGGTATTATCCAGCCAACTGACTTTGAGACTGA 
TTAACCAGGTGATTCTTGCATTCTTTGGTTTAAAA(XTCAAGTTTAAAAATATCNNN 
>978 

ACGACTTCACAACACCAACCACAGGTCTCAAGGTCAAAAAATGAGCTAGGAGTAAAG 
TATCTGCTCCAG AATCT ACCCCCATCCCAGAAAG AG CAACCCAACTGTGTCCT GAGTGGCTC 
TTAGAGTTTAAGACTCTGAATGAATGOCTAAATTTAGAAAGGGTGTGG ACCAAG G GATTTTTG 
GTTAATGTTCT CT AAAGCAGGCTG ACT GCCAGGATTTCAAGTC^GTGATAAATTTTTAATTTTT 
ATT ATTTTTTTTCCCCCG CGTANN 
>979 

acctggcagca gagta ggcactaatatgtgttgaatgagtaggtgaaataaacaaaa 

ACCT AATGGCGATGGAATTTTATGGAAATAAGTAAACTTCATTATTGCTGAAAATACCGCAGA 

TAAATAGAGGGAGGCAGTGTAATAGAGTGGAAAGAGCAGTAGACCAGGAGTCAGACAGTCG 

AGGATCTCATTCrrAAATTTGAAGGTGAATAGCCATGTGGCTTT AGACAGGACTCTGAACCAC 

CTTGTTTTCTT ATCTGTAAAAGGGGGAAGTCATAATAGCTACTCCTGCCTAACT CATAGGTTG 

TTGAGAAAATGAAGTGN 

>980 

ACATTACCTTTT ATGTATGCTGGAATAAGAACTTGTGTCT ACATGCATGT AGAAACAA 
TGGAAGGATAGGCAAGAAAATGAAAAAAAAATGATAACCT ATGGGGAGTGATGGCCACTAGA 
TGACTGGGGACAGGGGCTGGTGAGTGAGCGCAATTATCT ATTTAAACAATCAGAAATGCTCC 
CTAAATTACAAGTTTCTAGTTAAATGCAGTAAGAAA1TCCCCACAAGCTCT GCAAAATAAGTT 
CT GTCAATCAAATCTT ACATGATGCATTAACTGAGCTATTTTAAAATACT ACCATGAATTCATC 
TTTAAAGTGTGACTTTGTAAAGCAGATAATCCTCXrTGTTmCATACA 

TGATTACACTACACAAAGAATACAATATTAAAGGACX^TTCAAAATTAAATTAA ) HIGH ICTA 

ATTAAAGGTTAAATCAATGTGTATTACCAACTCAAGAAGATAAAGACCGACNN 

>981 

CCTTTGTTTATCTTT GCAGTTTGGTGGTTTTCTGTAATGATAACATGTGGTTCCATTCT 
CTTTCTT A TTTGT GTGTCT GCTTT ACCAATGAGTTTTATACnTTTGGGTTTTCATGATGGTGAA 
TATAGCCCTTTTGCTT CCAGA TGTAGGACTCTCTTAAGCATTTATTGT AGGGTCAGTCTAGTG 
ATAATAAATTCCATCAGTTTTTGCTTATCrrAGGAAAGACTTT ATTTCTCCTGCATTTCTCAAGG 
AGATCTGTGCTGGGTACAGT ATTGTT GACTGGCT AACAGAGGACCAATT AAT AAG CCAAAGA 
AATGGCTCTTTAACAATGAACATTTCrrGCCATCAACTGACAGATCCCAG^ 
CAGTGAGGAGACTTCTCTGGTTTTCAGAACACCTCTG GCTGCCCCTGCCCACCCCAT AGAA 
GGGCTATOCCTCCAGGTCAGGTTAGCATCATCACCT AGAGCCAACAAGTCAAGGAGGTGAT 
GGTTTGCCTTTGACATCTCTACCCAGACCAGACTCCACT 
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CTGCCCTACCT agttagttggtcctgccctggggccagagtttcact aggggctgaatagt 

ATACT GTTAGCTCAGGCAACAGA TGAA CTCTGCCT CCATGGGCAAAT ACAAAAAGGTATCTC 
TGCI ICI 1 1 1 1 CACTCACTT AAGATTTTGGAAGAATGGCTCCTAAAGCGGGTGATCATCTGCC 
CTCACGAOCTACAGTGCCTCAGCATCACATGCTATTCATTTTGCAGAGCAAAACCAGGAAGT 
GAATATGACTGTTATCCCTACTTCACAAGTAGAGAACTCTGAGGCCCTGAGAGGTTAAGAGG 
TGCAGGTAAGATTTGAACCTACGGGCT GTGTGCGGTGGCTT ATGCCT GTAATCCCTGCACTC 
TGGGATTACAGGCGTGGGCCACCACACCCGGCCTACTGCCTACCATTTTGCCCAAGCTTCC 

cag t act agaagaaccc 

>382 

aotagatcag^tggattgaaacatgacagccccatttcatctggccggttaaggtc 

CTCATGGAATGAAAAACACTTTCGGGCACTCTCCTATGAGAGAGAGAATGGGI } ten 1 AATT 

gccagattgtctgaacacagcctcagctacttctaggaataagacgaagcagtgaggaagtt 
gccagttgagtgattcttggggaaaaaaattagcattcagtgccagctctctaaagtgtgga 

TTCTGGATTCTGGTAGAAGCCAGTAAAGAAACGTTTTCTCTGGAGTGGAAGCTAGTAAGATT 

TATTCTGTGGTGATGAAGCCATCTGVU^ACCTTACAAGCAGTGTGGTTGTATCAGCATATGGG 

AGCTGAGTGCCTCAGGACTTTGGAAGCCTGCTTCTCTGTGCCTCAGCCGGAACTCAGGTTA 

CTCAGTAGTCATTTGCTAATTTCTGAGVVAOGCAACACTCCTGAAGGGATAGAA^ 

AATACCCAAACTTTTTAGACTAGTAC TGTGT GTCAGGTATTGTAACATTCATTCAGTCCCTGA 

AATGT ACAAACT AGTTATTG CTCCCCTTTTTCATTTGAGG AT ACTGAT GCCCAG AT AAGAT AA 



ACAGTGACATTTCAAGACATGGCCCAATGCACAAGCAACTTCCCAAAGCT GTAATTC 
ACGAG ATTCCTCAG GGTCCTCT AAG CTCCTTGAGG GCAGAAACTTATCTTT GT ATT ACAGCT 
AGCCTTCAATCAGTAGGTGTTGAGCTGA I I I I C I 1 1 1 I CCTTTTT AAACT CAGAAGTTAAGTTC 
CAGCTTCAGTGGCTATGCCCAGATGGTCTGATTCTGAAGGACAAGAGAATTCAGTGGCATAA 
GCCCTGTGCTTGGCANN 
>6S4 

GGTTCTAGCTITTTAAAAAAGCAAATCCCTAGAATCTrAAAACAATTGATATCT AGTTT 
GCTAGTATGTGATATT CTAG TC^GAATTCCTCTTGGGAGTGAGTTGGAAGAGGGTTATGGTG 
GTCGCTAGCATTCTCATTTTCAAGACCACTCTGTTCC I F ICI f AGACCATGTTTCTCATCTTAA 
GCAGTCATGCCCGAGCTGGAGACAGGAAACCAGACAGCA GTCT GCCCT GAT AAGG GACAG 
CATTGTCTGAAGGCGGGGAATAGCCCTGCTTTTACCCXXiAACT CAGCTCTCTTTGCTTGAAA 
TTrAGTAGTCATAGAAGAGGCACAATGCTGTTGACAGTTTCTGAACGAAAACTAGGATTG 
TCTTCCAAAGCAAAAAAATAGTGTCCTCGGCCGGGCGCGGTGGCTCACACCTGTAATTTCAG 
CACTTTGGGAGGCCGAAGTGGGTGGATCACCTGAGGTGAGGAGTTCGAGACCAG<Xrr AGC 
CAAGATGGGAAAACCCCATCTTTACTAAAAGT 
>9S5 
>S86 

ACATGGAATACATAATnTGAAATGGAGTCAGGGCTTTCCT AATGATCCATTTTGT AA 
TTCACCTAACAGCTGAGGAAAGGTCCAGAGAAGGAAGAACTCAAGGTT 
ATATTGAGTTGCACTGGCT GCCTT CTCTTTTTGGT CCCCTAAAGAGTATTT ATCATCTTAGATT 
CAGCTTAAGTTGTGQACAAATATCAAGGGGAAAAGTATTT ACAGTT AACGTT GGAATCACAC 
GGTTITCGNGGGTTGTGCCTCTTTACCCTTCAACTTTGGT GGTTCTAAAGAGGGACGATTAT 
TAGTTGCTTTCACTAAGGAGGGGAAGTTCATGATGGAGCAACN 



ACCTGGCCTAGAAAAT A I IIT1TITIIT GAAATGGAGTCTCACTGTGTCGCCCAGGCT 
GGAGTGCAATTGGCGGAAATCTTCGTCTCAAAAAAAAAAAAAACAAAACAA 
CTCAAATATCAUI I ICIUI IAAAIUI ICI I AATTCCTTCAATCATCCCCCTCTTCTAACTCTCA 
CAGCAC1 1 1 CI I CCACTACGGCACGCATCACAOGCCAACTACTCACCAGTTCACGTTTTCCG 
CCCTCTCTGOGACTTGCCGAATCACAGAGTTCXTAAAGAACCAGGACT ATGTTCTACTAGTCT 
TTGT AGCCACTGCACTCGGAATGGT GTCT AGTACCT GCCCGGGCGGN 

ACTCCTGTTTC TACAAA TTTATCTTATAATAATTTGTCAAATGTTG^ 
ATTCATTGCAGCATTTGTTTTTCATATCAAAAGATGGGAAACATTGTGCAAAC 
AGTAGTGGATTGATTAAATAAATTAGGTATATCCAATAATTGAATATTATGCAAGTATATAAAA 
AATAAGAATCATGAATATGGAAAGATITCGAAAATATATTGCTAAGATTAAAAAAAAGGAAGG 
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GGCAGAAGAAAATAAGTTGGGTAAAAAAAACCCCAGAAATGTTTACTAATAATTATATTTAAA 

AACTCATAGGATAAACAAGAAGGTAATGAAATAATTAAT 

>989 

ACATCTCTGGAAGGTGATCCTTCTTATACAGATACCAAAAAGCACTTAGTGAT^ 
TTTTGTCCAGGACAGGTTTCT GAACATTTCTGCTTGT AATCAAAATGGCCTGGTAAATGTTT A 
TAAACTTATTTTAGCTAATTGATGAAACTACTGTAGI I I I CI 1 1 1 AAGTTTATATCAGGATACCA 
AAGCAACATATTT AGCCCCCACGTCTGTTGAGTCTTTAAAGGACCT ACTAAGATTATTAGTAT 
AGATTCTCAAATATTArrrmCAGAAGGATGTATTCCTCTrrrrrGCTGTG 
GTAATTAGAGTTTATGGAGAAAAAGGACTAAGGGGAAGGGTGGAGGTCACAAATTTATAAAA 
GCCC TGAAC TATTTCTCATTT AACACGT ATTCTCTG ATC A 1 1 1 1 1 1 CTAGTGTGTT A 1 1 1 1 I G TT 
TGTTGTTTTGAACATCTG AAAT AAACAT AGGCTGGGCAGG GTGGCT CACACCT AT AACCCT A 
GGAATTTGGGAGGCTAACACX»GGAGGATTGCTTGAGTCCC6GGAATTCGAGAOCAGCX3CTG 
GCAACAAAGTGTGAGACCCTGTCTCTACCAAAAAAAAAAAAAAAAAAAAAAAAAAGT 
>990 
>991 

CGGTACCCTAAAAC1TAAAGTATAATAATAATAAAATTAAAAAAACCAAAAACAAAGAT 
TAACAGAAAACAAAACAACAAAAAAACTCCCAG(^TATACATTGAGTCATTTGCAGGTTTGGG 
AGGGGGGGAAATGCTTTTTTGTATTAGGAGAA^GGAAGTTTTC^TTTTAAATGTCTATATTA 
CTTAAAGTTTGCAATAAATATTTATTACTTTCAACAGTGAAAAAAAITACAAGAGAGAATATGA 
AATAGCACAGTAATAGAGACTATAGTAATGCTAGAGAACATAATGAAAAACAAAGAAAATATG 
GGGGGGAAAATAGCNNNNN 
>992 
>993 
>094 

ACCAAGTTGTTCTCAAACTTTCATGTTTGTGTATAC5AAATGAGCTG A 
AACTACAGATTCCATGGCCTGGCXXrrCAGAGATTTTGACTCAACAGGTCTGAGTTGGG^ A 
GAAATATGCATTGCTAATAGGCACCCTGACAATTC<X^TGTAGGTGGTCCTTAGAACATATTT 
TGAGAAATATATTCTGTAGTCN 
>995 

TACCATCATCTGTTTCOCTCTGGTTATAAATCTTT AATGAAAACGGATTTAAAAAGTCA 
CATTATGATG CTCGAAGCTCTGACCTCT CATCACAATGAGAAG CAAAAGACATG CCATAAAG 
ATGATATTTCCCACAGGAACGATATTAGAATTATGTGATGCAATCTCATOCAAGGTCATGGTA 
TCAAACCAGAGACAG ct AAAAATGT ATCAT AAT AGCAAGGAT ACAGT AGCAAGGATGG GCCT 
CAAT AAACATTTAAAGTGGAAAAATTCTTCT CT AACTCATATCAAGTACCT GCCCNNN 
>996 
>997 

ACTTGGCAACAATAGCTACAAAGGATAGGATACTCAATTGCAAGTAGACTTTTAAAAT 
AAATTCACTTACTTCTATTCCCAACTCAATCTAGAATATTATTGGTGATAGTGAAAAGA 
CAGATGACATT ACTT CCAAATTTTACCAATCT AATTGTTTTT ACT CACACCT GT AGATGTCACT 
TTAAAAATGTGAATATT AATTTCTTCAAAACTACTCCAATTT AAGTAATGAG GG 
CAACCATTAAAGCTCTCTGTrCCCAACTCTAACAATATCT 

TTT ATGTTTACACAAAATCAAAGG TT AT ATTT AAGGTTTTCT ACATTTTTTTG GAT ATTT ACCT C 
CTTGTAATTTAGTTTTATATGTCTGTATTACAAAAC^TATTATATTCAAGAATTTTTAA 
GAGT AGAAGTGAAATTACAGGTTGAAGATTATTAAATTAN 

ACGTGTTTTACI I GG I GCTGTAGGTAATGCTAATTCATGATAAATTTTGAGAACCACT 
CTAGGGTAGTATGTTTCCAACAGTTTAGGTCATGAGCAACCTTGAGAAATAC^^ 
TGACTCAGC ACACA CACTCACATG CAGGT GTGACTTAGACGTTTC 

TTACAGTGTG UNCI UCTCTGGT ATTTTT ACTT AT ATTCTATTAAATAGATATGTGTGTAT AAA 
CTTATTGATATAAAAATGTGGTCATGATCCACTAAAGTGA7TTTACAAGCCACT AATGGG 

ACI H 1 1 1 1 1 1 1 1 1 1 1 1 1 I I II 1 1 1 I C I M T 1 1 M 1 1 1 1 1 1 H 1 1 1 I I M 1 1 1 II I I 1 1 1 1 I I I 

CCACTTGGGTTCTCCTTTTTTATTATTCGGCAAAATGATAAAAACX^ ATATAG 

GGTTTTC ATGG CTAGAGTTCTATAAAACTGCttTTre 

AGTCAAATTTTTTAAAAGCCTCGAGATCCAACAAAGCTGGAAA 

AATGGTTCATTTGAGATGTTGGCCTTCAGTACCATGAGAGGGAAAGCAGAACAAANNN 
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>1000 

ACT AACTGAAT ATTT ATTT AAAAAAGCATTAATTTATCTATCT ATATAACTAAATCTATC 
AAATATTCTTTAAAACACGAACCAAAGTTAATCTGAAACTCTTCXrrG 

TATATGCCTTCAACACAGAATTTGTCATTATTTCCTGTGGCATTATACT ATGCCCTTTGTCATA 
T GCI H ill t UCX^TAGAGC^TTTTTTCCCATAGAACrrTGTATTCCTCCACTT CT ACCACCTT 
TCTTTGAAGAACTCTTATTTACCATTTCTTGGACTAAATTAGGAAA 
>1001 

ACCCAGAATATGGTATATCT CTTCATTT ATTT AGCTCTTTTTAAATTTGTTTTGGT AAT 
ATTCTGTGAI 1 1 1 1 1 I I II 1 1 1 1 H I GGTATGGAGGTCTTACATCTTTTGTAAAATTT ATTCCT A 
ATACTTTGGATTTrGACAT T ATCAT A AAAGAAAATTATTTCACTGACTTT^ 
GGCCTAAACATATCAGT AATTTTTTAT ATTTT AATCTTGT ATCCTATGACTTT GCT AAATTCATA 
TATTAAATAGTTGCTCCATAGATTCCTTAAGATNN 
>10O2 

CGCGTCCGCTGTGGTG(X^TCTCAGTTCACCATAACCTCTGCCTCCTGGAT^ 
CAA GTCTCC T GCCTC AGTTTCGCAAGTAGCTGGGATTACAGGCATGCGCCACCA 
CT AATTTTG T ATTTTT AGT AGAT ATGGGG TTTCTCT ATGTT AGTTAGGCT GGTCTT GAACT CCC 
AACC TCAGG TGATCTGCCT GCCTTG GTCTCCCAAA^ 

CGTGTCCGGCCT ATCTCTT CI 1 1 1 1 GTAGCATAATGTAAATTAGTCTTT ATCTCTT AGAGAAG C 
ATCATTCTCATATCACATATACm^TGTm 

CTrGGCATTAATTAGATTGTGATCATAAGT CAAA ATGTCATTGGTT ATAAAGTGGTCATCAGA 
CCATGCAGACT AtTACTAATATTGGTTATGTTTT AGTTTATTGCAGTGAAAATACAAAATTTAA 

aagtt attqt agagaattattjat accccccaaaatgt gtcatt g gt cct ccaggactctgt ag 
tcc ccatcc aagaaag act gtgataattgtcaaggggtt agt atggtctgagcatggttgat 
ggtgctctgtcattctggtattaaacaaoctgccaa a i b i c i i gattagatgtcct aaaaaag 

TGAGGGGAAGAGTGTAGGACAAATGCAAAATAAAATAACACATTT AGCT ATACTTTT AGT ATT 

T^ CATTA TGGAGACTTCCATATTTAAGTGACCCGATTTCAAGATTC^^ 

GGACrTTGTGCCTTAACGTAACCTGGTGCATAAGAACGTGGGGGTGGGATAAGGAGCCCT^ 

TCTGCATGTACTGGCGGCCATAATCCAGAGACATGTGTGTCCGGCCCGAACCGGGGGTGAT 

CTCKN 

>1003 

^CATCTGTTTCTGAAAGCATTTTTCACTGAACCAATTTTCT ATACC i 1 1 i I CI IGTATT 

CI 1 1 IOCTTAGCI rrron I ATATGGTTGCTATATTTTTCAAGCCTCATACCAGTCATATAAAA 
CCATGATAAAACTTCATCAAAGCATACTTGGGCAAAT^ 

AAAAI 1 1 1 1 1 ACT AGTTTGGAAATAGATCTACATGTTTG AI I M C 1 1 I CCTTCCTCCCTCCTTTG 
TTTCT T GTCTrTCTCTCCCCTTTCCT AAAAAGTTAATGGCTATCATTATCTTCAOCAAATTAGT 
GTTTGTATACCCATAAAAATGNN 
>1004 

nnaagatcatgcactccagcatgggtgacagagtgagact^ 
aaagaccaaacaccacatgctctcacttataagcaggagctgaatggt gagaacacatggat 
atatggaggggaacaacccacactggggcctgccaaag 

ATCAGGAAGAACAGCTGATGGATGCTCGGCTTAATAC^ 
GCAAACCACCATGGCACACGTTTACTTATTJTAACTGCATAT^^ 

TAAAAGTTGAACAACAAAAAAAGAAGGAAAATGCGTTAATACCTTATTGT AATTATTATTTTTT 
GGAAGACT ATTTTITATATTCAGAAGAAGTGTCAGAGTCAGCAGAAAGG 
TTACCT ACA ACAATGG TTTTAAATGACT GOAT AGAT AGAAATCTCTTTCAACTTAACTGCTTAG 
CACATT6CAI 1 1 1 1 C I CTQTTTCAAGTTAGTTTTCCAAAGGATTACT 

GCT AAGGGATGTCAGGCCTT AATGACATA 1 1 1 1 1 CC I CAAATAAAGATACAACATGCTTTT AC 
TGTGTGAN 

>10OS 

MCTTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCA^ 
TATA 1 1 rTGAACTACAAAGGTGCTGTGGT ACTTT AAAGAAAAACTATTT1TACTACTTTATCTG 
AATGGT CTGTGGAC TTTATTTAGAAA C Hi 1 1 1 1 1 CAGTTT AGTTTTTTGGACATATCCTTTGCT 
CAGTGT ATTTTOI I ACTTCTCTAQTAAAGGT AGAAGTGAAGCAGATGCCATTGTAGGTTTT AC 
CAGCATTT^TATATTATGAATTGCTrAGCAATGAAATGCA^ 

ATACT ATTT ATGTATTCAGCTACAGAGATGAAATAGCATTTTATGTGTT AA 1 1 G I T 1 1 G GCTAT 
AAAATTTAAGTCCTTACAGCATTTGGGGATTATACACTTGGATTTC^ 
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GTGAACAT ATCTAAT AGATATTCTGTTTACTTGTA6AAATAATATAATGTGTCTGGTTCTGTTT 
TGTTAGACA \ 1 1 1 1 1 ATGTTCTACTGAATCAGAATAGTCAGTGTTATATTCCTTATTTCAAGAAT 
TCATCTATTTTATGGCTACATCTCATATGTGCTATGAGGACTTCTAGCCAGTGGGTGGCGCT 
GTAATACCGAAGTACTGCACTGCTTTGTT(XrrGTTT<XnT 

G GTAACATT ACCGGAT GGCATTCXXTTT AAATT(X^CCJVATCTT AATCTT ATTTT AAAATGG CT a 
CTTTTAGGGGGTGGGGTACGGTAAAACTTAATAGTaCGTGTTTATCTCAGTTAATGTCTGGAT 
ATCT A TATGCA CTGGTT AAC>GTCCCGACCGTATATGATGGCGTATTTAGGAAAGATATAATA 
CTGGCCTTTT AAT AT AAAAATCAAAAAGAG CGAAAAAGAGGGGGGGAAATCCCGGGGGCCT 
AAGCCAATATTAAAGGAGAAAGAGGATAAAAAGCGGTAAAAGAGCGAAGATGTAGAAACCC 

gtgagtagacaaagaaaagtatagctgtatanmnnnnnnnnn 

»1006 

ACATAGTTCTGCTTGCATTGGTCCCATT ACAATCXTTGTCTAAATCXrr GAAGTAAAAAT 
GAAT ACCAT AG TGAAGAAATT ACTTGTGCATGTGAAAGAGG CTGGTCCAACTCCTT AATTGC 
AACAGXSGATTTG ATTCTTC TACTAGTAGTTAGGAAAGGTTGCATTAATATTCAGTAGTTAAAAT 
GTGCGATTCT AAA I I I I 1 1 GTAATTTCCC^TGAGAGAATAAATTTTTTCAAAAATATTCCCAGT 
AGGTGAATGGCTTTAATACATGGTATCTGTGAAGATGGCAAATAAAATGACT 
>1007 
>1008 

ACACTGGCTCACCTCTCAGGGCTTTGCTCCTTGGGAGGCT ATTCAAGCTCAGCATC 

acctgtctcacatctgtctgggatcctcaaacctgacctttgtaa^ 

TTGTAGAGGAAAAAAAAAAACATCTTATCGAATTCCTGCTCTT ATAGCTGATTTTAGCT ATT AG 
GAAA ACATCCC AAGTTGAGCTrTTT JTATr C CTAGAATTTC AG A I I I CI I fOCI 1 1 1 1 AAAAATTT 
TATCTCCTTTT ATAGTAGTAAAAATATTTTCCT I T T 7 TT TTGGAATGGAGGTCTTAAGCTCAGT 
GTCAAAAATAAAATCATTTTA 
>1009 

NNNCTTCTCT AT ATTTTT AAACTTCCTTGATTCT GAL 1 1 1 1 C I CATATAT AAAATGAAAT 
CAATGTTACAGTCAAAATTGCCTTTCAAAAATTCCTT AAGTCT ACCCATTTTCAGACTTAGTGA 
AAAATATATAGTCT AGGACCTATAGTGTATAAT AATGCACACTTAAGCGAAGGAAATGGTGAG 

ctaggctagagaaagact aagggagcatttatatacatttgcctaaaatatt^aaactttcag 

GGTAACTCGGTT GAGA AGGGATGAGTGAAGAATTCTAGAGAGTACCTT CTT GCCT ACAGCGT 

ttagctccctttgttttgcataaagatctgttt^ 

GCTT ATTCTCACTATGTAAATTACTT AGT AAAT AAT AGGAAGAGATGTTGAAATACAAACTTTC 

t^gccaccagac cjtcac tctattg^ 

ATGCAAACATGCTCAAGTCTGTCTTATAAAAAATAAAAAAAAAATNN^1^^^^ 
>1010 

NN NCGAGGTACICI 1 1 1 CAGATGAAAGTGTTCGGTCACCTGGAACCTGTGAGT ATGT 

g g i iii i gatc tgtgact aaac tgtcacccatttccca g 1 1 i c i ci gctocgtcaaatatcaa 
cattttaccagg imciciuflgtt gccaaacctgtcatttttatttggtgtggcttcttggg 
aaacttccatggcccatttgatggcaatcaaacagtgaaaacaaggacag 
tggcatcaggaacaaatgggtcataagaacttacctt ggcagcagccccagaatggtcagg 
agg aaaggcact ct aag^t atcagaaggtagaaaggagaggttggatgat agaatggggaa 
tggattcctcctcgcgttcacagaaatgaattaatgggagacan 

>1011 

ACTGAGACACTGGATCCTAAGAAAATCAGAGTTATAGCTAGTGGCAGTT ATCAAGGG 
AATGCAGAGGI 1 1 C I G I ATTCTGAGCA1 G U CC t G I AAT AGGATAGATAGGCGATGTGGCAG 
CAACAACTCCCAATTCGTAATGTCTTAAAACAAAAACAAGTTTTATTTCC 
TTCCAGCACAGTTTCTCAGAGGGCTGTGCTCCATGCATTTACTCAAGGT 
TGGCTACACTATCTTGCAGCCACCATATTrGGAACCTGTTGCCACTCTGA 
ACAAAAGAAACOCAAAGATCATGT ATGAGCT ATTCACT 

>1012 

- • ACGGGL mTTrBTTCn G I GCAGTAACAGTGAGGGCATGATTAGCCATCTTTGCCA 
GCTGATGTCTTGTGGACACCTGCCTTGTr ACCACTCT AACAGGCCCGT GTCAGCAGCTCCG 
CI I CC I CC I GACAAGCTGCGAGX^CAGGGGAC^GCACAATCTGAAACTCTT ACAGAT ACCAA 
CAGCAACAAAAATOAAAGCAGTTATGGTGGGCAAGCATTAATCTAAAA 1 1 III 1 1 AAAGGAC 
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AACiCTT ITTTATATTTGACTCCTACTAATAATAAGGATTTGGTGGGGGGAAAGAGAAAATAAA 

TTTCAATAAGATTTTTCCTCAAGGGAAGCAAACNN 

>1013 

acgcggggggtctcaccatgttggccag gccggtct caaattcct gacctcaagtg 
atc ctcc cccgtcagcctccccaaagtgccaggattataagcaggagccacggggcocagc 
ctattttgtttcttaaa t f i f t f i gttjttc agtcaccacaatttcaccatgcat aaat cacaac 

GGTTAACAAT TTAG CATCTTTGCCI ICI 1 1 I CCTGTGCACTTACGTTTTTATGTAGCCAAGATC 
acacgttgcattttgctgctttcctt AACAGCGTCTAAGTCATCAGCACTCTATTGTG atgat 
ttatcttaaaaatattccaagcgatcatttttagtaactgtgtaatattatatcataaagttaaa 
ac^taatttgtcattcaattgttgaaatttttaggttacgtatattttctcttataaatatgtaa 
atatgtttataaaaagttatatacagttttttataaatctttgtgcatactttatactgtttcct 
tagcatagagactgtggaataggatttcttgaaaaaaggtaaaagtgtgagtatgcatatata 
ctggtacatatatgnnn 

>1014 

acttattcagacaagagttctgactctcatgcttgaggataagattatacatttcagt 
attacattgaagatatttcatttttaaccagact aactt agtatattgttatttttaatgtgacc 
aaagaaatattt tcat agaagctaatgctgagtctittgataatttgccgtatcttagtgaatc 
ccaaaaaatttaf i 1 1 ctactatttacat attatcctagtggatattacattacttactgaagcc 
tttggttctatgtttcatctactcagacttaattcaggaagagcttcatccagatgttttgtrt 
atttgtttctcgattacatgtatgagatttcagaatttatgagatcataggtcaagtgaaagg 
tcacagttgagaggtcaagtaagaagctaaaatttgtgaaaccaaagaaatgacaggacagt 

GCCAAATGAAAGGTCAAAAGTCAAGTGACAGACTCAGT 

>1015 

>1016 

ACTATTATAATAAGTTAACATATTTCCCCTATATGCGG AAAATGCTGACT ATATCTTTT 
GGTTGCTTTGGAACACTATCTCCTCACAACAGTCCTTGTCT ACAGAAATGGGAAAGGGAAGG 
ACACATTTTG^TTTCTGCAACATGGGAACATTCGTAAAACCAGAAATGATGTGTGACAAGAAA 
CT AAA6AACT GGACGAAATTCACTTCCATTCACCCTGGTTAAAGCTT CCTTGAATCAGAGATA 
AGAAACAACATGAAAAATCTATTCCTTTTAGAAAACAAGTCTTTAACCCAG AG GTTG GTTTATT 
TTGAAAAGGAATTAGACTCTGGGCCCACATACCGCTCGTTCAAAATATAATGCTGTGGTTTCA 
ACTCCTGCTAAATGTTGCTGTGACTTTTAAGCAGAGAACTTCT AAAAGGAAGTAACCTAGGG 
AG GGGCT GATATAACTCAGAGATCAATAATTCATTTTATTGGAAATAGGAGTAGTAGTATGAA 
ATGCTAGCAGACTGTTTCANNNNNN 
>1017 

ACAAGTATTATGTATCCATAAAAATTAAAAAATCTTTAAAAATGCATATGGGGGTGAG 
TAGGTAAAAGAAAAGAGAACCAAGAGAGCTGCAGCGGGGAGCACAGCTTGCTTT AAACATG 
AGATCCAGCTCAGTGATCATGCGGGGGAAAAGGCCCGGCATTGCTGGAACTGCTAATATTT 
AAAAAGATGATGGAAACTTGAAATTTTATATTTAATCTrCTCATTTTTAAGTGTTGGCAATGTA 
TTGAAGACTTTGAAGCCTCTCTGCTGGTCAAACAAGATGT ATCTGTAGGCTGGATTTAGTCC 
ACAGCTGGCCAGTTTGAAAACTGAATCCTGCTAGCCTTAATTTAAATTTTTTTA^ 
GCTTT GATTCCTGCACTCCTGCTCAAAAAAATCTT CAATGGCT COCCACTGTCTGCAAGGT AA 
AGTOCAAACTTTGTCACCAGTCCTTCAAAGCAACCCATGACT ATATCCAAGACCCCAAACCAT 
ATTTCTACCTTATAGCCAGTCTCCATC7TCCACCGCAACCAGAATGATAGTTGAATTGTACTC 



rrgccctacactgtgcatgcctatcaatgaaaccctgtccatc 
at taacc atccagctcaaataccacctctccacaaaacatccctgatggcccagccaaatgc 
cccttt gctctgaatttccatgggactttatatcactcacatgacactt acaacatactgcct 
tgtttt g gcat tagctacatagctcattttcccaaacacctaataatattcacagaaggaaga 
gtttggg 1 1 1 1 gcactatattgaatatccctgtagtgctttgtacatagcannnnn 

>1018 

acgcgggtcccttattttctggtgtttacttgggatgcatcagtgaacaaaacaaag 
gtatctgtcctt atgaaatttatatcatagcagaggaagact g gaaatgaat aaat aaat aaa . 

GAATGGAGTTTGTTGGAAGGTAATAAGTTCTGTGGAAACAAGGAAAACCAAGGCATGGAGGT 
TTGGAGTGCTAAAGTGAAGGTGTGAGAAGAGATTGCTCTtGCTC^GTTrrCTGTC 1 I CI 1 1 IG 
TTTAGGAAATTQTCATTCTCTGT ATGCTTCATTATAATATATACAATAAATATGAATTGTTATAA 
TTTAAGATAAATTATATAAATATAAATTATAAATGTGTGT 
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>1019 

acttagttactccttgcccatagacgtgtttgacctagaaaaatttcttatacgcaac 
agatattcatgaaatatatattaaaataaagcttgaagggtgaattaaataaatatttacttgg 

AAGCTACAGTGGGTGAATTAAACAAATATTTACrrGGAAGCTACTTTATAGCCACTGGGCTG 
GAmCATATAGAGAGTTCTTGCCCTTGGGAGTTTTACAACTGCTTAACACTTTGTCTATGCT 

agaatacataaaaaaaaaaaaaaaaaaaaaagtaccnn 

>1020 

nnnnacaacacacatactttagagataatcagaagcaaatatttttttaaatgccaga 
atatacacataagcatagttaaataaaaaaaaatgttcactctgttgatgttaaagtccatgga 
ctcaaatctcttt gg acaattgtgttcaaaatttttcacactg AG atat attt acaggtgg g at 
ttatgttaaggccaatatacttactcacagagacacaagagaataccaaagtacctaatgctt 
tcagcccaggagcagaaagagaagtgggctctttgctttgagagtctctgaaaattattcaa 
taccctgggacaaattaaatgaggtagatccttctttgaatttgttaataaagcatgcttgttt 
tgtctccataaaacaggctttgac^ttaaggtttatattttaaatgggtaaattttatt^ 
tacactaa tttta agaaaagaattaactcatggcttaaaagcaaaaacgagaccttggatttc 
acccataactttaaggctggtcattttaaccctgatttgacacactcttattatggtgtct^ 
ctcctratttggctaaatatttctgaccatcatagcaatcttttctataaaggaagcaggcaa 

GAGAGCTAGAGTGAAAATGTTAAAAACAAAACAAAAAAGACAGCATACTGGCTACCAGTTTTT 

cttaattaagatgatctgttttcgcaattgcgtaaattagaataaaatgttatttaactcaagg 
gcaatg^tcSSg^^ 

>1021 

actt acagtcttaagatatccatacacccccacatccgtcctttgtgcgagaagatt 
actgaaaatttaattccatttatgtgattg gatttgt aaaaaaccccttct ggatt caaag atg 

AAGGCCTCACTT ACTTT All 1 I 1 GTCATTTTCACAGACCCCTTATGTAAATGCCT CAAGAGTAA 
GAATCTTGCTCAAGTGATTTTTGTATCTCCAATGGCTAACAAGGAGCCTGACATAGAGTAGCT 
GCTT GGTAAATATGTGTTCATTCATTCAACAAATACCCCCAAGGNN 
>1022 

ACCGTGTGGGCCACTAATACATAAGCATCTGTGTTGGCTGGGGGTAGGTGTAGGGG 
GTGCTTGGGGAGAG ATTTAAACAAAGGCTTTCCTCTACTTG CAACATCTCTT AAAAGCTTGTC 
AT CATGTT ACTTCCT A I MC I 1 1 AGAGTTCATTTGTTTAAGACGGAAACGTGCTT CATCTTGTT 
CGCTTTTTCT GCATTCTTTGTAAACTTAATATTCTAATTAGCCCCAACACGGAAAAGAATGTAA 
CACAACTGTCTTAGTTGTGCCATAGAGTTAGAATCTATCTATTAACATGTTTTAGTAATAACAA 

gagaaataataaaaacacacctattatgagac^ctgccc^tgcjcaataaatttgaaacattac 
caggaatataaaggaaggaaggaaacaggtggagacactcan 

>1023 

NNNN N ACT ATGGGGAGCT ATTATAGAGATTTTAAGTATGGAAGTTGACAATGATCAG 

aatatgcagtttgggaacaattacctctggtgtttttgaagaaaatagaatgcagaagagtaa 
gcct acagatagaaagcatttaggaagctctagtaatagtcctggtaaaaagtaaggaagcc 
ctaagtattgccactgcaaatagaagagtggattcaaagataagtaggtaggattagaagtt 

^TITATGTG<3AATGGGAGGGAAATATGGTTAGTTTCAGGTTCCT GGTTTGGGCCGTTGGGT 

ggttgatgggattatttcxxx^aatagagaatatagaagtgggaagaacacataattatgagt 
gaaaacattaattaatttag^aaatgttgtttga^tatatgtactttctct 
tagaactgcctttggt acactagagtatgttggtgaaaatttgttctacttatggcatgcttg 
tccaacctgcg gcccaggatggc tttgaatgcagcccgacacaaactttgtt aaagcattat 
GGATTTCGAI MU 1 1 I I I J I M I I ggctgatcagctattgttcatgtannnnnn 
>1024 

nnnacccacaatggaaagatggtcttcctgcattgtgaaggttgttctcatcaacca 

AGCCTGCAATGACT AGACATTCTAAAGAGAAGAGTGATGGCAATGGAAAGAGGACACATCCA 

CTTGCCAGGTCACTT ct atcagttgatgacatgccatatgttatggctaggtcagctttccac 
aagtatgcacatgcaaaatagaacttgggaaaaaaatctittgatttggccctctaccaagtto 
atcagtgtgtcagagttcagttgagcaaaggtc^gagtttaagttgaatctcgagtcactctt 

TTGAAGATAjrTGGTGATGCCAAATTTAATTTAGGATATTATC^ 

cattt atagTgt GaatttacaggatttagccttagtccgagaaAaactggcccctggcccag 

ACT ATACACC^GCAGTTCTAAACATATACAATTCATGTTAAGGAAAAAACAGATT^^ 
ACATTACTCCTAAATAACATTTATTTCCATGTGACTTGCATGTCTAAATAAAATGAATTTO 
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TAGTAAGGCTTTCATTTATCTTCATCTAAA«3CTT1TCCACCAGTACTCA^ 

CAAGGTT ATGAAGCAGATTGATATAGAAACTCCATTTCT AGGACAAAAACGAGGCACCTTGA 

GAATGGACCCAAGCCATAATAAACTAGTAATGCATTTrCXXACACAACsCATATAAAATACAGT 

G AGGCCTTGATCAAGCTTTGCACAAAAAG AATTGCTCATTAAGTCTTT GATTTTTTAAAAAATG 

GCACTrGGTTTTCTAGAAAGAGGATCTGCACACTAACTATrCGATTTGTTTGAATATAGTCAG 

ATTATTATGTCTGTGTTTATTAGGCrrAATTrGAAAAGATTTTTGGrrT 

gatcaaacactatagagttcaagaccagcctgggcaacatagtgagannnn 

>1Q25 

acttgtttctccccttcggacx^ctctccccact agacagctgtatggccggctcc 
ctcactctc^c^ggtctatcagagggtggccactgacxitcattgtctcyo^acattatatag 
aacacac^cgcacccatgcacgcacaoxstcottctt^ 

ccagga<xn"acagcagtgcctagaacacagaacatccattagcaacatttgtttaatgaattt 
atagtgcctaaacctgcacaactctgactttgccttgctattagaaaatgc^ 
cggtggctcacacct gtaatcc<^gcactttgagaggccgaggtgggcggatcacttgag 
gtcaggagttcaagacaagcctggccaacatggcgaaacxrrattctttactaaaaatacaaa 
aataacctagggctgatggcatgtgcctataan 

>1026 

nnnncganagtcttctagnattaattaacttagttgaaagtgttctgaaacttgccag 
actcaaagtgtagagcacaacagcaactgtg ctrga ccgtactcctrggtttacatctattgt 
cttgacaaaaatattaatagcatttcctttcacttttagaaatactogagtttgtatg 
ttaggttgactttaccctcatgctataaattggtggtgttctggttg 
agattgtataatcaacaattacccaaagcagagattctgatttagtag g ccaagggtggcct 
actaaatgtgcattttaagaaggcatgtctaaggattcaggtagtagtgtgagaagcagtac 
tctgtttctggtactgaggctaatggtcttagttgggataaggagagtggggaaggggcag 
ggggagatgatgaaattc^tttatcctctgtgatgctatggaagaacaattaagatcatgttt 
cctacttgattttagttgctagtcatttcttaatcta^^ 

TCATG<^AAATCACCATCX5GTAATAATGTGGGGGCGGGGGAAGTCTATACAAGAATATTAAG 

gccctgtgcgtgagcatgtctatagttaaagacttaatgagaaagcat^ 
aacagctgaaagtagaagt aaatcacaacgtaataagatg caactttggagg agctcaaagc 
aacagatacg till i atcxiaaaaaggagt aaaacaaaaaaatcgtcaac^gcagttccttcag 
ataatcaactgatgatttc^tttgaaaaccataattaactagcgttgtttgta 
tccatttataui 1 1 1 1 aatgtttattaaattacttttctct atagatatgcagataagatgtttt 

AAATGTGTAAGTGGTATAAATGTCCCATGTGTCTrrTATTCTAGAGCATAAGAAAAGATGGGA 
AGCTACCTCAAACTTGTTTTGAGGCAAATGT AGTATGAATCCCT AAACATAACACCAGTACTG 
CTAAATGACAG7TATCTAATCTCACTGATGAACAAGAATAWNNNNNN 
>1027 

ACT AATTCTTTT CCTCTTTCCT AGACCGATTCT AGTTTGTT GCGTTCCCTTTCCTCGG 
AAACCCCAAGTTTGTGGATGCTGCAGACACTCTGTGCCCCC^^ 

CAGCTGCCAGGGCATAAAGACAGAGACGATGTGGCCTTTGTCCTT AAGAATGAGGTTTGAAA 

GCCCCAGTTCTTCCATGTTAGGrTGATTTCTTGCAGCTCTTGGT ATCTGCAGAATTAGTGTGAA 

TGCTTAAAAAATATTAACAGCTTT ATATCATGAAAGTTTTAACATGN 

>1028 

>1029 

NNWNAAACATTTflA GACT CACTGTGT AGCCTTCTTGGAATCGGGAATTCGCTTAATG 
CTGTCCGTGAAAAAATAGCCTTTAACATCTGTTTGATTGAGATTTGTGATACATAGA 
GAGGAAGATGTGGAAAGCCCTAAGAGAGCT ACTTGCCAACCCCACCATCAGGTCTGCCTCA 
GTGTTCCTAGTCAGGACAGACGAGGCCGAGTCTGATATTAGAT AGT CTTTGAATGCAACATA 
AACAGACCACAGGGACTGGTATGTAGCAAATGGTCAATATATAATGTACATAGGAANNNN 
>1O30 

ACTTTGACCTGTATGTAAACTCTAGTT ACTTT GGT CTT CTCAGGCTCTTGACTCTTTC 
ACAATTAAAGTAGTCTTTGAGGCrrCAGCCTGCTTTCCTCATAGCT ATGCTATGGCCTGGACA 
CTCAGGGGAGTATAAGCTGAGGCAAACATQGACTCA 1 1 IGI I N CTAACTTTCAGGGATTATT 

gtccatcattgcctgatgtccagtgtcttgaaaagcaatt 
>i<ot 

accattg Hugh caaaatcacaatttaaat acttcgtgatttt agaaat aatggaGc 
cacgttttcaccattaaggtgagtgattgttcagatacatttggcact 
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GCTTCCAACTTGTTTAAGACCATTCtDCAGAGTGAGAGCTGATTTGCCATGGTTATGAAGCTTT 

caggatataaactataagaatgacaaactacagcagttgaaaatgtgtcttcagatactcact 
tgcaactccxxatttatgtctctagggattgagaaatgaggatcgagggaccaaatctggctt 
ggtcagtaagagtgtaggtaacatataaatattaatgttcgttgcagttagtgtggt 

>1032 

NNNNNNNNNNwSS^ 

cagctcactgcagcctcaacctcccx3ggcx:caagc^tcctcccacctcagcctccccagt 
agctgtgttccaaagaaattt attt ataaaac aggtgttg ggct ggactt g acccgtgggcc 
ac^gtttgtcaactgccattctgtaagcrrtaacatgtgttaattact 
gctatgatatagac^ctgtgttccttttaatagacaaaggaacccaggcacagaaggattga 
ctaatatgaccaaagtcacactgccagtgagtagcaagcctgagctctgaaccatgacagtt 

CACATCTTCCACGACAGCAGCTTCTCAATGCTCTTTGGAGGGACCAGAGCCCAGGCAGTAG 
CAACGGCTATGAGGTGGTGAGACATGACCAGCAGATAAGCCCTGGGCAATTGGCCAGAGCT 
GGAGGGAGTGGAGAACTAGCCATNTGTGACrrTTGTGAACATCCCTGGGGAGTCTGGAATTA 



>1033 

ACTAGATTGGGTGTGTGTATTAAGAGAAAGACAGGAGTCAAAGATAGTTCCAAAACT 
TTTG AACAGAACACTGGATG AATACTGTTTACTGAG ATGGGGAACACTT AG AGAAAAATGCA 
T7TGGAAAGCAGAAATACGATCAAGACTTCCATTTTTGATACATTAAGCTTGGTATGTTTAATT 
CATAGCTATATAGAGGTATTAAATTGGCAGGACAAAATCATAGCTAGAGATAAAAATTTAGAG 
TTCACCAGTGTAAAGATGATATTTGATGGCACAGGATGGA O I ICI I CTGGGATTTGAGTATA 
CATAGAGGAAAGATGTGAG6ATTGAGCACXAGGGGACTTCAACATTGACAGGCTCAACAGA 
GGAGAATT CCCAAGAG GATG AG G TTCCACCTTT AG G ACCG CCAAAGAAGACTTCCCAGACA 
AGN 
>1034 
>1035 

ACCATTTAACTGAGTGAAAG CTTTACAATTG AGGGGTT ACT CATTAG CAGG ACCTGG 
GTTTTGu I I lAATCT CATTAACXXXJTTGTTA(XX^ATTTGATAACAAAGACTTCAAGGAAGAAT 
TTGCTCAAAAATCTCTGGGAGACAGTAATAGCTTCrrTGGGCCTGACTGATAA^ 
CX^GCAATGGAAATGTGGGAAAATTCCJAGATGCTAAATGATCTGGCTTGGACCCAGCAGGTT 
G AGGTAGTG GAGCCTTTCGATTGAGG CACAGCCCAGGACTGCTGCAAGGGAGAGGCACAA 
CAGATA 
>1036 

CCCACGCGTCCGATCAAACCAGGAGCAGGTG CAGCCCGGAGGCCCCCCCCGTGG 
GGAGAAGCXaGCTGGGGGGTCCCTGCATCTrGTACTGCGGCC^CTCCAACAAGTCXa 
TGTCCTGGCAGGACTGCTCCTGAGAAGGATGGAGCTGAAGCCCCTCCGT 
CAGGCTGAGGCCAGCGAGGTTCCCAGTGCCGCGTGTGGGCAGCAGGAAGCTGCTCAGGW^A 
GAGCCCCCGGGAGGGGAGGCCGAACCAGAGCCTCAGGAGCATCACCCTGCAC^ 
TCCG GCAG GCCCXXiAACCCTT ACT(X3TCGGAAATCAAGGCCTT^ GCAGAG 
AGGGGAGCTCTTCTCCAGGGAGGACCTGGTCrrGGTACAAGAAGGTCTTGTGGGAGGCTCG 
GAAGTTCGAGCTGGACCGGCATGAGGTCAT CCT CTGCACCT G CTCCTGTG CAGCCTCT GCC 
AGCCTCAAAATCCTGGACGTGAGGCAGATCCTTGTTGACGAGGCAGGW 
CTGAAACXXTTCATCCCtXrrGGTGCAGTTCCCACAGGCCGAG 

ccacaagcagct GCGGCCTGTGGTCAAGAATGAGCGGCTGCAAAACCTGGGTCTGGACCG 
GTCT CT GTT (XSAGCGGTACCACGAGGACGCACATATGCTGGACACTCAGTACCGCATGCAT 
GAGGGCATCTGTGCCTTCCCCTCTGTGGCGTTCTACAAGAGCAAGCT^^ 
GCCTGAGGAGGCCGCCCAGTGTCCTGGGCCACGCTGGCAAGGAGAGCT 

ttggccacgtgcagggccacgagcggagcctgct ggtgtccacggacgaagggaatgaga 

ACTCCAAGGCCAACCTGGAGGAGGT GGCTGAGGTGGTCCGT ATGACXAAGCAGCTGACCC 
TGGGGAGGACCGT AGAGCCCCAGG ACATCGCCGT GCT CACGCCCTAI 



ccaagagccaagggagcgagtggcgct atgtgctggt gagcaccgtccgcacctgtgcca 

AGAGCGACCTGGACCAGCGGCCCACCAAGAGCT GGCTCAAGAAGTTTCTGX5GCTTCGTT GT 

GGACCCCAACCAAGTGAATGTGGCTGTCACGCGGGCCCAGGAGGGGCTCT 

AG ACCACCTCCTTCTGCGCTGCTG CCCCCTCrrGGCGTAGCCTCXTrGGACTTCTG CGAGGCT 
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CAGCAGACCCTCGTGCCTGCCGGCCA6GTGCGCGTCTGCAGGAGGCCAACT ATGCCTTCC 
TGAAGAGCCCTCTCCACCTGCAAGGTGCCAGGACTGGGCAGGGAAAGTCCACGGGGCCCC 
ACGAAGGGACGACGCCCTGCGACATCAOCOGCCOGGGCCCAGCCTGCCTCCTGGCCTCAG 

gctcccctgggccaggacactgagtgaaaaagaaagaagaacggaaacagcaacgagacc 
cgcagokcaccagaccacacctggcccacc^ 

caaagcagcaacctgcgaccagcgacagtgaacaccagaaagatcaaggaccagacaccag 
caaacatgtaaagagaatccgtgtgaacggaacaacaacaaaacaatagaacaacacaacag 

acccggtatctatagaaagagagagtcagtcgctacatcagaacatcactaactgaggacac 
gcgcacgagcagocacacagcgaaaacgaacgaccaaagacacagcagaccgggagcgat 

AGCAGCGCACAGCACAGACATCACAGCAGCCCCTACGCGGCCCGAGAACCCGAACAGGAG 
AGACGAGCACGCTGCCAGaCGAAAATGAACGAGCAGCCAGAAGAGGGCGAGGAGCGACCG 

>1037 

ACCATTTAACTGAGTGAAAGCTTTACAATTGAGGGGTTACTCATTAGCAGGACCTGG 

gttttgtttttaatctcattaaccccttgttacccatttgataacaaagac^ 
ttgctcaaaaatctctgggagacagtaatagcttcttgggcctgactgataaactttt^ 
ccagcaatggaaatgtg ggaaaattccagatgct aaatgatctg gcttggacccagcaggtt 

cagata • 
>ioaa 

actttgactattttttagcaacaaattacttttgacacacagcac^ 
ttccaattttggaactattggataaataatgatgggat^ 

tt acagcatagggtctcttgt agtcctcttagt aaaaactattgtgacactt ccttctttctcc 

aaatattcggcctggaaagacctaaatacaatgcagggattgaatcaaattcacacattttt^ 

ttcctacggaaacaacaacxrrttcttgcttatatttaacaaaaactag 

attaatagtratatggtatttttttctcagagtagaaatcaggtttataggctaagaatatagg 

ctaatttggagcataacactaaccagcatgaaoctaagtgagtacn 

>1D39 

ACTTAGATCAGATGGATTGAAACATGACAGCCGCATTTCATCTGGCCGGTTAAGGTC 
CTCATGGAATGAAAAACACTTTCGGGCACTCTCCTATGAQAGAGAGAATGG G 1 1 1 U 1 1 1 AATT 
GCCAGATTGTCTGAACACAGCCTCAGCTACTTCTAGGAATAAGACGAAGCAGTGAGGAAGTT 
GCCAGTTGAGTGATTCTTGGKiAAAAAAATTAGCATTCAGTGCCAGCT CTCT AAAGTGTGGA 
TTCTGGATTCTGGTAGAAGCCAGT AAAGAAACGTTTTCrrCTGGAGTGGAAGCT AGTAAGATT . 
TATTCTGTGGTGATGAAGCCATCTGAAACCTT ACAAGCAGTGTGGTTGTATCAGCATATGGG 
AGCTGACTGCCTCAGGACTTTGGAAGCCT GCTTCT CTGTGCCTCAGGCGGAACTCAGGTTA 
CTCAGTAGTCATTTGCTAATTTCTGAGAACXaCAACACTCCTGAAGGGATAGAAAGCATGAAC 
AATACCCAAACTTTTTAGACTAGTACTGTGTGTCAGGTATTGTAACATTCATTCAGTCCCTGA 
AATGTACAAACT AGTTATTGCTCCCCTTTTTCATITGAGGATACTGATGCCCAGATAAGATAA 
GTGGCTTGCCCTGGGTCACATAGCTTGTAAATGCCGAGCTAGCATTTGAAN 
>1040 

ACTCTTATCAACTGTTTTATAGATGAGAAAAC^TTAGCCAGAGCTT AGCTT ATTTGAA 
GTCACAAT AAT ATTAACT AAGT AAGAGCAAAAGCCAAGATrCAAATGTAGATTATTTTACTACA 
GACTGAGAAAGGAATTAAACTAGGAGCCTAAGATACTTTCTGGAATTGAAATGATACATTATA 
TATACCTATAAAGATAATTGGCT ATAGCTrCCTAAACTACAAATTGTCATAAAAATGACTTCTG 
TCCTATATCAATTAGAAACTGGTATTAAAATTGAGTATTATAAGACAATAGAATGN 
»1041 

NNNACTGCAGGGCCCAAGAGCAT ACAAAGCTAGTTATTTGGATCCAAAGTTGGT CAA 
GTGTGCAGTGTTTAGACATCATGATCTAGGCAAACAGAATTCCT GGCCT GAAATATGTCACT 
AGTTAGAAACATTAGAAGCTXTCAGGTAAATAAATATAAAAAACCAGTCAACCGTATTCTTATT 
TCTTCGTCAGAGAATCATGTGTCGTTTGGTTTAACTTCCT 

TGAACATATTAATCTCATTATTTTCTGT AGAGGAGAGGTTGTCCCCCCCTTCCTGATTAGCGC 
OCTOAfcTOCTraTTAOOOC TCTCIO CC TC l GGCCCTQTGACCAGCACGGTTGCTCCAGCAG . 

gcagcagtgcGtgggoctgctci ccatggcagagacagggctgtgaagcttgggt ' 

>1042 • • ....... 

accctgctttgattatttccgaatccagtgggtagagaaggt a 
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actggatatttttaaattgtagggatgtcctttgctctgggtcaattttaggatcaaatataaa 
agcacctatagctcagagtatcttctaacataaaacttctgagataccagaaat7ttccaaaa 

CATGGTATAAACAGTATGAAACACTGGGTAGATAAAAGCTTTCTCTAAATCTTAAAGTGCTCA 

aatatcatgaoctgattttttagttttagaaatcagatatttttctattdcatatcttaaac^ 
catgttaaattctagttctgacaatgtagggttctai rrm ic aggtgattgttgggagcgt 
ATAGAAGCATATATAAATATGGAATATGTGI MC I I I 1 1 1 ccccttctgaaagaaagtcaagcc 
tctaatcaaatagattgatgcttcagaaacttaacagaatattatctgcaatttggcataaatg 
catni i iciiggggaagtttccatggtcaaaattattagtcartgcaaaacagaaaagtttga 
caactggaaann 

>1043 

AaXGTTTGTCCATGGCTATTCCAAATACCCCCATGTTTATTTAAAATGTATATATAAT 

cagttacataaaaagaggtatgcttaaattctcatgactctatggttggacctctgtggttgg 
agcaggcaatagaaatgtctgtaattcatttaaaaaaaaaagtgactttcctacctttagata 
gtgaggacaatctgttaactctttgtgttgataaaagcaaacatttcagggcacggtgaaag 
aaatctctaocatgtataaggttatatatataocagaagcagtggagttaggaccaaattaag 

ATTTGAC 

>1044 

>1045 

^^CGTCCGGTTCTGACCTATTCCAAGAGTACAGCCATCAACAGTTAGTCCCAAGAGT 
TGGAGGCA7TGTTGGGGAAAGAGATTGCAATAGACTGGTGCAGATCAGTTACCTGAAGCTC 

ctggcttatctgaggcctggtgatgctgctgtgtgcccagaagccagtcatatggatgtctt 
gacctgtccagcacacacctcag tag tag gagat atgggt ctttg gaatgctt7 atgtgtga 

TGaGGATGAAACAGTT AAGTGCT ACTTTCT CATCCTCGGCCTTATG ACATTAGTT AGTTGTGT 
AAGATTTTGCAGGGAGGTATTCCCTCCTCTACCCTTTCGTCACAAGCOCCTCTCATTCCTCT 

gaaaaggttccatactccagtccct accctcaaagaaagattttactaagcaaaagtatctat 
ggctct ct ct gttctcctt gcttt agctt agc^cagctt aaacttgg<^tctcaccagtctgaca 
gggcaacctccaagattcacaacccaggaaaagtactgctttgcgtaagtttaatcaagacc 
agagcagagacaggacacaaagtcaaaagaactcagtcagcnn 

>1046 

acagcactttcaaagt agtg gaat at aaatctttccattt aacagcaacattcaaat a 
tttcccattctgctt attattcctctct gaaggtgatacatag aaat ataggagcaaacacag 
caatgcaggcgctct atgatctggtttgctcacatagatctt aaaaggagaagaatgaggga 

tccttgaggtagaaatggtaaactag catgacctcgaatcataatttaatatcattct an 

>1047 

ACATT ATTGGTAGT ATCT CAGAATCCTGCTT AGCTTTTGAGATAAACCAAGTCATGAT 
ATTTTG GGTAA TATGGCCATAQGTATCATGCAAGATTGAACTGCCCAGTATTTG CCI 1 1 MC A 
AT ATT TACT TT GTAAGAACCTGACAGTGTAGGTCCTCACCACACCAAAACCTGCAACATAAAC 

TTCAAt 1 1 T gggcaactcatagaccaaaaaagctaaacaaaacaaaaaggaaaaaaccctcta 

TATACAATCACCCT GCTTGTCT ACATTTAATTTGCTT CATTCANAATAAGGAGT CACN 

>1048 

>1049 

ggggacatttagttcgggcatgaaaaagaagttaacaagcaaaggt ACCTATAAAGA 
aaggcatcataaatagatataaagcgagaagaaaagggatctaaagtagacagaqaagatag 

GCTGACTCTCCAGTTGCAGAt 1 1 r CATTATCAGCTCATCACACCACCGAAACTCTCTGGTGAT 

ttg^atccacatccatggcxjtttggtggccctaaagattgtaacggcccccatcct 
ttaaaatggcaggtgtgttgacaagaact gtctt aggtaccccctgoctgctgggcatcaca 

TTCI I CI I GGTATATATTAAAAGAACACAAGTTTGGGCCAGGCACGATGGCTCATGCCTGTAA 

TCCCAGCACTTGGGGAGGCTGAGACAGTGGATCATTGGCGGTCAGGAGTGCAAGAGCAGC 

CTGGCCAACATGGCAAAANNNN 

>1050 

>1.051 

ACCCATCTCTTCCATTCTGGG-AATCTGGGAAACTAAGCCTGTAACTTGTAGCTTGTA " 
GAATGAATGATGGAGTAGAATAAATAAGAAAG^AATATATCATTAAATGCACAGGTTAAATAA 
AT AAAAATCTATT AATAAAGAG CXTrAAAGAAAGAAAGATGACATTTCAGCACATATTGGGT GA V 
AATAAGTrGTTTAGTCCAGGACTrCTCAATTTTT AGTGGATATGTG AATTGCCT ATTAAAATGC 
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AAATTTTAAATTAGTTAATCTGGGTTGGACCTGAGTCTGCGTTTCCAACAAGCTCCCAGGTGA 
TGTCAATGCTATTGGTCCAAAGACTATGTTTTGTGTAGCAAGGGTTCTAGATACAATT ACATT 
AGAAAAGATCAGAGAAAAGTGGAGTGATTGT 
>10S2 

acgcgggtatagctatatactcatatttttatttttatgtaaaatttccaaaatgctta 
atatggcagtataataattataactagatttacttcaaaacatagacataaagaagattacatg 

CCTGTAGAAGTTCATTGAATTAGGAATCACATGCTATTTATTTTAGCAGATATCTTCTrAAT^ 

AATGTTTGACCCATGTGAAGTCATTTAAGAGATCTGTTACGCATrATrCAGATATCCAAAATAA 

TCTATATGATCTGAATACCATTTCCATCTTTAAAATTACATATTCCT 

>1053 

ACAATCAAAAAAAGACAAAAAAGAAAtG GTGTT AAAAGCCACAGT AAACAT AAACCTC 
ATATCAAGTATAAAACCACACACACTTTGCTCTT^ 

GGTATTGGGGTGGGCTGATACCT TCAAACAGGG AGAGAGGGACCATGTTCAGGAGGTGTAT 
TCCTCGATTTAGGTGGTGACTGAA 1 1 I 1 1 1 1 1 1 1 1 AAGACAGGGTCTCACTCTGTCACCCAGG 
CTGGAATGCAGTGACGTGATCTCGGCTCACTGCAGCATCAACCT CCTGGGCTCAAGCGATC 
CTCOCAOCTCANN 
>1C54 . 

ACAATGAAAATTACAAAATACTGTTGAGAGAAATTAAAGAAGACAAATAAATGAAAAG 
AGAOGGAACATGTrrrCGCTTGT AAAACTCAGT AGGATT AAGATCT CTTCTCTCCCAOGACTC 
TATAGCTTTAAAGCAATCAAAATCAGACTGGTTTTGTtTrGAACGTTTTT GAATAAGTCAATGG 
CTT ATTTCAAAATTCATATGAAATTTCAAATGCCAAAGAATAGGCAAAATATTTCAGA^ 
GAAAGATTGAGGATTTGCAAT AACTGACTTCAAAACTCACT AGAAN 
>1CS5 

NNNKNCAGGCTAGAGAGATGTTGGAAATAGTTGTTAAATTGGCTT AACTTTCTCAGG 
ACACCTTGT ACOCAOCAGGTTCATGTCTCCTCT AGCCAACTATAAAGTTATTAACACAAGAAC 
CCTGTCTT ATTCATCAGAGTATCACCCACAGGGGCTGAGACAGTGCTT ACACAGAAATGGCC 
CTTGATAAAATATGG GCTGAA TGAATGAACATATGAATTTGACACTTTGAGAACTAAAT^ 
GTTA TTTCT ACT AGCATTTTTAACACAAGAAGTATTGAGATT ACTT ATATATTAGT AGTAAAATG 
TTTGCTTrATTGATTTGATTGGCAAACTTATAATGAACT CAGTGAAACTTGTCCACC' I I II I L I 
ACATGTTGAAATTTTCAAAAATCCATAAGATTACTCCTCACACA 

GAGATGGACCT ACTTCAT ACCATTATATTCAT AATCCAATT ATTTCT AGAAATCCCATTGATTT 

CAGGGAACTGAATTTGAT AGCCAGGAGGCATTCCACTGGCTT CTT AAAGCN 

>10S6 

AGATTAACTCACTGACTT ACTCTGGGTTGCTATTGT ATT AAAATTCTGTATAGACATTA 
OGT AGCCTC AGAGT TQAATTTGGACTGCCCTTAAAATAAAAAATTCTT AAATCTTTAGTGTGG 
TGTCT ATTAAI 1 1 1 1 ATGATGATTTACAAGTTGGAAATGATTACTTTGCAAGTCATAGTTTACTT 
TGAAGTT AATAAGAGTGATTACAGTAAAGGAAAAATGCCATAT ATGGCATT GT"I CTTAACAGC 
TTATGAAATTTGGAAAACGATATTTTAGAAAGC 1 1 ICI C1 TGI I G GCTGGAATGAAGN 
>1057 

. _ *C*C CT mi i lAGGATATTTCTTCTA 1 1 1 1 ICCI I IGAG1 fC I I G 1 1 CATATTCT AGTT 
AATTTCTAGT AGTTCTT AATGT ATTTT AACCAATAGAC nTTGTCTTOCTTCTGCn A TGTATTC 
CTCGTA AATGC \ 1 1 1 J GTGACTTGTCTAAGTAT AAACAACTTT ACT ATTAGCTGTAAAATTTTC 
AMU lAGTATGTCATCAAT LI 1 1 1 1 1 1 1 AGT ATGATT AAAT G I « \ I > C ACTTGGAAAGA 
TATGAATAGTCTACTTCATTGA 1 1 1 1 1 1 1 1 AAAGTCATTTCATTTTTTATTTTTGTAGCTACAAAA 
TCAN 
>106B 
»10S9 

T^ATTTAATGAAGGGCTGCTTTTAAGAAAACTTTAAATCCT CACGTAAAiCCACCACCACCTG 
CAAAGTATTAATATTCAACTTTTTCAACAAAATGCCTGCT atgtataagct actgaaagaagac 
AAAAATTAATAAAATGTGTCAX FCCTC1 1 AGATATCT ATAATCT AGGAAAATGAACaCATTCTT 

ttcagacactaaactccataagaacaggcatcagatctatcttattta 
aatggagcacagtgcctgacacataatagatgctcataat 

GTTJAATAAGTAAAGAAATGAGTGAGCAAATATCTCT^ . 
AAOCAGJTGATpTGTFATTCA^ST AGCAAATAAGATTGTTTCCT AATGTCAT AATTCAATTWTCCC 
TGCTTCCTaCT ATGACT AGATGTTGGTT GGT 6A T AGTTT AT ATGANW* 
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>1060 

CCCTTC G^GC^GCCGCCCGGGCAGGTACAGTTACCAAAACCCATCCAACTAAAAAT 
TTAAG CTTTTTG CATTTT AGTG GATG CAAATTG TG TCTT AGT AAGAAGAACATACAAAAACT AA 
GAAAGATAATGTTGAAGAAAAT AACAAAGCTT AAGGACTT AAACT ATT ACCATCAAGACATGT 
ATAAGTACAGT AATTTT AAAAACTG I ITTC1 1 GCAT AAGTATAGAGAAATGTACCTCGGCCGC 

>1061 

ACTTACGCTTT ATGATCTTGAAT ATTTTCAGTGTTT AAGGAATCTCTTCCTT CTTTGAT 
CTCCACTGCATGNAAGAACTCrrGTTGCAGGTGTTAACAAGGAAGTTTGAAATAGAAAGCCAG 
AACCT GCCCCCCAAAG ATCTGACAGTAGT AGAAG GAGATCCATTATT AAGAAGGTAT AATGG 
CAACANAAGAATAATCAGAAA7TATCTGTGTGTGTAATATGTGTTGTGTGGTGTGGGTCAAGG 
AGATGAGGAAAG TGG TT AGGGAAN 
>1062 

ACTTT AACAAATT A AAAAC AAATTTTAATTTAAAATATTTTAGAAATTTTACTT AATACA 

TTTATTTAATGAAGGGCT GCTTTT AAGAAAACTTTAAATCCTCACGTAAACCACCACCACCTG 
CAAAGTATTAATATTCAACTTTTTCAACAAAATGCCTG 

AAAAATT AAT AAAATGTGTCCCTCCTCTT AGATATCT ATAATCT AGGAAAATGAACACATTCTT 
TTCAGACACT AAACT (XJVTAAGAACAGGCATCAGATCTATCTTATTTACCACCACATCCTGAG 
AATGGAGCACAG TGCCT GACACATAAT AGATGCTCATAATAC^TGCTCAGGGTTTAT AGTCA 
GTGAATAAGTAAAGAAATO^TGAGCAAATATCrCTTAAAAAG 

AAGCAGTGATGTGTTATTCAGTAGCAAATAAGATTGTTTCCTAATGTCATAATTCAATTNTC 

TGCTTCCTACT ATGACT AGATGTTGGTTGGTGATAGTTTATATGANNN 

>1063 

>1064 

ACTT ACT ACAAGC^GCAAAAGGAAGCTCTAGAACAAGGAATTAAACACAGTGTTTGT 
TTCCAATCGCAGAAGAGGCCATOAGCACCATATGTGTGTCAGGCT^ 

AAAGGCCAATCCTT CACCI I ICT J ATGACTCTTATAGGCTGCAATATTTCACTTGGCCATAAA 

CAACTT AATATCT CACACCTAGTAGTATTCAGTGACACAGAAAGGGAAAGAGAAAGGATGAA 

GAACAGAGGAAAGAGAAAT AATTTCCCAAGAT ACAAATTT AAT ATTCTTT CCAAAGCAT AAGA 

GCAATTAAAAAATANNNNNNN 

>106S 

>1066 

ACCCACATGATCCCAAAGAGGAGGG<^CCTGTATAAACAAGAACCAACCAACATAA 
AGCAGTGACT AGAGGCACCATGACAAGAAAAGGAGTTTTAAAGTGCATCTTCAAATAGCACA 
CAATTTTCCAATTT AAATAGTTT GGAATGAATCAAAGGGAAAAAAGCATT AATTAGAT ACAACT 
GAATTTCTCAAAAGT AT ATT AACACAGCCT ACAAATAAATCCTCAAATGT ACCN 
>1067 

ACCCTCCGTG A CTTTTC AGGGTCT C C I GG 1 1 GAATGAATTTGCAGAAGGATTAAAAT 



AAGCTGAGAACGGTAAT AACAT AAACACATACCGT AGGGAGCOCTGACATTAAACACATHAG 

GTAGGAGCCNNN 

>1068 

act atattagtgt agcaattttcxiaaaagccattc^tcttagagggct aaatgatttt 
acctt atcaattcccctg tgaaaaaat atctct aaagaggttttct gctggaaaat attgttg 
• ctgtcacattgat atgccaacaaaagct aaccagggaagtcaggccaagaaatatcrocctq 
caagagaaggcatcgcacatgtatctctccatgct atttaaaattgcattctgcaacatagaa 

AGGATAG GCCATGCTGCAGAAGCCAGGTOCAGGAAAACTG C 1 1 1 CI 1 1 GGCCTTTACACATrr 

VJ^^^ 7 ^^ 0 ^^^ 0 *^*^^ 707 ^^ ' 'Cioi IGACTTAGTGTCA 
gcaggtggagggaggaaggagggcatcccagac^^ 
cgga t cg a aaggccaagaafittgctgctccatgcoctcag 

CTGAAA.TGLAj^^^S^CTTT'GtAAAGGGCTXCJT GCTCTGGGTGAAACOSGCTCGTGGCCCGGGCCAA 
rTCTBCTOGCl i i^CG I C I G I CAGTGTGTOCTAATCACTGTT ATAAGTGTGGTTTJTGOGGAA . 
CATCTTGT AAAATA 1 1 1 1 WIATTGCTCCACCAACAICICCIGI C I AGACAATCTAATTATGAA - 
CACAGAGCAAATAGCTGAAGTGTATGCCGCCCCCAAG^ . 

ctaggaagacaggggagggaggtgi mm gatgtttattac i rmrsi i g acctgaccaga • 
aaattgagtgctccaaaagaatctggctaacttttaattaagaagaaatgatctgg 
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TGTT Gl I I iiCCAAAG I Cag I GGAGGAl T AAAGGTACTGATGTGTTTCCCTCTAA 
NNwSnNNNNN^ ' ' GGG ^ GAAAGG ^ GG1 ^ GTGGTGTTT GCGGT AAAATAAGNNNNNNNN 
>1069 

ACCCT GCTTTGATTATTTCCGAATCCAGTGGGTAGAGAAGGTAAAGGCAAGGGCTC 
ACTGGATAI 1 1 1 ! AAATTGTAGGGATGTCCT7TGCTCTGGGTCAATT7TAGGATCAAATATAAA 
AGCACCTATAGCT CAGAGTATC7TCTAACATAAAACTTCTGAGAT ACCAGAAATTTTCCAAAA 
CATG GT ATAAACAGTATGAAAC ACTGGGTAG ATAAAAGCTTTCTCT AAATCTTAAAGTG CTCA 
AATATCATG ACCT G ATTTTTTAGTTTT AGAAATCAGAT ATTTTTCT ATTCCAT ATCTT AAACTTT 
CATGTTAAATTCTAGTTCTGACAATGTAGGGTTCTAI Mill I CAGGTGATTGTTGGGAGCGT 
ATAGAAGCATATATAAATATGGAATATGTG I I I CI 1 1 1 1 1 CCGCTTCTGAAAGAAAGTCAAGGG 
TCTAATC AAATAG ATTGATGCTTCAG AAACTTAAC AG AAT ATTATCTG CAATTTG G CAT AAATG 
CATS i mui iGGGGAAGTTTCCATGGTCAAAATTATTAGTCATTGCAAAACAGAAAAGTTTGA 
CAACTGGAAANN 
>1070 

tagtgaacct agtc ggoctaaaaagaagtattagcaaacccagaagct aatactgaa 
aattragcgagtcagctttacaaagacacgatctt atcacatttct gttaacaacctaaaagg 
octtca attttccaaaatggctogatcttc^ttgtgctatttttaataagtta 
tctttcacaagcaatacaagtggattttaaatcaattgatacttatagaaagagt^ 
aatct acctcctaagttcattat tcaca agtgttgtgtacattatattaatgaaatttatctagt 
ccttgcaaacttgtgcctattgattttcattagtgtaaactaaagagagaaacttcacactga 
cattrataattgtaagaactaagaaccaaccatcagcttttctatgccaatccatgcccttca 
ggaagttcttgaggccttgaggttgct agtttagtaaattgctt actgggacattaaagcag 

CTACATTTTGGAAAGAGGGAGAATTAAGTTTTTTGTTGTTGAATTTA^ 

AAAGCTCi I I I AGATTCCAAAAGG AG GAAAAATTG CAG G TCCATT AATCAAAATGCTG CAAAC 

TAAGACNN 

>1071 

ACCAAAACTGAAAAAAGATTGTGTATCCAAACATTATTTCACATAAAATGTATTTTGAT 
AAAGTAAATTCCCAAACCATGGTGCTCAGAGGTTGTAACAGTCCATGTAAGTTGAAGAAAAA 
GAGTTATCAATCAATACGTGACTATCAATCATTTATTTAATCATTATTTAGTT^ 
>1072 

A^l i ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 1 IGAGACGGAGTTTCACTCTTGTTGOCCAGGCTGGAGT 
GCAATGGCGCAATGTCAGGTCAGCAGAACXTCTGCCTCXX^GGTTCAAG AGATTCTCGCGC 
CTCAGCCTCTTGAGTAGCTGGGATTACAGGCATGTGCGACCATG<Xn^GGTTAA7TTTGTAT7 
TTTAGTAGAGAGAGGGTTTCTCCATGTTGGTCCGGCTGGTCT CGAACTCCCGACTTCAGGTG 
ATCCTCCTGCCTTGGCCTCCAAAAGTGTCAGGATTACAGGCGTGAGCCACCACGCCCTGCT 
TAAGTTTT AATAAGATCTCTTGGCAACTTTTTACGACTGGCAACTT AGGTCTCACAAACAC AG 
AAAAGCTT GTCTT T AAGT ATATTGTCTTTGAAAAGTTAATACACTCTCTAAATGCTrCGATTTAA 
AATGATTTACTTT ATAAATGGATGCACT GAGAGAAAAG ATATTTGAATGATATACANGCACAT 
G7TAAATTAACTGTGATTGTTTCTAAGTATTGGCACT ATGGTCAANNNNNNNNNNN 
>1073 
>1074 

AGTGGGTCACTCTGCCCCAGCTCTGCAAAGGCATCAAGATCCGACTGCT AGGAGCC 

TAACTGGTGTTCCACATTCCTCTAACGTGCACAACACAGTCCTGCCCCCGTGCTTT^ 
C^roTCCATTCCTCTTATAACGCTCTTCCCCAAATCGCTTTGCCCATGG 

>1075 

NNA CAGCCAT GTGT ACT TTATA GAACTTAGAGGAAAAATCTAQTATTTTGTGTTTTCC 
TATATATTTA iiu mCT AT TGCTC MICH CATTCCTGAAGATGCGGCATTTCCTTTCAGCCT 
AATGTTGACCTT AGCTT TTCTAGCAGATCAGTCTGCTGGG GATAGATTCTCTTTGT7TTTCTTT 
GAGAATGTATTT ATT7TACTTTCATTCCTAAAGGAT ATTTTAGGTGG AT ATATAAT7CC TA 
GATGCTTGCTTCCTTGAG<>CCTCAATGATGCCATTTAGCTGTCTTTTCA 
TG GTGAA AAAT CTTTGTAATGT AAACCTT ATTCCCCTCTGTGTGGTGTGTAATTTTTTTCT AGO 
T^?T rCAAA ^ A 'J ntn TTTTGGTTTTCAGC^GCTTAATTTGAYGTGTATCTAGTCAT 
TTTCTTT AAGTTTATCCTCTTTGAAGAGTACTGAGCTTCTAAATCGGTAAATTTTTG GCNN 
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ACTTCACTGATTTATGGCAAGTCAGCCAATCCATCAGTGCTCAAAGCTCCTTGTATT 
G I CAGG ^ G JCTAACATTATTTGra^ 

TTTTGTGTCTGATAGATTCTTCATGCAGAAAGAATAAGTAAAATGAGATGGGACACAAATCTG 
AGTATAGCATTGTGATTACTTTTTGCTGCACAGATTAGTTGCAAGAAATATTCTAGTCTGGGG 
^J^AGA^C^CAAATTC^ 

>1077 

. ACAG ^^CCATGACTTACTAGGTGTTATGATGAAGGTGTATGTGTGTGTATATGT 
GTGX^TGCATGTT^TAAGTGTCTGCATTTGCACACATAAGAGTTTTAAGCT 
TATTGATGGT CAAAGG 1 1 ICI f! I GGCTATTGCTGGACTCTTAAGATTGTCTTGT AATTGTCTT 
Zr^l^^^^LT^^^"^*^*^* 1 ^^ - ^ AAGG G TC5T ATAT^ A_AAG GT AGTTXTT^ ACOCAGATCT~r AXATGTGT 
^T^^^^^C^TCAGAAACCTACTGTTT^ 

^l^JJ^ GA ^^<>^^P^X^C^AAAGrCTGCTGCACl M CI 1 1 1 GTAGGCC 

TAACATTCATTTCATATGCATTGAATATTATTGGTG AACTT GCATT AATT AN 

>1078 

m "^^"wn^nnnnn^ttgatatg^ 

?7^I AAA 5 ATTCTT AAAAC ' ' I O HnATGGGATTATC^GAGTAACAAAATAATGTAGTCOCTT 

T A Z 6GA i^™^ TAAC ^ AA T G i^^ 

ATTTAAJTCACTT ACTT CATTATTTTGTTTTGCATTG TATTTT ACACCTACATTT ACACTATT AAC 
TT A ZII G J ACA X AC ^ AAATGG 

AGGCGGTGGCGTTCTCCTGCTGAA(>TTTAACAATTCTCAAATCTCTAACATCAGATGAGGT 

C ^ C I G T AA T GCGGATAAAATGA ^ 

cttY£2£c^^ 

>1079 
>1080 
>1081 

GA CGCAG A CCTG TAAC ACGTG TC^ 

TTGH \ I I CCTTTCTGGATTTTATATGGCACGCTGATCTT GCCTATGTATCACCTCGAGCCTTT 
CTTTTCATACATCTTCCTCAACCT ACAGCTCATGATCTTGCAGGTCCTT CACCTGTACTGGGG 

TGATGACGAGGATTATGAAGAGGAAG AGGAAGAGGAAGAAG AAGAGGCT ACCAAAGGGAAA 
GA^TG^TTGTTTAAAGAACGGCCTaSGGGCT^ 

CATGGCCATTAGCTGGAAGCCTACAGG^CTCCCATGGGACAGCATGCTGCAAGTACTGTTG 
GCAGCCTGGCTTCCAGGCCCCACACt^ACCtX^ 

AT AAGCATGCCCATGGATTTTT ACT GCAGTT AGGACT CAGACTGGTCAAAGATTTCAAAGAN 



^^ rt 9 GAGGCG I CCG ^ 

GC^CCGAGACCCTC*GGGGCGTGCGGGCCTTTGGTC^ 

H^™59^9 G ^?^ CCCAGCCAGCGGACGGGCCGGG 

pCCGGGGGGCGTCCACGGGGGTGTCXXra*GGGGTCTCCGGAAGGCG 
CCCGCGCTGCGCTTGAAAATCGCGCGTCSGCCCraC^ 

GOAGCXlAATGGGAAGGGTCt^GGCATGGCACAG^AATGGGAAGGGa^GGGCACCAA 
AGCGAATGGGAAGGGCCGGGAGCGCGCGGCGCGGGAGATTTAAAGCCTGCTGGAGTGAG 
GGGTCGCCCGTGCACCCTGTCCCAGCCGTCXjrGTCCTGGCTGCTCGCT 
GCCTC<^CTATGCTCTCCCTCCGTGTCCa^C<3CGCCCATC^ 

C^AAAACTAAAGCAG<JTGCCCCCGGCGTGGAGGATGAG 

" ' ~ ™ ^^ATCTTCCCCATCGAGTACCATGATATCTGGCAGATGTATAAGAAGGCAGA 
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GGCTTCCTTTTGGACCGCCGAGGAGGTGGACCTCTCCAAGGACATTCAGCACTGGGAATCC 

CTGAAACCCGAGGAGAGATATTTTATATCCCATGTTCTGG C I I I CI 1 1 GCAGCAAGCGATGG 

CATAGTAAATGAAAACTTGGTGGAGCX5ATTTAGCCAAGAAGTTCAGATTACAGAAGCCCGCT 

GTTTCTATGGCTTCCAAATTGCCATGGAAAACATACATTCTGAAATGTATAGTCTTC^ 

CACTT AC^TAAAAGATCCCAAAGAAAGGGAATTTCTCTTCAATGCCATTGAAACG ATGCCTTG 

TGTCAAGAAGAAGGCAGACTG G G CCTTGCGCTGG ATTG GG GACAAAG AGGCTACCTATGGT 

GAACGTGTTGTAGCCTTTGCTGCAGTGGAAGGCA I I I I C I I I 1 CCGGTTCTTTTGCGTCGATA 

TTCTGGCTCAAGAAACG.AGG^CTGATGCCTGGCCTCACATTTTCTAATG^CTTATTAGCAG 

AGATGAGGGTTTACACTGTGATTTTGCTTGCCTGATGTTCAAACACCTGGTACACAAACCATC 

GGAGGAGAGAGTAAGAGAAATAATTATCAATGCTGTTCGGATAGAACAGGAGTTCCTCACTG 

AGGCCTTGCCTGTGAAGCTCATTGGGATGAATTGCACTCTAATGAAGCAATACATTGAGTTT 

gt ggcag acagacttatgctggaactgggttttagcaaggttttcagagtagagaacccatt 
tgactttatggagaatatttcactggaaggaaagactaacttctttgagaagagagtaggcg 
agtatcagaggatgggagtgatgtcaagtccaacag agaattctttt acctt ggatgctgac 
ttcraaatgaactgaagatgtgcccttacttggctga i n 1 1 i i 1 1 \ ccatctcataagaaaaa 

TCAGCTGAAGTGTTACCAACTAGCCACAC^TGAATTGTCCGTAATGTTCATTAA^^ 
TTAAAACTGTGTAGCTACXTCACAACCAGTCCTGTCTGTTTATAGTGCTGGTAGTATCACCTT 



AAGTGAGGGGTGACCCTTTAGTGAGCTTAGCACAGCGGGATTAAACAGTCCTTTAACCAGCA 
CAGCCAGTTAAAAGATGCAGGCTC^CTGCTTCAACXjCAGATTTTAATGTTTACTT AAATATAA 
ACCTGGCACTTTACAAACAAATAAACATTGTTTGTAC^ 

TTTGGTTTCT ACACCAAATACATTCTCCTGACCACTAATGGGAGCCAATTCACAATTCACTAA 
GTGACTAAAGTAAGTTAAACTTGTGTAGACT AAGCATGTAATTTTTAAGTTTT ATTTTAATGAA 
TTAAAATATTTGTTAACX^AACTTTAAAGTCAGTCXrrGTGTAT ACCT AGAT ATT AGTCAGTTGGT 
GCCAGATAGAA6ACAGGTTGTGTTTTTATCCTGTGGCTTGTGTAGTGTCCTGGGATTCTCTG 
CCCCCTCTGAGTAGAGTGTTGTGGGATAAAGGAATCn"CTCAGGGCAAGGAGCTTCTTAAGTT 
AAATCACTAGAAATTTAGGGGTGATCTGGGCCTTCATATGTGTGAGAAGCCGTTTC^TTTTAT 
TTCTCACT GTATTTTCtrrCAACGTCTGGTTGATGAGAAAAAATTCTrGAAGAGTTTTCATATGT 
GGGAGCTAAGGTAGTATTGTAAAATTTCAAGTCATCCTTAAACAAAATGATCCACCrrAA 
TTGCX iCtn'GTTAAGTGGTGAAATCAACTAGAGGTGGTTCCTACAAGTTGTT CATTCT AGTTTT 
GTTTGG TGT AAGTAGGTTGTGTGAGTTAATTCATTTATATTTACTATGTCTGTTAAATCaGAAA 
1 1 1 1 1 1 ATT AT CT ATGTTCTTCT AGATTTTACCTGTAGTTCAT ACTT CAGTCACCCAGTGTCTTA 
TTCTGGCATTGT CTAAATCTGAGCATTGTCTAGGGGGATCTT AAACTTT AGTAGGAAACCATG 
AGCTGTT AATACAGTTTCCATTCAAATATTAATTTCAGAATGAAACATAA I I I M I I I H I I I H I 
GAGATGGAGTCTCGCTCTGTTGCCXIAGGCTGGAGTGCAGTGGCGCGATTT^ 
AACCTCCATCTCXrTGGGTTCAAGCAATTCTCCT GTCTCAG CCTCCCT AGT AGCTG GGACTGC 
AGGTATGTGCTACCACACCTGGCTAATTTTTGTATTTTTAGTAG^ 



CTGGGATTGCCGGGTGTT AACAATTTTCTTATAGGGGACCTTGAATTAACTGC G I 111! I GGG 

CGAGAAGCTCGGACTTGCN 

>1083 

>1084 

NCCGAGGAAI I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 GCATGACTAACAATTCATTCAATGTGTGTGTA 

CTGCT ATGTGT CCTCATGCATGAGCT ATGTCAACAAGGACAATCTACTGATAAGAGAAAATG 

AAAATGTACAAGTTGT AGGAGACTAAACATGGTTTT AACTT AGTACACATTTTTCTGAAATGTC 

CXXXGTGATTAAGTTGTGAACAAATGAACATGCCACATGTCAACAAACTGAACA^ 

TTGTTAGTOACTTAGAGGTGGAGGGAGGGCTAGAGAGAGGCTAGCTGTGT^ 

TCTCCTGTGTCCCACACTGGCTACAAAAATACAACCACTGGGTAGGTTAGGGCTCATCT 

CCAGAATTAGGAATAAGGATTGAGAAGAAAACTCAG^>AGGGTGATGAATGAGTTTCAGC^ 

ATTGCTGGAGTT AGCT GAAGAATGAAT AGGACACAGTGGAT GAAGGAACAAGCT ATTCCGG 

GGACCrTTTGAAGCCCTpGGACCTCA^ AAAGAAGGATAAAATGGATGTA 

AGCCAGTGCTAATGGTTOCCGCGN 

>1085 

TTCGACGTGCCTCCOGCT GTGGGCTGC^GTGTCATATGAATCTCGGAAGCACATTTCAGAA 
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?ZSI^? A I2!^ G ^ CATAATC ^ GAGTAGTGTG ^ 

AGGCAGCTCTTCACACAGCTCCCTCAG gacttccatgtg GTGATTAAGCTGGTACAGGCAG 

CCTTC CTCAGAAAAG CAGGCCTCATGTTCITTGATGAGACCCTTGGTCCTTC^^ 

AAGTTTC TTTTCT TT ATTrATTTGCTTTTCAAGTTTTAAAATATTGTCACT AAGCTTTCCTTTTTC 

AATATCTAi 1 1 1 1 A GTTGTT GAACATATAAAGACAGTTCATGATGAAGAGACTCCCATTTGGCA 

CA6ACATCTCTACTC I 1 1 ICI I CTAAAG ACATCTTT ATGTCCT CAGTT ACATTATTTTTTAGTTC 

TTCAACAGCTTTCAAATACTTGGCCAGACTTTTTGTATTAGAAAT^^ 

TTTGAAATGAGTTC^CCGGGGCTCTCTCTTTGC(^AGTAACrrGCrrGAGCC<rr 

TGATTCAAGTTCXrrTCTGGGATTCTTCXIATCTTCAGCCTGATGTC^AAATC 

AGTTAAGTTCTTC>AC^CTGCAATGAGTTTTACCACATTGATCTGGACTTT 

TGAATCTTTGCXrrGAAGCCCAGTTGTGAGtAACATGTTGAAAGGACTCCTGGCTTGAAATATTT 

TGTATATC^TTTGTAACATATCTTCACTTCTTGCCATAAGCTTTOVTCAAACATAG^ 

TGGCAATAAGATGCTTCAAAGATTCTTCU^TTGAGGTGTCCACATCATCTTGATCCAAAACAG 

ATTTGGui i i> I GCCTCXX^GA TTTCCACTTG TCCAATGAGTTTTTCAACATCTTT AGCCACTTT 

T AG II GCCCTGTGAATTCTTCAt 1 I ICI II 1 1 CATGTTTCTCTCCAGCCTTTATATTTTGATTAT 

AATTTTCAGGAAGTTCTACTGACATATCTGTTGAAAACTC^ 

TAGAATATTTCCAG AATTG TCAAAAGTGGGCTGATCCTG I Mill I ATCATCAGTGGCAGGTT 
CATTTCAAGTTGAG Mill GAAACCAACTTTCTCCATCTTTTATTCAGCCTTCTCAGTTCTTTA 
GAAATAGATGATCCAACCACATCATTGCTGACTTCGACT AAGAAATTTCCTGCTTCATTTAAA 
GTAGCGTGTTCT AGATTCCACTGGGCT AGTGTCTCAAAGGGTACCTCTTTAATTTCTTCCTTC 

^ GT CTCCTC AAAGCAAGCC ^ 

CAGCACTTTTT GT AGAGTGG ACTTCACATTATATATATTTTTTCT ATACATACAAACATCAGAT 
TTC^CCAJCATATACTGTTTATTAATATTCTGACAT^ 

ACATTTTTGAAAAGAAGTATCAAGTCGAGCTAGGAATT C I 1 1 1 I C I I CAATAAATTTATGCCAG 
TCTTCCAGCAATAATTC^ACAGATTCTtrrGCTCCCATATTTAATGTTCCAAATA 
ATTT CACTTC ATCT ACCAAACCAAG AACTAAGCACTTGTAGTAATG AAATTCT AG AAGTAGAA 
TAAAi 1 1 1 1 1 CTGCAAAATGTTGTTGATTCGTCTTTTCATTTCCrrCCAATTTGTTAGGTGGTAC 
CAATGGCAAGTGATTTTCATCCTTATTTTCAAAGGTAAGGAGAATGTTCGAATGATGCTCAAA 
TCTATCCATCAGGCTCTTGAATAAAGTCATTTTCTCTTGTATCAGAGTCACGGCTTGAGAGTG 
ATCCT GGGAGGCTGACAAATCTTCATCCATAAG CTCTTCTACCTCCTGGAGCCAAG CTTCAG 
TTTGATGGAGGGGTGGGGGCAAGGCATAATTTAGCTTTATTTTCCATGCATTAATCTGGTGA 
JCGAGGCC*TCCCAGGCTTCTCTCAACTGTAAATGATC^ 

TTTTATTGACAG GACATC CAAAAAGGA C 1 1 1 1 1 1 1 C M GATTGAATGACT CCAT AAAGGACAG 
CAGGCTATTATAuu i 1 1 AAAGTAGGTATCATTCTCTG AATCCTTTAGCAACTTCTGTAGTTTT 
TCCTTTTGCAGAGTT AACCAGCCCATAGCATCTTTCACCTTT CCCT GAGCCTCCTCTCCAGTC 
CCAGGG GCATC7TTGGAATACTGCAGAAACTGTGCCACATAGGT CATGATGGACTTTTCATC 
AGGATCAACAACATCCACATCTTCTGGTTCCAG<^TCT 

^^CTGAAGGCCTCTCTCAG^TTGTGTTTGTTGGATCTATGCTT CACACTCTTCATGTCAATT 
AGGTCTGGTWiCAAGGCATGAATGATGGCCAAAAAAGCCATCCCATTT^ 
AAAATCGGTCACATTG ACAG ACTCATAGGTGGCGCATTGTT CCT GAGCCCACAAAAGAAGG 
GCvi hum GCAGACA7TTGCCATCTTGCTTGCACTTTAGAGCATTTCTT 
GAGGCAG^GATGAGTCAACCACACTCACATCATCCAGGGAAGGCTGATT 
AAAGAGTCTGGGCAAGCTTCTCAATATGAAAGTGCAGGATAATTGT CCAAATTAGGCCAAGG 
ATAATGGATGGGTTTC^TCAATGATATCAGTAACATGAATATTTATTAGCT^ 
TTCTT AGGAATGTCAAGGCATGTTCTATATrGATTCTACACTGGAAGGT ATTAGATCCTTTATC 
CCGAGGCAACT GTTGCCCAGAAAGTACTTCTAGCAGATGCAGGAGGACATGCCXXrrTTTTAA 
TGTCTGTGAATAGGTCGGATATAACTGAGGGAGAAGT GT GCCTGGCCAACTGTGAGTTTATC 
CAGCACGTGAAGGCTTT CTTCTGGGTGTCTTCCTGTTCAGCTTGCAATGAAATATGGAGATC 
GTCGATGCCCCAGGAACCCTGTTCATCTTCGGT GGGAAGCTCAGGACT AG^TGCCATTCTTT 
GACrCAAT«3AGATrATATC^ 

" " ^ " " " ~ *" " "* ' " " ~ ~ ' " 'GCCTGGCTCTCTGG 



ATCGCGCXi 1 1 1 GCTCGCTCTCCCT CTCGCTTTTTCAGGTCCGGCCGAAGGCTCTGGGCGC 
AGCTGAGCTCGTGCC 
>1006 • 

ACTGGAGTGCCTGGTGCAATCTCGGCTCACTGCAACCTTCACACCCCA^ 
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atc ctocc gcctgagtagctggaaccacacgtgcgcaccactaaacccagctgtttaataca 
ccatttttaaoccaaaacattaagaaaaatataggaacagtaagtagattacattttgtaaaca 
gacaagctracaagttttctcaaatatgaaagtcatactaaactgggagactgttaacttctt 
gatggggttaatctctaatatgaagccacagtcatagctaactacaaattacatatacaatgc 
caaaaat attcaaaaat aacatttttt g cacctt aatgattacaaatgct aaccagcat aaaga 
cactggaaagtttcagaatct cctcatcacatactttcaaatatcttccctttacn 

>1087 
>1088 

nncacgaggattccttccttci 1 1 1 ggtcggttctgagtgtggggtgtctactggg 
gatctgctaag gctaagaggcaaagataggcaagtcactcccctg acctcaagaaactccc 
agtctacaggcgaagatacaccacccaccggtagagtcgctggaccagaatattaggtgtt 
ccagtcaaagtcacccagatttgccaaaagacctggcacaaatgtcacttccact atgaagt 
ccca ctgac ttccatatacaagacaatctgctgggaatttcttgggttgacagctctt ggat 
cc^attttgaacagtggtagtgtcctggattacttttcagaaagaagtaatcctttttatga 
cagaacatgtaataatgaagtggtcaaaatgcagaggctaac^ttagaacacttgaatcagat 
ggttggaatcgagtacatccttttgcatgctcaagagcccattcttttcatcattcggaagca 
acagcggcagtcccctgcccaagttatcccact agctgattgctatatcattgctggagtga 
tctatcaggcaccagacttgggatcagttataaactcrragagtggtaagtgtcttcacattct 
TTAAGCACTAAAGAAAACTTTT AATTAG^T accttgcttcgagtaatcaaactagagct cctct 
GCCTTGTGTAAGTTGCTATAAAGTATTGACTATTAGAATGTCTTGAACTTTGGTTACTGTGAG 
GCAAGTCGG T G CTCAAAG TAT ATTTCAT A GTCTCAATT AT AT AG T AATTT AG GTTCT GAAAAAT 
AGGTTCTGT CTTTG CATATGT AAT ATTTTG TGAG T ATTT ACTTTGGAAAGTTTGGTCGACCT AA 
TGATAAATTTAGAGTTTATTTTCCTTTT ACAAGCTTACTGCATTGCATGGTATTCAGTCAGCTT 
TTGATGAAGCTATGTCATACTGGTCGATATCAT 
>1089 

NNGAGTCGACCCACGCGTCCGCTTGTCT ATTTCAAGTTTGAGTAGGCTGAGGAAGA 
GGCGGAAGGAGGGTTGGTCTTGCTGTCTTACAGGTGGCAGAGGTGAAAGAGGTGAAAGTC 
CACAGGTACACGGACTCATGT AGTTCAAATCTGTGTTGTTCAAGGG CCAACTGTATTTCTCTT 
GTAGTTGGCTTGCCTTTTCATGTTmAGTTTTGATTAATGCATGGATTTTACX^ 
CTAACAA GAAAGG AATGTAAGTTTACTCTAGCATATGATAAACAGGCAGTCTGAGATTTTACA 
GAGCTT ATT 1 1 CI GAG GAGTTCATTGTATTACATCATTTCATTTGCC 1 1 1 1 I GTCTTT ACAT AG 
TAGATAGGGATATGTACCTCXXnTOGCC^TCATGTAACTGLAAATAACTGAGGAATTGTTATTG 
TG CTACAAAACT G AGAACAG ATGAAGATTCT GT AATG AAT ACTT AGATCATCT ATTTTTTGTTG 
ACATTTTGCCTAGATGATGTGAAATTATAACTATTGATTCTGTTGAAAG^ 



>1090 
>1091 
>1092 
>1093 



GCXSTGTTCAAGAACGGCCGCGTGGAG^ 
GCTCAOCTOCAAOCCCtMGAACACGGTCTTTGACGGCAAGO 

GAATGACCCGTCTGTGCAGCAGGACATCAAGTTCrTGCCGTTC^AGGTGGTTGAAAAG 
CT AAACCAT ACATTCAAGTTGATATTGGAGGTG GGCAAACAAAGACATTTGCT OCT GAAGAA 
ATTTCTGCCATGGTTCTCACTAAAATGAAAGAAACCortGAGGCTT ATTTGGGAAAGAAGGf^ac 
ccatocagttgnacctGTACCAGCCTATTrTA^ 

ACT ATTGCTGGCCTAAATGTTATGAGGATCATCAACGAGCCT ACGGCA6CTGCTATT6CTTA ■ 
TGGCCTGGAT>C\GAGGGAGGGGGAGAAGAACATCOTGGTGT^ 

CTTOQATGTG ICfCTTCI CACCATTGACAATGGTG 1 CT 1 CGAAGTTGTGGCCACTAATGGAG ' 
ATACTCATCTGGGTGGAGAAGACTTTGACCAGCGTGTCATGGAACACTTCATC^ 
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TCTCAG CAT WAGCAAGAATTGAAATTG AGTCCTTCT ATGAAGGAGAAG ACTTTT CTGAGACC 
CTGACTCGGGCCAAATTTGAAGAGCTCAACATGG ATCTGTTCCGGTCT ACT ATGAAGCOCGT 
CCAGAAAGTGTTGGAAGATTCTGATTTGAAGAAGTCTGATATT^ 
TGGCTCGACTCGAATTCCAAAGATTCAGCccacactagcBttra^ 




itagacadaQcacaccca c a c 



>1094 

KttWNNNGTAGAGAGAGGGTGTCACTATATTGCTCATGCTGGTCAAATATTATTTGTA 
ATCATCTTT CATCT AAAGAATGAATTAAATTT ACT GAAAGCAAATAATGGAAAACT ATATGGCT 
TACAAAAAGAGTTGTGATTCACATCATAAAAGTCACT C 1 1 1 I I C I CAATTTCTTGGGTACATGC 
CCCAGACTTCGCCATAACTTTTCAAGTTAATTACACCTGCTACT GTTTCACTTAGGGGCACTT 
TGCTT AACCTGT AAT ACACAGAAGGGGTTGAGAAGACAAAACACT GTT AACTT CATTAT ACCT 
TTGACAAAGTAATATTATGTGACATGATGTGTTTTCXrTX^AAATATTAGAGCT GCAGATTTAGC 
T^ATTCAATTTATGGGACAATTTGTTATGTGATCTAACAATTTG 

CTTTTATGATCAAAAATTGATTTTATATATATACATATAAATCTATATGGCTGGGAGCTATGGCT 
CACGC^^ AT AAT GCCAG CACTTTGGGAGGCT GAGGCACAAGGATC^CTTGAGCTCAGGAGT ■ 

ttgagaccagcgtggacaacatgqcaaaacctcctctctaca 
nnnnnnnnnnnnnnnnnnnnn 

>1095 

aegCGtCX^tCttgaatBCA AAAA ATalatatagATACAAGTCTATCATT^ 

cacaaatcaattaaaagttaaattttatgaaaacatacagactgtacgtgtaagc^ 

atgcagtattaaatcataactctatagccgccactgtccggccacagccta^ 

tgtcgtttgcx3gtctcgcg<>gggcggcccxtc 

aocaccatgt cgagcaaaaagggaaagaccaagaccacgaagaaatgccct cagcgtgcaa 
catccaatgtgtttgccatgtttgaccagtcacagattcaggagttcaaagaggc 

ATGATTGATCAGAACAGAGATGGCTTt^TCGACAAGGAAGATTTGC^TGATATGC » rG CTT CT 
CTAGGGAAGAAT cccactgatg CATACCTTG ATG CCATGATGAAT GAGGCCCCAGGGCCCA 
TCAATTTCACCATGTTCCTGACCATGTTTGGTGAGAAGTT 

T CATCAGAAACGCCTTT GCTT GCTTTGATGAAGAAGCAACAjGGCACCATTCAGGAAGATTAC 
CTAAGAGAGCTGCTGACAACCATGGGGGATCXSGTTTACAGATG^ 

ACAGAGAAGCACCT ATTGACAAAAAG^SGGAATTTCAATT AGATTJGAGTTCACACGCATCCTG 
AAACATGGAGCCAAAGACAAAGATGACTGAAAGAACTTT AOCT AAAATCTTCCAGTTACATTG 
TCTTACT CT CrTTTTACTTCTCAGACACTTTJCC^ 

AGTTTCACAGCTTTGCC I C I TCI 1 1 1 1 UATGT ATTrATTCCAGACCTTTCTQOCACTTAGCACT 
TGTATAATCAGACT GGAAATGGGGATGAGGGTGGGGGAAGTGTCTGAGAAGTAAAAGAGAG 
TAAGacaatGTAACTGGAAoairrcaogaTGCGTGTGAAclcGaTGtAATTGAAAT^ 

aatagg tgctt ctctgtacagctcatccacttcctca 

cagcag ctctcttaggtaatctt cctgaatggtgcctgttg cttcttcatcaaagcaagcaaa 
ggcg rttcl uatgacatcttcaggattjtgtgccatttaacttct caccaaacatggtcagga 
acatggtgaaattgatgggccctgggggctcattca^ 

ggajtctt ccctagagaagcaagcatatcatgcaaatcttcctt gtggatgaagccatctct 
ctcgacatggtggttgttt aatttctggccaagcactgttatgaaggcatagggcttcctca 
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AAG AGAAAGTCTCCAAAGTTGATTTCTTACTCTGACAAAGACTGTCCCCATTTAATCCT CACA 
ACAGTTGCCTATGGTAGGTACTCTTATTOCCATTTrACTACCAGTATAACATAACCTCTCTGA 
GCCTCAATCCTCTTATTCCCATTTTACTACCAGTATAACATAACC 

CATCAGTAAAACATGATATCTACTTCAATGCAAAGAACTAATTGAGATAACATATGTAAAAAGA 

CTAGtcasttcAAAAAAGCQS 

>1096 

>1097 

caTBa^TTmattaCTTTTTAAAAATCCAAATTCagaTAAATTaacacACT AGGTT AGAACCT 
CAATTGTATTtoa taca oaAGGTATTGATGA CTTTA TTATTCTGCAGGTACATGTCCAGGGGCCC 
AGCCTCTGGGCCCAGTAACTCAGCT ACTCTTTGTGGC 1 1 1 CI I CATGG C III 1 1 T TGTGGGCT 
G CCACG CCCATCTTT ATCACCAGAAT GAG G AACTCCT GGAA GTTAACT GCACCATCAGTGTT 
GATATCCAACTCrrTTGAACCAGACGTCTGCACCCTTTTTCCTGAT AT ACT GAGGACACTCGGT 
CTCTAGCAATTTCTTC^GGTCATCCCTGTAGACGGC^^ 

ACTTGTGGTAGACGTCGATGATAGAGTTCAAGGCTTTCTCCAG CT CGGTCAACATGATGCCC 
ACG GACTT GCCCCACCATGTCT CTT GTCAGCT GTCTTTCAGAAGACCTGGTGGGGCAAG TC . 
CGTGGGCATCATGTTGACOIAGCTGGAGAAAGCCTTGAACTCTATCATCGACGTCT ACCACA 

agtactccctgataaaggggaatttccatgocgtctacagggatgacctgaagaaattgcta 
gagaccgagtgtcctcagtatatcaggaaaaagggtgcagacgtctggttcaaagagttgg 
atatcaacactgatggtggagttaacttccaggagttcctcattct ggtgataaagatgggc 
gtggcagcccacaaaaaaagccatgaagaaagccacaaagagtagctgagttactgggccc 
agaggctgggcccctggacatgtacctgcagaataataaagt catcaatacctcaaaaaaaa 

A4*l***** A a .«.. ann ,^» B ^> M ,« a ,^ n ^ a j T T .^ t ,..^ MWtM ^ , mw J |1t)1ft |t| 



>1098 
>1099 

acg<x3ggggaggtctccattc>gtaggtgg<xx^ggatgaaggcx«tgttggggc 
taaaccacactctggaattctgtcagcaaattcctcgctgtgtgaa 

ACci I ICI IAAGCCAVI 1 1 CI J GATATTTCACAGAGCCTCACCAAGTATTCAACGAGAACATG 

taagtgaaatgcttcacaaaatgcctggtaaataatagatgctitagaaaatggtagagagaga 
aaagagcagtctctgcgctttaatgtacctcggccgcgaccacgctaagg 

>1I00 
►1101 

atagggagtcgaccacgcgtccgcgctctacatt^ 
tgaactgcgcagcagaggactttt attctcgtctccttcagaaatttaatgaagaaaagaaag 
gaatccgtaaagacccatttctctatgaggctgatgtcxlaacrrgca 
caacc aaagcci t i gaaaaatattct aaat6aaaatqacatagtattcatagtggaaaaagtg 
CCTTT agaaaaggaagaaacaagtcatattgaagaacttcaatctgaagaaactgccatatct 

GATTTCTCT ACTGGCGAAAATGTTGGACXSACTTGCTTT ACCAGTTGGGAAGGCAAGGCAGTT 
AATTGGACTTTACACCATGGCTCACAATCCTAATATGACCC^ 

TACTGCCC 1 1 LCI CCCC I rTGGGTAAGATGTGACAGTTCAGATCCTGAAGGT ACTTGTTGGC 
TAGGAGCTGAGCTT ATCACAACAAACAACAGCATT ACAGGLAATTGT CTT AT ATGTGGTCAGTT 



TCACTTGTCrACTGTAACATCCAAAGGCTTTGCCCAGTATGAGCTC^ „ . _ . . . 

GGATGATACAATCACAGCATCACAAACTGCGA t OGC 1 1 1 G GAT A 1 1 1 C O G GAGTCCTGTGG 
ATGAGXTTCrrCAAATCCCTCCACTCTCTTCAACTGCAACT 

GAGAGCCCAGAGGTCCTrrGAATCATCTCT ACAGAGAACTGAAA 1 1 fCI ICHGI 1 1 I U GCTG 
ATGGTTTGAGGACTGGTGTC^CTGAATGGGTraiAXXXXCT GGAAGCAAAATCTGCTGTTGA 
Ac 1 1 to 1 1 CAGGAATTTCTGAAT1GACTT AAAT AAGCTGGATGGATTTGGTGATTCT ACAAAAAA 
AGACACT GAGGTTGAGACCTTGAAGGAT GACACTGCTGCAGTCGATCG I I C XAal C AAGCGT 
Ci Hi CAAAQ n i>oGAGTGATCTTQATTTTGCTGAGCAACT GTGGTGCAAAATGAGCAGT AG - 
TCTGATTTCATAOCAAGACITGGTGAAGTGTTTCA 

TGATATACAGCCATGGCTCCATAGTGGAAGTAACAGTTTACT AAGTAAGCTCATTCATCAGTC 
TTATC ATGG AACCATGGACACAG 1 1 ICICT CAGTGGGACTATTCCAGTTCAAATGCTTTTGGA 
AATTGGTTTGGACAAACT AAAGAAAGATTATATCAGTTTTTTCATAGGTCAGGAACTTGCATC 
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TTTGAATCATTTGGAATACrrrCATTGCT(XATCAGTAGATATACAAGAACAGGTTTATCGTGT 

CCAAAAA CTCCAC CATATTCTAGAAATATTAGTCAGTTGCATGGCTTT^ 

GAACT CCTCl M ICH I AAGACAGATCTGCATAAAGTATTACAAACAAAATCCTCTTGATGAGC 

AACACAf I i 1 I CAGCTGCCAGTCAGACCAACTGCTGTAAAGAACTTATATCAAAGTGAGAAGC 

CACAGAAATGGAGAGTGGAAATATATAGTGGTCAAAAGAAGATTAAGACAGTTTG gcaactg 

AGTOACAGCrcACCCATAGACCATCTGAATTTTCACAAACCTGATTT^ 

AACGGT AGCCT GGAAGAAAGGATATTCTTTACT AACATGGTTACCTGCAGCCAGGTGCATTT 

caagtgaagtgtgctgatgaagtcctctataagcacaatgggaggcgggaggcggccagc 

CTG<3TTCATCAGCATCCAGTCXn"CTAGTGCTGCCACCT AATGGACGCTTAATAATCTCTGGA . 
TACTTGGAAATTCTGAGATGCATTTGTGGO^GTCTCCA^ 

TGCATTTGTGTGTGACGTCTTACTCTCCCTCCCAGCTGCTCC^ AAG AC 

CTGTCTGAGGTATGCATACACTTCTGCAAGCTGGNNNNNNN 

>1102 

ACGCGGGA TTCC CC^TG TTTTC TTCTAGAAGTTTTACAg 
TGGT CT ATGACCCATTTTG<»TTAAATTTTGTGTAAGGGATGTTATACATGTGGAGGTTCATTT 
■""^GCATGTAAATATCCAATTGTTTOAACA^ G 
AATG CTTCrroCG CCTCAATrAAAATCAGTTT ACTCTATCTGCATAAGTCT CT GGGCTGT 
CTACT CTC1 1 ■ i^TTGATCTGTATGTCTGTCCATTTTCCAATACCACTGTCTTT ATTACTGTAG 
TTTCATAGTAAACCTTGAAATCATAATTCTATAGTAAGTCTAAAAAAATCACACAGGT^ 
TGCACAATTAGTATGCTAAATCAGAGCAATCTTGTTGTTTCAAATGGTTT atgggagaaatat 
T AGCCAGTGTCTTGACATGCTCATTGATGATAACTGTAGCTTATGT GAAATCATTGTTTCTCA 
GCCTTGCCAGCTATTTCAAAACCACTGCCTGTGCTCGGT AAGTGTTGCCTGTGTCATCATT A 
TTGTCTGGACCAAGGTGATCATAATTTGATATCTGAGAATGGTTTCTGCTG^ATAGTAGATGG 
CAATGACAGGAAACATCCTGATCATTOGTCCTGAAGACTACTTTCT 

ATAGT ATTATCATCCTGCCTCCTAAATTTTTACATCAAAATAATTGTT AAGCACAGGACTTAGA 

AAATAATTTGAAAGGCGATGTCATTTAAAATAATTTTTAA 

TACACTTT ATAAATACCTCATCTGAAGAAACCTTTGTGTTAGACXSTTCX^ 

CTGCCCATGAAATGATCTCAGTTACAAACN 

>1103 

^^TGTTGTTAAGATGTGTATAGTAAACAGAGTTGTrGTCTAGTGCTTGTTTTCATGA 

AC 1 1 1 1 GGCAGATCTTTCrTAGTGACTTTTTATGTTCCTCGGGTGT GTCGGTGTGTTACCCQC 
ATTTQ iGTAI 1 1 1 1 1 1 1 GTAGAATGGACATGCCCGAACGATGTTTGCATTOAAAAGTGCACGG 
AACTGGTTTjACGCTT CAGAGAGGAAAAGnrGTGTCCCGCCCCCCA^ 

1 1 I rOCI 1 1 MIM 1 1 GGCATGCAGCATATAT AAAATGTGATCT AGAATTGAAATTGT AAAACA 
GAATAATCU l lll AA AGAAAATCAGOCTATGAACAGTCAGACTTGT CCTTCCATGGAAAACAT 
AATCT AAA CI I CI 1 1 AAAAACTGGCCATATTTTTTCAGTCTGT AATTCGAATAATGAAGATGGC 
ATAACAAAl 1 1 1 1 1 1 GCTTACTAG<y^TGOGCAAAAGGATGAGATGGATACAGAGGTTTAAAA 
AAAAAAAAAAAAGGC CACTG GGAT ATTTAGTA III G I CCAGATGCAAAACAATTTCAGAATAA 
CAAAAT ACACACC 1 1GCI 1 1 AGGCCACGTATATGATACACTTCATAGATTCTCAAACAAAAGA 
TGAAAATTTCAGTCCACATTAAACATTTTAAAGCATAACT 

GGG GCCCOGT GCCCCAAGT AAAAGACAAGGAGTT AAATTTTACAATGAAATCCACGTATACT 
CAGGc I C n i ACACT AAT AATCAAAGTGGAGACCAACCAGTTOCT AAAGCATAAAAGAGTGAT 
TTTAAGAAAATATCACATTTTGAGTTACGGT ATATTCATCAC A 1 1 1 1 1 CCC I GTAATTTTCACTT 
TAACAGTGATAGTCTTAGCAQCAAAAAT TTGCTG GGAATGTTAGGAAAAAAGG^ 
AT ACCT ACT AATTT AAAAAT A T AT AAA CTATT TTTT A CAGAAfiGGCAGGTTTAAAAGAAAATOC 
AATCATGATCI 1 1 ICATGCCAI I III f G T CAACTTTGCATACACAQGCAGACGCTAACAAAGT A 
CT GT GTCAT AT ACACTGAAGTGT CCT GAAGATGCCTGTGAGCTGCTGATTOCAGCTGGT ACC 
AGGGAAGCATGTGTGGCT AAGTT GCAACAAACTT AAATTTGTTCCCCTTAGTTCTTCATCTTT 
TT AAAAGCTGAATAAACTTAATGTCAATCGGGTGGGGCCGGT ATTAACAGCATCAACCGCTT 
GCTT ACT ATGAATCATCCGCAACAGAAACCCACGTGCATTCTCCA 

AAAACT AACGGCCACTTCCTCCCT AGGTGCCTACT ATCX5CTATGTCTGACA6CATCTGTAAC 
AGGACAAGCTCACCTCT CT CT ACACGGATTCATOCAAAGCCACACACTAGGCCAAATTCTCC 
CAGT GGAATCAAAACACTGGAAGTAGAGCAAGAACCTCAT G<X>ACATTGGCCCAATCACAC ' 
CTCAGTGACTT AGGAGTTCATCATArTTGATCATAAAATTGCTCCCGGC 

CT AAAGACTGTAGTTTT AAAAAGCCACT GTACCGTCCACAATTATTCTGACCACTCTTTTCT AA 
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ATGGAGAATGTGCTACGCACTGAwAGATCAGAACACACACGCGGGAGCCw i i GGGCACTGAC 

cctgagcctgtacacaagatcagctggtactagtctcaaatgattcatgaggactcaaagtg 
gaaatttaaacattcaattctgtgtatcattcatgtgcccagagtcttcaaagaatgctcgtg 
ttagttacctgacacctctggagaagtgacagggatattcgtgtgaataac^tggctccct 
tc^gacagac^tatggaatcttactcttaaaaaagtaaagatatttcagatatcaaatgtatt 
ataatatactgcggtctaaagttaaaggc^aaatttacac^cgattttctcttcx>tgacttta 
ttacat gtttc ctagtgaattgacacggctcgtgtgtcaaagctcccaggcacgtcacttgg 
tcacagttttcatgtgcgtggggttgatcatggtagaaatgcctatgcgcat c i 1 1 I ICTCG 
gatggtctcatttgactttcatttcgtaagggttcactctccatggtgtagatgataagctac 
cacatttgtgcttttaaacccattcctggcaatcccccttt atcttcattttgtcttt agggg 



gttcatttcttrttagaatgaaaatgcaggtcctgaattctcgtgaagcagcccacagggtta 
caggccaatgattogaacgctgaggtgaacacttagtactcgcatttgagcttcaaagtaaa 
aataaagcagaggacaaacgcctgtcacagcgtggcagcccggttcatttggtaagagaac 



acagccaggcacctgttcaaaagcctctct ggtggtggcggtggaccccaaaacccacg g 
gttggaagagcctttgggaaagcaaattgataccctgattgcagtccttgaagaaatgactt 
agggttagtttaataaacgaacattaaaaaatcttccaaaaattgcattcaaatgtattttaca 
ctttaggcagttcttaagaagggagagtgcagtttcttacactaacattaaaggtttannnnn 

>1104 

gcacataccaagcattgtctgtaacagaataatgtcatggacacacatgcagataaa 
tatacctactaaaattctttgaatgtatgaaatgctattgtggttaatgaaatgcacattcact 
taaacaagcgttttttcagttcctatag gi IHIII IA TTCATGTGTATCAGTCCTCCATCTGT 
AGAAGCTAAAAATACAATAACTCTCTTATACTGGTACTTGCAATGGTTTGACA"rTAAGAGAGA 
GACTATACATTCACAGAGGTTGGGAGCTTCTGTCT AGCCTGTTGTCCAAAACTGCTTATATAA 
TTTAGCAACTAATTTTCACTTTTGACAACTATTTTAATTCTAGAGAATACGTT^ 

cttacagtgttatctatccttccaatgacttattataaattttagaatgtatttctatagggtg 
gaaaaatctcctttagtcagaattgaacagttttcatgaagaacatgttacaccatgtagaaa 
catgggt 

>110S 

ccacgcgtccgctgactgtctctctctgtcttgtaatattgccactgcgcatatgtt 
actaggaactgatacccacaaacacacatacgcacggacgatctcacttcgatctatacact 

GGAGGATT AAAACAAACAAACAAAAAAAACATTTCCTTCG CTCCCCCT CCCTCT CCACT CT GA 

G aagcagaggagccgcacggggaggggccgcagaccgtctggaaatgcgaagcct AAAG 

CGTTTCCTCXiCTTGCATTCAGCTCCTCTGTGTTTGCCGCCTGGATTGGGCrrAATGGATACT A 
CAGACAACAGAGAAAACTTGTTGAAGAGATTGGCTGGTCCTATACAGGAGCACTGAATCAAA 
AAAATTGGGGAAAGAAATATCCAAGATGTAATAGCCCAAAACAATCTCCTATCAATATTGATG 
AAGATCTTACACAAGTAAATGTGAATCTTAAGAAACTT AAATTTCAGGGTTGGGATAAAACAT 
CATTGGAAAACACATTCATTCATAACACTGGGAAAACAGTGGAAATTAATCTCACTAATGACT 
ACCGTGTCAGCGGAGGAGT7TCAG AAATG G TGTTTAAAGCAAGCAAGATAACTTTT CACTGG 
G^aAAAATGCAATATGTCATCTGATGGATCAGAGCATAGTTTAGAAGGACAAAAATTTCCACTT 
GAGATGCAAATCTACTGCTTTGATGCGGACCGATTTTCAAGTTTTGAGGAAGCAGTCAAAGG 
AAAAGGGAAGTTAAGAGCTTTATCCA I HIGH I GAGGTTGGGACAGAAGAAAATTTGGATTT 
CAAAGOGATTATTGATGGAGTCGAAAGTGTTAGTCGTTTTGGGAAGCAGGCTGCTTT AGATC 
CATTCATACTGTTGAACCTTCTGCCAAACTCAACTGACAAGTATTACATTTACAATGGCTCATT 
GACATCTCCTCCCTGCACAGACACAGTTGACTGGATTGTTTTTAAAGATACAGTTAGCATCTC 
TGAAAGCCAGTTGGCTG l llll l GTGAAGTTCTT ACAATGCAACAATCTGGTTATGTCATGCT 
GATGGACT ACTT ACAAAACAATTTTCGAGAGCAACAGTACAAGTTCT CT AGACAGGTGTTTTC 
CTCATACACTGGAAAGGAAGAGATTCATGAAGCAGTTTGTAGTTCAGAACCAGAAAATGTTC 
AGGCTGACCCAGAGAATTATAGGAGCGTTG71 6 1 1 ACATGGG AAAGACCTCGAGTCGTTT AT 
GATACCATGATTGAGAAGTTTGCAGTTTTGT ACCAGCAGTTGGATGGAGAGGACCAAACCAA 
GCATGAATTTTTGACAGATGGCT ATCAAGACTTGGGTGCT ATTCTCAATAATTTGCTACCCAA 
TATGAGrrTATGTTCTTCAGATAGTAGCCATATGCACTAATGGCTTATATGGAAAATACAGCGA 
CCAACTGATTGT(X3ACATGCCTACTGATAATCCTGAACTTG^TCTTTTCCCT GAATT AATTGG 
AACTGAAGAAATAATCAAGGAGGAGGAAGAGGGAAAAGACATTGAAGAAGGCGCTATTGTG ' 
AATCCTG6T AGAGACAGTGCTACAAACCAAATCAGGAAAAAGGAAGCCCAGATTTCTACCAC 
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AACACAu i ACAATCGCA i AGGGACGAAATACAaTG aAGCCaaGaC I AACCGaTCCCCAACAA 

gaggaagtgaattctctggaaagggtgatgttcccaatacatctttaaattccacttcccaac 

CAGTCACT AAATTAGCCACAG AAAAAGAT ATTTCCTT GACTTCTCAGACTGTGACT G AACTGC 
CACCTCAGACTGTGGAAGGTACTTCAGCXn"CTrTAAATGATGGCTCTAAAACTGTTCTTAGAT 
CTCCACATATGAACTTGTCGGGGACTGCAGAATCCTTAAATACAG7TTCTATAACAGAATATG 
AGGAGGAGAGTTTATTGACCAGTTTCAAGCTTGATACTG^AGCTGAAGATTCTTCAGGCTCC 
AGTCCCGCAACTTCTGCT ATCCCATTCATCTCTGAGAACATATCCCAAGGGTATATATTTTCC 
TCCGAAAACCCAGAGACAATAACATATGATGTCCTTATACCAGAATCTGCT AGAAATGCTTCC 
GAAG ATTC AACTTCATCAGGTTCAGAAGAATCACTAAAGGATCCTTCTATGGAGGGAAATGT 

gtggtttcctagctctacagacataacagcacagcccgatgttggatcaggcagagagagc 
ttrctccagactaattacactgagataogtgttgatgaatctgagaagacaaccaagtccttt 
tctgcaggcccagtgatgtcacagggtccctcagttacagatctggaaatgccacattattc 
tagctttgcctacttcccaactgaggtaacacctcatgcttttaccccatoctccagacaaca 
ggatttggtctogacggtcaacgtggtatactcgcagacaacccaaccggtatacaatggt 
gagacacctctt caaccttcct acagt agtgaagtcttt cctct agtcacccctttgttgctt 
gacaatcagatcctcaacactacccctgcrrgcttcaagtagtgatrcggccttgcatgctac 
gcctgtatttcccagtgtcgatgtgtcatttgaatccatcctgtctt cct atgatggtgcacc 
tttgctt ccattttcctctgcrrrccttc agtagtg aattgtttcgccatctg catacagtttct 
caaatccttccacaagttacttcagctaccgagagtg ataaggtgccctt gcatgcttctct 

GCGAGTGGCTGGGGGTGATTTGCT ATTAGAGCCCAGCCTTGCT CAGT ATTCT GATGTGCTGT 
CCACT ACTCATGCTGCTTCAGAGACG CT GGAATTTG GT AG TGAATCT G GTGTTCTTT AT AAAA 
CGCTT ATGTTTTCTCAAGTTGAACCACCCAGCAGTGATGCCATG^TGCATGCACGTTCTTCA 
GGGCCTGAACCTTCTTATGCCTTOTCTGATAATGAGGGGrCCGAACACATCTTCA^ 
TACAGTTCTGCAATACCTGTGCATGATTCTGTGGGTGTAACTT ATCAG GGTTCCTTATTTAGC 
GGCCCTAGCCATATACCAATACCTAAGTCTTCGTrAATAACCCCAACTGCATCATTACTGCAG 
CCT ACTCATGCCCTCTCTGG7GATGGGGAATGGTCTGGAGCCTCTTCTGATAGTGAATTTCT 
TTTACCTGACACAGATGGGCT GACAGCCCTT AACATTTCTT CACCT GTTTCT GTAGCTGAATT 
TACATATACAAC^TCTGTGTTTGGTGATGATAATAAGGCGCTTTCTAAAAGTGAAATAATATAT 
GGAAATGAGACTGAACTGCAAATTCCT 1 C I T7CAATGAGATGGTTTACCCTTCTGAAAGCACA 
GTCATGCCCAACATGTATGATAATGTAAATAAGTTGAATGCGTCTTTACAAGAAACCTCTGTT 
TCCATTTCT AGCACCAAGGGCATGTTTCCAGGGTCCCTTG CTCATACCACCACT AAGGTTTTT 
GATCATGAG^TTAGTCAAGTTCXiAGAAAATAACTTTTGAGTTCAACCT ACACATACTGTCT CT 
CAAGCATCTGGTGACACTTCGCTTAAACCTGTGCTT AGTGCAAACTCAGAGCCAGCATCCTC 
TGACCCTGCTTCTAGTGAAATGTTATCTCXriTCAACTC^AGCTCTT ATTTTATGAGACCTCAGC 
I ICT 1 1 1 AGTACT6AAGTATTGCTACAAC<rrrCCTTTCAGGCTTCTGATGTTGACAC C I fGC l I 
AAAACTGTT CTT CCAGGTGTGCCCAGTGATCJCAATATTGGTTGAAACGGCGAAAGTTGATAAA 
ATTAGTTCT ACAATGTTGCATCTCATTGT ATCAAATTCTGCTT CAAGTGAAAAGATGCTGCACT 
CTACATCT GTACCAGTTTTTGATGTGTCGCCTACTT CTCATATGCACTCTGCTTCACTTCAAG 
GTTTGACCATTTCCTATGCAAGTGAGAAATATGAACCAGTTTTGTTAAAAA^ 
ACCAAGTGGTACC » I CI 1 1 GTACAGTAATGATGAGTTGTTCCAAACGGCCAATTTGGAGATTA 
ACCAGGCCCATCCCCCAAAAGGAAGGCATGTATTTGGTACACCTGTTTTATCAATrG^ 
CCATTAAATACACTAAT AAATAAGCTT AT ACATTCCGATGAAATTTTAACCT CCACCAAAAGTT 
CrrGTTACTGGTAAGGTATTTGCTGGTATT(XIAACAGTTGCTTCT 
ATCATTCTGTTCCTATAGGAAATGGGCATGTTGCCATTACAGCTGTTTCTCCC^ 
GTTCT GTAACCTCAACAAAGTTGCTG 1 1 1 CC I I CTAAGGCAACTTCTGAGCTGAGTCATAGTG 
CCAAATCTGATGCCGGTTTAGTGGGTGGTGGTGAAGATGGTGACACTGATGATGATGGTGA 
TGATGATGATGACAGAGAT^GTGATGGCTTATOCATTCATAAGTGTATGTCATGCTCATCCTA 
TAGAGAATCACAGGAAAAGGTAATGAATGATTCAGACACCCACGAAAACAGT CTTATGGATC 
AGAATAATCCAATCTCATACTCACT ATCTG AGAATTCT GAAGAAG ATAAT AGAGTCACAAGTG 
TATCCTCAGACAGTCAAACTGGTATGGACAGAAGTCCTGGTAAATCACCATCAGCAAATGGG 
CTATCCCAAAAGCACAATGATGGAAAAG AGGAAAATG ACATTCAG ACTGGT AGTGCTCTGCT 
TCCTCTCAGCCCTGAATCTAAAGCATGGGCAGTTCTGACAAGTGATGAAGAAAGTGGATCAG 
GGCAAGGTACCTCAGATAGCCTT AATGAGAATGAG ACTTCCACAG ATTTCAGTTTTGCAGAC 
ACTAATGAAAAAGATGCTGATGGGATCCTGGCAGCAGGTGACTCAGAAATAACTCC^ 
OCCACAGTCGCCAACATCATCTGrrTACTAGCGAGAACTCAGAAGTGTTCCAC^ G 
CAGAGGCCAGTAATAGT AGCCATGAGTCTCGTATTGGTCTAGCTGAGGGGTTGGAATCCGA 
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GAAGAAGGCAGTTATACCCCTrGTGATCGTGTCAGCCCTGACTrrTATCTGTCTAGTGGTTC 
TTGTGGGTATTCTC^TCTACTGGAGGAAATGCTTCCAGACTGCACACTmACTTAGAGGACA 
GT ACATCCCCT AG AGTT AT ATCCaCACCT CCAACACCT ATCTTTOCAATTTCAGATG ATGTCG 
GAGCAATTCCAATAAAGCACTTTCCAAAGCATGTTGCAGATTTACATGCAAGTAGTGGGTTTA 
CTGAAGAATTTGAGACACTGAAAGAGTTTTACCAG GAAGTGCAGAGCTGTACTGTTG ACTTA 
GGT ATT AGAGCAGACAGCTCCAACCACCCAG ACAACAAG C^CAAGAATCGATACATAAATAT 
CGTTGCCTATGATCATAGCAGGGTTAAGCTAGCACAGCTTGCTGAAAAGGATGGCAAACTGA 
CTGATTATATCAATGCGAATTATGTTGATGGCTACAACAGACCAAAAGCTTATATTGCTGCCC 
AAGGCCCACTGAAATCGACAGCTGAAGATTTCTGGAGAATGATATGGGAACATAATGTGGAA 
GTTATTGTCATGATAACAAACCTCGTGGAGAAAGGAAGGAGAAAATGTGATCAGTACTGGCC 
TGCX3GATGGGAGTGAGGAGTACGGGAACTTrCTGGTCACTCAGAAGAGTGTGCAAGTGCTT 
GCCT ATTATACTGTGAGGAATTTTACTCT AAGAAACACAAAAATAAAAAAGGGCTCCCAGAAA 
GGAAGACGCAGTGG^CGTGTGGTCACACAGTATCACTACACGCAGTGGCCTGACATGGGAG 

TACCAG AGTACTCCCTGCCAGTG ctg accttt gtg agaaag gcagcctatgcc aagcgcca 

TGCAGTGGGGCCTGTrGTCGTCCACTGCAGTGCTGGAGTTGGAAGAACAGGCACATATATT 
GTGCTAGAC^TATGTTGCAGCAGATTCAACACGAAGGAACTGTCAACATATTTGGCTTCTT 
AAAACACATCCGTTCACAAAGAAArrATTrGGTACAAACTGAGGAGCAATATGTCTTCATTCA 
TGATACACTGGTTGAGGCCATACTTAGTAAAG^AACTGAGGTGCTGGACAGTCATATTCATG 
CCT ATGTTAATGCACTCCTCATTCCTGGACCAGCAGGCAAAAC AAAGCTAGAGAAACAATTC 
CAGCTCCTGAGCCAGTCAAATATACAGGAGAGTGACTATTCTGCAGCGCTAAAGCAATGCAA 
CAGGGAAAAGAATCGAACTTCTTCTATCATCCCTGTGGAAAGATCAAGGGTTGGCATTTC^^ 
CCCTGAGTGGAGAAGGCACAGACTACATCAATGCCTCCTATATCATGGGCT ATT ACCAGAGC 
AATGAATTCATCATTACCCAGCACCCTCTCCTTC^^ 

TGGGACCATAATGCCCAACTGGTGGTTATGATTCCTG^TGGCCAAAACATGGCAGAAGATGA 
ATTTGTTTACTGGCCAAATAAAGATGAGCCTATAAArrGTGAGAGCTTTAAGGTC^CTC^ 
GGCTG AAG AACACAAATGTCT ATCTAATG AGGAAAAACTTATAATrCAGGACTTTATCTTAGA 
AGCTACACAG^ATGATTATGTACTTGAAGTGAGGCACTTTCAGTGTCCTAAATGGCCAAATC 
CAGATAGCCGCATTAGTAAAACTTTTGAACTTATAAGTGTTATAAAAGAAGAAGCTGCCAA 
GGGATGGGCCTATGATTGTTCATGATGAGCATGGAGGAGTGACGGCAGGAACTTTCTGTGC 
TCTGACAACCCTTATGCACCAACTAGAAA^AGAAAATT(XXBTGGATGTTTACCAGGTAGCCA 
AGATGATCAATCTGATGAGGCCAGGAGTCTTTGCTGACATTGAGCAGTATCAGTTTCTCT AC 
AAAGTGATCCTCAGCCTTGTGAGCACAAG GCAGG AAGAG AATCCATCCACCT CTCTGGACA 
GTAATGGTGCAGCATTGCCTGATGGAAATATAGCTGAGAGCTT AGAGTCTTTAGTTTAACAC 
AGAAAGGGGTGGGGGAACTCACATCTGAGCATTGTTTTCXrrcrrTCCT AAAATTAGGCAGGAA 
AATCAGTCT AGTTCTGTTATCTGTTGATTTCCCATCACCT GACAGTAACTTTCATGACATAGG 
ATTCrrGCCGCCAAATTTATATCATTAACAATGTGTGCCTTTTTGCAAGACT^ 
TTATGTTTG AACT AAAATGATTGAATTTTACAGTATTTCTAAGAATGGAATTGTGGT A 1 1 1 I II I 
CTGT ATTGATTTTAACAGAAAATTTGAATTTATAGAGGTT AGGAATTGGAAACTACAGAAAATG 
TTTGi 1 1 1 1 AGTGTCAAATTTrTAGCTGTATTTGTAGCAATTATCAGGTTTGCTAGAAATATAA 
CI 1 1 1 AATACAGTAGCCTGTAAATAAAACACTCTTCCATATGATATTC^ACATTrrACAACTGC 
AGT ATTCACCTAAAGT AGAAATAATCTGTT ACTT ATTG T AAAT ACT GCCCT AGTGTCTCCATG 
GACCAAATTT ATA TTTAT AATTGT AGATTTTT ATATTTTACT ACTGAGTCAA G 1 1 1 ICIAGTTCT 
GT GTAATTGTTTAGTTT AATGACGT AGTTCATTAGCTGGTCTT ACTCTACCAGTTTTCTGACAT 
TGT ATTGTGTTA CCTA AGTCATTA ACTT TGTTTCAGGATGTAATTTTAACT^ 

gaaat acctt c^ttttgaaagaagttttt atgagaat aacaccttaccaaacattgttc^ 
gttttt atccaaggaattgcaaaaataaatataaatattgccattaaaaaaaaattcactgqcc 

GTCGTTTTACAACXSTCGTGA^ 
CCCCCTTTCGCCAGC^ 
TGCGCAGCCTGATGGCGNN 
>1106 

nnknngt acca ctt catcaaaatcatggatctt ggct gggtgcggtgcctcatgcc 
tgtaatcct agcacttt gggaggctgaggcaggcaatcactt gaggtcaggagttcgagac 
cagcct ggccatcatggcgaaacctcatctctactaaaaatacaaaagttaaccaggtgtgt 
ggtgcacgtttgtaatcccagttactcaggaggctgagg cacaagaattgagtatcacttta 

ACTCAGGAGGCAGAGGTTGCAGTGAGCCGAGATTGCA^ 

AAAATAAAATAAAATAGTCGTGAATCTTGTTCAAAATGCAGATTCXJTCAGATTCAATAAT 
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GCTCAGACTGGGAACAGGGCCCAGGAATCTGTGTGGTACAAACCTGCATGGTGTTTATGCA 
CACAGAGATTTGAGAACCATTGTTCTGAATGCTGCTTCCATTTG^CAAAGTGCCGTGATAATT 
TTTGAAAAGAGAAGCAAACAATGGTGTCTCrrTTTATGTTCAGCTT ATAATGAAATCTGTTTGTT 
GACTTATTAGGACTTTGAATTATTTCTTTATTAACCCTCTGAGTTTTTGTAT 
AGAAAAATGCAATCAGGATTTTAAACATGTAAATACAAATTTTC^ 

GTG AAATTTTCAG GTAGTCTGAGTAATAG ATTGTTTTGCCACTTAGAAT AGCATTGGCACTTA 

GTTTTGGAGGCAAAGTGGGGATTATTGTCAGGTTGTCCTGTTCTATAAAATTTAAGGTCAGGT 

TGGCCCTTTTTGGGAAAGTTGCCTTTAAGATTGTGGCATGGTTCACGGGTTTGCT^ 

TGGTATAAAATTCTAAAAAAGGGGATTGGGGTCATAGGGGAAAGGGGAAGAGAGGTAAATG 

N 

>1107 

NGTCTGTGAGATGTTACCACrrAGTATTTGGAAAAAGAATAAAAATGTGGCCGGGCGT 
GGTGACACATGCCTGTAATCTAGCCACTTGGGAGGCCAAGGCAGGAGAATCGCTTGAACCT 

GTGAACTCTGTTCAGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNr4^NNNNNNNNNNNN 

NNNNNNNNGGTCATTAGTTATTAATTTTACACAGTTAACACTGAAAAATGAATGATATTTAATC 

ATTGTCACTTACTGAGAAGCAAGAACAATGAGTGAGCCCAAAGGAGTCTACTACCATACCTA 

TTAAGTGTAGGGAAGGGTTTAAGTATTTTTTACATAC I IIIUI I CTGTCATTGGAAAAACACCA 

CCCATCTGAAATG G ACAG AAGAAAATTTTCCCAGG TGTTCT ACTCTCATCAG AACAGCTTGG 

TGCCAGTGACCT(X^ACACTGTTAGCTGGCTCCCATACTGCTTGAAGGGCACCAGTCAAGAG 

CTGTCAAAG^GAGTCTCCTGAGAAATCAGCATrTGTGGGAGATCCCACTTCCAAAAAGGTAG 

TG CAG CAGTGCT GTTTCCGAGCAGCAGTGAATGTG G G GG CTGAAGACGACG CTTT ACAGGG 

CCTTGCTTAGGCCCTGGGGATGGGAGTAAACAGAGTCATGATATGAGGATGTAGTGAAATG 

AGTTTCT GCANN 

>1108 

NACGAAATTTAATACGACTCACTATAGGGAATTTGGCCCTCGAGGCCAAGAATTCGG 
CACGAGACAAAAATTAGCTGGGCATAGTGGCGGGTACCTATAATCCCAGCTACTCGGGAGG 
CTGAGGCACGAGAATCACTTGAAGCCGGGAGGTGGGGGTTGCAGTGAGCCGAGATCACAC 
CACTGCACTCCAG CCTGGGCAACAAGAGCAAAACTCCGTCTCAAAAAAGATGAAAATAAAAT 
AAAATATATATTTACAGGCCTACAACTrrTGCCTCAGACTGTTCCGCTTTTCT 
AGTTTTCACCTTTTAAGCTTCATATCCTCAGTGCTTGTAG 

GTCATTGCAGTTGTTTGCTT AAAGA CT TATTGA AATGGTT ACTGGCGTAAATACTTGGCCAAC 
TCAACTTT ATTGCCCCTGATC 1 1 1 1 CCA I 1 1 1 i GTTTCCACCTTAACCT ATAGCAGCTCCTCCA 
AATGAGGAATGCTGTAACTAAG ACTCATCA AACAGATTrTAACCATriTATTATCCTGTGTGT 
CCTTACATTQCTTCTGTGAGATG I ITTTTTC1 1 ATCTGAGATGAACTTTCAGGAGCCT ATTTGA 
ACTCCAGACTGGTGTrCTGGGGCAAAGAGCTATTAGCCAAACTGATTCT ATGCAGGTGAAGG 
ATGCACTAAAGTTCTCACTTTAGTGAGAACT1TTCTAGCTATTCCAATACA A 
TAGGG CT ATTGATATTGACACCAAATGGAGTGG CTTCTCAGCCTCTTAATGTCTTAAGTAAGT 
GCTTAATTTGGAATAGAGAAACCAGTATATTTTAAAAAAGAAAAATATTCTTT 
AAATTCTCCGATTATAACAGTGAACAGAGCTCCAGXSTAATAACGCATAGGCATGTCAGGT^ 
CATCTGTATATTTGACTACATTAGT ATTAGTGACATCAGGTGGATATAAAAGAAAACCCTTGG 
AAAGAGAACTGCCTTAGCCATGATTTCGTTAGTAGACCT ATTTATGATTCAATTGCAATTTTCA 
GAT AGGATGTGAACATG GAATTTC^TTGAAAAT AGTTTAATTTTTT AT ATAAAAG GTTTTGT AT 
ATAATGTGTGTCAGTGACTATTTTCAAAATC^TTTTCAT 

CATTGCATACACATATGCACACXjTATGTGCACGTGCCAGACATTTTTTGTATAATGTTGGG^ 

tgattataaaagtgttgtcaaatgttttatttatctgcatatagcagtggttgg 
ttgaaatttttgcgcattgatgcattgaaat aag gaaaatt atttatct ctgagcactaaactt 
a t tt ttg cat atttct gt aat attgcagt ccccag at ccagaacatg ggaag tt agggaaaat 

GTGTGATTTTGTGTTTTGAATTACTGTCAGAATTAGATACACAATTACAACAAAC 1 1 1 1 1 1 1 A AA 

agacatttcatrgtactgcaaaaatctgaatatttat a imciig i iuh i cim a tatgtttt 
gcatttt aat atgttgagccactggaaatttgt aacagatt aatttgtta aaatg 
tgttgtcattgtctccattgtctttgtccagagcctattatt 

t gt cagttgatt agaaaact aaaaaagaaaaaaaaaaaagaagaaaaaaaaaaaaaaaaaagg 
aaaaaggaaaaAcacaagaaa a aaagaaacaaaaacagagag^agaaaaacagaacagaaac 

AGGAGCAAAGAACCAGACGACACTAAATACTGAGTGGGAGAGGGAGGGTAAGCAGCAATGA 
ACT AGG AT AGGTCAT AGG GAGGCGGAAG GACCAAACAACG W5ACGCCAGTCAAGA CATGG 
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TCOACAAGGAAAAGAGGGAGGACGACAAGCCGTGACGGATTGTGATAGATACAAGAGGAAG 

CAGATAGAGAGGAGAACCCACATAGTGGGGGGGGGAGlAAGCGGAAAAACACACAAAGAGA 

GCGGGCACAGT6ATGCCAACAAAGCAAACGGTGGGA6GGAACAAGCANN 

>1109 

NNNACACAAGATTTACAOCACTAAAACACAACATAAACT AGGCACAGTGGGACACAC 
TTGTAATCC00CCTACTCAAGAAGCTGAGGCAGTAGAATTACCT6AGTTTTAGTCCAACGGT 
GAGTATACACACGTACATACATATACATACACACACACACTCTCTCTCTATATATATAGGTGTT 
CATCCATG<5GGTCCACAAAGATAAC7AGAGTACATTTTGGGCCTTTAATCCCATCT AAACAAT 
TTGCTGTTAACGAAACTCAAAAACAGAAATACCTATATTTTCTCGCTAAATCCAATTGTTACCT 
ATGATGAGTAAAGACACTAGATCTGCAGGTCCTAGTACAATCTATACATAAAAGGCCTTCAGA 

tttgaggcacaaaaaanaaagggcaaaaaaagaaaaaaaagaataaaaaaccttctacacatt 

TCCTTCTTTT ATCTCCAATATGAGAAGGAATCCTTTCTAACTCTAATAACATATTAACAAGAAT 

TAAGAACACGATTGTCGGGGAACTCAGATGTTGGCAAAGCTTAAAAATAAAAAAACAAGGGC 

TGGGTGCAGTGGCTCAGGCCTATAATCCCAACACTTTGTGAGGCCGAGGCAGGAGGATTGC 

TTAAGCCCAGGAGTTTGGGATCAN 

>1110 

>1111 

>1112 
>U13 
>1114 
>1115 
>1116 
>1117 

>iiie 

>1119 
>1120 
>1121 

NNW<NNNNNNNNNhK^GACAGAGTCTGACCCTGTCGCCGAGGCTGTAGTGCA 
GTGCCATCTCAGCTCACTGCAAGCTCCGCCCACTGGGTTCACGCCA7TCTCCTGCCTCAGC 
CTCTCX3AGTAGCTGGG>ATACAGGCACCCACCACTATGCCCGGCGAATTTTTTCT ATATTTA 
GT AGAGAGG GGGTTTCACCGTGGTAG CCAG GATGGTCTTGAT CT CCTGACCT CG T GAT CCA 
CXX^CCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACXXrr GCCGGGCTGAAAA 
ATAACCCTTTAGATATCTACAGCTTTAAACTGTGTGCAGTCATGAA 

TCATTGGCATTTAATAAATTGGAGAAAAATTATAGAGTAAATACATTACAA7 CATTAAT AATAG 
GCTCTAATGAGAAGAATTTAATAAATAATCATTAAAAAGACAGCAGJVATTTT ATCTGTTCT 
TATGTTGCTGCTCTTCTTATCAAATACTATAATAAAACTATATGACT ATAATATAGATTTCAGG 
AGCTAAAAAAAGCCTT ATATTTTCAAATAAAAG^AACAATATAAATTTTGCAAAATATGACGAGC 
ATTACTGCAGTATAAAGTAAATATCTGGAATTAAAATATACCATCATTTAGATATAGACTAAAA 
AAAAAGAATATAAATQTTAATGA I FO CI 1 1 C I QCCTGCAGTGAGCTTAAAATTACAACCAAAAA 
TTTTAATAAATATGT AGCACCT ACAAGACATTTTATTAATAGCTT ACATAATGTGGAAATTTGA 
GCAATTTATTTTA.GAATTTTTGAATCTAAAAATCACCAGCTT GACATTCATTTGAGAAAGTGAA 
ACATAAAGCGGAATTCCGACGACCACAACCACCACCTCGCGCTG 

CGAGAACTATGACTTTAGCTCCGCCX^TCXX^CTCCTCGCTGCGCTACTACAGCGAGGGT 
GAQAGCCGICGGCGGCGGCGGCAGCTCCTTGTCGCTGC^TCCGCCGCAGCAGCCT 



GTACCCGGCTTGGCGGCCCGGCGGCCCGGCACCGCT ATGAGGTAGTGACGGAGCTGGGC 

ccggaggaggtacgctggttctacaaggaggagaagaagacctggaagccgttcatcggct 
a<»actcgctccgcatcgagctcg£ctra» 

occagggcggggaccgggacggcgaccatgtgtgctcccccacgggcgcagcctccagt 

tccggagaagatgacgatgaggaccgcgcctgcggcttctgcx^gagtacgacggggcac 

gagccggagatggtggagcttgtgaacatcgagcctgtgt Gcgtgcggggcggcctctac 

gaggtg<mtgtgacccaaggagagtgct acccg gtgt actg gaaccaggctgat aaaatac 

cagtaatx30gtggacagttmtttattgacggcacttggcagxxtct agaagaggaagaaag 

taattraattgagcaagaacatctgaattgttttaggggocagcaga 

t attgaag tgtcaaaatccatagatg gaaaagatgctgttcatagtttcaagttgagtcgaaa 
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ccatgtggactggcacagtgtggatgaagtatatctrratagtgatgcaacaacatctaaaat 

tgcaagaacagttacccaaaaactgggattttctaaagcatcaagtagtggtac<>gacttc 

atagaggttatgtagaagaagccacattagaagacaagccatcacagactacccatattgta 

tttgttgtgcatggcattgggcagaaaatggaccaaggaagaattatcaaaaatacagctat 

gatgagagaagctgcaagaaaaatagaagaaaggcatttttccaaccatg 

aatttctgcctgttgagtggcggtcaaaacttactcttgatggagacactgt^ 

ctcctgacaaagtacgaggtttaagggatatgctgaacagcagtgcaatggacataatgtat 

TATACTAGTCCACTTTATAG^GATGAACTAGTTAAAGGCCTTCAGCAAGAGCTGAATCGATTG 
TATTCCCTTTTCTGTTCTCGGAATCCAGaCTTTGAAGAAAAAGGGGGTAAAGTCTCAATAGTA 

tcacattocttgggatgtgtaattacttatgacataatgactggctggaatccagttcggctg 
tatgaac^ttgctgcaaaaggaagaagagttgcctgatgaacgatggatgagctatgaaga 
acgacatcttcttgatgaactctatataacaaaacgacggctgaaggaaatagaagaacggc 
ttcacggattgaaagcatcatctatgacacaaacacctgccttaaaatttaaggt^ 

TCTTCTGTATGGGATCCCCATTAGCAG 1 1 T I CI FGGCGTTGCGTGGCATCCGCCCAGGAAAT 
ACT GGAAGTCAAG>CCATAT7TTGCCTAGAGAGATTTGTAACCGGTTACT aaatatttttcat 
CCTACAGATCCAGTGGCTTATA£MTTAGAACCATTAATACTGAAACACTACAGCAACATTTCA 
CCTGTCCAGATCCACTGGTACAATACTTCAAATCCTTT ACCTTATGAACATATGAAGCCAAGC 

titctcaacccagctaaagaacctacctc^gtttcagagaatgaaggcatttcaaccatacc 
agccctgtgacctcaccagttttgtcccgccgacactatggagaatctataacaaatata^ 

CAAAGCAAGCAT ATT AGGGGCTGCTAGCATTGGAAAGGGACTT GGAGGAATG n GTTCTC AA 
GATTTGGACGTTCATCT ACAACACAGT(^TCTGAAAC*TCAAAAG>CTCAATGGAAGATGAG 
AAGAAGCCAGTTtsCCTCACCTrCTGCTACCAGCGTAGGGACACAGAC 

gttctggcttcctoqattctg^cattggagttggatcagaggat^ 
ggccttgtgg agagccgcta ttggtcagctgtcaogtcgcatactgcctattggtcatcctt 
GGATGTTGCCCI 1 1 1 1 CI 1 1 iaaccttcatgtataaacatgagcacgatgatgatgcaaaacc 

CAATTrAGATCCAATCTGAACTCTrGAAGGACATGAATGGOCTAAAACTGA I I I I H I II M I C 

cgttaaaatgtgtgtgtcaagatacggagatttcagggttaaagtatatttcag 1 1 1 ici 1 1 a 
ggggaacatatatttgaatttaaaagcactttatttaaaaaaaaaaaaaaaaatccgactaaa 
aaaaaaaaaaaaggaagg^cgctagagtagtctagagaaaaaacctcccwcacctcccocaa 
aacctgaaacatagnaatgaataaagann 

>1122 

tWNNNCGGTTTTCCGGGCAGGTACGCGGGGGCGGCCCGTTCCAGAGGGCGGCGC 
TCGACCAGTTGCCTGACGAGAGCTCTTCAGCAAAAGCCCTTGTC^ 

rrATCTAACACGTGGAATGAAAAGTACAGTrCTTTACAGAAAACACCTGTTTGGAAAGGCAG 
GAATACAAGCTCT GCTGTGGAAATGCCTTTCAGAAATTCAAAACGAAGTCGAC I mTTCTGA 
TGAAGATGAT AGGCAAAT AAAT ACAAGG TCACCT AAAAGAAACCAGAG G GTTGCAATGGTTC 
CACAGAAATTTACAGCAACAATGTCAACACCAGATAAGAAAGCTTCACAGAAGAT^ 
GATTACGTAATCTGCTC AAGCTT CCTAAAGCACATAAATGGTGTATATACGAGTGGTTCTATT . 
CAAATATAGATAAACC^CTTTTTG>AGGTGATAATGACTTCTGTGTATGTCT AAAGGAATCTTT 
TCCT AATTTGAAAACAAGAAAGTTAACAAGAGTAGAATGGGGAAAAA1TCGGCGGCTTATGG 
GAAAACCACGGAGATG I ICI ICI GCATTTTTTGAGGAAGAGAGJkTCAGCATTAAAACAGAAA 
CGGCAGAAAATAA GGCTC TTAGAACAAAGGAAAOTTGCAGATGTTTCACAATTC 
CCCAGATGAAATTCCTTTGC^rrCTGGTTATTGGAACGAAAGTTACAGCACQATTACGTGGTG . 
TTC ATGAT GGTTTGTTCACTGGACAAATAG^ATGCTGTGGATACTCTT AATGCTACTTATAGAG 
TAACTTTTG^TAGGACAGGGCTTGXaAACCCATACCATCXXrrGACTATG 
GAACCTCATGAGACAATG CCAATTGCTGCCTT GGG ACAAAAACAGCGG ccttctcgattttt 
TATGACCOCACCACGGTTACATTATAjCTCCTCCTCTCCAGTCAGCAATTATAG^ 
TTTATT AGGAClAGTCGCCGTGGAGAAGTAAAATTTCTGGCTCTGACACTGAAACATT AGGTG 
GTTTTCCAGTAGAATTTCTTATCCAAGTGACCAGATTATCAAAAATTCT 

GAAGATATCAAGAACATCAGGGACATGAACACAGAAGCAGAAACACTGGCAATCATATTGCA . 
TGGGAGTCAGCATTGG ATTC^CGCGGAGATTTGCGACAATCGGT CTCGGGGACTT GAACAG 
GTGAACACAGGGCCTAAACACAGTTTGGGTAACGGTCACCGGATTGCT ATGAGATTGTCCC 
AACACGGGGGTCCAGCTGAGAGATGAGCGAAGGTATTAACCCGTGTGTAGCGGCGAAACAC 
CCGGCACAGTGTCCGCGCGACATCCTCAGOGGAGAACCCGTCTTTAjC^ 
GACAGCAATTC>GATCGCGACTGGTGACCAATTGTGCCAGC^ 
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CTCAGAGTAACGAAACATGATCACGAGCACTAACAAACGATAAGTGAAGCGAACGGCCCAC 

GGCTGCAACGACTGCTTCCTCAT CAT ACATCTTTCATT AT GAN 

>1123 



laggTGGATGCAAGATCAAGGAAATACGAGAGAGTACAGGGGCT 



caggtccaggtggcaggggatatgct acccaact caactgagcgggccatcactattgctg 
gcattccacaatcc^tcattgagtgtgtcaaacagatctgcgtggtcatgttggagtcxxxx^ 
ccgaagggcgtgaccatc»^tacc«ggccaaggc^ 
gtggtc flgga caggtacagcacaggcaccgacagtc^ 
catgtgcctcaaccctgacctggagggaccacctctagagttgaccaagct 
gcaatgcaacagtctcattttcccatgaggcatggcaacaccggattcagtggcattgaatc 
gagctctccagaggtgaaaggctattgggcaggtttggatgcatctgctcagact acttctc 
atgaactcaccattccaaacgatttgattggctgcataatcgggcgtcaaggcgccaam 

AATGAGATCCGTCAGATGTCTGGGGCGCAGATCAAAATTGCGAACCCAGTGGAAGGATCT a 

ctgataggcaggttaccatcactggatctgctgccagcattagcctggctcaatatctaatc 
aatgtcaggctttcctcxsgagacgggtgccatg^ggagcagct agaacaatgcagattcat 
ocataatocc i 1 1 c i g ctgttcaccaccacccatgatccatctgtgtagtttctgaacagtca 

GCGATTC CAGGT t tt AAATAGTTTGTAAATTTTCAGTTTCT ACACACTTTATCATCCACTCGTG 
Allll 1 1 AATTAAAGCGTTTTAATTCXrrrTCrCTGTTCAGCTGTTGATGCT GAGATCCATATTTA 
GTTTTATAAGCTTCTCCCTGG 1 1 1 1 1 1 1 H U 1 1 1 1 IGGI I H I I 1 1 1 1 1 IGGCTCATGAATTTTT 
C1 GM IGT C^TGGAAATGTAAGAGTGGAATATTAATAC^TTTCAGTTTAGTTCTGTAATGTCAG 
GAATTTTTCAAAAAAATTAAAAGATGGACTGGAGC I 1 1 MC I I I GTGAAT AGAAACT GGATGC 
^^^^^^' r ^^^^]^^^^CCTUrrCTCJTGCTQrVJ ATTrrTGTACCTTTT ATCCCT C 

aaaggaccci i ctt gggttttgaatggaagcctttattccggtt aagatg 1 tt ic i ictatttt 
agca cttccaiu i ml 1 1 b t gg ccctcgatcct atttttccctgact ccat gctt ggttggc 
ccttataaaacttgtgcccaaaagattgaggattagacttt ccgagg acttacctgtcctag 
gggagt aggcaagcacttccactagggagggggtgggggaaaggaatgacacatgacata 
catgg catac acattaagcagttg atca tatgtctgactgggttccagtttcttgggaatgtt 
ggtccccttgttgaggcttgcatattttaaactaaaaatt^ 

CATTTATAGTCCTCCATAACAAGTTAGAAGGATGTATCTTJCT^ 

GAAAGT TGGGG CTTGACATTATACTCATTTAGTGAGAGTAGATGCAAAAAAGTGGAGGGGCA 
GGAGAACI I C T CCAGACACCTCAGAT AAAGTCCG^AGCCCAAGGCTTTATCTTAACCATGT A 
TGGT 
>1124 

W><NNNNNNNNNNKCGAAATACTTCT AGAATTAATTAACGCGGGGAAGATTTCGAAAA 
CTAGATATCCATATGCAAAAGAATAATACAGGACTCTTATGGTACT AGAT ACACAATAATCAA 
CTCAAi^TGATTAAAGATTATACACAAAACTTGGAACCATAAAGCTCXrr ATAAAGACAACCT C 
AGGTGTGOGT ATATTTAGGGCAACTTGGATGT ATGCTT AGGGTTCGCAAAAAGTAAACAAAA 
ATACAAGGQAAAAAAATTATTGACAATGAAL nSCI 1 1 G OT AGTGATTTGTGATTrrGTTTTTTC 
TTGATTAGTAACCAACAGCACAGCCA^ 

GAAGO j mm uCCCAACAAAGGAAACAATAAAGAAAATAAAAAGGCACACT AAAAATTACAA 
' G 1 1 1 uGGATAAGGGATTATTrrTGJ^AAAGGTACAAGGGAGTCAT 
AATAACT AAACAACAGAACCATTGCCAGACAAAAACTGGGGATATAAI^CG^TT AGAT ATTTT 

tccaaagaagacaggaaactgaccgaatgtct ACTAAAAGGTGGTTAACATTACT ACGTCAT 
. gagaaaaatggagatcaaaaccacactcagatcttatctcactcocatagaatgacattagca 

AAAGATCCAAAATATTTAAAAAGACAATCCTGGTAGTGAA iiui c i i c gggtcgnh 
>112S 

ACAGAAAAAGACACATTTAGATAAACTGAAGCAGATTAAAGTGACTT^ 
ATCTTTC i TTTrATGTTTAATTTCAAGTATGGTT AAGCACT AATTT AATTCAGTGC 1 1 TGI GCTT 
ATTCTGT1 iCTAGT AACTCTTACAGAAACAAGTGTAGTCAGTAGCCAACATACATOGATGTCA 

GCCTATATATGACTTACT aggagggcttagttttttaaaagagatgaaaaataaagagaaggt 
ctagt attttcctcccacattccaacagatcatttt atgt GCCCOCTTT GGGTGAGCACATTC 
QTGTTGT AGACCATTGATCATAGTAGTCAGAGX^TGGaGCTCTGGAGTT cagaaaaattatt 

ttattattgttgtt atgacaaaaat aattaccatgt aaaggaaaagaaaagaaactcagcgaaa 
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>1126 



cettx^|cccaCGCGtecggB>CXSGAACCCCGTCAGGAAAGCGCAG 
cattgcagagctaccgcttcggttagcx^gccacgaagttctcgcgagagtto 
ataccaaagctggtgtgagacgagacaatcctgccccgccgcgggataatcaaagntggcgg 
gacctttgagcatacaccgagagagtgaggagccagacg acaagcacacact atggcgctg 
aaaoggattaataagttgcagcttttc acccg aagctatttatccgagggacaccgg 
gatgtaaaaggcattct ggaagctccc i nrocrn iggctggagagtgggggtggataag 
gggtgcgatggcagtaagggaaagcrraatgtacctatttttggctcattct atttaataacac 
cct tgttt cgttacttgaaatgtggtttgatagtatattatcgatttaagttgaa^ a . 

ggatttttcggagctctggga gtga gcggaacctccacctccggtggtttagtctc 

TTGCTCTAACTTCTATCXXXy^TTTTAAGATGGCGG<rrGCTTT AACTGGTTC^GGCTCTTTCC 
GGCATCTCCTTrCGTAATAATGTGAATTAATATGATTTGTTAAATGTTAACTTTTATATCGCGT 
T GTGCACT GCTCTAAGATCCTCTGCTCAAAACTT ACCTAAATTCAAACTCTT AAACTGT ATTTC 
CCTTTTTI GTGACTTGTATTTGTGTGATGTGTTTGGTTGTTGCACAATCGAAGAGGGTGGAGT 
GTTTACAGCTAACTTGAAAAAAATGTCATGGAATAAATTCAGCCTT ACTAGAAAAATTGTGGA 
GTGTCGGAACTTAGTGATTTGGCCCGTGACCCTCCAGCACAATGTTCTGCAGGTCCAGTTG 
GGGATG ATATGTT TCATTCGC AAGC CACAATTATGGGACcTAATGACAGCCCATATcaaagCG . 
GTGTATTCTTTTrGACAATTCATTrrCCTACAGACTACCCCTTCAAAC 

TACAACAAGAATTTATC ATCXS AAATATTAACAGTAATGGCAGCATTTGTCTCGATATTCT aaga 
TCACAGTGGTCGCCTGCTTTAACAATTTCTAAAGTTCTTTTATCCATTTGT^ 
ATCXiAAACCCAGATGACCCCCTAGTGCCAGAGATTGCACGGATCT ATAAAACAGACAGAGAT 

aagtacaacaagaatttatcatccaaatattaacagtaatggcagca 1 1 iu ici c gatattcta 
agatcacagtggtcx3cctgctttaacaatttctaaagttctt^ 
gtgatccaaaccgagatgaccccctagtgccagagattgcacggatctataaaacagacaga 
gataagtacaacagaatatctcgggaatggactcagaagt atgccatgtgatgct acctt aa 
agtcagaataacctgcattatagctggaataaacttt aaattactgttc c 1 11 1 1 i g attttct 

TATCCGGCTGCTCCCCTATCAGACCTCATCTTTTTTAATrrTATTTTT^ 

TTCATTCACATGCT CATCTGAGAAGACTT AAGTTCTT CCAGCTTTGGACAATAACTGCTTTT A 

GAAACTGTAAAGTAGTTACAAGAGAACAGTTGCCCAAGACTCAGAATTTTTAAAAAAAAAAAA 

TGGAGCATGTGTATTATGTGGCCAATGTCTTCACTCTAACTTGGTTATGAGACT aaaaccatt 

CCTCACTGCTCTAACATGCTGAAGAAATC*TCTGAGGGGGAG^ 

TCACATCAAAGGATACAGCATTATTCTAGCAGCATCCATT^ 

GAGATTTGAG«TTACATGATATGC7TTATGCTCATAAC 

AATTTATAGCATCAGCAGAACAGAAAATGTGATGTATTITATGCATGTCAATAAAGGAATGAC 
CTG ITCTTGTTCJ ACA GAGAA TGGAAATTGGAAGTCAAACACCCTTTGTATTCCAAAATAGGG 
TCTCAAACATTTTGT AATTTTCATTT AAATTGTTAGGAGGCTT GGAGCT ATT AGTT AATCT ATC 
TTCX^AAT ACACTGTTTAAT AT AGCACTGAAT AAATGATGCAAGTT GT CAATGGATGAGTGATC 
AACT AAT AGCTCT GCT AGTAATTGATTTA I M I ICI I CAAT AAAGTTGCAT AAACCAATGAGTT 
AGCTGCCTGGATTAATCAGTATGGGAAACAATCTTTTGT AAATGCAAAGCTGTTTTTTGT ATA 
TACTGTT GGGATTTGC l I UATTGTTTGACATCAAATGATGATGT AAAGTTCGAAAGAGTGAAT 
ATTTTGCCATGTTCAGTTAAAGTGCACAGTCTGTTACAGGTT 

TATGCAGAATTAATAAGCT ATTTGGAT AOTGT AQCTTT AATGTGCTGCACATQATACTOGCAG 
CCCT AGA GTTCA T AGATGGACTrTTGGGACCCAGCAGTTTTGAAATGT GTTT ATGGAGTTTAA 
GAAATTTATTTTCCAGGTGCAGCCCCTGTCTAACTGAAATTTCTCrrC^ 

GAGCT GAAAA AAAACA ACATGGGAGTAAT AATGGGTCAAAATTTGCAAAATAAAGTACTGTTT 
TGGTGTGGGAG 1 T G 1 CATGAGGCTGTGTTGAAGTGACTT ATCT ATGTGGGATATTGAGT ATC 
CATTGAAATGGAI IIGI ICAGCCATTTACATTAATGAGCATTTAAATGCAACAGATATCATTTC 
AGGTGACTTAACATGAATOAAT AAAAGTCAATGCT ATTGGATT G I ITTTT61 HU ACAAGTGC 
TATCTGTGOCAC TGATT T A ACTTCT GT AGT AACAAG6GCATTACCATTCITCACCTTTCCT AAT 
TCTGATCCCATAGTTTT ACA 1 1 1 1 I CCIGI 1 1 ATTTTG A1 1 1 I GI I C A CIG CI 1 1 ATTTCTTAAA 
GTTCT AGCACATCTGTGACTCCTCCACTTCCACATTTTTG GCTTACACTT ACGTGCAAT 
CTT ATTCCTTGTCTGCACACACAT GTGGAAAGCT AGAAAT AAATGTT AAAACTT ACTTTTT AT A 
AACATTTT AAT ATGT AGTTTGGACATGATTTATTGACTTAAGG I I C I I C I CI AAACTGGAAGTG 
AAATGCATGCCI I CIGAAGA TGTTCTGGCI I IGI I AATTCTGTAATCATTTCATTGGGGAAAA 
AACC AGCTAOGCAGI 1 1 i \ CGAATGAGTGAA I MM ICAI 1 1 I bid 1 1 IUCI lAAAAOGGCTG 
CTTCAGGGT AGATGTCATACTGCATAACTTTTTTGGATTCAAATTATGAATGAGAAATTAGT^^ 
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ACATTC7GCTCCACAAGGTAAGAAAAACTGCTCTTTGGCTCT 

ATGCATATAGTCTCAAAATAACAGCTTrAGTAGGCATATCACTTCTTGAAAGCCAAACATGAG 
TGTAAGACACTTTT ATGAAAC ACGGT GG ATCCCTAACTGGCTTTCAAATTG ACCTTTATAGCC 
TTAGACAACCCTTAGGTATTTACGGAGATGACTrCTTTGATTGTCATAACAATTAGTGGATGT 
GTCCAGTTCTCTGTATCTTTGACTrGATGCTTTATACATCATTTCATTrGTTGCTTCTAAGGGA 
ATAAGCCATAGAGGCTTCTCGAGGTTTAAAAGAACAGTAAAGTACCTGGAAAACCAACATTTT 
TGAATGTATGGACACTGGACATGAGATATGTACAATGAAATCTTAAAAGAATCTAAGAATTTG 

ccctctttgccccactccacccagtaatttgacattactagtgccatgtataggacccaact 
gagtattagaatcagttttgactatgtctttgtatttcctaaatctrrtaatgcataaaccgaa 
ttagggtccagttggcctgttaatggtaaatttacattttaaatgactcagtttgi i 1 1 i cctg 
ggcgagtttgcaatgtgataatcagattttttaaaactgattaatttg c 1 1 1 ci igtgtgggt 

GTACTCACAT TTTAA AGTATGAAC CACAGTTAACTAGTGGTCTCAGGGGTAGTGAAACACTCA 

CI 1 1 1 1 1 1 1 1 IGTTTGI 1 1 1 1 1 1 1 IU 1 1 1 GTTTGTTGAAATGGCTTAGTTGAAGTATACTTAAGG 

TACTGATCATGCTGrTGTTAGTAATTTGGGC^GGGAGGGGGGTAACTCAGCCATGTTTTGTGT 

TGG CATAACAAAACTGTTAATG ATTGTTGATTACACTTTTAAGTGAATTTGTCTTTT ATG AGGA 

ACCCAGTGCAAGTCACTAAATATTGTC7AATAGTGACATCTGCATAAGACTTGTAATAGCTGA 

AGTTAATTGAGCTTAAAGGAATTGTTAOCATTAAAGTCTGTG 

>1127 

GCGTCCGGAATGGTAAGTATATACCTCTTATAAAATCTTCGAATGGCTTCCTCTTTCT 
TGGACTGAAAAATGTCCATTTTCCTAOGACCTGCAACATTACTCTTCTCTTGCTTACTCTAGTA 
CTGCAAACATGG ATTCTTTATTGTTCTTT AAACAGACAAGCT CCTGGCTCAGGGCCTTTGCTT 
TCTTGGTTCCTG II I H I CCCACATACCTACATGGATCCATCTiTGTCTTTCACCTC CI I N CA 
TTTCTGCACAATTGAGTTTTCAATCAGAACTTTCCGACAGCCCGTTTTAACATTGTAAATTC^ 
TCCATCCCTGGC C I I 1 1 GAATTCCTGGATGCTCTGATTTrCTCCATCGCGCATTTTATCATCT 
GATATACCAAATATTTTACTTATTTTCTCTACTITGCTCATGCATTACAATGACATCTOC^ 
AGGAAGATGI I I I IGGAI 1 1 1 1 1 1 1 1 1 AACTGCTGTTATCCTCAGTGCTCAATAGTAGGT ACG 
CGGATGAAATTCATTGTATAGGTGATTCGTGCAACAGCTTCTGAA7TTTTTCCTOCCTCACOC 
ACXTTT ACAGCCATTAGTCXTTAT AAT AATTT AATTTGGGTTTGT G GGTATAGAAACTGTATTTGC 
ACAATAATGATAAAGCCACTCAGGCTCATCTAGTCATTTCCTGAATTTATGTGTGTAGGAGTA 
CCACAGAGCACAAAAGATGGGCACCCCCAGCTGCAGCCGMCTGTCAGAGGTGCAGGCCTC 
C CTGCAT GGATAQTGCTGGGCCAGCCAGACCCCCGAGTTCCACCCT ATCAACTCCCTGGAG 
ACTrrTTGGTCACATGCACTGCGCCCAGGTGATGCTTCTGTATATTTGAGGAAAGAGACCCT A 
ACCAAGACACATAATTCTCCATGGTATTG GAG G CCG ACCGCTGCT CAGCTGCCCT CTCTCTC 
ACCCAGTAAATGCAGAAAGCTCTTAGGAGAGACAGAAAACCTGCCATGGGGCTGCT7CCCT 
CCCTCGCAGG GCTGGCGGAGGCTCCGCATCGCCTCTCTCCTAGGATACAGAAACCATGGC 

ACCK^GCTGCCTTTCCTCCCTCACAGGAGCCACCAC^ 

GGCCCTGGCCAAAGATAGGAGGAAGAAAGAGAACT CTCCATAGAATGT AATTT ATAGATGCG 

CGTATATGTATATATATCTATTTATATGTAAATAGGCATATATGAAGATATATATATATAGTCTA 

CTTTTTAAACTCNNNN 

>1128 

>1129 

>1130 

>1131 

>1132 

>1t33 

►1134 

>1135 

>1136 

NNACCACGCGTCCGCAAGGACAGGTGCGTGTCCATCCTGTGTTGCAAATTCTGTAA . 
ACAAGTGCTCAGCTCTAGGGGAATGAAGGCTGTTTTGCTGGC^ 

TCTCTACAGACATCCCTCGTACCAACGCAGTGGACTTCACTGGAAGATGCT ATTTCACCAAA 
ATGTGGAAA7GTAAACTGAAGGACATCGCATGTTTAAAATGTGGGAACATTGTAGTTTATCAT 
GTGATTGTTCCATGT AGTTCCTGT C I I C I I I CCTGCAACAACAGACACTTCTGGATGTTTCAC 
AGCCAGGCAGTTTATGATATTAACAGACT AGACTCCACAGGTGTAAACGTCCTACTTCGGGG 
CAACTTGCCAGAGATAGAAGAGAGTACAGATGAAGATGTGTTAAATATCTCAGCAGAGGAGT 
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GTATTAG ATAAATGC AATT ATGATATAT ATG AT ATACAAAC T T T ITT CTATTTAAAAATATATTA 
ATGGATCAACTTTAAAATTGTTAGTTGCCAGTGATCTTTTTTGGAAAACAAAAATGGGGCATT 
TGTTGATTTATTTATTTTCTGTCTCTAATTAGTTACCTCAGTTTGATTGAAGCCAGTGGAGTTG 
TGCTTTTCCTCTACTTCTACTTCCTCTCCCCCAC CI 1 1 1 1 CTGCCCAGTGTAGGTGTATTCTTA 

aattcagacgggaagattcttrcacatatcactcagttacctcccaatctgggggagtttttc 
ttacaacttgataccagataccattaattttacattcctgaataaaggcctagtacccacgca 
tatttcaaccatgcatatatcaagttcaaccgagttrraataggggattaaaaaaacaagctg 
ttaggtttccatgggcactggttctcataggttctattggtgataactgctttaacatggagc 
aagagtttgtgaatcaggaaatag^ataaattaaaatttaaaatatatagaggaatcctcttga 
ttgctcagcatgatgttagataaatgagtttgtcagaaaatatcagtatacgctgtttaccaa 
tgttatttatttacattcttctaaaggcattatggatattgtattatgagagctaaacctaaata 
agttatcctgttccctaggaccttctctgtaaatagtgaattttagacgagtagtctgtccta 
aatcttaaatagaaaaaaaactaaagcgatttgctt aagccattgtacattataaagagct gtt 
ttgttrtgctttgctttgctttgttttg 1 1 1 1 it i aaag ct gcattca gag ccacaaag g aat a 
ggaaagtagggtagtgttggattctggtttratgtaactct aaaataaatgtatctctttaata 

TCTCAGTTGTAGGGATTTTGTCAATACCAAAGCAGACTGAGTTGTGGTTTTGTAAATAAAGTT 
TTTTCTAAAAATGACCATTCTTCCTTTAATTTTTTGTTATGCCCACATATTGTATC 

aaataaatagtacttaaagtattgggtgggggggaaacacccccccccccaaaagggggcc 

AAATTTTTGTTGTGGTG GGGGCCCCCCCCAAAAAAG GGGGGGGGGGGGCCCCCCCCCCNN 

NNNNNNNNNNNNNNNNNNNNNNNNN 

>1137 

>1138 

>1139 

tct aaaacatgctaacctgaccttttccttctaa i norm i a acttcattcgagata 

GTACCTCTCACTCACAGATATTTATTTGGTTATCAAGTGAAGATAGGTGTGTCTAAAAGTGAT 

CTTCTGAATCCTGTCTCCCTAGAGGTACTAGTATCTAGAGTTTACCCAGAAAATTTTATGATT 

GTAACAAAAGG AAGTAGTGACTT ATGAAGGTTTTGTTTGTTG AATTTTA C 1 1 I I GCTACTTGTC 

CAATAGTGGCTAGTTTACATTTATGGGAAATTTTATAGTTTGCATAAACAAAGTATAAACTTAA 

GGCACAGTGAATAACTATATTGATAAAGATAAACTAGCCACTATTGGACAAGTAGCAAAAGTA 

AAATTCAAGAAACAAAACCTTCATAAGTCACTACTTCCTTTTGTTACAATC^^ 

GGTAAACTCT AGATACTAGTACCTCTAGGGAGACAGGATTCAGAAGATCACTTTTAGACACA 

CCTATCTTCACTTGATAACCAAATAAATATCrGTGAGTGAGAGGTACTATCTCGAATGAAGTT 

AAAAACAAATTAGAAGGAAAAGGTCAGGTTAGCATGTTTTAGAACTATTGGTAAACTATAATT 

CATGGGACATTATATAATCAAAAGATTAATATTTTAAGCACT AAGTTATAAAGGGTTTACACCC 

ATGAATAAAAAGATTACCATCACTTACTATGAACCACCATTCCATGAATGCATGTAGCTGAAC 

ACTCCTAATGAAAAGTTTAATTATCCTTCAACCTGTAGTTGAAGAACTCAGTTCATGTT CATTG 

ACAGATTTCCATTACAGACCCACTATATTGATGTTAC I MC I 1 1 G ACACTATATTTTATATAGAT 

ATATTAAAATTGAAAACTT AATGCTGTTT AGAAGGCT ATT AAT AT AACT ATT AATTTCTGAAAG 

CTTTGAGTTTCTGAAAAGGCrrrTAAGATCAAA^ 

CCCACATTTAATTATAAATCAATGTTATACTGATAAAAGGTTCTATACACACATTTAGAGATAT 

ATGTGTGTGTGTGTGTGTCTGTGTGTGTGTATATATATATAC I J 1 1 1 ICCCCCCAAGATGGAG 

TCTCACTCTGTCGCCCAGGCTGGAGTGCAGT GGTGCAATCTTGGCTCACTGCAACCTCGAC 

CTCCCTGGTTCAAGTGATTCT CCT GGGAGGCAGAGGTCGCAGTGAGCCGAGATCATGCCAC 

TGCACTCCAGGCTGGACAACAGAGCAAGACTCCGCATCAAAAAAAAAANAAAGATATTTTAA 

TATATTCAN 

>1140 

ACCAGATTATGGACTCTGG 1 1 C I GGTGTGGGTAGT AGGTGGAGGGT AGCCAGGAGG 
GCTTGGGGTGGGTCATCACCTCACATTTGAGATGGGG 1 1 1 1 ATTTTGCAGATTCATGCATTG 
ATCACAGGCCCATTTGACACTCCTTATGAAOGGG G I M C I I CCTGTTCGTGTTTCGGTGTCC 
GCCCGACTATCCCATCCACCGACXTrCGGGTCAAACTGATGAGAACGGGCAATAACACAGTG 
AGGTTTAACCCCAACTTCTACCGCAATGGGAAAGTCTGCTTGAGTATTCTAGGGTAAGAGGA 
GACTTTT AAGT AGCCAAG T CGGTTGTT AGGAGAT AATTACTCTAG GTCAG CCTTT ATCAACCG 
GAGTCCCTCATCTGAACT ACAGAACACAGAAAATGATTGAGTGACTCTTCTGAAATCTCXTCA 
GGA7 GGT AT GTGACT AGTA1 CATTCTAGATGCAGAGGGGAGAAGTT AATTT ATT ACAGTGGT 
AACCTTTG AGAAGTGG I I C I C 1 1 AAGAGTGTGGCCCTGAGCATCTGGGAACTTGTAGCCGAG 
CAGTrrCTGGGGCCCTATCTTAGACCTACAAAAGAAACTCCTGGGTTGGGGCCAAAAATCTG 
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ATTTCTTAGTGTTCCT AG 
TACCTAGAAAACAATGTTGGG 



G 1 1 1 1 1 1 iGGTTGTTG 
TTGACTGTATTAGCTTTATCAGCCATGT 
CTGACTTACNN 
>1141 
>1142 

CCGATATAGCAG AAAGCATGAGTTTTGTTGTGAGACAGACTT AGCTCTACAT C I MI A 
TGTGAGACTTAACCTGTTTnTCTATGGGCAAATGACCTGAACTCCTGAATCAGTTTCCCCAT 
CTGTAAAGTGAGG ATG AAAAT AGTATCTACTTTATTG AG A I M MC I I ATTCATCTGTAGCATC 
TTTGTGT AAT GTCA TATGGATGTAACTTGGTTGGTACTATT AGCAACT GTG ATGATGATGATT 
GTGAATCTT ATTTTCATATCTTGGG M MCI I ACAGTGAAATATTTGTTGCTGTTA I 1 MCM I G 
TAAAAATAAACCATGTTTG CATCT TGGTCTTCTTTCCATTTGGATTCAAAAGTTTTATAGTGAT 
TCCTGCTAGTAAAATTGCATTTTCTCCCTAGGAGTACTTCTTTGATTCCAGAGTATTAACAGAT 
GGAGAACTGGCTCCCAATGATAGATGTTCCCGTGTGTGTGGAAAAATAGGGCACTACATGAA 

agactgcacctaacaaggaaaaggttggcaaatgaatatatatcaacatttgaggaaaataaa 

TGAGAGGCCAATTTGTAGCTATAATTATGTTCAGAAATGTTTAACTGAGAAAGTTTAGGGTAT 
TTAATTATACCTAAGGTAAATGTTC^TTTGAGTGGAATTTTGTAATAGGGTAAGATTTGCATGT 
GGTATAATAGTATCTT CTAAQATTCATTTGTTCATTTATTCATTAATTCACTCAG I M 1 1 Ml I G 
rTTTTGTTTn GTTTTTATTTTTG I M M M I GAGACGGCATCTCGCCCTGTCGCCCAGGCTGG 

GCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCGCACACCACAATGCCCAGCTAAATTrTTG 

TATCTTTAGTAGAGATGGAGTTrCACCAACATTGGCCAGGCTGGTCTCGAACGCCN 

>1143 

>1144 

>1145 

>1146 

ACCAAGGTGAAATTTGAATGTGTGAACGCATTGTTCTGTGGAGTTCTTTTCAAAGAG 
ATTTCAAAGCCACAAGTTAGATAAGCCCAAGAAGTAAGGCCAGAGTGAGATCGAAGTAGGC 
CM I CM M AAAAAATAATAGCTTTTATTTTATGTCAGTATCI I CI 1 1 ACAAATCT AACCTTCCC 
TTlTCACGCTTmGAAAAGATAGCTAAAATTCAGTGTGTTCCTCTTATTATAAAGGATTGGGC 
TAATAGTTAAGCArrrCAAAACATTTCAGTTrCGTTAATCAGAAGCTGCAGTGGGTT^ 
TAGCCAGTTTGCTTTTAAATTTGGCCATGTGGGCTTTAAGTTCAACGTATTTGTGTTCTCTTTA 
TTGTTACTCTCTCCAGAGTATTACCCAAACTGTGAAGTCTGTGTTTATGGGGATGGCAAACAT 
TCATTCTATTCGGAGGAGTTTTCAGTCTCTGGGGTTGCTGTGCACTCAGATGCCAGATCCGG 
GAAAGTAAGN 

>1147 

CCAGGGGGGGGAATTTCCGCCCAAAGGGGTTCCCCCCAAAAAACC C 1 1 M M I M M M M M 
TTMTfTCGCTATGTAATACTGTCTAATAATTGAGACTTTTAGAATCTA 

TCTnTGGGCTGGGAATTCATCAGGAACTGGGT CT GTAGGAACCT ATTCTCGGGCTTCCCAT 
GGCCTTTGATCTCCTGTTACGGTCtXTTCCACAAACATAACATGAGGCGACTT GTAGAGACT G 
GGCTACGTGTTCGGCTAATTGCAAAAACAAATTTTTAGTTTTTCCTGGAA 
CACATTTAGTTCATCATAGAAAGTCTGAAATACTGGTTCTGGAGAGCGTTTTT 
TTTTATTAGGATGCTTACACT AGGATCT AGTCCTTTTCCATCAATGCCT AATGATACAT ATTTT 
CXTTTTATTACATTTGGGTCTGAGGGGTTTGTGATTATCAATTCTA 

TCGT GCAGGAGGGGCTGACTTTTCCTTTTT GGAGCCAAACAGGATCTTTTTT ATCTT CT GTTC 
AAGTAGCCCAAATGACAGAAG^CX>GTATTGACACATCTCACATAAATATGATTCTTGACAGA 
TATACTTAI 1 1 1 M 1 1 CTGTGTAAC 1 1 1 II I T CCAATTTAGAGAACGGCATCCT ATCCATGCTG 
CTTACT ATCAATAGTGGCACAAGCGTCAAATTTTAAGGTTAiCATTTTTG&GAOGCC' I MUCH 



AGACTGTGATCATGGGAGG1TCAAAGGGGTCATAGCACACATCGGGCTC 
ATTACATACTTTATACTGAGTGTTATTATACAAACAGG I rocm 1 GGAGTTCCT AGGCATTCA 

taataactataaaataataggactgtagcaatcttttgtcct ACCTCAGTGACTTGATGTATAT 

ACTGGAAACAGTCTCAGTCTGAGGAAGGTCAGTTGAAGTCCTTACTGT AGAAGTCCAAATTT 
TAAGGAAAATGAGTCCCGCGATGAGTTTCCTCATGCTTCAGCTGTGCCT 
TCOGGGTGTGACTGGAGCAGGGCTTGTOGT C MCM CAGAGTCACTTTGCAGGGGTTGGTG 
AAGCTGCTCCCATCCACATACAGCrrCGCAGTCTACTGATGTTTAAGGTTGGTCTC^ 



GGGCCTACT AGGGTAG ACTGAGTCT AAGACCTCTACACAGTTTATGTTCAACTGGGCTCTCT 

GATACGAGGAGTAAGGTGTGCAGGGTTAGGGTGTCGCAAACTTCAGTGGTTATGCGGGAAT 

TTTCACAGAG(^AAGCTTTGGTATCTAGTTAGTCTAGGATTCGTTAGCrrAGGATGCGGTTTGGT 

ATTTATTAAAATCATCACAGCATGGGGAGACTTTAGGTITTGCCCAAGAGTrAGCTTATCTGC 

TTCTTATG CTAACAGGGCC ATTGCTGCCAGGGCCCTTAGAGATGGGGGCCAGCCTTTGGAA 

ACCCCATCTAGTTGTTTTGAGAGATAGGCCACTGGCCTTGGCCAGGGCCCTACAGTCTGGG 

TTAAAACTCCAACTGCCA I I M M MCM M I GACACATGGAGTGTAAAGAGTTTTGTCAGGTC 

AGGTAGCCCCAGGGCTGGGGCCGACATGAG I I I > I CI II 1 1 AACTCATGAAAAGCTCGTTGC 

TGTTGGTTGT AAT AGATGTAGTGTATCT AATTT ACATTTTTATTGACTGTCATCTACCAAAATA 

TTGACTTCAGTTCCTGTAACTATTTGATTTCAAGCTTTAAA 

CTCCAATTGCATCTAAATATACCCGCGT 

>1148 

NNNATGTTAACAGGTAATGTAAGAAGTTCTCTGAAATGTCAGCAAGTAAGTTCTGAA 
ACACATCATGCATGAGTAGGAATAAAACCCAAGTTCCCCATAACGTAGATAACTTAATGCTGC 
ATAAAAATATG AAAGTGT AACCCATGAAGGACA C I I I MCI 1 1 CCACT GCAAAGTT AGCCACT 
TTGCTGI M I ICCICI 1 1 II i AAACTTTGAAAATAGACTCTTTCCAGAAATTGGAGCAATAATG 
GTGTT ACCACACA CAG ATT AAAT AATTT GTAGAT ATTTT AAGTGACTTTT GGGCAAAACTGGA 
ATGTATACTTTTACCTTGTTTCAAACAGCTAAG^CCAGTAATTTAAAAATTACTAAAAGGTTTA 
CTTTGTTCATTAATAAAACATTTAACAATTCAAATTATATGCAC^rrTTTACCT 
TACACATTCCTGTTTTCACATTATAGCAACTGATTAAGCTGAAGCTGTAAGT 
ATGAGTGATCCG CATCT CCATCAATT AGAACACT GGAAAAGATGTTTTATAAAAGAGGTATTT 
AATTTTGTTTGTAGGATTAACTCATGCAAATAATAAAAAAGATATCCTGTTGGTTCAATAGTAC 
ACTTGCNNN 
>1149 

NNAAAATGGAATCTGTTCGGGCATATTTTTA I 1 1 II 1 1 1 1 1 1 1 1 1 ACTGCCTCAGGTAT 
AATTTTG CAAT GG CG GTTTCGCAACTTG CACTG TCCT GACT ACCTCTGTT ATCATAACTAATC 
TAGGTAAACAATTAAAATAGGTAAATGTAATGCGATAAATACTTGGGGACAAACTGGTCATAA 
TTTAGAATCTCAAGCTATATTAAATAATAGATATTTCATTATTTGGGTATTTTCCAATAAAAATG 
TATTGGAGGAAAACCTTTCCCAAAAAAAAGTGTAACCTTTTTAAAAAGGTGGAA 
CCT AATTCAAAGCTTATTTAAAGGTTTTGTGTAAAACAAGGTAAAAG AACCATGAAAT AAGAA 
AGATGTAAATAAAG GTACAAAAATAAAGTGAGGATTTTTGG 1 1 1 1 1 1 1 1 II I GGTAAGAAGGCT 
TAAAGAAAAATAANNN 
>1150 
>1151 
>1152 

ACAAG CAAGACTTTCXrTTTAATATTGAT AAAG AATTGAGT ATCATGT ATGCATTCCCT 
TTTATGATATACAATTAATTGTAAGTTATTTCCCCTT GTATGCAACCATCCACA I 1 1 I ICMCTG 
ACCTTTTGCTCAAGTCTT ACAACT A C 1 1 1 1 AATGACTGCATTTTGGAGGTGGTCCCAGGAGAA 
CAGATGTTQCCTT ATAATGGTG Mill CC A I M H II C M I GATTGGGCAAGGGGGTTGGAAG 
TATTATTTAGTCATTATATGGATTCCTCTAAAAATTGTTCAATAGAATATCT 
CTT ACTTATTGTTTATTTATTGCCT AGAGTATACXXAAGACTGAGGATAEAATAATGATCAAGA 
CAGGTCT AATTTCrGTCCCATAGAGCTT AAATTGT AATTAGAAAGAA 1 1 1 1 1 1 IAAATAANNNN 
N 

>1153 

AGTTAGTGTCATGTTCATC^TTAGCACCATCATCGTTTTT ATCACATTCACCCATGGA 
TCACCGGT ACTACATAGAAAGGGCTTGGAAGTCTGATTCAGGAAAGGAAATCAGGAAAGAAC 
AAAG GAAATGAAGGAAG AAT AAAAAAGAAGAGAAGTCATTGAAAAAGTATGAAAAAATATGAA 
ACAGATAACAAGAAAGTAGAGGAGATTCCAAAAAATAGAACCCAGGTTTT 
ATAGAGTCTTGAGAATTGTAGGGTGTAAGAAATAAAGAATCAAGTCTGAGAGATCXX^ 
TTCI M C I I G I C I CA CTGATCTGGA ACCCAGG T TGOCAGC TGGGT C ^ 

GTTT CATGCTGAGACCAGGTAGACAAATGTGTAAATGCATGCATTTCCTCXJCT . 

>1154 

>115S 

>1t56 
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>11S7 

^CAGGCTCCTGCCTTTAAGAGCACTGTTTTGCTTTTGGGGCAGAAAGCATGQACTTT 
TAAAGGGGGACTTGGCATGAATGGCATTCAGAGGAGGGAGTGAGCAGTTGGGGGTCTGCG 
TGACTCGCTTTCGTGCTTAATCTACTGGTGGTCGAGCTGGCTGCATCACAAGCAGAGCTAG 
GTTGTATAGTGG CCTTTGTCTCAAGACACTCTCCAGGTGGGAGAGCCTTCCATCAGGGACAT 
ACTTTAGGTTGCAAATTGACTGTTGTCTCrrGAGGCAATCTCCrrGTGGGAGAGAGTTTCTG 
CCGTGGAGCTTCAAAGTAAGCaCGTAGTTAGATAAGCTTCCAGTGTAGTGAGT GTCTGGTGA 
AGGGAAGGTAAAGGTTATGATTGCATTTCTGAAGAGCTAGGTAGGAAATGGGAACNNNNNN 



>1158 

GGAGTCGACCCACGCGTCCGCGGACGCGTCGTT7CTGCTGACTCCAGTGTCCCGA 
GAGGCGCCGCTrCTTCCGCTTTCTCGTCAGGCTCCTGCGCCCCAGGCATGAACCAAGGTTT 

ctgaactactgggcgggagccaacgtct c i ici i i ci cccgctctggcggaggctttgtcg 

ctgcgggctgggccccagggtgtcccccatggcggggccgcgggtggaggtcgatggca 

gcatcatggaagggggcggocagatcctgagagtxrrctacggccttgagctgtctcctagg 

cctccgcttgggggtgcagaagatccgagcgggccggagcacgccaggcctgaggcctca 

acatttatctggactggaaatgattcgagatttgtgtgatgggcaactggagggggcagaaa 

ttggctcaacagaaataacctttacaccagagaagatcaaaggtggaatccacacagcagat 

accaagacagcagggagtgtgtgcctcttgatgcaggtctcaatgccgtgtgttctctttgc 

tgcttctccatcagaacttcatttgaaaggtggaactaatgctgaaatggcaocacagatcg 

attatacagtgatggtcttcaagccaattgttgaaaaatttggtrrcatatttaattgtgacat 

taaaacaaggggatattacccaaaagggggtggtgaagtgattgttc^^ 

aacaatt gaaccctat aaattt aactgagcg tg gctgtgtgact aagat atatggaagagctt 

tcgttgctggtgttttg<!catttaaagtagcaaaagatatggcagcggcagcagttagatgc 

atcagaaaggagatccgggatttgtatgttaacatccagcctgttcaagaacctaaagacca 

agcatttggcaatggaaatggaataataattattgctgag^cctccactggctgtttgtttgc 

tggatcatcgcttggtaaacgaggtgtaaatgcagacaaagttggaattgaagctgccgaaa 

t gcta ttagcaaatcttagacatggtggtactgtggatgagtatctgcaagaccagctgatt 

gttttcatggcattagccaatggagtrrccagaataaaaacaggaccagttacactccatac 

GCAAACCGCGATACATTTTGCTGAACAAATAGCAAAGGCT AAATTT ATTGTGAAGAAATCAGA 
AGATGAAGAAGACGCCGCTAAAGATACTTATATTATTGAATGCCAAGGAATTGGGATGACAA . 
ATCCAAATCTATAGAGTATrrGCCTCTTAAATGATACCTCATTGATATATTGCACTATTTCATA 

aatactataaaataatgactaggaagtaacttattaaaggctatgacttaaatttgaagatgaa 
ctacagtgttctaggtttgctgagaaggcttcattaaattaatct 
agagatggacaatgaaatatc^gttggtggatatgtgtgatagctgatttcaatattgaagta 
ttgaaat aaaat attctttacacctgaagtaaaaaaaaaaaaaaaagggcg gt cg tt agact a 
gtttagagaaagaaccttccacann 

>1159, '"• 

><160 . . 

>1t61 

. NNHAGATCOCAGAGGCTGAACACCTCGACC U O J C I GCACAGCAGATGATOOCTGA 

gcagctgaagaccagaaaagccact atccggaggctgaacacctcgaccttctctgcacag 

caggtccagcatoctttgaagcatgagttcttaocagcagaagcagaoctt^ 

ctcagctt caacagcagcaggtgaaacaacccagccagcctccacctcaggaaatatttgt^ 

CCCACAACCAAGGAGCCATGCCACTCAAAGGTTCCACAACCT 

AGCCAGGCTGTACCAAGGTCCCTQAGCCAG GCTGTACCAAGGTCCCT GAGCCAQ GCTGT A 
CCAAGGTCCCTGAGCCAGGTTGTACCAAGGTCCCTGAGCCAGGCTGT ACCAAGGTCOCTGA 
GCCAGGTT GT ACCAAG GTCCCTGAGCCAG GCT ACACCAAGGT CCCT GAACCAGGCAGCATC 
AAGGTCCCTGACGAAGGCTTCATCAAGTTTCCT GAGCCAGGTGCCATCAAAGTTCCTGAGCA 
AGGATACACCAAAGTTCCTGT G CCAGGCTACACAAAGCTACCAGAGCCATGTCCTTCAACG 
GTCACTCGAGGCCCAGCTCAGCAGAAGACGAAGGAGAAGTAATrTG^ 
CTTGAGAAGOCAAGCAOCAGATGCTGGACAGOCTCTTGOCATCTG1 1 rCTGTGTCl IAATTGT 
CTGTAGACCTTGT, AATGAGTACATTCTCACCCCAAGCCATAGTCTCTCTCTTATTTO 
AAAAATACGTACT ATAAAGC 1 1 1 IU1 1 CACACACACTCTGAAQAATCCTGT AAQCCCCTGAAT 
TAAGCAGAAAGTCTTCATGGCTTTTCTG^ 

AATGAAAAGAAATGCATGTTTCCTGCTCTGCCCTCATTAAATTGCTTTTAATTCC^ 



PCTTUSOO/lJJn 



AAGTCGACGCGGCCGCGAATTTAGTAGTATCACATTTGG<XJCAAACCTCAGGATTCTCCGTC 
TGCCTGTCTTACTTCATGGT 

>1162 i 

>1163 

>1164 

>1165 

>1166 

>1167 

ACTTTTCTGTCTTCTAATTTTTAAAATTATTAATGTCTTCTATTTTTCT AAGGCTGATTT 
TTTCT AATGTCTGTATTTTTCC I 1 1 1 1 1 1 CACATCTTGACATAAGTAGAGTTCATTTATTTTCAT 
TTATTCTTGTATAATAAAATTACTTAAGGTTAGGAATAATTAAGTTTTGCTCCCATGTrTTTATG 
TGTAACAATCTCAATGTTGTATGTCATCTACTrCAAAATTTCAAGCTTCCCCTTT AAAAT ACTG 
TTTAAAAAACTTTATGAAACCAGTATTTCTCTCAACCTTTGTGTAATACCTGGTTTTACT^ 

gtggtcagataatttaacctgtactggatcggcagtgccttcggactgtctatttgacctgc 
agtccaacct atgg cctttct ccttttgtctctagttcattctctaaccaccaaccatgaattt 
cagggaactu i r i icicattctctttgttttgtggccactttcacaatgtagaaggaaaaaac 
caaatgaccccactgtga 

>116B 

ecGGGTCCGCCTGTAGCAGAlTGGGTTTcaaaaagttcGGATGTAAACATTGACATCCAG 

tttccagccattatgtctcagccagatgtcctctt acttgttcaagaatgtttaaagaacagt 
gactcctttattgatgttgatgcagacttccatgctagggtgccagtggtggtgtgcagaga 

AAAGCAAAGTG GTCTTCT GTGTAAAGTGAGCGCAGGAAATGAAAATGCTTGTCTGACAACAA 

agcatttaactgccctt ggaaaactagaaccaaagctggttcctttggtgattgcatttaggt 

act gggcaaagctttgcagtatagatc^ccctgaagaaggaggtctgccacctt atgtgttt 

gccctgatggccattttcttt ctt cagcagag gaaagaaccccttttg cctgt atatctagg 

atcatggattgaaggattctcattaagcaaactagggaatttcaacctrcaagac^ 

agatgttgtgatctgggaacatactgacagtgctgcaggggacacaggcataacaaaagaa 

gagqcaccaaqagaaacggcgattaaaaggggacaggtgtcattaatattggatgtgaaac 

agcagoctrcagtaccagttgggcagctctgggtggaattgctgcggttctatgcttt agaa 

tttaatttggctgatttagtgataagtattcgtgtgaaagaattggtatctcgggaattgaag 

gattggcccaaaaagcgcattgccattgaagatccctactctgttaaaagaaatgtggcaag 

aaccct aaatagtcaacctgtgtttgaatatatacttcattgtttaaggacaacatacaagtat 

t^ggtcttccacacaaaattagaaaatgcagccttctaaaggctct gaatgcaattagatgt 

atttcagaacatt ctaaa gaagtaataaatcatcatccagatgtacaaacaaaagatgataag 

ct caaaaactcagtt^ggcccaaggtcctggtg ct accagttcagctgcaaat acct gt aa 

ggtacagccacttactctt aaagagactgctgaaagttttggaagcc^agcaaaagaagaaa 

tgggaaatgaacacatcagtgtccaccctgaaaactcagactgtatccaagcagatgt^ 

tctgatgattacaagggtgataaagtataccatccagaaacaggaaggaaaaacgag 

gaaagttggaaggaagggcaagcatctgttgactgttgatcagaaacgtggagagcatgtt 

gtctgtg gcagcacacgt aat aatgag tcagagagcactttggattt agaaggctt ccaaaa 

tcccacagctaaagagtgtgagggacttgccactttagataacaaggct 

aaagtacagaaggtactgaggaactagaagactctctaaac<>cttta 

ggccagacatcagaaatgattcgctctgatgaagaggagg*gga 

gaggaagaaxiaacctaggctcaccattaaccaaagggaagatgaagatggcatggctaatg 

aagatgagttagacaacacctacactgggtcaggggatgaggacgccctatctgaag^ 

tgatgagtt aggcgaagctgct aagtatgaagacgtgaaagaatgtggaaaacatgtagaaa 

gagctctcct agtggaactt aataaaataagtctcaaggaagaaaatgtatgtgaagaaaatt . 

CACCTGTGGATCAGTCTGAI 1 1 1 1 1 1 1 atgaattcagt aaacttatcttcaccaaaggcaagt 

ctcct acggtagt gtgcagctt atgcaaacgagagggtcatctaaagaaggactgtcctgaa 

GACTTCAAAAGAATCCAGCTAGAACtXTCTGCCACCAT^ 

gatcaagtctgt atccagtgttataaggatttttctccaacaattatagaagatcaggctcgt . 
gaacatattcx^gcaaaacct agaaagtttcat aagacaggactttccaggaactaaattgag . . 
cctgtttggct cctccaaaaatggatttgggttcaaacagagtgaccttgacgtctgt atga 
c^ttaatgg^t tgaaactgctgagggattggactgtgt cagaactattgaagaattagca . 
agagtcctcagaaaacattcaggtctgagaaacatcttacctattacaac^ 

^^GTGAAGTTCTTCCATTTGAGAAGTGGTCTGGAAGTAGATATCAGTrrGTATAACAGATT 
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ggcccttcataacacaaggctttt atctgcttattcx^ccattgatcccagagtgaagtattt 

gtgctataccatgaaagtatttacaaagatgtgtgatattggtgatgcatctagaggcagctt 

atcatc^tatggatatactcttatggtgcrratattttctccagcagaggaatccaccagtcat 

tcctgtccttcaagagatatacaaaggtgaaaagaaacctgaaatatt^ 

tatttai 1 1 ii i tgatcaaatagatgaactgcctacctattggtcagaatgtggaaaaaataca 

gaatctgttggggagttatggttgggcctt cttcg tttct acacagaggaatttgattt^ 

gaacatgtt attagcat(^ggagaaaaagtctgcttacaactttt aagaaacagtggagctca 

aaatacattgttattgaagatccctttgatttgaatcataatcttggagctggattat(^wjga 

aaatgacaaattttataatgaaggcttttatcaatggtagaagagtatttggtattcctgt caa 

gggatttccaaaggactacccctcaaaaatggaatacttttttgatc^ 

ag6agagctggccccaaatgatagatgttgtcgaatttgtggaaaaatcggacacttcatga 

AGGACTGTCCTATGAGGAGAAAAGTAAGACGGCGGCGAGATCAGGAAGATGGCCT gaacca 
AAGATACCCTGAGAACAAXaGAAAAAAGAAGCAAAGAGGACAAAGAAAITCACAACAAGTACA 
CAGAAAGGGAGGTGTCAACAAAAGAAGATAAGCCCATACAGTGCACACCTCAGJVAAGCCAA 

gccaatgcgggcagctgctgacxrrggggagggagaagatcctcaggcgaccagtagaaaa 

atggaagagacaggatgacaaagacttaagagaaaaacgttgttttatttgtggaagagaag 

ggcacattaaaaaggaatgcccacagtttaaaggctcttcaggtagcc^ 

tgactcagggaaaagc^cagcgaagaggacccagcaggaatcatgagggaaggaaaatg 

cagcactctaaatggccactcaggcgttcctattc^ctcggaaaattaggt^ 

GACACAG<XAGTGT"AGATCAGG<rrTCAACTTAACATTTAAGGGAAATGTCAGA 1 1 1 1 1 1 H I A A 
TTTAATGAAATTGTTAATGAGGAAAAATTTTTAATATAGTCTrA 

ATTTAAGGATTTTAATAGAAAGTCATGATGTATGTATTTAAG(XACGTTAAAAGAAAAAATATA 
ACT ATGGACCGGT ATTCAGTGAATACAGTTTCATGGTTTTTAATTCTTTCAAAG(^CATTAA^ 
ATGGTGTGCT GATAAACCCCAAGTAAATT AACCCTTTTT CCGTAT AAATCCATTTTTTGTTTTG 
AAGAGGGGAAATTAT ATTTATTGTTGTTT ACT GAAT CCT GGTGTGAAAGCAT ATCAGAT ATGT 
ATGAACTCCTACTGCTGTACTTCCGATTTACGGACATCATTTTATTGCT ATTTGT AGACGTGA 
TAACATGAACATGAGTACCT ATTT ATGTGGGCCTTCAGTGGATGGGCAGTGCCACTCAGGTC 
TCTGGGGTTTCCCTCTC 1 AATTTTAAGTAAATTGACATATAACT ACT ATGCTTATAAAAATGAA 
GTA AGGAAAACAAGT AGTCCTGTITG>CCACTAAAAACArmCAAA^ 

GTACI 1 1 1 1 ACTTTTT ATGATAjCTCAGAAATT AGGATGAAGAACTTTT AAAATTGCT GAAGATC 
AAAGAGGTTATCTCTGCCAGTCACAAGTGTGGCT 

CTC CTGG ACTGI 1 1 C fTT AATTTCAAAAGCOCTGCAGACATAGTACCTGGTCAGAACTATGCC 
TCGGTTT ATTTATCATTTTGAAATAAAATCAAAATTTCAACCTGTAAAAAAAAAAA^UUSttafl 
>1169 

GCGTCCGAAAAAAAAAAGTCCTAAAAATGGAAAGCCTTCGAATGGTTT ATAAAGAAG 
GCAGTGOCTTGATCTGATTTGTGTTATAAAAATTTCAGTCTGGC^ 

TGGAATGGGGAAGAGAAGATGTTGGGAAGCTAACTCI CC I AGCAAGAATGATAGAATAGCTT 
ACAGT AGGGT AGTGGC^GTAGTGGAGGAGATGGAATAATGGAGATTCAGGGGXTrTGAATTT 

gttaaccatttagtt CAACAATATTT ATTGATTGGC I IC IC1 g ttgtgagcatatctcgqaga 

GGAAGAAAAACAAGCCAGGAT AATCC<XIAGGTTTCTGGT GT AT AAATGTTGGT ACTACTCTG 
CC^TGGAATGGAAAAAGACTT GAGGCCCTGAGAACTGAACX^TGATTAGGAGCCTTGGAAC 
AGTAGCAG iiui C ATC fGTS I AAAT AAGGTTATTGATTTCTAAAAGGTCT ATGTTT AATCAGAT 
• GGAT TAGTCATICIO IUI | rCACACCTGTGGTGACT AATTCAAAGGAAAAGOCCTATCATTTT 
GCCAGCT GGTGT TTAACAGTTGCI I fGI 1 1 I C I G I UAAACCAGGTGT ACTGCATT AGAAAAGT 
TTATTCAAm I 1 T AATCATTAAGCATTGTGATAAATTGT AAACTT AQAAATTGTTTGGTTT ACTG 
TAT TAAAA ATGTTGGATTCCTGTAATTC<>GCATTTTG^ 
TGAGTTT^AGAGTTCAAGACCAGTCTGGGCAACATGGT 

AAAAACAATTTGGCCTGGGAGTGCAOGC 1 1 G 1 GGTCCCAGCTACCCAGGAAGTTAAGGTGG 

gaggtttacttgax5cccagaaggttgaggctgtga 
agcctgggcaacagagtgagaccctgtcnhnknn 

>1170 
>1171 

GT6GCGGCCGCCCGGC*>GCTACG<23GGGGG 

aaagggaccatggtgcggcct gtgagacat AAGAAACCAGTCAATTACT cacagtttgacca 
ctctgacagtgatgatga 1 1 1 1 G 1 1 1 C ! gcaactgtaccttt aaacaagaaatccagaacagc 
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accaaag gagttaaaacaag at aaaccaaaacctaacttgaacaatct ccggaaagaagaaat 

cccagt acjvagagaaaacccct aaaaaaaggatg gcttt agatgacaagctct accagagag 

acttagaagttg(^actagctttatcagtgaaggaacttccaacagtgaccactaatct 

aactctcaagataaaagcattgaaaaacatggcagtagtaaaatagaaacaatgaataagtct 

cctcatatctctaattgcagtgtagccagtgattatttagatrrggataagatractgtggaa 

gatgatgttggtggtgttcaagggaaaagaaaagcagcatctaaagctgcagcacagcaga 

GGAAGATTCTTCTGGAAGGCAGTGATGGTGATAGTGCT AATGACACrGAACCAGACTTTGCA 
CCTGGTGAAGATTCTGAGGATGATTCTGATTTTTGTGAGAGTGAGGATAATGACGAAGACrrT 
CTCT ATGAGAAAAAGTAAAGTTAAAGAAATTAAAAAGAAAGAAGTGAAGGTAAAATCCCCAGT 
AGAAAAGAAAGAGAAGAAATCTAAATCCAAATGTAATGCrrrTG^TGACTTCGGTGGACT 
CTCCAGCTGCCGTCAAATCAGAATCTCAGTCCTTGCCAAAAAAGGTT^ 

CCACT AGGAAACCATTAGAAAT ACGCAGTC^TIT^GCTGAAAGGAAGAAACXrrAAATGGGTC 
GCACCAGCGGCATCTGGAGGTAGCAGAAGTAGCAGCAGCCCACTGGTGGTAGTGTCTGTG 

aagtctcccaatcagagtctccgccttggcttgtccagattagcaogagttaaacctttgc 
tccaaatgocactagcacctgagtgtggtacaggaggaatgtttggttgggagaatcagag 

CTTTACAAGGGTGTTTATATTTG^TTTGTGTTTATATT^ 

atccattaccatgtcctataaatgacctctagccattttato 

tcaagacttcaatgagaagtttgtttataagaattatcttct catacctttccttgtgaagagc 
qtattct gtrrrr ctat cagt tcgacatgaagtccacatcacatgct b \ i ci i i ic tagttaca 
tgatgtgcctttctagctttgtcrragtttatagcacctt aacttt aactgttcagttttatctg 

GCAGAGGAAAACATTCTTATTTCTTTCAGAAGACATTTCTT^ ataagctacttaagct 

ACGTTGTCAGTTTTATCGCAAAGATGTTrTGTATT^^ 

AGAATTATTTTACACACTAAAATGGTTGCAGITTTA^ 

TTCTCT AGAAAATAGTA7TTAAAGACATTTT ATGAAATCTT CA7TGTCAAAACCTTTAATAAAAG 
TGGAAATATTTGAAATGCCCI 1 1 1 ICI I GATACCACTCATCCACGTGTTOCTGATTTGTOCAC 
ATTTCATG ATAAAATGA GAG CTCTG CAGAGAATGTTAGC C I I rCTGI GTAAATGTAATCTTCA 
AGTAGTCACI II IGI [AAGTTCTTTAGAAAGTAGTTGTCAAGTACTTAGTCATCCCTATTATGA 
T ATGAG ATAGTACAGTTTTCAGGAAGCTTAGATCTGAATTATTGTGAAAAACAATt^ATf^ 
TATTTTATATTTACATGAGAATTAATAGTTCTGATCAATTTTATAAAAATT^ 
GTGTT AAAATGTTATAACT CAG TTTCT GGTTATGACTATCTTTTT AGGGAACTATATTAGAATG 
GGT AGTGGCTT G TT A* - * * 



GCCGAGAAAGGACGGCGGAACGCATCAGGGTCACAAAGCGGCGCGGAACAGGGGGTCGT 
TAGCAGCAGCCN 
>1172 
>1173 
>1174 
>1175 

gaGCT ctaoAATCATATTTATTTGATTTGCATTACCATTATTGATGAC^ 
^C^^A^^TTTrATAATAAATGCAACCACTCTT ACCTGCTTAAGCAAATATAAACAAA 
CACTCCCAGGCCT ATTGTTTTTAAGGCTGTATATTTCACTTGATGGAAAAAGTGATGAACAG 
AATOACTTCCCAAGAAAAAAGAAAGAAAAGTGGGTGTT ACAT ATTCCTGyuVAGCCAGACTCT 
AATCAC^CAGAGGAATTTTArTCTAAGTTACATGCAAAGCA^ 

AT AAAT AAATGGATCAACT AAGAT AT ACT AAACCTCAAAAAACCCT GAATAAAGAT AAAGGT A 
AAAATCTATTAATAAAAAAAACOTTAAAATAC^ 

TTCTCT ACAGTCAAAGTTrCATTCACAGGGAGCTGTAGGTCAGTTGG I TGI I GAAAACTTATT 

GT AGO I 111 ATCTGAAAGAAGCAT AT AAAAACTT AAAAGTCAGAGCAGGGAAAGGGGAGT AA 
T AAATTA TTAAGAACAAAC T CAGTGAGACTTGACTTCACTTATGATACATGGGT ACJUU^fiAfiC 
AAACCAGOC1 ICI ICTTTCTGI I rCTCAATCl I IGCICI I GGATTAAGAGGTGTGAATCTCCC 
ACi rOCJGAGGTGGCATTTAGTGGAGAACAAGAATTACC^ 

TATTTATCCCGCTGAjCGGGAGAGACAGTACTA^ . 
GGAATAGATACCTCAA ITGTCC rCTTTGCTCCTI UAGTCCATGCTCTGAGAGGTAACTGAACA 
GGTGACATTGGTocacgiior f k » aim. frig li ^nn nwm y . m^t ^^m f u ttn flrrCTGCTTTTG 

CTTT CATTTGCTT AAAATTTTTTGGCATATGCAGATAATGTTCTCATCAGT AGTAAGAATCTCA 



WO 01/42467 PCT/USOO/33312 
901 
Table 4 



G G GTT ATGCTT ATTCCCCAATG C^GGT AT^^T A T A.* TCTTTTCTGCCTTTACTTATCAATTC 

ACCAAGGAGCTGTTTTCTCTGCATCTA6GCCATCATACTGCCAGGCTGGTTATGACTCAGAA 

GATGTTATCTGAAAAAAGTTTGATAATACAAAAGCTAOCATCAGAAGAAATCCCTTCAGGATC 

ATTAAGCCACTTCCTTTGCTCTGCAGTTTCTATAGTAGTTTTAAATTATTATTAAATCAGCTGA 

AAAAAATTCCAAAAGAGAACCACACACTACCATATCCAAACAACTTTTGCATTTCCCATAATT 

GTAGTTAATGTCAGCGCAGTAGGCCAG^CCAACtXXX^AGTTCAATACTTTCCTTCCGCAA^ 

gctctatactrrgaaggaaaacagatacagtatcaaattatgacactttcgtrgcccaaatta 
atgcactggtagacccagtggctcatatttaacttcccccagcttcccaattcaaactgggg 
g g aaaaaaact aaatcattgggagttacttg ccaacttggaagttgat a > hum i a c i i im c 
cattctaagactttaagttctctggcatgagtttatctgcaatcataaactaaacaattaccta 

AACCCACCCCACGAATCCCAACCGTAACAGGCCACTGCGAACTAATTGCCAATATTTGCCCC 

TCCCCTTTAATAAAACTTTT AAGAAGTCACATTATTGGAAAACTTAACTTCAACATTTAGCCTA 

CTCAAGCTCTTCTGAAGTTCTCCTGAGATGACTGAATATGAACCAAAGCTGCACTGTGCTGT 

>1178 

>1177 

>1178 

>1179 

>iieo 
>iiei 

NCTGCAGAATTCGCCGTTGAGC<5GCCGCCCGGGCAGGTACTT aggctttcataaaa 

atacagcagggcaagaggaccaagiatggaggcagtgatcagggaatctcaatgagggtga 

gactgcgacaaagacttgaaaaaggtggagaagcaagccttgtgggtatttagggtagcag 

tagtccaggcaaggggaacaactagtgcaaaggctctaggaggcaatgtgtttgaagtgtt 

ttaagaacagtaaggaggctagtatggttagaacagaatgagcaaagggggcaaagtggta 

gaaggtgagatcaaagaggtaatgaggccattgtggaggcccatatggactattggaaggg 

ctttggctrttactctaaatgaggcaaaaaccattttaagcagagaggagtgatatgacttga 

tttcttgttaaaaggattattctagttgctgttac^gaaaaagattacagg<5gtggaaagaaa 

cagggagacaaaagaatataagattttgactgtaactt atatct agtatgctt gctt atacttg 

aaaatgcatatccagataattgtagtaaattcaaatattatgtttatttaatagtactaacattg' 

atatgctggttaattatgattaggagcactaataaagcacaaatcagggattcccaaaaagaa 

TGTTAAAAGGGCAGTCAGCTTTTCCTGTGCCAGAAATCAAAGTCATAGCAGATTTGGGGCAA 

atatgtcaaagtcaaacttacgcacatcactactg^gaagacaaagatgaatgtgtgacagtt 
tcctgcccccaagaatctttaagcattgtgaagg^aagattaatatagccaaataactagykgtg 
atcagttctaocag agaggaccagttttg gaagccagaggaaaaaaaaaaaaaaacag aaac 
aaaatgatgtttcaattaaatctttaaaa g i i i c i c m a taaatttaccaagccacatattggg 
aatggtaccccaggcagaaggagtagagtaagcaagccagaaaggaaatactatggtgctt 
ttgagtaactgcagtgtggctgaagaatgtggaaaatgatgaggataaagaggtggacagg 
gaactaggtaagggagggcttccttttaaataattagaccttgtcctgtgtacatttaatggg 
attttaatcaggccataatgccaaatttcttt acttcggaaggatctttatgg7gatggtttca 

>1182 
>1183 
>1184 

cttaTTTTTCcca<3lGCGTTATGatGgtATC^ 
TTAAAATCAACTrTCACATAAAAATATGCATGTGCCTATATTTATACACATCTGTATTTACGTA 
CAATTGTCAAAGTCTTAAAGGCTGGGGTTTATATCTCTGCC<^TCCCCTCOCAACCCCA 
CCCTAATAGAAAGGTTTTCATCTTGCTTAAAAAGACAAAACAACTAAAAAGACT ATAGCTCTTA 
GGAGTCAGGCTGGTGGATCAGTGAATGAGAGAGGACAAATTGGCGAATCAGGGCTGCCGAT 
C^QOOGCAOGGGGCGCGGABGATGGGGTTGGGGGGATTAGCTGOCTTGGGCTGA Gfl > » U 
CTGGTCCTGAAGATTACAGTTTTGGTTAGAGAGAGAAGCCTGTCTAGCATGCTGACTCGGTC 
TACCTCAAGCTrTGAAAGTAATGTCTAACCCT GCTGCCAGTTT ATCACAAGTGCATTAAAAAT 
ATAGAGAACCATCAAGTTCCACATTTATATGCAATCAAACTGAAAAGGACCAGACAAACAGTG 
GAAAGGTCAAATAATTACTGTTTTATATTGGGACAGTACATTTCAGATTTCAACCAAA^ 
AAATGCAGTTTAACT CAAAGGCACAACAATTGAAACACAGAAAAT AATGGCATCTCTTGG ATG 
CCTTATTTCTAAATTAAACAAAGAAGTACACAAATTTTGTTCCCTT CCCT CCCCCAGTG CAAG 
CCCACAAG CI 1 1 1 1 CCAAAGTCTTGAGCTTTTGATGGTGTCGCGGATTGTG GACTTGTGAGG 
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CGC • t GGATG7AGCAA i vaAAACAAAATGC i iGAGAGGTviAGTGAATGGCAi iCAAAAGGGA 

cctcaaagtgcagacatatcnttttcaaatatgttacaggctgaactggctctttgaacacta 
tcactgctgcttaaatccaggaagcaggctttaaaaatgcaaaaggcatacaagttgtatttc 
agtgcaaatcttt ag ctggtt attg gaaaag a ttccaaaaatt aaaaaaaaaaaaaaaat cctt 
tgtgtcacctgcccct ccccccacccaaagccaaaatgaag aaaagacagaacaaaacaaaa 
acagaagaattacagatgactttggaatcatttaggaatccactacttccaatgagaatgttt 
agaaaaatacattaaagaatcattttttaaaaaacacaaaagtaatcttgctttctgagactgc 
tcttcgggtatagaatgagcttccgtgtggagtgcgtttgaactttgaccttagctgtgtgc 

GAGGTGCAGGCGCTGGGGGAGCTTCAGGCTGTGGTGTGCGAGTGACCGCAGGGGAGCGT 

GGGGTGGCTGCCCGCTGCCGAGTCAGCCGTTGAGCCCTTAGGTCGCAGTGCGTCCGTGGA 

TTTTAGT AATGTTCTCATCACTTTGTGACAC ACTCT G CTTCAGCATCACaG ATTTT ACAAATG G 

TTCAAATTC AATGAAAAAG ATTCGCCAGAGGCTGAAAACATCAAGACCTCAAAAAT ATGAATG 

CAAAAATATTCAGGGGAATGAACTAATTTTGTAGATACCTACATACAAAGTTTCTAGAACATCT 

ATAAAACCACAAAGTCATAAGAATTrGTTTTCCATTACATTAGTTTATATAAAATGAATAAATAG 

TATTTATAGATTTAATGTGTCTTATGTACATATACATGTGCTCTGTTTTAGCTTAAATAAAAATG 

CT ACAAAGTCG GAGCGGACGCflTgg 

>1185 

caAAATGTAAAATAGCCTTTAGGACTGAattagaGtGAAGTTTTTTTGGTTCCAAAGocGT 
AGACAAAAAGTCTGCCTGATAAAGTTGGTCTCGAAAGATAAATTCCAAAGAAATTTATCCATC 
TTTAGACACGACACCAGCACATTTTTCT AjCTGATACGGTTTTACATAGTTAAGGACTAAATAA 
AAATTGATT ATTCAGAATG ATT ATGTTT ATAATTTGTTCT AAGAAACT AGACT AAAAAGATTTTT 
TAAAAAAGCTTCTATATTCTTTAAAA I M I C I 1 1 GTG ACCCCCACAACTTTT CAGAGAATATTG 
TTT ACAGCACTG CCTATCACTGTTTCCAT ACT AAAAT AATT ATTTTCTCTTT ATG CT ATAATG AA 
AGTTAAATTATGCTTAATCAAAATGACTATG GCTATG gggtggcactcattt aaaatttttcag 
caatgaaacttaaatttattacctttcttctaatgaatatagaataaaagtattgatacc^ 
gcatactaacacaagaaactaaaagcaaaaattacatatttaatcttcag 
tatatattaatattccaggrrtrraaggaattaagttacattaacaataaaaaatattccatatat 
tcccttataaaaggttct attaattct aagagttctcaaatgggcttcaataagaatgatggg 
tttaaagatgaaggacatgacacgct caaaattaagaaaaatgctgtatttgaacttgcaagg 
ataattgaacactttggttataacatacttaaaagatttctgttaaaatttaatttcct^ 
caaattaaaaggtagaacagtctcctgaaggtaatgtaggtgggtcccttgcttttcacttac 
tataaactgtgctaaccaatacatggaggtgaagggggatataaaacacaaattatttcatca 
CTGAAATAATTAGCTGCTAAjCATGACTTCT aaag aacaggtataatataaaact aagaacttat 
ttcaaaat atgaataaaggcact aattacaaatgtacattttttagacagtatcttagagttca 
ctgatttacaatgaatgtcagaaccaatg ctct agtatt agatacttgcttgggttaaaacat 
tctcttggagtaaagaataatcagttttgacgttactgtga 
taaaaaatactcttaaaaagtcttacatgaatttcgtaatcatgaacttttcccaatc 
gtctacactgatgccaatttcatcttcaacacatggct aacataaatcatttctaatccaaaac 
taaccaaatattttttaaaggatttacaaaataaattgt atttt aaaaattaaat 
agtctatactagtagttggcaaaatacaatcaactattaaaaacacttttaatataatgctgat 
gtggtaatataaaaaataaccatataatttgacagaagaaaattgagccagccaatcttacta 
gatacaagaaggcagatagtcatttcataagaacaaatacaggagtactaagtatgcttntc 
tcaggacagactt agggaaactaccctcagtacttac^gtaaacaacacaaccaagcagaaa 

TTCAGGCAAACTGTGCTTAGGTAAATGGCAI 1 1 T CI lAATTaX^XACATCCCACACACACCA 
CAAACCATCCATATCGATTTATGTCTGCI 1 1 I CGCTCATG I ICI \ GATTATGGGTTATATTAAA 
AACTAAGTTAAACTACACTTATCCTCTACTTGATACAGAATCACTAA^ 
AAAGTTAAACAACTACTTTCCAAAGCCCAGAAAGCTTTCCTTTC 

GTGTGCTCCAACT AAAGATAAATCAGAAAAATGTTACCACATTCGACT AAATTAACTTGACT A 

AATT AAAAT ACCAATTTTGAACACATATTTTGATATTATGAACTTTGTATC 1 1 M 1 1 ATCAAAATT 
AAAAAGTACTGCAAAAATGAAGTATTATTCTCTAAGTATTCATTTTATC 
AAATCACACATTTGAATAAACAGGATCGAAATACXa^CACTTGTCTTTC 
TATATTGTTTAGATTATTGTTCATATTAGACAACTGCCTCAAAAA^ 

AAAT AAACTTTTGATAGATT AT GACTTTTTTTAATTTT AAGTTGTT AAGAAT ATTAACTTTGAGT 
CTCXTTATTAATATTCTAAAAGCTrAGGATTGAATTCAGCAG 

AGGCATAACTACAAAGATGGCAATTGTTT CAAGTCTATTACATAATACCGTCAAATAAATTCAA 
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GGAAAG CCATG TAGTTTTTTAAGTAACATACTGCTGGTAGTAAAAATGCTATGACAGACACGA 
CTTCAAATTTTAAGAAACAAAATGGACTGTTCATACTTCGGCTT CAGCTACTGCCGAAAGCAG 
CTGTGATATATCTTGCAAAGAAAAGAGCTAAAATTTCTGTTGATAATAAAGTTGTTTGTAAACT 
GATGCTAAAATATATTCCAATTGTCTTATTTTTAGTGCTGGTATGATTTACT 

ttttatatgcattgaataatgtgatattcttaaaataacagatttattcaagaatatttaacatc 
aaagttgggctttcaaataaatattttaccttggttagtttttt3aacaaatgttg 
agagtggacttgattcxtittagataaataaacagcctactt aatttccatgcaaataacaactt 
aaacattattcx>tgactgaacagtaatgtgggggaaagttaaaaggcaagtactcatgatgt 
ggaatcttccttacatatatcctctaggcagai 1 1 cm ici i 1 acagt aagtt ctgtccgaaa 
gatgaatggtttatagagggaagaaatatatttagtgctggagaagtctgcaggcagtacct 
gatggtttccaagact actct atcaaaccagtaattctctggttgggagggagtgctggtca 
agaattgaaagcaattggtcattaaggtt at aagaat atttcagaagacatcacgcatatgac 
aaacatttcttacaaggcagtcactgattaattagaataaacag aagaattgttgctgaaaac 
atagttatacattacattcatcgaactgaagattagtgtgttaaaaaaaaaaagttaagaaaaa 
atcttttatgttccagttcagctgcacttgattcaagtgttgct^ 
ataaattaaacatgctgctgccttagcagccaaaggtactaggaaaaaatgttcatgtc^ 
aataaatagagcagtctcttrcaagcaaaaaagttatctcacaccaatccggg 

CTCT AGACCrn'GGCATAACCCTTCTCCTGTGAGAGCACAGCAGGATT 

GATCCTTAATTGAACTAAGGGTGAGGTATTTCGAGATTTCAGCTGCT GTCAT ACACCCTTTCA 
TATCCTGTTT AlTTG^^AAGATAAGGACTGCAGCCrrTCCGTAAATCCTCATGAGCCAAGATTC 

GAACTCTGTATTTGAGTAATATGTGTTCCAGGATGATC<5CAGAGACT 

C<X2ACATAAGAAAATGAGTGTTCTTCACAACTATTTCTTCAACATTGCTTC^ 

AAGTATGAACCACTTC^TTCATTAAGAATTGGTAAAGAATGGTGGTTrrCCCT 

GTCCCACTATAATrACTTTGTGTTCTTGGTTACAGAAGAGGCTCCACAGTIT 

G<XX^ATCACGAGCACC<MGGCGGGCCGCGGGGTCCCTCGGCAGCAGGTGCCGCT 



GAATCCOCACCGGACAGC 
CCTCGGTCTGT ACCCACCTACCCGGACGC 
>1186 

WvKaAGT(XMCXX^CGCGTCCGCTTGTCT ATTTCAAGTTTGAGTAGGCT GAGGAAGA 
GGCG<5AAGGAGGGTTGGTCTTGCTGTCT7 ACAGGTG GCAGAGGTGAAAGAGGTGAAAGTC 
CACAGGTACACGGACTCATGTAGTTCAAATCTGTGTTGTTCAAGGGCCAACTGTATTTCT 
GTAGTTGGCTTGC C 1 1 1 1 1>TGTTTTTAGTTTTGATTAATGCATGGATTTTACCATCA I 1 1 i 1CT 

GAGCTTATTTTCTGAGGAGTTCATTGTATTACATCATTTCATTTGC^ \ 1 1 1 1 GTCTTT ACAT AG 
TAGATAGGGATATGTACCTCCCT7 CGCCATCATGTAACT GAAATAACTGAGGAATTGTTATTG 
TGCT ACAAAACTGAGAACAGATGAAGATTCTGTAATGAATACTT AGATCATCT ATTTTTTGTTG 
ACATTTTGCCTAGATGATGTGAAATTATAACTATTG^TTCTGTTGAAAGAGANNNN 



>1187 
>1188 
>1189 

ht^CATTTATTCAGACCTITGCTATGTTCCAGGCATG 
NNNNNNNNNNNNNNNNNNNN^^ 

ATTGCTGTGTATGATAGATTCTGATAGTTCGTTGTCTATTGTAGGAATTA^ 
TAAAGGGATAATAAATATTrTAG^rrrCAGCTAGGTCCTGTGGTGACTACT 
TTGTACCATGAAAACAGCCATAAATAAACATTAAGTGATT^AAATGGGCACTGCT AGGTTCCAA 



GCACTTTOGGAGGCTGAGGCGGGCGGATCATGAGGTrCAGGAGATGAAGA 
>1190 

GCGTCCGAAAAAAAAAAGTCCTAAAAATGGAAAGCCTT CGAATGGTTT ATAAAGAAG 
GCAGTGCCTTGATCTGATTTGTGTTATAAAAATTTCAGTCTGGCTGCAATGTGGATAATAGTT 



904 
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AC^GTAGGGTAGTGGCAGTAGTGGAGGAGATGGAATAATGGAGATrCAGGGGCTTGAATTT 

GTTAACCATTTAGTTCAACAATATTTATTGATTGGCTTTJTCTGTTGTGAGCATATCT 

GGAAGAAAAACAAGCCAGGATAATCCCCAGGTTTCTGGTGTATAAATGTTGGTACTACTCTG 

CGATGGAATGGAAAAAGACTTGAGGCCCTGAGAACTGAACCATGATTAGGAGCCTTGGAAC 

AGTAGCAGTTGTCATCTGTGT AAAT AAGGTT ATT GATTTCTAAAAGGTCT ATGTTT AATCA GAT 

GGATTAGTCATTCTGTCTTTCACACCTGTGGTGACTAATTCAAAGGAAAAGCCGTATCATTTT 

G CCAGCT G GTGTTT AAGAGTTGCTTTGTTTTCTGTGAAACCAG GTGT ACTGCATT AGAAAAGT 

TTATTCAACTTTTAATCATTAAGCATTGTGATAAATTGTAAACTTAGAAATTGTTTG 

TATTAAAAATGTTGGATTCCTGTAATTCCAGCATTTrGGGAGGCTGAGGTGGGAGGATTGCT 

TGAGTTCAAGAGTTCAAGACCAGTCTGGGCAAC^TGGTAGGACCCCATCTCTACAAAA^ 

AAAAACAATTTGG CCTG GG AGTGCACGCTTGTGGTCCCAGCTACCCAGG AAGTTAAGGTGG 

GAGGTrTACTTGAGCCCAGAAGGTTGAGGCTGTGATGAGCX^TGATCGTrrCATTGCACTCT 

AGCCTGGGCAACAGAGTGAGACCCTGTCIsWNNNN 

>1191 

>1182 

>1193 

AC I I I M 1 1 I II I I 1 1 1 I I II I I T M 1 1 1 1! M 1 1 1 CATTCAAGAAAG AT AATTTT ACACTT 
ATTCTTTGAAAGAAAAATTCTATGGAA I [TTCTTCl I CTAATTAAATTCCAAAATACATTCTCTC 
AACCCTATGCCCTCATACTAGTAACTTGATGGTTAGCGGGTAAGTAGGTAGTAGTAAAAGAG 
CAAAAGGGGAAATTTGGGteAGCAAAAAAGGGAGAAAAAGAAAAAAAGGGACCOTTCTAGTTT 
CCT AATAGAAAAGCTAGAGAATT(DCATTCCTGAAAATTAAANN 
>1194 
>1195 
>1196 
>1197 
>1196 
>1199 

aalatttGATGAAGlTtcicAaAcattocnoaGaC^Tggcag^ac^ 

gaaactagattttacattggtactaattcaatgaaaatatttaaaacatttttaatatg^ 
gaggctttttcactcn'aagaatctgcxrrccaaattcaagctgaagatttg 
tctgaataaacagtcaagaaacacaacatcaaacaataaaagcttttagccaaatgtacagta 
tccagtaaaaaaaggcatattgagcnttaactgcatcaatcarrtgctgttctct 

CTGCATTATAACAAGATGAAAAATAAATACTTGGTTAATCTGCTT ATTTCATGCAAATTTGTCA 
TGT AAAGGACCT CT CTT ATTTGTTCTCTATTT AAT ATGTT ATCAATCAACAT ATAATCAAAT AG 
GTAGGTGACCGTTATACCTTGCCTTCAGCTGAATTTAGAATTCT f 1 1 T AAGATGTC 

TTAAGCATACTCAGAAATGAAGGACTCAAGAAAATGTtCAGrTC^ 
AGTCACCAAAGTAAATAAAGCCATTCTATAACATAAACTGTTAGGTCTATATTTTTTACT 
ATCCTAAGGACACAGCAGAAATGGTGGTTGGGAXSGCCTTCCACATTTTTG^ AATAGA 
ACAGGCAATAGGCAGTTATAAATGGATAGATTTCAC^GrGGGGGAAAAAAGAC^ 
AAGTGAGCAGTTTCTGAGCAGGAATGTGGT ACAGT ATT AAGAATGGAAGAAT AAT ACAATAA 
AATTCCACACTATATTAAGATAGAAAAAGTAGTGAA^ 

TATATAACACAGGAGAAAACCTGT ATAAAATTCX^TGTATTTAAACCAATTTACAAATACAAAA 

AATTCTGTC<^GCTCTGAG<xJotTACACGACAAACGTTTAC 

ACCa K aaaa t ao cn caaaiagttmcttc a aaaaaa tg aca ^ 



>1200 
>1201 
>1202 
>1203 
>1204 
>1205 

NNNNNCTGGCCT GTCTCCTTCTT ATTGT ATTGGCGAGTCTGGGCATTTATGGAAOGC 
ATCTGTGGGGGTGGTCATATGGCAGTGAGCT ATGATCAT ACCACT GCACTCCAGCOCGGGT 
GACAGAGCAAGACCCCATCT CTTAAAAAGAAAAAAAAAGAGAT CCCATTTCATATTCAT CAGA 
TTGGT AAAAATAAAAAAAAATCT GACAAAGCCAAGTCTTGATGAGATATGGAGTACCAGAAGC 
TAATCCCCACCGGGGTTGGTTTAAATAGGGACT AACTACTTTGGAGGACATGGAAGATAGCT 
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caagtttaaatgcttataacccaaggctcagcaatattctagttaatactctagaggaatgct 
tgcacagtgcccaagaaggtattaaaagaatgtttattcaggtgttatttgtcatagtgaaat 
actggaagcactgtaactgtccattacagaagaacggataaaaactattgtgactaatttata 
taacagtatagcatacggcagagaaatgagtgaactagagct agatgaataaatctcaaaac 
gaatgttaagacaaacaaggccggttgcggtggctcactcctgtaatcccagtactttggga 

GGCTGAGGTGGGTGGATCACTTGAGGTCAGGAGTTTGTGACCAGCCTGGCCAATGTGGTGA 

AACCCTGTCTCTACTAAAAATACAAAAAATTAGCCAGGTGTGGTTCATCNNNNNNNN 

>1206 

>1207 

NNNNCCATTGCT ATTATCTAAAATAAAAAAACCTAGATTTATGGTC I 1 I I 1 1 IGAAAAT 

cagaaggatctaacaacactagactcacattcctacacatatcaactagacaactagacaaag 

t gatt acctt acaataggc aaacactttctagttttct acttccc acagttttccct aagaccg 

tgagtgaatattagttgccatttattaccacacgtacagtgttgttttttgacagtagggaca 

cagttccatgtaccttgatctctagcaacgagggaaaataagaaagatcaagattattgtgtc 

t aaagaaaactgggaat atat at acttgacccgctt cacttgctt acattgt ctgtctgattc 

ttccaagcattaattagaatttgcaactcctagctgggcacagtggctcatgcctgtaattcc 

agcactttgggaggccgaggctggtagattacttgaggtcaggagttcaagacaagcctgg 

ccaacatggcaaaaccgcatctctactaaaggtacaataaccagcct agcatggtgttgtgc 

gccggtggtcccagctactcaggagactaaggcagagaatc^tttgaatccaggaggcag 

gggttgcagtgagcagagatcgtgc^ttgcactccagcctgggcaacagagcaagactct 

gtctcaaaaaaaaaaaaaagtcgacxscg^x^cgaatttagtagtagtagggggccgctct 

agaggatccaagcttacgtacgcgtgcatgcgacgtcatagctcttctatagtgtcacctaa 

ANNNNN 
>1208 

acgc«gggatagttgaaatggagggcggggaagtc^ggcagtggtttctgaaagc 

CAAGAACTT AGT AG CACTGTG CCATT CTCTTGCCT GATCCAG TGCCATTCCCTTCA CTTGATA 

tctgtttacttt agaggaggcagtttttgagaaaggatcataaat atcct ggcccagtgccc 

tccgtggatacagaaatctctgcaggcaagttgctcc^gagcatattgcaggacaagcctgt 

aacgaatagttaaattcacggcatctggattcctaatcgttttccgaaatggcaggtgtgagt 

gcctgtataaaatattct atgttt accttcaacttcttgttctggctatgtggtatcttgatcc 

tagcattagcaatatgggtacgagtaagcaatgactctcaagcaatttttggttctgaagatg 

taggctctagctcctacgttgctgtggacatattgattgctgtaggtgccatcatcatgatt 

ctgggcttcctgggatgctgctmtgctataaacgacagtcgctgaatgcttcatgtacgtaa 

tatcgtaggctcgcttctgctcctgctcctgcaggtgcg^gacaggtatcctaggacctga 

aaacaaatctaagtctgatcgcattgtgaatgaaactctctatgaaacgagaaggctctgaag 



ACTCCAACAACCCACT ATCCTCGAACATCACTGAGCCAGACCAACCACT ACAGCAGGAAAAA 

GAATGCACAACGCOCAATACAAGGCAAAAACACAGCAACACCACVVGAGACACTGCAAGCA^ 

CCATCCCAAATACAGAAAAACTACCTACGCCAjCACCACCG^X^kAGCCAAAGGACGACAACG 

GCGAACACCCACAGACAGCAACCGGCGACAAAGACTGGAGAACAAAGAGC^ 

GAAAAAACG CGCCCAG AAAATGG CCCAGAACAGCG AACCCAOG AAACAGAAGAAAGACACA 

NNNN 

>1209 . 

ACGGGGGGGAGGTCTCCATTGAGTAGXSTGGCGCGGG^ATGAAGGCCGTGTTGGGGC 
TAAACCACACTGTGGAATTCTGTCAGCAAATTCCTCGCTGTGTGAACTTGAGCAAGOCATTC 
AUCJ I ICI 1AAGCCAI 1 1 ICI I GATATTTCACAGAGCCTCACCAAGTATTCAACGAGAACATG 
T AAGTGAAATGCTTC^AAAATGCCTGGTAAATAATAGATGCTT AGAAAATGGTAGAGAGAGA 

>1210 

>1211 .... , ■* 

agcctggctgcxntcctctcctractgggtctg^ctctgccacctggtctgccaca 
gatcx>tgatgtgcagttctctggagcaggcgctggc^^ 

, gtActcctgccaag agggcgacaagttcaagctgaGtaagggggaaatgaaggaacttct g 

GACAAGGAGCTGCCCAGCTTTGTGGGX3CATTCCAGAGAACCATGTGCTGTGAGGGCCTTCC 
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gagtccatctgtttaatgctgtcattggagacttgagaaaccagagcccagaagggaaaagt 
gattgtcccaagatcacacagcactggagaaagtggatgaggaggggctgaagaagctgat 
gggcagcctggatgagaacagtgaccagcaggtggacttccaggagtatgctgttttcct g 
gcactcateactgtcatgtgcaatgacttcttccagggct 
gaactcttgacttcctgccatggatctcttg<igccc^^ 

tattcaataaac i ii i ii i gtctgttaaaaaaaaaaaaataaaaataaatgggcggccgccgg 
gcccccttt aatttttt aaaaaaaaaaccccgccccccccccgccggg ccggaaaaaaaaaa 
aaaaagccca gggg gggggaaaac i i i i i i i \ ggcccctttaagggggacaaaaaaaaaaaa 
tcccccccaaattttcaaaaaaaaagc 1 1 ii 1 1 1 ccccggctttt aagggggggggggccca 
accccaaaaggggtttttaaaggggggggcgccccgggcccccccccaaaaataaattttt 
ttcccttttccaaaaaaaaaaaccccggccccnnnnnnnnnnnnnnnnnnnnnnn 

>1212 
>1213 
>1214 

NNNACAGATGAAATTGGC^GCAGTrTATTAGTCACAACTGCTCACAGGGAGGGAGG 
TCACCACATGCCATGCGGGGTCACAGGAGAGTTGCATTTGGGAATAGAGTGAACCAGTAGG 
GGCTGTGGAAG GCAGGCTTTGCAGT AACAAG AGGAAGAGG CGATTCCTGGCTCCTCCAGAT 
GTGACAGGCTrGTTTGAATAATTTTCCAGGCTGGAGGGTTGTCAGCCACXaTTGAGACCCAAG 
GAGGGTACCTCGGCCGOGANNN 
>121S 

nntaatncaccgnggtggcgggcgctctagaactagtggatcxxxxcggctgcagg 
ctgtagtgctattccctaatgagcaactaagagaagacacaacaataaaccatggaagtgtt 
gggaggccatggaaatgatttagaggtataaaatattcccaagtttttattataaatttttatt 

CATAGGTTAAAACATATAAGGATAAGTGATTGTACAGGTGTGACTCCCTTTT ct aggagagaa 
AAT AGCTTT ATTTGAAAACTCCAAAAG CTACTT AATCTT ACCAGAGGTTTTTAAT AG GGGAACT 
GGGTTT^nTACAATGAAAGCATATTTTCATTCTGCTATAGAAAAACATATCCCTGCTCATTCA 
GT AAATCT AAGATTGGAATGCTTTCT AGAACT ATGCAATATTTGAAT AATTTCTAGAAATGTCT 
AAGAAACAACCI 1 1 1 1 1 ATTTTAAATCATCACTT ACAAACGT AGTCT GAG ACTGCAAACT AGTT 
TTCACrTTGACTAATCCATCATTATAGATTAGTTACCATTTGCT CTATATAATCT ATTGAAGAAA 
CTATTGTTATTCATATTTTGATTAATTTTACTAGCAACAAATTTTTATTTTAGTAG 
ACCTT AAATATGTAAAAATGAAGTTGTAACAGGATCAAATATGGAAATGGCAATGGTAACTCT 
GCCT AGAACAT AGTTGAATGTCCTTG GTTGTCGGT AAGATGCAAAGATCT ATAGAAATTTT AT 
CAT AGGTCAAAATTATTTCT AAAT AT ACAAACAG TTTCAACT ATTTCTTCCT CCCT AAAACTAA 
AACCAAACCAAACCCAAATGAAGAAACCT AAA C I IC I I I I CATCT AAT AGTTATACCCAACTA 
GATAATAGTAGCTTC TAGA CATCAATAAAAAGATTTCATTTTAACTGTCTTTT^ 
TAT ATGA GTATGGAATTTrGTATCTAACTTGAAATTTCAGATGTTTTAATAATOCAAATA 
CHC 1I I AG CATGTCAAAQT ACTAATAGTTAAAAAA I 1 1 CM TAGATCTAAGCATATACATTACT 
TGACTTTTTCAAAAGTATCAAAAAATTGATAGCTGCAGAAATG<3CATTATTTGA 
AACATTG ATTCA TTCTTAAATQTGTATTCTTTAAAAAATAGTAAATACCTCAAGAGTGACA 
AACTTTCCTTTTCTGTTCTCTAATGCT AATTTAAACTTTGAGTCTTAGTTTACATGGTCATAGG 
GGTT AAATAAAATGTATAACTGTACAATTAATTGTGTTCTTGTGACCTGATGA 1 1 1 1 1 IG AAAA 
TTTGCTTTT CT CTTTAAGAAATTTAAGTTTTCAAGGGCXX5T ATT AGTTATCT AAAT ATT^ 
CTAATGTTGAC TTATA AAT AAATAAA AATTTAGAAATATATTCATGATGACAATTTTGTTACT 
CACTGCCT ATTCTTTA 1 1 ICI I I 1 1 1 AGTTC AAAGGTGAAATTTTGACCTTTGTATTAACAAAG 
CCTCAAGAAAAGAGAAATTCTGCCTTTT AAACATTGQ \ I UC C I I GCATTTTATGGTGGTGTG 
ATTTTGCCTCAGCAATAATAAAACATAAAATATGTTGfrTGCATC GCTGA 
GATTATTTTTAAGAACCAGGTTGAAGCCTCTATGCCTAAAAAAGC^ 

TATAAAGCAAGATATATGGCT ACAAATAAGTCTTGCAACCAAAATTG GTATATTAAAATTTACT 
GCACACTTTGAGATTGCTGCCATTTTTTGGTGXTTTTTGG 

GTATATTAGGGAAAATTACTCAAGTTACACATTCTACATTAACTGCGCTGTATTATCT 
ACAATCAATGGAC XAATA GTTCCTAAATTAGTAATAAGGCTT ACCTT ACAAAAGT AATGAAATA 
ACACT GATATTTCTCTTrCAAAGGCTAACAGCAATTCTTGGTCCTG^ 

AATAACTACTGTACATGCTACAACATCTGAATATTAATGGI I 1 1 GC I G CAGATTCTGGGTATG 
AGGATTAGCTTGAAAGGAATAGATTTGCTAACATGAATCATG^CTC^ 

TTT ACT ACCTGTATATGAATGCAAATGTTGTT ACAGAAAATATGCTTGAAATG TCAAT ACAGTG 
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TGCAATTAGCGTTGCATTGTCATGCTCCAGCATGTGATATCTGTATGGTGAAGTTATTTCAAT 

TACTCTGATACATTACTATAATAAAAAAGTTTAACCNNNNNNNNNNNNNN 

>1216 

>1217 • 

NAATTTATTAAAAATTTAAAAAAAATGAAAAAAAGCTGCATGCrrTGGTTTTTC 
GTTATTCTACATTGTTGCCATTATTACCAAATATTGGGGAAAATACAACTTAC^ 
AGGAGTTAAATGTTACT ACGAAGGCAAATGAACTATGOGTAATG AACCTGGTAGGCATTATTT 
ATTGA ATTAT CAGCATTCC AGAT GTCCAGCACATTTTTAATAGGAAAGTATTGGGAACAGATG 
TCATTATTTTCAGCCTAGGTTTTAGAACATTTTAGTATGTCATGAATTATCT^ 
T AAAT CTTTTTT AAAGGT CCATTTT ATTT AAAAT AT AT AAAAATAATCACTGCACT GCAACCTG 
GGTGACAGAGAGTCTGNN 
>12f 8 - "* 

NNACAATGTT AAATATCTGACTTTTCTATGATTTGGCTTTT CTGCCTTGAGTAACTATT 
TAAATATCTGCGTGATCI ICI 1 1 1 ATTTGGGCTACTTCTAGAACAAAACAGAGGTATTTACAAC 
AAACCACTTCCCACAGC<5CCTTTGAACXX»TITACCTAAGTCA 

AAATGCACCATGGGGTTTATTGTTAGATAATAAAAGGCTTAAAAAGCCCCT AGACCCT AAAGA 
TGCCTGGGATGGATG ATTGATGCTCATATGCT ACTTGAGCATGT A 

>1219 • - ■ ... 

ACCI I I 1 1 1 1 1 1 1 I 1 1 1 II 1 1 U IC GTCAA AGTCACTATrrGGGCCCTAACATAATCCCT 
GCTCAGAGCGACGGAAAAAAGGCAAGCCTTTrCAAACATAACTCTCTCT ACAAGCCAGCTAT 
TATGGCAAG^SGAAAAAAGAAAGCATCTAGATAAATATCTATC^AAATTAACTTTAAG^GAAAT 
ACTCTCTTTCCTTAAAAGCCCTT ATTTTTTAAGACACTAGAAAATAAGTTACT ATAAAAAGTGG 
TGGTCTGGGGGCTAAAAACAAAACAAAAAAAATCCT CTTTT CTACATTTTTTAGTTTTCTG 
>1220 

ACAGAATTATCAACTGATTTGGTCAGTTGCTT CCAATGCTGGTTGATTTCCCTCATTG 
TGTAAACATTGACAGGTATGTGACAAATGGGAAAAAAAATCCAAATAATAAAGTGACATATTG 
GTGTTCATIWNMMMHH KNNU HNNHMHHNNNNNHNHN C I I I I M I I UNI I 1 1 1 1 1 1 1 1 GTT 
ACTT AAT AAAAAAACT GAGTTTTATTTC^CATGTATTTTGTTGGNrCCCCACCTTTT^ 
TGACCACCGCTACT ACTTAGTCCTATCATAACATTCCATACATACTT AAAANN 
>1221 
>1222 
>1223 

ACACT GAACAATTTGTTAAGAT AGATCTCACCTTGTGTTCTTACT GAAAAAAAAAAAG 
AAAGAAAATAGAACAGAAAAGCAATTGGATTTT AATTCTGGAAACT CCTTT CTC I IC C I I A CAT 
CCAGGAAATTTGCTGTTT ATTTTGAAAAGCAAATTTAAACCTATTTAAGGGAGAGAGAGCTCT 
TGTAAAAATTCATTTATTAGTTCTGGACCAATGTTATTTATAAGCT ATTATTTCAAATGATAAAA 
AATAAATGCATAATACATTTGATGATAGAACA \ nTTCTTTI I AAAT AAGGCCTGATTTTTCAAA 
AAG ^ 1 TOS A ™ TG ™ 5TT TOAATTT^^ 
AGAAGAATTGTTATGTATGCATGTTCATATCACTTTTG 

GCAATCTGTGGTT ATCATGGACCATGAACTAGATAAACATTCC 1 GO 1 1 1 I C I F AACATTCCTTT 
CCTGTGCC1 CATTAACT AATGTACATAATAAAACATCTTTATTTCATANfWNNNNNNKfMNIIM 
>1224 

>1225 

OCACGCGTCCGGCTGTTATCACAGTCT^ 
ATGAi rCl i GCAAATT AAATTTTATTTGGAAAGTTCAGT AACAGCCTTATGGCTTCCATGCTAT 
AATCAAAGGGATGTATTTGATTCAGTTAAGATAGAATCrrGTGTTTCTA 1 1 IC I I 1 I A TTTCATCT 
TATTAGAAAAGTAATGCTCTGTATAGAAACTAGAAGAAAATTAAAATCATCTACT 
ACCAGACAACCAGCTTTGATGTACATCATTT6ATGTATG 1 1 1 I I U I I 1 1 1 1 1 A ACAT AAAAGGA 
TTAT ATpg 1 1 1 tXXCCAGCTGTTTTCACTCAAT ACATTGT GAAAATATTTTCACATATGTTGC 
ATGGGTT TCTATAA CATTTGAAATGACTTjCCAAATATTTCACTGT atgatcatcatttaatatt 
ATTATCAAi 1 1 1 S I AT AT TTAAGTTAGAAL I 1 1 1 CCATTACCATAAACATCATTATGAATGAGCT 
TTCTTG AACTGT ATTTTAATATACTTCCTT AGGATAAATGCTT AAAGTAATAATTATTGAATTGC 
i 1 1 w-AAAAACATTX5CACT0CATCCCAATrGTTTTGGAAT ACCCTTT ACCATATAN 
>1228 . 

>1227 " " - ' 

>122B 



>1229 

NNATACCACCTTCCTAATAGTATTATGACTTCTTATTTAAAATAAATAACAATTGCCGG 
TTTTCTGTTAATCAGI M ICI J AAAGCCAGTGTTTCACTATAGTGTATGCATCAGTGTGAATTG 
TGACrrTAGGAAGGGACACCTCAACAGTTCACAAAGTACCAGAATTCAAGGAGGAATGCAGA 
GAG AAAT AG AAG GAAG CTGCTCCATTTTTCATCATCT ACGCATCT ATTT GGAAAGCACTG G AA 
TTC^GATGCAAGAGAACAATGTTTCTTCAGTGGCAAATGTAGCCCTGCATCCrrCCAGTGTTA 
CCTGGTGT AGA 1 II I 1 1 1 I C TGTG 
>1230 
>1231 

ACTCCATAATATAATCTTTTAAATGGGCAACTTCTAAATATTGATACAACCATTAATAA 
TA ATGC TTATAGGGTAAAAGAAAATTTTTGAAGCACTGAATTCAGTAACCTGGGTCATGGTCC 
AATTTTGCTCACTACTTCATATCTTTTATGTAGATTATTCCTATAAACATC 

catcagtttgtaaagtcaatggattaaattattcaaatgtagctatttaacggtcagtaacaat 
g cct agaaacct attt attcatctgt aat att aaaaag ctgaatttgatgatctt gaaaaatcc 
tttccagatttacaacnnnnn 

>1232 
>1233 

ACrrCCATAATATAATCTTTTAAATGGGCAACTTCTAAATATTGATACAACCATTAA^ 
TAATGCTTATAGGGTAAAAGAAAATTTTTGAAGCACTGAATTC^GTAACXrrGGGTC^TGGTCC 
AATTTTGCTCACTACTTCATATCTTTT ATGTAGATTATTCCT AT AAACATGTTCCCTAAATTCCA 
CATCAGTTTGTAAAGTCAATGGATTAAATTATTCAAATGTAGCTATTTAACGGTCAGTA/WIAAT 
GCCTAGAAACCTATTTATTCATCTGTAATATTAAAAAGCTGAATTTGA 
TTTCCAGATTTACAACNNNNN 
>1234 
>1235 

NATGTAGAGAGGG^GTTTCACCATGTTNCCCAGG^^^GGTCTCAAGCCTGGTTTCCT 
GGGCTCGAGCAGTCTNCTGCCT CAGCCACCCAAAGTACTGGGATTACAGGCATGAGCACCC 
AACCCATCCTACTCTGTGT ATTTCAAAAAT A 1 1 1 U 1 CTGCCCCTCTTTTT GGGAAATAATAAT 
TACTGCTTCTAACAGTTATGTAACAGAGTGTAC^TATCATGCCTTACAAATCTTT 
TGTAGCATCCAAAGATGATAGGGTGAGTACTCTGTAAGTCTGGAAGAACAGGTCACATTTAT 
TCA6ACTTCTCCCCX>CAATTTTTAATCAAGCACCT 

GATTTTAAGTTGACAACAGATGTATCAGATGAGGAAAAAATTGAGCATGTGT GGTGTGATTAT 
ATAATAGAAI fGGTI I CTATAAACCATTTATAGTATTC>ACTTTTATAGTATTACTTTTTCAGAT 
GTATGGATATATAGACTATTATTT^TAATTGAGGCTCTGOGAAGTGTAGTGT ATTTTACAAAC 
ATAATTAAATACT ATCATTTAATATGCCTGATACT AACAT ATATT AAT AT ATTTTAGCATGGTTT 
CAAAAATTTT AATTAGGAATTTTTGCCrrTTT ATTATATCAGAAAGT ATATTTTGCAGTCAGGCC 
TCCAATAAATCATTATGAATCTTGTAGTTGGTCCTGAACTTGGAGT^ 
CAMNMN 
>1238 

NNNCGGATCTATGATAGAGTAGAATAAAACC I C > 1 1 C GTCT ACAAGCCACACTT ATC 
CAAAATTCTGTTGACAACT CACACTTGCT ATT AT ACCTGCTTCT ATTCT CCT AGTT AGTCCCT G 
TGGGTTTATACCTTTTTATTCTTTCATTGTTATTTTAATGAGGTTTCTGT 

AATGCCTATGTrTACTCCATCATGGTT ATCTGTAAGTCTGAGGTGAATTTCAAAAACTCACGA 
CAT GAT CAAACAAGAAGGGT ACTTJXT AAAGAAAAAGGAACAGTT AGAGAAAAAGT ACAGGGG 
AATGAAAGAGCATGGTATAAATAAGTAACAGGAAGT ATTTGCAGT AGAAGAACCTCAGTGAA 
GTGGCCTGAGCACT AGGGCAAAACAAGCCAGAGTGTGAATCCAGGCT CTGCT ATTTAGTCG 
TCAT ATGACCX^AGGATATGTTTCTTCTCTTCTGAGGCATCTCACT GCT GGGCATGCACACTT C 
CACTCCCTCCCTGTTTTCACN 
>t237 

CCCCTCGCOCGTCACGCACCGCACGTTCGTGGGGAACCT GGCGCGCGTGCGGCG 
GCTGGGCACCGCCATTTTGGjCCGGTGGCCGTGAGAA 

TTTCTCCAGAAGAG 



ctggat attctttcgcocagttatggcagacaagtt AACGAQAATTGCT ATTGTGAACCATGA 
caaatgtaaacctaagaaatgtcgacaggaatgcaaaaagagtt ct 

ttggttgtggtatctgtattaagaaatgoccctttggcgcctt atcaattgtcaatctaccaa 
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GCaaC 1 1 CCi AAAAAGAAACCaCaCaT CGaTa 1 1 G I GCOaa I GCCi 1 CAAAC 1 1 CACAGG I I G 

cctatccctcgtccaggtgaagtntgggattagttggaactaatggtattggaaagtcaac 
tgctttaaaaattttagcaggaaaacaamgccaaaccttggaaagtacgatgatcctcctga 

CTGGCAGGAGATTTTGACTTATTTCGGTGGATCTGAATTACAAAATTACTTTACAAAGATTCT 

gaagatgacctaaaagccatcatcaaacctcaatatgtagaccagattoctaaggctgcaaa 
ggggacagtgggatctattttggaccgaaaagatgaaacaaagacacaggcaattgtatgtc 
agcagcttgatttaagccacctaaaagaacgaaatgttgaagatctn'caggaggagagttg 
cagagatttgcttgtgctgtcgtttgcatacagaaagctgatattttcatgtttgatgagcct 
tctagttacctagatgtcaagcagcgtttaaaggctgctattactatacgatctctaataaat 
ccagatagatatatcattgtggtggaacatgatctaagtgtattagactatctctccgacttc 
atctgctgtttatatggtgtaccaagcgcctatggagttgtgactatgccttttagtgtaaga 
gaaggcataaacatttttttggatggctatgttccaacagaaaacttgagattgagagatgca 
tcacttgtttttaaagtggctgagacagcaaatgaagaagaagttaaaaagatgtgtatgtat 
aaatatccaggaatgaagaaaaaaatgggagaatttgagctagcaattgtagctggagagtt 
7acagattctgaaattatggtgatgctgggggaaaatggaacgggtaaaacgacatttatca 
gaatgcttgctggaagacttaaacctgatgaaggaggagaagtaccagttctaaatgtcagt 
tataagccacagaaaattagtcccaaatcaactggaagtgttcgccagttactacatgaaaag 
ataagagatgcttatactcacccacaatttgtgaccgatgtaatgaagcctctgcaaattgaa 

AACATCATTGATCAAGAGGTGCAGACATTATCTGGTGGTGAACTACAGCGAGTAGCTTTAGC 
CCTTTGCTTGGGCAAACCTGCTGATGTCTATTTAATTGATGAACCATCTGCATATTTGGATTC 

CXrrTTGTTGTGGAACATGACTTCATCATGGCCACCTATCT AGCG GATCGCGTCATCG 1 1 1 1 1 
GATGGTGTTCCATCT AAGAACACAGTTGCAAACAGTCCTGAAAGCCTTTTGGCTGGCATGAA 
TAAATTTTTGTCTCAGCTTGAAATTACATTCAGAAGAGATCCAAACAACTA TAGG 

TGATTAGACTGACTCTGAGAATATTGATAAGCCATTTATTAAAAGGAGTATTTACTAGAATTTT 
7TGTCATATAAAACTTGAATCAGG ATTTTATGCCCCACATACTCTGGAACTTGAAGT ATAATAT 
ACTTAATATAACATAAAAAGCCAGTTGGGTTCTAAATTGTAGTTGAAACACAGAAAATGCCAC 
TTTTCTGTTCCTGAAGAGGCTCTTTTGTGCATAATATTCTAAAATGAAGACATTTCAAGCTATA 
CAAATT ACTTCCAAGTTTTCATGAT GTATG GGAAG ATTTTCAGT AG GTGT ATT ATATTC ACGG T 
ACCAuAATGCTGAGCAGTGTTGCTCCATTTTTTAAATCTTGAAAAGGGTTTCTGTACTTACCTG 
GTrTGCCAAGTATGCCAGTGTAATGAAACTGOCCTTATTrrAAAAGCCAGTCAAAGATTCCAC 
TGATTGACATTTGATAAATAAACATCAGGATTATGTTTATTGTTTGTTTTCAGTCTTTGG 
A7TACCAGTATATGGTTTCCGAGGAAGATTATCTAGTGCAAAACACCACTGTTGGAAAAATAG 
GTATTTTTAAATT G Hill AATCTTTTTTGGTGCTTTT AAACATGTTTAGGAAAAACCAATTCAG 
. TTCCATTCCCCGCAAAAAAGCCCTAACTTT ACTCTGAAC 1 1 1 I 1 1 IG I I 1 1 1 GCATTCCATGAG 
GTTCTGTATTGAGTCATTCTCTAGGTAATGTCA1 1 1 I I GTACACATATATTTATATAATCACTGA 
TTGAGATTTAGGAAAAAGCATTTCT AAAGAATATTTGCTTCCCTTAGAACTACAGACTCGAAA 
TCTTT AAAGATGGTGCCT AAGCATCT ATGT A 1 1 II 1 1 1 1 AAGTTCCACAGA 1 1 1 I ICTGTTGGG 
CAGCGAAGGATTATAAACCACTTCGCTAAAGGGAACATTAATGGAAAAGTCCCGAGATGGCA 
ATACAAAGTATCCCCTGGTACCACATATATTCATTTGTGAGTTT GGATATAGAGCAGATTATC 
TAAACCATTTTGTAGTTCCAAAAACCCATCTAAATT^ 

TACCTGGAGCCTGGAGCGACTTTAAGTTGTACTTCTGACTAAACTGGAATTA7GAGTGAGGA 

GACAAACT7TAAGAGCT AjCTTGAAATAACAGAAGTCTTGATTAATATGCAAATAATGGCTAGA 
aagtatggttt aactggagcct ATTATGOCTTTTAAAAATAATTTCAGTAACCCATAAATACAT 
GTTGTAAAAAATTCAAATATACAGAATGGAATAAAAAAATGATCTCC<nTTATTACCCTCCCAA 
AGGTTACC^GCGTTTGAATTTAATAATGTATATTCTTTCATGCTTTT^ 

AGTGTGAATATGT AAAGGG I UGH 1 1 GTATACAAATG GGATTAT ACT AAAATAAGTAATGCCT 
ATTrTTAAGGATAGGTTAAATTTGTGAATGATCATTTGAAATATATTGAATAAAATAAGCAAAA 
GCTATTGTT ATTTACTGATCCTGGATTGTCTTAAAGAAAGTACTTGTGATATAAGAGGACAGT 
TCTACTTACTGCTAATGACAGAATATGT ATTAAAOGTATTTTCATX^AAATGAAATCTTTAAAAT 



ATAAAATCAGTATAAAATCAATGCAGATGTTAGCAAAAGAATGAGACTAXjrTCATATGTCCTA 
ATTGGAAAAAG^CTCAAATAdTTAAAAAAACAAAAAACrrGGAACAGTTTATTATACT 
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TTGTGAAAATATACAAAATATTGAAATAAAGCAATAC^ 



>1238 

acaaagctagaagcagcctg gtccagatggct atac aaacccgaaactgtct acac 
ccagactttattcttct acaaccaaattoctcaaacacacaa tctgaa cagtagcagtgaaag 
ggagtttaaggtgggggtgaaggagaaaggagtaatatggttttttagtaatatagtaattta 

CATTTTCAAAAATCTT CAGTGTTTCACT ATCTT AAATGAATATTTATTCTGGTGGGCCTCCCAT 

TTCTCTTCTCCTGTTAAAAAAATGACTGTTATTAACTCCAANN 

>1239 

NTCCCCGCGGTGGCGGCCGCGGGAACGCGGGGCCCATTTCCCATC 1 1 I CI II ICA 



TGTCCTCATTCTCTGC»CCTTrCGCAGAGCTTCCAGCAGCGGTATGTTGGGCCAGAGCATC 

C<5GAGGTTCACAACCTCTGTGGTCCGTAGGAGCCACTATGAGGAGGGCCCTGGGAAGAATT 

TG CCATTTTCAGTGGAAAAC AAGTGGTCGTT ACTAG CTAAGATGTGTTTGT ACTTTGGATCTG 

CATTTGCTACACCCTTCCTTGTAGTAAGACAGCAACTGCTTAAAACATAAGGATGTTTCAGTT 

CCTCCATTTAACAGATATGAAGAGCATTTTAAGAGGTGCAGCCTCTGGAAGTGGATC^AACT 

AGAACTCATATGC^TACTAGATATGTTTGTCAATAAACTTAAAACATAAGGATGTTTCAGTTC 

CTCCATTTAACAGATATGAAGAGCATTTT AAG AGGTGCAGTCT ATTGGCATTTCG CCCTGTAG 

tgagtcgtattaatggacaggatactctctgtgaattagcttattgtggctccggggccagc 
CAT C I 1 1 i ctggtcaacggggtgaagtctctgcgogggtcccccatatgaggggcgtggta 

TTAATTCCC<^TAAGCCCGGGTACAAC^GGTTTATGTATGCGGCAAAATACAAAAAACCTA 
CCTCTCTATTCTATCTCCCT ACCATCACCCTT ATTATAACATTGACAGCCAGACTAATCATATA 

atatctcttcgacaccacaatatccgaacttggattc^tcaggcgatggaggcacaagccag 
aacgctaagcgggacattcttccattctacacccagtaggtcx^ttcctaaatgatgagtgat 
taatccaacaccaggtc^taacattatatacctcagcaggaattcaaccctagcagataatta 
tagttcagagttatgaagac^tatcggactgtatgacctagcxstttaccacggaatgggga 
taaaggggagtagacaatacgaaaagcxttctgatcgatagattmaaaagtatacatgcaca 
gcgcagtacacacacan 

>1240 
>1241 
>1242 
>1243 
>1244 

agaaataaagtattccaagggttcagaatagaaaatgatttcttgcagcttggggac 
atttgggaaattgggatatcctttggggaaatgtagtaatcagtatattctgggaaaacatag 
tagagaatgaataaataaattgcattg^atttggaatatgttgtccattctccctgtaactaat 
gctatcaagataaagtagaaatac^catttcagaaacagctggagtagac^ggtcttcata 
ggctagcttggaaacctaatagctattaataatgaaattttaattatactctggatt 
ATGAACACACAGTGATC 1 1 T 1 1 gacttgctgcttgtttataagttggggatggtctgaattca 
ttgttgattcaaaagaaaacgaatagtgttrtaaaccttttattt acagagact ataatttgtc 
accttcctaccaacatatcttaaatacao^gac^ 
cctgcccgggcn 

>1245 
>1246 
>1247 
>124B 
>1249 

NNAACTTATTTAAAACAAAACAATTTTGTAGGTATTATTATAC^ 
GATAAATGAGACCAATAGAAGTTAAATAACTT GCCAAAGGCCACACAGCTGGTGAGTGATGG 
AGAAGGAATT AAAACTCAAGTGJVGCAT AATTCT AAAAGCCATCTTCT CGTT AGTGTTTCTCAC 
T ATCCAGGTCT GCGTTTGCCTT ATTT AACTGAAGTTAAGCCATCGTTACCT GTGATCACCT AG 
CGTCTCAGTTTGGGGGGATCATTAGAGCGGG I I I 1 1 AACTCCCAATGTTCTGGTCCAGTTTG 
CTTT ACATGTTCTT AtTTATACATTGTCAAGGATGACCT CAGGACAGTACAGCAAGGACACAG 
TGGCACTTCACA I I 1 l (il 1 LX^ACGAAATGACTGGGGCATAATCTCAGATCATCTT CCTTTAG 
AATGTGGAAACATCAGCAGAAGAATATTAGTCTTTATACAAGTCAAATCCAAAATGACAGATG 
TGAAAACTAATAGAGCTGACTTTGAGCCATGATAGCTTTGGCACACCTCA 
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AACCTCTCTTCCGTCAACGGAGAGCTGCATTCCTGGGAATTTCTGTTGTGCA CI 1 1 ICCCACT 
TGCCCT GCT GTCATTTAAAGGTGAACATTCTAGTTTTGCTrAAGAAAACCGTTTCCTTCATTTG 
GAATGAACAGCAATTTTATTACTrTTGACCTTAAAATGAGTTTGCTGC 

CGCGTTCATCACGCTCTGCTTCGGGGCGATCT 1 C 1 1 GCTGCCAGACTCCTCCAAGCTGCTCA 

GCGGGGTCCTGTTCCACTCCAGCCGCGCGTTGCAGCCGGCX^GCCGACCACAAGCCCGGGC 

CCGGGGCGCGCGCCGAGGACGCGGCCGAGGGGCGAGCCCGGCGCCGCGAGGAGGGGG 

C^CCCGGGGACGGGGAGGCCXiC^XrrGGAGGACAACTTGGCCAGGATCCGCGAAAACCAC 

GAGCGGGCTCTCAGGGAAGCCAAGGAGACCCTGCAGAAGCTGCCCGAGGAGATCCAAAGA 

GACATCCTACTGGAGAAGAAGAAGGTGGCCCAGGACCAGCTGCGTGACAAGGCGCCGTTC 

AGAGGCCTGGCCCCGGTGGACTTCGTGCCCCCAATCGGGGTGGAGAGCCGGGAGCCCGC 

C^CtSCCGCCATCCGCGAGAAAAGGGCAAAGATCAAAGAGATGATGAAACATGCTTGGAAT 

AATTATAAAGGTTATGCCTGGGGATTAAATGAACTCAAACCT ATATCAAAAGGAGGCCATTGA 

AGCAGTTTGTTTGGTAACATCAAAGGAGCAACTATAGTAGATGCC^TGGATACACTTTTTATT 

ATGGAAATGAAACATGAATTTGAAGAAGCAAAATCATGGGTTGAAGAAAATTTAGATTTTAAT 

GTGAATGCTGAAATTTCTGTCTTTGAAGTAAATATACGCTTTGTTGGTGGACTACTCTCAGCC 

TACTATCTGTCTGGAGAAGAGATTTTTCXaAAAGAAAGCAGTGGAACTTGGGGTAAAATTGCT 

ACCTGCATTTCATACTCCCrCTGGAATACCTTGGGCATTGCTGAATATGAAAAGTGGTATTGG 

AAGGAACTGGCCCTGGGCCTCTGGAGGCAGCAGTATTCTGGCAGAATTTGGAACCCTGCAT 

TTGGAGTTTATGCACTTGAGCCACTTATCAGGAAACCCCATCTTTGCTGAAAAGGTAATGAAT 

ATTCGAACAGTACT GAACAAACTGGAAAAACCACAAG GCCTTTATCCTAACTATCTGAATCCC 

AGTAGTGGACAGTGGGGTCAACATGATGTATCAGTTGGAGGVVCTTGGAGACAGCTTCTATGA 

GTATTTGCTGAAGGCCTGGTTAATGTCTGAGAAGACAGATCTGGAAGCTAAGAAGATGTATT 

TTGATGCTGTTCAGGCTATGGAGACTCATTTGATCCGCAAGTCT AGCAGCGGACTAACTTAT 

ATCGCAGAGTGG AAAGGGGGCCTCCTGG AGGAC AAG ATGG GCCACCTGACCTGCTTCGCT 

GGGK3GCATGTTCGCACTCGGGGCTGATGCAGCTCCCGAAGGCATGGCCCAACACTACCTT 

GAACTCGGGGCTGAAATTGCCCGTACTTGTGATGAATCATATAATCGAAGATTTATGAAACTG 

GGACCAGAAGCTTTCAGATTTGATGGTGGTGTrGAAGCCATCGGTACAAGACAAAATGAAAA 

ATACTACATCTTACGGCCAGAAGTTATGGAGACTTACATGTATATGTGGAGACTGACTCATGA 

TCCAAAGTAGAG GAAATGG GCCTGG GAAGCCGT AG AG GCCTTG GAAAA CC ATTG CAGAGTG 

AATGGAGGCTATTCAGGCCTAAGGGATGTrrACCTTCTTCATGAGAGTTATGATG^TGTGCA 

GCAGAGTTTCTTCCTGGCAGAGACATTGAAATATTTGTACCT AATATTTTCTGACGACGATCT 

TCTTCCACTGGAGCATTGGATCTTCAATAGCGAGGCACATCTTCT 

ATAAAAAGGAAGTTGAAATCAGAGAGG^ATAAAAAGACATTTTATATTTTATTCTGCTCCATTC 
GCTTCACTGTATACCTT AATAATTCCTTTTCTG<^AATCAGGC^CATGATGAACTrTGATTAGT 
AGGTCTGTGATTAAG 1 1 C 1 1 AAATTGTTTTGCAGTC 1 1 1 1 ATGTTTATTATCATA6GTATAGGT 
GGACCTAAATTCCTT ATCAT ATCCTTTATT AATT CAGCCAGTGT ATCCACCAG I Mil IGTTTA 
TGTTTTTAAGTAACCTATTATCTCTGGATTTCATGAAGGTGTAATATC 

ATAGAATTGTATAGCX^TGACCTCTTAATTATAATTTGATTTGACT GCAAAACTTTTTCCTCCT 

CTAAGAGGAGATGATGTCTGCTTTAAGCTGTAATGTTTTGCCATGTTGCAAAAAGCCATAA 

ATAAGTATAAAAAAGCTTTTTCCTTTACAATTTCATGTTAATCT 

ACAGATGTTCTGTGACAGCXrrCGTTATXK^GGTCTATGATTATTTGATAGAATGTCTTCT 
AT ACTTCACTCACATTGTAATTCAAATTAGAAAGTCATTCCAAAAGGATCATGT CATGTTGACC 
TCATTTCATCGGAACTGCAGTAT AI ITTTGI I GGTTAATTATATTAGT G 1 1 1 1 CT ATTTTGT AAA 
TGTGT CCI 1 1 AATTTrACTTTAAATGCCCTGTGTCATTTCTGGATTATATACTAGTTAA 1 1 I C I I 
CCATTCCCTACTACACAGAGAGGTGAGCTrTCAAATTTTGCAGAGCTCT 
ACATTTATCTGAAGAAAATAGTACAACTT AATGGATTAGCTTTTGGGTTTAACT GAATATATGA 
AGAAATTGGGTCTGTCTAAAGAGAGGGT ATTTCAT ATGG L 1 1 1 1 AGTTCACTTGTTTGT ATTT 
CATCTTGAI rTTTTTCTI I GGAAAAT AAAGCATTCTATTTGGTTCAGATTTCT CAGATTTGAAAA 
AGGCTCTATCTCAGATGTAGTAAATTATTTCCTTTCAGTTTGTGAAAGCAGGAT^ 
AAAGAAGCTTTGCCAATTTTACTTATTCGTGATCAATCAAGGAAAATCT 

AAAT AAGAATAT AGCAT ATTT AGTATGGTT AT AGTCAACaCAGAGATCACAACTT AGAAGAAA 

TATAAAGAAATGGCCACTCXXXIATCCCCCACAGTCCTGGAGT AAATCAAAATCAATAT ATGAT 

TCTTTTAAACATTAAGTTTGAAATAGGAATGG unci CAAGAATAGATTTGGTGTGAT acctt . 

GTGTTTGCTTACATTGGOCCACTATATATACATATATATTTATGTAGATAT^ 

GGCTAAT ACt^TGCAT ATACTGAAGGGCAAGGACTTTGACCATGTCAATTTTCA6CC - 

TtSGTCAGAAAGATCAGTACAACCOCATGGATTAGGCTGAAACATATGAAATTGCTGCATTTG 
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TAGTTTAAAAACTGTCAGCAGTTTCATATGGTTCCACCTAATATTATTGAAGACAATTATTTTC 

TTAGCT ATCAATAGGCTTAATAGTTTTAGTTATTTTAGCTTTTGAAAGTG 1 1 I IAAAAG ATTTCC 

TTTATCGG^CAGGACCATCTTT AT GACCTGCTTTCTGTTTTTCAAT ATCAT ACATTGGTGTATG 

TCAAAGAATAAATTAGTAAAATTAGTAAATGAAAAAGACTCTTCCGTACATCATTATTTCCATG 

CTAATGTGTGTCTGTGATCCAGAATAACTrCTCCCACTCATATCTTT^GTTCACCTAATGAAA 

TGAATGGATAGCAAGAGCJCCTrTGTTCCCAGGACTrTAAGGCAAAATATTAAAAATrATTGCC 

AAAATTAAGAATATAATGTTTGTATAAATGTCCTTGAATTTGCCATTTAAAT^ 

TTTCACTTTGATTTGAAAGCTGATAAGTATrrCTGCAGCAGATAGAATATTAAAATCAGGTTGT 

GTGTAGACACTGCACTATGAGGTACCTTGGTGTCCTGGTGTGAATAGACAAGAAGCTGTACT 

AT ATGTTGCTCT CT CAGTGGCAACAATG AAGTTTTTG CAATTCTAGAACTTG GA J 1 J I 1 1 1 I I A 



TTAATTAAAGTTATATTTTGGGTTAATTTTAACAACTGAAGTCTTATTGTTGAAACTTATTTTCA 
ACAAAACTGTGCAGTTAAATTTGTATACGTATTCACATACTGAAAGATGAACCGTTAAAATAG 
CACTTAATTTTGTGTTTCTTCAATATGTCTTGATATACTTTGTGCAATTAATATTACACATGTAA 
GTTGTATGGCAGTrrACAGAACTCAATGACTTGTCATGAGGTTTTCATATGAGCTACACATTG 
TGTACATTGATTG 1 1 M I IAI 1 1 1 I AC^TAAATCGATTCrrGTCATTTTCAACTTTATATATAAATC 
TCCAATGTTATGGGAAACAATAGATTGACACATAATTTTTAAAAATTATATTTGTAAAATTTCTC 
TATTGTGAATAAAGTCTTTTAATATATCTCX^AAAAAAAAAACCACTCCACCTT ACT ACCAGACA 
ACCTT AGCCAAACX^TTTACCCAAATAAAGTATAGGCGATAGAAATTGAGACCTGTCGGAATA 
GATATAGTACGGCAAGGGAAAGATGAAAAATTATAACCAAGCATAATATAGCAAGGACTAAC 

ccctataccttctgcataatgaattaactagaaataactttgcaaggagagccaaagctaaga 
cccccgaaaccagacgagctacctaagaacagctgagcacacccgtctatgtagcanaata 
gtgggaagatttataggtagaggcgacaaacctaccgagcctggtgatagctggttgtcca 
agatag aatttttgtcccnnnn nnnnn 

>12S0 
>1251 
>1252 
>1253 

tatagggagtcx^ccacgcgtccgtaatttcatcgtgtatgtcacgggctatttcat 
gatttc^tgtaccacatcgtatatttcgccttgaagattcctgaattaaggaattctataataa 
cttcagatagcctacatgtcccovtttagtggagacctcttgaatgct^ 
tatcattmatgaccattctcctttgaggaatactatgcccaggtacatg 
gccgggagagtggatt ci I I ic i I cactgcagagtcatcacactgttggaatagtctgctct 

TTrACATGCTCAGGTAGGG^AAATAGGACGAAATATATTrCCACAGTGCCT^^ACTGTGTCA 

TGTTTACAGTGAGAGTTTAAATATTGTTGATGTCCTGACTCrGTGAGCT 

TCATAGTAATGAC*TTTGATCAGCCATAAAATTTACATT^^ 

AGTCAGGTAATGAATACCCTTGAAGTGAATAGCAATTTTGATTTAGGCAGTGTGTTAGGCCAT 

CCTTGCATTGCTATAAAGAAATATTTAGGCTGGG<X3TGGTGGCTCACGCCT 

ACTTTGGGAGGCTGAAGCGTGTGGATCACCTGAGGTCAGGAGTTCAAGACC^ 

>12S4 — 
>1255 

h«^CG(>ACCCCAATACCCCCCATCGACW 
TACGCCCGCGGGAGAGCCCGGGACTTCCACTCGTGCGT6AGGCGAGAGGAGCO 
GAGACCAGAGGCXXsAACTCGGGTTTJTGACAAGATGGCCGGGCTGCC 
AjGGAAACCCAGCGTTTGCTGGGAGAACCAGTTCCTGGCATCAAAGCCGAACCAGAT 
CAACGCOCGTTATITTCATCTGGTCATTGCTGGCCCTCAGGAT^ 

CTTTT AAACTTGAACT ATTC<rrrCCAGAAGAATACCCAATGGCAGCCCCTAAAGTACGTTTCA 
TGAGCAAAATTTATCATCCTAATGTAGACAAGTrGGGAAGAATATGTTTAGATATT^ 
TAAGTGGTCCCCAXSCACTGCAGATCCGCACAGTTCTGCTATC^ 
CTCCCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTX3GAA 
COUSGCCATAGAAACAGCTAGAGCATGGACTAGGCTATATGCCATGMTAATA 
TACGATCATCAAGTGTGCATCACTrCTCCTGTTCTGCCAAGACI I (JC 1 CCTCTTTGTTTGCAT 
TTAATGGACACAGTCTTAGAAACATTACAGAATAAAAAAGCCCAGACATCTTCA . 
TGATTAAAtGCACATTAGCAAATCTATGTOT^ 

AGGCTAGAAGTATCATCTGGATTGTTGTGAAACX5TTTAAAAGCAGTGGCCCCTCC^ 
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TATTCATTTCCCCCATCCT GGTTTAAGTATAAAGCACT GTGAATGAAGGTAGTTGTCAGGTTA 
GCTG CAGG GGTGTGGGTGT T T T » ATTTTAriTT ATTTT ATTTT ATTTTTGAGGGGGGAGGT AG 
TTT AATTTT ATG GG CT CCTTTCCCCCTTTTTTGGTGATCT AATTG CATTGGTT AAAAGCAG CT A 
ACCAGGTCTTTAGAATATGCTCTAGCCAAGTCT AACTTTATTTAGACGCT GTAGATGGACAAG 
CTT GATTGTTGGAACCAAAATG GGAACATT AAACAAACATCACAGCCCTCACT AATAACATTG 
CTGTCAAGTGTAGATTCCCCCCTTCAAAAAAAGCTTGTGaGCATTTTGTATGGCTTGTCTGGA 
AACTTCTGTAAATCTTATGTTTTAGTAAAATATTTTTTC 

T ATTTT ACT GTGTTTATTTCATTTTCCCAATTTGACAATCG T ATTTTAAAATTGAAACTGATGGA 
ACATTCTTTCTTGGTCTTCACCATCTGACAAATTG AATGG CAAGAGGTGGATTTTGCCAGTTT 
CTTTTCACTGATGCAGATTTGTGTTAAGATAGTACTGAATGGAGTATTTATAAACTGGCCCTG 
AGCATGCATAAAGCATCAGTATCTGACCI MM 1 1 aaccttctaggaatttgaaataaatgtgt 
ttgtgttgtctgattagatgatcattggtgtcttgcx^caatgtttaaaaattactgtacagg 
aaagtcacagcaaagatagcagttgtgactgacatgtaggactttcacagttgtgccacatt 

TTTGCCTAAAATTTGGGTTATGAC AI 1 1 1 > C 1 1 GGTTCTTATCTGAAAATTTCATCTGT AACCTT 

tcatgtgtgtttagaaac^tggatctgatcatttgggattatgctgaggatttgtgagtcttc 
cttattatgctgatttagggaatttcaatattatactgggtggttatcagaaacgggttcctt 
ggagaatagttccacgtaaatttggtaaacccaggattgtggggaggcgaannn 
>i2se 

NAGGAGACT GAGTTTATTCAACAAGTATTT ACTGAGCCCTTACT ATGTTCCCGGCAC 
TATTTT AGGTGCTGAGAGT ATGTTGCATTGGTTAAAAATATACAAAATTCCT GTCTTCATTGTG 
CTT ACACTCCAGAGAAAG GAT ACAAG ACAAT AAACAT AAT AAGTGAG T AAATT ACAT AGT AAT 
TTAGAAGGTGATAACTATTACCGTACAACAAAATAGAGCAGTCGTTGAGAATTAATGGTGACA 
TG TCTTATCTGGAAATTTTGCAAAACCAGAAGAGGAAAAA G TAAACTT CT ATCTTGGAAGACT 

atatatatatatatocgtacat aacaaa ggaaaatatttctggctctrraatgttcatgggatt 
ttcaagcaagaaattaaagggatttttaaaacatgaaagtaatactgaaaactaggaaaagta 

GCTTTTGCAAAATACGTGCTCAATACCAAGTTATTGAG^aCAGTGCGATCTGTGCTTACTGCA 
GGAGTAGAAGCTTCCACAGTTCTAGCAAATTCCATGATT 

GGATCT AGAGAGGCAAGTGAGCTGAAGAAAGACTGCTAGGGACAGACGlAGAACTCAAATAG 
CTTTGAGATTTTTTTATTTGCACTTC O I M i l 1 AAAATACATCAGCGTG C I II 1 1 > I G TGTTAAA 
GGGTGAGCT CATTAAAAAGGAAAACTAAAAAAAAAAAAAAAAACACGGGAACACCCAGGTAC 
CCAACTCCACCT CCCCTTTAAAANNNNNNNN 
>1257 

N^^ITTCAAACCTCAGTTTGAAAATGAGAGGAAAACAAAATAAAATGATTTACATAATC 
AAAGGATT AACTGAT AGAGACTTTT ATTCT AAATG CTCACAAGCACAGAAACCAACAAGAAAT 
CAGATCTTGAACGAATTTATAATGATTCrrCCAGGAAGCACCGCGGO 

TGTrCACACCTGGCTGGGTCT GCCAAGTTAGTCCTGAAAGAGAAAACAAGGAGGAAAAAGAC 

AAAAAAAAAAACAAACGACCAAACCCAGTGTGCTT AAAACACAGATCAN 

>1258 

>1259 

. AAAQC O GAfiGT CTCGCTAAAATCATCATGGATTCACTrGGCGCCGTCAGCACTCC^ 
CTT GGGrnTGATCTTTTCAAAGAGCTGAAGAAAACAAATGATGGCAACATC rtCTTTI OCOCT 
G 7 GGGCAT ATTGACTGGAATT GGCAT GGTCCTCCT GGGGACCCXJ AGGAGCCACCGCTTCCC 
AGTTGGAGGAGGTTGGGCGCAGTGAGGGGGCTTCC I lb I 1 1 GG 1 ATGCACAAATTCATTTG 
GCGGGGGGGTTGTCAGCXXrrCrrGCATTATCTTAAAAATACGCT 

ACGAAACAGAGACTTT CAAaACAAGGAGCAATTTCAGGTGTTTCACTCTGAAAAAGAGACGA 

AGAGCTCAAGAATAAAGGCTGACGAAAAAGAGGTGGTAATAATAAAGGCTGAAGGAAAAGA 

GATTGAGAACACAGAAGCAGTACATCAACAATTCCAAAAGT^^ 

CACTAATGATTATGAACTGAACATAACCAACAGGCTGTTTGGAGAAAAAACATAC^ 

TCAAGTAAGTrTGCCATGCCTACCATATCTGTGA^ 

TGGTAGGACT ATGGGTCCTGAAGTCGTGCTGCCT GGCTCTGGCCACATCCCTGTGGTGCTT 
TTCCA'TCCTGATCTACAGATATTCAGAACTGCAGGGAGTT<XTTTT AGTCCTGGCAATCTGAA 
CCTGA 1 1 1 1 1 GOG 1 CATCOCCAGAATAGCTGCATAAAAATGTGGAGCANN 
>1260 . , 

ACTGGTGGGATTGTT AGACCATCCCAAAAAGGAAQTGCACCTTGGAGTCTGTGGAG 
CTCT caagaatatct ctttt ggacgtgaccag GAT AACAAG ATTGCCGTAAAAAACTGTGATG 
GTGTGCCTGCOCTTGTGCGATTGCTTCGAAAGGCTCXITG^TATGGAC^ 
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accggtgagttctaggcctaaggaaaattgctaagtcagtgttactcrrctagtgatgttgag 
aactagagggatttccagaccttttacttttgatgaaaggttgtgaactggtggctgtgggt 
caaatccatctcacagatttgtttggatcacacagcn 

>1261 
>1262 

acac tccatcaagcctggttcctaggatgctggacttctagcttagtgagaatgcag 
tatactttttgaaaacttcgtgcaggaatgcctcaaatgct gtaactaggaatgggtcagtga 
agttcaaaggacttttccttgagggagtattttaatc^gacaagggaactc i 1 1 1 1 CI 1 1 1 GG 
gcartgg(xaacagx5actgagaagccagagagcttgca(xrtga 
gtaacagtcctaggaaaatagatgggggctgggggtaaggaaatgtgctgaagacagagct 
attctggatggattttggtttgcaaaaattctactttaaaacaa 

TTTTTTTG CAATTGGAGTG T AAACATTCTGTGTG GCAACAGTTAAAAGCTGTT AT AACAATTTG 

cttgggtgtgtgtgcatgtgtgtgtgtgtgtgtgtgtgtatacannknnnn 

>1263 
>1264 
>1265 

NNNNNNNNNhWNhKaCCCGTCCCGGTCCGGAATTCCC^ 
GCCGCTTACAATGAGAGC^CAAG^GATGAAGAAATTCAAAGGATTGAAGAGGTT^ 
TCTGCTAAAGTGGTAGATGATGAAATTTACTACTTCAGAAAACCAATTGTTCCTCAGAAGGAG 
CCATCACCTTTGCTGGAAAAGAAGATCCAGTTGCTAGAAGCT AAATTTGCCGAGTTAGAAGG 

TACCTCAGCTTCAGACCCXXCTGGGCAGTGAG^ 

GTCT ACCCCAAAGTCTGCCAAAGGCAGTGCAAAGAAGGAAGGCTCCAAACGGAAAATCAAC 
ATGAGTGGCrACATCCTGTTCAGCAGTGAGATGAGGGCTGTGATTAAGGGCCAACACCCAG 

CAAGAAAGCAGAATATGAAGGCATGATGGGTGGCT ATCCGCCAGGCCTTCCACCTTTGCAG 

ggcccagttgatggccttgttagcatgggcagcatgcagcx^cttca<xx;tggggggcct 

CCACCCCACCATCTTOX3CCAGGTGTGCCTG<MXrrC^ 

ATGAACCAAGGAGTGGCCCCrrATGGTAGGGACTCCAGCACCAGGTGGAAGTCCATATGGAC 
AACAGGT6GGAGTTTTGGGG<X^C<>GGGCAGGAGGCACCAC^ 
ATCCAGCTGGAOCCCCTGTCATACAGCAGCCAACAACACCCATGTT^ 
AAGACCCAGCGGCTTCTTCACTCAGAGGCCTACCTGAAATAGATTGAA 
AGTCC^ACAGCATT AGCAAGTGGGATCAGACACTGGCAG CT CGAAGACGCGACGTCCATTT 
GTCGAAAGAACAGGAGAGCCGCCTACCCTCTCACTGGCTGAAAAGCAA^ 
C>C(^TGGCAGATGCXXrrCTGGCGCCTTCGAGATTTGATGCT CCGGGACACCCT CAACATT 
CGCCAAGCATACAACCTAGAAA ATGT TTAATCACATCATTACG 1 1 1 C I 1 1 I ATATAGAAGCATA 
AAG AGTTGTGGATCAGTAGCCATTTTAGTTACTGGGGGTG 
TAATTTTTATTGCATTTTACTGTACATCACAAGGCCATTTTTATATAC^ 
CTATTTCAATTTGTTTGTTATATTAAGTTGACrTTATCAAATACACA^ 
TTTCXJTTCGTTTAAAACCAGTTTCATAATTGGTTGTATATGTAGACTTG 
CT TGTTG CC ATGGAAC TGAAACCATTAGAGG nTTT G TCTI GGCTTGGGGTTTTTGTTTTCTT 
GGTTTTGGG 1 1 1 1 M I ATAT ATATAT ATAAAAGAACAAAATGAAAAAAAACACACACACACAAG 
AGTTTACAGATTAGTTT AAATTGATAATGAAATGTGAAGTTTGTCCT AGTTT ACATCTTAGAGA 
GGGGAGT AT ACTT GTGTTTGTTTCATGTGCCTGAAT ATCTTTAAGCCACTTT CTGCAAAAGCTG 
M ICI I ACAGATGAAGTGC 1 1 I CI 1 1 GAAAGGTGGTTATTTAGGTTTTAGATGTTTAATAGACA 
CAGCACATTTGCTCTATTAAGTCAGAGGCTCACTACAGAAATATGT AATCAGTGCTGTGCATC 
TGTCTGCAGCTAATGTACCrTCCTGGACACCAGGAGGGGAAAAAGCACTTTTTCAAT^ 
GAGrrTAGACATCTGTGAGTTAGACTATGGTGTCAGTGATTTTTGCAGAACACGTG^ 
CTGAGGTATGTTTAAT CTAGG CAGGTACGTTTAAGGATATTTTGATCT^ 

CAATTTATGCCTATAAATTTCAGATGATTTAAAATTTTAAACCT GTT ACATTGAAAAACATTGAA 

GTTCGTCTTGAAGAAAGCATTAAGGT ATGCATCGAGGTGATTTATTTTTAAACATAACACCTA 

ACCTAACATGGGTAAGAGAGTATGGAACT AGATATGAGCTGTAT AAGAAGCATAATTGTGAA * 

CAAGTAGATTGATTGCCTTCATATACAAGTATGTTTTAGTATTCC^ 

G TAI ITTT TCn 1 1 AAGTTTCAA TGTTQ TTATAATTCTCAACC^SAAATTTAATACTTTCT 

TATTTITTAAATTTAGCTTGTGCTTTTGAATTACAGGAGAAGGISAA 

CTTACTAGAAAGACCATTACAGATCCCAAACACTTGGGTTTGGTGACC 
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GACCCT ACAAT AAACATTTGAAG GCAGCAT AGGATGGCAGACAGT AGGAACATTGTTTCACT 
TGGCGGCATGTTTTTGAAACCT GCTTTATAGTAACTGGX3TGATTGCCATTGTGGTAGAGCTT 
CCACTGCT GTTT AT AATCTGAGAGAG TT AATCTCAGAG GATGCTTTTTT CCTTTT AATCTGCT A 
TGAATCAGT ACCCAGATGTTT AATTACTGT ACTT ATT AAATCATGAGGGCAAAAGAGTGTAGA 
ATGGAAAAAAGTCTCTTGTATCTAGATACTTTAAATATGGGAGGCCCTTT AACTTAATTG CCTT 
TAGTCAACCACTGGATTTGAATTrGCATCAAGTATTTTAAATAATATTGAATTTAAAAAAATGT 
ATTGCAGTAGTGTGTCAGTACCTTATTGTT AAAGTGAGTCAGATAAATCTT CAATTCCT GGCT 
ATTTGGGCAATr GAATCATCATGGACTG T AT AATGCAAT CAGATT ATTTTG TTTCTAGACATCC 
TrGAATTACACCAAAGAuACATGAAATTTAGTTGTGGTTAAATTATTTA 

TTTTATTTCXXrrTAAGGTCTGGATGAGAC I ICI I J GGGGAGCCTCT AAAAAAATTTTTCACTG 
GGGGCCACGTG GGTCATT AGAAGCCAGAGCrrCTCCTCCAG GCTCCTTCCCAGTGCCT AGAG 
GTGCTATAGGAAACATAGATCCAGCCAGGGGCTTCCCTAAAGCAGTGCAGCACCGGCCCAG 
GGCATCACTAGACAGGCCCTAATTAAG 1 1 1 1 1 1 1 I AAAAAGCCTGTGTATTT ATTTT AGAATCA 
TGTTTTTCTGT ATATTAACTTGG GGGAT ATCG TT AAT ATTT AGGAT AT AAGATTTGAG GTCAGC 
GATCTTCAAAAAAGAAAAAAAAATTGACTCAAGAAAGTACAAGTAAACTATACACXJ mTTTTC 
ATAAGTTTTAGGAACTGTAGTAATGTGGCTTAGAAAGTATAATGGCCT AAATGTTTTCAAAAT . 
GTAAGTTCCTGTGGAGAAGAATTGTTTATATTGCAAACGGGGGGACT GAGGGGAACCTGT A 
GGTTTAAAAC AGTATGTTTGTCAGCGAACT 

AAAATCCCCAAATTGGTGTTGATGATTTTGAGCTTGAATGTTTTCA 

TTTATTCrrAATTTCTGTATCATATCATCTGAGGTTTACGTGGTAACT AGTCTT ATAACATGTAT 

GTATCI 1 1 1 1 1 1 1 G 1 1 GTTCATCrAAAGCTTTTTAATCCAAATAAATACAGAGTTTGCAAAGTG 

ATTTGGATTAAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 

GGCGGCCGCTTCTAGAGGATCGGAGATTAC 

>1266 

>126T 

>126B 

>1269 

>1270 

>t271 

NATCCCGGlTGCTTGTCTGCAAATTATGAAAGTCATTCTrAA 
TGAAGGGATGTTGATACTCTTTAAGTTCACTGCCAGGCCTA 

GGAGAGTTCCTGTAATGAGAOGTTTTTAAGACGTC l l IIUI I CTGGGATGAATCAGAGGGAA 
TGACTGOCTTGGAGCTCAGGAT ATT AAACTGAGTGGTGT CAAAT ATTCCCAGGATCAAATCG 
A CAATGOC ATGTGTACCATC rC CTOCl ACAGGGATTAAAGGCCTGACATTTAGATATGATTCT 
AGTTTTCTCTCAAATCCATACTTTAAGAGCTTAra 

ATCOCAGCACTTGGGGAAGCCAAGGTGGGGAGATAAGTAGAGGCCAGGAGTTCGAG 

GCCTAGACAAANNNNNNKN 

>1272 

ACCjCGGGGGGTCGGGCTGAGACACGCGGAGCAATGGCGACCTT^ 
TGGAGGCGGCGAAGAAGAACTT AAGCGAGGCCCT GGGGGACAACGTGAAACAAT ACTGGG 
CT A ACCTAAA GCTGTGG 1 1 CAAGCAGAAGATCAGCAAAGAGGAGTTTGACCTTGAAGCTCAT 
AGAC TTC TCACACAGG ATAATGTOCATTCTCACAATG A 1 1 IC t ; i C CI G GCCATTCTCACGOGT 
TGTCAGATTTTGGTTTCTACACCAGATGGTGCTGGATCTTTG 

- - ATTTGATC 



ATAGATTCCAG CCTCA AAATCC I CTCrCAQGAQCCCAGCAATTTGTGGCAAAGGATCCCCAA 
GATGATGACOACTTGAAACl MGI I CCCACACAATGATGCTrCOCACTCGAGGCCAGCTTGA 
AGGGAGAATGATAGT GACTGCTT ATGAGCATGGGCTGGACAATGT CACCGAGGAGGCTGTT 
TCAGCTm I G I C I ATGCTGTGGAGAATCACCTTAAAGAT AT ACTGACGTCA G I I U I G I C AAGA 
AGGAAAGCTT ATCGGTTACGAGATGGTCATTTTAAATATGCCTTTGGCAGT AACGTGACCCC 

gcagccata cctgaa gaat agtgt agtagcttacaacaactt aat agaaagccctccagctt 
ttactgctcqctgtgctggtcagaatccagct^ 

caggctgcac i cc i gctggcatgctocggagacactct acct qcatctttgcctt3cggtga 
acatgtadga i u 1 1 1 1 1 gaagctttgcaggt gcacagggaagtcatccct acacatactgtc 

TATGCTXTT AACATTOAAAGGATCATCACQAAACTCT GGCATCCAAATCATGAAGAGCTGCA" 
GCAAGACAAAGTTCACCGCCAGCGCTTGGCAGCCAAGG 
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aggatttgagggtgtgggacxxhtjacxgaattcattgattactgaaaattgaatgt^^ 

GTCCAC ATTrC AAGGCTGAAGTGTATAGTGTATATATAACCTTTCCTATGGAAATGTGACATT 
GAGTACATTTTGTGTTGCTGTTGTGAAGCCATTAATATAAATCTTTG^ 

TATATGTATGTGTTWX2AGTTGTGGGAGCAGGGACTAATGAAATCCTGTGCCTGGAATGGAG 
AT ATTT AGGTACCTGAGGCTT AGTGT CCTGTGGTCTGCATGTAA GAT AGATGACATCCTaGA 
ACAAAGAAGCTGTTTTAACTTAATCCCCCTGATCVW3GAGGATATCTGTGTGTTCA 
ATACATTCTGTATCrrAGAAGTCTAAAATTTCrrGCCTTTCTCCrrAAAGAATGTG G CATTT 

TG GTTGAAATAACCTACACAGTGTrAAAAATCAGATACCTCCTTTAGTGACCAGTTCAAATTrT 
^TAGCGATAGGTAGOCCCTGAGAAATTTATCACTATAACTCCACAGGAAATATGACTTGGAA 
GTGCTCTGTGTACT AAACAAAATAAAGCCCCTCT1TGCATTTAAAACCAAAGTCAAAACAAAA 
CT CTTGTAATGCAATTAATTAACTTT ATGTCTTCCCATGACTGA^ 

AAAACTTTGATTGGCAGTTrCTTCGGTTAATTATTCCTA ATA 
CAAATTT3GATATATGCTTGGTAATTCTCCAGTTTCT AGGAGGT ACCT ATTTCTACCGTTTCAA 
GTGA TGAAGT SAAAATAATTTACATTCGATAGTGTTACT GATAACAAAOCT ACTTAAGAGATAT 
GTTGCTTTTTACrrTAAGGGATAGTGTTGATAGAT AAATT AG AATG TAT AGATAGGTTTGTGAAA 
GTCTAAATAATGGCTGTATAGATATGTATATATGGTTC>CATATCTGGATCrrGTGTATTTGATT 
TTGTACTTT AAATGTGACAAAT AAACC 1 1 1 1 GGGAGAAAAAAAAAAAAAAANNNNNNNNNNNN 



>1273 _ 
>1274 
>1275 
>1276 

^<NNN^9tf>CTAGGA^GGGTGTCGTGTGCATGCGGGTTGAAGAGGATCTGTATTGCC 

gtgacctctgttcatggatgagtgcatrgtaatttgttctcaggct 

cttaacccttgctcccttcccttctagagct 

aggatatcttgcctggaagggatatcttgccctgtttctcaaggtt^ 

ggatgtg gccctgcatgaccctccttctcctgt ac itcct c tt i cctttocaaatgggaatta 

gaactgtggggcagcaacagtctcagagcxiagtgagaggccagctt agagaatgcttctga 

gttagtgggactctgtgtcacaagtaagcaaatgaatatatgaaagaaattatggagataagt 

tagattcttggtaatactt aaatgtgttgctttct actaacc 1 ittgl i act aaaggtaaaggg 

tataactcaaac 1 1 1 1 1 gtggacattcttttcaaaatttttt aagaaccctgt act at aaaag g 

TTGAGTAAAAACAGGAAAGCGTGCTATAAGTTCAAATCTGTTGTATTACCCT AAATT AGATAA 
ACCAACCT GAATTATAGT AGATTTCTCAAT AGATGAGGAACT GAAAAAT ACT ATGT AAAATATC 
TTCCAAAATGCTTTTTATAC I Mill F ATTTGTAATTTGGTCT ATCT AAAATGTTOGTTAGCTTA 
ACTT AATGGGCGrTTATTGGATTCATATGAXrr AACGTTTCCTCAG T ATTGT AATG CTTGAAAT AT 
TTGA AAGAAA AAAT G1 IUI 1 1 1 1 1 AGTTGAAACTGGTATATATAATTCAGTGCTTGGCAGGTTA 
GTATAi I 1 1 I ATGCATTTITCAGAGTCAGCAGTTTCAAATCTTATTGTTATCATGTTA^ 
TTAGCCCACATTTCAGGCTCCGTAAATCATTTGAGOCATTA I MM I CCC AACAAATGGTGAA 
TTTTTTCTTT AAATGTG GATATATA TGTTGT AATTTATGATTCCTGGTT ATGT ATTTTTGTGGGA 
TCCTGCAGT AAAATTGAC I II 1 1 RSTSTCl 1 1 GGGAGATTTAAATTGCGCTAACAGTGTT GCG 
CAAAAATGAGTTCATGCCATTTAACAT ATTGTATTTTAATT ATTAACT GT ATTAATTT ACT ATGA 
AATGGACATCCTTTT AACT AAAATGGAATTGAACATTGCCAGTTTCAAATA rTTTTCCTTGTTG 
GGTCN 
>1277 
>1278 

ACT AAAACTAAAACT GAGCAGTTTAAAACATTCATTTAAAGGGAT ATCT AATGT GTTT A 
TTATTAACATAAATAATGTTTTATGAAAAATGT AACCTT AGTTTT CCAAAACAAAAATGTTTAGG 
GCAAGAGTAACATTATTTTACATTATTGCATCTCAGT A 
TATCIGCMCIGCAGTTAATTCTGTTCA1 1 II GM 1 1 GG M GAAATATATGAAGGAAATCTGTC 
CTC^ CACAGTTG TGTAGTGGAAAAAQSGGGACT ATTGT AACaGGCTGTGCACATAATTGTGG 
ATGA 1 1 1 IUMIGATACAACAACAAAACTTN 
>127B 

* N^<CTCT^>AGGCAGAACTTGCTGAATrTGAAGACT ATTAATGATTJCAGCTTJTCACTG 
GT ^CAAtaTGAAAAACCTCriCA CCCC^ ACACACCATCTTfAiAGAAAGCTTTA A 1 1 1 rSGI 1 I I I 
CAGAGCAAT ATTCACATGACATC1TTCGGATCTGGAATCTT AA C 
TACCTGAACT AAAAATAAGTGGTCTCT ATT^ACGTTGAAATTACTTT AATATTTTCATTTATGTTT 
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atactatttatttaaatgcggagaacatgoctaaccattcaggaccatttaattatcaaattat 
i aatgattcttccat aataaacaagc aaactgtcatactt gttgactt atatttcattccatgg 

AATTCAAA TTTAATT AATTGATAAATCACATTGTACTTTATAAGAGTrTATTCCTTAAGTGTTTG 
AATTTACACI 1111 1 CTCCTGTGACAGCATCTATTCCCAAGGCAATCAACATTTTAATGGAACG 
AACTGAGGTTTGCAATTGCTAATGTTATTCTCCCATCCCAGTTTCTACTTGTTTCTTGTAA«AA 
A GGAGG TTTCT AAATCTTGACTGTTGATTGACAATTAATGGGCT AAATTTCCCATTAGCAAAC 
ACTTTTCAGATATACACTAATATTAGTATCTAAATTATATTAAAATCTAATTTGGTAGCAATTGA 
CTTG AAATTTGTGATACTTCAACTTGTGCTAGGTTGTAGTTTT g CTTGTATCACT CATAGTTAC 
TAAAAATGCAAAAGGCAAATTTGAAATAACTATAAAAATAAACATTTAATGAACTTTTTAATATG 

caatactc^ttgtaaataaccattctctttactgaatcagatgtttcattaactgttagcagt 
tatttcgagttatttatatatcacagaattttgaactaaagatcatctagctgaatttcctrrat 
attaaag aag ag gag aaaatctt aatactaagt atgtgtgagcctg gagctgccat g g agca 
tgta gagagagtctcctagttgataccaactcaggaaagcagaatcaagcaataaagagaca 

TTT TGGGC CTGATGACATCATTTCAGCACCTGGATCCAGCGATGCCTAAAGATGTGACTGGT 
AGACTTTTCACTTACATGAGCCAAAACGGTTCCTTTGCATCAATAAATCGCTCTGTGATGCAT 
TTTATAATGTACAACCAAGAATCTCAATATTTATTATTTTGCATTAAATATGAGTAAGTTTGATA 
TAAAAAGCTCAGTAAAATTCATTAATTTGAGAATGTGTCT atatcatagtaaagttttaaacct 
AGATAATTTTCAATATTTATACCTTTTACTGCAAAAATAAAAATG AGTTTATCCTT G CCCTAAAG 
CTGTAj^rTAAACT 

>12B0 

kaaggcaatttaataagatttgagcatagatattaaacttagcatggacagagaaact 

TAi i ivi iGGGGGACTGGAGTAGTGAAAGAACAGAATCAATATGACTAGAAAGAGCAGAAAA 

acttacaacagct aatact acttgctacattgctgtagctttaagattgagggaggagg acta 
gagcgagcctgagatcttctgggtcagtttgatctaggcgtt c 1 1 c 1 1 c 1 1 cm ct agatctg 

CACTTT AAATACTTCTGGTGTCTTTATGACTTAAACGCAAATAGCTT AGGCTTAGCTTTTTCTC 
TTAAGGGTTT AAGGAGTGAGAGCAGAGCCAAGTCCTGGAGACGCTGAACTTGCTGTCACAC 
CGGAAAATGAGATGTGCGGGGTAGGGGGCAGGGACGAGGCGGAAAAGGACTACTCGGATC 
A i 1 1 1 1 AAGATGGGAGAGTAGCCACAGAGGAACAGAGTAGGAATCTAAACGAAGTAAAGCAG 

tacgggggtaggtttccctacacagtgttttacttaagggcacaggaaaagttacagaatga 
caagagaggtgagcaaggaaatctgcag g gtggctgtttt g aactcact act g gttt attt a 
gaggaggtctaatcacttggacgaggagtatggcgatctaaatacttacaactttcatggtg 
ct agaaatcttaatcaggcaaatgtttttcacacttgttcttgtaaca^ 
tggcagaaaagacaggactgtggtggtcagcctgacctcggggtcagaagtcgj^sgactc 
aga ggtg gaatttttatgggcacccacacggtagtcgatccgctctcctctggaagacggt 
cacctttcgaggacctgaaaai hum i a ggtggcaccccccaacaagccggccgtccnn 

>1281 
>1282 
>1283 
>1284 

AGGAGAAACATAATGAAGAATCTGTGAGTAAAAAGAATATTCAGGCAACCtrTTCATC 

aaaaagacctagattgtcaacagcttcagtc^agattgtctgcatctgaaacc^ 
gaatacatgtagaactaagtg^aaaag^gaagctactcaaaagc^^ 
gaggt agagaco^tacaaagaagacgactgttttccacttccatt^ 
cttcatttctatttattgaac^gtcaaatttgtcttgttatttgtgagt 

>1285 

NNNAGGCAAACCTCCCI 1 1 1 1 1 MCI I CCTITTCTGCTGATCTTCAACAATTTACTTAA 

CTCAGTTTCXTrGACTTGTAAAATACAAATGATATGCCTATCT 

AATGAAATGACTTAAAGTATATGTATGCTATCTGGCATGTAGTAGCAGTTTAATAAATTATAGT 
TTACATACACCAGAAGAGTACTTGOCTCGC I I HUC I 1 G T ATGGTACTTTTT AATCTT ATT ATT 
AAACT AACCCCTGTGGTGGTGTGGCTACATTCTTTGAGTTTAGAAAACGAGATAAAGAATTG 
CTCATATCTTCCCAAATTGTGT AGTATAAAAAGAATGCTGTCCTGGTTGTTTTTTGTAGAATAT 
GGAAGT CCCTGC AGTA AGTAGGCAACATGCTACC^rrrCTATTCAACACCAGCACT AGAACAA 
GGCAAGTGGGAOCTTTGTCGACACATG^ 

TGAGACACCTGCATCCACACCCACAGCTCAGGTTAGCTGCAAAAGTT ACACATCTTCTCTAG 
GCCATACACCCACGTAGCATCTTTCTCTAATGGT 
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GCGTCCGGCTTGAGCCCGGGAGGAGAAGGTTGCGGTGGGCCGAGATCGCCCCAC 
TGCCCTCCAG(XTGGGCGACAGTCTCAAAAAAAAAAAAAAAAAAAAGCATACATCTCTG 
GTAAAGJATAAATCAGTGGGCTC^GATTTGCAGGTACAGCCTTGTTA^ 
CTTA<^TTCCTTGTCTTATGGGTTATAATTTTCACATCCTCTTC^GATACAATCTGAGAACTTG 
TTGACTACCTTTGTTACATGCAAAAATTTTCTATTTTAATATTGCATTA 

GTACATT(^AGTTCTTTATCTGAATACAAGCGTTTTGCTTTT ATTTCCAGTTTCTTGGACCAGA 

acaataaaatacataagacatcgtttctatatggtcatatactatatagaataaagaattgtta 
tgtaaattattaaatgagtatacagacctttacataaaaactaaggtacctcagtggaatctg 

CTACAGTGCTTCCCCCTOXTACCCCTCC^TTTTGTT^ 

C«TGTTCCATACTCAGGATAGCTGGTTAGCTAGCyVAAAGAATTAACATCTGTGAAGCCATATT 
CATTATCTTCCTTGTCACCAAGGCrGTTGACCTTAAATAAACATTAAGTTGATTTTGCACAACA 
CTGTATTTGTGTGTGTGCATGTGCCTGTTTTTGTGTGTGTATGTTTGTGGGAAATAATTATGT 
^Q^^C^C ATATATTC^T TTTTAATGCATTCTGTAACTTTT CT CGAGTGGTGGTCATTGAGG 
GTAGGGAAGArmATTTTTTAAGTTGTCGTrAAGGTATTTC^TTAGTGTTC^f^AnTnTAAAA 
CAGI 1 1 1 1 1 CCCAAATA CTTATG CCAG^TAAGAGCATTTTTGTAAATAATAAACTAGCACCGTT 
T GGGT AAATTTGC^TTATTTTTTGGACAGGTTCATTGTCATGTAAAAGAAATATCTCAAAATTr 
ATTTTACAmAGCAAAGGTGCAACATTTGTTTTTGGAGmGAGAGATATT^ 

GA J CA ^^ TAAATACTAA ^^ 



NNCACGCCATTCTCCTGCCTTiAGCCTCCCX^GTAGCTGGCACT ATAG CGCCCGCCA 
CCACGCCCAGCT AAM M IT GTATTTTTAGTAG AGATGGGGTTTCACCGTGGTCTCAATCTCC 
TGACCTAGTCATCCACCCGCCTCCGCCTCCCAAAGTGCT GGG ATTAGAAGCGTGAGCCACT 
ACGCCCGGCCAGGTGAGACTATCTTTTAACCTGCTAATGAATTATAATTAGCATATAA^ 
AGTGAAGATGACCAGGGGTC*CTTTamGCCATCTT6«TTTTGGTA^ 
TTTACCTCATGCTGTTTTATCAACAAGGTATGTGTGATCTGTACCTTGTGCAG 
I G ^7P G T G ^ GA C^ AG J^ CAGCCTT AT 
TTTACCCAGCCCTTG CTACATTCAAGAAGGAATCACTCT GGTTCTAATGCCTCCGACAGAAT 
GGTC^GATTCTCAGACTCTAAAGCAAAGAAGACTATGTTCAGTGACAGCAAGACTGTTGAAG 
AA ^I A ^ CTC ^ T GGCCTrGAGGAGCTATTATCAATAAAA^ 
TGTTGTGTTACAATGAAGTATATCATCACTGN 
>1289 
>1290 

NCCTTAAGCGTTGAAACCCGGTACT CAGTTGATACTCGAAGACGGGCCCCCAAAAG 

GGCTCTGTATTCCACAG<X^TGAATTCCATCCGGGAGAAGGCTCGACGGGGTGGTACCATG 

GAGACTGATGACXttTATGGGTTGCATCCCTGCCCGT^^ 

TTATATTGGCATC^TTGACATTCTACAGTCTTACAGGTTTGTTAAG^GTTGGA 

GAAAGCCCTGGTACATGACGGAGAGACTGTCTCAGT GCATCGCCCAGGCTTCTACGCTGAA 

CGGTTCX^GCGCTTCJVTGTGCAACACAGTATTTAAGAAGATTCCXTrGTAAGTGGTTTCTAC^ 

AATTGACTGCCTACTCCTGCCCAG^ 

CrCTGGTAGGGAGCTGCCAATGCCAGAGGCCTCTCCTTCCACCTACATCCX^TGAG 
TTTCTTGTCCTTTGTGTTAGTCTGTCAT AGTCATTTCGAAGTTGCT GTTTCCCCCTTGGCTCTT 
CCTGCATATGTGAATGAGGTGAGGACCATTTCTGGGGAATTGGGATTTGGA>^ 
AATAAATAATGTGTAGTTGTGGATGTAGGATCC^TGCCGCAGAGATAGGCAGAGCCTATGT 
>1291 

NCCTTAAGCGTTGAAACCCGGTACTCAGTTGATACT CGAAGACGG GCCCCCAAAAG 
GGCTCTGTATTCCACAGCC^TGAATTCCATCCGGGAGAAGGCTCGACGGGGTGGTACCATG 
GAGACTGATGACCATATGGGTTGCATCCCTGCXXXSTAATTGTAA^GGGGAAAGGCTTCTGCT 
TTATATTGGCATCATTGACATTCTACAGTCTT ACAGGTTTGTTAAGAAGTTGGAGCACTCTTG 
GAAAGCCCTGGTACATGACGGAGACACTGTCTCACTGCATCG . 
CGGTTCCAGCGCTTCATGTGCAACACAGTATTTAAGAAGAT^ 

AATTGACT GCCT ACTCCTGCCCAGTGGCTCCCTTACCCCAAGAGAACAGAGGGCAGGACAC 
CTCTGGTAGGGAGCTGCCAATGCCAGAG^CCTCTCCTTCCA 

i i iui I GTCCTTTGTGTTAGTCT GTC^TAGTCATTTCXy^GTTGCTGTTrCCCCCTTGGCTCTT 
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^ctgcatatgtcaatgaggtgaggaccatttctggggaattgggatttgcaaaaaataaata 

AAT^AATAATGTGTAGTTGTGGATGTAGGATCCATGCCGCAGAGATAGGCAGAGCCTATGT 

^ — COTACT GGGCCATTTT'GCTrGOAGOCTGOGACCG AGT"G 

GGAGTGGAGTGGAGCGGCTGTGGTTGCCGACTCTTTCCTCTTaX^ 

GG jr AA 7 rAGGCCATCGGGCCTG G^^ 

C J CG '^ GATG ^ TGATGTGGAGGAAA ^^ 

ca ^i c i ccaa !^ gaactcg 2 g ct^ 

ggtcaaggagacattctcatagacgttacactcg^tccagatcccactctcactctcatagg 
^acgatctcgaagtagatc^tatacaccagaata(^gcggcgaaggagccgaagcx;att 
ctccaatgtct aaccg gagaag acatactggcagcaggg caaatccag atcccaacacttg 

CCTT«^TGTTTGGCCT<>GTTTGTA(^CAACAGAGAGGGATCTTCGT 

ga J^I ggac C£ttgag™ 

A T^?77T G T GTA '^ ctatggaaagggcaaatg 

^pi^C^TG^GGTGGAGGTGGTGGCAGACGTC^AG^ 

CTTATTATAGTCXiATATAGATCACGATGAAGATCrCGTTCCTACAGCCCAAGACGCTATTGAT 

^^ GG >7^Z TAAGG ^ TTTT ^ CCTC ' ' m mm u m^ ttctga 
^ G T CAT T^ GAACA G TT AAACA ^ 

TAAAi 1 1 1 1 1 1 I MlGui 1 1 1 agct agataaacttctagctaatcaaataaggaaagaaactgtc 

TTTTTAAAGCI I CI 1 1 1 GTGTTAGATACTGTATTAGAGATCTGCATTTATCATGAGTTC^TTTTTT 
iii 1 1 1 AACTTTAI 1 1 1 IGGGAAAGTAACACATGAAGTAGTTCAGTCATGTCAGGTTTGTCTGG 

^^ AA ^ A ^7^ C X G J A J^ 

*CACTGTTCGT(^TTTACAACGTTGTGACTGGGC^ 
GGCTTGCAGAGTTOCCNNNN 
>1293 

^ TO ,~^ ACC T G in AAGGA ^^ 

G TG CTGTGAT AACTTTTGTATCACTTGGAGAATGCT CCT GAAATT ATGCAACACTACT AGAT A 
AC^CC^TO^AAGA^G^ 

irr[ A T G /? cA ^ T ^ G i^ 

TTAAAGAAGAAAGACAATTAAGAAATCTGACACTAATCAGAAGAAATTAGAAAACG^ 
ffff G ffI^ GAAA ^ GGAGA AA ^^ 

TAAACAAACCCAGATGCTGGTTC! I T T ATTAATAAAAAGAAAACAAACAGTGAACAAGGTAAT 

TAATAAACCGTAATGCATTAAGTTAAAAATACTAGCGAAAATATACAAGATGAGAAATGAGAA 

AGGAGAAATAAAACACAAATACAGGATATACT ACAAGAATAATAAGAGAATCTTATATGGCTA 

TCTATGGCAAAG^GAALAAGAAAAAATTAGAGTAAATTGATGTTTTC^ 

AG^NGTGGGANAAATTTGGATTAATTGA^ 

TTTCACCCTTGGAAAAGGACACCCNN 

>1294 

>129S 

NNGGCCAC6GTGCTTCCGGGTTCTGGTGTAGGAGCGGCGTCTTCT 

G J G ^ GC J GAA J^ GG I ACG ^^ 

G^CCTGACTTCTAGCACTCCTGAGTTTGAGCCCT ATGAGK1AGTGGAACCTAGG GACCACCA 
AGGATCCAGATATAATTCTTTCCTCAGTAATAGAAAAAAGAATCTGGGATCTAAG 
TACTCATTTTCT AAGTGACTGGCCATGTC>TTGTCCCACTTATACCGGGATGGGGAAGGCCG 
C^TTGATGATGATGATGACGAGCGGGAGAACTTTGAGATCySCTGACTGGGATCT 
agttcaaccccaaccgacagcgccact GGCAGACX^AAGGAAGAAGCCACCTACGGGGTGT 
GGGCAGAGOG^^ 

ACTCT GCGCCAGTCAACTTCATCAGCGCAGGGCTCAAGAAAGGGG 
AGTTGGAAGATTCTGATGACGAAGAGAAACCTGTTAAGCAGGACGACTTTC^ 
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GGACX^AAGGAAGCTAAAAACGGGTGGCAATTTTAAGCCCAGCCAGAAAGGTTTTGCAGGAG 

gaaccaaatctttcatggacttcggcagctgggaaagacacacaaaaggaattggacagaa 

gcttcttcagaagatgggctacgtccctggacggggcctcgggaagaatgcacaaggtatc 

attaacccaattgaagccaagcagagaaagggaaaaggtgctgtgggggctitatggatccg 

agcgcaccactcagtccatgcaagacttccctgtggttgactcagaggaagaagctgaaga 

ggagtttcagaaggagctgagccagtggaggaaagacccaagtggaagcaagaagaagcc 

caaatactcttacaagaccgtggaagagttgaaggccaagggcaggattagcaagaagctc 

actgctccccagaagg^actttctcaagtcaaggtcatagacatgacaggccgggagcaga 

aggtctactacagctacagtcagatc^gcx^caagcacaacgttcccgatgatgggctgcc 

gctacagtcccaacagctgccacagtctggcaaagaggccaag^cccccggc^ 

gcccgagctggagcacaacctgcagctgctcatcgacctcacggagcaggagatcatcca 

gaatgaccggcagctacagtatgagcgggacatggtggtcaacctcttccacgagctggag 

aagatgaccgaggtcctggaccacxsaggagcgggtcatctcgaacctcagcaaggtcctg 

gagatggtggaggagtgcgaggggcggatgcagcc^gactgcagcaaccccctcaccctg 

gacgagtgtgcccgcatcttcgaaaccctgcaggacaagtactatgaggagtacaggatgt 

ccgaccgtgtggaccttgctgtggccatcgtctatccactcatgaaggagtacttcaagga 

gtgggatcccctcaaagactgcacttatggcaccgagatcatctctaagtggaaaagcctcc 

tagagaatgacgagctcttgtcocatggcggacaggacx!rrcrrcagcagatgcctttca 

GTTGATATGGGAAGTCTGGATGCCTTTTGTTCXjAAATATTGTC^(XX^GTGGC^GCCAAGGA 
ACTGTGACCCGATGGTGGACi r 1 1 IGGATAGTTGGGTGCACATTATTCCTGTGTGGATCTTA 

gataac^tactggaccaactcatotccccaagctgcaaaaggaggtggaaaactggaa 

GCTCACAG ACACTGTTCCCATCXiACTCTTGGATCCACCCATG GCTGCCCCTT ATGC AGGC AC 
GGCTGGAGCX*CTCTATTCXXXCATCCGTAGTAAGCTC^ 

CCCCAGCGACTCCTCTGCCAAGCTCATCCTCCAGCCCTGGAAGGATGTCTTCAC^ 

TCCTGGGAAGCATTCATGGTCAAAAACATAGTGCCCAAGCTGGGGATGTGTCTTGGTGAGC 

TAGTCATTAACCCCCACCAGCAGCACATGGATGCATTCTATTGGGTGATTGACTGGGAAGGG 

atgatctctgtctctagcctggtgggacttcttgaaaagcacttcttccccaagtggcttca 

ggtgctgtgctcttggctcagtaacagcu^aaattatgaggagatca<xaagtggtacctg 

ggttggaagtcgatgttctcagaccaagtgctggcacatccatctgtcaaggacaaatttaa 

tgaggcacttgatat(^tgaaccgggcx3gtgt(xrtccaacgt^ 

ggagcacgggagaacattgcctatctcacccacac»gagcggaggaag 

gaggccatgcaggagaggcgggaggctgagaacatggctcagaggggcattggcgtggc 

cgct agctctgtgcccatgaactttaaggacctcattgagacx^aggctgaggagcacaac 

attgtcttcatgcccgtcattgggaagcxsacacg^ 

gc^ttgtgatctac^tcgac^gggagtggtctttgtccagggcgagaagacgtgggtgcc 
cacctcactgcagagcctgatcgacatggccaagtgaactgtggcagct 
agagacttgcccct aagagggatgtactgtaaataaacagtatttttagatcaccttcaaaag 

GGTAACI ICMIACT(^GAGCTACT(X1ACAT(>^GCCAGGTCATGAGGG^GGATGTAAAA 

CGGGACAATGCCACGTAGGGCAGG^TCTTTATACCAAGTCCTCATGTGGGGTATCGTC^ 

CAAAACAGGACAGCCAAGTTGATGTTCTTTCCT GGCAAAGGAGTTCTCTGTAGTTCCTCCAT 

AGCAG(XrrrCCCTTTAGTCTCTGGTGGCAGCTTCTACCTCAG CCAACACT GAGTACCTCACA 

ACGAGTTCAGTCAGTAGCAGAAGGATCTTCTCTCTTGTTCCTGATGATT^ 

GTCCTGATAATCTGGTTCTTCCCGAAACTCCCAAATATCT ATGGAGAGCTGTTCTAGCTTTTG 

CACAGGG^ACCAGTGGA CAGAG GTATC^TTAAATATGTCCATGTATTGTGAAGTCTGAGGAA 

ACTCAAGCTCCTCCAGTCCI T TT AAAATCT GAGCAGAAAAATTCAAGT ATAAAAACTTCAGAT 

AGTCACAGT AATCCCCACTCrTCTGGGATAAATTTCCACTTTACTTGTC^ 

TTTAAAAAAATAACAAAT AGGGT ATTTTGATTT AAATG CTTTTG CACAATGAATTGGGCGCCC 

CAAGG ACAGTGGATATCATGGACTGTGATACT AAGAATAAAGGAAATAAGGAAAAAACTAAG 

AGATGCATTAAAGAGTCATAGCTTTCTG«AACACAGACCCATGGAGAGAGCTG 

TAGCG CCACTTAG CCCATGAGTTTCCTCCTACGCCTGTAAAGATAGATA^ 

TTCACACCUI 1 1 f CCATCCCAGCCCCTTTATTCAGATAGATACCTTTGCAATGTAGGGATAAT 

TTTTTTGCAGAATCCTTGCCAACACCCT^ 

>1296 . 

>12»7 

>1298 

>1299 
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>1300 
>1301 
»1302 

NNNGCCATCGCGATCCTGCAGTATCCAGCGTCCCTTGG^AGACTCTGCCACTGTT 
GCCACAGCCACrrCTGCCGTCCCCAA(^CX^CCTCCTC>GGGC^ 

GATGCACGACAACGTGAAAGGCAAAGCTATGTCCTACCCTGTGACCAGTCAGCCCCAGTGC 

gccaccaccagctgctaccagacocagctcagtgactggcacacaggtctcacggactgct 



TCCGACGACTTTGGCGAGTGCTGCTGCGCGCCCT 
>1303 
>1304 
>130S 

agggagtcgacccacgggtccgcccgaatcgcggagcggcgggtcccgtctoga 
caggtcttctctgttggttgaaatgtctatgattttatctgcctgagtcattcgtgtcagaga 
tgg actgc cactttctgcttctactgattatgaacaaaggacaggaatgcaggagtgcagaa 
agtattttaaaatgct ttcg aggaaacttgctcaacttc^tgatagatgta 
gagattataacattaattttattagctctctgggagtgagctacatgatgttgtgcactgaaa 
attacccaaatgttctcgccttctctttcctggatgagcttcagaaggagttcattactactt 

ATAAC^TGATGAAGACAAATACTGCTGTCAGACCATACTGTTTCATTGAATTTGATAACTrCAT . 

tcagaggaocaagcagogatataataatoocaggtctctttcaacaaagataaatctttctga 
catgcagacggaaatcaag ctgag gcctccttatcaaatttccatgtgcgaactggg^ 
ccaatggagtcacatcagcattttctgttg actgtaaaggtg ctggt aagatttcttctgctc 
accag<*actggaaccagcaactctgtcagggattgtaggatt^^^ 
gg agctc tgaatttaattcga ggctt tcatgctatagaaactctcctgcagagtgatggtg^ 
TGATTTTAATTACATCATTGCAI ! 1 1 ICC! I ggaacagcagcctgcctttaccagtgttattta 
CTrTGTCTACTACACCGGCTGGCGGAATGTCAAATC f 1 1 T 1 1 GACTTTT G GCTT AATCTGTCT A 
TGCAACATGTATCTCTATGAACTGCGCAACCTCTGGCAGCTTTTCTTTCATGTGA 
AGCATTTGTTACACTACAGATCTGGCTAAGGCAAGCCCAG 

TCT GACACCATCCTTCAGATCTATTGCCTTGGCTT CAGGGGGATAAGGAGGGAACATATCAT 
AACTGG ACTQTQ ATGAAGAAGCTGTTCCCCACAGAGOAGAAGCTCTQ C I MC I 1 1 CTCTCCA 

ACTTT0CTTTT1 I aaaatcagcatgatgtgcctgtgagcatggaagagtcctctcagaagaat 

GTTGGCC^TGAGACT ATCATTCAGAGGAGGAG GGGATTTCTCT CTTCAAGGCCATAACAGTG 
GAAGAACAGTCATATGCCATTGGAAGTCTT GGCCAGCAGTCCTG AATCXnTCCTGAAGAGTT 
CAGAAAATAGATGTGGT A 1 I GC I C I GAGGACCAGGCAGGAGGAACTCT ACAACCTGAGTTTG 
CCTTTGTGAGGCATTAGTATAGACCAAATAAAAAGCT GCAGAAATTGGAAAGTTTATGTTTTA 
AATAAATGA£TGTGATAAATTTCAQATTATTTGCACATTTATQGTAXrrACGAGTTTATAAAGTC 
CAAGATGGTGTQAAATTGG MCmilACITTT ATAT TTTT GCTTGAATCTTAACTCTGGAAAT 
CACCTGATGTAGAAGAAGACTGTGATGAGCTCGT CTGTGGAACATCACAAGTATCGAAAATA 
CAGTAATGQATG1 1 I C CI 1 1 C 1 AATCCACATTT ATTG 1 MC I I 1 1 GAAATCACGTCTAAAAAAT 
ATGACTCAGA CT ATAG CCG 1 1 CI 1 1 C CCAAACT TCAGTCTCTTTAGTACT ACTTGTATTATT7T 
CTT AATATTTA I CUT T AAATTTTAAAG TTT TTTTAAAAAAATGAATCCTGCTCCAAGTGGAAAA 
CCAGTATCAGTrTGCCATAAATAAAAGGTAANNNNNNNNNNNNN 
>1306 
>1307- . 

ACTTCCATTTGAAAGCTT AAAAAAGTGAGTGT AACAACAAGCACTCAGGAAACTTT AG 
AATTAAGACATTAATAGCTT A I ! IUI IT I AAGTTTTAAAATTATAGAAAAATAGTAAGTGGCATG 
GTATTGTTTGACTTAATTAGAATGTGAAAACTCTAGTGCTTrrG 

CAAATATTTGAAATAAGACATTATTATTGGTCATTTTAAGATACTTCAT G I 1 1 MCI IAGATGCC 
TTTGAGTAATGGAGAGATGGGCGAGCCTCTCAGGCCTCAGGCAAATTATAGTX^AAATAG 
AGCCCCCTCAGGCATCTGTGGCAAGGCATCCCTCTAGAGAACAACT AATTGATTACTTGATG 
CTGAAAGTGGCCCACCAGCCTCCATATACACAGCCCCATTGTTCTOCTAGACAAG 
ACTGGCAAAAGAAGAGATTCGAGTGAGGGTTGAAAAGGATCCAGAACTT GGATTTAGCATAT 
CAGGTGGTGTCGGGGGTAGAGGAAACCCATTCAGACCTGATGATGATGGTATATTTGTAACA 
AGGGTACAACCTGAAGGACCAGCATCAAAATTACTGCAGCCAGGTGATAAAATTATTCAAGC 
T AATGGCTACAGTTT^ATAAATATTGAACATGGACAAGCAGTGTGCTTGCT AAAAACTTTCCA 
GAATACAGTTGAACTCATCATTGTACGAGAAGTTTOCTCATAAGCACrrGTGGACAAAAAAAGC 
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GGGGAAGACAGCAAGATTTATGGGAAGATACTTACAGGGGAAATTAATATT7TGACTATTTTT 
ATTGCATCACCAAGGATATTCTTAGGTGAAATTGGCATTAATATATATTTATTGTGAGTGTGG 
GGTAGCAGTTGGGCACCACTGCCTTC ATTCATGCT AAAGTGTGG ATAATTAT ACCTGCCATT 
TC7TTGGGACCATGGATGCGGGACGACGGGTANN 

>i3oe 

>1309 
>13t0 
>1311 

NNNNNNNNGCT ACTGTGCATATCTTCCCAGCTCCCTTTTCACTGTTATCACAGTTGA 

ctatggtggtagttgttggcatttcctgaacatatcacactgattgtgggggcggggaggg 
agaatcaaagttggagaggtcatgagaaaccatgtacaaactaaaattatgggagaagaaac 
tatgagtgaaacgatgagaaaaacxtaatgcatgatgtagaactgagtggtgttaatagcag 
agcactggagggaagggccacaaaactcttcaccccaaggtctagaatcattctagaatcat 
cctacaagcctagttttcatgagattcagccctattttatttcttgctct^ 

ATT ACGAATTTCTGTGTG TTGCCAGCT GT AAT AGAATACCCTG GAATTTTATTT ACTTTT AATT 

ttgtttatttattt atgcttatgtgc<>tcttctc^tgaaaaagaggcagtatgttaaaagttt 
gagttcagattttctgatgtagataaataagctaaagaaggcagggtgaagtgtgatatatg 
a gaatt tcx^gagcagggtattcgtaacttgtaagtatttagtccaagttccctctccc^ 
catttt acactagaataagattgaaaggcgagatgtggtggct cacgcctgaaatcactttg 
ggaggccaaggcagtggattgcttgagcccggagnnn 

>1312 

CtcaAGAtooacTTTTAAATGGGTGTGATGTGTA I 1 1 1 1 1 1 1 AAGTAGCAGGTTTGATTTTA 
AAACAAAAAAGGTTAGTGAAGACTCTGTCTTTCAAAACATAAAAATCTGCGATAAAACCAATT 
ATTCCATACAGTGACT ACGGTGAGTTCTrGAGAAATGACACCCAGGTTGGCGATGTGTCTCAT 
GGTTGGCCTTCGATGGGGACAGTTCCAGGG GTGGTCCATCT COCCGATGTGGGTGATGAGT 
TTCTTCATCTCGCTTGTGTTAAGAGCAGTCCCAATCATCACCGACTTCCGGCAGGCTCTG 
GGCAAACATCTGCTT GACT CGGGAAGGCCGGCACATGACCCCAGGGCT GTCGCTCAGCAT 
GAAGATCAGTT CATC GACGTCCTGGGGT CCGA AGGTCCAGCTTTT ACATTTTCATCGAtaacaa 
aatcgaagCCATTCTTT CT AAATATTTC^GATTTTCTATCAGAACAGctlcaTTAACTGCTGTTAaon 
gaeagtdgoogtBctateagoctCT GCCOCT GGAGCACGGTGTG CTGCTG CAGCATCTCGAAGTT AT A 
CTTCTCGTCCGTGGCATGCTGGTCCACTATGAAGATATCCTCATTCAGTTTGGTTATTATAAA 
TCCCAGGTTAAACTGACGAATGATTTGCATTTCTGCAAACATCGTTTTACTT ATCTC 1 1 MC I I 
AGTTCATCTTCGGCTGCTTGATTTTCTCCAGGACAAATC<nTGCCCT AAACTT CCT GTAATTC 
TGTTCCCCTTCACTTT GCTGTGCTTCATGATGTAACTGCTTTATTCGTTTAGCT AAAGAACTCA 
GAAAAGTTCAGGGGCACAAC I MCI I ATTAATTTTCACAGCTACATCAACCTGAGAAGCTGAC 
AGQTTCTGAGTATTTACTAAc Btlp ac gMlB Ecagaal^^ 

ATGTGAQaTTAGttggctGAGgcaAaacCTGAAatttaCATccgGT ATCTTCCTGGTTTGAaTggcAGTCC 
ACATCT GAAAAAGAGTCGTCAGTTTCAGGCGCTTT CTCCTGAGAGTCCACATGTTOCTGCGA 
GCCCCTGTCCCCTGGGGTGCTG<>CCACAGACTCGCTGCTGCAGTGACTGCCCGTGTCTG G 
GATGCTGAACCCCTCAGAATCCACGGAAGTGCTGCCATGCCCC 

GCTtXrG TCCGT AGGGTCACTGGGTCCCTGACTGGAACTCACTGCC I CI 1 1 C I GAGGTCTCA 
GGACGCC1 1 1 G I CAGAGATGG CATCTGAAGTGCTAGAAGAOGACAT ACCCC 1 1 1 ICI GTCCT 
AGAGGGC 1 OC 1 1 C I I GGTTCTGGAGTCTTTGGGGTGTGAGGGTT G ITC I C I GTTGTGTGACG 
AAGAGAAAAGGCCTCTGGCAGTCTGGAAATGGACACG I CCC I 1 1 I I ICI rCTCCAGTOCTTA 
ATGAAGGGGATTGATCCTGCTTrTCTACGATGGGCTTTTCCAAAT< 




_ ateCCAGCTActoQaoaGGCTBaflgcAGGAGAATC 
GCTToaaoceawaogtogaagglACAGTAAGCCAAGATTGTGCCA 

GAGCGAGACTCTGTCT AAAAAAAAT AAATAAATAAT AGAGGTGAATGTCTGCATT AGGATCAA 
GACAAGAAGAAGAGAGACAATCACTTT GGAATTCTGAGACTA£XrrCCAAGAATCATGCACGG 
AAGGATGTCAGCGATTTAACGAGGGCTAGGGATCAAAAAGGAAAAAATAGAGTCAGTGGACA 
AGTAGAAGAGTCTCCTGAAAAATATCCGTATTTGAAAAGGCAGCAGGAGTTGATA^ 
TAACTAAAAAAGTAGAAGACACTGTTAAATTTGAATCTGGATCCTATATAGC I I C I ICI CI GG 
GATCTACTGAGGAGTGAAATCT AAATGAAGATTTAGCTTAGAAAGCATGAAGATAGTATGTTC 
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CAATTTTAAATAAAAATTATATTGTCTGAAAGACAATACAATTTTAGTACTGGGGGAAAAA 
CTAGATTGTTATTTGAAGAAQAAAACATCATGGTTCrrCTCCAGGTTTAAAATGGGTGCT 
TGT AGGCTGATTTTCCCGGCATTTGATCTT AACTTGAAG gccatctg cccagcccagcaatc 
AAAGGGCT GCTAAGC TGTocTTGGTAGCCTCTACTGGGCAAAGCTGTGCT ATAACTCTGAAG 
AAAGGGAGArCI \ \ T AATGTT GATTACTGAAAGGCTTAGAAAGATGTCTTGAAAGAGGACTAA 
ACACAAGAAAAATAAATTTTT AG CATTTGAGACTT AGGTG TTT AAG AG TGCTT AGCT CTT AACT 
ATTAACTCTTCTGAGGGGAGAAAGCTCX^TGAAACAGGTTTCTGTTT 

TAGGGCAAGAACATCTACTCAACACTAACAAATGCTGGAAGCAGGCATCTGATTrCCTTTGT 
6CTAAGGGCTGAAATCAATACTGGCCTCTGGCTCGCTACATTCCTATAGAACCAGCTGCAGG 

tagggaggaagagaacaaaagttgttttggcggcactcagtacaataaaccgaagacaaaaa 
agacagagt a aagtct attaacagatggtaagtagtagatgatactggggttatttcggtaaa 
agaatatcagatttttatgaatgtcttaaatattacctataaccacagcagccacatgcccca 
g gtttc tcagaaccaatctcagtaccaaatattgtcat g i ic i 1 1 gtccagatact aggttttt 
gui it i aa aaataaggtog gtgc agttracatgcccactgctactgggaggcatacactggg 
ccacttggataggct gtcattttgtgcaaatcaagag acxzxxatatcagctcct caggagctc . 
tggcct atttttgtcct attcttgtctaccctaattgatatccttggct acatttatgataaat 
gin cctgaocatcc1 1 1cc i > ctttt agat accgatcagatt aag t acaagcaattgtcaatt 
atccagcacttgatct aggcctttaattcctt cagagagagaaggaatatagagataaaactc 
ccacccacttgaggtatggtttatgagctcttgcccaatcacattaccattcagaaggatttt 

CCTGGCTGACTT AAACCAAAAGCTTGATTG TTCCAG TCT GTGATGT CACTGCAGGAACAGCT 
GTCACCTCAATTATGCAGTATAA! T 1 1 IGAC^TAATTTATATAAGOACACAGAATTCAATCAGC 
AGCTGAGCAGGGAAGCAATGTT AAGAAATGCTATGAATGGGTATATGATTTGCATATTATGTG 

gctcatctgtggactggagcaatccatccattcccctcattactctgaat 

TAGATGC I OCT GGCCAGACAGCTGT ATGTrrTAAAAATCAAAAACAACAAGAAAAGCAGCCT 

catggctggtgcgct ct agaatt aaggct AT ATTAAGCATTTATTCTTA 1 1 iccic agctttg 

G GGCTG GGACAAGGATOCTCAAAACCCTGCTATTTCCTTACTTTr AATTGTGATTCTCTAGCC 
CTCCTCAACT ACAAAAAGAAAAGGATAAAGTAGGGCATTTCnTGGGGGvACAAAGGCAGAAG 
GATATGATCAGAGAAATTTCTCiAATAGCAACACAAGAAGGAAACCAAAACGA 
**^TAAGGAAGGAAAAGGGGTGTGTCAACATGCTAATGCCAGGXK3CCCTGT CACTTTCCCA 
ogggloAiX»GTCTn(>ggggacCATGCCCTTTATTTT^ 
CTcaoctsCATTGCCCaGGCT c^""" - — 




>1313 
»1314 

*PP*CCG GNATT A CACAAATCCTCAACAATCGTCCACATATTTTCT G 
^JLS^^F 0001 ^ 1 * WCAGCAOCTG^TGT ACCTCCTTAGAAACCT AGACTCCACAGAACA 
CTGTTTG ACAAOCACTGCAGTAGAACATAATATA^ I ICI I I C I I 

CATTTTT ACATGT TGT AGAAT AACATGCAT AATCAAAGCTAATAATACT G I U 1 1 M C 1 1 1 ACTCT 
TTTAI I TGCCTCTAAAGACATCCACACATAGTGGTGAACTGATTm 

AG^ATTGAAAAATATTAATAATTGTAATTACTAAAAGTA \ I ICIC I 1 1 G CGATTCTCTT ATCTG 
TGTTTCCAGACCGG 1 1 GGAGGGTGACAGATCAGAAGGCTCTGGTCAAGAGAATG 
GGATGA GGAAT AAT AAA C I C 1 1 1 1 1 GGCAAGCACTT AAATGTTCTGAAATTTGTATAAGACATT 
TATTATMTrrTTTCmACAGAGCr^ 

CTTTTTTTGGGATAACCT AACATTGGTTTGGAAT GATTGTGTGCATGAATTTGGGAGATTGT A 
TAAAACAAAACT AGCAGMTGTTTTTT AAAACTTTTTGCCGTCT ATGAGGAGTGCTAGAAAAT 
GCAAAGTGCAATATTTTOXnAAOTTCAAATGTG^ 

TTTCATCAT AGTGAAAA » G I IGUT I CAAATAAATTTCTACACTTGCCATTTGGAT G I TTOTT 6C ' 

TTrcTAATTAAAGAAACTGGNHNMNNNNNNNNN 

>1315 

>1316 " " ' ... 

'• 000 ****GCT AAGCAAAAGATTTT AT AATGAAGjAAAGATGAGT AACT AGGCTCTCA 
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ATCTTC^TGACTGGCAGGCTGGGCCAGGAGAGAAGATGGAACATGCG^CCAGGCAGGAA 
TTCACATGACCCTAAAGCTGCCGGTGGAGCTCTCAAAAG TCCCT AGATCCACTGOGTCCGTC 
TCTAGGTGGAGCCTGGACAAGGGCATCCTTCACAC^CTTCCCAGGGGCTCATATAC 
AGATGAGAACTCTGCTGCCXXiGAAGGACTGACATTCTT CGGATATMrTAGAAACCCTTCCTG 
AAGCTCACTTTTGGATAGATATGGGCTGTACGAAAGACACTTATTATTCTAACATGCATCAAG 
TAAAGTAAAACAAGGAGAGAGGCTGCGGTGTGTGGGTAGGGGATGCAGGAGAAGCTGTGT 
AAGGTAGTGGACAGCTGTGTGGCTCTGGGGATGAGACAGACT AGACCAGGCAAGTGCTTCA 
GGCAGGTGCCCCGTCGGGAGG^CTCTGGAGTT ACTCATC1TGCAGCCT CGGGCT ACTCAC 
CATC AGGGAGCCCCGGGAGCCACCTGTG 1 1 II 1 1 GTTTTGTTTTTATTTATTTAT AATTTAT AT 
TTTTGAGACGGAGTCTTGCTCXGTCACCAGGCTGCAGTGCAGTGGCACCATC^ 
GCAACCT CCACCTtXrTGGGTTCAAGCGATTCTCCTCCCTCAGCTT ANNTilNNN 
>1317 

>1318 

ACTACTTTTG1 MINIM rTTTTTGGATCAATAAGTTTATTTATGTTGCATCACACAAT 
AGTTACACAAGCTTTT AAAAACACATGCACACGTGTTTATTATAGCATACATACAAACACACAT 
ACAACTTAATTTTTACAAGCACATACAAGC ACATT CAAACTTTTAAACAAC 
TCAACATACATACAATACTTACAGCTTACGTTTmGCGTTTAGCAGTTGTA^ 
GGCGGTGGGTGTAGCI nTCGTTI I CC I AATGTAAATTTTGGTTTGGCCTTCAATCCTGCTTG 
GAGTAAAAATTTGCGTCCT AAAGGAAACTGATCTAGGTCTGCAGAAAACTTTTCCTTT AAATA 
TACTGCCCAAAAAGAGTATTTTTTAAGGGGAN 
>1319 
>1320. 
>1321 
>1322 

NNNGTGAAAAAGTT G I 1 I C I I I I AAAAATATTTTAATCCAAAGTATGOCTCCCT C CGG 
GAATTTAGGTGCCCCAGGGCCTTCAGCGCCCCCAAGATCACACACACACACGCAGACAAGT 
TCTCTCTCTCACACACACACAGACACACACAGCTCCCySAA 

AACCATGTAATTATAAAGG^GCCCTACCTCTCAAGAGTGAAGTAAGGGTGAGAAAGACTATG 
TCATTTCAG^CCTGAAGAAGCCAGAGACAGGAGAATGTGTGTCACTTAAAGAAAAATAGGGAT 
TAGCAACAATG1TTGGTTGAAA AGCCA GAAATAATAAAAGATAGTAAAATT1TGATTACCAAA 
GGGGAAAACAAAAA GCAAAA CACTTTCWAGATTTCTGT ACAGAGCTTCTTCCTATTAAGTGCC 
TAAACT AT AGGCAAACTTTGGT GTTCCCACT AAAACACAAGAGCCTCAC^GAATT AGGAAAAA 

aaaaatcaaaagaaacaaggaaact gagaatggaagttagtgtaaatctctgcatttgggga 

GTTGTCATTAACTCCAGAGCCCAGCATAGTTTCCATGGAGCCCTGAAG 
TGCCACAAAGAGTTTCGTTCCAGACGAGTCGTAGCAGTGGGT gtaaacagcattggggaag 
AAGTCAATGTCTGAAAAGTAATTCXTrCCAGGTTTCATCATGATTCTACG^ 
ACT ACAATTAGCACXTCTAGCCATGGGGCAG 

>1323 ' " " " " 

>1324 

ACGATAGGAGCTGGTTGTt>ACTGATTTAATCAGTGGGT ATTTCAGTATGAGGCATA 
GTTGGTGACTGAGCTTTCATGTGGTAATCTGAGGT GAATATTGACAT AGAATT AGAATT AT AA 
CTCACATAAATTCACACAAAAACATTmACATTGCCT 

AATAAAGTATATGGGGGCCACCTTrCTCCAGGGATTTATAGGGT AATTAGAOGAACTCTTCC 
AAT AATT AT AAAAAT AAGAACTGAAAT AGCAAGT GCAT ATTCTGAATT ACAAAAATCATTCAAA 
GT AACAAAAAGCCT ATATCTTTGATCCCAAT AACCT ATTTTA A 1 1 I ICI HI I AAAAT AAAAGTG 
TTAGAGGCAAAACAAAAGTSCCATAACTAAACCAAGT ACAGACA AAGGAAATAGGAATGAGA 
AACAGTGTATGTGTGTTTGTATGTGTGGGCATGCCTATAGGT atagttgtgtgtgaattaaag 
tgaacacactcgctaattgaat aaaat acagtaagtgtttgtgtttgt aaaaaaaatataaata 

ANNNN 
>132S 

>132B. -. - . 

GCGTCCGGCT ATT ATAGT CATTTCACA^TGATGAAACT AAGATTCAGAGCAGCTGA 
TCTTGTGAGGCATCTGGAATTGGAACTCAGATTTGTCGAACTCTAGAACTA^ 
ATGTCTTGCAAT ATATTCATTT ACAAAACACTTCATTATTTATAAAGAATTTA^ AACAGTTTAT 
CTT ATTTATACCCATACATCTGCTACTTrGGGAGGOCCTTT ACATA 
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GCC^TATGACCAAATTACTTTTATTTATAATTTTrGATTTATAT^ 
CATCTGAAGGAATTrACAATQAAAGATACCTATGCAATAACATTTAGGATAATCTT^ 
TGGAAAAATAAGAATTGAGGAAAAAAAGTGTATCTTTCAAGTAG^TGCAAAGCATTATAATGA 
£I GAGAG I[ GTA TCTAACnXX^GTOT 

GTAACCTCTAACATTTGGTAAAAG<iAAGTATACTGGTCTGTTAGCAGAGACAAACTTTTTTTA 

GAATTGAAGTCTGAAACAAACAAAAGCAATTCAATGTCAATAGACATTAAGCAACATAATAGA 

CAAACATCTCCTAAGGGAACATTTGTTACAGCTGCTCCTTCCCTGAACTGTGCTTTGGAAGAT 

AAGCTCTGTCCTGAGTCCAAACCAAGCCCTTCC AAGAG AGAACAAA GG T CA G A G ATGTTGAA 

GAJTCCAGCAAATTTCTCCrCTTAmCTACCAAGCCTTTGTGAACATTGCT 

CTGTACjrCTaX^CAGGGACCTAGAACAATGGAATGTCAGTCAGTCTCT^ 

TTTCTI I AAAATTCAATTAAGGTACTTCTCCCTCAGGG ACGT AG AACAATGG AATGTCAGTCA 

GTCTCTGTA GTTAAAAC 1 1 1 T T CTTTAAAATTCAATTAAGTT AC ACCAG AATTT ACAGG C AA n. A 

1 1 1 1 1 1 1 i i i CATTGCTCCCATAAGCAAATTTGTTTTAAAATAATTGTAAATGAGGTATATACTT 

AGTTCTTGGTTAAAAAATATATTGCTTTGTTAAGTATTAAAGATTATTTGTAAGTCATTGTATTA 

A T AATACTAATAAAA '^ ATGAAGGG ^A^^ 

GCCAAATCT AG GTCACTATCTGTrTTTGTAAATAAAATTTTATAATAGTACACAGCCACACTCA 

TTCATTTATTTTCTGTGGTTGCTTTCAAGCTACAATTGTAGAGNNNNN 

>1327 

TTTCCAGGGAGTAAAAAGGCCTTTTGGGTGGCACATAAATCTACTGCGGTGCTG CCT 
GCTGTGGCGCGGGACAGCTGTTGAATTGTTGTAATTGGCTGACTCCCTGAAGTGGTGGTAT 
TTTCTGT AGGGGTTGTTGTCCTTGAAAACCCTGAGGAACAAACAGTTTAACTGGGAATGCCC 
CAGI I IUI IGGGGTCCCTGAGGCTGGCCCCTCTTCCTGmCCAGACAATGGTTGGCCATC 
TTTATCAA ATTTAGAATGA CATTGATTGGCCCAATGTTTTGCTTTT CCACATTTTGGACACAGG 
™^I£2hLLL!. f ATTTTTTGCrrGTTAATAGCTTGATTrATTATATTCTGTTTATTTAAGCCTG 
GGC A A 5J CCTmGAGATGAGGGA ^GAC^^ 

CTCCAATCCCATCACAAGCCTTCACAT ATTCTGT AATT ACATCAACT CCT GCTGGAACTTTTC 
CIH I AATGG CTTTATGGCCGACTGAX^TTCTGGATTTCCATTTTCATAGGCCATTAATTCTAC 
AATAAUI 1 1 1 CGGGCATTGTCATCTGTAATAGACTrrrGAGCAGCATCTTGTAATCTTGCCAC 
AAAGTCAGCATATGGCTCTTTAGAGCCTTGTCTAATTGAATTAATAGGGAAAGCTG 

ST c P™TirT CGCCAGGGccTQAGGcAM 

CATCACTGATTGTTGGTTAATGGTGCTC^AATTTGGACCTGTTCCT AACAATTGGTCTGCQTC 

TATATTAACAGTGGGCTTAGTAGCCTGATTTTTTCX3TACCTGTTCTTGTACTC 

CCAGGTTTTAAACTGTAGATACTGAGAGGATGAAAGGG^AGATTTGGCCAAAATTTCCCAGT 

CATAAGGAGT AAGTCT ATTTCCATG AGCAATGGAAT CT AATAATGTTCTT AT ATAAG GGGAGT 

TGGATGCATATTGTTrAACTCCTTCCTTTATATCTTTT AACATTTTCATGGTGAAAGGTTCACA 

TCTAGTCTCAGCTCGGGCAGGCXJCTCCTACTTGA 

AAATTACCAGGAACCOOCATGCCTCAAGATCGCCCTOTT^ 

TGCAGTGCACCACCCTGTCCACTACGTGATGCTGTAGGATTAAGTCTCACAGTGGGCGGCT 

GTATGGTGCCGTGCTATTTGGCACAGGACAGACCGCTTGAGGTCTGTATTGAACCTCT 
GAGGCCGATACTGAAGCTCGGTTGGCAGCCGGTATTGATAAACTAC 
*ttttctaccggctgatattgtggatactgtatct GAGTTGGCATTGCCGGGATAG AGACTC 
TGTCCCTTTCT ACTTGAI 1 1 ICICI I GGGGTTTGTGCTTGTCTAACCTGCGTTTGAGGTTGTA 

A ™j A £ A G9j^ie^^ 

TGJODOCAATGAT^ 

TAATTGA GTGT AGT CAACATTTTGCGTTGACTGAGCCATTACAGACTCTGCTACATATTTACA 

£3£I° AA i?IJ3 c i^ 

GCTTTTAGGGGCATCAGAAACTGAAACAATATCTTCT CCTGTTTGAAATGGTTCTAAAGTTGC 
TTTA ATAATGG CCCA ATCATTC CATACTGTAAGTGGGATGATTTTACCTTOX^ 
TTTAATTCTTTGGCAAI 1 1 1 1 1 UCCAATCTTTTAGATCTAAAGTTCCCTGTTCTGGAAACCATG 

6 £C A 5^™TL CTA7 ^^ 
£ c I7 c II AAAAQ ^;ri^ 

£2S5S2£J3S^^ 

CCCCCACGCCGGGCGCCAGATGAAGGGffiTGGCCTGCCCCTCCACACCTGTGGGTATTTCT 



926 
Table 4 



PCTYUSOO/33312 



A T CAG I GG ^ GGAG ^GAC<>GCA 

GAGTrCCCTCAGTrrTTATTGATTATGATTTTCATTATTTCAN 
>1328 

N NAGTG CT GCTTGCCGATGCTCATCT CCTCTCCTCG AAAG CT CCTGG AAATCATCAG 
GAACTGGTAGAATGCTCCATAGGGGAAGAGATCTTCGTCTGTTTGGTCCATTGCTGTCTTAC 
^^ZISLS^ 0 AGGTGACAG CGTGCTG GCAGTCCTTC^^CiCCCTC^CTT'GCTCTCGGCACCTC 
CTCTGCCTGGGCTCCCACTTTGG CGGCACTT G AGGAGCCCTTCAGCCCACCGCTGCACTGT 
GGGAGCCCCTTTCTGGGCTGGCOAAGGCCAGA<Kn'GGCACCCTCAGCTTGGAGGGAAGTG 
TGGAAGGAGTGGCGTGAGCGGGAACCGGGGCTGC<3TGTGGAGCTTGCGGGCCAGATGGA 
GTTCCGGGTGGGCGTG G GCTTG G CG GGCCCCGCACTGGGAGCAGCCGGCCGGCCCTGTC 



GCACCGAATGGAAATCCATGAGCAGCACGTAGGCAAACAGCAGGAGGAAGGCGATGTAGA 

gIdagc^cttgtgc^ 

ggtataataaacagacacaggataatcttccagtrcttggtgtctcgggaaatctctccatac 

C ^^9JI AGAAAGAAAA TTCTGGACCC<^^ 

CCACCGCt^CTCXiAGACAGTTGCrrCCACCCCAAGCTTCACAGGAATAGAC<>GC^ 
TTCTGCCAAGTCTTCATCGCTGCTGTAACACTCAGTGAACAGCTCAACAGCCCGGGTCTCGT 

^I GG 9j CGG i^ CAATGTCTGTGCCACATGGC>GATG ^ 

TGTGGACTCACC^CAGCCCGGGTCAGGTACTCATTAGCCAGCTCCTC»GAC«CAGCATTG 
ATGTCGTTCTTCACTTTGGCCAGAGTCTTCAGAAG CTTGCTGGCTCCCAGGGCTGCCAGAGT 
G CAGCCCCTGGTCT GCTCCCAAAT GACTTTG GAGAGTTCCTT CTT ATTCTGAAG AATGGCCC 
AGATGAAGAGAGCTTGCAGGGGGTGC«^GTAATAGGAGACAC<3TCGTGGAGTTCTATGTC 
CATCTCGTCCCGGCCATTTCTGTCTTCXrrTCCGGAAGCCTCr^ 

TCCAGACAAACGTGAG GAG G GCAT CATT AT AG G AATTCTT GGCGATCTTGCAGATTCCGC^AC 
ACAAGCGTGCTGAAGTGGTTGGAGAAGAGTTCAGTGAGGACATCATGGGTGAGAAACTTCC 
GTAGGTTCAAG<X^TrCTCCAGAAAGAGGC«GACAAACTTGGGTCTGTCCTTTATGAGAGCC 
GT AAACATGACTTCTTGAAG GTCAGCAGACTT G A G AACCACCAGCATATG AGAAGAGGCTTT 
GCCGTCAGTCACATCTTTGCTCTCAACCTTGATTCTACCATTTTC^ 

TATTGTGTCTCTGAGCCTCGGTTrGCTCTTCTCCCATCGGCGGTCATTGGTGAAAATCTCAT 

c^qctaagtccagctggtt^ 

ttgtcttgctcactggtgctgaa<k5ctttgtatagagcgtaggagatggcattgctcac^ 

AGCCATTTGATCCAACTCrrCAGTXTrCXn'CXrrC^GGCAGCCGGG^ 
AGOX*CeAGCTTCTCCTTGAGGGCAGAAGATGTCAGGGCAT^ 
AGC^TCACATCAGCQATCTCGC*^^ 
TGGAGGTATTGATGGCTTTCyAAGTCTCTTTrCCACCT 

^ c J7 G y c 3 A ^ c ^ AG ^ GGAATC ^ GAAtAGTG ^^ 

CGGAGCTTTGCrTCGACAGTGGGATGTCCATGACAGCCATTGTCCAC 

TGTGGTTGTTGTCX^AGGATATACAGTGGATCTCrrTGTGAAGTGATCCATAAGGTACTGGGCT 

AA>AAATAGCC<rrCAGGATC<aCAATTCCTGATX3AGGGTGTCXXX5GTTGG^ 

AAGCTGCTATGC<^TGGCCACAATATTCTCC^^ 

CCAOTCCCCGAGGTACTTCATC^GGCCAT^ 

TTTGGACTGC<SCGATGTAGATGAGCC<3^ 

GAAGI 1 01 1 GGCGCCCCCGGTCACAGAAATGACCAGGTrGGGTGTTTTCAGGTGCCAGTGC 

^ g J£ a ^ gg ! gg ; aa ^^ 

CTTTCJTCCCCAGTGTCTCAAACTGAATATCCCCAAA 

TOTTTCTTGTAGTTCCATTTCTCACTrrGGITGATCTGGGTGC^ 

GJ^TAGCCACACTTGCACACATTCTCCGTGGCCTTGGAATCTTTC 

TCGi i iui I AAAATTTGCTTGAATAAAATTCACCAAG I CGC 1 1 1 G ACTGT AAGACAAGTCTGTG 
CTCCGAGACGCGCT G GAG T ACAG G GTCCGGGTGCT GTCCAGAGTGTCATTCCTTCTGTTCC 
TCATGCTGAG CCTGG<rrGCCCGAAAGGACATL I 1 UUCI 1 1 ACTTCTGCAGG CTCCACGTCT 
TCTGTCCACCTGCATCCTCCAACCAN . 
>1329 . . 

NNGAAAGTGACCGTGTGCTTCGGACGGACCCGGGTGGTCGTGCCGTGCGGGGAC 
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GGCCACATGAAAGTTTTCAGCCTCATCCAGC^GGCGGTGACXX^CTACCGGAAGGCCATCG 
CCAAGGATCCAAACTACTGGATACAGGTGCATCGCTTGGAACATGGAGATGGAGGAATACT 
AGACCTTGATGACATTCmGTGATGTAGCAGACGATAAAGACAGACTGGTAGCAGTGTTTG 
ATGAGCAGGATCX^CATCACGGAGGTGATGGCACCAGTGCCAGTTCX^CGGGTACCCAGA 
GCCCAGAG ATATTTGGTAGTGAGCTTGGCACCAACAATGTCT CAGCCTTT CAGCCTTACCAA 
GCAACAAGTGAAATTGAG GTCACACCTTCAG T CCTTCGAG CAAAT ATG CCTCTTCATGTTCG 
ACGCAGTAGTGACCCAGCTCTAATTGGCCTCTCCACTTCTGTCAGTGATAGTAA7TTTTCCTC 
TGAAGAGCCTTCAAGGAAAAATCCCACAC^CTGGTCAACAACAGCTGGCTTCCTCAAGCAG 

aacactgctgggagtc«taaaacctgcgacaggaaggatgaggatgggacagaagaggata 

acagtostgttgaacctgttggacatgctgacacgggtttggagcatatacccaact^ 

ctgg atgata tggtaaagctc^agaagtcxxx^cgatggagggcctctgggaatccatg 

tagtgcctrrcagtgcrcgaggcggcagaaccctggggttattagtaaaacgattggagaa 

aggtggtaaagctgaacatgaaaatctttttcgtgagaatgattgc^ttgtcaggattaatga 

tggcgaccttcgaaat ag aagatttgaacaagcacaacatat gtttcg cc aag ccatgcg ta 

CACCCATCATTTGGTTOCATGTGGTTCCTGCAGCAAATAAAGAGCAGTATGAACAACTATCC 
CAAAGTGAGAAGAACAATTACT ATTCAAGCC<STTTTAGCCCTGACAGCXIAGTATATTGAGAAC 
AGGAGTGTGAACAGTGCAGGGCTTCACACGGTGCAGAGAGCACCCCGACTGAACC^ 



CATCCGCTCeAGCCTCGGCACCTCAGAATGTATTTAGTA^ 

ACCAAAAAAAT AGGCAAGAGGCTTAATATCCAGCTTAAG AAAG GTACAGAAGGTTTGGGATT 

CAGCATCACTTCCAXSAGATGTAACAATAGGTGGCTCAGCTCCAATCTATGTGA 

CCCCCG<^GGGCGGCCATTCAX>GATGGCCGACTrAAGGCAGG^GACAGACTT ATAGAGGT 

AAATGGAGT AG^TTT AGTGGGCAAATCCCAAG AGGAAG TTGTTTCG CT G TTG AGAA GCACCA 

AG J A J GGAAG /" AAG ^ G ^ GAG ^CTTCTGGTCTTrCGCCAG GAAG ACGCCTTCCACCCAAGGGA 

ACTGAAAGCAGAAGATGAGGATATTGTTCTT ACACCTGATGGCACXiAGGGAATTTCTGACAT 

TTGAAGTCCCACTTAATGATTCAGGATCTGCAGGCCTTGGTGTCAGTGTCAAAGGTAACCGG 

TCAAAAGAGAACCACG(^GATTTGGGAATCTTTGTC^AGTCCATTATTAATG 

ATCT AAAGATGGAAGGCTTCGGGTGAATGATCAACT GAT AGC AGT AAATG G AGAATCCCTGT 

TGGGCAAGACAAACCAAGATGCCATGGAAACCCTAAGAAGGTCT ATGTCTACTGAAGGCAAT 

AAACGAGGAATGATCCAGCTTA1TGTTGCAAGGAGAAT AAG CAAGTGCAATGAGCTGAAGTC 

ACCTGGGAGCCCXXXTGGACCTGAGCTGCCCATTGAAAC^ 

AGAATTTCCC^TTCCCTCTACAGTGGGATTGAG<5GGCTTGATG 

TGCC<rrCAGTAGGATAATGGGTAAATA<X^GCTGT(XXXTACAGTGAATATGCCCCAAGATG 

ACACTGTCATTATAGAAGATGACAGGTTGCCAGTGCTT(XrTCCACATCTCTCTGA 

TCTTCCAGCTCCCATGATGATGTGGGGTTTGTGA 

CTGCAATCAGTGATTCAGCCGACTGCTCTTTGAGTCCAGATGTTGATCC^ 

AACGAGAAG GATTTGGACGTCAGATAGCTGACGaGACT AAACTCAATACAGTGGATGACCA 

GAAAGCAGGTTCTCCCAGCAGAGATGTGGGTCCTTCCCTGGGTCTGA 

ttggagagtctgcagaccgcagttgccx^ggtgactttgaatggg GATATTCCTTTCCATCG 

TCCACGG CGGCGGATAATCAGAGGCAGG GGATGCAATGAGAGCTTCAGAGCTGCCATCGA 

caaatctt atgataaacccccggtagatgatgatc^tgaatx^tggagacc^ 
acmcagaagaaagttcaagatcagqgagagagtctgtatcca 

ccact ct ctggagagacaaatgaatggaaaccaagagaaai^tgataagactgat agaaaaa 

ag^taaaactggaaaagaaaagaagaaagatagagataaggagaaggataaaatgaaagcc 

aagaagggaatgctgaagggctrgggagacatgttcaggtttggcaaacatcgaaaagatg 

ACAAGATTGAGAAAACGGGTAAAATAAAAATACAGGAATCCTTT acatcagaagaggag agg 

ATACGAATGAAGCAGGAGCAGGAGAGGATTCAAGCCAAAACTCGAGAATT^ 

AAGCTCXMGAGCGTGACTATGCTGAAATTCAAGATTT^^ 

GAGTTAATGTATG6GGGAG 1 1 1 C 1 1 C 1 1 ATGAAGGTrCCATGGCTCTCAACGCTAGACCTCA 
GAGGGG ACGAGAAGGG£>TATGATGGATGCTTTGTATGCCCAACT CAAGAAGCCGCGGAAT 

tccaaaccctcacctgtagacagtaacagatcaactccta 

TCTGAGGCAAGAAi 1 1 oagcaaccaaagcaagatgaagatgtagaagatcgtcggcggacc 



GTGTCOGTGOAGGTGCAGATGCAGCiGGCAGCGOCAGGAGGAGCGCG 
GGCCCAGCGOCAGTACAGCTCTCTGCCTCGGCAAAGCAGGAAAAATGC^ 
CAGGACTCTTGGGAGCAGAACTACTCCCCTGGGGAAGGCTT 
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CCAGGTACTCCAGCTACCAAGGCTCCAGGAACGGCTACCTGG<3AGGACATGGCTTCAACGC 
CAGGGTCATGCTGG AAACTCAGGAGCTCCTTCG CCAGGAAC AGAGGCGG AAGGAGCAGCA 
GATGAAGAAGCAGCCTCCTTCCGAGGGGCCCAGCAACTATGACTCGTATAAGAAAGTCCAG 



AG<3TTGCGAGGCTGAACAGACTTCAGACTCCTGAGAAAGGGAGGCCCTTCrrA 

AC^CAA ATAACCG ATGCTTCATGTCGCGCAATAAAAGAGATTTTCCTATGAAGACTrGTATTT 

TGGGAGTTTTTTTAAAACCTC^ATGGTACTATGGAGTATTTCTGTTGTTGGTATCAGTGCCTT 

TAAGCGGTGTAGGCAAAGAAATGGAAGGCCTTAATGTCTTTGCCACTATGTCTCAAGTGTCT 

GTTTCATGGAAGGATTTCCCACCCTGTGACAATCATCTGTrrGAGGTGTrCATATGCTCTGC 

GG CTCTCCACAGTACCAGGAATCTCGGCCCTACTCATGAGTTGTCCGCGGCTTGGTTGTAA 

CATCCCTGCACCACTTGCAGTGACAAATTCACCTGAAGTGGAG^^TGACXJTGCGGCCCTGT 

TTCTCCCTCTAAGTTCTCTTAGCTATGGGATGACATCTTAGTCTCTGGTGGAGGAAAAGTGG 

GC^^CATACACCAAAAATTGGGGCTTTCTGGTACrrrCACAGCACAGCCATTTCTCGTA 

GTCAT&ACTGTGGTTrrCTCTrTCCTTTCT 

ttacagaagctaagccatagtccaacattgtrrggtcaccatgggggtccttttgtaactgc 
cttatgactcaacattaccaataaagtgatgatcctggtctgcgtttatacatacccttgttc 
ggtcctgttcctgacacgtgggttgagtcaccacagctctgtgtggggaacgtgggagaca 
ggagtggctcctgccgggggaagctgggcctgccattggccctgtgtctatcatgagggg 
agagct aagaaagaaattctcctaggaagagctcatggcccagtacatcctagtaattatttt 
aattag ttttt gtrctgacagcttgtcaggaagggcacagaatgggacagagataaaccaga 

CAGTCATTTTGATCT GCT CTCTACGGTTTTTCAAGTC^GAGGCAATTGATGCTT GTCT AATGC 
ATCCAGACACTGCATGTCTGACTGGCGATGCCAC^CTC<3TAAGTAGTTCTGCCATGAAACAT 
AAAAGACAAAGGAAAAGCCGTTACACATCACACAGAGAACATTTTC<K3GTC 
GGTGGCAGGAAGCTCACTCTCGCmCAGTATTAGAGTGTGTGTGTGGGTCTCGGGGATCTC 
GGTGGCTCCCATCTrCCTTCATTGTTCTGAACATCCTGTATTGT AAACCATGGCTGGGGTGC 
TAAAGT^CCTGTGAATCCCGATGTGGAAAAAGCT GGAGGTGAAAGCTCAGGATACCATGTAT 
TTACTTT AAAAACAGAAAAAAAG A CATGT ATGGAT ATGT CTA 1 1 1 1 1 1 1 1 1 IA TTGGCACATTGT 
Anil l GTGTTGACTrGTTTTTAGAAATGATGTGTCCACACACGTACCC<3TGTCTCTrCTGCA 
TTTCTGTGTCATGGTTCTGTTTCTTAATCACGTGCGGC^MTGTCTAAGTGGTGTTACCAGTGT 
ACGCGCAGTGACCTT GGATGACAGTGGCTCTTT CTCAC AGCCT CCCCT GAGCTGTGAGAAA 
CAGCTTTCT CT G T ACAT ATGCAACTCCT AAT AAAAGG CAT ATTTCTTCCT GTT CAAAACACAG 
AATGGGCCXiATGAGGACCTCAAGTGATAGTGACX^CAATG^TCTTTT AAATCCTCAGGGGCGC 

>1330 

nncatggtctttttatttticaaagagaaaattgactttgtaaa 
atatact atct acatgt ataacaaattcaatttcagagaaaacacaatgtaatatgt aagagaa 
gtacostgttttgatagttgactaacactgaorrgtaatggtcct 
tacact atctt aggt aaataagacttttattc^aagtgtgaattttcacaggaggagaaatct 
ggcagatagatcctcaccatcatctgaacactcgaactgga ci i ul i 1 1 1 ctgaattgacca 
gtcaaagagaaaggaaaagaaaaaaaaatatgacomttgaatttagagtatcaaagcatgg 

AGTATAGAATAAA I 1 1 G I II I I AAAGAGGAGCTATAAGTTGAATGGAAGGAAAAAGTTCTGGA 

AATGCGTTCCATGTAAGGATAGTAATCCCGCGTACCT 

>1331 

actgtttgcattaataaattaaagctccatagggtcttctc^c^ 

CCGCCTCTTCA03GGCAGGTCAATTT ACTGGTTAAAAGTAAGAG^ 

GT ACCACTGT AATCATTATTCCCAATGTTATGATrACATTGACAGATAACTCCAGTTTTGCT 

C<TTGAACTGATOTTATGGCCATAATATGTTGTTGATTC^^ 

TGATCCTu 1 1 1 1 1 CTCACAATGGTGGTGGAGGCCGGGAGCTT ATATGTTT ATTTATGTATGAA 

TGACGATAGTAAGAGATGG CAT ATAATCACCAGACTGATCATATTGGATTCTTT GGGGAACG 

GAGCCGGAAGGGAGTAAACAGAN 

>1332 

>1333 

>1334 

>1335 

>1336 

hWW>tt#iN>KSG>AAGTAAAACCA<3TATTTATTGAGTACCT ATTTGCCAGATACGGCTAG 
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gtgctcocacatcccatctcttaatcctcacaacaacccrgtgaggtaggtaataatgatcc 
ccattttt^aggtgaggaagtaggctcagagaaattaagcagctttcccaacttcacacag 
tgagtaagtacatctatctgaccccagagttacccttttctatcatgcccccgtaggatattg 
cctggggacacctgacaacagaaagtctaaggttttcatct aggattgggagttaccccaac 
accagcaggatgc agga aaaagtaact gaccggatggttgcctcaatctgttgattcttcag 
tgagttagctcagattttgtcxjaggaacagctttcagagccaaagatracgtattgaacttta 

CCAAGGCATCTGGTGACTAGAAAACTCCTGGAAGGTGGTCATAGCAGAAATTGTTGGGAAA 
GTTCTCAGCATAATAAAAGAGAAATTTTTATTTCCTTCATGATCCACTCCT ACAGGGAAAAATA 

aatggcaaatgaacccatgtatgtcaaactctgtaataaacaccagtgagatcacagtgtca 
ggaaatttca gcctgaatt aaa gat acccttgctctt aaaac 1 hci icm i gatgt atgtgaa 

GGAAGGGCTGGAGGGCAAAGTCAAAGGAAAAGGCAAAATAAATTAACAAAATGACTGNCAG 

GCAATGATAAATGNCCTCCCCTrCCTCCTCCCACCACGTACACTCGCCTTTCCTCCTTCCATT 

GGGCTCACACTGCCACXrTGCTGGCAATTCCTTGCATCTAATAGCTGATC^GTGGTGTTTGAA 

AGAAAAGTTTTTTCCCTGTCTAGGAATGATTGATTCCTCTGTATGCATTTATTTTAATAGGGAG 

GAGAAGNNM 

>1337 

NNAGTT T AACT TTGG GATAAAATCCTAGGACTGGGAATGGAGAGTGCAAAGGGT AAA 
GTGTGTATGTAA7TTTACTAGATATTGCCAAATTCCTCTCCAGAGGTGGCGXACCATTTTGCA 
TTCTCTTCAGTGGT ATTTGATCGG TCCAG GCTCACT ATT AGAAT ATGTT AATGAGCTTCTGGA 
1 1 1 1 1 II I 1 1 1 1 1 1 IGCAAACCTTATAAATAAAAAGTGGTATGCCAGTAAAGT7TCAATTTACAT 
TTCT CTTCrrGAATGAAACTGAGCATTTTCCATTTTCCTCCTAGATTCTT AGGAAGCCTTTGTAT 
CTGCGATAT A AGTT ACT TTGTC C I » C I 1 1 GTCATGTT GTTTAACTTTGCAC I 1 1 CHI I I AAAAC 
CTGCAGTAAATTTTAAATCTTTTCATTCAGTGCTTCTGGTTTTCAAATC^ 
CTCCCGAGTCAGAGGGTGTGACGAGAGTCTGTTCTGGTGCTTCTATG 
TTTGA ATCTCTG ACGTAGTTGGAATTTATTCTGGGCTATAAGGACCCGACTnATTT^ 
CAAAATTTTTTTAACAAATGTTAAACTTAACTT CCT AAAGGCAG AT7ATT ACTGGGACCATGTG 
TGAN 
>1338 
>1339 

NNCACCCACGCGTCCGCCGAGCCCTGAGCGCCGGCGCGGGACCGAGCTGGCGGC 
GGGCGGCGGGCGCCTTCCGAGGCTTCCT GCT GCTTCTGCCCGAGCCCGCGGCCCTCACGC 
GCGCCCTCTCCCGTGCCATGGCCTGCAGGC^GGAGCCGCAGCCG^ 
GCTGCTGGCGCCGTGG0CT<X7TATGACTACCTGGTGATCGGGGGCGGCTCGGGC6GGCT 
GGCCAGCGCGCGCAGGGCGGCCGAGCTGGGTGCCAGGGCCGC0GTGGT6GAGAGCCAC 
AAGCT GGGT GGCACTTGCGTGAATGTTGGATGTGT ACCCAAAAAGGTAATGTGGAACACAG ■ 
CTGTCX^CTCTGAATTCATGCATGATCATGCTGATTATGG CTTTCCAAGTTGTGAGGGTAAAT 
TCAATTGGCGTGTTATrAAGGAAAAGCGGGATGCCTATGTGAGCCGCCTGAATGCCATCTAT 
CAAAACAATCTCACCAAGTCCCATATAGAAATC^TXXGTGGCCATGCAGCCTTC 
TCCCAAGCCCACAATAGAGGTCAGTGGGAAAAAGT ACACCX5CCCCACACATGCTGATCGCC 
ACAGGTGCTAT GCCCTC CACCCCTCATGAGAGCCAGATCCCCGG 

CCAGCGATGGAI I ! I! ICAGCT GGAAGAATTGCCCGGCCGCAGCGTCATTGTTGGTGCAGG 
TTACATTGCT GTG^X^GATGGCAGGGATCCTGTCAGCCCTGGGTTCTAAGACATC^ACTGATGA 
TACGGCATGATAAGGTACTTAGAAGTrTTGATTCAATGATCAGCACCAACnrGCACGGAGGAG 
CTGGAGAACGCTGGCGTGGAGGTGCTGAAGTTCT CCCAGGTCAAGGAGGTTAAAAAGACTT 
TGTCGGGCTTGGAAGTCAGCATGGTTACTGCAGTTCCCGGT AGGCTACCAGTCATGACCAT 
GATTCCAGATGTTGACTGCCTGCTCTGGGC<^TTGGGCGGGTGCCGAAT 
AGTTTAAACAAACTGGGGATTCAAACCGATGACAAGGGTCATATCATOT 
GAATACCAACGTCAAAGGCATCT ATGCAGTTGGGGATGTATGTGGAAAAG C I C I fCI IACTC 
CAGTTGCAATAGCTGCTGGCCGAAAACTTGCCCATCGACTTTTTGAA 
AAATTAGATTATAACAA(^TCCCAACTGTGGTCTTCAGCCACCCCCCTAT^ 
A CTCA CGGAAGATGAAGCCATTCATAAATATGGAATAGAAAATGTG^OVGAiCCrATTCAAOGA 
GCTTTACCCCGATGTATCACG^iAGTT ACCAAAAGGAAAACAAAATGJGT GATGAAAATGGTC 1 
TGTOGT AACA AGGA AGAAAAGGTGGTTGGQATCCATATGCAGGGACTTGGGTGTGATGAAA 
TGCTGCAGGGTITTTGCTCTTGCAGTGAAGATGGGAGCAA 
AGTCGCCATTCACCCTACCTCTTCAGAAGAGCTGGTCACACTT 
CGTGTCGCGGGCAGTGGGACCCATAGATCrrrCTGAAATGAA^ 
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CTGTTTGAGTTTTATGTATTTCTTTATTTTAATC^ 

ATAATAGAACTTATTTATGGAGTTAGAAATTTGTAGTGTTATCCAGGATTGATTTTCATTTGAT 
CACAT CTCAC AGTAATTAATATTTTCAAG I H H 1 It 1 1 A TTAACAGCTCT GTGCT A G I M l 1 1 1 
TTTCTGTTTT AGCCTCATCCCAAAT AT AAAG CTTTG TGAAGT ACAATTAACTT AAT GT ACTTGA 
ATGAAT AGAACTT GCT AC I 1 1 1 II 1 1 1 1 1 1 1 1 1 GGGGAGACGAGTTTGGCTCTCATTGCCCAG 
6CTGGAGTGCGGGGGTGCT ATTTCAGGTCTC^CCAAACCTGGGCTCCGG GGGTCAAGGGG 
AATCTCCT GACAT TT AGCCTCCCGAATTAGTGG AAATT ACGGGCAGGCACACCAAGTCCGGA 
ATATTTGGATTTTGGAACATGGGGCGTCCCCAAGTTGGCCGGGTGGTCTAATCCACTTGAG 
GATTCGCX^CCAGGCCTC TAGTG TGAATTCAGGGAC^AGGGACAGGTATTCACCACGGGGGAT 
CCAGTAAGGGCGGACCATTTTTGCACGGTGCTATACGTGAATACTAAAGTAAGCGATTTTAA 
AGAGAGG GCTT AAG ACAGACAAAACAT AT ACAAG ATACCNNN 
>1340 

NNNNNCATGAG^TTGCTCTATATTTTCNTTTTGGTA(3CATCCT^ 
TCAA TGCT ATNA CATTCATTTTATAAAATGAATTCAGGAGI MUCH IATTTG I rTTTTTTCTTI 
TTTTGGCACTrrGAGGTTCAAG^TTCTTTTCCTAGTCTCTGCAATTGTTT AAGCAAGAATGGG 
ATTATTTAATCTTAAAAGGTTTGGTAGAAATCGCCTTAAACAATCT GGGCTTG ATGCTTCTTTT 
GGTAAGG GG TACTT TT AACTATTT GTTTCT TCTACGGTAATTGGTTTGTTGTGACTTT ATCT AC 
CT AGAGTAAATTTTGGCAATTTGCATTrrTCTCAAAATAGTTTTTGAATTTATTGTGTA 
CTCAAAATAGTCAATTTAAACAAATTTCCTGTTTTACTATTTCC^ 

TATTTGTGCTTCCTCCI mTTTCl I AAATAGGTTAGCTGGTAGTTTATCTGACTCAACTCCCC 
GCACTACCCATCACCiAACTTGGATTTATTATTGCCGGACGCGTGGCGGACGCGNN 

>1341 

ACTTT GACTAi nil I AG^AACAAATTACTTTTGACACACAGCACAATTGATTTAACAC 
TTCCAATTTTGG AACT ATTGGATAAATAATGATGGGATTTAAATAAAGCAATCCGATTCTACTA 
TT ACAGGATAGGGTCrrCTTGTAGTCCTCTTAGTAAAAACT ATTGTGACACTTCX; 1 1 CM 1CTCC 
A^TATTCGGCCTGGAAAGACCTAAATACAATGCAGGGATTGAATCAAATTCAGACATTTTTT 
TTCCT ACGGAAACAACAACCTTTCTTGCTTATATTTAACAAAAACT AGTATAGATTCCCTTTAT 
ATTAAT AGTT AT ATGGTA 1 1 1 TT 1 t CTCAGAGTAGAAATGAGGTTT ATAGGCTAAGAATATAGG 
CT AATTTG GAG CAT AACACT AA CCAG CATGAACCT AAG TGAGT ACN 
>1342 

ocgc^agctGGGACTAGCGAAATCTGTCTGCACTGATCGCAGTCGTCCTCAGTTTCACC 
CT CT AGGAATCGGCCT G GGGGTGCACGTGCTACTCTCrr<XrTCCAGGCCGGTCCCCGGCG 
CGTGCGCGCGATCCATGTCGATGTCCGCGCCTATCAATAAAGTTGCTCA 
CCGCTAGCCCGAAAGGTTGCGCGCGCAGACCGAGAAGTCT CGCGATAGCCAGCCGCGGCT 
GCCCTTGCGCTTCCGCGAGCTGGCGGGGTCCGTGGTGCGKSGATC^ 
GCGCCGAAGGTTITTCGTCAGTACTGGGATATCGCCGATtSGCACGGATTGCCACCGC 
CCT ACAGCACCACCAGTATTGCCAGCGTCGCTGGCCTGACCGCCGCT GCCTACAGAGTCAC 
ACT CAAT<XTrCCGGGCACCTTCCTTGAAGGAGTGG<TrAAGGTT 
CAGCTGCTGTCGGGGCCGTGTTTGGCCTCACXiACCTGCATCAGCGCCC^ 
AGCCCX^CXSACCCCCrrGAACTACrTCCTCGGTG^ 
CACGCACGCACAACTACGGGATTGGCGCCGCCGCCTGCGTGTACTT^ 
CCCTGGTCAAGATGGGCCXSGCTGGAGGGCTGGGAGGTGTTTGCAA^ 
CCCTGTTJCCTGCCGGGACCTCCAGCCTGCAGAAT 
GTGTGAaacacscAAC 




>1343 
>1344 - 

TATAGGGAGTCGACCCACTCGTCCGCGGAGTCTCCCCGGCG 
CCGACCGCGTCTGGGTCTCCGCGTCTGCC^^ 

CCGCGCCGG^GCAGTCTAGGCGGAGCGGGGGCTCAGGCGGCGGCGGCCT 
TGAGTGTCGTGGTTGGGGTGCTG6ACCCAGAGTGCCT ACCCTCGCCTGCCTGGGCCTCAG 
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TTTCCACATCTGCACAATGGGGGTGACCATCCCTGCCCTGCTGGCTGCCAGGAGCGGCT 
GAGTCTTCAGGCGTGGATGCAGCCTGGGGGAAG^CATAGGGCGCTTrCACAGGCCTGGCC 
TTCAC(^TGGCGGGAGGGAGACCGCATCTGAAGAGGAGTTTCTCCATCATCCCCTGCTTTG 
TCTTOGTGGAGTCGGTGCTGCTGGGCATTGTGATCCTGCTTGCTT ACCGCCTGGAGTTCAC 
GGACACCTTCCCTGTGCAGAC^CAGGGATTCTTCTGCTATGAC^ ACGCCAAGCCCT 
ACCCAGGGCCTGAGGCTGCCAGCCGAGTGCCTCCTGCTCTTGTCT ACGCACTGGTCACTGC 
CGGGCCCACCCTCACG A TCCTGCTGGGAG AG CTGGCG CG TG CCTTTTT CXXTTGCACCACCT 
TCAGCCGTCCCAGTCATCGGGGAGAGCACCATCGTGTCTGGGGCCTGCT GCCGCTTCAGC 
CCCCCAG TG CGGAGGCTGGTCCG CTTCCTG G G GG TCT ACT CCTTCG GCCT CTTCACCACGA 
CCATCTTCGCCAACGCGGGGCAGGTGGTGACCGGCAATCXXJAC^ 

GTGCGCCCAACTACAC6GCCCTGGGCTGCTGCACCTTCTCGGATCGGCAGGCCCCGGCTG 
CCTGGCCGOSCCC^TGCCTGCGCTGGCGGTCCAGCTCGTTGGC^ 

CCTGAAGGTGGGGCCCCTrGCTAGCGGCCCTAAOCGATGTACGTGACTCTCGTGTTCOGCG 
TGAAGGGCTCCCGCCT GGTCAAAOCCT CGCT CTGCCTG GCCTTG^GTGCCCGGCCTTCCT 
GGJGGGCGTGGTCCGCGTGGCCGAGTACCGAAACCACTGGTCGGA 

C^XrGACAGGGGCGGCCATCGCCACCTTTTTG^TC^CCTGCGTTGTG CATAACTTTCAGAG C 

CGGCCACCCTCTT^CCGAAGGCTCTCTCCCTGGGAGGACCT 

GATAGGCCXXTrCGAAAAGTTAAGTGTGGCGCAGGAAO^ 

CACCGGCACGGCTCACCCCATCCAAGTCGCAGAACTGCGCCOSCCGTGGCCACCTG 

CCAGCTGTGTCTCCTCCAGGGCCCCAGCCATGTGTTCGTCGCCCGGTGT^ 

ATTGAGGTCTGAGC CGACG CCCTTGCCCCT GCCCCT ACCCCTGCCAGCGCCCACCCCCAG 

CCAGGGCCCCTCGCCTTCCTCCCXTrGG 

GCCGGAAGCTGCTGCTGCCCACGCCCCTGCTGO^ 

ATOCCJOCCCCTTjCC^ 

GCTGTGAGGCCCGACCACCCACCCAGAATCTGCCCAGTCCC^^ 

TGTGTGTGCGTGTGCCACGTGAGTGCCAAAGTOCCCTGCCCCOCAAGCCAGCI^kGACCCA 
GACATTAGAAGATGGCT AGAAGGACATTTAGGAGACATCTG(XrrCTCTGGCCCTCTGAGATA 
TCCGGATGGGCACAAATGGAAGGTGCGCACTTGCCCCTACT ATTGCCCTTTTAAGGGCCAA 
AGCTTGACCCCATTG^CCATTGCCTGGCTAATGAGAACCCCTGGTT 
AAAGGAGTTGGCTCCAACCAATGGGAGCCrrCCCCTCACTTCTTAGAATCCTCCTG 
GGCAACTCCAGC^TGTTOGC^^ 

CTAGAGTGGGTGGGCATGATTCXJAGTCAATGGGGGACCGCCCGTGTCT AAGCATGTGCAAA 
GGAGAGGAGGGAGATGAGGTCATTGTTTGTCATTGAGTCnT 

AGCTGT AGGGT GGGGGGCAGGCTCCCCCATGGCAGGGTCCTTGGGGT ACCCC mTCCTC 
T^P^^J^^^P^^CGGCC TCTCCA XXTCTCACCCACTCTCTCCT AATCCOCTACTTA 

CATJWTGGGGGGCATGGGTGGAGCAGAGGGGCTCCCTO 
TCGGTAGAGGAGGCACTGCCCCCCTTTCCTGCCCCCTC^ 
TCT^TCTTTAATAAAGACCTCTTrGTAACAGAAAAAAA 

>1345 

>1346 

ACT AGATTGGGTG TGTG1 ATT AAGAGAAAGACAGGAGT CAAAGATAGTTCCAAAACT 

TTTGAACAGAACACT GGATGAATACTGTTTACTGAGATGtGGGAACACTT ACJ^AAAAATf^k 
TTrGGAAAGCA^^ACGATCA^S^ 

CATAGCT AT^ATAGAGGTATTAAATTGGCAGGACAAAAT(^TAGCTAGAGATAAAAATTTAGAG 

ES^Sactgtaa^t^^^ 

CATAGAGGAAAGATGTGAGGATTGAGCACCAGGGGACTTCAACATTGA^ 
G^AGAATTOCCAAGAGGATGAGGrrCCACCTTT AGGACCGOCAAAGAAGACTTOCCAGACA 

>13*7 ■ 

.^Jg£?£^G^TT«^ATATTTTa€TTrT 

I^^£2^II~!7™ T ^*^^ 1 1 m ui 1 1 attt u n 1 1 ii i icm 

TTTTGGCACTTTGAGGTTCAAGATTC 1 1 1 1 CC I AGTCT CTGCAATTGTTT AAGCAAGAATGGG 
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ATTATTTAATCTTAAAAGGTTTGGTAGAAATCCCCTT AAACAATCTGGGCTTGATGO I rCTTT I 
GGTAAGGGGTAu I 1 1 1 AACTATTTGTTTCTTCT ACGGTAATTGGTTTGTTGTGACTTTATCTAC 
CTAGAGTAAATTTTGGCSAATTTGCATTTTTCTCAAAATAGTTTTTGAATTTA 
CTC2AAAATAGTCAATTTAAACAAATTTCCTGTTTTA 

TATTTGTGCTTCCTCCI I 1 1 1 1 ICM AAATAGGTTAGCTGGT AGTTTATCT GACTCAACTCCCC 

GCACTACCCATCACCAACTTGGATTTATTATTGCCGGACGCGTGGCGGACGCGNN 

>1348 

>1340 

>1350 

>1351 

ACAAGTATTATGTATCCATAAAAATTAAAAAATCTTTAAAAATGCATATGGGGGTCAG 
TAGGTAAAAG AAAAGAGAACC AAGAGAGCTGCAGCGGGGAGCACAGCTTGCTTT AAACATG 
AGATCCAGCTCAGTGATCATGCGGGGGAAAAGGCCCGGCATTGCTGGAACT CCT AATATTT 
AAAAA ^TJ^ GGAAAC ^ GAAA1 ^ 

TTGAAGACTTTGAAGCCTCTCTGCTGGTC^AACAAGATGTATCrrGTAGGCTGGATTTAGTCXD 
ACAGCTGGCCAGTTTGAAAACTGAATCCTGCTAGCCTTAATTTAAAT^ 
GCTTTGATTCCTGCACTCCTGCTCAAAAAAATCTTCAATGGCTCCC^ 
AGTCCAAACTTTGTCACCAGTCCTTCAAAGCAACCCATGACTATATCCAAGACCCCA 
ATTTCT ACCTT ATAGCCAGT CTC^TCTT<XACCX5CAACCAGJVATGATAGTTGAATTGTACTC 

taggaaggaaaatattcagaaaggcaccagtcct ggccatgagggctgcttctgggtccct 

AAGCTTTTT CCTTT CCTGCAGTGCCCTACACTGTGCATGCCTATCAATGAAACCCTGTCCATC 

ATTAACCATCCAGCTCAAATACCACCTCTCCACAAA^ 

C^CJTTGCTCTGAATTTOCATGGGACrrTATATCACTCACATGAC^ 

TGTTTTGGCATTACCT AC^TAGCTCATTTTCCCAAACACCTAATAAt ATTCACAGAAGGAAGA 
GTTTGGGTTTTGCACT ATATTGAATATCCCTGTAGTGCTTTGTACATAGCANh£NNN 
>1352 

NWAGTCGsAGCCACGCGTCCGCCGACCGCGGGCGCGGGGGCCG^JGCGGGTGACAG 
CCGATCCCCGCtXXTTGCTGCCCGCCACXSTCCCTCA^ 
AAACCTGGCGTACTGGCTGTGGCrrCTCTAGCGGGACTCG^ 
CTrOGCGGAGCCreCCTTGGCCGGCCC^ 

CAGCTTCAGCTCGGACTCGGGCTCCAGCCCGGC^^ _ 
TGGACTTCT ACGCGOGCTT CTCGOCGTCCCCGCTCTCCATGAAGCAGTTCCTGGACTTCGG 
ATCAGTGAATGCTT GTGAAAAGACCT CATTT ATG TTTCT GCGGCAAGAGTT GCCT GTCAGAC 
TGGCAAAT AT AATGAAAGAAAT AAGTCTCCTTCCAGAT AATCTT CTCAGGACACCATCCGTTC 
AATTGGT ACAAAGCTGGTATATCCAGAGTCTTCAGGAG C 1 1 C 1 1 GATTTTAAGGACAAAAGTG 
CT GAGGATGCT AAAGCT ATTT ATGACTTT ACAGJ^T ACTGTGATACGGATCAGAAACCGACAC 
GATGTCATTCCCACAATGGCCCAGGGTGT GATTGAAT AGAAGGAGAGCTTTGGGGTGG 
ATCCTGTCACX>GCCAGAATGTTCAGTAC 1 1 1 1 1 UGATOGATTCTACATGAGTCGCATTTCAA 
TT AGAATGGT ACT CAATC>AGCACTCTTTATTGGTTGGTGGAAAAGGCAAAG^ 
TCATCGAAAAACACATTGGAAGCATTAAATCCAAACTGCAATGTACTTTGA^ 
ATGGCTATGAAAAATGCTAGGCGTCTGGGTGAATTGGATT ATATTAAC i 1 1 rCCCGAACTAGA 
ACTTGAAGJ^AAATGCCC^T^ 

GTTT A*XXX*X 1 ATTCAAGTTCATGTCACGXrr(a^TAATGAGGATTTGACTGTGAAGATGAG 
TGACCGAGGAGGTjGGCt»IIUCT 1 1 GAGGAAAATTGACAGACTTTTCAACT ACAT6TATTCAA 
CTGCACCAAGACCT CX^TGTTGAGACCTCCCGCGCAGTGCCTCTGGCTGG 1 1 rTGGTTATGG 
ATTGCCCATATCACGTCTTTACGCACAATAOT 

TCCCAGTGT ATAACAAAGCT GCCITjGjAAGCATTACAACACCAACCACGAGGCTGATGACTG 
GTGCGTCCCCAGCAGAGAACCCAAAGACATGAOG^ 

GGACATCGGAAAATCCAAATGTGGCI 1 1 TGTATTAAATTTGGAAGTGTGGCCCAGAGTTGCT 
CAGAATTGJSAGCAGAGCCTGAGACGT ATCTGCAl^TOCTGT CATCAGCTGGCAAGTCCAGG 
AGACTGTGTCATTTAGAGACTGTGTTG 1 1 AGTTATCCCTCAACATCTTCT AAGGTGGCAGGAA 
ATAAT A^GGAAATAACATTTT AAAGT AAAAATTTTAAAGTTTAAAGAAGAGTTTTGCCACTTAA 
ACAGGGGAGC 1 1 1 G I C I GX5AAAATACACTGAGTTGAAACACTTCATCCTTGGAAGGATTATA . 
JAAGATGAACAGTTGT^TAAATGTGTAGATTAGA^ 
GCCCTCATTTAAGAG^OCAAGATCCTGATTCAGAGGAGGC^ 
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GCTAftTCTGACCAAATGTTGGGAAAAATGTCTCACCTAACCCACTATTCCTTA.*TTATQ^^ 
TOTGAAAAACAATAGAACATGT^ 

GCTTTAAATTACAGTTTTCTTATAATGTTGAAATGTTTTAGAATOCTTrGAATCTAAGTATTTCT 
TTCCT AAATGAAACATTTGTACAACATTTGATGTTTTTACTTATGAAATATTCTCCTCCCCCAA 
GAAAATTTAAACTTTTTCTCTCTATTTAAAAGCTAAGAAATGTTTTAAAGGAAAAATGAAATTAT 
CTTCCrrTTAGCTTATTTTTAAGGT AAAACAGCTTTTTACTCTGTTATTGTGGTAATGGACAGAA 
TATTAGATACAAAAATATTCTGGGAGAGCTTTTTCCTAGTTGGTTTTAAATCATTGTGCCACCT 
GAAAGGTTTTTAGATTTTATAGGAGCTAATTTGTCCACCAGGATTAATGTAACACAGTGTAGT 
TATGAAAATATATTGAAGGACAGGAAGTGGACACGAAGTGATTnTGTAACCTGAGCAGTTAA 
TGAATGTGCCAACATTTTCTAGGAAGGGACAGCAAGAATATTCTGCTCTGTAGT TAAAAT ACT 
GGCTGGCI 1 1 1 GATGTCTTCATGCTTAATTGTGATCAC \ I I CI I G CACTGTGATGTTTTTACGT 
GAATATGTTGAAGTAGAAGTCTACCATATTATTTTATAAAATGTTTTCTGTATGGCAATAAACT 
GAAAACATGGATCAACCC I ICM I 1 GAAAATAAACTGAGTCAATTTAGCCC TTTAA AAATATAG 
TCCATCCCCTTTTAAATAGAATCCCTCTTCCCACCCATCAAGGCCTCAACATTTTGTAAGCAT 
CCAAAAAATTGGT AATTAGGGGGCTTGCACT AAATTTCACTATCTTCAGTAGAGAGGAACTGT 

TTGACTTACCTrAGATTTTTATTTTCCATACATACTGCAAATGATTGACTTGTTGCATAAATGA 

AGATCTTCTGTTGTGTGCTT7TCAAACACTGTAAATAAATTTGAAATTTGAATAACTTTCCACA 

GTATAACTGN 

>13S3 

>1354 

>1355 

nnnagccgttgagcagcaagcaagatggcagcttccgagacggttaggctacggc 
ttcaatttgattacccgccgccagctaccccgcactgtacggccttctggcttctggtcgac 
ttgaagagatgccgagtcgtcacagatctcattagtctcatccgccagcgcttcggcttcag 
ttctggggccttcctaggcctctacctggagggggggctc7tgccccccgccgagagcgc 
gcx3ccttgtgagagacaacx^ctgcctcagagttaaattagaagagagaggagttgctgag 

AATTCTGTAGTCATCAGTAATGGTGACA7TAATTTATCTCTTAGAAAAGCAAAGAAGCX5GGCA 
TTTCAGTTAGAGGAGG GTG AAGAAACTG AAGCAGATTG CAAAT ATTCAAAGAAGCATTG GAA 
GAGTCGAGAGAACAATAACAATAATGAGAAGGTCTTGGATCTGGAACCAAAAGCTGTCACAG 
ATCAGACTGTCAGCAAAAAAAACAAGAGAAAAAATAAAGCAACCTGTGGCACAGTGGGTGAT 
GATAACGAAGAGGCCAAAAGAAAATCACCAAAGAAAAAGGAGAAATGTGAATATAAAAAAAA 

ggctaagaatcccaagtctcogaaagta<>ggcagtgaaagactgggc<^tcagagatgt 
tYatca^ggaag^ 

TTTAGCCTT AGOCCCAGCAAGGGCAAGACCTCTGGAACAACATCTrOCAGTTCAGACTCTAG 
TGCAGAGTCAGACGACCAATGCTTGATCTCATCGAGCACCCCGGAGTGTGCTGCGGGTTTC 
TTAAAGACAGTAGGC CI 1 1 1 1 GCAGQAAGAGGTCGTCCAGGCCCAGGG CTGTCATCACAG A 
CTGCAGGTGCTGCTGGATGGAGGCGTTCTGGCTCAAATGGTGG7GGACAGGCTCCTGGTG 
CTTCTCCCAGTGTGTCTCTCCCTGCT AGTTT AGGAAGAGGATGGGGTAGAGAAGAGAACCTT 
1 1 F ICI I GGAAGGGAGCTAAGGGACGGGGCATGCGGGGGAGAGGTCGAGGACGAGGGCAT 
CCTGTTTCCTGTGTTGT AAATAGAAGCACTGAGAACCAGAGGCAACAGCAATTAAATGACGT 
GGTAAAAAATTCATCTACTATTATCCAGAATCCAGTAGAGACAOCCAAGAAGGACTATAGTCT 
GTTACCaCTGTT AGCAGCTGCCCCTGAAGTTGGAGAAAAGATTGCATTTAAGCTITrGGAGC 
TAACATCCAGTTAGTCTGCTGATGTCTCTGACTACAAGGAAGGAAGAATA7TAAGCCAGAATC 
CAGAGACCCAGCAAGTAGATATAGAAATTCTTTCATCCTTACCTGCCTTGAGAGAACCT 
AAATTTGATTTAGTTTATCACAATGAAAATGGAGCCGAGGTAGTGGAGTAGGCTGTGACACA 

ggagagcaagatcactgtattttggaaagagttgattgacccaagactgattattgaatctc 
caagtaacacatcaagtacagaacctgcctgagt atgacctctccagcttat agttt atgaat 
gtcttgtttgtgaaagtgact at aacccaaactm i! 1 1 1 1 1 1 1 1 aaagaggatttggaagtt 
gtatgg a i ftttttofl 1 a tcttcacttt actggat aggaaacaatctaoctcatcatttaaaat 
gacatgggtgtcggttttgtagatctttggi 1 1 t 1 1 1 gtcaggtttaatttgagttaacaaaat 

G TAAAACATG ACATTCCCTGCAGAT ATTGTTGTATACCAGT ATGGTTTCTT CTCTTTCTTT AAA 
TGTTTTTGGCCATCAAGTAGCAGTCGTCAGTAGGAGTTTATAATAGGAAGAATGTGCTGCGT 
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ATCTTGTCTGAATAAGTTTTAAGTAACATTTAAAAATATTAAAGCATGTTATTTGACCTAATTTT 
TTAGCATTTGAGTTGTTCCATTAAATGGAGCATCTTGTAAATTTCAAGTATTTTATACTTGCAA 
TTGT TAAGAGTTAACAGGTAGTTGGATTTGTCGCAGACAATGAGTTAAGGAATCCTTTCACGT 
TTTTCCCAACTTTAAAATTAAGGATTCTCAGGTCCCTGTGTAGAGCAGTGAAAATAAGATGTG 
CGTATGTGTGTGTATGCCTGGAGAATTGGTGTTTCACTTCAGTGAGAGGATTGGCTGTGAGC 

ttcagaccaggaaatgtgtcatcttgccaggcacctggctgagtgtgctggagtgaggatc 
ttgaagagaaacttccttttctgttattattcactacgaagctaaaatgggcaaatatataccg 
tgaaaattggtttcatttaacaaaagatcagatccctccttcagctgtacacatttttaaataa 
aatcatattgaactaaaaaaaaaaaaagtaaattctgcgnnnnn 

>1356 
>1357 
>t358 
>1359 
>1360 

NNAGCTGCCGATACTACTACTAAATTCGCGCCCGGCTGCACCAAGGATTGGCCCAT 
ACCTGAT ATGATCCCTTCCAGTGCGGTTGCAG GGAATCTCTTATTCG ATGTTGTTTTCAG CCA 
ACAACATCTCCAGGTTGTAGG CCATGGTCTCTGTTACCTTTATCTCCCTG g agctcattgcg 
AACGGAATCCTTTATTAATTTGGCATTTATGGTAGGAAGTTTTGTGAGACXrrTGGCW 
GAGGATATCACCTTTAAGAAGTACAGGTTCATAAGCTCTTTTATCTAGCCACATAGGCCTTCT 
TGTTATTATTTCAAAGAAAGAGACTTGACGTTTTATGAGTGGGGTGGATTGTAGGTTGAGCA 

GAACTAATGGGAGAGGTGCTGGCT AG agaaagttaaaaatttctgttagctttgcattgagc 

TTTTTAATATCATTTGTTCATTTCACCAGTTCAGAGGATTGGGGGTGATGGGCACAACAGAAA 

tgatggaatataggccaaatgttacaaatagataaaattacctgaccagtgaagtgtgttcct 

CAGTCGCCATGGAGCTCAGANNNNN 
>1361 

TITttatrrTTTTTTGAAAAGCTAGTAGTCTATATATCTATATATCTATCTATATaaaCtAcTA 
GCTTTTGATAGATATTGTGTTTACATAGTCCAGAAGTTAAA7GCAGGTATCCATAAGAAGAGC 
ATTAACAATAAAAATACAATCTGTGTGTAGCCAAGTAGAGAGACTTAAAATGGTAAAACAGCA 
AAAAG G ATCTCACAAAAGT ACAAAT AT ACAGT ACAAATTG ATTT ATTT AAAAC AGTT ACAAAAA 
AGCACAACAAAATAGAGATTATCCTTAGAATTATTAATGCTTTGTTAAAGATCAGGTAGGATTA 

gagggtaagaaatggtattgggtggagggagcacctgactagttctaaggctttagataca 

atgcagttgattacatattaattcagccattacatactggggatagtagatgggggatcagta 

atttatctacagggaactcctacacagatc^tccagacctccccagtgc<^tccttctc^ 

ctccttaggaccctaaagccacctggatgacaac^ttcccatccri^ 

tatccgtatattcttccctcgcctaaggtgctgtgcaagctgagagcagtctgctctctgca 

GCTGGATCATATACTGTTTTGGCTACAGGGATC^GTGTGACTAAGAAGGTTGTAGGGGACA 
GCCTTATTTTAATACTTTAAGTTAGCCATGAGTACAGTCAGTCCCTCCC^ 
ATTCTCTTGAATGCAAGTATCAGTCAAGTGAGGGATCAAGCCTG<XrrTCAGCAATACCAGGA 
ATGCACTAACCACTCACAGCTCACAACTACGTGCAAACTCCCTGCCCAGGGGGTGGGTGTA 
AAACCACTGTGTCATGAGCTCAGAiCT CTTTCTCTGAAGAACCTCAGTCCCCATGACAGTCATA 
AAGACAAATAAATGCITGCAAGTTTGACA^CAGGTAAAGGCACATGGGCTGCTGCCACACAC 
ACTGCAGGCAAGAACCCAATTTGAGGCCATCTTCACTGCTGGGCATTTGCAGTGGAATGGC 
TTTGGGATCACAAAACTTTGAGTTCG^XATCTACCGGGGGGATCTAGGTTTAACTGGAGAAC 
TAATAAAGAAACCTTTAGTGTGAAAAGTGCATTTATGACT ATTGTTCCTGCTGGGGAGGCAG 

ATAGTCTGCAGCTA 



GCTTCTGGTCCAAAGATGGATOCT AGAGTTGAAAGAGCAACTGACAG CTGCAAAGACCCAG 
AT GCTT ATTTT ATT ATTGCATAATGTG^gaGACCTTCCocTACCCCAGGAAAGCTGGAGAGTGA 
AGGCTTCCCAGCAAAGGGCAAACAGGTTCCTGATCGACATCCAAGCTAATAAGGTTCAGCTC 
CTCAACTGT AGTGT AGA G 1 1 1 1 GCCCATGTGGAAGGTACTAT AGCTTCAGTGTGGTTTAGTAA 
ACTTAGCCT AGGAG<KX1AAGATGTCTCCCTAAAACTTAGTCTCTGTCCT ATTTACTTTGTTTA 
TAAGAC TGTGACC TAACTTCCCATGGCCAATTCAATC^ 

CCCAGGCC nil CCCAGTCAATCCATGTCCAACCCTTCATCTCCAGCGTGATCACT CAACTC • 

GCCCATGATGCCCGCAGGTAAAAATAAAACTAAACCCGACTTGAGGTGCTATT^ 
CTTTAAACCCTCTCCGAACTCCAGAATCCACCTAACTGGAGCTGGACG AACTTCACTG GGAT 
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GCTG GCATGGGCCCT AATGTGGTTCoclTCATTTAGtTTTTaaacAGGATACaaG ATAOCACAAA 
GGAAAGAaGTTAATacaecAGAAGACAccactagtagaaac^^ 

caaagggcTCocaaCAGCACTGaaGccTTATooctttggATTTACCAGCCCACCAGCAGCATTTCATAC 

CATOCCT ATCGAACATGACTTCGCTATAAAAT AACATGGCCT AACACAGGTAAGAATGAACC 

GGACGOCTACACTTAATCTACATGGOCAAAGACATGGAAGGATAGGGAGCTGTAGAGGACA 

GCAGGCAGGACAGrrGAGCGGACAGCAGCAGTGGCAAAGCAGCGCAGGGCAAAGCCCTTG 

GTGTTATGACCAACTGGGaCCAAGAATATAGAAGGACCAGCCCAGATTGGCTGGGGGCCCT 

CCCTCAGGTACAAGTACTCAGCTCTGAACTCCCAGAGCCCCTTCCCCCCATGCCACCTCTTC 

CTCATGTATACTAATCTTGATAGGAATGATTCAAGACTTGGCTTATTATCCATCCATGACTGG 

GTTCCCCTATTCCCAAATTGGTATACACA 1 1 1 1 1 ATACAGCGCCAGACAGTCAGCAGTCCTCC 

TTTCTATTCCTAAGAGCTGGAAATGTCCTACTGCTGACAAGGGACAGGCTCTGGCCCTTGTC 

AACTGCTCATTCTTAAAAAGTTGGCAACTTCCCACTAAAATACTCGCTAAAAACTGCTCTCAT 

GTCTCCTGCAAACTCAGGAAATTCCTATAAGGAAGAATGGACACACCACAGGTTAGTCATAT 

TTTTCATGCCTGGAGGAAAAGACTAAACTAGAGGTTAAAGGCAAGGGGGAGGCCACAGCTC 

TGACCGAGATTTAGAGCACAGAGATT O I M CCTGGGGAGCATATTTTATGACAGATACCTTT 

GCAGCTCAGCAAAATCAAGTATTAATATGTTATATATGCATGGAACGATATTGTGGTATTACA 

cc^ggtcctcaagaatttctctttccagaattcagtggaagttacc^ggttggaaagtcac 
aagtagcccaagtggaaggtgatttgatctaaactaattggtctgtatctggttagagatag 
gtttgccccataaggagaaagctcagcttgtgtccttgcagacact^ 



TATCCCCATrTGGCAGGTGGGGAGGAAGTG^AAGAGCCAGAGGTCACGAGACTTGOOGAAA 

gtccttgacagagcagagatc^gaoctcagaggtttggactcctgatgccaggccaggtct 
cggccaatccaagtataccttataaaggacacgccaggatcatagcctgggagaagtctaac 
cagagagctgcgtggacaataccctgagttgccttcaagagtctaaagacccccaaaaaaca 
aacaaaaaaactt acccacctgaaagct agtgggggagcctaattcccaatacccagtggaa 
ccccacggtctgagctgatgctaaaagttgtgaagaaaatgtgggattaagacattggtgg 
ggggtagggggggtggcaatgcaaccaagattgaaaata ci 1 1 1 i gcagaggaatatagca 
acacaaaatgacatctattcaggtactgaaatgttatcaagact atcctagaatcttagagct 

aggaggggcatatgggacatgtgaaaatgccatgatgggcactgcccagccacgcattcgc 

CCTGgcccaGGGCATAGTTCCCrrTCTTGTTTCXrCACX^AAgcK3 

TGaKleaQatmaAAAATegaoATTATAgcTGAAGGflgaaa^ 

AgaGGlcTTGGlCAGagtggacAAiacCAACTTTGCAGACAAGGAAA 

CCCAGGTGACGACACCATGGAgcGGTCAGCGCAGCAGmaGGGTGGGGGTGGCGCAGGCACT 
TGGGCACAACACAGACAAAGGTGGGGGGTCAGTGCrtaGAAAAAGGGTctACT AGTTCTeaagT 
caocattaocAGGAAGATTCTGTGAGATTtCAAAGTG GGlctggcaTG CTGAg«0aagg»GCTTTGTT AG 

gtgagct acctttcgagagggcctgcctagaacccagtcagcctggctatcaagtc i ucic 
ccaaagt cccctgactgagtctx^ggcaaggtagt acttca<actgaaccctgaaggggctc 
tccctccgggcccccgtgctgctagaagccttgtaggcagaatggtgaggagagatgggg 

AAGGGCT AT GAAGCAGAGGATCAAGCAG AGTCCTGCAGATGAGAGTG CTCCTCCCCCACAC 

ACXX^AGCCATAGACATctggAACAAAATCAGCCTTTAAAGTGCGCAT CTCTCACAAGAAGGGTT 

CTGGGTGGGAACCClcTTATTTCCAAOCTTGGGGCCTCCTGCCCTTCT ACCACAAAAAAGAAA 

AAAAACTTtnXaACAGTTT AAC^yctaeT AT AAGCTGTGGATCATCATTTTIggcT C I MCI I IA CTC 

CCTCCCCggAcgcGTgggtoGACTC 

>13S2 

>1363 

>1364. 

>1365 . 

>13S6 " " ' 

NNNNNNNNNAAATTTCCTATTCTGATAG CAATAATAAGAAGTAGGAAATTTCACATAG 
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tcagcaacctaacactgtgagaaagaatccaagtagcaggtcattaaaacagatggcattta 

ggaaagtaaacttattcaaaacatattcttgtaagtaaatatgtaccacaacgtttctactcta 

ttgtgtaagctttaaatacaaaaataccacaaccactcccggactcctccatta^ 

tactggctgccctagtttttgaggatacatcatgcaaataagttcttttatttttcaaat^ 

ttattcctaaagtatctttaa 1 h i icm i 1 1 g gttat acagcttatagaataaacaagtcacaa 

gaatcttcatttgtttctaaagtatataattctacaaaagttgttttactcaatgtgaattaaaa 

tttgcaagtctaaaaaaataaaaaaattttaaaaagtaaaaaaatttgcaagtcatttaacttg 

gtgtcttttata7tatcttcagatgacaattgctctcagatgactattaaactcaaagatgcaa 

tacagtt aaatcttc c 1 1 i c i 1 1 gtaaaatt actgttgttcaatt agc aagag nn 

>1367 

NCCGGTTCCCCATTrTTTTTTG AACCACAGTCTCaCTCTCACCCAGG CTGG AGTG CA 
GTGGCACAATCTCAGCTCACTGCAAGCTCCGCCTCCTGGGTTCACGCCATTCTCCCACCTC 
AGCCTCGCGAGTAGCTGGGACTACAGGCGTGCGCCACCATGCCCAGCT AATTTTTTGTACTT 
TTAGTAGAGACGGGGTTTCAGCGTGTTGG CCAGG ATGGTCTCGATCTCCTG ACCTT GTGATC 
TGCCCGCCTCAGCCTCCGAAAGTGCTGGGATTACAGGTGTGAGCTGCCGCACGCGGCCGT 
TCCCA T III! 1 ATTGGTGGCAAGATTTCAAAACTTCAT ACTGTTT ATAAACTGATTTTCTGGTTT 
ATTTCT ACTGTAAT CTCTCAAT ATTCAT AG AAGATG AA G AAAT AAG TCTG ACAACACT AAAG AA 
ATTTCTTAATATACTATTATTAGAATAGTAGAGGATAAAAGATAATGTTATCCAGTTACAAAGG 
AATTTGGATGATAATTATAACTTAATGTTAGAGATGGTAAAGCTGCACTTACTTGAACAGCCC 
AAAAGGATTAAAAATCTATTTGCAGAGGAATGTTTTGCTATGTAGGGAAATTTTGAAGTAGTTT 
TTGCT AAAAAATT ATTTCT AG AGTT GTGTTATTCTT AGCAGTAATCTTGGCTAGAAATGTTTTT 
GTGATTTATGCGATTAAAGGAAATGATAAAGATAAAACCATATTATTAAATACCTATATATCAC 
AGCTAAGTAGGAGAAAGGGAGACCTTTTT ATTTrGTTCAGGTAGGAGTATTTCTTGTCTATTT 
TAGAAACAATTATTTTTCCCATTATTTCTTATGAGATTAGTTGTTTC^AGTA 
GTTCTG H I m i I GGAAATGTAGGTTTTAGATGTATCTTrCAGTCAGCTCTCAGTCACTAGAC 
TATTTCATATTTTTTGAAATGTTTTTGAAAATTATGACTAGCCCATTTTT^ 
AT ACTTATCXTTCTGAACACAT AAAT AAGTGAATAAGATAAAAATATCTTT ACCTT ATT AATATTA 
TACATACTTGAAAAAAATTTCATTTAGAAACCTAGTTCAGAATCTGAGCAAAT^ 
CT CT GTAATAAATGGTATTGATAATCAGAGAATGT ATTTCAGAATACT ACAGT ACAAT ATATT A 
TGAAGCATGACCACTTTATTTTGAAACTT AGCAATTGTATTGCT GGGGTTTATTGTATCTGTA 
GCATGTCACTGATTATTrCAGTrAGTTTT ATAATGATTTTrAAAAAACATATCTATTTGGAATAA 
GATACAGCAAGAATCATTGCTACTGACTTGTTCAACCCCTT AGTTACACTGTATGATCCAGAT 
ATAACAAGATACAGTGGAATGGCCCATACAGTATATTACT 

TAATTTTTCCC C t I IGI I C I 1 1 1 ICGCGGGACAGGTGTTCTGQGGCX5GCGCCCCCAATTCCT 
GGCCTGTTTT ACACAACTCCCCCGX^CGGTTGAAGGCCCCTTGAGCCGTGAAAGCGCTGGA 
GNN 
>1368 

ACATATGATGGQGCCAATGCACAATACTTTTATCACAATCAA C I M MC I 1 1 GTATCC 
CT ATTTCAATGAGCAGTCAGTCT CAAGAGGTTACT GCATTTCAGTTCTTT CT GGACATGGGT A 
CTrGTGArCACACTACGOGAATCTCTGTGGT ATATACCTGGGGCCATTCTAGGCTCTTTCAA 
GTGXCT 1 1 f GGAAATCAACC 1 1 1 M I ATTTGGGGGGGAGGATGGGGAAAAGAGCTGAGAGTT 
TATGCTGAAATGGATTTATAGAATTTTTGGAAATCT ATTTTTAGTG I I IGI I CGI T 1 1 1 1 1 AACT 
GTTCATTCCTTTGTGCAGAGTGTAT ATCTCT GCCTGGGCAAGAGTGTGGAGGT GCCGAGG T 
GTCTTCATTCTCT CGCACATTrCCACAGCACCTGCT AAGTTTGT ATTT AATGG I 1 1 1 I GI 1 1 1 I 
CM I I IGI I IG I I IC I > bCTCTTTATTAAACCCC I ICCCCAGI I II 1 1 1 1 1 1 A TACTTT AAACCC 
CGCT CCTCATGGCCTT GGCCCTTrCTGAAGCTGCTTCCTCTT AT AAAATAGCTTTTGCCGAAA 
CAT AG I II I M 1 1 1 AGCAGATOCCAAAATATAATGAAQGGGATGGTGQGATATTTGTGTCTGT 
GTTCTT ATAATATATTATTATTC I IC C 1 1 GGTTCT AGAAAAAT AGAT AAAT AT A I 1 1 1 II fC AGG 
A^TAGTGT GGTGTT T CC^G^ 

AGTT GAGGT GGATGGTTCTACCCTTTCTGCCTT CTlii I iGGGACCCAGCTGGTGl IL.I J iGG 
GTTGC I I IC I I CAGGCTCT AGGGCTGT GCTATCCAAT ACAGT AACCACATGCGGCTGTTTAA 
AGTT GGGCCAATTTAAATCACAT AAGATTTACCATTCCTTCCT CAGTTGCACT AACCACGTTT 
CTAGAGGCGTCACT GTGTG TGGGTCCAGGGCT ACTGTACTGACAGCGAGAGCATGTCCTCC 
GGTTGGACAGCACT ATTCTAGAGAACTATGCTGGCGTGTCCAGTC^CAGCXn^CAGCnGTGC 
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tgggacgaccc1 tgtci ccctgggtaggaggtgggggaatoganncggggmgtgnttgtg 

tntggtgttgtcagaggatttamaa^tattattgtaatacagaactatgtacattntatt 

act ataccgtattatcrtacaaatatgattttggaagcatgataaatgtati^aatgtagccaac 

atgtgcaa^gttacatgaggggccagaggcactcatgggaaaatcggltatgggttcttt 

gtac acacccaagacccatgtggaggcatctgattatgaggccacgctcatattaagggga 

ttttacccacatgatgagctggctgccgattctcagttgaaatatggctctgcgogtattgg 

cgatcatttctacggcctttaccctccgcatttacctgt^ 

gtccn 

>1369 

knn nngg tttaggcataaagtttattcatccaaaaataaagatccccatatttccatc 
tact atgagttttccatcatgctt agtattaaccaataatttatactccactactgatacttgc 
taattgaattacacttctgatgatcai rTTTTTm i caaatgacatagtgagagttatgatcg 

GTTACrrGGGTTGTAAGGATTGAAAACATACCTAGATCATATAAATTTGTGAAGGTTTTGCCA 

TC^CAAGCATTATAGGGAATAATGAAGATCAACTATCCTACAGCTAAACCTAATGAAGAC^^ 

ATTGCCTa^GGTCAAAACAATAmCTTTGTGCTCAAAAGTGGTrCACATAATTGATGCTG 

CATTGACGCrGTCTATAGAGATTCTAGTTTTCTCCACATTrTCT 

TTTCACTGGGGTCTATTGTTCTTTAACAGAAGTAATGGCCTTrTC 

TTGT AAGCATTCTTTCCAAG CT AG CTG CCCATCAT GACGTTTTGG CTAGTCCCAGTCTCTG CT 
CATAGAACACTTGTCCACACTCTAAATTCTCTCTGTTCTCCATAGCCC 

CT AT AGTCTTTTAACTCAAAGCCTTCAACTTATGTACAGCTTTCT CTGCCTCACGTTTCAAGCT 
TAATGCATGATCTTAATTGATCTTTCGACATCrrATrTCTACTACATGCTGCTCTC M l C I C I A TC 
TTATATCTCOCAGAATATGTTTTATTTCAACAAATCGCTAATCTGT^^ 

CAAAATX^TAAGCCCTGCATGTAGCAAAGTTCCTGCCTTCACTGGATATGCATTAACAGCTCT 

GATTAGTCCACTT AAAAACCATTGTTCCTGTCGTGCAGAACTCCAN 

>137D 

>1371 

ACTGTOGTTTC C TTCCT ACCTCGTCCTCACCCCATO 
TGAAC CTTA ui 1 1 1 1 iCCtA»nClCCTCAAGGCAGTTTGAACGACACAGGTTTGGAAGGAATA 
GTTAACTCTCCAGTATTATTGGAACATCTGGACACCACCAACAAAAAATCTT AGAAAAGGGTC 
ATTTAAGGCCT ATAAAAAGTGCCACCTTTCCCAGAATTAATTCAGAGAGAAAAATCTTATCTG 
CCTCCTGGCAQCTACAGCGCAGAAAGN 
>1372 

>1373 ' . 

>1374 .■ *• . ' 

NNAAGACCTGTAGAC TCAAG CACTTTCTCTG^ ATCTTATCTTGGACACTT - 

atgctgaatt atcttcttattattn"cajctggtt ataagct attccttgt acataat attt aaac 
tatccaaatatccagcatacatttccatgtggctg<x^tcggctgttgatgccatgaatgaaa 

TG GCTGG TTAGAAAGCCAAAGG1CI IC1 1 1 1 1 1 CAATTCCTAATGAAT AAGTAAAATGCCAGA 
TCTCTTTGGT CT CAAAGCAAATTGCTCTGATCAGATCAAAT ATCT AT GT AATGAAAAGAGACT 
TACTT AAATTTGGGATTCTGCTTAAACTTT AAGTTGAATrrGACTGTAGTCATTCATTCTTTTTA 
TATATGTCTGTCAGGTATGACATTAATTACC 

TCACTTTCAGTTAAAl TTCIGI IGnATTGTGG CI H H 1 1 1 II 1 1 1 TIT l A ACCCCACTGAAGA 
TGGAAGTAGAGCAGTCTGCCATCTCCTG CT G I II I I IC IU IG TGOCCCCTAATTGCTTGGTA 
ATCTA AAATTA TTCTGACT TTCCA CATACTGTCTTT AAAAACCAATTGTAGCAAT C TGCCC G OG 
TCCACCCTTT AAGCT AAATCTTT CACTGTGCTGGAT ACATAGACACTCGATAAATGTGTGATG 
ATAGTTCTAAAATAATTATGCTATATATCAGGGTTCTGTATTTTTGAGCAATATC^ 
TGCTGTGATGTTAAATGGAATTTATTTTACACT AGTAATAGACTGCTTAAGTGGCTGATTTGAA 

ACCH I CI CI aaaaaaacaaaaacatctggcagatgaqtgacacaaattcattctctt aaagg 
ata gaatttcaagttaaggataagg cagaataaataggc^ 

TTTGGl 1 1 IGGI 1 1IUI 11 1 M I i i gtctc tctgactagctaaactgccaactagaagaatggt 
tgttatgtagtaactcagaagaatcttttaataag i i ii i rci 1 1 ror I ic aaaacaaacttga . 

GC TTCAI 1 1 CI rGOCACCTCTATTCI ITC1 IUXACX5TTGGGTGACTGAACTGATGCTTATTC 
TAGTTTTAGAAGAGGATCCACATTT AATGT AAGGCAT AAAT ATG CCTTTT GCAAATT ATTGCTT 
GTTCT CCAGTGAACATTTGATCATATTCAACAGAAATCCACCATATAAACCATAATCX2^ GAAT 
TGCCCCTTTT AAAAT AGATTGGTT AAAACCAAAAACCT AT AAGAGAGAAATGAT ATTTCATTGT 
ATTGTAGGTTTGGGAATCCT GGTT AGATTTATAATTTCTGGCCGGGTGCGGTG GCTGATGCC 



PCT/US0G/3J312 



tgtattcccagtactttgggagcccaaggtgggtggatcacctgaggtaaggagtttgaga 

CCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAAGTTAGCTGGGCAT 

ggtggtgcgcgcctgtggtc<x^gctactcgggaggctgaggtgggagaatcccttgaac 
ccaggaggcacaagttgcagtgagctgagjvtcatgccattgcactccagcctgggc^ 
>1375 tgcggccgctagactagtcn 

acgcgtaK^T<>GCC^TGGCGCGC<^CGaXCCAC^ 
TTCCGtoc^^TTGATATGATTGGCCGGCGAATCGtjignCTCTTTTCCT 

gatagatcgccatcatgaaggacaccgtaactatccgcactagaaagttcatgacgaaccga 
ct acttgagaggaaacaaatggtcattgatgtccttc^ccccgggaaggcgacagtgcctaa 
gacagaaattcgggaaaaactagccaaaatgtacaagaccacacx:ggatgccatctttgtat 
ttggattcagaactcattttggtggtggcaagacaactggctttggcatgatttatgattccc 
tggattatgcaaagaaaaatgaacxx:aaacvvtagacttgcaagacatggcctgtatga<wvg 
aaaaagactttcaagaaagcaaccgaaaggaacgcaagaacagaatgaagaaagtcaggggg 
actgcaaagggccaatgttggtgctggcaaaaagaaatgaagtgtctagcagtgagctggag 

ATTGGATCACAGOCGAAGGAGT AAAGG TGCTGCAATG ATG TT AGCTGTG GCCACTGTG GAT 

TTTTCGCAAGaACattaaTaaaCtaaaaacttca 

>1376 

>1377 

. GCGTCCGAGGAAAAGGAGTCGT AT AGGGCT ATCAAT AGGT GCCTCCCACT AAATTTT 
CGGAAAGATGAATT AAAAGGCATTT AT AAAG ATCGAATGAAAATTGGAGCAAGCCTTG CCGA 

tgttgatccaatgcaactagattcttcagtacgattTgatagt^ttggtggc^tgtct AATCA 

TATAGCAGCTCTAAAAGAGATGGTGGTGTTTCCATTACTTTATCCAGAAGTCTTTGAAAAAT^ , 
TAAAATT(^ACCOCCAAGAGGTTGTTrGTTTTATGGCCCA^ 

TTGCCAGAGCACTTGCCAATGAGTGCAGTCAAGGGGATAAAAGAGTAGCATTTTTCATGAGG 
AAAGGTGCT GATTGTCTAAGTAAATGGGTAGGAGAATCT GAAAGACAGCTACGATTGCTGTT 
TGATCAGGCCT ATCAGATGCGCCCATCAATTA I I II 11 1 1 GACGAAATTGATGG TCTGGCTCC 
AGTACGGTCAAGCAGGCAAGATCAGATTCACAGTTCT ATTGTTTCCACCCTGCTAGCTCTT AT 
GGATGGATTGGACAGCAGAGGGGAAATTGTGGTCATTGGTGCTACGAACAGGCT AGATTCT 
ATAGATCCTGCTTTACGAAGGCCrrGGTCGCTTTGATAGAGAATTCXJTCTTT AGCCTGCCTGA 
TAAAGAGGCrCGAAAAGAGATTGTAAAGATTCACACCAGGGATTGQAATCCGAAACCACTGG 
ACACATTTrTAGAAGAGCTAGCAGAAAACTGTGTTGGATACTGTGGAGCAGATATTAAATCAA 
TATGTGCTGAAGCTGCTTT ATGTGCTTTACGACGACGCTACCCACAGATCT ATACCACT AGT 
GAGAAACTGCAGTTGG A I C TCTCn CAATTAATATCTCAGCT AAGGATTTCGAGGTAGCT ATG 
CAAAAGATGATAOMGCCTCCCAAAGAGCTGTGAX^TCA^ 

TTGTGAAACCACTCCTGCAAAACACTGTTG^CAAGATTTT AGAAGCC^ ATTTC 
CACATGCAfiAATTCAGAACAAATAAAACATTA^ 

GTGACTTGGCTTACAGTQATGATGATGTTCGATCAGTTTATGAAAATGGA^ 
CTTCTC ^TAA GGCAAAAGACAATTTTAA MUCH CATTTGAATAGAAATGCTTGTTACCAACC 
TATGTCTTTrCGACCAAGAATATTGATAGTAGX5AGAACCAGGATTTGGG<^ 
GGCACCAGCTGTCATTCATGCTT^ 

1 1 1 1 GGAGTTAGT ACTACATCCCCTGAAGAAACATGTGCCCAGGTGATTCGTGAAGCT AAGA 
GAACAGCACCAAGTATAGTGTAI GTTCC1 (JATATCCACGTGTGGTGGGAAATAGTTGGtACCG 
ACACTT AAAGCCACATTTACCACATTATT ACAGAAT ATTCCTTC A ITTOCI CC AGTTTT ACTAC 
TTGCAACTTCTGACAAACCCCATTCOQCTTTGCC^ 

GATTATGGAGAGATTTTTAATGTCCAGTTACCGGATAAAGAAGAACGGACAAA^ 

GATTT AATTCTAAAACAAGCTGCTAAGGCT C<n"ATATCAAAAAAGAAAGCAGTTTTGCAGGCT 

TTGGAGGTACTCCCAGTAGCACCACCACGTGAGCCAAGATCACTGACAGCAGAAGAAGTGA 

AACGACTAGAAGAACAAGAAGAAGATAC^TTTAGAGAACTGAGGATTTTC^ AAGAAATGTTA 

CAGATAGGCTTGCT ATTGACAAGCGATTCCGAG TGTTTACTAAGCCT GTTGACCCTGATGAG 

GTTCCTGATTATGTCACTGTAATAAAGCAACCAATGGACCTTTC^TCTGTAATCAGTA 

GATCTAGACAAGTATCTGACTGTGAAAGACT ATTTGAGAGATATTGATCTAATCTGTAGTAAT 

GCCTT agaatacaatccagatagagatcctggagatcgtcttattaggcatagagcct GTG c 

TTTAAGAGATACTGCCTATGCCATAATTAAAGAAGAACTTGATGA . 
TGAAGAAATTCAG GAATCT AGAAAGAAAAG AGGTTGT AGGTTCCTCX^AAT atgcgccgtctt 
ACT ACCATGTG ATGCCAAAGCAAAATTCCAjCTCTTGTTGGTGATAAAAGATCAGACCCAGAG 
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CAGAATGAAAAGCTAAAGACACCGAGTACTCCTGTGGCTTGCAGCACTCXrrGCTCAGTTGAA 

gaggaaaattcgcaaaaagtcaaactggtacttaggcaccataaaaaagcgaaggaagattt 

cacaggcaaaggatgatagccagaatgccatagatcjvc^aattgagagtgatacagaggaa 

actcaagac^caagtgtagatcataatgagacc«gaaacacaggagagtcttcggtggaag 

aaaatgaaaaacagcaaaatgcctctgaaagcaaactggaattgagaaataattcaaatactt 

gtaatatagagaatgagcttgaagactctaggaagactac^gcatgtac^g^ttgagagac 

aagattgct^gtaatggagatgcttctagctctcagataatacatatttctgatgaaaatgaa 

ggaaaagaaatgtgtgttctgcgaatgactcgagctagacgttcccaggtagaacagcagc 

agctcatcactgttgaaaaggctttggcaattctttctcagc^ 

atcatgagxx^ttaaaaaatcttttgaagactgttgttaaaaaaagtcaaaact 

tcagttggaaaatttgtatgc^gtaatcagccaatgtatttatc^catcgcaaggaxx^tga 

TAAAA£^TCACTTATTC^GAAAATGGAGC^GAGGTAGAAAACTTCAGTTGTTCC^ATGATG 

ATGTCATGGTATCGAGTATrCTTrATATTCAGTTCCTATTTAAGTCATTT^ 

TAATTGATGTAGTATGAAACCCTGCATCTTTA^ 

TTAAAC TrTCC TG^TATrrATGTACATATTAAGATAAATGTCATGTGTAAGATAACTG^TAAATA 
TTGGAACTTTG CTAGAA CAAGACCCT GTAG TAATAGTAATAATAGTTTGAAGTTTG 
TTAATAAAGTTATTTTG GTAACTAATG TTTTATGGCACTTAAGAATAATTAGCAGCGTTA 
TTGTTTGTATT AAGCACTTTTAATTTTATCCTTCCT AAAAATAGTTTATTGT ATCTGACAAGAAA 
CTT ACTT AACCATTGTGTCCTTCCCATCTTTTTTG TGATCTTTGTTTTCTTCAAATGCCCT CCT 
CCCATCTGCCTTGAGATTCCC I TOTCl I CACTTAAAAGCCAGAGTGCAAGTCATGATTTGCG 

TTTTTGAGACAAACATTTTTATAATTCTAATTTGGGTTAATAAA 

TACTT TTGTAAT TATATACACAACAAATGT ATTAATAACTACCTTGTTAAACACCTTTTAATAGC 
AC AAGGTTT TT AT ATT TGCAA GCTGTTGATA I C 1 1 ICf AAAACTCTTTAGGTTATAGTCTATTG 
AT AC 1 1 1 1 lATATACAATTTTATAAATATAAATATTATAArrTTATATTAATGGTACCAAAAATAC 
A 1 1 1 CU AAGOTT AAAAGCATGCACTTCCATGCATACTTGCTTTTGGGGAGAGTGGGGAGAA 
GAGATTCT AATAATCAGTTTGTGAAATAGCTTCTGTTGGAAACCTTTTG^GGGGAATAAGGAA 
TGGTCATCTAAAATGAGAGATTCTGGATTTTAATGCAGTTCAAAGTTGAGCT GT ATTTTTGTT 
GTTGATTTATCTGGA I HUM I AAAGCCTTCTAAAACCCAGTGAATTCAATACCTTAATTAGT , 
ACATACTATCTTATGTAATG^TAAAGCAATGCCAGTCACTGAGAACAT^ 
TTCCTGGAGAT ACACATT CT CATTTTTGTT GGTTTATT ATAAATT ATTCTTCTAGATGCATCTTT 
TATAACT AGGATTT CATTTTGTGTGTATAGCTTATGTAAT AAATTTT AAAGGT GAAAACTCTCT 
TAAAt 1 Uif I r CTATTTCTTAAGTATTTTAAAACATAGCAAAGATAATTCT GTGCAAAAGAAGT 
GTAAAGTCTGTTCTTTCAT ATATTACTGT GAAAAATTT AAAGTGCTGTT GACACT GGAATGTTT 
.TTATATGGTAhMNN 
>1378 

KKGTGCGATCTCGGTTACGTGCG I 1 1 I U I CGGAATTTTATTTCCGCTCCCTT ATTTG 
GGGGN OCCCGCGCGATTTCCCCACCGTGGGGACGAAAAAAAC^ 

ACCTT ATGACGCAATGGCGCAATTCCAAACGCCCG 1 1 1 I MC CGCOCGCreOCCCG<XS C CAT. 
TAGATAGOGCCTCGGGA I I I 1 1 1 1 1 1 H 1 1 1 1 1 1 1 1 1 1 I I I GGT AAAAAAAAATT ACTGTTTATT 
AAAGT ACAACCATAGAGGATGGTCTT ACA3GCAGGCAGTATCCTG1 1 1G AGGAAAGCAAGAAT 
CAGAGAAGGAACATACCOCTT ACAAATGAAAAATTCCACTCAAAATAGGGACT ATCT ATCTTA 
ATACTAAGGAACCAACAA I CI ICCIU I 1 1 AAAAAACCACATGGCACAGAGATTCTGAACTAAA 
GTSCI »jCACTCAAATGATGGGAAGTCCGGCCQCAGT ACACAGGGGCTT GACTTTTTGAACTT 
CGI I ICCI I IUI I GGAGTCAAAAAGAAGCAG 11 G 1 GG I I CT AAAAGGTGTGAAGGTGATTTAA 
GGGCCCAGGTCAGCCACTUN IU1 1 1 ACAAAATCAGGT AACTAACTGCAT ACA C U 1 1 I CIC I 
TTOCATGACATCAAGACTTTGCTAAAGACATGAAGCCACGGGTGCCAGAAGCT ACT GCGATG 
CCCCGGGAGTTACCCCCCTGCTAATAGCTGT AAACTTCCAATTTCTAGCCATACGCTCAGCT 

catccatgcctcagaagtgcatctggagagaacaggtttctaag 

agttggacttttt AAAAATTCAGCAAAGTGGTTCC C 1 c I c m agggacagtcaaaaccaagtc 

A CTT AGG T AGT ACCAAAATAAATAAGGAAAAGCTTAGCTTT AGAAACAGTGCAAjCACTGGTCT 
gctgttccagtggt AAGCT ATGTCCCAGGAATCAGTTTAAAAGCACGACAGTGGATGCTGG 
GTCCATATCACACACATTGCTGTGAACAGQAAACI 0 CTG1 G ACCACAACATGAGQCCACTGG . 
AGACGCATATGAGTAAGGGCACTGACGGACTCATGA 1 1 II I ICI I ACCAGATGCI I fOCTGT 
TCTTTAAGAGTTTAAAATCATCAGAAAG«AAAAACAAACT 

TACCACGCT AGGGCT GGCTCAATTGAAAGTGGGCAAAAGCTT ACAAAT ACT AAAAAGAAGTG 
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CTGCCGCGCAGTGTGGAGGCCACT GTTTGGAAATAAATCTTCCT AACACTAGGACTTTTCAT . 
GTTTTGGAGTGGACT I T T AAAAAAGTTTTCCAACCACAATGGAAACAGGTTGAACCACACTG C 
GCCTACACATCCATCCATACGTCAGCCATTATAATCAGAT^ 

TTCCCAAAGGAGCTGTATGCAGTCATCACCCGGAACT AATGGGAAAGTGTTCTTAAGCAGCA 

AGTGACTCTGGACCACCTCTCTCACACACTGACTCTTCCACAGCAT<5^^ 

TCACTTGAATTCATACAGTAGGCCACCATTTTATAACAGGTCTT AGTGTAAATTCTTCCAGAC 

CAAGTCTGAGATAGTCTTGGTTGT AAAGCAAGACCGTCACCCCTTCTCCCAAAGCCAACATT 

ATAAAAAGATTCCATTGTCTCAAAGACAAATGTTCCAGTCTTGAGCACTT 

ACT CGGGCATATCCT GGCAGTTGGTGGATTTTGCCCTGTAACACAATCCTTGTGGATATGAT 

TGAAAGGTGACTTT aattcagagaatgtgttttcctttctgttttaaaaatatttgaattaagt 
gatggaaattcttttgttaaacatcaaaagcttrctcctrg^ 

aagtcaggtgggacatttggtggattcagaaggtgaaatcattcct gacct ct cct cttagg 
accc agcattaactg ctgcg atactcaaggcttccttaaaggatca 

TTTTACTGTCCAGCAACTTT ATTAGTTGTGCTAGCTGAGAATTGGACCATAGAGGCCTTTGAC 
CCTGC<^TCTMTMTCAAACNAANNNCNNCCCCCC^ 

NNNNNNNNNNNNNNNNNNNW<NNNNN^ . 

NNNNNN 

>1379 

>1360 

ACAGTAAt i i IGGAAAGCTCTITGATGTCTGGCTTATAGAAGACACCTGGGTTCTTAT 

ATCTGCT t c T GAATTCGATCT attgt aatgtgttattttggctgaagtatgtgaagaaaatact . 

ACCTT ACAAAGATATGTATTTTCAAAAGGAAATACATATCAGAAAGTTTGA^ 
GTGATACT AAGGTTGTCAGGATGGATGATGTCTATCTGGAGAGCTGGCAGGAAACAACCAA 
GCCOCTT GAGC GCTCCTTGCAGAGC AAGAAC T ATGATTGAT G 1 1 1 I CI I ACT CACTTGT AAAT 
TGCGTGATTTTAGCTCT ATTTCAATAACTTTTTCAGAAGACTGT AGAT ATT C11 C T T T GAT ACT 
ACAT CAGAACTTTGCAAGTAACTTCAAGTAGTAATTTOGTAATATTCTTTTT AATAATGTGT AC 
TTTTCCTATCCT ATTA CAGTAAAAGCCCTTTATATAATATCCTTGT ACCTTGGCTGGATCTTCT 
ACTT ATTCTT GATTTTG TAACATC^TCATTGGTCATCTGGAAAATATTGGTTCACTGAGTTATG 
AAGCTTCCAAATGTTGACAC GTGTC ATTATATAATATTrTTAAAAT 

ACCAATCTCATCAGATCAGTCTTTT AAGTGT CAAGGGAACCTGTCAAGTTCACAGTTTGCCAA 
AATTCT AATTTTCTTT CATCT GGAAGCTCXAGTTAT AT ATTGGCAATGAATACTGGGAGTTGTT 
TTCCTAGAAG TGACA GGCCCACCCACATCTGAATAACCAGCT AGTTCACTTTGCCATTCAGT 
CACACAAGT ACTTTTCCTTGAGGCXTTCX^TCAT AATCTCAAGT ACCAGA^ GCTTTGTATGTA 
TGTCCCGTTTATACACATACATT ATT AAAAAGACTGATGCT CAAGGGTAAAAATTT AACAAAAT 
TAATAATTTGCACT 
>1381 

NNNNCGCGGCCCCGTCGACACGGCTGCGAGAAGACG^ 
GGCGCTGCCX5GGTGAAATCGTAG<5ACAGTGAAGATGCT 

AGGAACACCTGGCCn CCI GOCTCAAGTOGACAAOGCGGTTOTOCKX^AflTITBGGOQGA 
TTGCTGTGGAATTCCTGAGACGCGGCGCAAACCCAAAAATCT ACGAAGGOGCCGCCAGAAA 
ACTCAATGTGAGTAGTGACACTTJTCCAGCATGGTGT 

AGAGCTCAAAGCTCAT GATTTC TGAACTGGATTTCCAAGA C I C I m I I 1 1 G I I C I G GGATTCT 
CTGAAGAATT AAACAAA I Hi 1 1 GCTTGAGCTTTATCTGGACAACAGAAAAGAGATCAGAACGA 
TTCTGAGTGAATTGGCACCAAGtX' 1 1 LXXAGTTATCATAAOCTITGAATGGCGACTAGATGTA 
CAGCTT GCAAGT AGAAGTCTCAGGCAACAGATT AAACCAGCAGTGACT 
TAATCAAAATGGAGATCACAACACCAAAGTTCTGCAGACAGACCCAGOCA^ 
TGGTTCAAGAACTGGAACAAGCATTGGAAGAGATGAAGACAAACCACT 
CGCAACATCAAGTAGTACCAGTTTTAAGGTTTTAATTCATT^ 

ATAGAGCAATTAC 1 1 1 1 CAAAATTAATTTTTTATTAATTCATGATGATAAATACATAGTATTCCT 
CAGTATCTATTOCAAOATACTGACGTC 

CCAGTTA1 CGCAGCAC IT 1 GGaAG^30GA«CT^^X! : A*ATT^T 1 Ti AfiGTCAGGAGATTCAGA 
CCTTOCTBGCTAACATGGTGAAACCCCATCTCT 
GTGGCACGCATCTATCAGAGTCCCAGCTACTCA 
CCTG^AGGTGjGAGGTTGCACT^^ 

AGAGTGAGACTCCATCTCAAAAATAATAATAATAATAAT AAAGT AAAAATAAAAATAAAAAAGT 
AATCAGAAGCT AAAGT AAA G 1 f C Cl I rCCTGGTGCTAACTGTGGTCI ICI 1 GACACATT AAGA 
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TGTATTTTGTATTrrAAGAGTCTCATGCTCTAOCGTTGGAACTAGCCAGATGGCCATTATrTT 
GTATTTTAAATACATAAATAGGATTGAATCAACTAGAAATGAATCTATATGTTCTGTATATATG 
AATGACTAT C I I U I I 1 1 IGCTA C I 1 CI I 1 1 GACTGCTTAATTTTATTATTTTCATCTTT ATTGATC 
AAATTTGAAAATAAAATTCACAATGTAATACTACTACTATGCAGAATTTTCTAAACAGTTCAGT 
ATTTTTGACTTTT AAAAAGACCCCACAGTGTTAATAGCCACAGAATATTGAACATCAATAGGAT 
TTTTAATGCTATATTGTTAT AGGCAGTTTATTCA 1 1 1 MCI 1 1 GTATATGAAGATGATAAGTATC 
ATATTGCCTAAGTTTGAGTGATCATGGTTAATTAATTGGCTTAAATAGTACTCAAA7TTGTGTG 
GTCGTATATTGTATTTTATCAAGGACTAATTCTTCCACCATACCCAAAGCATCTAGGAGACAC 
TCTGTCATTTACATTTACAAATAATGGATGCAGAGAAATATAATCAATTCTTGATTGTCCTGGA 
TACAAATTATAGTCACTTT7TATAATTTGAATAAACTATGATGAGGGAAGGAAGGCCATGTTTA 
TTACATTATAGGCAACTATCCCTTCTCAAAACATAACTTGATTTATG7TTTGTTTTA 1 1 1 1 1 JAG 
TAGGGAACCAAAAGCrmGGTCTGAAAAACTCAGGTGCTGATTTGTCAGGTCAGCCTGAGA 
GAGTGCAGAGATACTGAGGTCAGCTTAGGGAGAAGAATTGTATGGAAGGGTGGAAATGAGG 
GTTCAGTCTTAACTATGTGTCCAAGACCTrATCCTGAGAGTCTTCATGGAATAATCTAGAAAG 
GGCTGTAGAAGCAATGAAACTCTAAACTGCGCTCT AGACTTTGTCAATACTTTGCTTCGTAAG 
AATTAG GGATTTTCAT ATCAG AGTGAT ACCATGTTGCAATCT CTTT AG AG AAGAGAG AGCCT A 

gcagggaataggctagtggaagaaaaacatacttgagatggttacccattggtccctgcag 

GCATCTGGGCCACCCTTTTCCTGTAGGTATATAT7CTGCCGTTGTGAAACAACTATATTATTA 
GATGTCAGCAAACTATAATCAGAAATAATTTTTAATTATAGTGGCAACTTGATTAATGAAGCTC 
TCATATACAACCTCACAAACAAAAAGAAACTTTATTACGTGAAATAGTAGCATATGTCATTGG 
GTTTAAAAGCAACATGCCTGCAAAACATATGATAAAAAATACCTCTGTAAGGGTTTTTATTTGT 
CCACCTTOGATGGAAGGT AAACATG G CTG ATCCGAG CCAAG ATG AAACAAT AG CCCTTTG TC 
TCCTCAGAAGAACCT ATCTCGAACTCTTTGTCCACTGTTCTTACCCGAATTAAAACAAGAAAG 
CATGTGCTAATCCI I 1 1 IIAAI I II I ATCAACCTGGCTTGGGCTCCTTTCACTGTACTCCATTA 
TTCCTTCTQTGATGTTCT ACC I ICI iCACCI 1 1 I GAGACCAGGGTGAGTTGGGAGCAGTTTG 
CCTACUI ICU 1 1 AGAATCTGGGGCTT ATCTTTT AGTACAAGCCATTGAATAAGCCTCTTCCTT 
I 1 1 I 1 1 GCTCAAACATTCCACATCCTTGTGGATTCCCCTGCATTG II I U I M I ATATAACATTT 
GATATTTGTTGTAGCTTGT ATATGAACATA A 1 H I UI I I A G AGGT AGTCACTGTTCTCTCCAGT 
ATGACOCAGGI MC I 1 G ACTCTGAGTAATGCACCTTCTATAACTATCTAAATTTCTACTGAAG 

cum i ggattatgagtatgctgac i 1 1 1 caogattggctggtgcatgtttagacttaaatgt 
catatccttcatgtctcaaagccaaaatagtaacatctcatctcagaacagagctgtgaccac 
atgccaatatatgtgtcacaaagtctacatatgttacattccttggaagtctccttaaatgttt 
cacaaaatgtcaacaagc i i gi m i gttattgatatttccgagattgqgcacattt aagagag 
taaacgggaaaggtggtgaagatgctataagaagatgctgtatcttgagaattgaaaaatga 

GAATCTGACATGGTTTGAAAAATCATGAAGGGT ATATATAAAGGATGCATGTGTAGGAGCCA 
TTAAAA7TCATAAGAGTATGTGCCCCTTCAGCGTTTTAATCTT ATGAAGTGGTTAAGAGATAA 
GTCTTCGGAGTTGGAGAAAAGGATTTGAATTTAGGTTCTGTGGATTATTATGGGTCCTTGATA 
AATTCTCAAAGCCI I M M M I GTTCATCTCTAAAATAGGAATATAGGATTACCT ACCACATTG 
ACTCGTAAGGACTATATGAGACCATGTATATAGAATG7TTAACATAGT ACCAGGCTTTACAAA 
GAACTCAATAAATGTTTACCATTATTGNNNNI^N^ 



CCCGGGACGNt 



B^NNNNGGGGGGCCCTTAAATTTTTTA 



NCCGCTTTGATATGAATTATTATGCAAAGAATGACAGACATAGAGACTGGAAAGGCC 
AAAGGGAGATATGTCTGTCATTCTTTGCATAATAATTGATATCAAAGGGGAGAAACAG CTATT 
TTTTGTCXXn"AGAAGTTCTGCTCTTAAGGTTGAAAAGAAAGTTTG AAAAATGAT ATCATTTAGG 

ttagaagt actttgtgttgttggtatccaaaattaggactct gag attctt gtgtattcagag 
aatttttagtaggaaacaaggacaaatttgcatatgaaatgaaaatagttattacatgacaaaa 

TATGT AGATCTGATTTCTAGAAACTGAATTAGTCCAAAACAAGTAAGAGTGGGAAAAGCAGT A 
AAAA GMC I ICI TGAATATTGCTGTTGTCATCCAAAGT ATTCTT A f I I CI 1 1 I AQGTGAAAAATT 
TCCATTACICI 1 1 1 G I QATATTCTCAAAAGAAAQTTT AQGATTTTAGAGTGTTCTGAAATACTG 
AATCTTAATTCAGT ATTTCAATAGAGTATTATTGATTTGCTTCCTrATCAGTGGATTTTTAAATT 
ATTTATTTCT AGGCT ATTGAACCTTCT AAAATTTAATCCCAAGTAGGTTT AAAAGCCGGATATC 
TAG GACCAAAGGTTT AAAGACTC 1 1 1 1 1 C t GGGGCATGCTATTATTTAACAG I MM ICCTAAA 
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TTGGGTTTTGGTTTTGGGAAACATGAAATAATTGGTTCTTAAAGCAATCCTTAAGGTCTATTA 
GGAAGJTTTrCTAGGC^gA^^ 

>13M KNWCGTCCGAGTGGTGCTGTTGCCGTGACTGTCTTCCTATTGGCGCCGTGCAGAG 
AGGCGGAATGTTCAACTCCTAACTGCGGCGGAAACGTGGGAGCCGCGCGGGCCGCTGTCG 
TCCCAACCXXXXXXXSCXXnCGTGGCGCGCGGGGCCTCCGCGC^ 

GCCCCGCCCGGGAGCCAGATTTTGTGGAAGTATAATACTTTGTCATTATGAGATGTCGTCTC 
TCGGTGGCTCCTTTGTGCAAATTAAATTTGATGACTTGCAG I Mil I GAAAACTGCGGTGGAG 
G AAG TTTTGG GAGT GTTT AT CGAGCCAAATGG AT ATCACAGG ACAAGGAGGTG GCTGT AAA 
GAAGCTCCTCAAAATAGAGAAAGAGGCAGAAATACTCAGTGTCCTCAGTCACAGAAACATCA 
TCCAGTTTTATGGAGTAATTCTTGAACCTCCCAACTATGGCATTGTCACAGAATATGCTTCTC 
TGGGATCACTCTATGATTACATTAACAGTAACAGAAGTGAGGAGATGGATATGGATCACATTA 
TGACCTGGGCCACTGATGTAGCCAAAGGAATGCATTATTTACATATGGAGGCTCCTGTCAAG 
GTGATTCACAGAGACCTCAAGTCAAGAAAGGTTGTTATAGCTGCTGATGGAGTATTGAAGAT 
CTGTGACTTTGGTGCCTCTCGGTTCCATAACCATACAACACACATGTCCTTGGTTGGAACTTT 
CCCATGGATGGCTCCAGAAGTTATCCAGAGTCTCCCTGTGTCAGAAACTTGTGACACATATT 
CCT ATGGTGTG gttct CTGGG AG atgct aacaagggaggtccccttt aaaggtttggaagg 
attacaagtagcttggcttgtagtggaaaaaaacgag agatt aaccattccaagcagttg cc 
ccagaagttttgct gaactgtt acatcagtgttgg g aagctg atgccaag aaacggccatca 
ttcaagcaaatcatttcaatcctggagttccatgtcaaatgacacgagccttcctgacaagtgt 
aactcattcctacacaacaaggcggagtggaggtgcgaaattgaggcaactcttgagaggc 
t aaagaaactagag cgtgatctcagctttaagg agcaggag cttaaagaacgagaaagacgt 

agattggtgcatggacggaagacgatgtgtattgttgggttcagcagctcgtcagaaaagg 
tgactcttcagcagagatgagtgtatatgcaagcttgtttaaagaaaacaacattacagggaa 
gcgggtgctgctgctggaggaagaagacctgaaagacatgggcattgtctccaaggggca 
tatcattcacttcaagtcagccattgagaaattaaccc^^ 

CCACCACT AATT AAGGACT cag GAG G TGAACCT G AAG aaaat gaggaaaaaatagtg aacct 

ggaactggtttttggrrtttcacttgaaaccaggaactggcccacaggattgtaagtggaaaa 
tgtatatggagatggatggggatgaaattgcaataacctacataaaagatgtgacattcaaca 
ct aacct acctgatgcggagattttaaagatgacaaagccaccatttgtaatggagaagtgg 

ATTGTAGGAATAGCAAAAAGTC^GACTGTGGAGTGCACTGTCACATATGAGAGTGATGTTAG 
AACTCCAAAAAG CACT AAACATGTCCATTTGATTCAGTGGAGT AG AACAAAACCTCAGGATG 
AAGTGAAAGCAGTCCAACTTGCCAITCAGACATTATTCACCAATTCAG^ 
AGCAGGTCCGACTGAAGTGCTGATTGCCAGTGGTTAGATACTCTGAGGATGCGGCAGATTG 
CATGCAACA C I ICM I A CAGCGTTCCCAGAGCAATCCTATTCTGGGGTCACCGTTCTTCTCAC 
ACTTT GATGGCCAGGATrCCTACGCTGCTGCTGT GAGACGGOCCCAGGTGCCCATTAAGTA 

tcaacagattacacctgtgaaccagtcx^gaagctcgtctcctactcagtatggactgacca 

AAAACTTCTCTTCCCTACATCTCAACrrCTAGGGACAGTGGCnT^ 
ATCACTCAATTCTTCTCCTAGAGGAAGATACAGTGGAA^ 

GAGGAAGATACCCTGGAAAGTTCTACAGGGTTTCTCAGTCAGCACTCAATCCTCACCAGTC^ 
CCTGACTTCAAGAGAAGCCCCAGGGACCTCCACCAACCC 

TGCACCCT GAGACTGACTCAAGAGCCAGTGAAGAGGACAG CAAAGTCAGCG AAGGG GGCT 

G GACAAAAGTGGAATACCG gaaaaag ccccacaggccatctcccg ccaaaaccaat aaaga 

gagaggcagaggggaccaccgtggatggagaaacttttgatgaattgaacta^ 

ct aagcaggttaaaaaaaaaaaaaaaaaaagaaatgtaatggtttttgataatatgatcccttc 

agattgaattaacgaaaagacaacacttccagtttttggattggga 

actatagccaaaccagggccaaaattatggatattggt<^cccagt^ 

gaaaatcactacagatattttctgccttgagtgaacatttt^ 

TITTACCCTAACCACTGATATTCTG<rrTAGCAG 

caacaaaaaaatcaaatttttaggaaaagataagatgaatot 
aggctgcaatatggcctggcaaqgcactgttactgatcttgtctttaac^ 
ttcatc^taatttttgcatttai mill i aaatattgcatt aaaat atcatttagcttgattatcg 
agi i i i i ig gtttgagg1 mm igttg c1 i cm m i i cm i icm i ci i iccocct cm m i m ig 
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GATGTCCCCTTAAATTTTGTGCCCAAGGCAGGTACCTCACTCATCTCATCCTTGGCTCAGCC 
CTGCTGGTrAGTATTTAGTATTrATTTTAGTAAGATATTTGTGTCTGTATGATGGTCAGAGTTG 
AACTGATCTGGCTTGTCATTTTTCAGTAATAAAAAAAGTTACTGAATTTAATTGTTGAATATG^ 



TTTCTCTTCCTAAGTTCCCAGCGACTGCnrTTCAAATACTATTTTCTAAA 

AAAGAGGATAAAACAACACTCXIATAAAGGCCTCTTGGGATGTCAGAAATCTAAAATCT AAAAG 

AAAACAGACACAGAGCAAGACAATAACATCACAAGCTAAAAGCCAGAGAAATTTAAAATTACC 

AACATCCTTGTTGGAGTAAGACAGTAAAT ATCAG CCTTGCAGCAAGACAGCTCTGAGCAGCT 

GTGGGCAAAGAGGTAAACCAGTGGGGGTGCAAGGAGACTGTCTGCAGCTTAGGGCAGAAA 

TGGTGGGATCCAACTTGTGAAATGCTTCATGTTrTACAAACCAAAAAGTCAGGTAGCAACAAA 

CTrATTGTATGTCAAATCAATAAATGTTACTrrCAATATTCTGAGATTCTGTTCAGCATTCACT 

AACCCTGrrTAGCArrCACTAAGCCCTACATAGGATtCTCAGTGCTCTGN 

>1385 

>1386 

>1387 

>1388 

>1389 

>1390 

>1391 

nnnnngg ttgt gg gtggag ggtgttttaaataacagcttt acagagagatatcattc 
at aattcataaggttttaac 1 1 1 m mcm i m 1 1 aagacaaagttttaccttctgtcacattg a 
aaaatctcctatattcttgaagattctgagcaatacattcacgaccc 
tactattggagaaactxxrttcxxrgaagataaacacttciaagaatttgagaaaattaaaact aa 
aaccgaaa^acatgaacacaaaggcacaaaacattgcct aacattgcagaaaattactttaaat 
cctgatatgcttgtagaagaaatagi i i mci i g ttttgcttgcaaaactttgaggaatgcat 
gaacttcagaaggggaacaccatacaaatgagai iiiiii cactttagaggaatgagctaaaa 
gagaatatggtttagattaaagaattattttcttacagttgcagaagaattaagaagaacta 

N 

>1392 
>1393 

nnnnnnnnnnnnnaaaagtgattcaaac 
agaggataact gttac aggacacgctctcctaccataaacagctaga 
ttgaaataactgttttcagacattaaagaactggcagcacaggactgtgatctct 
ggaaatacacgaagtacaact gccctacatttct g cctaaag gcaatttccagact acacag 
acagagaggaaatgcacatagagcccaactgtctctgaaaagagacaagagaaatctaattt 



ttc^aaaatcatcatgagggaccccttgaatgtttgggagaatacaaatctgcttatgtatag 

ggtgaaagaccacagagctgatcaaagtgcaactatcaggatcaagaacaa^ 

ggtcaagtgtaagcnwnnnnn 

>1334 
>1395 

f#wtcgagccccattcgcgagaag<xx3gccattgtgttggt acccgggaattcgcc 
aagtcgccgc^agcgcggaactggggttgcggcgtct aagtgtttccggtggattcccag 

GGACTGTCGGAGGTGTGGACTCTGCCTGCCTACCTGGTCTGGGAAGATGTTCT 
tgcctagaggacgaaatcctgctgcacccgogct ACTTCGGCCCCAACTTGCTGAACACGG 
tgaagcagaagctcttgaccgaggtggaggggacct gcacagggaagtatggcttt GTAAT 
tgctgtcaccaccattgacaatattggtgctggtgtgatccagcca 
ctttatccagttaagt acaaggccattgttttccggccattt aaaggggaggtcgtggatgc 
tgttgtcactcaggtcaacaaggttggactcttcacagaaattgggcccatg ca 
tctctc«acattccatcccttcagagatggagtttgatccta^ 
agacaatggatgaggatattgtg attcag caggacgatgagatccgctt aaagattgtggg 
gacccgtgtggacaagaatgacatttttgctattggctccct 
ttgtaagctgagcctggtggcctcct acccttggtcctactctaggaagtgtgattgtcaca 
ct7 atcat gttgtccagagg tccag tct g gctgct gttgt g gag gcaaggaaggcaact ca 

TtXX>GAAGGCATCTGGTGCTTCTTGTAGCTTAACTACTGOTCCT 
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TCTAAQTATAAAAAGTCCTrG GTTCT CAAAAAAAACAAAACAACAACACCACA 
>1396 

NGTACTCGGTGTGTAGTATTTAGTTCTGTGACTGTGTGGTCGTCTGCTCAGATAGTG 
TGCGATGTCGaGTGGCGGATCGCTTAGTGGGTTTCTCGCGCTGAGGGATGTATGGGCGTC 

gtgtcagt atgtgatcatg gactgttg atggtg tggttg acg nt acaacag ct ggn cg ggt 
atatcgattgctggctccctgggtgagaccgggaatgccctcaagccccatggggttgttc 
aaaaccagcccggtgcgccccggccg 1 1 ml 1 1 attgggtggaaacccacccctagttcgt 

GGCXX^GGTTCa^GGGCACGCGGGGCCCAGGGTTCACCACCCAGGGCCCGGCX^CC 
GAGGTTGTTAAAAGAui I I I I GGGGCCCCCAGGCCCCCAAAAAGTTG 1 1 1 i iGi « i I ( M M I 

1 1 GTTT AAAGGTTTCAGAAATTATTrATTGTAGTAGTACTTACACGTAAGATTTT AGCCT 

ATGGTCATTTTATAAAGATGACTGTTAGGATTTAATTO^TTTAAAGAAAATGAGATTCGTT^ 

TATTATGGTGTTrmATGACCTATAAAATACTTACCCCTACAAATTTCCATAAATGTAGTGGT 

T AGT AAAGCTTTTTT CTT ACT GAAAAAT AATG CCAGGTAACC AAGT ATT ATTCCTT CCATC ATT 

TATTTAGGAAAAAGTTTTATGTATTAGGGTAAAGTGGTAGAAGTTAACCTAGAATCTAATAATC 

TCCAATCACCCATTCXrrGATCTAATAGTAGCCATGAGAAAAAATCTCT AGAAAGAATCATACC 

TCT CAAAAAAT AAAAAAT AAAACAAAG G CTG GGT GC AGTGGCCAGCAT AGTATTTTT ATTT AT 

AACGTGGAAGCAATGAGGTGGGCAGATCGCTTGAGCCTAGGCATATCGCCGGAGGCGTGG 

GTCGN 

>1397 

>1398 

>1399 

NNNNNNNNCACAACCAGGACTGGCCCTTAACCAAATGGAGCCCCATTGGAAATAAA 
GCAAATCCCGCAAACGGGTACTC<XXX3AAAGCAGCAATACGGCC6CCTTCGGCCGGG 
GACTTTACAAAAATGAGGGCTTTTATTrrCCATTTGGAACGTGGGACAACAGAC 
AATTCCATTTTGCAAGTCTTrCCAAGGGAGAAGCTGTTCAACCACC 
GAGCCGACACTTrCCTTTGGTCTTTCTGAATC<3TAACTGCACTGCTrr 

G GGGGCCTTT ACAAGAAGACATTCCTGTCGGAGAG GAGGGT GG ACT TCGGAGAAATTCTCA 

TACT GAAGCATGAGCTTAGGaGTTTCT GTTAGTGGTAGTGGTGTTTTGGACACTTCATTCCTT 

GCAACACCGAGGTTTTGGGTGTTGACATAAAGTGGACCACACACCACATCT 

GACACI 1 1 M I I IG I I IGGTTGGTTTTGTTACATtJrrACATrATGCAGAACTATTTTTGTA^AAA 

TTGTTTAAAAGTTATTTATGCAAGGTTTGAATGCATACCAGTGTTTT^ 

CAATTTT CCT GATTTCCTT AAGGT AG GAGAGAATTT AACGTGT ACTTGATCGACACAACCCAT 
CT ACAAATGTGCCCAG ATCTAA CAAAGT AG G CT AAGACCTTCX^CTT AAAAGCATGTTT AACT 
GGAAGTTGAGAGTCTTGCTTTGTACCTCAAGAGTTACATGAGCATGTTGTGGATAAATGTAAAT 
TATAGTCAAAGTAAGATACTCTGCCAAGTTTCCTCTGTAGAGAATTCACT 
AAATTTCGACTrCAGCCTTTGCACTCAGGAGGTTCT 

T AGATCTAATrTATACAGTGAGTCAAGAGGTAGAATAAATGCTCCCACATAGCC 1 1 ICI M IG 
CI 1 1 1 GCTTCT CTCCTCTGAAGTGTGAGTTGAGTTCTCATTTAGG 1 1 PGTAAGATGGCTATTTC 
CT AGTTGTAAAGTTCTGCATTTATAAGTGCCATTGTTGT AAGGTGGTGTTTCCTAGACCTTCC 
CTGATGCGATTTTACCTTTGTTGAATTTGTATAAAX>ATTGTACAAAAAAAA 
TTTGXGGGTrTCTGTTCTAGGAGTGGACTAGAAGTTTAAG 
TTGAAGTCCAAACAATGTAATTTATGATTCATACGATGTTACCCT 

TTCTT AAAAT AGCAAGCAAGCTGTTTCT GGCTCGT AAAGGT AACGGAGAAGAGCTGCTCTTA 

CXACACATAATTAATTTCATGGCTACTTGCCATTACCANNN 

>1400 

>1401 

CCCTT AGGGTGGTCGCGGCCGAGGTACTCAATCAGATGTTAAATTCTTCAATGTAAA 
TGCTCTGTGATGCCATCX^ACCTCCTGTCTCCCCCACCCC^ 

CTGGGCACAGT AGTT ACACAATAAAOGCT AAAAfiCCTGATTTAACAACTGTATATAAACAAAC 
TA C1 1 U ATGTGACTACTATACCTCTGGGCATGGTATTAACCTATCC^ 



>1402 
>1403 
>1404 
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NNA" 



tTGGATTTATGCATAATTCCCCATTAOCCCCTT< 



1GGCAGGTA 



octggc^acagtaaatgctcaaggccctttgttattatttcagatggcc^agaataaatgttt 
ttcaaggatc trcttt ttgtagacaactgtgtagtcacagtttacagtcgtaaattatctggct 
ggcaagat acttttt aaaatt aaaatgt aaag aacctgag gggattcactcccaaatg ttt at 
ggacaaactgaaagggcatttacacagatattaccttctacatttatgtgagaaagtgct 

>1406 
>14©7 

gtttgaattat7taatttgatccatttatttaattaaaaaaaaaaggaaggggaaagag 
atcatggccaaaaaaatagtagttaaccgccaccccacccccaaagctctagccagtcatgt 
gagcatc^ccacatcccactcagtggctgatattcggatggtggcatactctgccccagg 

AGACTGCCTG^AGGCACGGGGCAATGGGTGCCAAT7TTAGCTCTCAGCAGGTTAGTCAACC 

agacaaactggtgggctaaagtccagaaattctttccaggttttctgctc^ttggctgagca 

catacaaactgtcataagc^tgtaaaatttaaggggagttggggtggggcgtaagagcaaaa 

ggacagcaggagaagagaaattac^gtcacccaagtttttcctgggctagtggctctgga 

tatagatttaaagagaggtcagagtaaatggagtccaggtttcttatcaaagaaaact atccc 

tcaatgaggagctgagatgtgccatgcaagagagttcttcctgcagaggcacaggagaaag 

ggcaggtgactctctcatgtggagagagtggcgaggaaggtcttctagtaccatgaagtaa 

gacaggcagagggagaatcctgaggtttgggccaaatgtg agactggtacacacacagtt a 

aag^ctaaaaagccatcaggaaccctcaaagccaaattctatctgcacactg 

gttcgcagcaagaatctcttgc^ccaagtcaagttagtggcctggctatgatggatggtgg 

ggagggcaatacattntattttcagagggatgaggtgggaagaacagccatgatctagtaa 

AAAGAGACCTGCAAGAAGCAGAAAATATTTAGAGAACATTTTAATATATATTTTCATATATATA 

tttaaagttaagaaaaataaaacrraattcaagccatgcccrrgtgcaaaaaaaaaaaaaaataaa 
gaaaaaagaaaaaaaaaggaaaagaaaacaaaaatttaaagtgagacgtttgctgctctggtc 
tcaatttaagaatcacagtcagcttgttacttttattttggaagaaaa 
tcaatcattcagaaggcaagtgtagccacttataaaaacagaatggcaggaacaaactagga 
aaggaaagtcagaagtaaagggcagagtgggaaaataattttcaaaaat aaaatt aacaagg 

TGACTGTTC CAGAA GAGGGCTGTGAAAAG GACAT GGTGGA<XX5AAGTCTGTTAGTCAAGTA 
ATGATTGAAC Mil AAATTATTCTCTTGTTC t T T I TTGTTGGGTGTTTTG7 TTGT 1 CAAGTCTAA 
GAT1TGGAAATGCTGACCCTTTGTTAAGAGCCAACAGGAGATATAGGATCCCTTCCCTCOCC 
CGGCCTGCCTOCGCTGAAGCCACCACCAGCGCCTCCTrGGCTGGATGCTGGAAGAGTGCT 
CCATGTGTAC GGACT C AGGAT GACAGGGCAGCCTCCTTCTGTGGTTGCTGGGCTrGTG 
GTTGCAGTATCTTTrGGCTTrCCACGTCTCTAAAATGTTTTTCAACT ATTTTGCGT ACATGGCT 
CAGTGCACTCCCCTCrrrrGCCTTTACAGTTTTCCACTTGATATGGGGGTGT AAT A C 
TTCCATGACTACGATGI III 1 1 1 CI 1 (aCCATTTACAGTCTTTAATGGTCTTGTGAATGGTCTGG 
AAGAGCTGCTGGOCCTCTAGAGAGACACCAGCACTGATTGCATAGGCC^ 

GGTCGACN 
>1400 

ACGCGTCCGC GCTA ACCAG TCCCCCAGTTCAGTA GACTGGAGCtXAGAGCCTGCTT 
ACTTGTCAGGTGTTTATTTTGTCTTGCI I 1 1 1 1 1 I II 1 1 1 1 1 AAATGAAGTCAAAATGCCAATAA 
GACCAGATCTCGAGCAGTTGGAAAAATGCATTGATGATGCTTTAAGAAAAAA 
CI \ I GAAAACACTTTTGCAAATTGATATTTGTGAAGATGTX3AAGATTAAATGCAG CAAACAQ T 
TTTTCC ACAAG GTGGACAACGTTATATGCAGGGAACTT AATAAAGAGGATATCCACAATGTTT 
CAGCCATTTTGGTTTCTGTTGGAAGATGTGGCAAAAATATCAGTGTATTGGGGCAAGCTGGA 
CTTCTAACGATGATAAAACAAGGACTAATACAAAAGATGGTTGCCTG 

GACATT ATTCAGAGT CAAGGAAATTCAAAAGATGAAGCTGTTCT AAAT ATGAT AGAAGACTT A 
GTTGATCTTCTGCTGGTCATACATGATGTCAGTGATGAAGGTAAAAAACAAGTAGTGGAAAG 
TTTCGTACCTCGCAI I IGI I CCCTGGTTATTGACTCAAGAGTGAATATTTGTATTCAGCAAGA 
GATT AT AAAAAAAATGAATG CT ATGCTT GACAAAATGCCTCAAGATGCCCGGAAAAT ACT CT C 
TAACCAAGAAATGTTAATTCTCATGAGTAGTATG^GAGAAAGGATTTTAGATGGT GGAGATT a 
TGACTTACAGGTAGGCATTGTAGAAGCTTTGTGT AGAATGAGCACAGAAAAACAAAGACAAG 
AACTGGCACATCAGTGGTTTTCAATG^TTTTATTGCTAAGGCATTTAAAAGM 
CTGAATTTGAAACAGATTGCAGGATATTTCTCAACCTTGTAAATGGGATGCTTGGAGACAAAA 
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GAAGGGTCTTTACA I MCCI I GTTTATCAGCATTTCTTGATAAATATGAGCTGCAAATACCATC 
AGATGAAAAACTTGAGGAATTTTGGATTGATTTTAATCTTGGGAGTCAGACT ctct cattct a 
cattgctggagataatgatgatcatcaatgggaagcagttactgtgccagaggaaaaagtac 
aaatatacagcattgaagtgtgagaatcaaagaagctactgacaataattctgaaaaatacag 

T AAAAATT AGCAAAAGAGAAGGGAAAGAATTGCTTTTGT ATTTTGACGC ATCACT AGAAATCA 
CTAATGTAACTCAAAAAATTTTTGGTGCAACTAAACATAGGGAATCTATCAG 

tttcagttgccaaaacgtcgctg catatactttttgacgcaagtggatcacagattctagtgc 

cagaaagtgaaatctcaccagtcggagaagagctcgttagtttaaaggaaaaatcaaagtcc 

cgaaaggaatttgctaaaccttcaaaatatatcaaaaagagtgacaaagggaatagaaataat 

agtcagcttgagaaaactactcctagcaaaagaaaaatgtctgaagcatcaatgattgtttct 

ggtgcagatagatacactatgagaagtcf^gtgcttttcagcaacaca^ 

aagaagaagaattaaaccaccactgcaaatgacgagctctgcagagaaacctagtgtttctc 

aaacatcagaaaatagagtggataatgctgcatcactgaaatctagatcatcagaaggaaga 

catagaagagataatatagacaaacatatcaaaactgctaagtgtgtagaaaacacagaaaat 

aagaatgtigaattcccaaaccaaaattrragtgaactgcaggatgttataccagattcacag 

gcagcggaaaaaagagatcatactatattacctggtgttttagacaacatctgtggaaataaa 

AT ACACAGCAAATGGG CATGTTGGACACCTGT AACAAAC ATTGAACT ATGT AAT AACCAAAGA 

gcaagtactt03tcaggagacacattgaatcaagatattgttataaataaaaaacttactaaa 

caaaaatcatcctcttcaatatctgatcataattctgaaggaacaggaaaagtgaaatataag 

aaagaacaaaccgaccatatcaaaatagataaagcagaagtagaagtttgcaggaaacacaa 

tgagcaagaaaatcatcctt aaat att cagggcagaaaaatactgaaaatgocaagcagagtg 

attggcctgttgaatctgaaactacttttaaatcggttctcct aaataagacaattgaagaat 

cgctgatatataggaagaaatacatattgtcaaaagatgtgaatactgctacttgcgataaaa 

atccatctgctagcaaaaatgtgcaaagtcatagaaaagcagagaaagaattgacttct 

cttaattcctgggattcgaaacaaaaaaaaatgagagaaaagtcaaaagggaaagaatttagc 

aatgtagcagaatccttgataagccaaatcaataaaagatacaaaacaaaagatgacatcaag 

tctacaagaaaattaaaggagtc7ttgattaacagtggtttttcaaacaaacctgttgt acaa 

ct cagt aagg^aaaagttcagaaaaaaagctacagaaaactgaag actacctttgttaatgtt 

actt ctgaatgcccagtgaatgatgtttacaattttaatttgaatggagctg atgaccct atc 

at aaaacttggaatccaagagtttcaagct acagct aaag aagcttgtg cg g ataggtcaat 

tagattggtaggtcx^ggaatc*tgatgaacttaaatcrrrctgtca 

aattataacaaatcatcaaaagaaaaatctgtttagtgatactgaaagagagtacagatgtga 

TGACAGCAAGACTGATATTAGCTGGCTAAGAGAACCGAAATCAAAACCACAGCT 

AT AGCAGAAATAAAAATGTGAAGAATCATAAAAGTGGAAAATCAAGATCATGCTT GGAAAAGG 

GACAGCCAAGCTCTAAAATGACACCCAGTAAAAATATCACAAAAAAGATGGACAAGACAATT 

CCGGAAGGAAGAATCAGACTTCGACGAAAAGCAACCAAAACAAAAAAAAACTATAAAGATCT 

CTCAAATTCAGAATCAGAGTGTGAACAAGAATTTTCACATTCATTTAAAGAG AACAT ACCAG T 

AAAGGAGGAGAATATCCATTCCAGAATGAAAACGGTAAAGCTACGAAAGAAACA 

TCTTCT GTGCT GAAACAGAAAAG GAACT ATCAAAACAAT GGAAAAACT CATCT CT ACT AAAAG 

ATGCT ATACGAGATAATTGCCTTGACTTATCTCCCAGATCTTTATCTGGCAGTCCATCAT^ 

TAGAAGTAACGAGATGTATAGAGAAAATAACAGAAAAGGATTTTACTCAGGATTATGACTGCA 

T AACAAAATCTAT ATCAGCTTATCCAAAAACTTCATCACTT GAATCCTT AAATAGTAACAGTGG 

AGTTGGAG GT ACAAT AAAGTCACCCAAAAACAATGAGAAAAACTTCCTGTGTGCAAGTGAAA 

GTTGTTCACCAATTCCACGACCAC I G 1 1 1 1 1 GCCCAGACATACTCCAACT AAGAGTAATACTA 

TrGTAAATAGAAAAAAAATAAGTTCTCTGGTACTT ACACAAGAAACACAAAACAGTAACAGCT 

ATTCAGATGTAAGCAGTT AT AGTTTCAGAAGAACGGTTT ATGGAAATTGAATCTOCACATATCA 

ATGAAAATTATATACAAAGCAAAAGAGAGGAAAGTCATTTAGGATCTTCATTATCG 

GTGAAGGAAGAGAGAAAACGTGGTTTGACATGCCCTGTGATGGTACTC^ 

CACCCAACATCTTAGTCGCAAAAGAATAT ATATAGAAGAT AATCTAAGT AATTCCAATGAAGT 

AGAAATGGAAGAGAAAGGAGAAAGGAGAGCAAACTTGCTTCOCAAAAAACTGTGTAAAATTG 

AAGATGCAGATCATCATATCCACAAAATGTCTGAAAGTGTATCTTCATTATCAACAAATGACTT 

TTCT ATTCCTTGGGAGACCTGGCAAAATGAATTTGCAGGGATAGAGATGAGTTATGAGACTT 

ACGAGAGGCTCAATTCAGAATTTAAGAGAAGGAATAATATGGGACATAAAATGTTGAGTTATT 

TT ACT ACGCAGTCTTGGAAAAGAGCT GAGCAACATCTGAGAACAATGAATCATCAAAGTCAG 

GACTCT AGGATTAAAAAACTTGATAAATTCC^TTCATTATCATAGAGGAGCTGGAGAATTTT 

GAAAAAGATTC^CAGTCTTTAAAAGATTTGGAAAAGGAATTTGTGGACTTTTG GGAAAAGATA 



P*«e353af355 



947 
Table 4 



TTTCAGAAGTTCAGTGCATATCAAAAAAGCGAACAACAGAGGCTTCATCTTTT gaaaacttca 
TTGGCT AAAAGTGTCTTCTGTAATACTGATAGTGAAGAAACTGTTTTTACATCCGAGATGTGT 
TTGATGAAAGAAGATATGAAAGTGCTGCAAGACAGGCrrTCTTAAGGACATGCTAGAAGAGGA 
GCTTCTT AATGTACGGAGAGAACTGATGTCAGTATTCATGTCTCATGAAAGAAATGCT AATGT 
GTGAAATCTAG* 1 1 1 1 ATCACCATACTTTATCTAATTATTATTCTCTGTATATAACTGAGGAAAT 
AAGMTAGTCCTACAAAGAGAAAAATATACATGra 

CCCTCAAATTAAAAAAAAATGTCTTTT AATGGA TGAGAGGGAACCACTATAACATGAGTCCAA 
GCCCA GAAGACTTCTGTCTATACAATA I lllllll T AATTTTGGAGATAAAAGCTTT AAGAAAC 
1 1 1 1 I GAGTTAATTATAC T CAT A AAATGAGTTTCTTTAATAAATTAAATTTTATTGTGT AAAATGT 
ATTATTACATAAAATGTGTTTTTGAATGAATGCAGTTTGGGGATGAATATAATTAAAATATGTT 

taataactt agaattcaact aataaaaatttagccacacttacaagggggaggaagtccctag 

TTT AAAATGT AT AACTGAGTGGTAGATCAGTACTTTCAGCACACTGTTGGAAACATTTATTCA 

GATATGGCTCT AATGTATTAGGAAGCACTAAATGGCCT AAAAAAGCTACTACATTGCCTAAAT 

ATGTTAATTCAATATAGAAGTCCTATTTCATAACCAGGCTGTTTGAC^^TACTTTTAATCTAG 

TAGTCATTGTAATATCTTGCT AGATTAATTTATAAAAATGAGTATACATTTGATTTGCTTTT AAT 

GAAGTTGAAATAAATGCTTATGTCACTTGAATAAATATAAATCATTATANNNJN 

>1409 

>1410 

>141t 

>1412 

>1413 

>1414 

>1415 

ACTT *pAAAAA l " " AACATTAGGAGGT AATTATAAGTAGATTCTGTGATTAGGACTT 
GATTCATGTATC 1 1 1 I GCT ACATAAACCTTTGTTAGATTAAATGGAAGACACCT GCT AGGTGA 
T ACTTTTT ATAAAACATATGAGT AAGTCATATATCTTTGTT AAATTTCTGTATGTT CTTTTTTGT 
ATAAAGATGGAGAGAAAGGATGGAGTGATACTAAGGACCCTAATAACATCTCTGTTCAAATTA 
ATT ACT AAGTGATAGAAGTATTCATATGCX^TTAAAGATTTGCCAATTCTrATTTGAATTTTAT^ 
GATAAACTTGAAAATCAAATAACCTAACAGCTGTC I fTTCTTTCn I CT AAACCCTTTT AAGAA 
TAGATTTAATAi I 1 1 I CTGAGTTTTCATTAAAGAG TTATTTATGTTACGG rTTGTTTTI ATAAAA 
GTAGCATCGCAAAAT AAAAAAGTCTGCAT CCTTGCAAG TT ATTCACT GCT CACG TGCTT GCT C 
TTCTCTGCTAAATTAAAAAAATAAAGATCAAGAAGAGT^ 

GATGGGTTGAATCCTGTGAGTAAGTGAAAGAGTAATAGGAAAAAAAACACT ATGGTCATGAA 
AGAACT GCATCCTGAAAGGTTGTG 

>t4ie 

CAATTAA CTAAA C1 M I II 1 1 AAATTAACTAAAGTTGGTATTTCACTAGGACGATTTAG 
S^IS^I9 GGTm ^^ AGAACACCAI 1 rCTTCI I C CTTATTAC A I I ICCCI 1 1 1 1 AAT 

AAAGCU I CC I ACATTAATACTTGAGCATGATAACCTT ACAAT GAAAGGTAAATCATTAGTTG 

^I^I^^y^ 1 1 1 IC1 ' CCCCTTGTGGAGAAAAATCAAATTGC \ I M CCI CI GCTCTT ACAC 

CACAATttATC AAGACAX nAGACTTCTGT GACCAAATGTGTGGGG AI 4 I C I CCCCACCAACAA 

GAAGCAAJCAGTI C I GCAGAAGATACCAr^AGCTGAGCACCTTGT AATCCAATTCAATTCTCA 

T^^CTACCTGGAGATAGCATTAGATCCX^^ 

*?TGeCGCCACTra 

GCTTCAAGTCAGTGTTOCCCA 

cagaactcaggaaaacacgtgtact GGTTT ATT ATAAAGAATAGCACAAAGAATGCAGATGA 
AGAG^TIGCACAGAACAAGGCATGTGGGAAGGGGTGCAGAGCTCT 

GCACCACCC > CCAGGACCCTCCACGTGTT CAGCT ATCTGGAAGCTCTCTGTACCCGGTCC A 
GGACTTCACTGGACCGTCACGA/rTGAAGCATGGACAACCGGGT ANAAA T GATTGGAMMN 

►1417 
»t418 

TGAATTTCAGTGGCACCATACCCACAGGCAG^ 

CATGGC1 1 CCCAGCAAAT ICI CTI GGCATOU3CCTCCAGGTAGCTTCCAGAGAGTTCCACTA 
GCACACT AGAGGCCATCTTOCTTATAACGCCCAGAAAGTGTTGCAGGCACCC^ 
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CTGTCTAGTGATTTCCACCAGCACCCCCAGAAAGC^GGCCCCTGCTTGW 
GGACAACCCCACAGGCTTCCCCACTATCCAGTGGGCCGTGACCAAATCCTGTTTAATG^^ 

CCTTAGCCCAGCAGTTACAGATGCTCTTGCAATTTGCTGCTTAT^ AGAGTCCT CA 
GTCCCTCTAGCAGCTAATCCCATGTTACX^GTTGACGACTCTTCTAG 

GGTGTAAATTAGTACAAGCTCTATGGAAAACAGTATGGAGATTTCTCAAAGAACTAAACATAA 

ACCCCCATTCAATCCAGCAAGCCCACT ACTT GGCA 

>1419 

ACAC^T AAGTTCATTCTTGGCTTTTTAAATTTTATGGAAAGACT AAATAGATTTGTGTC 
TATTAATCAAAATATGAATTTAGAAGGAAATAATTTTGTGTAAAAAATTGTATGTGGTAAAATTT 
TACCT AATTT AAAATTGTTGTTCCAT AA 1 117111 AAAAAG AAAAATT ACAGAAATAAGACTT GG 
GGGGTGGGGGTTG AAAAG TGGTGAAAGAACT AAACAAGT AGAAGAGGATTTCT AAAGCACT 
GGTCTCATGAAAAAAGTTTCATGTGTGACTGGGTCCACTG AGATTGAAAAGiAAAT^ ATA 



AACAACAAAG TTTT ctt aaagt ATTGAT ACACG CTTTTTAAAAATTGCAAG AGGTTTTAAGTTTA 

ATTCAAAAATCTGTTTAACAGCCATTTTGTACCTGCCCNNNN 

>1420 

>1421 

>1422 

>1423 

NKffWNNNNWWNNNNhO^CACTCTGTATTAAAAGTAriAACTT ACT AAAAG AAAAGAGG 
TTTGTGTTCACATTAAATGGTTTTGGTTTGGC I ICI 1 1 1 AGTCAGGCTTTCT GAACATTGAGAT 
ATCCTGAACTTAGAGCT CTTCAATCCT AAGATTTTCATG AAAAGCCT CT CACTT GAACCCGAA 
CCAGAGTACGCGGGCAGTGTTCTATCTCTGAGGCTGAGCATTATAACLAAAATGAC^ GCT 
CCTTTTC*TTGCAGAAAATTGCCAGGGGCTTATTTCAGA 

GCTCT CAAACTCTCTAACTTATAAGTGTTGTGAACCCCCACCCAGGCAGTATCCATGAAAGC 

ACAAGTGACTAGTCCTATGATGT 

>1424 

>142S 

>1426 

>1427 

>1428 

NHACCAT ATAGG TTQOACGACGGAOACAOG^ 
AATOCAGAOCAA ATGTItJCro 

TCOOCAATCTtX^ritnCTlCATTCCACACTTTXIA^ 

GTCAAATTTTTA1 1 1 1 1 H 1 1 H I I 1 ACAT AT ATAGCATAOTGCTCATGCTTAAAGA I UL I O I IO 

ACTGAAATAACTTOCAAGGAATATTAGTTCAOGGAAATATCAAAGTGTAAAAAT 

TAGTQAOAAAGAAAAACOCATCCAACACCAGCA^ 



OICAAOACI rOOCI ICIATTITTOCAAAGTAATCACTAACTQAOTCAOCTGT G AAATTO^ 
TGCTCCTTG AI CI I O I G J A ACATTT ATX^AATCATTT AGGCTGTITT3OGTrGAA0GAGG<nTAT 
TACAATAAGAACATCTTTCAGAGA CI 1 1 1 IC AAOAAAATTPTTCATTGTTGGOGGCAAAAAAC 
CCCTCATTTGCCCACAGGAACTAAGO > 1 1UUIL.TGT CAOCCAAAC<n'ATTTTtTCATTATTHfXZAA 
ACTOGOAAAGTOTAGCTOGTOOGGGGCAGTTAGAAG^ 
AGTTAGAAGAGACACTTTTAAACAGTATTTTTTATTTAAA^ 
TAC ATTXA ACATCAAATCAACTTAAAAAACCACCATtTC^ 

ACATCCTICACACTGCT ATATTACC n Ol C I G I GAT ATT ATGG A I UL I GO 1 1 AGGAAAAGGATT 
CATACCACAACCAATCACG i ULAjOCTATGCTQAGaaGTGTCCA 1 1 IC I 1 1 AOOCTGAAAATAA 
GACTCTACTTTCATAAACAAAT AGGAAAATGTGATCTA 1 loci IC I 1 1 ICI roOT O OGTQ COOC 
rCOCCl rOCTAAOTCAGl 1 1 1 ICti ACTGCOGIGT GJGl U l tJIOlOlOlUlOl O CATGIGrOl O 
TCQJTATOTtJTTTTCCAOAOO^^ 

TCACAAAGGTCrrAaAOGCCACT ACTOCGCTtXX3GA C 

rrCCTCTTCCCAATGTGCACTCTr^ 

TATTTTATGN 
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